Overview

Computer vision and vision-language models on CLORE.AI GPUs.

Available Guides

Model
Use Case

Vision chat & QA

Multi-task vision

Video segmentation

Zero-shot detection

Use Cases

  • Image understanding - LLaVA, Florence-2

  • Object detection - GroundingDINO, YOLO

  • Segmentation - SAM2, Segment Anything

  • Captioning - Florence-2, LLaVA

GPU Requirements

Model
Minimum VRAM

LLaVA 7B

8GB

Florence-2

8GB

SAM2

8GB

GroundingDINO

6GB

Last updated

Was this helpful?