Overview
Last updated
Was this helpful?
Computer vision and vision-language models on CLORE.AI GPUs.
Vision chat & QA
Multi-task vision
Video segmentation
Zero-shot detection
Image understanding - LLaVA, Florence-2
Object detection - GroundingDINO, YOLO
Segmentation - SAM2, Segment Anything
Captioning - Florence-2, LLaVA
LLaVA 7B
8GB
Florence-2
8GB
SAM2
8GB
GroundingDINO
6GB
Last updated
Was this helpful?
Was this helpful?