Computer vision and vision-language models on CLORE.AI GPUs.
LLaVA
Vision chat & QA
Florence-2
Multi-task vision
SAM2
Video segmentation
GroundingDINO
Zero-shot detection
Image understanding - LLaVA, Florence-2
Object detection - GroundingDINO, YOLO
Segmentation - SAM2, Segment Anything
Captioning - Florence-2, LLaVA
LLaVA 7B
8GB
6GB
Language Models
Computer Vision
Last updated 18 days ago
Was this helpful?