Qwen2.5-VL Vision Language Model
Run Qwen2.5-VL, the leading open vision-language model, for image/video/document understanding on Clore.ai GPUs.
Key Features
Requirements
Component
3B
7B
72B
Quick Start
Option A: Ollama (Simplest)
Option B: Python / Transformers
Usage Examples
Image Understanding with Transformers
Video Analysis
Document OCR and Extraction
Ollama API for Batch Processing
Tips for Clore.ai Users
Troubleshooting
Problem
Fix
Last updated
Was this helpful?