LocalAI
Server Requirements
Parameter
Minimum
Recommended
What is LocalAI?
Supported Models
Type
Formats
Examples
Quick Deploy
Verify It's Working
Pre-Built Models
Model Name
Type
Description
Accessing Your Service
Docker Deploy (Alternative)
Download Models
From Model Gallery
From Hugging Face
Model Configuration
API Usage
Chat Completions (OpenAI Compatible)
Streaming
Embeddings
Image Generation
cURL Examples
Chat
Embeddings
Text-to-Speech (TTS)
Speech-to-Text (STT)
Reranking
Complete API Reference
Standard Endpoints (OpenAI Compatible)
Endpoint
Method
Description
Additional Endpoints
Endpoint
Method
Description
Get Version
Swagger Documentation
GPU Acceleration
CUDA Backend
Full GPU Offload
Multiple Models
Performance Tuning
For Speed
For Memory
Benchmarks
Model
GPU
Tokens/sec
Troubleshooting
HTTP 502 on http_pub URL
Model Not Loading
Slow Responses
Out of Memory
Image Generation Issues
Cost Estimate
GPU
CLORE/day
Approx USD/hr
Good For
Next Steps
Last updated
Was this helpful?