Fish Speech
Run Fish Speech multilingual TTS and zero-shot voice cloning on Clore.ai GPUs
Server Requirements
Parameter
Minimum
Recommended
Quick Deploy on CLORE.AI
1. Find a suitable server
2. Configure your deployment
3. Access the interface
Step-by-Step Setup
Step 1: SSH into your server
Step 2: Pull and run the Docker container
Step 3: Verify GPU access
Step 4: Check model download
Step 5: Access the WebUI
Step 6: (Optional) Enable API server
Usage Examples
Example 1: Basic Text-to-Speech via WebUI
Example 2: Zero-Shot Voice Cloning
Example 3: API-Based TTS (Python)
Example 4: Multilingual TTS
Example 5: Batch Processing Audio Files
Configuration
Docker Compose (Production Setup)
Key Configuration Options
Option
Default
Description
Model Variants
Model
Size
Languages
Notes
Performance Tips
1. Enable torch.compile for Faster Inference
2. Use Half-Precision (FP16)
3. Pre-load Reference Voices
4. GPU Memory Optimization
5. Batch Size Tuning
Troubleshooting
Issue: Container won't start — CUDA not found
Issue: Out of Memory (OOM) Error
Issue: Port 7860 not accessible
Issue: Model download fails / slow download
Issue: Audio quality is poor
Issue: WebUI loads but generation hangs
Links
Clore.ai GPU Recommendations
Use Case
Recommended GPU
Est. Cost on Clore.ai
Last updated
Was this helpful?