# Overview

AI-powered audio processing, speech synthesis, and voice cloning on CLORE.AI GPUs.

## Text-to-Speech

| Tool                                                                              | Description                                 | Quality   |
| --------------------------------------------------------------------------------- | ------------------------------------------- | --------- |
| [Bark TTS](https://docs.clore.ai/guides/audio-and-voice/bark-tts)                 | Expressive multilingual TTS                 | Excellent |
| [XTTS](https://docs.clore.ai/guides/audio-and-voice/xtts-coqui)                   | Voice cloning + TTS                         | Great     |
| [F5-TTS](https://docs.clore.ai/guides/audio-and-voice/f5-tts)                     | Fast zero-shot TTS                          | Great     |
| [OpenVoice](https://docs.clore.ai/guides/audio-and-voice/openvoice-clone)         | Instant voice cloning                       | Good      |
| [Chatterbox TTS](https://docs.clore.ai/guides/audio-and-voice/chatterbox-tts)     | Zero-shot voice cloning                     | Good      |
| [ChatTTS](https://docs.clore.ai/guides/audio-and-voice/chattts)                   | Conversational text-to-speech               | Good      |
| [Dia TTS](https://docs.clore.ai/guides/audio-and-voice/dia-tts)                   | Multi-speaker dialog generation             | Good      |
| [Fish Speech](https://docs.clore.ai/guides/audio-and-voice/fish-speech)           | High-quality voice synthesis                | Great     |
| [Kani-TTS-2](https://docs.clore.ai/guides/audio-and-voice/kani-tts)               | Efficient voice cloning TTS                 | Good      |
| [Kokoro TTS](https://docs.clore.ai/guides/audio-and-voice/kokoro-tts)             | Ultra-fast lightweight TTS                  | Good      |
| [MeloTTS](https://docs.clore.ai/guides/audio-and-voice/melotts)                   | Multilingual text-to-speech                 | Good      |
| [MiniMax Speech 2.6](https://docs.clore.ai/guides/audio-and-voice/minimax-speech) | Commercial-grade TTS                        | Great     |
| [Qwen3-TTS](https://docs.clore.ai/guides/audio-and-voice/qwen3-tts)               | Multilingual voice cloning                  | Good      |
| [StyleTTS2](https://docs.clore.ai/guides/audio-and-voice/styletss2)               | Style-controllable TTS                      | Great     |
| [Voxtral TTS](https://docs.clore.ai/guides/audio-and-voice/voxtral-tts)           | Open-weight 4B TTS, 9 languages, 3s cloning | Excellent |
| [Zonos TTS](https://docs.clore.ai/guides/audio-and-voice/zonos-tts)               | Voice cloning with emotion control          | Good      |

## Voice Cloning

| Tool                                                                      | Training Required | Quality   |
| ------------------------------------------------------------------------- | ----------------- | --------- |
| [RVC](https://docs.clore.ai/guides/audio-and-voice/rvc-voice-clone)       | Yes               | Excellent |
| [OpenVoice](https://docs.clore.ai/guides/audio-and-voice/openvoice-clone) | No                | Good      |
| [XTTS](https://docs.clore.ai/guides/audio-and-voice/xtts-coqui)           | No (6 sec sample) | Great     |

## Audio Processing

| Tool                                                                          | Use Case                                      |
| ----------------------------------------------------------------------------- | --------------------------------------------- |
| [Whisper](https://docs.clore.ai/guides/audio-and-voice/whisper-transcription) | Speech-to-text transcription                  |
| [Demucs](https://docs.clore.ai/guides/audio-and-voice/demucs-separation)      | Vocal separation                              |
| [AudioCraft](https://docs.clore.ai/guides/audio-and-voice/audiocraft-music)   | Music generation                              |
| [Stable Audio](https://docs.clore.ai/guides/audio-and-voice/stable-audio)     | AI music and sound generation                 |
| [WhisperX](https://docs.clore.ai/guides/audio-and-voice/whisperx)             | Fast transcription with word-level timestamps |

## Related Guides

* [Talking Heads](https://docs.clore.ai/guides/talking-heads/talking-heads) - Animate faces with audio
