# Überblick

KI-gestützte Audioverarbeitung, Sprachsynthese und Stimmklonen auf CLORE.AI GPUs.

## Text-zu-Sprache

| Tool                                                                                            | Beschreibung                               | Qualität      |
| ----------------------------------------------------------------------------------------------- | ------------------------------------------ | ------------- |
| [Bark TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/bark-tts)                 | Ausdrucksstarke mehrsprachige TTS          | Ausgezeichnet |
| [XTTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/xtts-coqui)                   | Stimmklonen + TTS                          | Großartig     |
| [F5-TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/f5-tts)                     | Schnelle Zero-Shot-TTS                     | Großartig     |
| [OpenVoice](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/openvoice-clone)         | Sofortiges Stimmklonen                     | Gut           |
| [Chatterbox TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/chatterbox-tts)     | Zero-Shot-Stimmklonen                      | Gut           |
| [ChatTTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/chattts)                   | Konversationelle Text-zu-Sprache           | Gut           |
| [Dia TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/dia-tts)                   | Dialoggenerierung mit mehreren Sprechern   | Gut           |
| [Fish Speech](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/fish-speech)           | Hochwertige Sprachsynthese                 | Großartig     |
| [Kani-TTS-2](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/kani-tts)               | Effiziente TTS für Stimmklonen             | Gut           |
| [Kokoro TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/kokoro-tts)             | Ultraschnelle, leichtgewichtige TTS        | Gut           |
| [MeloTTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/melotts)                   | Mehrsprachige Text-zu-Sprache              | Gut           |
| [MiniMax Speech 2.6](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/minimax-speech) | TTS in kommerzieller Qualität              | Großartig     |
| [Qwen3-TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/qwen3-tts)               | Mehrsprachiges Stimmklonen                 | Gut           |
| [StyleTTS2](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/styletss2)               | Stilkontrollierbare TTS                    | Großartig     |
| [Voxtral TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/voxtral-tts)           | Open-Weight 4B TTS, 9 Sprachen, 3 s Klonen | Ausgezeichnet |
| [Zonos TTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/zonos-tts)               | Stimmklonen mit Emotionensteuerung         | Gut           |

## Stimmklonen

| Tool                                                                                    | Training erforderlich      | Qualität      |
| --------------------------------------------------------------------------------------- | -------------------------- | ------------- |
| [RVC](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/rvc-voice-clone)       | Ja                         | Ausgezeichnet |
| [OpenVoice](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/openvoice-clone) | Nein                       | Gut           |
| [XTTS](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/xtts-coqui)           | Nein (6-Sekunden-Beispiel) | Großartig     |

## Audioverarbeitung

| Tool                                                                                        | Anwendungsfall                                      |
| ------------------------------------------------------------------------------------------- | --------------------------------------------------- |
| [Whisper](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/whisper-transcription) | Transkription von Sprache zu Text                   |
| [Demucs](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/demucs-separation)      | Gesangstrennung                                     |
| [AudioCraft](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/audiocraft-music)   | Musikgenerierung                                    |
| [Stable Audio](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/stable-audio)     | KI-Musik- und Klangerzeugung                        |
| [WhisperX](https://docs.clore.ai/guides/guides_v2-de/audio-and-stimme/whisperx)             | Schnelle Transkription mit wortgenauen Zeitstempeln |

## Verwandte Anleitungen

* [Talking Heads](https://docs.clore.ai/guides/guides_v2-de/sprechende-kopfe/talking-heads) - Gesichter mit Audio animieren