Text to Speech Models

Explore AI models that convert text to natural-sounding speech.

Filter by tier:

All Budget Standard Premium

Showing 12 of 12 text to speech models

TTS Budget

Chatterbox TTS

Fal.ai

Text-to-speech for memes, videos, games, and AI agents.

TTS Budget

ElevenLabs Flash V2.5

ElevenLabs

Fastest ElevenLabs model for ultra-low latency text-to-speech.

TTS Premium

ElevenLabs Multilingual V2

ElevenLabs

High-quality multilingual text-to-speech supporting 29 languages with emotional range.

TTS Standard

ElevenLabs Turbo V2.5

ElevenLabs

Fast text-to-speech optimized for low latency with good quality.

TTS Premium

ElevenLabs V3

ElevenLabs

Latest ElevenLabs model with best-in-class voice quality and expressiveness.

TTS Budget

Kokoro 82M

Jaaari (via Replicate)

Lightweight 82M parameter text-to-speech model with natural voices.

TTS Standard

Maya TTS

Fal.ai

State-of-the-art speech model for expressive voice generation.

Expressive voice generation

voice-cloning Standard

OpenVoice

MyShell (via Replicate)

Instant voice cloning with fine-grained control over style, emotion, and accent.

TTS Standard

Parler TTS

Replicate

Describe the voice you want in text. Generate speech matching your description.

TTS Premium

Speech 02 HD

MiniMax (via Replicate)

High-definition text-to-speech with premium voice quality.

TTS Standard

Speech 02 Turbo

MiniMax (via Replicate)

Fast text-to-speech variant of Speech 02 optimized for speed.

voice-cloning Standard

XTTS-v2

Coqui (via Replicate)

Clone any voice with just 6 seconds of audio. Supports 17 languages.

Clone voices in seconds

Text to Speech Models

Chatterbox TTS

ElevenLabs Flash V2.5

ElevenLabs Multilingual V2

ElevenLabs Turbo V2.5

ElevenLabs V3

Kokoro 82M

Maya TTS

OpenVoice

Parler TTS

Speech 02 HD

Speech 02 Turbo

XTTS-v2

Explore Other Categories

Text Generation

Chat

Code

Image Generation

Image Editing

Video Generation

Audio Generation

3D Generation