AI Models

Explore 12+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.

Showing 12 of 12 models in Text to Speech

TTS Budget

Chatterbox TTS

Fal.ai

Text-to-speech for memes, videos, games, and AI agents.

TTS Budget

ElevenLabs Flash V2.5

ElevenLabs

Fastest ElevenLabs model for ultra-low latency text-to-speech.

TTS Premium

ElevenLabs Multilingual V2

ElevenLabs

High-quality multilingual text-to-speech supporting 29 languages with emotional range.

TTS Standard

ElevenLabs Turbo V2.5

ElevenLabs

Fast text-to-speech optimized for low latency with good quality.

TTS Premium

ElevenLabs V3

ElevenLabs

Latest ElevenLabs model with best-in-class voice quality and expressiveness.

TTS Budget

Kokoro 82M

Jaaari (via Replicate)

Lightweight 82M parameter text-to-speech model with natural voices.

TTS Standard

Maya TTS

Fal.ai

State-of-the-art speech model for expressive voice generation.

Expressive voice generation

voice-cloning Standard

OpenVoice

MyShell (via Replicate)

Instant voice cloning with fine-grained control over style, emotion, and accent.

TTS Standard

Parler TTS

Replicate

Describe the voice you want in text. Generate speech matching your description.

TTS Premium

Speech 02 HD

MiniMax (via Replicate)

High-definition text-to-speech with premium voice quality.

TTS Standard

Speech 02 Turbo

MiniMax (via Replicate)

Fast text-to-speech variant of Speech 02 optimized for speed.

voice-cloning Standard

XTTS-v2

Coqui (via Replicate)

Clone any voice with just 6 seconds of audio. Supports 17 languages.

Clone voices in seconds

Browse by Category

Image Generation

DALL-E, Flux, Stable Diffusion

Video Generation

Sora, Veo, Kling, Runway

Text Generation

GPT-4, Claude, Gemini

Audio & TTS

ElevenLabs, OpenAI TTS

Browse by Provider

OpenAI Anthropic Google Stability AI Black Forest Labs Replicate