AI Models

Explore 74+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.

Showing 24 of 74 models

TTS Standard

Parler TTS

Replicate

Describe the voice you want in text. Generate speech matching your description.

image-restoration Standard

Photo Restoration

Fal.ai

Restore old or damaged photos. Fix colors, remove scratches, enhance resolution.

Restore old photos

face-generation Standard

PhotoMaker

TencentARC (via Replicate)

Create photos of a person in various styles while preserving their identity.

Video Standard

Pika 2.2

Pika Labs (via Fal.ai)

Popular video generation model known for creative effects and image animation.

Creative video effects

Video Standard

PixVerse 5.5

PixVerse (via Fal.ai)

Creative video generation model. Great for artistic and stylized video content.

face-generation Standard

Professional Headshot

flux-kontext-apps (via Replicate)

Transform casual photos into professional studio-quality headshots.

Instant pro headshots

Image Standard

Recraft V3

Recraft (via Replicate)

Specialized in vector-style and design-focused images.

Vector & design focused

Image Standard

Recraft V3 SVG

Recraft (via Replicate)

Generate scalable vector graphics (SVG) from text prompts.

Image Standard

Riverflow 2.0 Fast

Sourceful (via Replicate)

Agentic image generation model with font control, transparent backgrounds, and image editing capabilities.

Agentic image gen with typography

segmentation Standard

SAM 2

Meta (via Fal.ai)

Segment Anything Model 2 - segment any object in images or videos with prompts.

Segment anything

controlnet Standard

SDXL ControlNet Union

Fal.ai

Multiple control types in one model: canny, depth, pose, segmentation, and more.

All-in-one ControlNet

Video Standard

Seedance 1 Lite

ByteDance (via Replicate)

Faster and lower-cost Seedance variant for high-volume video generation.

Image Standard

Seedream 4

ByteDance (via Replicate)

Text-to-image and single-sentence editing at up to 4K resolution.

TTS Standard

Speech 02 Turbo

MiniMax (via Replicate)

Fast text-to-speech variant of Speech 02 optimized for speed.

Image Standard

Stable Diffusion 3

Stability AI (via Replicate)

Latest Stable Diffusion with improved text rendering and composition.

image-restoration Standard

SwinIR

jingyunliang (via Replicate)

General image restoration: denoising, deblurring, and super-resolution.

Video Standard

Sync Lipsync V2

Fal.ai

Advanced lipsync technology for realistic talking videos.

3D Standard

Trellis

Microsoft (via Fal.ai)

Native 3D generative model using Structured LATents (SLAT) for versatile, high-quality 3D asset creation from images.

SLAT-based 3D generation

Video Standard

Veo 3.1 Fast

Google (via Fal.ai)

Faster version of Veo 3.1 optimized for quick video generation with good quality.

Video Standard

Wan 2.2 Image-to-Video

Alibaba (via Fal.ai)

Alibaba's image-to-video model. Efficient and reliable video generation from images.

Video Standard

Wan 2.5 Image-to-Video

Wan Video (via Replicate)

Animate images into videos with natural motion and high fidelity.

Video Standard

Wan 2.5 Text-to-Video

Wan Video (via Replicate)

High-quality text-to-video generation with smooth motion.

speech-to-text Standard

Whisper

OpenAI (via Fal.ai)

OpenAI Whisper large v3 for accurate speech transcription and translation. Supports 99+ languages.

Accurate speech transcription

speech-to-text Standard

Wizper

Fal.ai

Optimized Whisper v3 by Fal.ai - same accuracy, 2x faster performance.

2x faster Whisper

Browse by Category

Image Generation

DALL-E, Flux, Stable Diffusion

Video Generation

Sora, Veo, Kling, Runway

Text Generation

GPT-4, Claude, Gemini

Audio & TTS

ElevenLabs, OpenAI TTS

Browse by Provider

OpenAI Anthropic Google Stability AI Black Forest Labs Replicate