AI Models

Explore 131+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.

Showing 24 of 131 models

Advanced Face Swap
face-swap Standard

Advanced Face Swap

Easel AI (via Fal.ai)

Swap faces between images with preserved lighting, texture, and natural blending.

Natural face swapping

Aura Flow
Image Standard

Aura Flow

Fal.ai

Fast and efficient image generation model.

Beatoven Music
Audio Standard

Beatoven Music

Beatoven (via Fal.ai)

Generate royalty-free instrumental music for any project.

Beatoven SFX
Audio Standard

Beatoven SFX

Beatoven (via Fal.ai)

Generate sound effects for videos, games, and multimedia.

BigColor
colorization Budget

BigColor

cjwbw (via Replicate)

Fast colorization model for quick B&W to color conversion.

BiRefNet
segmentation Standard

BiRefNet

Fal.ai

High-resolution dichotomous image segmentation for precise object extraction.

Bria Background Remove
BG Remove Budget

Bria Background Remove

Bria (via Fal.ai)

Automated background removal from images with high accuracy.

Chatterbox TTS
TTS Budget

Chatterbox TTS

Fal.ai

Text-to-speech for memes, videos, games, and AI agents.

Claude 3 Opus
Text Premium

Claude 3 Opus

Anthropic

Most capable Claude 3 model for highly complex tasks requiring deep analysis.

Claude 3.5 Haiku
Text Budget

Claude 3.5 Haiku

Anthropic

Fastest Claude model. Optimized for speed and cost while maintaining good quality.

Claude 3.5 Sonnet
Text Premium

Claude 3.5 Sonnet

Anthropic

Balanced model offering strong performance at a reasonable cost. Excellent for most tasks.

Claude Sonnet 4
Text Premium

Claude Sonnet 4

Anthropic

Latest Claude model with improved reasoning, coding, and instruction following.

Latest flagship reasoning

Creatify Aurora
Video Premium

Creatify Aurora

Creatify (via Fal.ai)

Generate studio-quality videos of avatars speaking or singing.

AI avatar videos

Crystal Upscaler
Upscale Standard

Crystal Upscaler

Clarity AI (via Fal.ai)

AI image upscaler that preserves fidelity, color, and detail.

DDColor
colorization Standard

DDColor

piddnad (via Replicate)

Automatic colorization of black-and-white photos with realistic colors.

Colorize B&W photos

DeepSeek Chat
Text Budget

DeepSeek Chat

DeepSeek

Powerful and cost-effective chat model from DeepSeek. Strong reasoning capabilities.

DeepSeek Coder
Code Budget

DeepSeek Coder

DeepSeek

Specialized coding model from DeepSeek. Excellent for code generation and analysis.

Demucs
music-separation Standard

Demucs

Meta (via Replicate)

Separate music into stems: vocals, drums, bass, and other instruments.

Split music into stems

Demucs 6-Stem
music-separation Standard

Demucs 6-Stem

Meta (via Replicate)

6-stem version separating vocals, drums, bass, guitar, piano, and other.

Depth Anything V2
depth-estimation Standard

Depth Anything V2

Fal.ai

State-of-the-art monocular depth estimation. Generate accurate depth maps from single images.

Accurate depth maps

Donut
ocr Standard

Donut

willywongi (via Replicate)

Extract structured data from receipts, invoices, and forms as JSON.

Receipt & invoice OCR

DWPose
pose-estimation Standard

DWPose

Fal.ai

Detect human poses including body, hands, and face keypoints.

Full-body pose detection

ElevenLabs Flash V2.5
TTS Budget

ElevenLabs Flash V2.5

ElevenLabs

Fastest ElevenLabs model for ultra-low latency text-to-speech.

ElevenLabs Multilingual V2
TTS Premium

ElevenLabs Multilingual V2

ElevenLabs

High-quality multilingual text-to-speech supporting 29 languages with emotional range.