AI Models

Explore 74+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.

Showing 24 of 74 models

face-swap Standard

Advanced Face Swap

Easel AI (via Fal.ai)

Swap faces between images with preserved lighting, texture, and natural blending.

Natural face swapping

Image Standard

Aura Flow

Fal.ai

Fast and efficient image generation model.

Audio Standard

Beatoven Music

Beatoven (via Fal.ai)

Generate royalty-free instrumental music for any project.

Audio Standard

Beatoven SFX

Beatoven (via Fal.ai)

Generate sound effects for videos, games, and multimedia.

segmentation Standard

BiRefNet

Fal.ai

High-resolution dichotomous image segmentation for precise object extraction.

Upscale Standard

Crystal Upscaler

Clarity AI (via Fal.ai)

AI image upscaler that preserves fidelity, color, and detail.

colorization Standard

DDColor

piddnad (via Replicate)

Automatic colorization of black-and-white photos with realistic colors.

Colorize B&W photos

music-separation Standard

Demucs

Meta (via Replicate)

Separate music into stems: vocals, drums, bass, and other instruments.

Split music into stems

music-separation Standard

Demucs 6-Stem

Meta (via Replicate)

6-stem version separating vocals, drums, bass, guitar, piano, and other.

depth-estimation Standard

Depth Anything V2

Fal.ai

State-of-the-art monocular depth estimation. Generate accurate depth maps from single images.

Accurate depth maps

ocr Standard

Donut

willywongi (via Replicate)

Extract structured data from receipts, invoices, and forms as JSON.

Receipt & invoice OCR

pose-estimation Standard

DWPose

Fal.ai

Detect human poses including body, hands, and face keypoints.

Full-body pose detection

TTS Standard

ElevenLabs Turbo V2.5

ElevenLabs

Fast text-to-speech optimized for low latency with good quality.

Image Standard

Flux 2 Flex

Black Forest Labs (via Fal.ai)

Flexible Flux 2 variant optimized for versatile image generation and editing.

Image Standard

Flux Dev

Black Forest Labs (via Replicate)

Development version of Flux. Good balance of quality and cost.

Image Standard

Flux Realism

Black Forest Labs (via Fal.ai)

Flux model fine-tuned for photorealistic output.

controlnet Standard

FLUX.1 Canny Dev

Black Forest Labs (via Replicate)

Open-weight edge-guided FLUX model for development.

controlnet Standard

FLUX.1 Depth Dev

Black Forest Labs (via Replicate)

Open-weight depth-guided FLUX model for development.

inpainting Standard

FLUX.1 Fill Dev

Black Forest Labs (via Fal.ai)

Open-weight FLUX inpainting model for development and fine-tuning.

inpainting Standard

Fooocus Inpaint

Fal.ai

Multi-mode inpainting: fill areas, improve details (face/hands), or modify content.

Text Standard

Gemini 2.0 Flash

Google

Fast and efficient Gemini model. Multimodal with massive context window.

Text Standard

Gemini 2.5 Flash

Google

Stable production-ready Gemini model with excellent speed and quality balance. Great for high-volume applications.

Text Standard

Gemini 3 Flash

Google

Google's latest and most intelligent flash model. Features enhanced reasoning, improved multimodal understanding, and faster response times.

Google's latest AI

ocr Standard

GOT-OCR 2.0

Fal.ai

Universal OCR for documents, scene text, tables, math formulas, sheet music, and more.

Universal OCR engine

Browse by Category

Image Generation

DALL-E, Flux, Stable Diffusion

Video Generation

Sora, Veo, Kling, Runway

Text Generation

GPT-4, Claude, Gemini

Audio & TTS

ElevenLabs, OpenAI TTS

Browse by Provider

OpenAI Anthropic Google Stability AI Black Forest Labs Replicate