AI Models

Explore 141+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.

Showing 24 of 141 models

Image Standard

ImagineArt 1.5

ImagineArt (via Fal.ai)

Professional-grade visuals with lifelike realism, strong aesthetics, and accurate text rendering.

face-generation Standard

InstantID

Replicate

Zero-shot identity-preserving generation. Create images of a person in any style/scene.

Identity-preserving generation

Video Premium

Kling O3 I2V Pro

Kuaishou (via Fal.ai)

Kling O3 pro image-to-video optimized for first/last frame transitions and style guidance.

Video Standard

Kling V2.1

Kwai (via Replicate)

Official Kling V2.1 model optimized for stable motion and cinematic composition.

Video Standard

Kling Video 2.6

Kuaishou (via Fal.ai)

Advanced video generation model from Kuaishou. Known for excellent character consistency and motion quality.

Consistent character video

Video Standard

Kling Video Image-to-Video

Kuaishou (via Fal.ai)

Transform images into videos with Kling. Excellent at maintaining character and scene consistency.

Video Premium

Kling Video V3 Pro I2V

Kuaishou (via Fal.ai)

Kling 3.0 Pro image-to-video endpoint with cinematic motion and native audio output.

TTS Budget

Kokoro 82M

Jaaari (via Replicate)

Lightweight 82M parameter text-to-speech model with natural voices.

ocr Budget

LaTeX OCR

mickeybeurskens (via Replicate)

Convert math equation images directly to LaTeX code.

Image Standard

LongCat Image

Fal.ai

6B parameter model excelling at multilingual text rendering, photorealism, and deployment efficiency.

Video Budget

LTX-2

Lightricks (via Fal.ai)

Fast and affordable video generation model. Good balance of quality and cost for everyday use.

Fast & affordable video

Video Standard

LTX-2 19B I2V

Lightricks (via Fal.ai)

LTX-2 19B image-to-video model with native audio generation for richer outputs.

Video Budget

LTX-2 Image-to-Video

Lightricks (via Fal.ai)

Animate images with LTX-2. Quick and cost-effective image-to-video generation.

Audio Premium

Lyria 2

Google (via Replicate)

Google's advanced music generation model for creating original compositions.

Google's AI music composer

depth-estimation Standard

Marigold Depth

Replicate

Diffusion-based depth estimation with fine details and sharp edges.

ocr Standard

Marker

cuuupid (via Replicate)

Convert PDF, EPUB, and documents to clean Markdown or JSON.

PDF to Markdown

TTS Standard

Maya TTS

Fal.ai

State-of-the-art speech model for expressive voice generation.

Expressive voice generation

3D Standard

Meshy

Meshy (via Fal.ai)

Complete 3D pipeline: text/image-to-3D with remeshing, retexturing, and rigging capabilities.

Complete 3D pipeline

Audio Standard

MiniMax Music V2

MiniMax (via Fal.ai)

Generate original music tracks from text descriptions.

Audio Standard

Music 1.5

MiniMax (via Replicate)

Generate royalty-free music tracks from text descriptions.

3D Standard

MVDream

ByteDance (via Replicate)

Multi-view diffusion for 3D generation from text. Creates consistent 3D assets from descriptions.

Image Premium

Nano Banana 2

Google (via Fal.ai)

Google's state-of-the-art fast image generation and editing model with strong prompt adherence.

Google fast flagship image model

Image Standard

Nano Banana Pro

Fal.ai

Fast and efficient image generation model. Excellent quality-to-speed ratio for rapid prototyping and production use.

Fast & efficient image gen

Video Premium

OmniHuman 1.5

ByteDance (via Fal.ai)

Advanced human generation and animation model.

Browse by Category

Image Generation

DALL-E, Flux, Stable Diffusion

Video Generation

Sora, Veo, Kling, Runway

Text Generation

GPT-4, Claude, Gemini

Audio & TTS

ElevenLabs, OpenAI TTS

Browse by Provider

OpenAI Anthropic Google Stability AI Black Forest Labs Replicate