AI Models

Explore 141+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.

Showing 24 of 141 models

ImagineArt 1.5
Image Standard

ImagineArt 1.5

ImagineArt (via Fal.ai)

Professional-grade visuals with lifelike realism, strong aesthetics, and accurate text rendering.

InstantID
face-generation Standard

InstantID

Replicate

Zero-shot identity-preserving generation. Create images of a person in any style/scene.

Identity-preserving generation

Kling O3 I2V Pro
Video Premium

Kling O3 I2V Pro

Kuaishou (via Fal.ai)

Kling O3 pro image-to-video optimized for first/last frame transitions and style guidance.

Kling V2.1
Video Standard

Kling V2.1

Kwai (via Replicate)

Official Kling V2.1 model optimized for stable motion and cinematic composition.

Kling Video 2.6
Video Standard

Kling Video 2.6

Kuaishou (via Fal.ai)

Advanced video generation model from Kuaishou. Known for excellent character consistency and motion quality.

Consistent character video

Kling Video Image-to-Video
Video Standard

Kling Video Image-to-Video

Kuaishou (via Fal.ai)

Transform images into videos with Kling. Excellent at maintaining character and scene consistency.

Kling Video V3 Pro I2V
Video Premium

Kling Video V3 Pro I2V

Kuaishou (via Fal.ai)

Kling 3.0 Pro image-to-video endpoint with cinematic motion and native audio output.

Kokoro 82M
TTS Budget

Kokoro 82M

Jaaari (via Replicate)

Lightweight 82M parameter text-to-speech model with natural voices.

LaTeX OCR
ocr Budget

LaTeX OCR

mickeybeurskens (via Replicate)

Convert math equation images directly to LaTeX code.

LongCat Image
Image Standard

LongCat Image

Fal.ai

6B parameter model excelling at multilingual text rendering, photorealism, and deployment efficiency.

LTX-2
Video Budget

LTX-2

Lightricks (via Fal.ai)

Fast and affordable video generation model. Good balance of quality and cost for everyday use.

Fast & affordable video

LTX-2 19B I2V
Video Standard

LTX-2 19B I2V

Lightricks (via Fal.ai)

LTX-2 19B image-to-video model with native audio generation for richer outputs.

LTX-2 Image-to-Video
Video Budget

LTX-2 Image-to-Video

Lightricks (via Fal.ai)

Animate images with LTX-2. Quick and cost-effective image-to-video generation.

Lyria 2
Audio Premium

Lyria 2

Google (via Replicate)

Google's advanced music generation model for creating original compositions.

Google's AI music composer

Marigold Depth
depth-estimation Standard

Marigold Depth

Replicate

Diffusion-based depth estimation with fine details and sharp edges.

Marker
ocr Standard

Marker

cuuupid (via Replicate)

Convert PDF, EPUB, and documents to clean Markdown or JSON.

PDF to Markdown

Maya TTS
TTS Standard

Maya TTS

Fal.ai

State-of-the-art speech model for expressive voice generation.

Expressive voice generation

Meshy
3D Standard

Meshy

Meshy (via Fal.ai)

Complete 3D pipeline: text/image-to-3D with remeshing, retexturing, and rigging capabilities.

Complete 3D pipeline

MiniMax Music V2
Audio Standard

MiniMax Music V2

MiniMax (via Fal.ai)

Generate original music tracks from text descriptions.

Music 1.5
Audio Standard

Music 1.5

MiniMax (via Replicate)

Generate royalty-free music tracks from text descriptions.

MVDream
3D Standard

MVDream

ByteDance (via Replicate)

Multi-view diffusion for 3D generation from text. Creates consistent 3D assets from descriptions.

Nano Banana 2
Image Premium

Nano Banana 2

Google (via Fal.ai)

Google's state-of-the-art fast image generation and editing model with strong prompt adherence.

Google fast flagship image model

Nano Banana Pro
Image Standard

Nano Banana Pro

Fal.ai

Fast and efficient image generation model. Excellent quality-to-speed ratio for rapid prototyping and production use.

Fast & efficient image gen

OmniHuman 1.5
Video Premium

OmniHuman 1.5

ByteDance (via Fal.ai)

Advanced human generation and animation model.