AI Models
Explore 141+ AI models for text, image, video, audio, and 3D generation. Compare capabilities, pricing tiers, and find the perfect model for your needs.
Showing 24 of 141 models
ImagineArt 1.5
ImagineArt (via Fal.ai)
Professional-grade visuals with lifelike realism, strong aesthetics, and accurate text rendering.
InstantID
Replicate
Zero-shot identity-preserving generation. Create images of a person in any style/scene.
Identity-preserving generation
Kling O3 I2V Pro
Kuaishou (via Fal.ai)
Kling O3 pro image-to-video optimized for first/last frame transitions and style guidance.
Kling V2.1
Kwai (via Replicate)
Official Kling V2.1 model optimized for stable motion and cinematic composition.
Kling Video 2.6
Kuaishou (via Fal.ai)
Advanced video generation model from Kuaishou. Known for excellent character consistency and motion quality.
Consistent character video
Kling Video Image-to-Video
Kuaishou (via Fal.ai)
Transform images into videos with Kling. Excellent at maintaining character and scene consistency.
Kling Video V3 Pro I2V
Kuaishou (via Fal.ai)
Kling 3.0 Pro image-to-video endpoint with cinematic motion and native audio output.
Kokoro 82M
Jaaari (via Replicate)
Lightweight 82M parameter text-to-speech model with natural voices.
LaTeX OCR
mickeybeurskens (via Replicate)
Convert math equation images directly to LaTeX code.
LongCat Image
Fal.ai
6B parameter model excelling at multilingual text rendering, photorealism, and deployment efficiency.
LTX-2
Lightricks (via Fal.ai)
Fast and affordable video generation model. Good balance of quality and cost for everyday use.
Fast & affordable video
LTX-2 19B I2V
Lightricks (via Fal.ai)
LTX-2 19B image-to-video model with native audio generation for richer outputs.
LTX-2 Image-to-Video
Lightricks (via Fal.ai)
Animate images with LTX-2. Quick and cost-effective image-to-video generation.
Lyria 2
Google (via Replicate)
Google's advanced music generation model for creating original compositions.
Google's AI music composer
Marigold Depth
Replicate
Diffusion-based depth estimation with fine details and sharp edges.
Marker
cuuupid (via Replicate)
Convert PDF, EPUB, and documents to clean Markdown or JSON.
PDF to Markdown
Maya TTS
Fal.ai
State-of-the-art speech model for expressive voice generation.
Expressive voice generation
Meshy
Meshy (via Fal.ai)
Complete 3D pipeline: text/image-to-3D with remeshing, retexturing, and rigging capabilities.
Complete 3D pipeline
MiniMax Music V2
MiniMax (via Fal.ai)
Generate original music tracks from text descriptions.
Music 1.5
MiniMax (via Replicate)
Generate royalty-free music tracks from text descriptions.
MVDream
ByteDance (via Replicate)
Multi-view diffusion for 3D generation from text. Creates consistent 3D assets from descriptions.
Nano Banana 2
Google (via Fal.ai)
Google's state-of-the-art fast image generation and editing model with strong prompt adherence.
Google fast flagship image model
Nano Banana Pro
Fal.ai
Fast and efficient image generation model. Excellent quality-to-speed ratio for rapid prototyping and production use.
Fast & efficient image gen
OmniHuman 1.5
ByteDance (via Fal.ai)
Advanced human generation and animation model.