Semua Model

5s videosGood motionFast generation720p output

WAN 2.1

Efficient video generation with good motion quality from Alibaba.

5-10s videosA/V syncHigher quality1080p output

WAN 2.5

Latest WAN model with improved quality and motion coherence.

5-15s videosEnhanced motionHigher quality1080p output

WAN 2.6

Latest WAN model with enhanced motion and longer durations.

5s videosGood motionFast generation480p/1080p output

WAN 2.2 Text-to-Video

Efficient video generation with good motion.

1024x1024Fast generationHigh qualityGreat prompt following

Flux Dev

High-quality image generation with excellent prompt following.

1024x1024Ultra fastGood qualityHigh throughput

Flux Schnell

Ultra-fast image generation for rapid prototyping.

Text to Videoopenai

Sora 2

OpenAI's flagship video generation model.

5-12s videosPhotorealisticHigh qualityTemporal consistency

Text to Videoopenai

Sora 2 Pro

Premium Sora model with enhanced quality.

5-12s videosPremium qualityEnhanced detailBest-in-class

4-8s videosHigh qualityMultiple durationsNatural motion

Google Veo 3.1

Google's flagship video generation model.

4-8s videosFast generationGood quality

Google Veo 3.1 Fast

Fast Google video generation.

4-8s videosHigh qualityRealistic motionMultiple aspect ratios

Google Veo 3

High-quality Google video generation.

4-8s videosFast generationGood quality

Google Veo 3 Fast

Fast Google video generation.

4-8s videosGood qualityReliable output

Google Veo 2

Reliable Google video generation.

2-12s videosEnhanced motionStart/end frame control480P/720P

Seedance 1.5 Pro

Latest Seedance with enhanced motion and frame control.

2-12s videosPremium qualityEnhanced detail

Seedance 1.0 Pro

Premium Seedance with enhanced quality.

2-12s videosPro qualityFast generation

Seedance 1.0 Pro Fast

Pro quality with fast generation.

5-10s videosFastestCost-effective

Seedance 1.0 Lite

Lightweight Seedance for rapid generation.

Text to Videovidu

Vidu Q2

High-quality video generation from Vidu.

2-8s videosHigh qualityGreat motionNatural movement

Text to Videovidu

Vidu Q1

Fast and efficient 1080p video generation.

1080p outputGood qualityFast generationAnime style support

5-10s videosGood qualityReliable output

Kling v1

Original Kling model with reliable quality.

5-10s videosImproved qualityBetter motion

Kling v1.5

Improved Kling with better motion coherence.

5-10s videosEnhanced qualityBetter motion control

Kling v1.6

Latest v1 series with enhanced quality.

5-10s videosReference imagesElements modeFrame control

Kling O1

Reference-based video with multi-image support.

5-10s videosCinematic qualityProfessional gradeMotion control

Kling v2 Master

Premium v2 with cinematic quality.

5-10s videosHigh qualityBetter motionImage-to-video

Kling v2.1

Balanced speed and fidelity for v2 series.

5-10s videosPremium qualityBest motionProfessional grade

Kling v2.1 Master

Premium v2.1 with highest quality.

5-10s videosFast generationGood qualityCost-effective

Kling v2.5 Turbo

Fast generation with good quality.

5-10s videosAudio generationHighest qualityLatest model

Kling v2.6

Latest Kling with audio generation.

3-15s videosAudio generationImproved motionNext-gen quality

Kling v3.0 Standard

Next-gen Kling with improved quality and audio.

3-15s videosAudio generationImproved motionImage-to-video

Kling v3.0 Standard

Next-gen Kling image-to-video with audio.

3-15s videosAudio generationHighest qualityPremium tier

Kling v3.0 Pro

Premium next-gen Kling with highest quality.

3-15s videosAudio generationHighest qualityImage-to-video

Kling v3.0 Pro

Premium next-gen Kling image-to-video.

Video to Videokling

Kling Lipsync

Add lip-synced speech to any video.

2-10s videosLip syncTTS synthesisVoice selection

5-10s videosImage animationNatural motion

Kling v1 Image-to-Video

Animate images with v1 quality.

5-10s videosCinematic motionPremium quality

Kling v2 Master Image-to-Video

Premium image animation with cinematic quality.

6-10s videosNatural motionHigh qualityCinematic output

Hailuo 2.3

High-quality video generation with natural motion.

FastLow costGood qualityMultiple sizes

Z-Image

Fast and affordable text-to-image generation.

Text to Imageopenai

GPT Image

OpenAI's flagship image generation model.

1024x1024PhotorealisticGreat prompt followingMultiple sizes

Image to Imageopenai

GPT Image

OpenAI's image transformation model.

Image inputPhotorealisticGreat prompt following

Text to Imageopenai

GPT Image 1.5

OpenAI's latest image model with enhanced capabilities.

1024x1024Auto sizeTransparent backgroundsBetter text rendering

Image to Imageopenai

GPT Image 1.5

OpenAI's latest image editing model.

Image inputLogo preservationFace preservationBetter instruction following

Image to Imageopenai

GPT Image 1 Mini

OpenAI's budget-friendly image editing model.

Image inputLow costFast generationGood for iterations

Text to Imagexai

Grok Imagine Image

xAI's creative and versatile image generation model.

Up to 10 imagesMultiple aspect ratiosQuality settings1k/2k resolution

Image to Imagexai

Grok Imagine Image

xAI's creative image editing model.

Image inputCreative editingUp to 10 outputsAuto aspect ratio

Text to Imagevertex

Nano Banana Pro

Strongest prompt-following model with exceptional detail.

4K supportBest prompt followingHigh detailCharacter consistency

Image to Imagevertex

Nano Banana Pro

Image transformation with strongest prompt following.

Image inputBest prompt followingHigh detail

Text to Imagevertex

Nano Banana

Ultra-high character consistency at affordable price.

Character consistencyCost-effectiveGood quality

Image to Imagevertex

Nano Banana

Image transformation with character consistency.

Image inputCharacter consistencyCost-effective

Text renderingComplex promptsHigh accuracy

Qwen Image

Great at complex text rendering in images.

Image to Imagealibaba

Qwen Image

Image transformation with text rendering capability.

Image inputText renderingComplex prompts

FastCreativeLow costGood quality

WAN 2.2

Fast and creative image generation.

PhotorealisticCreative controlMultiple outputs

WAN 2.5

Photorealism and creative control for image generation.

Image to Imagealibaba

WAN 2.5

Photorealistic image transformation.

Image inputPhotorealisticCreative control

LoRA supportArt generationFlexible styles

Flux Dev Lora

General art and design with LoRA support.

Context-awareConsistent outputsHigh quality

Flux Kontext Pro

Consistent, in-context image generation.

Image inputContext-awareConsistent outputs

Flux Kontext Pro

Context-aware image transformation.

Max contextComplex scenesHighest detail

Flux Kontext Max

Maximum context for detailed scenes.

Image inputMax contextComplex edits

Flux Kontext Max

Maximum context for complex image edits.

Next-genImproved qualityFast generation

Flux 2

Next-gen Flux with improved quality.

Multiple image inputsNext-gen qualityFast

Flux 2

Next-gen image transformation.

PremiumHighest qualityProfessional

Flux 2 Pro

Premium Flux 2 with highest quality.

Multiple image inputsPremium qualityProfessional

Flux 2 Pro

Premium image transformation.

FlexibleGood qualityCost-effective

Flux 2 Flex

Flexible generation at lower cost.

Multiple image inputsFlexibleCost-effective

Flux 2 Flex

Flexible image transformation.

Image inputCohesive stylesFast

Seedream 4.0

Image transformation with cohesive styles.

Image inputEnhanced qualityHigh detail

Seedream 4.5

Image transformation with enhanced quality.

Image editingPrecise controlHigh qualityFast

SeedEdit 3.0

Advanced image editing with precise control.

Multi-image inputBatch outputConsistent styleUp to 10 images

Seedream 4.0 Edit Sequential

Batch edit multiple images with consistent style.

Multi-image inputBatch outputEnhanced qualityHigh resolution

Seedream 4.5 Edit Sequential

Premium batch editing with enhanced quality.

6-10s videosImage inputFast generationNatural motion

Hailuo 2.3 Fast

Fast image-to-video with natural motion.

6-10s videosImage inputHigh qualityNatural motion

Hailuo 2.3

High quality image-to-video animation.

6-10s videosPremium qualityEnhanced detail

Hailuo 2.3 Pro

Premium text-to-video with enhanced quality.

6-10s videosFirst/last frameHigh qualityCinematic output

Hailuo 02

High-quality text-to-video with frame control.

6-10s videosFirst frame requiredLast frame requiredHigh quality

Hailuo 02 (First/Last Frame)

First/Last frame to video.

6s videos720PCost-effectiveGood quality

T2V-01

Affordable text-to-video generation.

6s videosCamera controlCinematic shots720P

T2V-01 Director

Text-to-video with camera control.

6s videosImage inputMulti-resolutionNatural motion

I2V-01

Image-to-video with natural motion.

6s videosImage inputCamera controlMulti-resolution

I2V-01 Director

Image-to-video with camera control.

6s videosImage inputLive-action styleMulti-resolution

I2V-01 Live

Live-action style image animation.

6s videosSubject referenceIdentity preservationCharacter consistency

S2V-01

Subject-driven video generation.

Text to Musicminimax

Music 2.0

AI music generation from text prompts and lyrics.

Text-to-musicLyrics supportStyle controlHigh quality audio

Text to Musicminimax

Music 2.5

Next-gen AI music generation with high-fidelity audio.

Text-to-musicLyrics supportHigh-fidelity audioConfigurable bitrate & sample rate

40 languages300+ voicesVoice cloningHD quality

Speech 2.6 HD

Latest HD TTS with outstanding prosody.

40 languages300+ voicesFast generationLow latency

Speech 2.6 Turbo

Fast TTS with 40 language support.

Superior rhythmHigh stabilityVoice cloningQuality sound

Speech 02 HD

Superior rhythm and voice quality.

Enhanced multilingualFast generationGood rhythmStable output

Speech 02 Turbo

Fast TTS with enhanced multilingual.

Image to Videoalibaba

WAN 2.5

Image-to-video with A/V sync.

5-10s videosImage inputA/V syncNatural motion

Image to Videoalibaba

WAN 2.6

Image-to-video with enhanced motion.

5-15s videosImage inputEnhanced motionAudio support

Video to Videoalibaba

WAN 2.5 Video Extend

Extend videos with AI-generated continuation.

3-10s extensionVideo inputOptional audio480p/720p/1080p

Video to Videobytedance

ByteDance Video Upscaler

AI super-resolution video upscaling to 4K.

1080p/2K/4K outputDetail recoveryTemporal consistencyArtifact cleanup

Video to Videobytedance

Seedance 1.5 Pro Video Extend

Extend videos with natural motion and stable aesthetics.

4-12s extensionVideo input480p/720pAudio generation

Video to Videobytedance

Seedance 1.5 Pro Video Extend Fast

Extend videos with natural motion continuation.

4-12s extensionVideo input720p/1080pAudio generation

2-12s videosImage inputEnhanced motionStart/end frame control

Seedance 1.5 Pro

Latest image-to-video with enhanced motion.

2-12s videosImage inputPremium qualityEnhanced detail

Seedance 1.0 Pro

Premium image-to-video animation.

2-12s videosImage inputPro qualityFast generation

Seedance 1.0 Pro Fast

Pro quality image-to-video with fast generation.

5-10s videosImage inputStart/end framesFastest

Seedance 1.0 Lite

Lightweight image-to-video with frame control.

Image to Videoopenai

Sora 2

OpenAI image-to-video with audio.

4-12s videosImage inputPhotorealisticAudio support

Image to Videoopenai

Sora 2 Pro

Premium OpenAI image-to-video.

4-12s videosImage inputPremium qualityAudio support

Image to Videogoogle

Google Veo 3.1

Google image-to-video with frame support.

4-8s videosImage inputHigh qualityFrame support

Image to Videogoogle

Google Veo 3.1 Fast

Fast Google image-to-video.

4-8s videosImage inputFast generationGood quality

Text to Videoxai

Grok Imagine Video

xAI's creative and flexible video generation model.

1-15s videosFlexible durationMultiple aspect ratios480p/720p resolution

Image to Videoxai

Grok Imagine Video

xAI's creative image-to-video animation.

1-15s videosImage inputFlexible durationMultiple aspect ratios

Video to Videoxai

Grok Imagine Video Edit

xAI's prompt-driven video editing model.

Object manipulationStyle transferScene controlMotion preservation

Image to Videorunway

Runway Gen4 Turbo

High-quality image-to-video with cinematic motion.

5-10s videosImage inputCinematic motionHigh quality

Video to Videorunway

Runway Gen4 Aleph

Transform videos with natural language editing.

Video editingNatural languageStyle transferObject manipulation

Video to Videorunway

Runway Upscale V1

4X AI video upscaling with enhanced details.

4X upscalingAI-enhanced detailTemporal consistencyUp to 10 min

Image to Videovidu

Vidu Q2

Vidu image-to-video with audio.

4-8s videosImage inputHigh qualityAudio support

70+ languagesEmotional deliveryNatural dialogue5min audio

Eleven V3

Most expressive TTS model with dramatic delivery.

29 languagesMost stableLong-form content10min audio

Eleven Multilingual V2

Most stable TTS model for professional content.

32 languagesUltra-fast (~75ms)50% cheaper40min audio

Eleven Flash V2.5

Fastest TTS model with ultra-low latency.

32 languagesLow latency (~250ms)50% cheaper40min audio

Eleven Turbo V2.5

Balanced TTS model with quality and speed.

English onlyUltra-fast (~75ms)50% cheaperReal-time ready

Eleven Flash V2

Ultra-fast English-only TTS model.

English onlyLow latency (~250ms)50% cheaperHigh quality

Eleven Turbo V2

High-quality English TTS with low latency.

Audio to Audioelevenlabs

ElevenLabs Voice Changer

Transform voice recordings into different voices.

Voice conversionEmotion preservationMultilingual supportBackground noise removal

Video to Videoelevenlabs

ElevenLabs Dubbing

AI-powered video/audio dubbing to translate content into 29+ languages.

29+ LanguagesVoice PreservationEmotion RetentionAuto Speaker Detection

Video to Videoalibaba

WAN 2.2 Animate

Animate characters or replace them in videos using motion transfer.

Motion TransferCharacter AnimationExpression Replication720p Output

Video to Videokling

Kling 2.6 Pro Motion Control

Transfer motion from reference videos to character images.

Motion TransferCharacter AnimationPro QualityUp to 30s Output

Video to Videokling

Kling O1 Video Edit

Edit videos with natural language instructions.

Natural Language EditingObject RemovalStyle TransferScene Transformation

Video Editlucy edit ai

Lucy Edit Dev

Fast AI video editing with text prompts.

Temporal ConsistencyObject ModificationStyle TransferMotion Preservation

Video Editlucy edit ai

Lucy Edit Pro

Professional AI video editing with resolution control.

Resolution ControlEnhanced QualityTemporal ConsistencyProfessional Output

Video Editlucy edit ai

Lucy Restyle

AI video style transfer with motion preservation.

Style TransferTemporal ConsistencyMotion PreservationArtistic Transformation

Reference to Videokling

Kling Video O1

Generate videos from character/scene reference images.

Multi-ReferenceIdentity ConsistencySubject ExtractionCreative Video Generation

Reference to Videokling

Kling Video O1 Standard

Cost-effective reference-to-video generation.

Up to 10 ReferencesIdentity ConsistencyCost-EffectiveStandard Quality

Reference to Videoalibaba

WAN 2.6 Reference-to-Video

Transform video references into new video shots.

Multi-View ReferenceIdentity PreservationSmooth Motion720P/1080P Output

Speech to Textopenai

OpenAI Whisper

Fast, accurate multilingual speech-to-text.

MultilingualAuto Language DetectionPunctuationNo Coldstarts

Speech to Textopenai

OpenAI Whisper with Video

Transcribe video audio with timestamps.

Video InputTimestampsSubtitle SegmentsMultilingual

Speech to Textopenai

OpenAI Whisper Turbo

Fastest Whisper model with great accuracy.

Turbo SpeedMultilingualAuto DetectionBest Performance

Text to 3Dtencent

Hunyuan3D v3

Generate detailed 3D models from text descriptions with texture support.

Text to 3DPBR MaterialsLowPoly ModeGLB/OBJ Output

Single Image to 3DMulti-view SupportPBR MaterialsLowPoly Mode

Hunyuan3D v3 Image

Convert images to detailed 3D models with optional multi-view enhancement.

Single Image to 3DPBR Materials4K TexturesGLB Output

Hunyuan3D v2.1

Single image to 3D with PBR materials and high-fidelity textures.

Single Image to 3D4K TexturesFast GenerationGLB Output

Hunyuan3D v2 Base

High-fidelity 3D from single image with 4K textures.

Single Image to 3DUltra FastLow CostGLB Output

Hunyuan3D v2 Mini

Fastest and most affordable image-to-3D model.

Multi-View Input3 Required ImagesHigh AccuracyTextured/White Mesh

Hunyuan3D v2 Multi-View

Best accuracy from 3 reference images.

Text generationReasoningCode generationMultilingual

GPT-4o

OpenAI flagship multimodal model.

Text generationAdvanced reasoningCode generation1M context

GPT-4.1

Latest GPT-4 with improved coding and reasoning.

Deep reasoningChain of thoughtMath & scienceComplex problem solving

O3

Advanced reasoning model with chain-of-thought.

State-of-the-art reasoningCreative writingCode generationResearch

GPT-5

Most capable GPT model.

LLManthropic

Claude Sonnet 4.6

Latest Sonnet with near-Opus performance.

Extended thinkingAdvanced codingAgentic tasks1M context

LLManthropic

Claude Sonnet 4

Latest balanced model for coding and analysis.

Strong codingFast responseCreative writing200K context

LLManthropic

Claude 3.5 Haiku

Fastest Claude model, cost-efficient.

Ultra-fastCost-efficientGood reasoning200K context

Fast inferenceThinking modeCode generation1M context

Gemini 2.5 Flash

Fast and efficient with reasoning.

Fast inferenceCost-efficientGeneral purpose1M context

Gemini 2.0 Flash

Budget-friendly general purpose model.

Ultra-low costFast inferenceSimple tasks

Gemini 2.5 Flash Lite

Most affordable Gemini model.

State-of-the-art reasoningAdvanced coding1M contextMultimodal

Gemini 2.5 Pro

Premium Gemini with top-tier reasoning.