Grupos de Modelos:

Todos os Modelos

159 Modelos

Ordenar por:

Text to Image

Seedream 4.5

ByteDance

ByteDance's flagship text-to-image model with exceptional photorealism.

2048x2048PhotorealisticHigh aesthetic

Text to Image

Seedream 5.0 Lite

ByteDance

Latest Seedream with 2K/3K resolution and PNG output support.

2K/3K ResolutionPNG OutputMulti-Reference I2I

Text to Image

Seedream 4.0

ByteDance

High-performance image generation with excellent quality.

1024x1024FastGood quality

Text to Image

Seedream 3.0

ByteDance

Cost-effective image generation with good quality.

1024x1024FastCost-effective

Text to Video

WAN 2.1

Alibaba

Efficient video generation with good motion quality from Alibaba.

5s videosGood motionFast generation

Text to Video

WAN 2.5

Alibaba

Latest WAN model with improved quality and motion coherence.

5-10s videosA/V syncHigher quality

Text to Video

WAN 2.6

Alibaba

Latest WAN model with enhanced motion and longer durations.

5-15s videosEnhanced motionHigher quality

Text to Video

WAN 2.2 Text-to-Video

Alibaba

Efficient video generation with good motion.

5s videosGood motionFast generation

Text to Image

Flux Dev

Black Forest Labs

High-quality image generation with excellent prompt following.

1024x1024Fast generationHigh quality

Text to Image

Flux Schnell

Black Forest Labs

Ultra-fast image generation for rapid prototyping.

1024x1024Ultra fastGood quality

Text to Video

Sora 2

OpenAI

OpenAI's flagship video generation model.

5-12s videosPhotorealisticHigh quality

Text to Video

Sora 2 Pro

OpenAI

Premium Sora model with enhanced quality.

5-12s videosPremium qualityEnhanced detail

Text to Video

Google Veo 3.1

Google

Google's flagship video generation model.

4-8s videosHigh qualityMultiple durations

Text to Video

Google Veo 3.1 Fast

Google

Fast Google video generation.

4-8s videosFast generationGood quality

Text to Video

Google Veo 3

Google

High-quality Google video generation.

4-8s videosHigh qualityRealistic motion

Text to Video

Google Veo 3 Fast

Google

Fast Google video generation.

4-8s videosFast generationGood quality

Text to Video

Google Veo 2

Google

Reliable Google video generation.

4-8s videosGood qualityReliable output

Text to Video

Seedance 1.5 Pro

ByteDance

Latest Seedance with enhanced motion and frame control.

4-12s videosEnhanced motionStart/end frame control

Text to Video

Seedance 1.0 Pro

ByteDance

Premium Seedance with enhanced quality.

2-12s videosPremium qualityEnhanced detail

Text to Video

Seedance 1.0 Pro Fast

ByteDance

Pro quality with fast generation.

2-12s videosPro qualityFast generation

Text to Video

Seedance 1.0 Lite

ByteDance

Lightweight Seedance for rapid generation.

5-10s videosFastestCost-effective

Text to Video

Vidu Q2

Vidu

High-quality video generation from Vidu.

2-8s videosHigh qualityGreat motion

Text to Video

Vidu Q1

Vidu

Fast and efficient 1080p video generation.

1080p outputGood qualityFast generation

Text to Video

Kling v1

Kling

Original Kling model with reliable quality.

5-10s videosGood qualityReliable output

Text to Video

Kling v1.5

Kling

Improved Kling with better motion coherence.

5-10s videosImproved qualityBetter motion

Text to Video

Kling v1.6

Kling

Latest v1 series with enhanced quality.

5-10s videosEnhanced qualityBetter motion control

Image to Video

Kling O1

Kling

Reference-based video with multi-image support.

5-10s videosReference imagesElements mode

Text to Video

Kling v2 Master

Kling

Premium v2 with cinematic quality.

5-10s videosCinematic qualityProfessional grade

Image to Video

Kling v2.1

Kling

Balanced speed and fidelity for v2 series.

5-10s videosHigh qualityBetter motion

Text to Video

Kling v2.1 Master

Kling

Premium v2.1 with highest quality.

5-10s videosPremium qualityBest motion

Text to Video

Kling v2.5 Turbo

Kling

Fast generation with good quality.

5-10s videosFast generationGood quality

Text to Video

Kling v2.6

Kling

Latest Kling with audio generation.

5-10s videosAudio generationHighest quality

Text to Video

Kling v3.0 Standard

Kling

Next-gen Kling with improved quality and audio.

3-15s videosAudio generationImproved motion

Image to Video

Kling v3.0 Standard

Kling

Next-gen Kling image-to-video with audio.

3-15s videosAudio generationImproved motion

Text to Video

Kling v3.0 Pro

Kling

Premium next-gen Kling with highest quality.

3-15s videosAudio generationHighest quality

Image to Video

Kling v3.0 Pro

Kling

Premium next-gen Kling image-to-video.

3-15s videosAudio generationHighest quality

Video to Video

Kling Lipsync

Kling

Add lip-synced speech to any video.

2-10s videosLip syncTTS synthesis

Image to Video

Kling v1 Image-to-Video

Kling

Animate images with v1 quality.

5-10s videosImage animationNatural motion

Image to Video

Kling v2 Master Image-to-Video

Kling

Premium image animation with cinematic quality.

5-10s videosCinematic motionPremium quality

Text to Video

Hailuo 2.3

MiniMax

High-quality video generation with natural motion.

6-10s videosNatural motionHigh quality

Text to Image

Z-Image

Alibaba

Fast and affordable text-to-image generation.

FastLow costGood quality

Text to Image

GPT Image

OpenAI

OpenAI's flagship image generation model.

1024x1024PhotorealisticGreat prompt following

Image to Image

GPT Image

OpenAI

OpenAI's image transformation model.

Image inputPhotorealisticGreat prompt following

Text to Image

GPT Image 1.5

OpenAI

OpenAI's latest image model with enhanced capabilities.

1024x1024Auto sizeTransparent backgrounds

Image to Image

GPT Image 1.5

OpenAI

OpenAI's latest image editing model.

Image inputLogo preservationFace preservation

Image to Image

GPT Image 1 Mini

OpenAI

OpenAI's budget-friendly image editing model.

Image inputLow costFast generation

Text to Image

Grok Imagine Image

xAI

xAI's creative and versatile image generation model.

Up to 10 imagesMultiple aspect ratiosQuality settings

Image to Image

Grok Imagine Image

xAI

xAI's creative image editing model.

Image inputCreative editingUp to 10 outputs

Text to Image

Nano Banana Pro

Vertex

Strongest prompt-following model with exceptional detail.

4K supportBest prompt followingHigh detail

Text to Image

Nano Banana 2

Vertex

Pro quality at Flash speed with 4K and PNG support.

4K supportSub-second speedCharacter consistency

Image to Image

Nano Banana Pro

Vertex

Image transformation with strongest prompt following.

Image inputBest prompt followingHigh detail

Image to Image

Nano Banana 2

Vertex

Image transformation at Flash speed with multi-reference support.

Image inputMulti-Reference (up to 14)4K support

Text to Image

Nano Banana

Vertex

Ultra-high character consistency at affordable price.

Character consistencyCost-effectiveGood quality

Image to Image

Nano Banana

Vertex

Image transformation with character consistency.

Image inputCharacter consistencyCost-effective

Text to Image

Qwen Image

Alibaba

Great at complex text rendering in images.

Text renderingComplex promptsHigh accuracy

Image to Image

Qwen Image

Alibaba

Image transformation with text rendering capability.

Image inputText renderingComplex prompts

Text to Image

WAN 2.2

Alibaba

Fast and creative image generation.

FastCreativeLow cost

Text to Image

WAN 2.5

Alibaba

Photorealism and creative control for image generation.

PhotorealisticCreative controlMultiple outputs

Image to Image

WAN 2.5

Alibaba

Photorealistic image transformation.

Image inputPhotorealisticCreative control

Text to Image

Flux Dev Lora

Black Forest Labs

General art and design with LoRA support.

LoRA supportArt generationFlexible styles

Text to Image

Flux Kontext Pro

Black Forest Labs

Consistent, in-context image generation.

Context-awareConsistent outputsHigh quality

Image to Image

Flux Kontext Pro

Black Forest Labs

Context-aware image transformation.

Image inputContext-awareConsistent outputs

Text to Image

Flux Kontext Max

Black Forest Labs

Maximum context for detailed scenes.

Max contextComplex scenesHighest detail

Image to Image

Flux Kontext Max

Black Forest Labs

Maximum context for complex image edits.

Image inputMax contextComplex edits

Text to Image

Flux 2

Black Forest Labs

Next-gen Flux with improved quality.

Next-genImproved qualityFast generation

Image to Image

Flux 2

Black Forest Labs

Next-gen image transformation.

Multiple image inputsNext-gen qualityFast

Text to Image

Flux 2 Pro

Black Forest Labs

Premium Flux 2 with highest quality.

PremiumHighest qualityProfessional

Image to Image

Flux 2 Pro

Black Forest Labs

Premium image transformation.

Multiple image inputsPremium qualityProfessional

Text to Image

Flux 2 Flex

Black Forest Labs

Flexible generation at lower cost.

FlexibleGood qualityCost-effective

Image to Image

Flux 2 Flex

Black Forest Labs

Flexible image transformation.

Multiple image inputsFlexibleCost-effective

Image to Image

Seedream 4.0

ByteDance

Image transformation with cohesive styles.

Image inputCohesive stylesFast

Image to Image

Seedream 4.5

ByteDance

Image transformation with enhanced quality.

Image inputEnhanced qualityHigh detail

Image to Image

Seedream 5.0 Lite

ByteDance

Image transformation with 2K/3K and multi-reference input.

Image input2K/3K ResolutionMulti-Reference (2-14)

Image to Image

SeedEdit 3.0

ByteDance

Advanced image editing with precise control.

Image editingPrecise controlHigh quality

Image to Image

Seedream 4.0 Edit Sequential

ByteDance

Batch edit multiple images with consistent style.

Multi-image inputBatch outputConsistent style

Image to Image

Seedream 4.5 Edit Sequential

ByteDance

Premium batch editing with enhanced quality.

Multi-image inputBatch outputEnhanced quality

Text to Image

Seedream 5.0 Lite Edit Sequential

ByteDance

Batch text-to-image with 2K/3K and PNG support.

Batch output (up to 15)2K/3K ResolutionPNG Output

Image to Image

Seedream 5.0 Lite Edit Sequential

ByteDance

Batch image generation with 2K/3K and PNG support.

Multi-image input (up to 14)Batch output (up to 15)2K/3K Resolution

Image to Video

Hailuo 2.3 Fast

MiniMax

Fast image-to-video with natural motion.

6-10s videosImage inputFast generation

Image to Video

Hailuo 2.3

MiniMax

High quality image-to-video animation.

6-10s videosImage inputHigh quality

Text to Video

Hailuo 2.3 Pro

MiniMax

Premium text-to-video with enhanced quality.

6-10s videosPremium qualityEnhanced detail

Text to Video

Hailuo 02

MiniMax

High-quality text-to-video with frame control.

6-10s videosFirst/last frameHigh quality

Image to Video

Hailuo 02 (First/Last Frame)

MiniMax

First/Last frame to video.

6-10s videosFirst frame requiredLast frame required

Text to Video

T2V-01

MiniMax

Affordable text-to-video generation.

6s videos720PCost-effective

Text to Video

T2V-01 Director

MiniMax

Text-to-video with camera control.

6s videosCamera controlCinematic shots

Image to Video

I2V-01

MiniMax

Image-to-video with natural motion.

6s videosImage inputMulti-resolution

Image to Video

I2V-01 Director

MiniMax

Image-to-video with camera control.

6s videosImage inputCamera control

Image to Video

I2V-01 Live

MiniMax

Live-action style image animation.

6s videosImage inputLive-action style

Image to Video

S2V-01

MiniMax

Subject-driven video generation.

6s videosSubject referenceIdentity preservation

Text to Music

Music 2.0

MiniMax

AI music generation from text prompts and lyrics.

Text-to-musicLyrics supportStyle control

Text to Music

Music 2.5

MiniMax

Next-gen AI music generation with high-fidelity audio.

Text-to-musicLyrics supportHigh-fidelity audio

Text to Audio

Speech 2.6 HD

MiniMax

Latest HD TTS with outstanding prosody.

40 languages300+ voicesVoice cloning

Text to Audio

Speech 2.6 Turbo

MiniMax

Fast TTS with 40 language support.

40 languages300+ voicesFast generation

Text to Audio

Speech 02 HD

MiniMax

Superior rhythm and voice quality.

Superior rhythmHigh stabilityVoice cloning

Text to Audio

Speech 02 Turbo

MiniMax

Fast TTS with enhanced multilingual.

Enhanced multilingualFast generationGood rhythm

Image to Video

WAN 2.5

Alibaba

Image-to-video with A/V sync.

5-10s videosImage inputA/V sync

Image to Video

WAN 2.6

Alibaba

Image-to-video with enhanced motion.

5-15s videosImage inputEnhanced motion

Video to Video

WAN 2.5 Video Extend

Alibaba

Extend videos with AI-generated continuation.

3-10s extensionVideo inputOptional audio

Video to Video

ByteDance Video Upscaler

ByteDance

AI super-resolution video upscaling to 4K.

1080p/2K/4K outputDetail recoveryTemporal consistency

Video to Video

Seedance 1.5 Pro Video Extend

ByteDance

Extend videos with natural motion and stable aesthetics.

4-12s extensionVideo input480p/720p

Video to Video

Seedance 1.5 Pro Video Extend Fast

ByteDance

Extend videos with natural motion continuation.

4-12s extensionVideo input720p/1080p

Image to Video

Seedance 1.5 Pro

ByteDance

Latest image-to-video with enhanced motion.

4-12s videosImage inputEnhanced motion

Image to Video

Seedance 1.0 Pro

ByteDance

Premium image-to-video animation.

2-12s videosImage inputPremium quality

Image to Video

Seedance 1.0 Pro Fast

ByteDance

Pro quality image-to-video with fast generation.

2-12s videosImage inputPro quality

Image to Video

Seedance 1.0 Lite

ByteDance

Lightweight image-to-video with frame control.

5-10s videosImage inputStart/end frames

Image to Video

Sora 2

OpenAI

OpenAI image-to-video with audio.

4-12s videosImage inputPhotorealistic

Image to Video

Sora 2 Pro

OpenAI

Premium OpenAI image-to-video.

4-12s videosImage inputPremium quality

Image to Video

Google Veo 3.1

Google

Google image-to-video with frame support.

4-8s videosImage inputHigh quality

Image to Video

Google Veo 3.1 Fast

Google

Fast Google image-to-video.

4-8s videosImage inputFast generation

Text to Video

Grok Imagine Video

xAI

xAI's creative and flexible video generation model.

1-15s videosFlexible durationMultiple aspect ratios

Image to Video

Grok Imagine Video

xAI

xAI's creative image-to-video animation.

1-15s videosImage inputFlexible duration

Video to Video

Grok Imagine Video Edit

xAI

xAI's prompt-driven video editing model.

Object manipulationStyle transferScene control

Image to Video

Runway Gen4 Turbo

Runway

High-quality image-to-video with cinematic motion.

5-10s videosImage inputCinematic motion

Video to Video

Runway Gen4 Aleph

Runway

Transform videos with natural language editing.

Video editingNatural languageStyle transfer

Video to Video

Runway Upscale V1

Runway

4X AI video upscaling with enhanced details.

4X upscalingAI-enhanced detailTemporal consistency

Image to Video

Vidu Q2

Vidu

Vidu image-to-video with audio.

4-8s videosImage inputHigh quality

Text to Audio

Eleven V3

ElevenLabs

Most expressive TTS model with dramatic delivery.

70+ languagesEmotional deliveryNatural dialogue

Text to Audio

Eleven Multilingual V2

ElevenLabs

Most stable TTS model for professional content.

29 languagesMost stableLong-form content

Text to Audio

Eleven Flash V2.5

ElevenLabs

Fastest TTS model with ultra-low latency.

32 languagesUltra-fast (~75ms)50% cheaper

Text to Audio

Eleven Turbo V2.5

ElevenLabs

Balanced TTS model with quality and speed.

32 languagesLow latency (~250ms)50% cheaper

Text to Audio

Eleven Flash V2

ElevenLabs

Ultra-fast English-only TTS model.

English onlyUltra-fast (~75ms)50% cheaper

Text to Audio

Eleven Turbo V2

ElevenLabs

High-quality English TTS with low latency.

English onlyLow latency (~250ms)50% cheaper

Audio to Audio

ElevenLabs Voice Changer

ElevenLabs

Transform voice recordings into different voices.

Voice conversionEmotion preservationMultilingual support

Video to Video

ElevenLabs Dubbing

ElevenLabs

AI-powered video/audio dubbing to translate content into 29+ languages.

29+ LanguagesVoice PreservationEmotion Retention

Video to Video

WAN 2.2 Animate

Alibaba

Animate characters or replace them in videos using motion transfer.

Motion TransferCharacter AnimationExpression Replication

Video to Video

Kling 2.6 Pro Motion Control

Kling

Transfer motion from reference videos to character images.

Motion TransferCharacter AnimationPro Quality

Video to Video

Kling O1 Video Edit

Kling

Edit videos with natural language instructions.

Natural Language EditingObject RemovalStyle Transfer

Video Edit

Lucy Edit Dev

Lucy Edit AI

Fast AI video editing with text prompts.

Temporal ConsistencyObject ModificationStyle Transfer

Video Edit

Lucy Edit Pro

Lucy Edit AI

Professional AI video editing with resolution control.

Resolution ControlEnhanced QualityTemporal Consistency

Video Edit

Lucy Restyle

Lucy Edit AI

AI video style transfer with motion preservation.

Style TransferTemporal ConsistencyMotion Preservation

Reference to Video

Kling Video O1

Kling

Generate videos from character/scene reference images.

Multi-ReferenceIdentity ConsistencySubject Extraction

Reference to Video

Kling Video O1 Standard

Kling

Cost-effective reference-to-video generation.

Up to 10 ReferencesIdentity ConsistencyCost-Effective

Reference to Video

WAN 2.6 Reference-to-Video

Alibaba

Transform video references into new video shots.

Multi-View ReferenceIdentity PreservationSmooth Motion

Speech to Text

OpenAI Whisper

OpenAI

Fast, accurate multilingual speech-to-text.

MultilingualAuto Language DetectionPunctuation

Speech to Text

OpenAI Whisper with Video

OpenAI

Transcribe video audio with timestamps.

Video InputTimestampsSubtitle Segments

Speech to Text

OpenAI Whisper Turbo

OpenAI

Fastest Whisper model with great accuracy.

Turbo SpeedMultilingualAuto Detection

Text to 3D

Hunyuan3D v3

Tencent

Generate detailed 3D models from text descriptions with texture support.

Text to 3DPBR MaterialsLowPoly Mode

Image to 3D

Hunyuan3D v3 Image

Tencent

Convert images to detailed 3D models with optional multi-view enhancement.

Single Image to 3DMulti-view SupportPBR Materials

Image to 3D

Hunyuan3D v2.1

Tencent

Single image to 3D with PBR materials and high-fidelity textures.

Single Image to 3DPBR Materials4K Textures

Image to 3D

Hunyuan3D v2 Base

Tencent

High-fidelity 3D from single image with 4K textures.

Single Image to 3D4K TexturesFast Generation

Image to 3D

Hunyuan3D v2 Mini

Tencent

Fastest and most affordable image-to-3D model.

Single Image to 3DUltra FastLow Cost

Image to 3D

Hunyuan3D v2 Multi-View

Tencent

Best accuracy from 3 reference images.

Multi-View Input3 Required ImagesHigh Accuracy

OpenAI

LLM

GPT-4o

OpenAI

OpenAI flagship multimodal model.

Text generationReasoningCode generation

OpenAI

LLM

GPT-4.1

OpenAI

Latest GPT-4 with improved coding and reasoning.

Text generationAdvanced reasoningCode generation

OpenAI

LLM

O3

OpenAI

Advanced reasoning model with chain-of-thought.

Deep reasoningChain of thoughtMath & science

OpenAI

LLM

GPT-5

OpenAI

Most capable GPT model.

State-of-the-art reasoningCreative writingCode generation

Anthropic

LLM

Claude Sonnet 4.6

Anthropic

Latest Sonnet with near-Opus performance.

Extended thinkingAdvanced codingAgentic tasks

Anthropic

LLM

Claude Sonnet 4

Anthropic

Latest balanced model for coding and analysis.

Strong codingFast responseCreative writing

Anthropic

LLM

Claude 3.5 Haiku

Anthropic

Fastest Claude model, cost-efficient.

Ultra-fastCost-efficientGood reasoning

Google

LLM

Gemini 2.5 Flash

Google

Fast and efficient with reasoning.

Fast inferenceThinking modeCode generation

Google

LLM

Gemini 2.0 Flash

Google

Budget-friendly general purpose model.

Fast inferenceCost-efficientGeneral purpose

Google

LLM

Gemini 2.5 Flash Lite

Google

Most affordable Gemini model.

Ultra-low costFast inferenceSimple tasks

Google

LLM

Gemini 2.5 Pro

Google

Premium Gemini with top-tier reasoning.

State-of-the-art reasoningAdvanced coding1M context

Google

LLM

Gemini 3 Flash Preview

Google

Next-gen Flash model preview.

Next-gen performanceFast inferenceImproved reasoning

DeepSeek

LLM

DeepSeek R1

DeepSeek

Reasoning model with chain-of-thought.

Chain of thoughtMath & scienceComplex reasoning

DeepSeek

LLM

DeepSeek Chat

DeepSeek

Fast and affordable general-purpose model.

Fast inferenceUltra-low costGeneral purpose

Zhipu (Z.ai)

LLM

GLM-5

Zhipu (Z.ai)

Z.ai flagship model with advanced reasoning.

Advanced reasoningMultilingualCode generation

Zhipu (Z.ai)

LLM

GLM-4.6V

Zhipu (Z.ai)

Multimodal model with vision support.

Vision + textMultimodalGeneral purpose

Kimi (Moonshot)

LLM

Kimi K2.5

Kimi (Moonshot)

MoE model with 1T parameters.

1T parameters (MoE)Strong reasoningCode generation

Mostrando 159 de 159 modelos

Categoria

Todos os Modelos

Seedream 4.5

Seedream 5.0 Lite

Seedream 4.0

Seedream 3.0

WAN 2.1

WAN 2.5

WAN 2.6

WAN 2.2 Text-to-Video

Flux Dev

Flux Schnell

Sora 2

Sora 2 Pro

Google Veo 3.1

Google Veo 3.1 Fast

Google Veo 3

Google Veo 3 Fast

Google Veo 2

Seedance 1.5 Pro

Seedance 1.0 Pro

Seedance 1.0 Pro Fast

Seedance 1.0 Lite

Vidu Q2

Vidu Q1

Kling v1

Kling v1.5

Kling v1.6

Kling O1

Kling v2 Master

Kling v2.1

Kling v2.1 Master

Kling v2.5 Turbo

Kling v2.6

Kling v3.0 Standard

Kling v3.0 Standard

Kling v3.0 Pro

Kling v3.0 Pro

Kling Lipsync

Kling v1 Image-to-Video

Kling v2 Master Image-to-Video

Hailuo 2.3

Z-Image

GPT Image

GPT Image

GPT Image 1.5

GPT Image 1.5

GPT Image 1 Mini

Grok Imagine Image

Grok Imagine Image

Nano Banana Pro

Nano Banana 2

Nano Banana Pro

Nano Banana 2

Nano Banana

Nano Banana

Qwen Image

Qwen Image

WAN 2.2

WAN 2.5

WAN 2.5

Flux Dev Lora

Flux Kontext Pro

Flux Kontext Pro

Flux Kontext Max

Flux Kontext Max

Flux 2

Flux 2

Flux 2 Pro

Flux 2 Pro

Flux 2 Flex

Flux 2 Flex

Seedream 4.0

Seedream 4.5

Seedream 5.0 Lite

SeedEdit 3.0

Seedream 4.0 Edit Sequential

Seedream 4.5 Edit Sequential

Seedream 5.0 Lite Edit Sequential

Seedream 5.0 Lite Edit Sequential