Grupos de Modelos:

Todos os Modelos

159 Modelos
Ordenar por:
Seedream 4.5Text to Image

Seedream 4.5

ByteDance

ByteDance's flagship text-to-image model with exceptional photorealism.

2048x2048PhotorealisticHigh aesthetic
Seedream 5.0 LiteText to Image

Seedream 5.0 Lite

ByteDance

Latest Seedream with 2K/3K resolution and PNG output support.

2K/3K ResolutionPNG OutputMulti-Reference I2I
Seedream 4.0Text to Image

Seedream 4.0

ByteDance

High-performance image generation with excellent quality.

1024x1024FastGood quality
Seedream 3.0Text to Image

Seedream 3.0

ByteDance

Cost-effective image generation with good quality.

1024x1024FastCost-effective
Text to Video

WAN 2.1

Alibaba

Efficient video generation with good motion quality from Alibaba.

5s videosGood motionFast generation
Text to Video

WAN 2.5

Alibaba

Latest WAN model with improved quality and motion coherence.

5-10s videosA/V syncHigher quality
Text to Video

WAN 2.6

Alibaba

Latest WAN model with enhanced motion and longer durations.

5-15s videosEnhanced motionHigher quality
Text to Video

WAN 2.2 Text-to-Video

Alibaba

Efficient video generation with good motion.

5s videosGood motionFast generation
Flux DevText to Image

Flux Dev

Black Forest Labs

High-quality image generation with excellent prompt following.

1024x1024Fast generationHigh quality
Flux SchnellText to Image

Flux Schnell

Black Forest Labs

Ultra-fast image generation for rapid prototyping.

1024x1024Ultra fastGood quality
Text to Video

Sora 2

OpenAI

OpenAI's flagship video generation model.

5-12s videosPhotorealisticHigh quality
Text to Video

Sora 2 Pro

OpenAI

Premium Sora model with enhanced quality.

5-12s videosPremium qualityEnhanced detail
Text to Video

Google Veo 3.1

Google

Google's flagship video generation model.

4-8s videosHigh qualityMultiple durations
Text to Video

Google Veo 3.1 Fast

Google

Fast Google video generation.

4-8s videosFast generationGood quality
Text to Video

Google Veo 3

Google

High-quality Google video generation.

4-8s videosHigh qualityRealistic motion
Text to Video

Google Veo 3 Fast

Google

Fast Google video generation.

4-8s videosFast generationGood quality
Text to Video

Google Veo 2

Google

Reliable Google video generation.

4-8s videosGood qualityReliable output
Text to Video

Seedance 1.5 Pro

ByteDance

Latest Seedance with enhanced motion and frame control.

4-12s videosEnhanced motionStart/end frame control
Text to Video

Seedance 1.0 Pro

ByteDance

Premium Seedance with enhanced quality.

2-12s videosPremium qualityEnhanced detail
Text to Video

Seedance 1.0 Pro Fast

ByteDance

Pro quality with fast generation.

2-12s videosPro qualityFast generation
Text to Video

Seedance 1.0 Lite

ByteDance

Lightweight Seedance for rapid generation.

5-10s videosFastestCost-effective
Text to Video

Vidu Q2

Vidu

High-quality video generation from Vidu.

2-8s videosHigh qualityGreat motion
Text to Video

Vidu Q1

Vidu

Fast and efficient 1080p video generation.

1080p outputGood qualityFast generation
Text to Video

Kling v1

Kling

Original Kling model with reliable quality.

5-10s videosGood qualityReliable output
Text to Video

Kling v1.5

Kling

Improved Kling with better motion coherence.

5-10s videosImproved qualityBetter motion
Text to Video

Kling v1.6

Kling

Latest v1 series with enhanced quality.

5-10s videosEnhanced qualityBetter motion control
Image to Video

Kling O1

Kling

Reference-based video with multi-image support.

5-10s videosReference imagesElements mode
Text to Video

Kling v2 Master

Kling

Premium v2 with cinematic quality.

5-10s videosCinematic qualityProfessional grade
Image to Video

Kling v2.1

Kling

Balanced speed and fidelity for v2 series.

5-10s videosHigh qualityBetter motion
Text to Video

Kling v2.1 Master

Kling

Premium v2.1 with highest quality.

5-10s videosPremium qualityBest motion
Text to Video

Kling v2.5 Turbo

Kling

Fast generation with good quality.

5-10s videosFast generationGood quality
Text to Video

Kling v2.6

Kling

Latest Kling with audio generation.

5-10s videosAudio generationHighest quality
Text to Video

Kling v3.0 Standard

Kling

Next-gen Kling with improved quality and audio.

3-15s videosAudio generationImproved motion
Image to Video

Kling v3.0 Standard

Kling

Next-gen Kling image-to-video with audio.

3-15s videosAudio generationImproved motion
Text to Video

Kling v3.0 Pro

Kling

Premium next-gen Kling with highest quality.

3-15s videosAudio generationHighest quality
Image to Video

Kling v3.0 Pro

Kling

Premium next-gen Kling image-to-video.

3-15s videosAudio generationHighest quality
Video to Video

Kling Lipsync

Kling

Add lip-synced speech to any video.

2-10s videosLip syncTTS synthesis
Image to Video

Kling v1 Image-to-Video

Kling

Animate images with v1 quality.

5-10s videosImage animationNatural motion
Image to Video

Kling v2 Master Image-to-Video

Kling

Premium image animation with cinematic quality.

5-10s videosCinematic motionPremium quality
Text to Video

Hailuo 2.3

MiniMax

High-quality video generation with natural motion.

6-10s videosNatural motionHigh quality
Z-ImageText to Image

Z-Image

Alibaba

Fast and affordable text-to-image generation.

FastLow costGood quality
GPT ImageText to Image

GPT Image

OpenAI

OpenAI's flagship image generation model.

1024x1024PhotorealisticGreat prompt following
GPT ImageImage to Image

GPT Image

OpenAI

OpenAI's image transformation model.

Image inputPhotorealisticGreat prompt following
GPT Image 1.5Text to Image

GPT Image 1.5

OpenAI

OpenAI's latest image model with enhanced capabilities.

1024x1024Auto sizeTransparent backgrounds
GPT Image 1.5Image to Image

GPT Image 1.5

OpenAI

OpenAI's latest image editing model.

Image inputLogo preservationFace preservation
GPT Image 1 MiniImage to Image

GPT Image 1 Mini

OpenAI

OpenAI's budget-friendly image editing model.

Image inputLow costFast generation
Grok Imagine ImageText to Image

Grok Imagine Image

xAI

xAI's creative and versatile image generation model.

Up to 10 imagesMultiple aspect ratiosQuality settings
Grok Imagine ImageImage to Image

Grok Imagine Image

xAI

xAI's creative image editing model.

Image inputCreative editingUp to 10 outputs
Nano Banana ProText to Image

Nano Banana Pro

Vertex

Strongest prompt-following model with exceptional detail.

4K supportBest prompt followingHigh detail
Nano Banana 2Text to Image

Nano Banana 2

Vertex

Pro quality at Flash speed with 4K and PNG support.

4K supportSub-second speedCharacter consistency
Nano Banana ProImage to Image

Nano Banana Pro

Vertex

Image transformation with strongest prompt following.

Image inputBest prompt followingHigh detail
Nano Banana 2Image to Image

Nano Banana 2

Vertex

Image transformation at Flash speed with multi-reference support.

Image inputMulti-Reference (up to 14)4K support
Nano BananaText to Image

Nano Banana

Vertex

Ultra-high character consistency at affordable price.

Character consistencyCost-effectiveGood quality
Nano BananaImage to Image

Nano Banana

Vertex

Image transformation with character consistency.

Image inputCharacter consistencyCost-effective
Qwen ImageText to Image

Qwen Image

Alibaba

Great at complex text rendering in images.

Text renderingComplex promptsHigh accuracy
Qwen ImageImage to Image

Qwen Image

Alibaba

Image transformation with text rendering capability.

Image inputText renderingComplex prompts
WAN 2.2Text to Image

WAN 2.2

Alibaba

Fast and creative image generation.

FastCreativeLow cost
WAN 2.5Text to Image

WAN 2.5

Alibaba

Photorealism and creative control for image generation.

PhotorealisticCreative controlMultiple outputs
WAN 2.5Image to Image

WAN 2.5

Alibaba

Photorealistic image transformation.

Image inputPhotorealisticCreative control
Flux Dev LoraText to Image

Flux Dev Lora

Black Forest Labs

General art and design with LoRA support.

LoRA supportArt generationFlexible styles
Flux Kontext ProText to Image

Flux Kontext Pro

Black Forest Labs

Consistent, in-context image generation.

Context-awareConsistent outputsHigh quality
Flux Kontext ProImage to Image

Flux Kontext Pro

Black Forest Labs

Context-aware image transformation.

Image inputContext-awareConsistent outputs
Flux Kontext MaxText to Image

Flux Kontext Max

Black Forest Labs

Maximum context for detailed scenes.

Max contextComplex scenesHighest detail
Flux Kontext MaxImage to Image

Flux Kontext Max

Black Forest Labs

Maximum context for complex image edits.

Image inputMax contextComplex edits
Flux 2Text to Image

Flux 2

Black Forest Labs

Next-gen Flux with improved quality.

Next-genImproved qualityFast generation
Flux 2Image to Image

Flux 2

Black Forest Labs

Next-gen image transformation.

Multiple image inputsNext-gen qualityFast
Flux 2 ProText to Image

Flux 2 Pro

Black Forest Labs

Premium Flux 2 with highest quality.

PremiumHighest qualityProfessional
Flux 2 ProImage to Image

Flux 2 Pro

Black Forest Labs

Premium image transformation.

Multiple image inputsPremium qualityProfessional
Flux 2 FlexText to Image

Flux 2 Flex

Black Forest Labs

Flexible generation at lower cost.

FlexibleGood qualityCost-effective
Flux 2 FlexImage to Image

Flux 2 Flex

Black Forest Labs

Flexible image transformation.

Multiple image inputsFlexibleCost-effective
Seedream 4.0Image to Image

Seedream 4.0

ByteDance

Image transformation with cohesive styles.

Image inputCohesive stylesFast
Seedream 4.5Image to Image

Seedream 4.5

ByteDance

Image transformation with enhanced quality.

Image inputEnhanced qualityHigh detail
Seedream 5.0 LiteImage to Image

Seedream 5.0 Lite

ByteDance

Image transformation with 2K/3K and multi-reference input.

Image input2K/3K ResolutionMulti-Reference (2-14)
SeedEdit 3.0Image to Image

SeedEdit 3.0

ByteDance

Advanced image editing with precise control.

Image editingPrecise controlHigh quality
Seedream 4.0 Edit SequentialImage to Image

Seedream 4.0 Edit Sequential

ByteDance

Batch edit multiple images with consistent style.

Multi-image inputBatch outputConsistent style
Seedream 4.5 Edit SequentialImage to Image

Seedream 4.5 Edit Sequential

ByteDance

Premium batch editing with enhanced quality.

Multi-image inputBatch outputEnhanced quality
Seedream 5.0 Lite Edit SequentialText to Image

Seedream 5.0 Lite Edit Sequential

ByteDance

Batch text-to-image with 2K/3K and PNG support.

Batch output (up to 15)2K/3K ResolutionPNG Output
Seedream 5.0 Lite Edit SequentialImage to Image

Seedream 5.0 Lite Edit Sequential

ByteDance

Batch image generation with 2K/3K and PNG support.

Multi-image input (up to 14)Batch output (up to 15)2K/3K Resolution
Image to Video

Hailuo 2.3 Fast

MiniMax

Fast image-to-video with natural motion.

6-10s videosImage inputFast generation
Image to Video

Hailuo 2.3

MiniMax

High quality image-to-video animation.

6-10s videosImage inputHigh quality
Text to Video

Hailuo 2.3 Pro

MiniMax

Premium text-to-video with enhanced quality.

6-10s videosPremium qualityEnhanced detail
Text to Video

Hailuo 02

MiniMax

High-quality text-to-video with frame control.

6-10s videosFirst/last frameHigh quality
Image to Video

Hailuo 02 (First/Last Frame)

MiniMax

First/Last frame to video.

6-10s videosFirst frame requiredLast frame required
Text to Video

T2V-01

MiniMax

Affordable text-to-video generation.

6s videos720PCost-effective
Text to Video

T2V-01 Director

MiniMax

Text-to-video with camera control.

6s videosCamera controlCinematic shots
Image to Video

I2V-01

MiniMax

Image-to-video with natural motion.

6s videosImage inputMulti-resolution
Image to Video

I2V-01 Director

MiniMax

Image-to-video with camera control.

6s videosImage inputCamera control
Image to Video

I2V-01 Live

MiniMax

Live-action style image animation.

6s videosImage inputLive-action style
Image to Video

S2V-01

MiniMax

Subject-driven video generation.

6s videosSubject referenceIdentity preservation
Music 2.0Text to Music

Music 2.0

MiniMax

AI music generation from text prompts and lyrics.

Text-to-musicLyrics supportStyle control
Music 2.5Text to Music

Music 2.5

MiniMax

Next-gen AI music generation with high-fidelity audio.

Text-to-musicLyrics supportHigh-fidelity audio
Speech 2.6 HDText to Audio

Speech 2.6 HD

MiniMax

Latest HD TTS with outstanding prosody.

40 languages300+ voicesVoice cloning
Speech 2.6 TurboText to Audio

Speech 2.6 Turbo

MiniMax

Fast TTS with 40 language support.

40 languages300+ voicesFast generation
Speech 02 HDText to Audio

Speech 02 HD

MiniMax

Superior rhythm and voice quality.

Superior rhythmHigh stabilityVoice cloning
Speech 02 TurboText to Audio

Speech 02 Turbo

MiniMax

Fast TTS with enhanced multilingual.

Enhanced multilingualFast generationGood rhythm
Image to Video

WAN 2.5

Alibaba

Image-to-video with A/V sync.

5-10s videosImage inputA/V sync
Image to Video

WAN 2.6

Alibaba

Image-to-video with enhanced motion.

5-15s videosImage inputEnhanced motion
Video to Video

WAN 2.5 Video Extend

Alibaba

Extend videos with AI-generated continuation.

3-10s extensionVideo inputOptional audio
Video to Video

ByteDance Video Upscaler

ByteDance

AI super-resolution video upscaling to 4K.

1080p/2K/4K outputDetail recoveryTemporal consistency
Video to Video

Seedance 1.5 Pro Video Extend

ByteDance

Extend videos with natural motion and stable aesthetics.

4-12s extensionVideo input480p/720p
Video to Video

Seedance 1.5 Pro Video Extend Fast

ByteDance

Extend videos with natural motion continuation.

4-12s extensionVideo input720p/1080p
Image to Video

Seedance 1.5 Pro

ByteDance

Latest image-to-video with enhanced motion.

4-12s videosImage inputEnhanced motion
Image to Video

Seedance 1.0 Pro

ByteDance

Premium image-to-video animation.

2-12s videosImage inputPremium quality
Image to Video

Seedance 1.0 Pro Fast

ByteDance

Pro quality image-to-video with fast generation.

2-12s videosImage inputPro quality
Image to Video

Seedance 1.0 Lite

ByteDance

Lightweight image-to-video with frame control.

5-10s videosImage inputStart/end frames
Image to Video

Sora 2

OpenAI

OpenAI image-to-video with audio.

4-12s videosImage inputPhotorealistic
Image to Video

Sora 2 Pro

OpenAI

Premium OpenAI image-to-video.

4-12s videosImage inputPremium quality
Image to Video

Google Veo 3.1

Google

Google image-to-video with frame support.

4-8s videosImage inputHigh quality
Image to Video

Google Veo 3.1 Fast

Google

Fast Google image-to-video.

4-8s videosImage inputFast generation
Text to Video

Grok Imagine Video

xAI

xAI's creative and flexible video generation model.

1-15s videosFlexible durationMultiple aspect ratios
Image to Video

Grok Imagine Video

xAI

xAI's creative image-to-video animation.

1-15s videosImage inputFlexible duration
Video to Video

Grok Imagine Video Edit

xAI

xAI's prompt-driven video editing model.

Object manipulationStyle transferScene control
Image to Video

Runway Gen4 Turbo

Runway

High-quality image-to-video with cinematic motion.

5-10s videosImage inputCinematic motion
Video to Video

Runway Gen4 Aleph

Runway

Transform videos with natural language editing.

Video editingNatural languageStyle transfer
Video to Video

Runway Upscale V1

Runway

4X AI video upscaling with enhanced details.

4X upscalingAI-enhanced detailTemporal consistency
Image to Video

Vidu Q2

Vidu

Vidu image-to-video with audio.

4-8s videosImage inputHigh quality
Eleven V3Text to Audio

Eleven V3

ElevenLabs

Most expressive TTS model with dramatic delivery.

70+ languagesEmotional deliveryNatural dialogue
Eleven Multilingual V2Text to Audio

Eleven Multilingual V2

ElevenLabs

Most stable TTS model for professional content.

29 languagesMost stableLong-form content
Eleven Flash V2.5Text to Audio

Eleven Flash V2.5

ElevenLabs

Fastest TTS model with ultra-low latency.

32 languagesUltra-fast (~75ms)50% cheaper
Eleven Turbo V2.5Text to Audio

Eleven Turbo V2.5

ElevenLabs

Balanced TTS model with quality and speed.

32 languagesLow latency (~250ms)50% cheaper
Eleven Flash V2Text to Audio

Eleven Flash V2

ElevenLabs

Ultra-fast English-only TTS model.

English onlyUltra-fast (~75ms)50% cheaper
Eleven Turbo V2Text to Audio

Eleven Turbo V2

ElevenLabs

High-quality English TTS with low latency.

English onlyLow latency (~250ms)50% cheaper
ElevenLabs Voice ChangerAudio to Audio

ElevenLabs Voice Changer

ElevenLabs

Transform voice recordings into different voices.

Voice conversionEmotion preservationMultilingual support
Video to Video

ElevenLabs Dubbing

ElevenLabs

AI-powered video/audio dubbing to translate content into 29+ languages.

29+ LanguagesVoice PreservationEmotion Retention
Video to Video

WAN 2.2 Animate

Alibaba

Animate characters or replace them in videos using motion transfer.

Motion TransferCharacter AnimationExpression Replication
Video to Video

Kling 2.6 Pro Motion Control

Kling

Transfer motion from reference videos to character images.

Motion TransferCharacter AnimationPro Quality
Video to Video

Kling O1 Video Edit

Kling

Edit videos with natural language instructions.

Natural Language EditingObject RemovalStyle Transfer
Video Edit

Lucy Edit Dev

Lucy Edit AI

Fast AI video editing with text prompts.

Temporal ConsistencyObject ModificationStyle Transfer
Video Edit

Lucy Edit Pro

Lucy Edit AI

Professional AI video editing with resolution control.

Resolution ControlEnhanced QualityTemporal Consistency
Video Edit

Lucy Restyle

Lucy Edit AI

AI video style transfer with motion preservation.

Style TransferTemporal ConsistencyMotion Preservation
Reference to Video

Kling Video O1

Kling

Generate videos from character/scene reference images.

Multi-ReferenceIdentity ConsistencySubject Extraction
Reference to Video

Kling Video O1 Standard

Kling

Cost-effective reference-to-video generation.

Up to 10 ReferencesIdentity ConsistencyCost-Effective
Reference to Video

WAN 2.6 Reference-to-Video

Alibaba

Transform video references into new video shots.

Multi-View ReferenceIdentity PreservationSmooth Motion
OpenAI WhisperSpeech to Text

OpenAI Whisper

OpenAI

Fast, accurate multilingual speech-to-text.

MultilingualAuto Language DetectionPunctuation
OpenAI Whisper with VideoSpeech to Text

OpenAI Whisper with Video

OpenAI

Transcribe video audio with timestamps.

Video InputTimestampsSubtitle Segments
OpenAI Whisper TurboSpeech to Text

OpenAI Whisper Turbo

OpenAI

Fastest Whisper model with great accuracy.

Turbo SpeedMultilingualAuto Detection
Hunyuan3D v3Text to 3D

Hunyuan3D v3

Tencent

Generate detailed 3D models from text descriptions with texture support.

Text to 3DPBR MaterialsLowPoly Mode
Hunyuan3D v3 ImageImage to 3D

Hunyuan3D v3 Image

Tencent

Convert images to detailed 3D models with optional multi-view enhancement.

Single Image to 3DMulti-view SupportPBR Materials
Hunyuan3D v2.1Image to 3D

Hunyuan3D v2.1

Tencent

Single image to 3D with PBR materials and high-fidelity textures.

Single Image to 3DPBR Materials4K Textures
Hunyuan3D v2 BaseImage to 3D

Hunyuan3D v2 Base

Tencent

High-fidelity 3D from single image with 4K textures.

Single Image to 3D4K TexturesFast Generation
Hunyuan3D v2 MiniImage to 3D

Hunyuan3D v2 Mini

Tencent

Fastest and most affordable image-to-3D model.

Single Image to 3DUltra FastLow Cost
Hunyuan3D v2 Multi-ViewImage to 3D

Hunyuan3D v2 Multi-View

Tencent

Best accuracy from 3 reference images.

Multi-View Input3 Required ImagesHigh Accuracy
OpenAI
LLM

GPT-4o

OpenAI

OpenAI flagship multimodal model.

Text generationReasoningCode generation
OpenAI
LLM

GPT-4.1

OpenAI

Latest GPT-4 with improved coding and reasoning.

Text generationAdvanced reasoningCode generation
OpenAI
LLM

O3

OpenAI

Advanced reasoning model with chain-of-thought.

Deep reasoningChain of thoughtMath & science
OpenAI
LLM

GPT-5

OpenAI

Most capable GPT model.

State-of-the-art reasoningCreative writingCode generation
Anthropic
LLM

Claude Sonnet 4.6

Anthropic

Latest Sonnet with near-Opus performance.

Extended thinkingAdvanced codingAgentic tasks
Anthropic
LLM

Claude Sonnet 4

Anthropic

Latest balanced model for coding and analysis.

Strong codingFast responseCreative writing
Anthropic
LLM

Claude 3.5 Haiku

Anthropic

Fastest Claude model, cost-efficient.

Ultra-fastCost-efficientGood reasoning
Google
LLM

Gemini 2.5 Flash

Google

Fast and efficient with reasoning.

Fast inferenceThinking modeCode generation
Google
LLM

Gemini 2.0 Flash

Google

Budget-friendly general purpose model.

Fast inferenceCost-efficientGeneral purpose
Google
LLM

Gemini 2.5 Flash Lite

Google

Most affordable Gemini model.

Ultra-low costFast inferenceSimple tasks
Google
LLM

Gemini 2.5 Pro

Google

Premium Gemini with top-tier reasoning.

State-of-the-art reasoningAdvanced coding1M context
Google
LLM

Gemini 3 Flash Preview

Google

Next-gen Flash model preview.

Next-gen performanceFast inferenceImproved reasoning
DeepSeek
LLM

DeepSeek R1

DeepSeek

Reasoning model with chain-of-thought.

Chain of thoughtMath & scienceComplex reasoning
DeepSeek
LLM

DeepSeek Chat

DeepSeek

Fast and affordable general-purpose model.

Fast inferenceUltra-low costGeneral purpose
Zhipu (Z.ai)
LLM

GLM-5

Zhipu (Z.ai)

Z.ai flagship model with advanced reasoning.

Advanced reasoningMultilingualCode generation
Zhipu (Z.ai)
LLM

GLM-4.6V

Zhipu (Z.ai)

Multimodal model with vision support.

Vision + textMultimodalGeneral purpose
Kimi (Moonshot)
LLM

Kimi K2.5

Kimi (Moonshot)

MoE model with 1T parameters.

1T parameters (MoE)Strong reasoningCode generation
Mostrando 159 de 159 modelos