Speech to Text
OpenAI Whisper
Fast, accurate multilingual speech-to-text.
Audio: .wav/.mp3, ≤15 MB
Hint: You can drag and drop a file or click to upload
示例
Sample Transcription
{"text": "Loading sample transcription..."}上方显示的是示例输出。提示词和设置已预填在左侧。
您的请求将花费 $0.000250 每次运行。 使用 $10 您可以运行此模型约 40000 次。
还有一件事:
相关模型
README
OpenAI / OpenAI Whisper — Speech to Text (openai-whisper)
OpenAI Whisper Large v3 provides instant, accurate multilingual transcription with automatic language detection and punctuation. No cold starts, ready-to-use REST API.
亮点
- •Multilingual support – Transcribe audio in 99+ languages.
- •Auto language detection – No need to specify input language.
- •Accurate punctuation – Automatically adds punctuation marks.
- •Fast inference – No cold starts, instant responses.
参数
- •audio* – Audio file to transcribe. Supports MP3, WAV, M4A, FLAC, OGG, and WebM formats.
- •language – Language code (e.g., "en", "es", "fr"). Auto-detected if not specified.
定价
$0.003000 每次生成
| 分辨率 | 价格 |
|---|---|
| Per minute | $0.003 |
如何使用
- 1.Upload an audio file (MP3, WAV, M4A, etc.).
- 2.Optionally specify the language code.
- 3.Receive accurate transcription with punctuation.
专业技巧
- •For best results, use clear audio with minimal background noise.
- •Specifying the language can improve accuracy for specific languages.
- •Supports long-form audio content efficiently.
更多可试用的模型
•Podcast transcription: Convert audio episodes to text.
•Meeting notes: Transcribe meetings and calls.
•Video subtitles: Generate subtitles from video audio.
•Voice notes: Convert voice memos to text.
Related Guides
Frequently Asked Questions
What is the OpenAI Whisper API?
Fast, accurate multilingual speech-to-text.
How much does OpenAI Whisper cost via API?
OpenAI Whisper costs $0.0030 per generation through Renderful's API. No subscription required — pay only for what you use.
How do I use OpenAI Whisper via API?
Sign up for a free Renderful API key, then send a POST request to the /v1/predictions endpoint with model "openai-whisper". See the documentation at renderful.ai/docs for code examples in Python, JavaScript, and cURL.
What type of content does OpenAI Whisper generate?
OpenAI Whisper is a speech to text model by OpenAI. Key features include: Multilingual, Auto Language Detection, Punctuation.
Is the OpenAI Whisper API fast?
OpenAI Whisper has fast generation speed. Results are delivered via polling or webhook callback for seamless integration.
