Speech to Text

OpenAI Whisper

Fast, accurate multilingual speech-to-text.

模式
Audio: .wav/.mp3, ≤15 MB

Hint: You can drag and drop a file or click to upload

示例
Sample Transcription
{"text": "Loading sample transcription..."}

上方显示的是示例输出。提示词和设置已预填在左侧。

您的请求将花费 $0.000250 每次运行。 使用 $10 您可以运行此模型约 40000 次。

还有一件事:

README

OpenAI / OpenAI WhisperSpeech to Text (openai-whisper)

OpenAI Whisper Large v3 provides instant, accurate multilingual transcription with automatic language detection and punctuation. No cold starts, ready-to-use REST API.

亮点

  • Multilingual support Transcribe audio in 99+ languages.
  • Auto language detection No need to specify input language.
  • Accurate punctuation Automatically adds punctuation marks.
  • Fast inference No cold starts, instant responses.

参数

  • audio*Audio file to transcribe. Supports MP3, WAV, M4A, FLAC, OGG, and WebM formats.
  • languageLanguage code (e.g., "en", "es", "fr"). Auto-detected if not specified.

定价

$0.003000 每次生成

分辨率价格
Per minute$0.003

如何使用

  1. 1.Upload an audio file (MP3, WAV, M4A, etc.).
  2. 2.Optionally specify the language code.
  3. 3.Receive accurate transcription with punctuation.

专业技巧

  • For best results, use clear audio with minimal background noise.
  • Specifying the language can improve accuracy for specific languages.
  • Supports long-form audio content efficiently.

更多可试用的模型

Podcast transcription: Convert audio episodes to text.
Meeting notes: Transcribe meetings and calls.
Video subtitles: Generate subtitles from video audio.
Voice notes: Convert voice memos to text.

Frequently Asked Questions

What is the OpenAI Whisper API?
Fast, accurate multilingual speech-to-text.
How much does OpenAI Whisper cost via API?
OpenAI Whisper costs $0.0030 per generation through Renderful's API. No subscription required — pay only for what you use.
How do I use OpenAI Whisper via API?
Sign up for a free Renderful API key, then send a POST request to the /v1/predictions endpoint with model "openai-whisper". See the documentation at renderful.ai/docs for code examples in Python, JavaScript, and cURL.
What type of content does OpenAI Whisper generate?
OpenAI Whisper is a speech to text model by OpenAI. Key features include: Multilingual, Auto Language Detection, Punctuation.
Is the OpenAI Whisper API fast?
OpenAI Whisper has fast generation speed. Results are delivered via polling or webhook callback for seamless integration.