Key Metrics
Input Limit
16K tokens
Output Limit
2K tokens
Input Cost
$1.25/1M
Output Cost
$5.00/1M
Sample API Code
Required Libraries
Notes
Speech-to-text model powered by GPT-4o mini. It offers improvements to word error rate and better language recognition and accuracy compared to original Whisper models. Use it for more accurate transcripts.
Capabilities
speech-to-text
audio transcription
language recognition
Supported Data Types
Input Types
audio
text
Output Types
text
Strengths & Weaknesses
Exceptional at
accurate transcription
improved word error rate
better language recognition
more accurate than original Whisper models
Good at
high performance
fast speed
Additional Information
Latest Update
Jun 1, 2024
Knowledge Cutoff
2024-06-01