Back to all models

Whisper

OpenAI · Whisper

Default
Latest in family

Key Metrics

Input Limit

N/A tokens

Output Limit

N/A tokens

Input Cost

$0.01/1M

Output Cost

N/A/1M

Sample API Code

Not specified

Required Libraries

Not specified
Not specified

Notes

General-purpose speech recognition model. Trained on a large dataset of diverse audio. Performs multilingual speech recognition, speech translation, and language identification. 'whisper-1' is the specific model ID. Performance: Average, Speed: Medium.

Capabilities

Speech recognition
Multilingual speech recognition
Speech translation
Language identification

Supported Data Types

Input Types

Audio

Output Types

Text

Strengths & Weaknesses

Exceptional at

general-purpose speech recognition
multilingual speech recognition
speech translation
language identification

Additional Information

Latest Update

Sep 1, 2021

Knowledge Cutoff