Key Metrics
Input Limit
N/A tokens
Output Limit
N/A tokens
Input Cost
$0.01/1M
Output Cost
N/A/1M
Sample API Code
Not specified
Required Libraries
Not specified
Not specified
Notes
General-purpose speech recognition model. Trained on a large dataset of diverse audio. Performs multilingual speech recognition, speech translation, and language identification. 'whisper-1' is the specific model ID. Performance: Average, Speed: Medium.
Capabilities
Speech recognition
Multilingual speech recognition
Speech translation
Language identification
Supported Data Types
Input Types
Audio
Output Types
Text
Strengths & Weaknesses
Exceptional at
general-purpose speech recognition
multilingual speech recognition
speech translation
language identification
Additional Information
Latest Update
Sep 1, 2021
Knowledge Cutoff