GPT-4o Transcribe

OpenAI · GPT-4o

available

Latest in family

Key Metrics

Input Limit

16K tokens

Output Limit

2K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Sample API Code

Sample Python code not available from this page.

Required Libraries

Notes

GPT-4o Transcribe is a speech-to-text model that uses GPT-4o to transcribe audio. It offers improvements to word error rate and better language recognition and accuracy compared to original Whisper models. Use it for more accurate transcripts.

Capabilities

speech-to-text

audio transcription

language recognition

Supported Data Types

Input Types

audio

text

Output Types

text

Strengths & Weaknesses

Exceptional at

accurate transcription

improved word error rate

enhanced language recognition

high accuracy compared to original Whisper models

Good at

speech-to-text conversion

Additional Information

Latest Update

Jun 1, 2024

Knowledge Cutoff

2024-06-01

Similar Models

babbage-002

OpenAI

available

ChatGPT-4o

OpenAI

Available

computer-use-preview

OpenAI

Preview

Similar Capabilities

Multimodal input

13 models

Long context

13 models

Function calling

23 models