Back to all models

GPT-4o Realtime (2024-12-17)

OpenAI · GPT-4o Realtime

Available
Latest in family

Key Metrics

Input Limit

128K tokens

Output Limit

4.1K tokens

Input Cost

$5.00/1M

Output Cost

$20.00/1M

Sample API Code

Required Libraries

Notes

Snapshot version of GPT-4o Realtime, locked to behavior as of 2024-12-17. Inherits capabilities like realtime audio/text processing and function calling from the main GPT-4o Realtime model family.

Capabilities

realtime text processing
realtime audio processing
function calling
WebRTC interface support
WebSocket interface support

Supported Data Types

Input Types

text
audio

Output Types

text
audio

Strengths & Weaknesses

Exceptional at

realtime multimodal interaction (text and audio)
complex understanding

Good at

function calling
fast response times

Poor at

structured outputs (not supported)
fine-tuning (not supported)
distillation (not supported)
predicted outputs (not supported)
image input/output (not supported)

Additional Information

Latest Update

Dec 17, 2024

Knowledge Cutoff

Oct 01, 2023