GPT-4o mini Realtime

OpenAI · GPT-4o mini Realtime

Preview

Latest in family

Key Metrics

Input Limit

128K tokens

Output Limit

4.1K tokens

Input Cost

$0.60/1M

Output Cost

$2.40/1M

Sample API Code

Refer to OpenAI Python library documentation.

Required Libraries

openai

Notes

Default. Smaller realtime model for text and audio inputs and outputs. This is a preview release of the GPT-4o-mini Realtime model, capable of responding to audio and text inputs in realtime over WebRTC or a WebSocket interface. Intelligence: Average. Speed: Very fast.

Capabilities

realtime interaction

text input

audio input

text output

audio output

function calling

Supported Data Types

Input Types

text

audio

Output Types

text

audio

Strengths & Weaknesses

Exceptional at

Realtime audio interaction

Realtime text interaction

Very fast response speed

Good at

Conversational AI

Function calling

General tasks at average intelligence

Poor at

Image processing (not supported)

Structured outputs (not supported)

Fine-tuning (not supported)

Distillation (not supported)

Predicted outputs (not supported)

Additional Information

Latest Update

Dec 17, 2024

Knowledge Cutoff

Oct 01, 2023

Similar Models

babbage-002

OpenAI

available

ChatGPT-4o

OpenAI

Available

computer-use-preview

OpenAI

Preview

Similar Capabilities

Multimodal input

13 models

Long context

13 models

Function calling

23 models