Logo
Back to all models

GPT-4o mini Realtime - In-Depth Overview

OpenAI · GPT-4o

preview

Model ID: gpt-4o-mini-realtime-preview

Get all the details on GPT-4o mini Realtime, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as function_calling, multimodal_input, multimodal_output, available API code samples, and performance strengths.

Key Metrics

Input Limit

128K tokens

Output Limit

4.1K tokens

Input Cost

$0.60/1M

Output Cost

$2.40/1M

Sample API Code

from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
    model="gpt-4o-mini-realtime-preview",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)

Required Libraries

openai
openai

Notes

Smaller realtime model for text and audio inputs and outputs, capable of responding to audio and text inputs in realtime over WebRTC or a WebSocket interface. It is a preview release.

Capabilities

function calling
multimodal input
multimodal output
realtime conversations
realtime transcription
voice activity detection

Supported Data Types

Input Types

text
audio

Output Types

text
audio

Strengths & Weaknesses

Exceptional at

realtime text processing
realtime audio processing

Good at

fast response times

Additional Information

Latest Update

Dec 17, 2024

Knowledge Cutoff

Oct 1, 2023