Back to all models
GPT-4o mini Realtime - In-Depth Overview
OpenAI · GPT-4o
preview
Model ID: gpt-4o-mini-realtime-preview
Get all the details on GPT-4o mini Realtime, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as function_calling, multimodal_input, multimodal_output, available API code samples, and performance strengths.
Key Metrics
Input Limit
128K tokens
Output Limit
4.1K tokens
Input Cost
$0.60/1M
Output Cost
$2.40/1M
Sample API Code
from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
model="gpt-4o-mini-realtime-preview",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
Required Libraries
openai
openai
Notes
Smaller realtime model for text and audio inputs and outputs, capable of responding to audio and text inputs in realtime over WebRTC or a WebSocket interface. It is a preview release.
Capabilities
function calling
multimodal input
multimodal output
realtime conversations
realtime transcription
voice activity detection
Supported Data Types
Input Types
text
audio
Output Types
text
audio
Strengths & Weaknesses
Exceptional at
realtime text processing
realtime audio processing
Good at
fast response times
Additional Information
Latest Update
Dec 17, 2024
Knowledge Cutoff
Oct 1, 2023