Back to all models
Get all the details on GPT-4o Audio Preview, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as advanced_reasoning, audio_input, audio_output, available API code samples, and performance strengths.
Key Metrics
Input Limit
No data tokens
Output Limit
No data tokens
Input Cost
$40.00/1M
Output Cost
$80.00/1M
Sample API Code
from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
model="gpt-4o-audio-preview-2024-12-17",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello!"}
]
)
print(response.choices[0].message.content)
Required Libraries
openai
openai
Notes
Preview model capable of audio inputs and outputs. Handling audio input/output typically requires specific API methods like the Realtime API or dedicated audio endpoints.
Capabilities
advanced reasoning
audio input
audio output
function calling
json mode
multimodal input
vision
web browsing via tool
Supported Data Types
Input Types
text
audio
Output Types
text
audio
Strengths & Weaknesses
Exceptional at
audio input processing
audio output generation
multimodal understanding
Good at
general reasoning
instruction following
Additional Information
Latest Update
Dec 17, 2024
Knowledge Cutoff
Oct 1, 2023