Back to all models
Get all the details on GPT-4o Audio, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as function_calling, multimodal_input, streaming, available API code samples, and performance strengths.
Key Metrics
Input Limit
128K tokens
Output Limit
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Sample API Code
from openai import OpenAI
client = OpenAI()
chat_completion = client.chat.completions.create(
messages=[
{
"role": "user",
"content": "Say this is a test",
}
],
model="gpt-4o-audio-preview",
)
print(chat_completion.choices[0].message.content)
Required Libraries
openai
openai
Notes
This is a preview release of the GPT-4o Audio models. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.
Capabilities
function calling
multimodal input
streaming
Supported Data Types
Input Types
text
audio
Output Types
text
audio
Strengths & Weaknesses
Exceptional at
multimodal audio processing
Additional Information
Latest Update
Dec 17, 2024
Knowledge Cutoff
Oct 1, 2023