Back to all models
Get all the details on GPT-4o Audio Preview, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as advanced_reasoning, function_calling, audio_input, available API code samples, and performance strengths.
Key Metrics
Input Limit
No data tokens
Output Limit
No data tokens
Input Cost
$40.00/1M
Output Cost
$80.00/1M
Sample API Code
from openai import OpenAI
client = OpenAI()
speech_file_path = "speech.mp3"
response = client.audio.speech.create(
model="gpt-4o-audio-preview",
voice="alloy",
input="The quick brown fox jumped over the lazy dog."
)
response.stream_to_file(speech_file_path)
Required Libraries
openai
openai
Notes
A GPT-4o model capable of audio inputs and outputs. Information based on the gpt-4o-audio-preview-2024-12-17 version.
Capabilities
advanced reasoning
function calling
audio input
audio output
multimodal input
Supported Data Types
Input Types
audio
Output Types
audio
Strengths & Weaknesses
Exceptional at
audio input processing
audio output generation
realtime audio interaction
Additional Information
Latest Update
Dec 17, 2024
Knowledge Cutoff
Oct 1, 2023