Logo
Back to all models

GPT-4o Audio Preview - In-Depth Overview

OpenAI · GPT-4o

preview

Get all the details on GPT-4o Audio Preview, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as advanced_reasoning, audio_input, audio_output, available API code samples, and performance strengths.

Key Metrics

Input Limit

No data tokens

Output Limit

No data tokens

Input Cost

$40.00/1M

Output Cost

$80.00/1M

Sample API Code

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
  model="gpt-4o-audio-preview-2024-12-17",
  messages=[
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ]
)

print(response.choices[0].message.content)

Required Libraries

openai
openai

Notes

Preview model capable of audio inputs and outputs. Handling audio input/output typically requires specific API methods like the Realtime API or dedicated audio endpoints.

Capabilities

advanced reasoning
audio input
audio output
function calling
json mode
multimodal input
vision
web browsing via tool

Supported Data Types

Input Types

text
audio

Output Types

text
audio

Strengths & Weaknesses

Exceptional at

audio input processing
audio output generation
multimodal understanding

Good at

general reasoning
instruction following

Additional Information

Latest Update

Dec 17, 2024

Knowledge Cutoff

Oct 1, 2023