Logo
Back to all models

GPT-4o Audio Preview - In-Depth Overview

OpenAI · GPT-4o

outdated
Flagship

Get all the details on GPT-4o Audio Preview, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as advanced_reasoning, function_calling, audio_input, available API code samples, and performance strengths.

Key Metrics

Input Limit

No data tokens

Output Limit

No data tokens

Input Cost

$40.00/1M

Output Cost

$80.00/1M

Sample API Code

from openai import OpenAI
client = OpenAI()
speech_file_path = "speech.mp3"
response = client.audio.speech.create(
  model="gpt-4o-audio-preview",
  voice="alloy",
  input="The quick brown fox jumped over the lazy dog."
)
response.stream_to_file(speech_file_path)

Required Libraries

openai
openai

Notes

A GPT-4o model capable of audio inputs and outputs. Information based on the gpt-4o-audio-preview-2024-12-17 version.

Capabilities

advanced reasoning
function calling
audio input
audio output
multimodal input

Supported Data Types

Input Types

audio

Output Types

audio

Strengths & Weaknesses

Exceptional at

audio input processing
audio output generation
realtime audio interaction

Additional Information

Latest Update

Dec 17, 2024

Knowledge Cutoff

Oct 1, 2023