GPT-4o Audio Preview - In-Depth Overview

OpenAI · GPT-4o

outdated

Flagship

Model ID: gpt-4o-audio-preview-2024-10-01

Get all the details on GPT-4o Audio Preview, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as advanced_reasoning, function_calling, audio_input, available API code samples, and performance strengths.

Key Metrics

Input Limit

No data tokens

Output Limit

No data tokens

Input Cost

$40.00/1M

Output Cost

$80.00/1M

Sample API Code

from openai import OpenAI
client = OpenAI()
speech_file_path = "speech.mp3"
response = client.audio.speech.create(
  model="gpt-4o-audio-preview",
  voice="alloy",
  input="The quick brown fox jumped over the lazy dog."
)
response.stream_to_file(speech_file_path)