Logo
Back to all models

GPT-4o - In-Depth Overview

OpenAI · GPT-4o

outdated
Flagship

Model ID: gpt-4o-2024-08-06

Get all the details on GPT-4o, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, long_context, function_calling, available API code samples, and performance strengths.

Key Metrics

Input Limit

128K tokens

Output Limit

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Sample API Code

from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
    model="gpt-4o-2024-08-06",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello, how are you?"}
    ]
)
print(response.choices[0].message.content)

Required Libraries

openai
openai

Benchmarks

BenchmarkScoreSourceNotes
1262
OpenLLM Leaderboard-
1126
OpenLLM Leaderboard-
1000
OpenLLM LeaderboardScore for 'api-gpt-4o-search' which represents GPT-4o with search capabilities.
53.95
LiveBench-
39.75
LiveBench-
69.29
LiveBench-
41.48
LiveBench-
63.53
LiveBench-
44.68
LiveBench-
64.94
LiveBench-
72.08%
Vellum Leaderboard-
13.4%
Vellum Leaderboard-
56.1%
Vellum Leaderboard-
31%
Vellum Leaderboard-
60.3%
Vellum Leaderboard-
27.1%
Vellum Leaderboard-

Notes

GPT-4o ('o' for 'omni') is a versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is considered the best model for most tasks and is highly capable. This is a specific snapshot version of the model.

Supported Data Types

Input Types

text
image

Output Types

text
json

Strengths & Weaknesses

Exceptional at

general reasoning
multimodal understanding
complex reasoning
instruction following

Good at

code generation
vision tasks
structured outputs
function calling

Additional Information

Latest Update

Aug 6, 2024

Knowledge Cutoff

Oct 1, 2023