Back to all models
Get all the details on GPT-4o, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, long_context, function_calling, available API code samples, and performance strengths.
Key Metrics
Input Limit
128K tokens
Output Limit
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Sample API Code
from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
model="gpt-4o-2024-08-06",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello, how are you?"}
]
)
print(response.choices[0].message.content)
Required Libraries
openai
openai
Benchmarks
Benchmark | Score | Source | Notes |
---|---|---|---|
1262 | OpenLLM Leaderboard | - | |
1126 | OpenLLM Leaderboard | - | |
1000 | OpenLLM Leaderboard | Score for 'api-gpt-4o-search' which represents GPT-4o with search capabilities. | |
53.95 | LiveBench | - | |
39.75 | LiveBench | - | |
69.29 | LiveBench | - | |
41.48 | LiveBench | - | |
63.53 | LiveBench | - | |
44.68 | LiveBench | - | |
64.94 | LiveBench | - | |
72.08% | Vellum Leaderboard | - | |
13.4% | Vellum Leaderboard | - | |
56.1% | Vellum Leaderboard | - | |
31% | Vellum Leaderboard | - | |
60.3% | Vellum Leaderboard | - | |
27.1% | Vellum Leaderboard | - |
Notes
GPT-4o ('o' for 'omni') is a versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is considered the best model for most tasks and is highly capable. This is a specific snapshot version of the model.
Supported Data Types
Input Types
text
image
Output Types
text
json
Strengths & Weaknesses
Exceptional at
general reasoning
multimodal understanding
complex reasoning
instruction following
Good at
code generation
vision tasks
structured outputs
function calling
Additional Information
Latest Update
Aug 6, 2024
Knowledge Cutoff
Oct 1, 2023