Back to all models
Get all the details on o3, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, long_context, function_calling, available API code samples, and performance strengths.
Key Metrics
Input Limit
200K tokens
Output Limit
100K tokens
Input Cost
$10.00/1M
Output Cost
$40.00/1M
Sample API Code
from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
model="o3",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Hello, how are you?"}
]
)
print(response.choices[0].message.content)
Required Libraries
openai
openai
Benchmarks
Benchmark | Score | Source | Notes |
---|---|---|---|
1409 | OpenLLM Leaderboard | Score for o3-2025-04-16 | |
1190 | OpenLLM Leaderboard | Score for o3-2025-04-16 | |
1303 | OpenLLM Leaderboard | Score for o3-2025-04-16 | |
80.71 | LiveBench | Score for o3 High | |
93.33 | LiveBench | Score for o3 High | |
76.71 | LiveBench | Score for o3 High | |
85.00 | LiveBench | Score for o3 High | |
67.02 | LiveBench | Score for o3 High | |
76.00 | LiveBench | Score for o3 High | |
86.17 | LiveBench | Score for o3 High | |
91.6% | Vellum | - | |
83.3% | Vellum | - | |
69.1% | Vellum | - | |
20.32 | Vellum | - |
Notes
o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.
Supported Data Types
Input Types
text
image
Output Types
text
Strengths & Weaknesses
Exceptional at
complex reasoning
math
science
code generation
visual reasoning
technical writing
instruction following
multimodal understanding
Good at
general reasoning
Additional Information
Latest Update
Apr 16, 2025
Knowledge Cutoff
Jun 1, 2024