Logo
Back to all models

GPT-4o - In-Depth Overview

OpenAI · GPT-4o

Current
Flagship
Latest in family

Get all the details on GPT-4o, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, code_interpretation_via_tool, computer_use_via_tool, available API code samples, and performance strengths.

Key Metrics

Input Limit

128K tokens

Output Limit

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Sample API Code

from openai import OpenAI;client = OpenAI();response = client.chat.completions.create(model="gpt-4o-2024-11-20",messages=[{"role": "system", "content": "You are a helpful assistant."},{"role": "user", "content": "Hello!"}]);print(response.choices[0].message.content)

Required Libraries

openai
openai

Benchmarks

BenchmarkScoreSourceNotes
0.52
OpenLLM LeaderboardAverage Win Rate for chatgpt-4o-latest-20241120
964
OpenLLM Leaderboard-
0.46
OpenLLM LeaderboardAverage Win Rate for chatgpt-4o-latest-20241120
1000
OpenLLM LeaderboardScore for api-gpt-4o-search
72.08%
Vellum LLM Leaderboard-
13.4%
Vellum LLM Leaderboard-
56.1%
Vellum LLM Leaderboard-
31%
Vellum LLM Leaderboard-
60.3%
Vellum LLM Leaderboard-
27.1%
Vellum LLM Leaderboard-
53.95
LiveBench Leaderboard-
39.75
LiveBench Leaderboard-
69.29
LiveBench Leaderboard-
41.48
LiveBench Leaderboard-
63.53
LiveBench Leaderboard-
44.68
LiveBench Leaderboard-
64.94
LiveBench Leaderboard-

Notes

GPT-4o (“o” for “omni”) is OpenAI's versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is OpenAI's most capable model outside of their o-series models.

Capabilities

multimodal input
code interpretation via tool
computer use via tool
function calling
json mode
streaming
fine tuning
advanced reasoning
vision
web browsing via tool
file search via tool

Supported Data Types

Input Types

text
image

Output Types

text
json

Strengths & Weaknesses

Exceptional at

general reasoning
multimodal understanding
instruction following

Good at

code generation
vision tasks

Additional Information

Latest Update

Nov 20, 2024

Knowledge Cutoff

Oct 1, 2023