GPT-4o mini - In-Depth Overview

OpenAI · GPT-4o

Current

Latest in family

Model ID: gpt-4o-mini

Get all the details on GPT-4o mini, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, function_calling, structured_outputs, available API code samples, and performance strengths.

Key Metrics

Input Limit

128K tokens

Output Limit

16.4K tokens

Input Cost

$0.15/1M

Output Cost

$0.60/1M

Sample API Code

from openai import OpenAI;client = OpenAI();response = client.chat.completions.create(model="gpt-4o-mini",messages=[{"role": "system", "content": "You are a helpful assistant."},{"role": "user", "content": "Hello!"}]);print(response.choices[0].message.content)

Required Libraries

openai

Benchmarks

Benchmark	Score	Source	Notes
lmarena text	1269	OpenLLM Leaderboard	Rank 45
lmarena vision	1124	OpenLLM Leaderboard	Rank 27
lmarena search	961	OpenLLM Leaderboard	Rank 11 (api-gpt-4o-mini-search)
livebench global average	43.41	LiveBench	-
livebench reasoning average	25.64	LiveBench	-
livebench coding average	55.02	LiveBench	-
livebench mathematics average	38.05	LiveBench	-
livebench data analysis average	55.10	LiveBench	-
livebench language average	29.88	LiveBench	-
livebench if average	56.80	LiveBench	-
gpqa	79.7%	Vellum Leaderboard	-
aime 2024	87.3%	Vellum Leaderboard	-
swe bench	61%	Vellum Leaderboard	-
math 500	97.9%	Vellum Leaderboard	-
bfcl	65.12%	Vellum Leaderboard	-
alder polyglot	60.4%	Vellum Leaderboard	-
grind	50%	Vellum Leaderboard	-

Notes

GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.