Back to all models
Get all the details on GPT-4o mini, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, function_calling, structured_outputs, available API code samples, and performance strengths.
Key Metrics
Input Limit
128K tokens
Output Limit
16.4K tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Sample API Code
from openai import OpenAI;client = OpenAI();response = client.chat.completions.create(model="gpt-4o-mini",messages=[{"role": "system", "content": "You are a helpful assistant."},{"role": "user", "content": "Hello!"}]);print(response.choices[0].message.content)
Required Libraries
openai
openai
Benchmarks
Benchmark | Score | Source | Notes |
---|---|---|---|
1269 | OpenLLM Leaderboard | Rank 45 | |
1124 | OpenLLM Leaderboard | Rank 27 | |
961 | OpenLLM Leaderboard | Rank 11 (api-gpt-4o-mini-search) | |
43.41 | LiveBench | - | |
25.64 | LiveBench | - | |
55.02 | LiveBench | - | |
38.05 | LiveBench | - | |
55.10 | LiveBench | - | |
29.88 | LiveBench | - | |
56.80 | LiveBench | - | |
79.7% | Vellum Leaderboard | - | |
87.3% | Vellum Leaderboard | - | |
61% | Vellum Leaderboard | - | |
97.9% | Vellum Leaderboard | - | |
65.12% | Vellum Leaderboard | - | |
60.4% | Vellum Leaderboard | - | |
50% | Vellum Leaderboard | - |
Notes
GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.
Supported Data Types
Input Types
text
image
Output Types
text
Strengths & Weaknesses
Exceptional at
fast
affordable
fine tuning
focused tasks
Good at
text generation
image understanding
structured outputs
function calling
Poor at
audio input
audio output
Additional Information
Latest Update
Jul 18, 2024
Knowledge Cutoff
Oct 1, 2023