GPT-4.1 mini - In-Depth Overview

OpenAI · GPT-4.1

Current

Latest in family

Get all the details on GPT-4.1 mini, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as fine_tuning, function_calling, long_context, available API code samples, and performance strengths.

Key Metrics

Input Limit

1.0M tokens

Output Limit

32.8K tokens

Input Cost

$0.40/1M

Output Cost

$1.60/1M

Sample API Code

from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
  model="gpt-4.1-mini",
  messages=[
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ]
)
print(response.choices[0].message.content)

Required Libraries

openai

Benchmarks

Benchmark	Score	Source	Notes
lmarena text	1322	OpenLLM Leaderboard	Elo rating
lmarena webdev	1189	OpenLLM Leaderboard	Elo rating
lmarena vision	1237	OpenLLM Leaderboard	Elo rating
lmarena search	961	OpenLLM Leaderboard	Elo rating
livebench global average	59.05	LiveBench	Percentage
livebench reasoning average	53.78	LiveBench	Percentage
livebench coding average	72.11	LiveBench	Percentage
livebench mathematics average	58.78	LiveBench	Percentage
livebench data analysis average	61.34	LiveBench	Percentage
livebench language average	38.00	LiveBench	Percentage
livebench if average	70.31	LiveBench	Percentage
aime 2024	49.6	Vellum	Percentage
gpqa	65	Vellum	Percentage
swe bench	23.6	Vellum	Percentage
alder polyglot	34.7	Vellum	Percentage

Notes

Balanced for intelligence, speed, and cost. Provides a balance between intelligence, speed, and cost that makes it an attractive model for many use cases.

Capabilities

fine tuning

function calling

long context

multimodal input

streaming

structured outputs

Supported Data Types

Input Types

text

image

Output Types

text

Strengths & Weaknesses

Good at

balanced performance

intelligence

speed

cost efficiency

multimodal understanding

coding tasks

data analysis tasks

Poor at

language tasks

Additional Information

Latest Update

Apr 14, 2025

Knowledge Cutoff

Jun 1, 2024

Similar Models

Gemini 2.5 Flash Preview

Google

preview

Gemini 2.0 Flash

Google

Current

Gemini 2.0 Flash

Google

Current

Similar Capabilities

audio understanding

3 models

grounding with search

1 models

image understanding

3 models