Logo
Back to all models

GPT-4.1 mini - In-Depth Overview

OpenAI · GPT-4.1

Current
Latest in family

Get all the details on GPT-4.1 mini, an AI model from OpenAI. This page covers its token limits, pricing structure, key capabilities such as fine_tuning, function_calling, long_context, available API code samples, and performance strengths.

Key Metrics

Input Limit

1.0M tokens

Output Limit

32.8K tokens

Input Cost

$0.40/1M

Output Cost

$1.60/1M

Sample API Code

from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
  model="gpt-4.1-mini",
  messages=[
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ]
)
print(response.choices[0].message.content)

Required Libraries

openai
openai

Benchmarks

BenchmarkScoreSourceNotes
1322
OpenLLM LeaderboardElo rating
1189
OpenLLM LeaderboardElo rating
1237
OpenLLM LeaderboardElo rating
961
OpenLLM LeaderboardElo rating
59.05
LiveBenchPercentage
53.78
LiveBenchPercentage
72.11
LiveBenchPercentage
58.78
LiveBenchPercentage
61.34
LiveBenchPercentage
38.00
LiveBenchPercentage
70.31
LiveBenchPercentage
49.6
VellumPercentage
65
VellumPercentage
23.6
VellumPercentage
34.7
VellumPercentage

Notes

Balanced for intelligence, speed, and cost. Provides a balance between intelligence, speed, and cost that makes it an attractive model for many use cases.

Capabilities

fine tuning
function calling
long context
multimodal input
streaming
structured outputs

Supported Data Types

Input Types

text
image

Output Types

text

Strengths & Weaknesses

Good at

balanced performance
intelligence
speed
cost efficiency
multimodal understanding
coding tasks
data analysis tasks

Poor at

language tasks

Additional Information

Latest Update

Apr 14, 2025

Knowledge Cutoff

Jun 1, 2024