Gemini 2.5 Flash Preview - In-Depth Overview

Google · Gemini

preview

Model ID: google-gemini-2.5-flash-preview

Get all the details on Gemini 2.5 Flash Preview, an AI model from Google. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, long_context, thinking, available API code samples, and performance strengths.

Key Metrics

Input Limit

1M tokens

Output Limit

No data tokens

Input Cost

$0.15/1M

Output Cost

$0.60/1M

Sample API Code

import google.generativeai as genai

# Configure API key (replace with your key)
genai.configure(api_key="YOUR_API_KEY")

# Initialize the model
model = genai.GenerativeModel('gemini-2.5-flash-preview')

# Generate content
response = model.generate_content("Write a short story about a robot learning to love.")

# Print the response
print(response.text)

Required Libraries

google-generative-ai

@google/generative-ai

Benchmarks

Benchmark	Score	Source	Notes
lmarena text	1394	lmarena	Score for gemini-2.5-flash-preview-04-17
lmarena webdev	1145	lmarena	Score for Gemini-2.5-Flash-Preview-04-17
lmarena vision	1273	lmarena	Score for gemini-2.5-flash-preview-04-17
livebench global average	69.93	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
livebench reasoning average	73.47	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
livebench coding average	60.33	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
livebench mathematics average	81.80	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
livebench data analysis average	65.53	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
livebench language average	59.43	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
livebench if average	79.02	LiveBench	Score for Gemini 2.5 Flash Preview (2025-05-06)
aime 2024	88	Vellum	Score from Vellum Leaderboard
gpqa	78.3	Vellum	Score from Vellum Leaderboard
alder polyglot	51.1	Vellum	Score from Vellum Leaderboard

Notes

Preview model. Our first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Preview models may change before becoming stable and have more restrictive rate limits. The free tier is subject to rate limits rather than specific monthly token caps.