Logo
Back to all models

Gemini 1.5 Flash - In-Depth Overview

Google · Gemini

Current

Get all the details on Gemini 1.5 Flash, an AI model from Google. This page covers its token limits, pricing structure, key capabilities such as context_caching, grounding_with_google_search, long_context, available API code samples, and performance strengths.

Key Metrics

Input Limit

1M tokens

Output Limit

1M tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Sample API Code

import google.generativeai as genai; genai.configure(api_key='YOUR_API_KEY'); model = genai.GenerativeModel(model_name='gemini-1.5-flash-latest'); prompt_response = model.generate_content('Write a story about a magic backpack.'); print(prompt_response.text)

Required Libraries

google-generativeai
@google/generative-ai

Benchmarks

BenchmarkScoreSourceNotes
1227
lmarena.ai-
1072
lmarena.ai-

Notes

Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million token context window. Pricing is tiered based on prompt size (<= 128k tokens vs > 128k tokens).

Capabilities

context caching
grounding with google search
long context
multimodal input
tuning

Supported Data Types

Input Types

text
image
video
audio

Output Types

text

Strengths & Weaknesses

Exceptional at

long context processing
multimodal understanding
diverse tasks
repetitive tasks
speed

Additional Information

Latest Update

May 13, 2025

Knowledge Cutoff

No data