Gemini 1.5 Flash - In-Depth Overview

Google · Gemini

Current

Model ID: google-gemini-1.5-flash

Get all the details on Gemini 1.5 Flash, an AI model from Google. This page covers its token limits, pricing structure, key capabilities such as multimodal_input, long_context, context_caching, available API code samples, and performance strengths.

Key Metrics

Input Limit

1M tokens

Output Limit

1M tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Sample API Code

import google.generativeai as genai; genai.configure(api_key='YOUR_API_KEY'); model = genai.GenerativeModel(model_name='gemini-1.5-flash-latest'); prompt_response = model.generate_content('Write a story about a magic backpack.'); print(prompt_response.text)

Required Libraries

google-generativeai

@google/generative-ai

Benchmarks

Benchmark	Score	Source	Notes
lmarena text	1227	lmarena.ai	-
lmarena vision	1072	lmarena.ai	-

Notes

Our fastest multimodal model with great performance for diverse, repetitive tasks and a 1 million token context window. Pricing is tiered based on prompt size (<= 128k tokens vs > 128k tokens).