caching

Available in 6 models across 1 providers

Providers

Google

Models with this Capability

gemini-2.5-pro-preview-05-06

Google · Gemini 2.5

preview

Input

1.0M tokens

Output

65.5K tokens

Input Cost

$1.25/1M

Output Cost

$10.00/1M

Exceptional at:

Complex reasoning
Multimodal
+1
Multimodal input
Long context
Function calling
+6

gemini-2.0-flash

Google · Gemini 2.0

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.10/1M

Output Cost

$0.40/1M

Exceptional at:

speed
realtime streaming
Multimodal input
Long context
Function calling
+6

gemini-2.0-flash-lite

Google · Gemini 2.0

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Exceptional at:

cost efficiency
low latency
Multimodal input
Long context
Function calling
+2

gemini-1.5-pro

Google · Gemini 1.5

GA

Input

2.1M tokens

Output

8.2K tokens

Input Cost

$1.25/1M

Output Cost

$5.00/1M

Exceptional at:

complex reasoning
long context processing
Multimodal input
Long context
Function calling
+4

gemini-1.5-flash

Google · Gemini 1.5

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Exceptional at:

fast performance
versatile tasks
Multimodal input
Long context
Function calling
+5

gemini-1.5-flash-8b

Google · Gemini 1.5

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.04/1M

Output Cost

$0.15/1M

Exceptional at:

high volume
cost efficiency
Multimodal input
Long context
Function calling
+5

Similar Capabilities