caching

Available in 6 models across 1 providers

Providers

Google

Models with this Capability

gemini-2.5-pro-preview-05-06

Google · Gemini 2.5

preview

Input

1.0M tokens

Output

65.5K tokens

Input Cost

$1.25/1M

Output Cost

$10.00/1M

Exceptional at:

Complex reasoning

Multimodal

+1

Multimodal input

Long context

Function calling

+6

gemini-2.0-flash

Google · Gemini 2.0

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.10/1M

Output Cost

$0.40/1M

Exceptional at:

speed

realtime streaming

Multimodal input

Long context

Function calling

+6

gemini-2.0-flash-lite

Google · Gemini 2.0

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Exceptional at:

cost efficiency

low latency

Multimodal input

Long context

Function calling

+2

gemini-1.5-pro

Google · Gemini 1.5

GA

Input

2.1M tokens

Output

8.2K tokens

Input Cost

$1.25/1M

Output Cost

$5.00/1M

Exceptional at:

complex reasoning

long context processing

Multimodal input

Long context

Function calling

+4

gemini-1.5-flash

Google · Gemini 1.5

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Exceptional at:

fast performance

versatile tasks

Multimodal input

Long context

Function calling

+5

gemini-1.5-flash-8b

Google · Gemini 1.5

GA

Input

1.0M tokens

Output

8.2K tokens

Input Cost

$0.04/1M

Output Cost

$0.15/1M

Exceptional at:

high volume

cost efficiency

Multimodal input

Long context

Function calling

+5

Similar Capabilities