gemini-1.5-pro

Google · Gemini 1.5

Flagship

Latest in family

Key Metrics

Input Limit

2.1M tokens

Output Limit

8.2K tokens

Input Cost

$1.25/1M

Output Cost

$5.00/1M

Sample API Code

import google.generativeai as genai

# Configure the API key
genai.configure(api_key="YOUR_API_KEY")

# Set up the model
model = genai.GenerativeModel('gemini-1.5-pro')

# Generate content
response = model.generate_content("Explain the difference between transformers and RNNs in deep learning.")

print(response.text)

Required Libraries

google-generativeai

@google/generative-ai

Notes

Mid-size multimodal model optimized for a wide-range of reasoning tasks. Can process large amounts of data at once, including 2 hours of video, 19 hours of audio, codebases with 60,000 lines of code, or 2,000 pages of text.

Capabilities

Multimodal input

Long context

Function calling

JSON mode

System instructions

Code execution

Caching

Supported Data Types

Input Types

text

image

audio

video

Output Types

text

Strengths & Weaknesses

Exceptional at

complex reasoning

long context processing

Good at

multimodal understanding

code generation

Additional Information

Latest Update

Sep 1, 2024

Knowledge Cutoff

2023-11

ELO Score

1280

Similar Models

gemini-2.5-pro-preview-05-06

Google

preview

gemini-2.5-flash-preview-04-17

Google

preview

gemini-2.0-flash

Google

Similar Capabilities

Advanced reasoning

5 models

Document understanding

6 models

Search grounding

1 models