Key Metrics
Input Limit
2.1M tokens
Output Limit
8.2K tokens
Input Cost
$1.25/1M
Output Cost
$5.00/1M
Sample API Code
import google.generativeai as genai
# Configure the API key
genai.configure(api_key="YOUR_API_KEY")
# Set up the model
model = genai.GenerativeModel('gemini-1.5-pro')
# Generate content
response = model.generate_content("Explain the difference between transformers and RNNs in deep learning.")
print(response.text)
Required Libraries
google-generativeai
@google/generative-ai
Notes
Mid-size multimodal model optimized for a wide-range of reasoning tasks. Can process large amounts of data at once, including 2 hours of video, 19 hours of audio, codebases with 60,000 lines of code, or 2,000 pages of text.
Capabilities
Multimodal input
Long context
Function calling
JSON mode
System instructions
Code execution
Caching
Supported Data Types
Input Types
text
image
audio
video
Output Types
text
Strengths & Weaknesses
Exceptional at
complex reasoning
long context processing
Good at
multimodal understanding
code generation
Additional Information
Latest Update
Sep 1, 2024
Knowledge Cutoff
2023-11
ELO Score
1280