AI Models with Vision Support
This page lists Large Language Models that offer Vision. Compare models, see how they implement this feature, and find the best option for projects requiring robust Vision.
Providers
Models with this Capability
Gemini 2.5 Flash Preview
Google · Gemini
ID: google-gemini-2.5-flash-preview
Input
1M tokens
Output
0 tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Exceptional at:
Gemini 2.0 Flash
Google · Gemini
ID: google-gemini-2.0-flash
Input
1M tokens
Output
0 tokens
Input Cost
$0.10/1M
Output Cost
$0.40/1M
Exceptional at:
Gemini 2.0 Flash
Google · Gemini
ID: google-gemini-2.0-flash-live
Input
1M tokens
Output
0 tokens
Input Cost
$0.10/1M
Output Cost
$0.40/1M
Exceptional at:
Claude Sonnet 3.7
Anthropic · Claude 3
ID: claude-3-7-sonnet-20250219
Input
200K tokens
Output
64K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
Claude Haiku 3.5
Anthropic · Claude 3.5
ID: claude-3-5-haiku-20241022
Input
200K tokens
Output
8.2K tokens
Input Cost
$0.80/1M
Output Cost
$4.00/1M
Exceptional at:
Gemini 1.5 Flash-8B
Google · Gemini
ID: google-gemini-1.5-flash-8b
Input
1M tokens
Output
0 tokens
Input Cost
$0.04/1M
Output Cost
$0.15/1M
Claude 3.5 Haiku
Anthropic · Claude 3.5
ID: anthropic-claude-3-5-haiku
Input
200K tokens
Output
8.2K tokens
Input Cost
$0.80/1M
Output Cost
$4.00/1M
Exceptional at:
Claude 3.5 Sonnet
Anthropic · Claude
ID: anthropic-claude-3-5-sonnet
Input
200K tokens
Output
4.1K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
Claude 3.5 Sonnet
Anthropic · Claude
ID: claude-3-5-sonnet-20241022
Input
200K tokens
Output
8.2K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Claude Haiku 3
Anthropic · Claude 3
ID: anthropic-claude-3-haiku
Input
200K tokens
Output
4.1K tokens
Input Cost
$0.25/1M
Output Cost
$1.25/1M
Exceptional at:
Claude Sonnet 3.5
Anthropic · Claude 3.5
ID: claude-3-5-sonnet-20240620
Input
200K tokens
Output
8.2K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Claude Sonnet 3
Anthropic · Claude 3
ID: claude-3-sonnet-20240229
Input
200K tokens
Output
4.1K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
Claude Sonnet 3.7
Anthropic · Claude
ID: claude-3-7-sonnet-latest
Input
200K tokens
Output
64K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
ID: gpt-4o-2024-11-20
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
GPT-4o mini Audio
OpenAI · GPT-4o
ID: gpt-4o-mini-audio-preview-2024-12-17
Input
128K tokens
Output
4.1K tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Exceptional at:
omni-moderation
OpenAI · omni-moderation
ID: omni-moderation-latest
Input
0 tokens
Output
0 tokens
Input Cost
$0.00/1M
Output Cost
$0.00/1M
Exceptional at:
o3-2025-04-16
OpenAI · o3
ID: o3-2025-04-16
Input
200K tokens
Output
100K tokens
Input Cost
$10.00/1M
Output Cost
$40.00/1M
Exceptional at:
o3
OpenAI · OpenAI
ID: o3
Input
200K tokens
Output
100K tokens
Input Cost
$10.00/1M
Output Cost
$40.00/1M
Exceptional at:
Claude Opus 4
Anthropic · Claude
ID: claude-opus-4-20250514
Input
200K tokens
Output
32K tokens
Input Cost
$15.00/1M
Output Cost
$75.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
ID: gpt-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
Claude Haiku 3
Anthropic · Claude 3
ID: claude-3-haiku-20240307
Input
200K tokens
Output
4.1K tokens
Input Cost
$0.25/1M
Output Cost
$1.25/1M
Claude Opus 3
Anthropic · Claude 3
ID: claude-3-opus-20240229
Input
200K tokens
Output
4.1K tokens
Input Cost
$15.00/1M
Output Cost
$75.00/1M
Exceptional at:
Claude 3.5 Sonnet
Anthropic · Claude
ID: claude-3-5-sonnet-latest
Input
200K tokens
Output
8.2K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Gemini 2.5 Pro Preview
Google · Gemini
ID: google-gemini-2.5-pro-preview
Input
1M tokens
Output
0 tokens
Input Cost
$1.25/1M
Output Cost
$10.00/1M
Exceptional at:
Claude 3.7 Sonnet
Anthropic · Claude 3
ID: anthropic-claude-3-7-sonnet
Input
200K tokens
Output
4.1K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
ID: gpt-4o-2024-08-06
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
Claude Opus 3
Anthropic · Claude 3
ID: anthropic-claude-3-opus
Input
200K tokens
Output
4.1K tokens
Input Cost
$15.00/1M
Output Cost
$75.00/1M
Exceptional at:
Claude Sonnet 4
Anthropic · Claude 4
ID: claude-sonnet-4-20250514
Input
200K tokens
Output
64K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
Claude Sonnet 4
Anthropic · Claude
ID: claude-sonnet-4-0
Input
200K tokens
Output
64K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
Claude Opus 4
Anthropic · Claude 4
ID: claude-opus-4-0
Input
200K tokens
Output
32K tokens
Input Cost
$15.00/1M
Output Cost
$75.00/1M
Exceptional at:
Claude Haiku 3.5
Anthropic · Claude
ID: claude-3-5-haiku-latest
Input
200K tokens
Output
8.2K tokens
Input Cost
$0.80/1M
Output Cost
$4.00/1M
Exceptional at:
Claude Opus 3
Anthropic · Claude 3
ID: claude-3-opus-latest
Input
200K tokens
Output
4.1K tokens
Input Cost
$15.00/1M
Output Cost
$75.00/1M
Exceptional at:
Similar Capabilities
Multimodal Input
Found in 30 models with Vision
Long Context
Found in 24 models with Vision
Thinking
Found in 3 models with Vision
Audio Understanding
Found in 2 models with Vision
Video Understanding
Found in 2 models with Vision
Image Understanding
Found in 2 models with Vision