AI Models with Multimodal Input Support
This page lists Large Language Models that offer Multimodal Input. Compare models, see how they implement this feature, and find the best option for projects requiring robust Multimodal Input.
Providers
Models with this Capability
Claude 3 Haiku
Anthropic · Claude 3
Input
200K tokens
Output
4.1K tokens
Input Cost
$0.25/1M
Output Cost
$1.25/1M
Gemini 2.5 Flash Preview
Google · Gemini
Input
1M tokens
Output
0 tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Exceptional at:
Gemini 2.0 Flash
Google · Gemini
Input
1M tokens
Output
0 tokens
Input Cost
$0.10/1M
Output Cost
$0.40/1M
Exceptional at:
Gemini 2.0 Flash
Google · Gemini
Input
1M tokens
Output
0 tokens
Input Cost
$0.10/1M
Output Cost
$0.40/1M
Exceptional at:
GPT-4.1 nano
OpenAI · GPT-4.1
Input
1.0M tokens
Output
32.8K tokens
Input Cost
$0.10/1M
Output Cost
$0.40/1M
Exceptional at:
Gemini 1.5 Flash-8B
Google · Gemini
Input
1M tokens
Output
0 tokens
Input Cost
$0.04/1M
Output Cost
$0.15/1M
Claude 3.7 Sonnet
Anthropic · Claude 3
Input
200K tokens
Output
4.1K tokens
Input Cost
$3.00/1M
Output Cost
$15.00/1M
Exceptional at:
Claude 3.5 Haiku
Anthropic · Claude 3.5
Input
200K tokens
Output
8.2K tokens
Input Cost
$0.80/1M
Output Cost
$4.00/1M
Exceptional at:
Claude 3 Haiku
Anthropic · Claude 3
Input
200K tokens
Output
4.1K tokens
Input Cost
$0.25/1M
Output Cost
$1.25/1M
Exceptional at:
GPT-4o mini Realtime
OpenAI · GPT-4o
Input
128K tokens
Output
4.1K tokens
Input Cost
$0.60/1M
Output Cost
$2.40/1M
Exceptional at:
GPT-4 Turbo
OpenAI · GPT-4
Input
128K tokens
Output
4.1K tokens
Input Cost
$10.00/1M
Output Cost
$30.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
GPT-4o mini Audio
OpenAI · GPT-4o
Input
128K tokens
Output
4.1K tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Exceptional at:
o1-pro
OpenAI · o1
Input
200K tokens
Output
100K tokens
Input Cost
$150.00/1M
Output Cost
$600.00/1M
Exceptional at:
omni-moderation
OpenAI · omni-moderation
Input
0 tokens
Output
0 tokens
Input Cost
$0.00/1M
Output Cost
$0.00/1M
Exceptional at:
GPT-4o Audio Preview
OpenAI · GPT-4o
Input
0 tokens
Output
0 tokens
Input Cost
$40.00/1M
Output Cost
$80.00/1M
Exceptional at:
omni-moderation
OpenAI · Moderation
Input
0 tokens
Output
0 tokens
Input Cost
$0.00/1M
Output Cost
$0.00/1M
Exceptional at:
ChatGPT-4o
OpenAI · GPT-4o
Input
128K tokens
Output
4.1K tokens
Input Cost
$5.00/1M
Output Cost
$15.00/1M
Exceptional at:
GPT-4o mini
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Exceptional at:
GPT-4o mini Realtime
OpenAI · GPT-4o
Input
128K tokens
Output
4.1K tokens
Input Cost
$0.60/1M
Output Cost
$2.40/1M
Exceptional at:
o3-2025-04-16
OpenAI · o3
Input
200K tokens
Output
100K tokens
Input Cost
$10.00/1M
Output Cost
$40.00/1M
Exceptional at:
o3
OpenAI · OpenAI
Input
200K tokens
Output
100K tokens
Input Cost
$10.00/1M
Output Cost
$40.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
GPT-4o Audio
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
GPT-4o Realtime
OpenAI · GPT-4o
Input
128K tokens
Output
4.1K tokens
Input Cost
$5.00/1M
Output Cost
$20.00/1M
Exceptional at:
GPT-4.1 mini
OpenAI · GPT-4.1
Input
1.0M tokens
Output
32.8K tokens
Input Cost
$0.40/1M
Output Cost
$1.60/1M
Gemini 2.0 Flash-Lite
Google · Gemini
Input
0 tokens
Output
0 tokens
Input Cost
$0.07/1M
Output Cost
$0.30/1M
Exceptional at:
Gemini 1.5 Flash
Google · Gemini
Input
1M tokens
Output
1M tokens
Input Cost
$0.07/1M
Output Cost
$0.30/1M
Exceptional at:
Gemini 2.5 Pro Preview
Google · Gemini
Input
1M tokens
Output
0 tokens
Input Cost
$1.25/1M
Output Cost
$10.00/1M
Exceptional at:
Gemini 1.5 Pro
Google · Gemini
Input
2M tokens
Output
0 tokens
Input Cost
$1.25/1M
Output Cost
$5.00/1M
Exceptional at:
Claude 3 Opus
Anthropic · Claude 3
Input
200K tokens
Output
4.1K tokens
Input Cost
$15.00/1M
Output Cost
$75.00/1M
Exceptional at:
GPT-4o
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$2.50/1M
Output Cost
$10.00/1M
Exceptional at:
GPT-4o Audio Preview
OpenAI · GPT-4o
Input
0 tokens
Output
0 tokens
Input Cost
$40.00/1M
Output Cost
$80.00/1M
Exceptional at:
omni-moderation-latest
OpenAI · OpenAI
Input
0 tokens
Output
0 tokens
Input Cost
$0.00/1M
Output Cost
$0.00/1M
Exceptional at:
GPT-4o mini Realtime
OpenAI · GPT-4o
Input
128K tokens
Output
4.1K tokens
Input Cost
$0.60/1M
Output Cost
$2.40/1M
Exceptional at:
o4-mini
OpenAI · o-series
Input
200K tokens
Output
100K tokens
Input Cost
$1.10/1M
Output Cost
$4.40/1M
Exceptional at:
GPT-4.1
OpenAI · GPT-4.1
Input
1.0M tokens
Output
32.8K tokens
Input Cost
$2.00/1M
Output Cost
$8.00/1M
Exceptional at:
GPT-4o mini Audio
OpenAI · GPT-4o
Input
128K tokens
Output
16.4K tokens
Input Cost
$0.15/1M
Output Cost
$0.60/1M
Exceptional at:
Similar Capabilities
Long Context
Found in 21 models with Multimodal Input
Vision
Found in 17 models with Multimodal Input
Function Calling
Found in 26 models with Multimodal Input
Thinking
Found in 4 models with Multimodal Input
Audio Understanding
Found in 3 models with Multimodal Input
Video Understanding
Found in 3 models with Multimodal Input