Logo

AI Models with Multimodal Input Support

This page lists Large Language Models that offer Multimodal Input. Compare models, see how they implement this feature, and find the best option for projects requiring robust Multimodal Input.

Providers

Google
Anthropic
OpenAI

Models with this Capability

Gemini 2.5 Flash Preview

Google · Gemini

ID: google-gemini-2.5-flash-preview

preview

Input

1M tokens

Output

0 tokens

Input Cost

$0.15/1M

Output Cost

$0.60/1M

Exceptional at:

mathematics

Gemini 2.0 Flash

Google · Gemini

ID: google-gemini-2.0-flash

Current

Input

1M tokens

Output

0 tokens

Input Cost

$0.10/1M

Output Cost

$0.40/1M

Exceptional at:

instruction following

Gemini 2.0 Flash

Google · Gemini

ID: google-gemini-2.0-flash-live

Current

Input

1M tokens

Output

0 tokens

Input Cost

$0.10/1M

Output Cost

$0.40/1M

Exceptional at:

instruction following

Claude Sonnet 3.7

Anthropic · Claude 3

ID: claude-3-7-sonnet-20250219

outdated

Input

200K tokens

Output

64K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Exceptional at:

extended thinking
web development

GPT-4.1 nano

OpenAI · GPT-4.1

ID: gpt-4.1-nano

Current

Input

1.0M tokens

Output

32.8K tokens

Input Cost

$0.10/1M

Output Cost

$0.40/1M

Exceptional at:

cost-effectiveness
speed

Claude Haiku 3.5

Anthropic · Claude 3.5

ID: claude-3-5-haiku-20241022

Current

Input

200K tokens

Output

8.2K tokens

Input Cost

$0.80/1M

Output Cost

$4.00/1M

Exceptional at:

speed
efficiency
+1

Gemini 1.5 Flash-8B

Google · Gemini

ID: google-gemini-1.5-flash-8b

Current

Input

1M tokens

Output

0 tokens

Input Cost

$0.04/1M

Output Cost

$0.15/1M

Claude 3.5 Haiku

Anthropic · Claude 3.5

ID: anthropic-claude-3-5-haiku

Current

Input

200K tokens

Output

8.2K tokens

Input Cost

$0.80/1M

Output Cost

$4.00/1M

Exceptional at:

speed

Claude 3.5 Sonnet

Anthropic · Claude

ID: claude-3-5-sonnet-20241022

outdated

Input

200K tokens

Output

8.2K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Claude Haiku 3

Anthropic · Claude 3

ID: anthropic-claude-3-haiku

outdated

Input

200K tokens

Output

4.1K tokens

Input Cost

$0.25/1M

Output Cost

$1.25/1M

Exceptional at:

quick and accurate targeted performance
fast response applications

Claude Sonnet 3.5

Anthropic · Claude 3.5

ID: claude-3-5-sonnet-20240620

outdated

Input

200K tokens

Output

8.2K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

GPT-4o mini Realtime

OpenAI · GPT-4o

ID: gpt-4o-mini-realtime

preview

Input

128K tokens

Output

4.1K tokens

Input Cost

$0.60/1M

Output Cost

$2.40/1M

Exceptional at:

realtime processing
audio input
+1

Claude Sonnet 3

Anthropic · Claude 3

ID: claude-3-sonnet-20240229

outdated

Input

200K tokens

Output

4.1K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Exceptional at:

reasoning
human-like interactions

GPT-4 Turbo

OpenAI · GPT-4

ID: gpt-4-turbo

outdated

Input

128K tokens

Output

4.1K tokens

Input Cost

$10.00/1M

Output Cost

$30.00/1M

Exceptional at:

long context processing
function calling
+2

GPT-4o

OpenAI · GPT-4o

ID: gpt-4o-2024-05-13

outdated

Input

128K tokens

Output

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Exceptional at:

high intelligence
versatility
+4

GPT-4o

OpenAI · GPT-4o

ID: gpt-4o-2024-11-20

Current

Input

128K tokens

Output

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Exceptional at:

general reasoning
multimodal understanding
+1

GPT-4o mini Audio

OpenAI · GPT-4o

ID: gpt-4o-mini-audio-preview-2024-12-17

preview

Input

128K tokens

Output

4.1K tokens

Input Cost

$0.15/1M

Output Cost

$0.60/1M

Exceptional at:

audio to text transcription
text to audio synthesis
+1

o1-pro

OpenAI · o1

ID: o1-pro

Current

Input

200K tokens

Output

100K tokens

Input Cost

$150.00/1M

Output Cost

$600.00/1M

Exceptional at:

complex reasoning
multi turn interactions

omni-moderation

OpenAI · omni-moderation

ID: omni-moderation-latest

Current

Input

0 tokens

Output

0 tokens

Input Cost

$0.00/1M

Output Cost

$0.00/1M

Exceptional at:

harmful content detection
multimodal moderation

GPT-4o Audio Preview

OpenAI · GPT-4o

ID: gpt-4o-audio-preview-2024-10-01

outdated

Input

0 tokens

Output

0 tokens

Input Cost

$40.00/1M

Output Cost

$80.00/1M

Exceptional at:

audio input processing
audio output generation
+1

omni-moderation

OpenAI · Moderation

ID: omni-moderation-2024-09-26

Current

Input

0 tokens

Output

0 tokens

Input Cost

$0.00/1M

Output Cost

$0.00/1M

Exceptional at:

identifying harmful content in text and images

ChatGPT-4o

OpenAI · GPT-4o

ID: chatgpt-4o

Current

Input

128K tokens

Output

4.1K tokens

Input Cost

$5.00/1M

Output Cost

$15.00/1M

Exceptional at:

text understanding
vision tasks
+1

GPT-4o mini

OpenAI · GPT-4o

ID: gpt-4o-mini

Current

Input

128K tokens

Output

16.4K tokens

Input Cost

$0.15/1M

Output Cost

$0.60/1M

Exceptional at:

fast
affordable
+2

GPT-4o mini Realtime

OpenAI · GPT-4o

ID: gpt-4o-mini-realtime-preview

preview

Input

128K tokens

Output

4.1K tokens

Input Cost

$0.60/1M

Output Cost

$2.40/1M

Exceptional at:

realtime text processing
realtime audio processing

o3-2025-04-16

OpenAI · o3

ID: o3-2025-04-16

Current

Input

200K tokens

Output

100K tokens

Input Cost

$10.00/1M

Output Cost

$40.00/1M

Exceptional at:

math
science
+6

o3

OpenAI · OpenAI

ID: o3

Current

Input

200K tokens

Output

100K tokens

Input Cost

$10.00/1M

Output Cost

$40.00/1M

Exceptional at:

complex reasoning
math
+6

Claude Opus 4

Anthropic · Claude

ID: claude-opus-4-20250514

Current

Input

200K tokens

Output

32K tokens

Input Cost

$15.00/1M

Output Cost

$75.00/1M

Exceptional at:

highest intelligence
complex reasoning
+5

GPT-4o

OpenAI · GPT-4o

ID: gpt-4o

Current

Input

128K tokens

Output

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Exceptional at:

multimodal understanding
complex reasoning
+2

GPT-4o Audio

OpenAI · GPT-4o

ID: gpt-4o-audio-preview

preview

Input

128K tokens

Output

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Exceptional at:

audio input processing
audio output generation

GPT-4o Realtime

OpenAI · GPT-4o

ID: gpt-4o-realtime-preview

preview

Input

128K tokens

Output

4.1K tokens

Input Cost

$5.00/1M

Output Cost

$20.00/1M

Exceptional at:

realtime audio processing
realtime text processing
+1

Claude Haiku 3

Anthropic · Claude 3

ID: claude-3-haiku-20240307

outdated

Input

200K tokens

Output

4.1K tokens

Input Cost

$0.25/1M

Output Cost

$1.25/1M

GPT-4.1 mini

OpenAI · GPT-4.1

ID: gpt-4.1-mini

Current

Input

1.0M tokens

Output

32.8K tokens

Input Cost

$0.40/1M

Output Cost

$1.60/1M

Claude Opus 3

Anthropic · Claude 3

ID: claude-3-opus-20240229

outdated

Input

200K tokens

Output

4.1K tokens

Input Cost

$15.00/1M

Output Cost

$75.00/1M

Exceptional at:

complex reasoning
advanced coding
+4

Claude 3.5 Sonnet

Anthropic · Claude

ID: claude-3-5-sonnet-latest

Current

Input

200K tokens

Output

8.2K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Gemini 2.0 Flash-Lite

Google · Gemini

ID: google-gemini-2.0-flash-lite

Current

Input

0 tokens

Output

0 tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Exceptional at:

instruction following

Gemini 1.5 Flash

Google · Gemini

ID: google-gemini-1.5-flash

Current

Input

1M tokens

Output

1M tokens

Input Cost

$0.07/1M

Output Cost

$0.30/1M

Exceptional at:

long context processing
multimodal understanding
+3

Gemini 2.5 Pro Preview

Google · Gemini

ID: google-gemini-2.5-pro-preview

preview

Input

1M tokens

Output

0 tokens

Input Cost

$1.25/1M

Output Cost

$10.00/1M

Exceptional at:

complex reasoning
coding
+7

Gemini 1.5 Pro

Google · Gemini

ID: google-gemini-1.5-pro

Current

Input

2M tokens

Output

0 tokens

Input Cost

$1.25/1M

Output Cost

$5.00/1M

Exceptional at:

long context processing
complex reasoning
+1

Claude 3.7 Sonnet

Anthropic · Claude 3

ID: anthropic-claude-3-7-sonnet

Current

Input

200K tokens

Output

4.1K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Exceptional at:

agentic coding
web development tasks
+1

GPT-4o

OpenAI · GPT-4o

ID: gpt-4o-2024-08-06

outdated

Input

128K tokens

Output

16.4K tokens

Input Cost

$2.50/1M

Output Cost

$10.00/1M

Exceptional at:

general reasoning
multimodal understanding
+2

GPT-4o Audio Preview

OpenAI · GPT-4o

ID: gpt-4o-audio-preview-2024-12-17

preview

Input

0 tokens

Output

0 tokens

Input Cost

$40.00/1M

Output Cost

$80.00/1M

Exceptional at:

realtime audio processing
realtime text processing

Claude Opus 3

Anthropic · Claude 3

ID: anthropic-claude-3-opus

outdated

Input

200K tokens

Output

4.1K tokens

Input Cost

$15.00/1M

Output Cost

$75.00/1M

Exceptional at:

complex reasoning
multimodal understanding
+1

omni-moderation-latest

OpenAI · OpenAI

ID: omni-moderation

Current

Input

0 tokens

Output

0 tokens

Input Cost

$0.00/1M

Output Cost

$0.00/1M

Exceptional at:

multimodal content moderation
identifying harmful content (text and images)

Claude Sonnet 4

Anthropic · Claude 4

ID: claude-sonnet-4-20250514

Current

Input

200K tokens

Output

64K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Exceptional at:

reasoning
coding
+3

Claude Sonnet 4

Anthropic · Claude

ID: claude-sonnet-4-0

Current

Input

200K tokens

Output

64K tokens

Input Cost

$3.00/1M

Output Cost

$15.00/1M

Exceptional at:

reasoning
coding
+4

Claude Opus 4

Anthropic · Claude 4

ID: claude-opus-4-0

Current

Input

200K tokens

Output

32K tokens

Input Cost

$15.00/1M

Output Cost

$75.00/1M

Exceptional at:

complex reasoning
advanced coding
+4

GPT-4o mini Realtime

OpenAI · GPT-4o

ID: gpt-4o-mini-realtime-preview-2024-12-17

preview

Input

128K tokens

Output

4.1K tokens

Input Cost

$0.60/1M

Output Cost

$2.40/1M

Exceptional at:

realtime processing
audio input
+1

Claude Haiku 3.5

Anthropic · Claude

ID: claude-3-5-haiku-latest

Current

Input

200K tokens

Output

8.2K tokens

Input Cost

$0.80/1M

Output Cost

$4.00/1M

Exceptional at:

speed
fast inference

o4-mini

OpenAI · o-series

ID: o4-mini

Current

Input

200K tokens

Output

100K tokens

Input Cost

$1.10/1M

Output Cost

$4.40/1M

Exceptional at:

reasoning
coding
+1

GPT-4.1

OpenAI · GPT-4.1

ID: gpt-4.1

Current

Input

1.0M tokens

Output

32.8K tokens

Input Cost

$2.00/1M

Output Cost

$8.00/1M

Exceptional at:

complex reasoning
problem solving across domains
+4

GPT-4o mini Audio

OpenAI · GPT-4o

ID: gpt-4o-mini-audio-preview

preview

Input

128K tokens

Output

16.4K tokens

Input Cost

$0.15/1M

Output Cost

$0.60/1M

Exceptional at:

audio input processing
audio output generation

GPT Image 1

OpenAI · GPT Image

ID: gpt-image-1

Current

Input

0 tokens

Output

0 tokens

Input Cost

$10.00/1M

Output Cost

$40.00/1M

Exceptional at:

image generation
image editing

Claude Opus 3

Anthropic · Claude 3

ID: claude-3-opus-latest

outdated

Input

200K tokens

Output

4.1K tokens

Input Cost

$15.00/1M

Output Cost

$75.00/1M

Exceptional at:

top level intelligence
fluency
+1

Similar Capabilities