Side-by-side, source-of-truth pricing for every model in the AiPricingLab catalog. Includes context-tier breakdowns (≤200k vs >200k), image quality tiers (1K / 2K / 4K), multimodal input/output rates, and prompt-caching multipliers.
90
models
10
providers
8
categories
Category
Provider
Showing 90 of 90 models
Language models
63
GPT-5.5
OpenAIgpt-5.5
OpenAI flagship reasoning model (≤272k context).
Input
≤272K context
$5 / 1M tok
Cached input
≤272K context
$0.5 / 1M tok
Output
≤272K context
$30 / 1M tok
GPT-5.5 Pro
OpenAIgpt-5.5-pro
Input
≤272K context
$30 / 1M tok
Output
≤272K context
$180 / 1M tok
GPT-5.4
OpenAIgpt-5.4
Input
≤272K context
$2.5 / 1M tok
Cached input
≤272K context
$0.25 / 1M tok
Output
≤272K context
$15 / 1M tok
GPT-5.4 mini
OpenAIgpt-5.4-mini
Input
$0.75 / 1M tok
Cached input
$0.075 / 1M tok
Output
$4.5 / 1M tok
GPT-5.4 nano
OpenAIgpt-5.4-nano
Input
$0.2 / 1M tok
Cached input
$0.02 / 1M tok
Output
$1.25 / 1M tok
GPT-5.4 Pro
OpenAIgpt-5.4-pro
Input
≤272K context
$30 / 1M tok
Output
≤272K context
$180 / 1M tok
GPT-5.2
OpenAIgpt-5.2
Input
$1.75 / 1M tok
Cached input
$0.175 / 1M tok
Output
$14 / 1M tok
GPT-5.2 Pro
OpenAIgpt-5.2-pro
Input
$21 / 1M tok
Output
$168 / 1M tok
GPT-5.1
OpenAIgpt-5.1
Input
$1.25 / 1M tok
Cached input
$0.125 / 1M tok
Output
$10 / 1M tok
GPT-5
OpenAIgpt-5
Input
$1.25 / 1M tok
Cached input
$0.125 / 1M tok
Output
$10 / 1M tok
GPT-5 mini
OpenAIgpt-5-mini
Input
$0.25 / 1M tok
Cached input
$0.025 / 1M tok
Output
$2 / 1M tok
GPT-5 nano
OpenAIgpt-5-nano
Input
$0.05 / 1M tok
Cached input
$0.005 / 1M tok
Output
$0.4 / 1M tok
GPT-5 Pro
OpenAIgpt-5-pro
Input
$15 / 1M tok
Output
$120 / 1M tok
GPT-4.1
OpenAIgpt-4.1
Input
$2 / 1M tok
Cached input
$0.5 / 1M tok
Output
$8 / 1M tok
GPT-4.1 mini
OpenAIgpt-4.1-mini
Input
$0.4 / 1M tok
Cached input
$0.1 / 1M tok
Output
$1.6 / 1M tok
GPT-4.1 nano
OpenAIgpt-4.1-nano
Input
$0.1 / 1M tok
Cached input
$0.025 / 1M tok
Output
$0.4 / 1M tok
GPT-4o
OpenAIgpt-4o
intext, image, audioouttext
Input
$2.5 / 1M tok
Cached input
$1.25 / 1M tok
Output
$10 / 1M tok
GPT-4o mini
OpenAIgpt-4o-mini
Input
$0.15 / 1M tok
Cached input
$0.075 / 1M tok
Output
$0.6 / 1M tok
o1
OpenAIo1
Input
$15 / 1M tok
Cached input
$7.5 / 1M tok
Output
$60 / 1M tok
o1-pro
OpenAIo1-pro
Input
$150 / 1M tok
Output
$600 / 1M tok
o3
OpenAIo3
Input
$2 / 1M tok
Cached input
$0.5 / 1M tok
Output
$8 / 1M tok
o3-pro
OpenAIo3-pro
Input
$20 / 1M tok
Output
$80 / 1M tok
o3-mini
OpenAIo3-mini
Input
$1.1 / 1M tok
Cached input
$0.55 / 1M tok
Output
$4.4 / 1M tok
o4-mini
OpenAIo4-mini
Input
$1.1 / 1M tok
Cached input
$0.275 / 1M tok
Output
$4.4 / 1M tok
GPT-4o (2024-05-13)
deprecated
OpenAIgpt-4o-2024-05-13
Input
$5 / 1M tok
Output
$15 / 1M tok
o1-mini
OpenAIo1-mini
Input
$1.1 / 1M tok
Cached input
$0.55 / 1M tok
Output
$4.4 / 1M tok
o3-deep-research
OpenAIo3-deep-research
Input
$10 / 1M tok
Cached input
$2.5 / 1M tok
Output
$40 / 1M tok
o4-mini-deep-research
OpenAIo4-mini-deep-research
Input
$2 / 1M tok
Cached input
$0.5 / 1M tok
Output
$8 / 1M tok
computer-use-preview
preview
OpenAIcomputer-use-preview
Input
$3 / 1M tok
Output
$12 / 1M tok
GPT-4 Turbo (2024-04-09)
deprecated
OpenAIgpt-4-turbo-2024-04-09
Input
$10 / 1M tok
Output
$30 / 1M tok
GPT-4 0125 Preview
deprecated
OpenAIgpt-4-0125-preview
Input
$10 / 1M tok
Output
$30 / 1M tok
GPT-4 1106 Preview
deprecated
OpenAIgpt-4-1106-preview
Input
$10 / 1M tok
Output
$30 / 1M tok
GPT-4 1106 Vision Preview
deprecated
OpenAIgpt-4-1106-vision-preview
Input
$10 / 1M tok
Output
$30 / 1M tok
GPT-4 0613
deprecated
OpenAIgpt-4-0613
Input
$30 / 1M tok
Output
$60 / 1M tok
GPT-4 0314
deprecated
OpenAIgpt-4-0314
Input
$30 / 1M tok
Output
$60 / 1M tok
GPT-4 32k
deprecated
OpenAIgpt-4-32k
Input
$60 / 1M tok
Output
$120 / 1M tok
GPT-3.5 Turbo
deprecated
OpenAIgpt-3.5-turbo
Input
$0.5 / 1M tok
Output
$1.5 / 1M tok
GPT-3.5 Turbo 0125
deprecated
OpenAIgpt-3.5-turbo-0125
Input
$0.5 / 1M tok
Output
$1.5 / 1M tok
GPT-3.5 Turbo 1106
deprecated
OpenAIgpt-3.5-turbo-1106
Input
$1 / 1M tok
Output
$2 / 1M tok
GPT-3.5 Turbo 0613
deprecated
OpenAIgpt-3.5-turbo-0613
Input
$1.5 / 1M tok
Output
$2 / 1M tok
GPT-3.5 0301
deprecated
OpenAIgpt-3.5-0301
Input
$1.5 / 1M tok
Output
$2 / 1M tok
GPT-3.5 Turbo Instruct
deprecated
OpenAIgpt-3.5-turbo-instruct
Input
$1.5 / 1M tok
Output
$2 / 1M tok
GPT-3.5 Turbo 16k 0613
deprecated
OpenAIgpt-3.5-turbo-16k-0613
Input
$3 / 1M tok
Output
$4 / 1M tok
davinci-002
deprecated
OpenAIdavinci-002
Input
$2 / 1M tok
Output
$2 / 1M tok
babbage-002
deprecated
OpenAIbabbage-002
Input
$0.4 / 1M tok
Output
$0.4 / 1M tok
Claude Opus 4.7
Anthropicclaude-opus-4-7
New tokenizer — may use up to 35% more tokens than 4.6.
Input
$5 / 1M tok
Cache write (5 min)
$6.25 / 1M tok
Cache write (1 hour)
$10 / 1M tok
Cache hit / refresh
$0.5 / 1M tok
Output
$25 / 1M tok
Claude Opus 4.6
Anthropicclaude-opus-4-6
Input
$5 / 1M tok
Cache write (5 min)
$6.25 / 1M tok
Cache write (1 hour)
$10 / 1M tok
Cache hit / refresh
$0.5 / 1M tok
Output
$25 / 1M tok
Claude Opus 4.5
Anthropicclaude-opus-4-5
Input
$5 / 1M tok
Cache write (5 min)
$6.25 / 1M tok
Cache write (1 hour)
$10 / 1M tok
Cache hit / refresh
$0.5 / 1M tok
Output
$25 / 1M tok
Claude Opus 4.1
Anthropicclaude-opus-4-1
Input
$15 / 1M tok
Cache write (5 min)
$18.75 / 1M tok
Cache write (1 hour)
$30 / 1M tok
Cache hit / refresh
$1.5 / 1M tok
Output
$75 / 1M tok
Claude Opus 4
Anthropicclaude-opus-4
Input
$15 / 1M tok
Cache write (5 min)
$18.75 / 1M tok
Cache write (1 hour)
$30 / 1M tok
Cache hit / refresh
$1.5 / 1M tok
Output
$75 / 1M tok
Claude Sonnet 4.6
Anthropicclaude-sonnet-4-6
Input
$3 / 1M tok
Cache write (5 min)
$3.75 / 1M tok
Cache write (1 hour)
$6 / 1M tok
Cache hit / refresh
$0.3 / 1M tok
Output
$15 / 1M tok
Claude Sonnet 4.5
Anthropicclaude-sonnet-4-5
Input
$3 / 1M tok
Cache write (5 min)
$3.75 / 1M tok
Cache write (1 hour)
$6 / 1M tok
Cache hit / refresh
$0.3 / 1M tok
Output
$15 / 1M tok
Claude Sonnet 4
Anthropicclaude-sonnet-4
Input
$3 / 1M tok
Cache write (5 min)
$3.75 / 1M tok
Cache write (1 hour)
$6 / 1M tok
Cache hit / refresh
$0.3 / 1M tok
Output
$15 / 1M tok
Claude Sonnet 3.7
deprecated
Anthropicclaude-sonnet-3-7
Input
$3 / 1M tok
Cache write (5 min)
$3.75 / 1M tok
Cache write (1 hour)
$6 / 1M tok
Cache hit / refresh
$0.3 / 1M tok
Output
$15 / 1M tok
Claude Haiku 4.5
Anthropicclaude-haiku-4-5
Input
$1 / 1M tok
Cache write (5 min)
$1.25 / 1M tok
Cache write (1 hour)
$2 / 1M tok
Cache hit / refresh
$0.1 / 1M tok
Output
$5 / 1M tok
Claude Haiku 3.5
Anthropicclaude-haiku-3-5
Input
$0.8 / 1M tok
Cache write (5 min)
$1 / 1M tok
Cache write (1 hour)
$1.6 / 1M tok
Cache hit / refresh
$0.08 / 1M tok
Output
$4 / 1M tok
Claude Opus 3
deprecated
Anthropicclaude-opus-3
Input
$15 / 1M tok
Cache write (5 min)
$18.75 / 1M tok
Cache write (1 hour)
$30 / 1M tok
Cache hit / refresh
$1.5 / 1M tok
Output
$75 / 1M tok
Claude Haiku 3
deprecated
Anthropicclaude-haiku-3
Input
$0.25 / 1M tok
Cache write (5 min)
$0.3 / 1M tok
Cache write (1 hour)
$0.5 / 1M tok
Cache hit / refresh
$0.03 / 1M tok
Output
$1.25 / 1M tok
Gemini 3.1 Pro
preview
Googlegemini-3-1-pro
Latest performance, intelligence, and usability improvements for multimodal understanding, agentic capabilities, and vibe-coding.
intext, image, video, audioouttext
Input
≤200k tokens
$2 / 1M tok
>200k tokens
$4 / 1M tok
Output
≤200k tokens
$12 / 1M tok
>200k tokens
$18 / 1M tok
Context cache
≤200k tokens
$0.2 / 1M tok
>200k tokens
$0.4 / 1M tok
Cache storage
per hour
$4.5 / 1M tok / hour
Grounding (Search)
after 5,000 free/month
$14 / 1k searches
Gemini 3.1 Flash-Lite
preview
Googlegemini-3-1-flash-lite
Most cost-efficient model, optimized for high-volume agentic tasks, translation, and simple data processing.
intext, image, video, audioouttext
Input
Text / image / video
$0.25 / 1M tok
Audio
$0.5 / 1M tok
Output
$1.5 / 1M tok
Context cache
Text / image / video
$0.025 / 1M tok
Audio
$0.05 / 1M tok
Cache storage
per hour
$1 / 1M tok / hour
Grounding (Search)
after 5,000 free/month
$14 / 1k searches
Gemini 2.5 Pro
Googlegemini-2-5-pro
Input
Text / image / video
$1.25 / 1M tok
Output
$10 / 1M tok
Gemini 3 Flash
preview
Googlegemini-3-flash
Most intelligent model built for speed, combining frontier intelligence with superior search and grounding.
intext, image, video, audioouttext
Input
Text / image / video
$0.5 / 1M tok
Audio
$1 / 1M tok
Output
$3 / 1M tok
Context cache
Text / image / video
$0.05 / 1M tok
Audio
$0.1 / 1M tok
Cache storage
per hour
$1 / 1M tok / hour
Grounding (Search)
after 5,000 free/month
$14 / 1k searches
Llama 3.3 70B
Metallama-3-3-70b
Open weights — pricing varies by host (Together AI, Replicate, Bedrock, etc.).
Input
-
Image generation
9
Gemini 3.1 Flash Image
preview
Googlegemini-3-1-flash-image-preview
Designed for speed and efficiency. High-throughput image generation.