LLM API budget planning

LLM API Cost Calculator

Estimate monthly API spend from input and output token volume, then compare popular and low-cost models using normalized USD prices per 1M tokens.

Page updated: 2026-07-12T13:07:48Z Data confirmed: 2026-07-12T20:07:24Z Prices normalized to USD per 1M tokens Calculator estimates planning cost, not final billing

Start with your workload

Change token volume once; every model estimate updates.

Use this when budgeting a chatbot, RAG pipeline, coding assistant, or batch analysis workload before choosing a provider.

Default scenario1M input + 500K output

Popular models12

Low-cost shortlist12

Comparable unitPer 1M tokens

Estimate your workload cost

Monthly token volume

Use input and output tokens separately; output-heavy apps can change the winner.

Monthly input tokens Monthly output tokens

This estimate uses available model metadata. Provider invoices may include routing, caching, discounts, minimums, or account-specific terms.

Popular Model Cost Estimates

Open popular models

Model	Provider	Input / 1M	Output / 1M	Your Cost	Context	Popularity
🔥DeepSeek V4 Flash	DeepSeek	$0.077	$0.154	$0.15	1.05M	#1
🔥Hy3 preview	Tencent	$0.063	$0.21	$0.17	262.14K	#2
🔥MiMo-V2.5	Xiaomi	$0.105	$0.28	$0.24	1.05M	#3
🔥MiniMax M3	MiniMax	$0.3	$1.2	$0.9	1.05M	#4
🔥DeepSeek V4 Pro	DeepSeek	$0.435	$0.87	$0.87	1.05M	#5
🔥Claude Sonnet 4.6	Anthropic	$3	$15	$10.5	1M	#6
🔥Owl Alpha	OpenRouter	$0	$0	$0	1.05M	#7
🔥Claude Opus 4.7	Anthropic	$5	$25	$17.5	1M	#8
🔥DeepSeek V3.2	DeepSeek	$0.2145	$0.3217	$0.38	131.07K	#9
🔥Gemini 3 Flash Preview	Google	$0.5	$3	$2	1.05M	#10
🔥Step 3.7 Flash	StepFun	$0.2	$1.15	$0.77	256K	#11
🔥Nemotron 3 Ultra (free)	NVIDIA	$0	$0	$0	1M	#12

Low-Cost Shortlist

Browse cheapest models

Model	Provider	Your Cost	Input / 1M	Output / 1M	Context
🔥Owl Alpha	OpenRouter	$0	$0	$0	1.05M
🔥Nemotron 3 Ultra (free)	NVIDIA	$0	$0	$0	1M
🔥Laguna M.1 (free)	Poolside	$0	$0	$0	262.14K
Nemotron 3 Super (free)	NVIDIA	$0	$0	$0	1M
gpt-oss-120b (free)	OpenAI	$0	$0	$0	131.07K
Laguna XS.2 (free)	Poolside	$0	$0	$0	262.14K
GLM 4.5 Air (free)	Z.ai	$0	$0	$0	131.07K
gpt-oss-20b (free)	OpenAI	$0	$0	$0	131.07K
Gemma 4 31B (free)	Google	$0	$0	$0	262.14K
Nemotron 3 Nano 30B A3B (free)	NVIDIA	$0	$0	$0	256K
Kimi K2.6 (free)	MoonshotAI	$0	$0	$0	262.14K
Nemotron 3 Nano Omni (free)	NVIDIA	$0	$0	$0	256K

Cost Planning FAQ

How does the calculator estimate cost?

It multiplies input tokens by the model's input price per 1M tokens, then adds output tokens multiplied by the model's output price per 1M tokens.

Why separate input and output tokens?

Chatbots, agents, and code assistants often spend more on output tokens, while retrieval and classification workloads may be input-heavy. Separating them prevents a cheap-looking model from winning the wrong workload.

What should I do after estimating cost?

Open the model page or compare it against a close alternative, then verify current provider limits, discounts, and availability before production use.