LLM API budget planning

LLM API Cost Calculator

Estimate monthly API spend from input and output token volume, then compare popular and low-cost models using normalized USD prices per 1M tokens.

Page updated:  Data confirmed:  Prices normalized to USD per 1M tokens Calculator estimates planning cost, not final billing

Start with your workload

Change token volume once; every model estimate updates.

Use this when budgeting a chatbot, RAG pipeline, coding assistant, or batch analysis workload before choosing a provider.

Default scenario1M input + 500K output
Popular models12
Low-cost shortlist12
Comparable unitPer 1M tokens

Estimate your workload cost

Monthly token volume

Use input and output tokens separately; output-heavy apps can change the winner.

This estimate uses available model metadata. Provider invoices may include routing, caching, discounts, minimums, or account-specific terms.

Popular Model Cost Estimates

Open popular models
ModelProviderInput / 1MOutput / 1MYour CostContextPopularity
🔥DeepSeek V4 FlashDeepSeek$0.098$0.196$0.21.05M#1
🔥Hy3 previewTencent$0.063$0.21$0.17262.14K#2
🔥MiMo-V2.5Xiaomi$0.14$0.28$0.281.05M#3
New🔥MiniMax M3MiniMax$0.3$1.2$0.91.05M#4
🔥DeepSeek V4 ProDeepSeek$0.435$0.87$0.871.05M#5
🔥Claude Sonnet 4.6Anthropic$3$15$10.51M#6
🔥Owl AlphaOpenRouter$0$0$01.05M#7
🔥Claude Opus 4.7Anthropic$5$25$17.51M#8
🔥DeepSeek V3.2DeepSeek$0.2288$0.3432$0.4131.07K#9
🔥Gemini 3 Flash PreviewGoogle$0.5$3$21.05M#10
New🔥Step 3.7 FlashStepFun$0.2$1.15$0.77256K#11
New🔥Nemotron 3 Ultra (free)NVIDIA$0$0$01M#12

Low-Cost Shortlist

Browse cheapest models
ModelProviderYour CostInput / 1MOutput / 1MContext
🔥Owl AlphaOpenRouter$0$0$01.05M
New🔥Nemotron 3 Ultra (free)NVIDIA$0$0$01M
🔥Laguna M.1 (free)Poolside$0$0$0262.14K
Nemotron 3 Super (free)NVIDIA$0$0$01M
gpt-oss-120b (free)OpenAI$0$0$0131.07K
Laguna XS.2 (free)Poolside$0$0$0262.14K
GLM 4.5 Air (free)Z.ai$0$0$0131.07K
gpt-oss-20b (free)OpenAI$0$0$0131.07K
Gemma 4 31B (free)Google$0$0$0262.14K
Nemotron 3 Nano 30B A3B (free)NVIDIA$0$0$0256K
Kimi K2.6 (free)MoonshotAI$0$0$0262.14K
Nemotron 3 Nano Omni (free)NVIDIA$0$0$0256K

Cost Planning FAQ

How does the calculator estimate cost?

It multiplies input tokens by the model's input price per 1M tokens, then adds output tokens multiplied by the model's output price per 1M tokens.

Why separate input and output tokens?

Chatbots, agents, and code assistants often spend more on output tokens, while retrieval and classification workloads may be input-heavy. Separating them prevents a cheap-looking model from winning the wrong workload.

What should I do after estimating cost?

Open the model page or compare it against a close alternative, then verify current provider limits, discounts, and availability before production use.