Workload-first model discovery

LLM API Model Finder

Filter the model catalog by token volume, use case, provider, and minimum context window before opening model pages or comparison search.

Pricing data updated:  Prices normalized to USD per 1M tokens Ranking updates as workload and filters change

Shortlist by constraints

Start from workload shape instead of a model name.

Use this finder when you know the rough monthly token volume, context requirement, provider preference, or workload type but still need a practical shortlist.

Models tracked353
Priced models353
Ranked models20
1M+ context65

The finder is a discovery aid, not a benchmark. Compare the shortlisted models and verify current provider terms before production use.

Estimate your workload cost

Monthly token volume

Output-heavy workloads can change the shortlist.

Costs use the selected token volume and normalized model prices. Missing price fields appear as N/A and rank behind priced models for cost-first views.

ModelProviderInput / 1MOutput / 1MYour CostContextRankNext Step
🔥DeepSeek V4 FlashDeepSeek$0.112$0.224$0.221.05M#1Open model
🔥Hy3 previewTencent$0.066$0.26$0.2262.14K#2Open model
🔥Claude Sonnet 4.6Anthropic$3$15$10.51M#3Open model
🔥Owl AlphaOpenRouter$0$0$01.05M#4Open model
🔥DeepSeek V3.2DeepSeek$0.252$0.378$0.44131.07K#5Open model
🔥Gemini 3 Flash PreviewGoogle$0.5$3$21.05M#6Open model
🔥Claude Opus 4.7Anthropic$5$25$17.51M#7Open model
🔥DeepSeek V4 ProDeepSeek$0.435$0.87$0.871.05M#8Open model
🔥Step 3.5 FlashStepFun$0.1$0.3$0.25262.14K#9Open model
🔥MiniMax M2.7MiniMax$0.279$1.2$0.88204.8K#10Open model
🔥Nemotron 3 Super (free)NVIDIA$0$0$01M#11Open model
🔥Kimi K2.6MoonshotAI$0.73$3.49$2.48262.14K#12Open model

How to use the shortlist

Cost-first decisions

Sort by Your cost when your monthly bill is the first constraint. Check both input and output prices before choosing a chatbot, agent, or batch workload model.

Context-first decisions

Set a minimum context window before comparing RAG, long-document, or codebase workflows. A cheaper model may still be the wrong fit when context is tight.

Provider fit

Use provider filtering when procurement, region, account limits, or existing integrations matter. Then compare close alternatives inside the provider hub.