Workload-first model discovery

LLM API Model Finder

Filter the model catalog by token volume, use case, provider, and minimum context window before opening model pages or comparison search.

Page updated:  Data confirmed:  Prices normalized to USD per 1M tokens Results update as workload and filters change

Shortlist by constraints

Start from workload shape instead of a model name.

Use this finder when you know the rough monthly token volume, context requirement, provider preference, or workload type but still need a practical shortlist.

Models tracked367
Priced models367
Popular models294
1M+ context75

The finder is a discovery aid, not a benchmark. Compare the shortlisted models and verify current provider terms before production use.

Estimate your workload cost

Monthly token volume

Output-heavy workloads can change the shortlist.

Costs use the selected token volume and normalized model prices. Models without price data rank behind priced models for cost-first views.

ModelProviderInput / 1MOutput / 1MYour CostContextPopularityNext Step
🔥DeepSeek V4 FlashDeepSeek$0.098$0.196$0.21.05M#1Open model
🔥Hy3 previewTencent$0.063$0.21$0.17262.14K#2Open model
🔥MiMo-V2.5Xiaomi$0.14$0.28$0.281.05M#3Open model
New🔥MiniMax M3MiniMax$0.3$1.2$0.91.05M#4Open model
🔥DeepSeek V4 ProDeepSeek$0.435$0.87$0.871.05M#5Open model
🔥Claude Sonnet 4.6Anthropic$3$15$10.51M#6Open model
🔥Owl AlphaOpenRouter$0$0$01.05M#7Open model
🔥Claude Opus 4.7Anthropic$5$25$17.51M#8Open model
🔥DeepSeek V3.2DeepSeek$0.2288$0.3432$0.4131.07K#9Open model
🔥Gemini 3 Flash PreviewGoogle$0.5$3$21.05M#10Open model
New🔥Step 3.7 FlashStepFun$0.2$1.15$0.77256K#11Open model
New🔥Nemotron 3 Ultra (free)NVIDIA$0$0$01M#12Open model

How to use the shortlist

Cost-first decisions

Sort by Your cost when your monthly bill is the first constraint. Check both input and output prices before choosing a chatbot, agent, or batch workload model.

Context-first decisions

Set a minimum context window before comparing RAG, long-document, or codebase workflows. A cheaper model may still be the wrong fit when context is tight.

Provider fit

Use provider filtering when procurement, region, account limits, or existing integrations matter. Then compare close alternatives inside the provider hub.