Workload-first model discovery

LLM API Model Finder

Filter the model catalog by token volume, use case, provider, and minimum context window before opening model pages or comparison search.

Page updated: 2026-06-30T20:07:34Z Data confirmed: 2026-07-01T11:07:32Z Prices normalized to USD per 1M tokens Results update as workload and filters change

Shortlist by constraints

Start from workload shape instead of a model name.

Use this finder when you know the rough monthly token volume, context requirement, provider preference, or workload type but still need a practical shortlist.

Models tracked378

Priced models378

Popular models294

1M+ context78

The finder is a discovery aid, not a benchmark. Compare the shortlisted models and verify current provider terms before production use.

Estimate your workload cost

Monthly token volume

Output-heavy workloads can change the shortlist.

Monthly input tokens Monthly output tokens

Costs use the selected token volume and normalized model prices. Models without price data rank behind priced models for cost-first views.

Search Use case Provider Minimum context Sort

Recommended shortlist

Open all models catalog Compare shortlisted models

Model	Provider	Input / 1M	Output / 1M	Your Cost	Context	Popularity	Next Step
🔥DeepSeek V4 Flash	DeepSeek	$0.098	$0.196	$0.2	1.05M	#1	Open model
🔥Hy3 preview	Tencent	$0.063	$0.21	$0.17	262.14K	#2	Open model
🔥MiMo-V2.5	Xiaomi	$0.105	$0.28	$0.24	1.05M	#3	Open model
New🔥MiniMax M3	MiniMax	$0.3	$1.2	$0.9	1.05M	#4	Open model
🔥DeepSeek V4 Pro	DeepSeek	$0.435	$0.87	$0.87	1.05M	#5	Open model
🔥Claude Sonnet 4.6	Anthropic	$3	$15	$10.5	1M	#6	Open model
🔥Owl Alpha	OpenRouter	$0	$0	$0	1.05M	#7	Open model
🔥Claude Opus 4.7	Anthropic	$5	$25	$17.5	1M	#8	Open model
🔥DeepSeek V3.2	DeepSeek	$0.2288	$0.3432	$0.4	131.07K	#9	Open model
🔥Gemini 3 Flash Preview	Google	$0.5	$3	$2	1.05M	#10	Open model
New🔥Step 3.7 Flash	StepFun	$0.2	$1.15	$0.77	256K	#11	Open model
New🔥Nemotron 3 Ultra (free)	NVIDIA	$0	$0	$0	1M	#12	Open model

How to use the shortlist

Cost-first decisions

Sort by Your cost when your monthly bill is the first constraint. Check both input and output prices before choosing a chatbot, agent, or batch workload model.

Context-first decisions

Set a minimum context window before comparing RAG, long-document, or codebase workflows. A cheaper model may still be the wrong fit when context is tight.

Provider fit

Use provider filtering when procurement, region, account limits, or existing integrations matter. Then compare close alternatives inside the provider hub.