NVIDIA LLM API Models & Pricing

Browse NVIDIA LLM API models with normalized prompt pricing, output pricing, context windows, release dates, and popularity signals.

12Models tracked
12Priced models
12Popular models
3Latest releases

Provider shortlist

Use NVIDIA as a model-level shortlist, not a single default choice.

Start from the cost, context, and popularity picks below, then open the model pages or head-to-head comparisons before choosing an API.

Cost pick Nemotron 3 Ultra (free) $0 sample workload
Context pick Nemotron 3 Ultra (free) 1M context window
Popular pick Nemotron 3 Ultra (free) #12 popularity signal
When to avoid Verify model fit Do not choose by provider name alone. Compare model-level input price, output price, context window, release timing, and current availability before production use.

NVIDIA comparisons

Search NVIDIA comparisons
ComparisonSample Cost WinnerLarger ContextNewest Release
🔥Claude Opus 4.7 vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)Tie
🔥Claude Sonnet 4.6 vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)Tie
🔥DeepSeek V3.2 vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)
🔥DeepSeek V4 Flash vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)DeepSeek V4 Flash
🔥DeepSeek V4 Pro vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)DeepSeek V4 Pro
🔥Gemini 3 Flash Preview vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)Gemini 3 Flash Preview
🔥Hy3 preview vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)
🔥MiMo-V2.5 vs New🔥Nemotron 3 Ultra (free)Nemotron 3 Ultra (free)MiMo-V2.5

Cross-Provider Alternatives

Open alternatives hub

Use this shortlist when NVIDIA is not a hard requirement and your first constraint is workload cost.

AlternativeProviderSample CostInput / 1MOutput / 1MContext
🔥Owl AlphaOpenRouter$0$0$01.05M
🔥Laguna M.1 (free)Poolside$0$0$0262.14K
gpt-oss-120b (free)OpenAI$0$0$0131.07K
Laguna XS.2 (free)Poolside$0$0$0262.14K
GLM 4.5 Air (free)Z.ai$0$0$0131.07K
gpt-oss-20b (free)OpenAI$0$0$0131.07K

NVIDIA model catalog

Browse all models
ModelPromptOutputContextPopularity
New🔥Nemotron 3 Ultra (free)$0$01M#12
Nemotron 3 Super (free)$0$01M#23
Nemotron 3 Nano 30B A3B (free)$0$0256K#75
Nemotron 3 Nano 30B A3B$0.05$0.2262.14K#89
NewNemotron 3 Ultra$0.5$2.21M#91
Nemotron 3 Nano Omni (free)$0$0256K#94
Nemotron 3 Super$0.085$0.41M#96
Nemotron Nano 9B V2 (free)$0$0128K#105
Nemotron Nano 12B 2 VL (free)$0$0128K#107
NewNemotron 3.5 Content Safety (free)$0$0128K#181
Nemotron Nano 9B V2$0.04$0.16131.07K#206
Llama 3.3 Nemotron Super 49B V1.5$0.4$0.4131.07K#209