Gemma 4 31B

Google API model specs with normalized input price, output price, context window, sample workload cost, and related comparisons.

Page updated:  Data confirmed:  Prices normalized to USD per 1M tokens Sample workload: 1M input + 500K output

Budget brief

Gemma 4 31B is estimated at $0.29 for the standard workload.

Gemma 4 31B is best suited for cost-sensitive production traffic. A 1M input token plus 500K output token workload is estimated at $0.29.

Input price$0.12per 1M tokens
Output price$0.35per 1M tokens
Context window262.14Kreported maximum
Popularity#3626.6B tokens signal volume

Use this as a first-pass planning estimate, then verify provider limits, routing, discounts, and availability before production deployment.

Model Specs
ProviderGoogle
Model IDgoogle/gemma-4-31b-it
Prompt Price
per 1M tokens
$0.12
Completion Price
per 1M tokens
$0.35
Sample Workload Cost
1M input + 500K output
$0.29
Context Window262.14K
Release Date
Popularity#36
Popularity Signal26.6B tokens

Estimate your workload cost

Estimate this model for your workload

Prices are normalized to USD per 1M tokens.
Gemma 4 31B Calculating… Estimated monthly API cost
Unit prices $0.12 input / $0.35 output Per 1M tokens

This estimate uses normalized public API pricing per 1M tokens. It is a planning aid, not a billing quote. Verify provider pricing, limits, and terms before production use.

Alternative path

Alternative Shortlist

Open model finder

Use these rows to build a test shortlist around Gemma 4 31B. Lower cost, same-provider fit, and larger context are separate decisions, so each group is ranked by a different signal.

Lower-cost alternatives

Cross-provider candidates with a lower standard 1M input plus 500K output estimate.

ModelProviderSample CostContextWhy it is hereNext Step
🔥Owl AlphaOpenRouter$01.05MLower standard workload estimate from another provider.Open model Compare
New🔥Nemotron 3 Ultra (free)NVIDIA$01MLower standard workload estimate from another provider.Open model Compare
🔥Laguna M.1 (free)Poolside$0262.14KLower standard workload estimate from another provider.Open model Compare
Nemotron 3 Super (free)NVIDIA$01MLower standard workload estimate from another provider.Open model Compare

Same-provider swaps

Lower-cost options from the same provider, useful when account setup or procurement is already fixed.

ModelProviderSample CostContextWhy it is hereNext Step
Gemma 4 31B (free)Google$0262.14KSame provider with a lower standard workload estimate.Open model Compare
Gemma 4 26B A4B (free)Google$0262.14KSame provider with a lower standard workload estimate.Open model Compare
Lyria 3 Pro PreviewGoogle$01.05MSame provider with a lower standard workload estimate.Open model Compare
Lyria 3 Clip PreviewGoogle$01.05MSame provider with a lower standard workload estimate.Open model Compare

Larger-context upgrades near this budget

Models with more context that stay within a close sample-cost band when price data is available.

ModelProviderSample CostContextWhy it is hereNext Step
Llama 4 ScoutMeta$0.2510MMore context while staying near this model's sample-cost band.Open model Compare
🔥Owl AlphaOpenRouter$01.05MMore context while staying near this model's sample-cost band.Open model Compare
🔥DeepSeek V4 FlashDeepSeek$0.181.05MMore context while staying near this model's sample-cost band.Open model Compare
🔥MiMo-V2.5Xiaomi$0.241.05MMore context while staying near this model's sample-cost band.Open model Compare
Model Introduction

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Best Fit

Gemma 4 31B is best suited for cost-sensitive production traffic.

Cost Example

A 1M input token plus 500K output token workload is estimated at $0.29.

Decision Shortcuts

Compare this model

Search head-to-head pages that include Gemma 4 31B and review input price, output price, context, and sample workload cost.

Find comparisons

Google catalog

See other Google models before narrowing your shortlist.

Open provider hub

Cheaper alternatives

Start from models sorted by a standard cost estimate when budget is the first constraint.

Browse low-cost models

High-Interest Comparisons

Search this model
ComparisonCost-first PickContext Pick
🔥Gemini 3 Flash Preview vs Gemma 4 31BGemma 4 31BGemini 3 Flash Preview
Gemma 4 31B vs Solar Pro 3Gemma 4 31BGemma 4 31B
Gemma 4 31B vs Free Models RouterFree Models RouterGemma 4 31B
Gemma 4 31B vs Mistral Medium 3.5Gemma 4 31BTie
Nemotron 3 Super (free) vs Gemma 4 31BNemotron 3 Super (free)Nemotron 3 Super (free)
gpt-oss-120b vs Gemma 4 31Bgpt-oss-120bGemma 4 31B

Popular Comparisons

Search all comparisons
ComparisonNewest Release
🔥Claude Opus 4.6 vs Gemma 4 31B
🔥Claude Opus 4.7 vs Gemma 4 31B
🔥Claude Sonnet 4.6 vs Gemma 4 31B
🔥DeepSeek V3.2 vs Gemma 4 31B
🔥DeepSeek V4 Flash vs Gemma 4 31B
🔥DeepSeek V4 Pro vs Gemma 4 31B
🔥Gemini 2.5 Flash Lite vs Gemma 4 31B
🔥Gemini 2.5 Flash vs Gemma 4 31B
🔥Gemini 3 Flash Preview vs Gemma 4 31B
Gemini 3.1 Flash Lite Preview vs Gemma 4 31B