| Provider | |
|---|---|
| Model ID | google/gemma-4-26b-a4b-it |
| Prompt Price per 1M tokens | $0.06 |
| Completion Price per 1M tokens | $0.33 |
| Sample Workload Cost 1M input + 500K output | $0.23 |
| Context Window | 262.14K |
| Release Date | |
| Popularity | #28 |
| Popularity Signal | 37.8B tokens |
Gemma 4 26B A4B
Google API model specs with normalized input price, output price, context window, sample workload cost, and related comparisons.
Budget brief
Gemma 4 26B A4B is estimated at $0.23 for the standard workload.
Gemma 4 26B A4B is best suited for cost-sensitive production traffic. A 1M input token plus 500K output token workload is estimated at $0.23.
Use this as a first-pass planning estimate, then verify provider limits, routing, discounts, and availability before production deployment.
Estimate your workload cost
Estimate this model for your workload
This estimate uses normalized public API pricing per 1M tokens. It is a planning aid, not a billing quote. Verify provider pricing, limits, and terms before production use.
Alternative path
Alternative Shortlist
Use these rows to build a test shortlist around Gemma 4 26B A4B. Lower cost, same-provider fit, and larger context are separate decisions, so each group is ranked by a different signal.
Lower-cost alternatives
Cross-provider candidates with a lower standard 1M input plus 500K output estimate.
| Model | Provider | Sample Cost | Context | Why it is here | Next Step |
|---|---|---|---|---|---|
| 🔥Owl Alpha | OpenRouter | $0 | 1.05M | Lower standard workload estimate from another provider. | Open model Compare |
| New🔥Nemotron 3 Ultra (free) | NVIDIA | $0 | 1M | Lower standard workload estimate from another provider. | Open model Compare |
| 🔥Laguna M.1 (free) | Poolside | $0 | 262.14K | Lower standard workload estimate from another provider. | Open model Compare |
| Nemotron 3 Super (free) | NVIDIA | $0 | 1M | Lower standard workload estimate from another provider. | Open model Compare |
Same-provider swaps
Lower-cost options from the same provider, useful when account setup or procurement is already fixed.
| Model | Provider | Sample Cost | Context | Why it is here | Next Step |
|---|---|---|---|---|---|
| Gemma 4 31B (free) | $0 | 262.14K | Same provider with a lower standard workload estimate. | Open model Compare | |
| Gemma 4 26B A4B (free) | $0 | 262.14K | Same provider with a lower standard workload estimate. | Open model Compare | |
| Lyria 3 Pro Preview | $0 | 1.05M | Same provider with a lower standard workload estimate. | Open model Compare | |
| Lyria 3 Clip Preview | $0 | 1.05M | Same provider with a lower standard workload estimate. | Open model Compare |
Larger-context upgrades near this budget
Models with more context that stay within a close sample-cost band when price data is available.
| Model | Provider | Sample Cost | Context | Why it is here | Next Step |
|---|---|---|---|---|---|
| Llama 4 Scout | Meta | $0.25 | 10M | More context while staying near this model's sample-cost band. | Open model Compare |
| 🔥Owl Alpha | OpenRouter | $0 | 1.05M | More context while staying near this model's sample-cost band. | Open model Compare |
| 🔥DeepSeek V4 Flash | DeepSeek | $0.18 | 1.05M | More context while staying near this model's sample-cost band. | Open model Compare |
| 🔥MiMo-V2.5 | Xiaomi | $0.24 | 1.05M | More context while staying near this model's sample-cost band. | Open model Compare |
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Gemma 4 26B A4B is best suited for cost-sensitive production traffic.
A 1M input token plus 500K output token workload is estimated at $0.23.
Decision Shortcuts
Compare this model
Search head-to-head pages that include Gemma 4 26B A4B and review input price, output price, context, and sample workload cost.
Find comparisonsGoogle catalog
See other Google models before narrowing your shortlist.
Open provider hubCheaper alternatives
Start from models sorted by a standard cost estimate when budget is the first constraint.
Browse low-cost modelsLong-context alternatives
Compare large-context models for retrieval, documents, and codebase review.
Browse long-context modelsHigh-Interest Comparisons
Search this model| Comparison | Cost-first Pick | Context Pick |
|---|---|---|
| Gemma 4 26B A4B vs Nemotron 3 Nano 30B A3B (free) | Nemotron 3 Nano 30B A3B (free) | Gemma 4 26B A4B |
| Kimi K2.6 vs Gemma 4 26B A4B | Gemma 4 26B A4B | Tie |
| 🔥Gemini 2.5 Flash vs Gemma 4 26B A4B | Gemma 4 26B A4B | Gemini 2.5 Flash |
| Gemma 4 26B A4B vs Granite 4.1 8B | Granite 4.1 8B | Gemma 4 26B A4B |
| Gemma 4 26B A4B vs Free Models Router | Free Models Router | Gemma 4 26B A4B |
| 🔥Owl Alpha vs Gemma 4 26B A4B | Owl Alpha | Owl Alpha |
Popular Comparisons
Search all comparisons| Comparison | Newest Release |
|---|---|
| 🔥Claude Opus 4.6 vs Gemma 4 26B A4B | |
| 🔥Claude Opus 4.7 vs Gemma 4 26B A4B | |
| 🔥Claude Sonnet 4.6 vs Gemma 4 26B A4B | |
| 🔥DeepSeek V3.2 vs Gemma 4 26B A4B | |
| 🔥DeepSeek V4 Flash vs Gemma 4 26B A4B | |
| 🔥DeepSeek V4 Pro vs Gemma 4 26B A4B | |
| 🔥Gemini 2.5 Flash Lite vs Gemma 4 26B A4B | |
| 🔥Gemini 2.5 Flash vs Gemma 4 26B A4B | |
| 🔥Gemini 3 Flash Preview vs Gemma 4 26B A4B | |
| 🔥Gemini 3.1 Flash Lite vs Gemma 4 26B A4B |