Gemma 4 26B A4B

Google API model specs with normalized input price, output price, context window, sample workload cost, and related comparisons.

Page updated: 2026-05-26T15:08:08Z Data confirmed: 2026-06-28T01:07:24Z Prices normalized to USD per 1M tokens Sample workload: 1M input + 500K output

Budget brief

Gemma 4 26B A4B is estimated at $0.23 for the standard workload.

Gemma 4 26B A4B is best suited for cost-sensitive production traffic. A 1M input token plus 500K output token workload is estimated at $0.23.

Input price$0.06per 1M tokens

Output price$0.33per 1M tokens

Context window262.14Kreported maximum

Popularity#2837.8B tokens signal volume

Use this as a first-pass planning estimate, then verify provider limits, routing, discounts, and availability before production deployment.

Model Specs

Provider	Google
Model ID	`google/gemma-4-26b-a4b-it`
Prompt Price per 1M tokens	$0.06
Completion Price per 1M tokens	$0.33
Sample Workload Cost 1M input + 500K output	$0.23
Context Window	262.14K
Release Date	2026-04-03
Popularity	#28
Popularity Signal	37.8B tokens

Estimate your workload cost

Estimate this model for your workload

Prices are normalized to USD per 1M tokens.

Monthly input tokens Monthly output tokens

Gemma 4 26B A4B Calculating… Estimated monthly API cost

Unit prices $0.06 input / $0.33 output Per 1M tokens

This estimate uses normalized public API pricing per 1M tokens. It is a planning aid, not a billing quote. Verify provider pricing, limits, and terms before production use.

Alternative path

Alternative Shortlist

Open model finder

Use these rows to build a test shortlist around Gemma 4 26B A4B. Lower cost, same-provider fit, and larger context are separate decisions, so each group is ranked by a different signal.

Lower-cost alternatives

Cross-provider candidates with a lower standard 1M input plus 500K output estimate.

Model	Provider	Sample Cost	Context	Why it is here	Next Step
🔥Owl Alpha	OpenRouter	$0	1.05M	Lower standard workload estimate from another provider.	Open model Compare
New🔥Nemotron 3 Ultra (free)	NVIDIA	$0	1M	Lower standard workload estimate from another provider.	Open model Compare
🔥Laguna M.1 (free)	Poolside	$0	262.14K	Lower standard workload estimate from another provider.	Open model Compare
Nemotron 3 Super (free)	NVIDIA	$0	1M	Lower standard workload estimate from another provider.	Open model Compare

Same-provider swaps

Lower-cost options from the same provider, useful when account setup or procurement is already fixed.

Model	Provider	Sample Cost	Context	Why it is here	Next Step
Gemma 4 31B (free)	Google	$0	262.14K	Same provider with a lower standard workload estimate.	Open model Compare
Gemma 4 26B A4B (free)	Google	$0	262.14K	Same provider with a lower standard workload estimate.	Open model Compare
Lyria 3 Pro Preview	Google	$0	1.05M	Same provider with a lower standard workload estimate.	Open model Compare
Lyria 3 Clip Preview	Google	$0	1.05M	Same provider with a lower standard workload estimate.	Open model Compare

Larger-context upgrades near this budget

Models with more context that stay within a close sample-cost band when price data is available.

Model	Provider	Sample Cost	Context	Why it is here	Next Step
Llama 4 Scout	Meta	$0.25	10M	More context while staying near this model's sample-cost band.	Open model Compare
🔥Owl Alpha	OpenRouter	$0	1.05M	More context while staying near this model's sample-cost band.	Open model Compare
🔥DeepSeek V4 Flash	DeepSeek	$0.18	1.05M	More context while staying near this model's sample-cost band.	Open model Compare
🔥MiMo-V2.5	Xiaomi	$0.24	1.05M	More context while staying near this model's sample-cost band.	Open model Compare

Model Introduction

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

Best Fit

Gemma 4 26B A4B is best suited for cost-sensitive production traffic.

Cost Example

A 1M input token plus 500K output token workload is estimated at $0.23.

Decision Shortcuts

Compare this model

Search head-to-head pages that include Gemma 4 26B A4B and review input price, output price, context, and sample workload cost.

Find comparisons

Google catalog

See other Google models before narrowing your shortlist.

Open provider hub

Cheaper alternatives

Start from models sorted by a standard cost estimate when budget is the first constraint.

Browse low-cost models

Long-context alternatives

Compare large-context models for retrieval, documents, and codebase review.

Browse long-context models

High-Interest Comparisons

Search this model

Comparison	Cost-first Pick	Context Pick
Gemma 4 26B A4B vs Nemotron 3 Nano 30B A3B (free)	Nemotron 3 Nano 30B A3B (free)	Gemma 4 26B A4B
Kimi K2.6 vs Gemma 4 26B A4B	Gemma 4 26B A4B	Tie
🔥Gemini 2.5 Flash vs Gemma 4 26B A4B	Gemma 4 26B A4B	Gemini 2.5 Flash
Gemma 4 26B A4B vs Granite 4.1 8B	Granite 4.1 8B	Gemma 4 26B A4B
Gemma 4 26B A4B vs Free Models Router	Free Models Router	Gemma 4 26B A4B
🔥Owl Alpha vs Gemma 4 26B A4B	Owl Alpha	Owl Alpha

Popular Comparisons

Search all comparisons

Comparison	Newest Release
🔥Claude Opus 4.6 vs Gemma 4 26B A4B	2026-04-03
🔥Claude Opus 4.7 vs Gemma 4 26B A4B	2026-04-16
🔥Claude Sonnet 4.6 vs Gemma 4 26B A4B	2026-04-03
🔥DeepSeek V3.2 vs Gemma 4 26B A4B	2026-04-03
🔥DeepSeek V4 Flash vs Gemma 4 26B A4B	2026-04-24
🔥DeepSeek V4 Pro vs Gemma 4 26B A4B	2026-04-24
🔥Gemini 2.5 Flash Lite vs Gemma 4 26B A4B	2026-04-03
🔥Gemini 2.5 Flash vs Gemma 4 26B A4B	2026-04-03
🔥Gemini 3 Flash Preview vs Gemma 4 26B A4B	2026-04-03
🔥Gemini 3.1 Flash Lite vs Gemma 4 26B A4B	2026-05-07