Model · Gemma 4 · Reviewed June 2026
standardGemma Terms of Use12B paramsOpen weights
Gemma 4 12B
Google · Gemma 4
12B parameter open-weight model. Google's flagship 12B open-weight model with 128K context. Q4 fits in 8–10 GB VRAM; strong default for MacBooks with 16 GB unified memory where KV-cache growth at long context is manageable. Benchmark against Qwen3 14B on your prompts — Gemma 4 leads on certain reasoning tasks.
Editorial review
Reviewed byOpenSourcesAI EditorialLast updatedJune 2026SourcesHuggingFace model card (google/gemma-4-12b-it), official docs, OpenSourcesAI editorial review.
VRAM figures are empirical estimates. Actual usage varies by runtime, context length, and system configuration. Verify on your specific hardware before production use.
Ready to run this model locally?
Find a compatible interface in our Local AI Tools directory →