Model · Gemma 3 · Reviewed June 2026
compactGemma Terms of Use4B paramsOpen weights
Gemma 3 4B
Google · Gemma 3
4B parameter open-weight model. 128K context at 4B scale is a standout feature. Q4 fits in 4 GB VRAM. Useful for RAG and summarisation on constrained hardware.
Editorial review
Reviewed byOpenSourcesAI EditorialLast updatedJune 2026SourcesHuggingFace model card (google/gemma-3-4b-it), official docs, OpenSourcesAI editorial review.
VRAM figures are empirical estimates. Actual usage varies by runtime, context length, and system configuration. Verify on your specific hardware before production use.
Ready to run this model locally?
Find a compatible interface in our Local AI Tools directory →