Model · DeepSeek-R1 · Reviewed June 2026

standardMIT8B paramsOpen weights

DeepSeek-R1 Distill 8B

DeepSeek · DeepSeek-R1

8B parameter open-weight model. Llama-3 distillation of R1 reasoning. Competitive reasoning at 8B scale. Thinking chains visible in output. Q4 fits in 8 GB VRAM.

Editorial review

Reviewed byOpenSourcesAI EditorialLast updatedJune 2026SourcesHuggingFace model card (deepseek-ai/DeepSeek-R1-Distill-Llama-8B), official docs, OpenSourcesAI editorial review.

VRAM figures are empirical estimates. Actual usage varies by runtime, context length, and system configuration. Verify on your specific hardware before production use.