DeepSeek-R1 Distill 8B

DeepSeek · DeepSeek-R1

8B parameter open-weight model. Llama-3 distillation of R1 reasoning. Competitive reasoning at 8B scale. Thinking chains visible in output. Q4 fits in 8 GB VRAM.

Editorial review

Reviewed byOpenSourcesAI EditorialLast updatedJune 2026SourcesHuggingFace model card (deepseek-ai/DeepSeek-R1-Distill-Llama-8B), official docs, OpenSourcesAI editorial review.

VRAM figures are empirical estimates. Actual usage varies by runtime, context length, and system configuration. Verify on your specific hardware before production use.

Ready to run this model locally?

Find a compatible interface in our Local AI Tools directory →

DeepSeek-R1 Distill 8B

Editorial review

Related models