VRAM
Graphics-card memory
VRAM is the dedicated memory on your GPU. Common values are 4 GB, 8 GB, 10 GB, 12 GB, 16 GB, or 24 GB.
Local LLM hardware tool
Enter your CPU, GPU, Graphics Card Memory (VRAM), System RAM, and target workflow. OpenSourcesAI will suggest good, better, and best local model profiles to try, plus a practical app/runtime path for testing them on your machine.
VRAM
VRAM is the dedicated memory on your GPU. Common values are 4 GB, 8 GB, 10 GB, 12 GB, 16 GB, or 24 GB.
Quantized model
A 4-bit quantized model uses less memory, usually with a small quality tradeoff, so it is easier to run on a normal PC.
Runtime
Ollama and LM Studio are beginner-friendly apps for testing local models. llama.cpp is a technical engine used by many local model tools.
RAG
RAG means the model checks your documents or notes first, then answers using those sources.
Open-weight
Open-weight means the model weights are public, but the usage terms may still have rules.