Local LLM hardware tool

Find local AI models that should realistically fit your PC.

Enter your CPU, GPU, Graphics Card Memory (VRAM), System RAM, and target workflow. OpenSourcesAI will suggest good, better, and best local model profiles to try, plus a practical app/runtime path for testing them on your machine.

This checker uses practical rules, not live performance tests. Treat the result as a starting point before testing a compressed model locally. Lower-memory PCs may show only one realistic local match.

Open Model Builder Wizard without prefilled specs →Compare local LLM tools →Browse model profiles →

Quick definitions

VRAM

Graphics-card memory

VRAM is the dedicated memory on your GPU. Common values are 4 GB, 8 GB, 10 GB, 12 GB, 16 GB, or 24 GB.

Quantized model

A compressed local model

A 4-bit quantized model uses less memory, usually with a small quality tradeoff, so it is easier to run on a normal PC.

Runtime

The app that runs the model

Ollama and LM Studio are beginner-friendly apps for testing local models. llama.cpp is a technical engine used by many local model tools.

RAG

Answers from your files

RAG means the model checks your documents or notes first, then answers using those sources.

Open-weight

Public model weights

Open-weight means the model weights are public, but the usage terms may still have rules.