Guide

How to Choose a Model for Coding, RAG, Summarization, and Agents

The best model is the one that performs reliably on your task, budget, latency target, and license constraints.

Who this is for

Developers comparing open models for practical apps.

Use real repo tasks and measure patch quality, not just code benchmark claims.

Separate embedding, retrieval, reranking, and answer generation choices. A better retriever can beat a bigger generator.

Prioritize tool-call reliability, context handling, and recovery from mistakes.

Leaderboard performance does not guarantee performance on your prompts, users, or documents.

Use them as a shortlist signal, then run your own evaluation on real tasks.

Use the model and tool directories to choose the concrete pieces for your local AI stack. Sponsor and affiliate placements will be added later.