Model family

Phi Models

Microsoft Phi models focus on small language models, edge deployment, reasoning efficiency, and low-resource local experiments.

MicrosoftUpdated 2026EdgeSmallVisionLocal

Best for

Edge

Use this family hub to compare Phi variants for edge workflows, then open the detail page for deeper deployment notes.

Small

Use this family hub to compare Phi variants for small workflows, then open the detail page for deeper deployment notes.

Vision

Use this family hub to compare Phi variants for vision workflows, then open the detail page for deeper deployment notes.

Local

Use this family hub to compare Phi variants for local workflows, then open the detail page for deeper deployment notes.

Variants

Phi models grouped by workflow

Latest / flagship

Local-friendly

Compare

All Phi models in the directory

ModelTypeBest forLocal runner notesLicenseDetail
Phi-4 MiniEdgeBuilders testing small local models on laptops, CPUs, and constrained hardware.A small-model option for local experiments on laptops, CPUs, and low-VRAM machines.MITOpen
Phi-4ReasoningSmall language model reasoning and assistant workflowsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-4 MultimodalMultimodalSmall multimodal and edge experimentsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-3.5 MoEReasoningEfficient reasoning model comparisonsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-3.5 VisionVisionSmall vision-language experimentsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-3 MediumChatMid-size local assistant workflowsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-3 MiniEdgeLow-resource local assistant workflowsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-3 SmallEdgeSmall local assistant workflowsUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen
Phi-2EdgeLegacy small model baselineUse the exact checkpoint and quantization that matches your hardware and latency target.Check exact model cardOpen