Model family
Phi Models
Microsoft Phi models focus on small language models, edge deployment, reasoning efficiency, and low-resource local experiments.
Best for
Edge
Use this family hub to compare Phi variants for edge workflows, then open the detail page for deeper deployment notes.
Small
Use this family hub to compare Phi variants for small workflows, then open the detail page for deeper deployment notes.
Vision
Use this family hub to compare Phi variants for vision workflows, then open the detail page for deeper deployment notes.
Local
Use this family hub to compare Phi variants for local workflows, then open the detail page for deeper deployment notes.
Variants
Phi models grouped by workflow
Latest / flagship
Phi-4 Mini
Microsoft · Phi
Best for: Builders testing small local models on laptops, CPUs, and constrained hardware.
Local: A small-model option for local experiments on laptops, CPUs, and low-VRAM machines.
Phi-4
Microsoft · Phi
Best for: Small language model reasoning and assistant workflows
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Phi-4 Multimodal
Microsoft · Phi
Best for: Small multimodal and edge experiments
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Phi-3.5 MoE
Microsoft · Phi
Best for: Efficient reasoning model comparisons
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Phi-3.5 Vision
Microsoft · Phi
Best for: Small vision-language experiments
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Local-friendly
Phi-3 Medium
Microsoft · Phi
Best for: Mid-size local assistant workflows
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Phi-3 Mini
Microsoft · Phi
Best for: Low-resource local assistant workflows
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Phi-3 Small
Microsoft · Phi
Best for: Small local assistant workflows
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Phi-2
Microsoft · Phi
Best for: Legacy small model baseline
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Compare
All Phi models in the directory
| Model | Type | Best for | Local runner notes | License | Detail |
|---|---|---|---|---|---|
| Phi-4 Mini | Edge | Builders testing small local models on laptops, CPUs, and constrained hardware. | A small-model option for local experiments on laptops, CPUs, and low-VRAM machines. | MIT | Open |
| Phi-4 | Reasoning | Small language model reasoning and assistant workflows | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-4 Multimodal | Multimodal | Small multimodal and edge experiments | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-3.5 MoE | Reasoning | Efficient reasoning model comparisons | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-3.5 Vision | Vision | Small vision-language experiments | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-3 Medium | Chat | Mid-size local assistant workflows | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-3 Mini | Edge | Low-resource local assistant workflows | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-3 Small | Edge | Small local assistant workflows | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Phi-2 | Edge | Legacy small model baseline | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |