Model family
Qwen Models
Qwen models are strong choices for multilingual chat, coding, math, vision-language, and local developer workflows.
Best for
Chat
Use this family hub to compare Qwen variants for chat workflows, then open the detail page for deeper deployment notes.
Code
Use this family hub to compare Qwen variants for code workflows, then open the detail page for deeper deployment notes.
Reasoning
Use this family hub to compare Qwen variants for reasoning workflows, then open the detail page for deeper deployment notes.
Vision
Use this family hub to compare Qwen variants for vision workflows, then open the detail page for deeper deployment notes.
Variants
Qwen models grouped by workflow
Latest / flagship
Qwen3 235B A22B
Alibaba Qwen · Qwen
Best for: Builders testing frontier-style open-weight reasoning and coding in hosted or multi-GPU environments.
Qwen3.5
Alibaba Qwen · Qwen
Best for: Teams comparing current Qwen releases for chat, coding, multilingual, and agent workflows.
Qwen3.6
Alibaba Qwen · Qwen
Best for: Builders watching newer Qwen releases for assistant, coding, multilingual, and agent workflows.
Qwen3 235B A22B Thinking
Alibaba Qwen · Qwen
Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen3 30B A3B
Alibaba Qwen · Qwen
Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen3 32B
Alibaba Qwen · Qwen
Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen3 14B
Alibaba Qwen · Qwen
Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen3 8B
Alibaba Qwen · Qwen
Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen2.5 Coder 14B
Alibaba Qwen · Qwen
Best for: Coding assistants, repository help, and developer workflow evaluation.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Coding
Qwen3 Coder
Alibaba Qwen · Qwen
Best for: Developers comparing open coding models for Continue, Aider, Cline, and Roo Code workflows.
Local: Smaller or quantized coder variants can be tested locally for IDE and coding-agent workflows.
Qwen2.5 Coder 32B
Alibaba Qwen · Qwen
Best for: Coding assistants, repository help, and developer workflow evaluation.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen2.5 Coder 7B
Alibaba Qwen · Qwen
Best for: Coding assistants, repository help, and developer workflow evaluation.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Reasoning
Qwen2.5 72B Instruct
Alibaba Qwen · Qwen
Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen2.5 Math 72B
Alibaba Qwen · Qwen
Best for: Math-heavy prompts, reasoning tests, and educational workflow evaluation.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Vision / multimodal
Qwen3 VL
Alibaba Qwen · Qwen
Best for: Builders adding visual understanding to open AI workflows.
Local: Can be tested locally when compatible checkpoints and runtimes are available; multimodal serving is more demanding than text-only models.
Qwen2.5 VL
Alibaba Qwen · Qwen
Best for: Vision-language assistants, document images, and multimodal agent workflows.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Qwen2 VL
Alibaba Qwen · Qwen
Best for: Vision-language baseline comparisons and multimodal prototypes.
Local: Use the exact checkpoint and quantization that matches your hardware and latency target.
Embedding and reranking
Compare
All Qwen models in the directory
| Model | Type | Best for | Local runner notes | License | Detail |
|---|---|---|---|---|---|
| Qwen3 235B A22B | Chat | Builders testing frontier-style open-weight reasoning and coding in hosted or multi-GPU environments. | Usually a server or multi-GPU model; use quantized builds or hosted inference for practical testing. | Apache 2.0 | Open |
| Qwen3 VL | Vision | Builders adding visual understanding to open AI workflows. | Can be tested locally when compatible checkpoints and runtimes are available; multimodal serving is more demanding than text-only models. | Check exact model card | Open |
| Qwen3 Coder | Code | Developers comparing open coding models for Continue, Aider, Cline, and Roo Code workflows. | Smaller or quantized coder variants can be tested locally for IDE and coding-agent workflows. | Check exact model card | Open |
| Qwen3 Embedding | Embedding | Builders who want a newer embedding family to compare against E5, BGE, and Jina. | Smaller embedding variants are practical for local RAG and retrieval experiments. | Check exact model card | Open |
| Qwen3.5 | Chat | Teams comparing current Qwen releases for chat, coding, multilingual, and agent workflows. | Local fit depends on the exact checkpoint size and quantized builds. | Check exact model card | Open |
| Qwen3.6 | Chat | Builders watching newer Qwen releases for assistant, coding, multilingual, and agent workflows. | Local fit depends on the exact checkpoint size, serving stack, and quantized builds. | Check exact model card | Open |
| Qwen3 235B A22B Thinking | Chat | Multilingual chat, assistant workflows, and Qwen-family comparisons. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen3 30B A3B | Chat | Multilingual chat, assistant workflows, and Qwen-family comparisons. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen3 32B | Chat | Multilingual chat, assistant workflows, and Qwen-family comparisons. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen3 14B | Chat | Multilingual chat, assistant workflows, and Qwen-family comparisons. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen3 8B | Chat | Multilingual chat, assistant workflows, and Qwen-family comparisons. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2.5 72B Instruct | Chat | Multilingual chat, assistant workflows, and Qwen-family comparisons. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2.5 Coder 32B | Code | Coding assistants, repository help, and developer workflow evaluation. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2.5 Coder 14B | Code | Coding assistants, repository help, and developer workflow evaluation. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2.5 Coder 7B | Code | Coding assistants, repository help, and developer workflow evaluation. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2.5 Math 72B | Reasoning | Math-heavy prompts, reasoning tests, and educational workflow evaluation. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2.5 VL | Vision | Vision-language assistants, document images, and multimodal agent workflows. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |
| Qwen2 VL | Vision | Vision-language baseline comparisons and multimodal prototypes. | Use the exact checkpoint and quantization that matches your hardware and latency target. | Check exact model card | Open |