Model family

Qwen Models

Qwen models are strong choices for multilingual chat, coding, math, vision-language, and local developer workflows.

Alibaba QwenUpdated 2026ChatCodeReasoningVisionEmbeddingAgents

Best for

Chat

Use this family hub to compare Qwen variants for chat workflows, then open the detail page for deeper deployment notes.

Code

Use this family hub to compare Qwen variants for code workflows, then open the detail page for deeper deployment notes.

Reasoning

Use this family hub to compare Qwen variants for reasoning workflows, then open the detail page for deeper deployment notes.

Vision

Use this family hub to compare Qwen variants for vision workflows, then open the detail page for deeper deployment notes.

Variants

Qwen models grouped by workflow

Latest / flagship

ChatFrontier 2026reasoningcoding

Qwen3 235B A22B

Alibaba Qwen · Qwen

Best for: Builders testing frontier-style open-weight reasoning and coding in hosted or multi-GPU environments.

Details →

ChatFrontier 2026chatcoding

Qwen3.5

Alibaba Qwen · Qwen

Best for: Teams comparing current Qwen releases for chat, coding, multilingual, and agent workflows.

Details →

ChatFrontier 2026chatcoding

Qwen3.6

Alibaba Qwen · Qwen

Best for: Builders watching newer Qwen releases for assistant, coding, multilingual, and agent workflows.

Details →

ChatFrontier 2026chatmultilingual

Qwen3 235B A22B Thinking

Alibaba Qwen · Qwen

Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

ChatFrontier 2026chatmultilingual

Qwen3 30B A3B

Alibaba Qwen · Qwen

Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

ChatFrontier 2026chatmultilingual

Qwen3 32B

Alibaba Qwen · Qwen

Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

ChatFrontier 2026chatmultilingual

Qwen3 14B

Alibaba Qwen · Qwen

Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

ChatFrontier 2026chatmultilingual

Qwen3 8B

Alibaba Qwen · Qwen

Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

CodeOpen weights where releasedcodingdeveloper

Qwen2.5 Coder 14B

Alibaba Qwen · Qwen

Best for: Coding assistants, repository help, and developer workflow evaluation.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

Coding

CodeCodingcodingagents

Qwen3 Coder

Alibaba Qwen · Qwen

Best for: Developers comparing open coding models for Continue, Aider, Cline, and Roo Code workflows.

Local: Smaller or quantized coder variants can be tested locally for IDE and coding-agent workflows.

Details →

CodeOpen weights where releasedcodingdeveloper

Qwen2.5 Coder 32B

Alibaba Qwen · Qwen

Best for: Coding assistants, repository help, and developer workflow evaluation.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

CodeOpen weights where releasedcodingdeveloper

Qwen2.5 Coder 7B

Alibaba Qwen · Qwen

Best for: Coding assistants, repository help, and developer workflow evaluation.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

Reasoning

ChatOpen weights where releasedchatmultilingual

Qwen2.5 72B Instruct

Alibaba Qwen · Qwen

Best for: Multilingual chat, assistant workflows, and Qwen-family comparisons.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

ReasoningOpen weights where releasedmathreasoning

Qwen2.5 Math 72B

Alibaba Qwen · Qwen

Best for: Math-heavy prompts, reasoning tests, and educational workflow evaluation.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

Vision / multimodal

VisionMultimodalvisionmultimodal

Qwen3 VL

Alibaba Qwen · Qwen

Best for: Builders adding visual understanding to open AI workflows.

Local: Can be tested locally when compatible checkpoints and runtimes are available; multimodal serving is more demanding than text-only models.

Details →

VisionOpen weights where releasedvisionmultimodal

Qwen2.5 VL

Alibaba Qwen · Qwen

Best for: Vision-language assistants, document images, and multimodal agent workflows.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

VisionOpen weights where releasedvisionmultimodal

Qwen2 VL

Alibaba Qwen · Qwen

Best for: Vision-language baseline comparisons and multimodal prototypes.

Local: Use the exact checkpoint and quantization that matches your hardware and latency target.

Details →

Embedding and reranking

EmbeddingRAGembeddingrag

Qwen3 Embedding

Alibaba Qwen · Qwen

Best for: Builders who want a newer embedding family to compare against E5, BGE, and Jina.

Local: Smaller embedding variants are practical for local RAG and retrieval experiments.

Details →

Compare

All Qwen models in the directory

Model	Type	Best for	Local runner notes	License	Detail
Qwen3 235B A22B	Chat	Builders testing frontier-style open-weight reasoning and coding in hosted or multi-GPU environments.	Usually a server or multi-GPU model; use quantized builds or hosted inference for practical testing.	Apache 2.0	Open
Qwen3 VL	Vision	Builders adding visual understanding to open AI workflows.	Can be tested locally when compatible checkpoints and runtimes are available; multimodal serving is more demanding than text-only models.	Check exact model card	Open
Qwen3 Coder	Code	Developers comparing open coding models for Continue, Aider, Cline, and Roo Code workflows.	Smaller or quantized coder variants can be tested locally for IDE and coding-agent workflows.	Check exact model card	Open
Qwen3 Embedding	Embedding	Builders who want a newer embedding family to compare against E5, BGE, and Jina.	Smaller embedding variants are practical for local RAG and retrieval experiments.	Check exact model card	Open
Qwen3.5	Chat	Teams comparing current Qwen releases for chat, coding, multilingual, and agent workflows.	Local fit depends on the exact checkpoint size and quantized builds.	Check exact model card	Open
Qwen3.6	Chat	Builders watching newer Qwen releases for assistant, coding, multilingual, and agent workflows.	Local fit depends on the exact checkpoint size, serving stack, and quantized builds.	Check exact model card	Open
Qwen3 235B A22B Thinking	Chat	Multilingual chat, assistant workflows, and Qwen-family comparisons.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen3 30B A3B	Chat	Multilingual chat, assistant workflows, and Qwen-family comparisons.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen3 32B	Chat	Multilingual chat, assistant workflows, and Qwen-family comparisons.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen3 14B	Chat	Multilingual chat, assistant workflows, and Qwen-family comparisons.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen3 8B	Chat	Multilingual chat, assistant workflows, and Qwen-family comparisons.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2.5 72B Instruct	Chat	Multilingual chat, assistant workflows, and Qwen-family comparisons.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2.5 Coder 32B	Code	Coding assistants, repository help, and developer workflow evaluation.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2.5 Coder 14B	Code	Coding assistants, repository help, and developer workflow evaluation.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2.5 Coder 7B	Code	Coding assistants, repository help, and developer workflow evaluation.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2.5 Math 72B	Reasoning	Math-heavy prompts, reasoning tests, and educational workflow evaluation.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2.5 VL	Vision	Vision-language assistants, document images, and multimodal agent workflows.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open
Qwen2 VL	Vision	Vision-language baseline comparisons and multimodal prototypes.	Use the exact checkpoint and quantization that matches your hardware and latency target.	Check exact model card	Open

Qwen Models

Best for

Chat

Code

Reasoning

Vision

Qwen models grouped by workflow

Latest / flagship

Qwen3 235B A22B

Qwen3.5

Qwen3.6

Qwen3 235B A22B Thinking

Qwen3 30B A3B

Qwen3 32B

Qwen3 14B

Qwen3 8B

Qwen2.5 Coder 14B

Coding

Qwen3 Coder

Qwen2.5 Coder 32B

Qwen2.5 Coder 7B

Reasoning

Qwen2.5 72B Instruct

Qwen2.5 Math 72B

Vision / multimodal

Qwen3 VL

Qwen2.5 VL

Qwen2 VL

Embedding and reranking

Qwen3 Embedding

All Qwen models in the directory

Related tools, stacks, and guides