Qwen3 is the latest generation of large language models in the Qwen series, offering a comprehensive suite of dense and mixture-of-experts models built upon extensive training. Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support. Qwen3-8B is an 8.2 billion parameter causal language model that uniquely supports seamless switching between thinking mode for complex logical reasoning, math, and coding, and non-thinking mode for efficient general-purpose dialogue within a single model. The model offers significantly enhanced reasoning capabilities that surpass previous QwQ and Qwen2.5 instruct models on mathematics, code generation, and commonsense logical reasoning. It features superior human preference alignment for creative writing, role-playing, multi-turn dialogues, and instruction following, along with expertise in agent capabilities for precise integration with external tools. The model supports over 100 languages and dialects with strong multilingual instruction following and translation capabilities. It natively supports a context length of 32,768 tokens and can handle up to 131,072 tokens with YaRN scaling techniques.
Alibaba
available local models on Mirai:
available local models on Mirai:
Name
Quantisation
Size
Qwen2.5-Coder-0.5B-Instruct
uint4
0.5B
Quant.
uint4
Size
0.5B
Qwen2.5-Coder-1.5B-Instruct
uint4
1.5B
Quant.
uint4
Size
1.5B
Qwen2.5-Coder-14B-Instruct
uint4
14B
Quant.
uint4
Size
14B
Qwen2.5-Coder-32B-Instruct
uint4
32B
Quant.
uint4
Size
32B
Qwen2.5-Coder-3B-Instruct
uint4
3B
Quant.
uint4
Size
3B
Qwen2.5-Coder-7B-Instruct
uint4
7B
Quant.
uint4
Size
7B
Qwen3-0.6B
uint4
0.6B
Quant.
uint4
Size
0.6B
Qwen3-0.6B-MLX-4bit
uint4
0.6B
Quant.
uint4
Size
0.6B
Qwen3-0.6B-MLX-8bit
uint4
0.6B
Quant.
uint4
Size
0.6B
Qwen3-1.7B
uint4
1.7B
Quant.
uint4
Size
1.7B
Qwen3-1.7B-MLX-4bit
uint4
1.7B
Quant.
uint4
Size
1.7B
Qwen3-1.7B-MLX-8bit
uint4
1.7B
Quant.
uint4
Size
1.7B
Qwen3-14B
uint4
14B
Quant.
uint4
Size
14B
Qwen3-14B-AWQ
uint4
14B
Quant.
uint4
Size
14B
Qwen3-14B-MLX-4bit
uint4
14B
Quant.
uint4
Size
14B
Qwen3-14B-MLX-8bit
uint4
14B
Quant.
uint4
Size
14B
Qwen3-32B
uint4
32B
Quant.
uint4
Size
32B
Qwen3-32B-AWQ
uint4
32B
Quant.
uint4
Size
32B
Qwen3-32B-MLX-4bit
uint4
32B
Quant.
uint4
Size
32B
Qwen3-4B
uint4
4B
Quant.
uint4
Size
4B
Qwen3 is the latest generation of large language models in the Qwen series, offering a comprehensive suite of dense and mixture-of-experts models built upon extensive training. Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support. Qwen3-8B is an 8.2 billion parameter causal language model that uniquely supports seamless switching between thinking mode for complex logical reasoning, math, and coding, and non-thinking mode for efficient general-purpose dialogue within a single model. The model offers significantly enhanced reasoning capabilities that surpass previous QwQ and Qwen2.5 instruct models on mathematics, code generation, and commonsense logical reasoning. It features superior human preference alignment for creative writing, role-playing, multi-turn dialogues, and instruction following, along with expertise in agent capabilities for precise integration with external tools. The model supports over 100 languages and dialects with strong multilingual instruction following and translation capabilities. It natively supports a context length of 32,768 tokens and can handle up to 131,072 tokens with YaRN scaling techniques.
Alibaba
available local models on Mirai:
Name
Quantisation
Size
Qwen2.5-Coder-0.5B-Instruct
uint4
0.5B
Quant.
uint4
Size
0.5B
Qwen2.5-Coder-1.5B-Instruct
uint4
1.5B
Quant.
uint4
Size
1.5B
Qwen2.5-Coder-14B-Instruct
uint4
14B
Quant.
uint4
Size
14B
Qwen2.5-Coder-32B-Instruct
uint4
32B
Quant.
uint4
Size
32B
Qwen2.5-Coder-3B-Instruct
uint4
3B
Quant.
uint4
Size
3B
Qwen2.5-Coder-7B-Instruct
uint4
7B
Quant.
uint4
Size
7B
Qwen3-0.6B
uint4
0.6B
Quant.
uint4
Size
0.6B
Qwen3-0.6B-MLX-4bit
uint4
0.6B
Quant.
uint4
Size
0.6B
Qwen3-0.6B-MLX-8bit
uint4
0.6B
Quant.
uint4
Size
0.6B
Qwen3-1.7B
uint4
1.7B
Quant.
uint4
Size
1.7B
Qwen3-1.7B-MLX-4bit
uint4
1.7B
Quant.
uint4
Size
1.7B
Qwen3-1.7B-MLX-8bit
uint4
1.7B
Quant.
uint4
Size
1.7B
Qwen3-14B
uint4
14B
Quant.
uint4
Size
14B
Qwen3-14B-AWQ
uint4
14B
Quant.
uint4
Size
14B
Qwen3-14B-MLX-4bit
uint4
14B
Quant.
uint4
Size
14B
Qwen3-14B-MLX-8bit
uint4
14B
Quant.
uint4
Size
14B
Qwen3-32B
uint4
32B
Quant.
uint4
Size
32B
Qwen3-32B-AWQ
uint4
32B
Quant.
uint4
Size
32B
Qwen3-32B-MLX-4bit
uint4
32B
Quant.
uint4
Size
32B
Qwen3-4B
uint4
4B
Quant.
uint4
Size
4B