Type
Type
Local
From
From
Alibaba
Quantisation
Quantisation
uint8
Precision
Precision
No
Size
Size
1.7B
Qwen3 is the latest generation of large language models in the Qwen series, offering a comprehensive suite of dense and mixture-of-experts models built upon extensive training. The Qwen3-1.7B model is a 1.7 billion parameter causal language model with a 32,768 token context length, featuring unique support for seamless switching between thinking mode for complex logical reasoning, math, and coding, and non-thinking mode for efficient general-purpose dialogue within a single model. Key capabilities include significantly enhanced reasoning abilities that surpass previous QwQ and Qwen2.5 instruct models on mathematics, code generation, and commonsense logical reasoning. The model excels in human preference alignment with strong performance in creative writing, role-playing, multi-turn dialogues, and instruction following. It also demonstrates expertise in agent capabilities with precise tool integration in both thinking and non-thinking modes, and supports over 100 languages and dialects with strong multilingual instruction following and translation capabilities.
Qwen3 is the latest generation of large language models in the Qwen series, offering a comprehensive suite of dense and mixture-of-experts models built upon extensive training. The Qwen3-1.7B model is a 1.7 billion parameter causal language model with a 32,768 token context length, featuring unique support for seamless switching between thinking mode for complex logical reasoning, math, and coding, and non-thinking mode for efficient general-purpose dialogue within a single model. Key capabilities include significantly enhanced reasoning abilities that surpass previous QwQ and Qwen2.5 instruct models on mathematics, code generation, and commonsense logical reasoning. The model excels in human preference alignment with strong performance in creative writing, role-playing, multi-turn dialogues, and instruction following. It also demonstrates expertise in agent capabilities with precise tool integration in both thinking and non-thinking modes, and supports over 100 languages and dialects with strong multilingual instruction following and translation capabilities.