Qwen3-14B-MLX-4bit from Alibaba – Run On-Device with Mirai.

Qwen3-14B-MLX-4bit

Run locally Apple devices with Mirai

Run on device

Type

Local

From

Alibaba

Quantisation

uint4

Precision

No

Size

14B

Source

Explore all local models

Qwen3-14B-MLX-4bit is a 14.8 billion parameter causal language model from the latest Qwen3 series, optimized for the MLX framework. It represents a significant advancement in reasoning, instruction-following, and multilingual capabilities with a native context length of 32,768 tokens that can be extended to 131,072 tokens using YaRN scaling. The model uniquely supports seamless switching between thinking mode for complex logical reasoning, mathematics, and coding tasks, and non-thinking mode for efficient general-purpose dialogue, all within a single model architecture. This dual-mode capability ensures optimal performance across various scenarios. Qwen3-14B excels in reasoning abilities surpassing previous QwQ and Qwen2.5 models, superior human preference alignment for creative writing and role-playing, strong agent capabilities for precise tool integration, and support for over 100 languages with multilingual instruction-following capabilities.

1

Choose framework

2

Run the following command to install Mirai SDK

SPMhttps://github.com/trymirai/uzu-swift

3

Set Mirai API keyGet API Key →

4

Apply code

Loading...

Qwen3-14B-MLX-4bit

Run locally Apple devices with Mirai

Run on device

Type

Local

From

Alibaba

Quantisation

uint4

Precision

float16

Size

14B

Source

Explore all local models

Qwen3-14B-MLX-4bit is a 14.8 billion parameter causal language model from the latest Qwen3 series, optimized for the MLX framework. It represents a significant advancement in reasoning, instruction-following, and multilingual capabilities with a native context length of 32,768 tokens that can be extended to 131,072 tokens using YaRN scaling. The model uniquely supports seamless switching between thinking mode for complex logical reasoning, mathematics, and coding tasks, and non-thinking mode for efficient general-purpose dialogue, all within a single model architecture. This dual-mode capability ensures optimal performance across various scenarios. Qwen3-14B excels in reasoning abilities surpassing previous QwQ and Qwen2.5 models, superior human preference alignment for creative writing and role-playing, strong agent capabilities for precise tool integration, and support for over 100 languages with multilingual instruction-following capabilities.

1

Choose framework

2

Run the following command to install Mirai SDK

SPMhttps://github.com/trymirai/uzu-swift

3

Set Mirai API keyGet API Key →

4

Apply code

Loading...

Main

Company

Links

Platform / SDK

Models Library

MacOS App

Blog

Docs

About us

Careers

Contact Us

Privacy Policy

Terms of Use

X (Twitter)

Github

Discord

Platform / SDK

Models Library

MacOS App

Blog

Docs

Main

About us

Careers

Contact Us

Privacy Policy

Terms of Use

Company

X (Twitter)

Github

Discord

Links

Main

Company

Links

Platform / SDK

Models Library

MacOS App

Blog

Docs

About us

Careers

Contact Us

Privacy Policy

Terms of Use

X (Twitter)

Github

Discord