This is a 4-bit quantized version of Google's Gemma 3 1B instruction-tuned model converted to MLX format for efficient inference on Apple silicon devices. The model is optimized for text generation tasks and maintains the instruction-following capabilities of the original Gemma 3 1B model while reducing memory requirements through 4-bit quantization.
available local models on Mirai:
available local models on Mirai:
Name
Quantisation
Size
gemma-3-1b-it
uint4
1B
Quant.
uint4
Size
1B
gemma-3-27b-it
uint4
27B
Quant.
uint4
Size
27B
gemma-3-4b-it
uint4
4B
Quant.
uint4
Size
4B
gemma-3-1b-it-4bit
uint4
1B
Quant.
uint4
Size
1B
gemma-3-1b-it-8bit
uint4
1B
Quant.
uint4
Size
1B
gemma-3-27b-it-4bit
uint4
27B
Quant.
uint4
Size
27B
gemma-3-27b-it-8bit
uint4
27B
Quant.
uint4
Size
27B
gemma-3-4b-it-4bit
uint4
4B
Quant.
uint4
Size
4B
gemma-3-4b-it-8bit
uint4
4B
Quant.
uint4
Size
4B
This is a 4-bit quantized version of Google's Gemma 3 1B instruction-tuned model converted to MLX format for efficient inference on Apple silicon devices. The model is optimized for text generation tasks and maintains the instruction-following capabilities of the original Gemma 3 1B model while reducing memory requirements through 4-bit quantization.
available local models on Mirai:
Name
Quantisation
Size
gemma-3-1b-it
uint4
1B
Quant.
uint4
Size
1B
gemma-3-27b-it
uint4
27B
Quant.
uint4
Size
27B
gemma-3-4b-it
uint4
4B
Quant.
uint4
Size
4B
gemma-3-1b-it-4bit
uint4
1B
Quant.
uint4
Size
1B
gemma-3-1b-it-8bit
uint4
1B
Quant.
uint4
Size
1B
gemma-3-27b-it-4bit
uint4
27B
Quant.
uint4
Size
27B
gemma-3-27b-it-8bit
uint4
27B
Quant.
uint4
Size
27B
gemma-3-4b-it-4bit
uint4
4B
Quant.
uint4
Size
4B
gemma-3-4b-it-8bit
uint4
4B
Quant.
uint4
Size
4B