Llamba-8B-8bit-mlx

Run locally Apple devices with Mirai

Type

Type

Local

From

From

Cartesia

Quantisation

Quantisation

uint8

Precision

Precision

No

Size

Size

8B

Source

Source

Hugging Face Logo

The Llamba models are efficient, high-performance language models designed for edge computing applications as part of Cartesia's Edge library. These recurrent-based models leverage distillation techniques to achieve strong performance across standard benchmarks while maintaining computational efficiency. Available in three sizes (1B, 3B, and 8B parameters), Llamba models are optimized for deployment on resource-constrained devices and support multiple frameworks including PyTorch and MLX for Metal hardware acceleration.

1
Choose framework
2
Run the following command to install Mirai SDK
SPMhttps://github.com/trymirai/uzu-swift
3
Set Mirai API keyGet API Key
4
Apply code
Loading...

Llamba-8B-8bit-mlx

Run locally Apple devices with Mirai

Type

Local

From

Cartesia

Quantisation

uint8

Precision

float16

Size

8B

Source

Hugging Face Logo

The Llamba models are efficient, high-performance language models designed for edge computing applications as part of Cartesia's Edge library. These recurrent-based models leverage distillation techniques to achieve strong performance across standard benchmarks while maintaining computational efficiency. Available in three sizes (1B, 3B, and 8B parameters), Llamba models are optimized for deployment on resource-constrained devices and support multiple frameworks including PyTorch and MLX for Metal hardware acceleration.

1
Choose framework
2
Run the following command to install Mirai SDK
SPMhttps://github.com/trymirai/uzu-swift
3
Set Mirai API keyGet API Key
4
Apply code
Loading...