This model was converted to MLX format from DataPilot/ArrowCanaria-Llama-8B-RL-v0.1 using mlx-lm version 0.31.0.
Refer to the original model card for more details on the model.
Inference: M3 Ultra - LM Studio MLX v1.4
4bit 120.84tok/s γγ―γγ arrowcanaria-llama-8b-rl-v0.1-mlx@4bit
δ»ζ₯γδΈζ₯γη΄ ζ΄γγγγγ¨γγγγΎγγγγ«οΌπ δ½γγζδΌγγ§γγγγ¨γ―γγγΎγγοΌ
- Downloads last month
- 86
Model size
1B params
Tensor type
BF16
Β·
U32 Β·
Hardware compatibility
Log In to add your hardware
4-bit
Model tree for mlx-community/ArrowCanaria-Llama-8B-RL-v0.1-MLX-4bit
Base model
meta-llama/Llama-3.1-8B Finetuned
meta-llama/Llama-3.1-8B-Instruct Finetuned
tokyotech-llm/Llama-3.1-Swallow-8B-v0.5 Finetuned
DataPilot/ArrowCanaria-Llama-8B-SFT-v0.1 Finetuned
DataPilot/ArrowCanaria-Llama-8B-RL-v0.1