Llamba Collection A family of efficient recurrent language models distilled from Llama-3.x into the Mamba architecture • 6 items • Updated Jul 11, 2025 • 1
Llamba Collection A family of efficient recurrent language models distilled from Llama-3.x into the Mamba architecture • 6 items • Updated Jul 11, 2025 • 1
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention sirluk • Oct 7, 2024 • 71