view article Article Should We Still Pretrain Encoders with Masked Language Modeling? Nicolas-BZRD • Jul 2, 2025 • 22
trend-cybertron/Llama-Primus-Nemotron-70B-Instruct Text Generation • 71B • Updated Aug 9, 2025 • 1.23k • 16
WhiteRabbitNeo/Llama-3-WhiteRabbitNeo-8B-v2.0 Text Generation • 8B • Updated May 15, 2024 • 409 • • 61