WUSH: Near-Optimal Adaptive Transforms for LLM Quantization Paper β’ 2512.00956 β’ Published Nov 30, 2025 β’ 23
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper β’ 2512.20848 β’ Published Dec 23, 2025 β’ 42
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated May 5, 2025 β’ 305
π Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets β’ 8 items β’ Updated Jun 12, 2024 β’ 42
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs Paper β’ 2403.02775 β’ Published Mar 5, 2024 β’ 13
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" β’ 15 items β’ Updated about 2 hours ago β’ 46
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Paper β’ 2401.01335 β’ Published Jan 2, 2024 β’ 69
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" β’ 19 items β’ Updated Dec 23, 2025 β’ 45