view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311
view article Article Why Did MiniMax M2 End Up as a Full Attention Model? MiniMax-AI • Oct 30, 2025 • 80
view article Article Aligning to What? Rethinking Agent Generalization in MiniMax M2 MiniMax-AI • Oct 30, 2025 • 43
view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 313
view article Article Gaia2 and ARE: Empowering the community to study agents +9 clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter • Sep 22, 2025 • 134
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders thomwolf, matthieu-lapeyre • Jul 9, 2025 • 800
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 777
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26, 2025 • 78
Built with Distill blog ❤️ Collection Collection of all interactive blogs built on top of Distill template. To create your own check: https://huggingface.co/spaces/lvwerra/distill-blog-tem • 6 items • Updated Mar 14, 2025 • 2
view article Article Fixing Open LLM Leaderboard with Math-Verify +2 hynky, alozowski, SaylorTwift, clefourrier • Feb 14, 2025 • 31
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4, 2025 • 258
view article Article FineWeb2-C: Help Build Better Language Models in Your Language davanstrien • Dec 23, 2024 • 21
IrokoBench Collection a human-translated benchmark dataset for 16 African languages covering three tasks: NLI, MMLU and MGSM • 6 items • Updated May 31, 2024 • 21
view article Article Scaling AI-based Data Processing with Hugging Face + Dask +2 scj13, jrbourbeau, lhoestq, davanstrien • Oct 9, 2024 • 33