view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 310
view article Article FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages davanstrien • Jul 8, 2025 • 35
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 773
view article Article Fixing Open LLM Leaderboard with Math-Verify +2 hynky, alozowski, SaylorTwift, clefourrier • Feb 14, 2025 • 32
view article Article FineWeb2-C: Help Build Better Language Models in Your Language davanstrien • Dec 23, 2024 • 21
view article Article 🇨🇿 BenCzechMark - Can your LLM Understand Czech? +11 mfajcik, hynky, mdocekal, xdolez52, jstetina, Lakoc, popelucha, hales, michal-stefanik, Adamiros, davidadamczyk, JanH, jsedivy • Oct 1, 2024 • 24
view article Article 🇨🇿 BenCzechMark - Can your LLM Understand Czech? +11 mfajcik, hynky, mdocekal, xdolez52, jstetina, Lakoc, popelucha, hales, michal-stefanik, Adamiros, davidadamczyk, JanH, jsedivy • Oct 1, 2024 • 24