Running Featured 85 Distilling 100B+ Models 40x Faster with TRL 📝 85 TRL distillation for 100B+ teachers, 40x faster
Running Agents 431 Reward Bench Leaderboard 📐 431 Explore and compare model scores on RewardBench benchmarks
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 1 day ago • 157
Sutra Pedagogical Datasets Collection High-quality synthetic educational datasets designed for LLM pretraining with structured pedagogical content across 9 knowledge domains. • 7 items • Updated Mar 17 • 5