nvidia/Nemotron-TwoTower-30B-A3B-Base-BF16 Text Generation • 63B • Updated about 21 hours ago • 1.71k • 32
Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning Paper • 2606.24133 • Published 4 days ago • 7
nvidia/nemotron-3.5-asr-streaming-0.6b Automatic Speech Recognition • 0.6B • Updated about 3 hours ago • 56.4k • • 705