Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning Paper • 2606.04923 • Published 4 days ago • 37
electricsheepasia/asia-owid-deforestation-agriculture-plantations Viewer • Updated 3 days ago • 911 • 35 • 1
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 10 days ago • 190
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 11 days ago • 420