Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives Paper • 2601.20833 • Published 21 days ago • 177
WebWorld: A Large-Scale World Model for Web Agent Training Paper • 2602.14721 • Published 2 days ago • 6
ResearchGym: Evaluating Language Model Agents on Real-World AI Research Paper • 2602.15112 • Published 2 days ago • 13
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published 6 days ago • 34