Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published 14 days ago • 261
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph Paper • 2511.00086 • Published Oct 29, 2025 • 42
Running on CPU Upgrade 586 GAIA Leaderboard 🦾 586 Submit your model answers to GAIA benchmark and view leaderboard
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance Paper • 2506.06444 • Published Jun 6, 2025 • 73