VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use Paper • 2509.01055 • Published Sep 1, 2025 • 79
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework Paper • 2510.04206 • Published Oct 5, 2025 • 3
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published Oct 7, 2025 • 107
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use Paper • 2602.02160 • Published about 1 month ago • 13
Budget-Aware Tool-Use Enables Effective Agent Scaling Paper • 2511.17006 • Published Nov 21, 2025 • 33