tencent/PlanningBench
Viewer • Updated • 467 • 3 • 1
None defined yet.
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement