SPARC-RAG: Adaptive Sequential-Parallel Scaling with Context Management for Retrieval-Augmented Generation Paper • 2602.00083 • Published Jan 22
Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning Paper • 2605.06241 • Published 7 days ago • 4
Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning Paper • 2605.06241 • Published 7 days ago • 4
LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning Paper • 2512.05325 • Published Dec 5, 2025 • 5
LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning Paper • 2512.05325 • Published Dec 5, 2025 • 5