CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning Paper • 2601.15141 • Published Jan 21 • 2