arxiv:2509.17158
Hugo Laurençon
HugoLaurencon
AI & ML interests
None yet
Recent Activity
liked a Space about 1 hour ago
mishig/jepawiki upvoted a paper 11 days ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information upvoted a paper 11 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories