arxiv:2502.04322
Narutatsu Ri
narutatsuri
AI & ML interests
None yet
Recent Activity
updated a dataset 5 days ago
narutatsuri/lrm_safety-artifacts published a dataset 5 days ago
narutatsuri/lrm_safety-artifacts authored a paper 2 months ago
Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision