Yang Zequn
bjlfzs
AI & ML interests
None yet
Recent Activity
upvoted a paper 25 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 2 months ago
ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition upvoted a paper 5 months ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding