Yang Liu
AronYang
AI & ML interests
Deep Learning, Optimization
Recent Activity
upvoted a paper 21 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper 5 months ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding liked
a dataset 7 months ago
tencent/AutoCodeBenchmark Organizations
None yet