Mingjie Bi's picture

2 1

Mingjie Bi

mjb95m

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Less is More: Early Stopping Rollout for On-Policy Distillation

liked a dataset 5 months ago

bigai/TongSIM-Asset

upvoted a paper 10 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

upvoted a paper 5 days ago

Less is More: Early Stopping Rollout for On-Policy Distillation

Paper • 2605.27028 • Published 7 days ago • 11

upvoted a paper 10 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 189