Plyusov's picture

5

Plyusov

daniilplyusov

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

upvoted a paper 22 days ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

upvoted a paper 7 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

View all activity

Organizations

None yet

models 1

daniilplyusov/reward_model

Updated Feb 2, 2025

datasets 0

None public yet