Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
Plyusov
daniilplyusov
Follow
kefirski's profile picture
1 follower
ยท
2 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
18 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
upvoted
a
paper
22 days ago
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare
upvoted
a
paper
7 months ago
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success
View all activity
Organizations
None yet
models
1
daniilplyusov/reward_model
Updated
Feb 2, 2025
datasets
0
None public yet