Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
1
4
39
Michał Wiliński
MWilinski
Follow
lucazsh's profile picture
misovalko's profile picture
mondalsurojit's profile picture
17 followers
·
26 following
https://michal-wilinski.com
inverse_hessian
JanekDev
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated
a model
about 4 hours ago
MWilinski/qwen2.5-3b-dpo-irl
published
a model
about 4 hours ago
MWilinski/qwen2.5-3b-dpo-irl
updated
a model
about 4 hours ago
MWilinski/qwen2.5-3b-sft-irl
View all activity
Organizations
MWilinski
's models
6
Sort: Recently updated
MWilinski/qwen2.5-3b-dpo-irl
Updated
about 4 hours ago
MWilinski/qwen2.5-3b-sft-irl
Updated
about 4 hours ago
MWilinski/qwen2.5-3b-gail
Updated
20 days ago
MWilinski/dro-v-qwen3-1.7b-paperlike
Updated
Mar 13
MWilinski/dro-qwen3-1.7b-full-fixed-tau
Updated
Feb 27
MWilinski/dro-qwen3-1.7b-full
Updated
Feb 27