Diversity-Incentivized Exploration for Versatile Reasoning
Zican Hu
huzican
AI & ML interests
None yet
Recent Activity
upvoted a paper about 14 hours ago
Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe updated a model about 1 month ago
huzican/unify_sft_tit-ckpt-3000