Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
9
13
CL Yu
clyu
Follow
21world's profile picture
1 follower
·
4 following
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
Qwen/Qwen3-Coder-Next
submitted
a paper
3 days ago
Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training
upvoted
an
article
5 days ago
We Got Claude to Build CUDA Kernels and teach open models!
View all activity
Organizations
clyu
's models
9
Sort: Recently updated
clyu/clip0.28_clipl0.2_vanilla_bsz512_mb128
Updated
Dec 17, 2025
clyu/cliph4_clipl0.5_cumloss_bsz512_mb128
Updated
Dec 17, 2025
clyu/qwen3_14b_rstar_sft_step802
15B
•
Updated
Nov 17, 2025
clyu/mistral12b_skyworkllama8b_grpo_step180
12B
•
Updated
Sep 7, 2025
•
1
clyu/mistral12b_skyworkllama8b_grpo_step160
12B
•
Updated
Sep 7, 2025
•
1
clyu/mistral12b_skyworkllama8b_grpo_step120
12B
•
Updated
Sep 6, 2025
clyu/mistral12b_skyworkllama8b_grpo_step80
12B
•
Updated
Sep 6, 2025
clyu/Mixtral-8x22B-Instruct-v0.1-mcore
Updated
May 23, 2025
clyu/Mixtral-8x7B-Instruct-v0.1-mcore
Updated
Apr 24, 2025