Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Work
1
Peijia Qin
t2ance
Follow
OliverQinyy's profile picture
AMAImedia's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
5 days ago
t2ance/daj-eval-results
published
a dataset
6 days ago
t2ance/daj-eval-results
updated
a model
6 days ago
t2ance/CodeRM-GRPO-Selection-8B
View all activity
Organizations
None yet
t2ance
's models
54
Sort: Recently updated
t2ance/CodeRM-GRPO-Selection-8B
8B
•
Updated
6 days ago
•
40.5k
•
1
t2ance/CodeRM-Bilevel-GRPO-4B
4B
•
Updated
7 days ago
•
85
•
1
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-K8s-v2
Updated
9 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v13-ThinkingMasked
Updated
9 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v12-NoThinking
Updated
9 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v11
Updated
10 days ago
•
1
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v9
Updated
13 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v6
Updated
13 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v5
Updated
13 days ago
t2ance/mle-playbooks
Updated
14 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v4
Updated
14 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v3
Updated
14 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v2
Updated
14 days ago
t2ance/CodeRM-SFT-Warmup-Selection-4B-Merged
4B
•
Updated
15 days ago
•
7.56k
t2ance/sft-4b-onpolicy-rejection-sampling
Updated
15 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
Updated
15 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
Updated
15 days ago
t2ance/CodeRM-SFT-Warmup-Selection-8B-Merged
8B
•
Updated
15 days ago
•
7.69k
t2ance/CodeRM-SFT-Warmup-Selection-8B
Text Generation
•
Updated
15 days ago
•
12
t2ance/CodeRM-SFT-Warmup-Selection-4B
Text Generation
•
Updated
15 days ago
•
13
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain-SmallMeta
Updated
16 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain
Updated
16 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain
Updated
17 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Heuristic
Updated
18 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Baseline
Updated
19 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Baseline
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-2B-Domain
Updated
27 days ago
t2ance/CodeRM-DPO-Selection-Domain-2-7B-Hard-Betty-Test
Updated
Mar 6
t2ance/CodeRM-OnlineGRPO-Selection-4B-Instance-Net
Updated
Jan 30
t2ance/CodeRM-KTO-Selection-Instance-Table-2-14B-Hard
Updated
Jan 29
Previous
1
2
Next