Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
0.8
TFLOPS
2
Saksham Loonker
SLoonker
Follow
saksham-loonker
AI & ML interests
I am very interested in RL and other post-training, as well as building Efficient LLMs for Sparse Resources.
Recent Activity
new
activity
about 20 hours ago
SLoonker/RL-Claude-Reasoning-SFT:
which claude model?
updated
a collection
10 days ago
Small RL Datasets For Training
updated
a collection
10 days ago
Small RL Datasets For Training
View all activity
Organizations
None yet
SLoonker
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
SLoonker/RL-Claude-Reasoning-SFT
about 20 hours ago
which claude model?
1
#2 opened 9 days ago by
Roman1111111
updated
a collection
10 days ago
Small RL Datasets For Training
Collection
6 items
•
Updated
10 days ago
New activity in
SLoonker/RL-OpenCodeReasoning-DPO
10 days ago
Added License Of Source
#1 opened 10 days ago by
SmartAnon
updated
a dataset
10 days ago
SLoonker/RL-Claude-Creative-Writing-DPO
Viewer
•
Updated
10 days ago
•
818
•
18
published
a dataset
10 days ago
SLoonker/RL-Claude-Creative-Writing-DPO
Viewer
•
Updated
10 days ago
•
818
•
18
updated
a dataset
10 days ago
SLoonker/RL-STEM-DPO
Viewer
•
Updated
10 days ago
•
1.51k
•
20
published
a dataset
10 days ago
SLoonker/RL-STEM-DPO
Viewer
•
Updated
10 days ago
•
1.51k
•
20
updated
a dataset
10 days ago
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
10 days ago
•
2.75k
•
15
published
a dataset
10 days ago
SLoonker/RL-OpenCodeReasoning-DPO
Viewer
•
Updated
10 days ago
•
2.75k
•
15
updated
a dataset
10 days ago
SLoonker/RL-Claude-Creative-Writing-SFT
Viewer
•
Updated
10 days ago
•
818
•
28
•
1
published
a dataset
10 days ago
SLoonker/RL-Claude-Creative-Writing-SFT
Viewer
•
Updated
10 days ago
•
818
•
28
•
1
updated
a dataset
10 days ago
SLoonker/RL-Ling-Coding-DPO
Viewer
•
Updated
10 days ago
•
2.84k
•
18
published
a dataset
10 days ago
SLoonker/RL-Ling-Coding-DPO
Viewer
•
Updated
10 days ago
•
2.84k
•
18
updated
a dataset
10 days ago
SLoonker/RL-Claude-Reasoning-GRPO-Prompts
Viewer
•
Updated
10 days ago
•
2.21k
•
18
published
a dataset
10 days ago
SLoonker/RL-Claude-Reasoning-GRPO-Prompts
Viewer
•
Updated
10 days ago
•
2.21k
•
18
Load more