arxiv:2410.01180
Kangda Wei
kangdawei
AI & ML interests
None yet
Organizations
models 50
kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation • 8B • Updated
• 376
kangdawei/MMR-Sigmoid-DR-GRPO-8B
Text Generation • 8B • Updated
• 2
kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation • 8B • Updated
• 10
kangdawei/MMR-Sigmoid-DAPO
Text Generation • 2B • Updated
• 7
kangdawei/MMR-Sigmoid-GRPO-8B
Text Generation • 8B • Updated
• 2 • 1
kangdawei/MMR-Sigmoid-GRPO-7B
Text Generation • 8B • Updated
• 1
kangdawei/MMR-Sigmoid-DR-GRPO-7B
Text Generation • 8B • Updated
• 1
kangdawei/DAPO-8B
Text Generation • 8B • Updated
• 4
kangdawei/DAPO-7B
Text Generation • 8B • Updated
• 7 • 1
kangdawei/MMR-DAPO-8B
Text Generation • 8B • Updated
• 3 • 1
datasets 0
None public yet