Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Runpeng Dai's picture
2 31 2

Runpeng Dai

Leo-Dai
TongZheng1999's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago
DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification
authored a paper 6 days ago
Reinforcing Multimodal Reasoning Against Visual Degradation
authored a paper 6 days ago
G-Zero: Self-Play for Open-Ended Generation from Zero Data
View all activity

Organizations

Killers's profile picture Efficient Reasoning's profile picture Parallel-R1-v2's profile picture

Leo-Dai 's models 17

Leo-Dai/PPO_BL_250_critic

4B • Updated Aug 15, 2025 • 1

Leo-Dai/PPO_BL_200_critic

Updated Aug 15, 2025 • 4

Leo-Dai/PPO_BL_300_actor

Updated Aug 15, 2025

Leo-Dai/PPO_BL_250_actor

Updated Aug 15, 2025

Leo-Dai/PPO_BL_300_critic

Updated Aug 15, 2025

Leo-Dai/GRPO_BL_40

4B • Updated Aug 15, 2025

Leo-Dai/GRPO_BL_30

4B • Updated Aug 15, 2025

Leo-Dai/GRPO_BL_20

4B • Updated Aug 15, 2025

Leo-Dai/GRPO_BL_400

4B • Updated Aug 15, 2025

Leo-Dai/GRPO_BL_10

4B • Updated Aug 15, 2025

Leo-Dai/GRPO_BL_350

4B • Updated Aug 15, 2025

Leo-Dai/GRPO_BL_200

4B • Updated Aug 13, 2025

Leo-Dai/GRPO_BL_150

4B • Updated Aug 13, 2025 • 1

Leo-Dai/GRPO_BL_100

4B • Updated Aug 13, 2025 • 1

Leo-Dai/GRPO_BL_300

4B • Updated Aug 13, 2025 • 1

Leo-Dai/GRPO_BL_250

4B • Updated Aug 13, 2025 • 1

Leo-Dai/GRPO_BL_50

4B • Updated Aug 13, 2025 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs