8 1

姚冠宇

yaogy

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

OpenClaw-RL: Train Any Agent Simply by Talking

upvoted a paper about 19 hours ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

updated a model about 2 months ago

yaogy/qwen-7b-visionr1-kl-320

View all activity

Organizations

upvoted a paper about 8 hours ago

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 29 days ago • 150

upvoted a paper about 19 hours ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 3 days ago • 24

updated 2 models about 2 months ago

yaogy/qwen-7b-visionr1-kl-320

8B • Updated Feb 20

yaogy/qwen-7b-visionr1-kl-340

8B • Updated Feb 20

published 2 models about 2 months ago

yaogy/qwen-7b-visionr1-kl-340

8B • Updated Feb 20

yaogy/qwen-7b-visionr1-kl-320

8B • Updated Feb 20

updated a model about 2 months ago

yaogy/qwen25-vl-dapo-visionr1

8B • Updated Feb 18

published a model about 2 months ago

yaogy/qwen25-vl-dapo-visionr1

8B • Updated Feb 18

updated a model about 2 months ago

yaogy/qwen25vl-7b-without-text-280

8B • Updated Feb 17

published a model about 2 months ago

yaogy/qwen25vl-7b-without-text-280

8B • Updated Feb 17

updated a model about 2 months ago

yaogy/qwen3b-without-text-320

4B • Updated Feb 17

published a model about 2 months ago

yaogy/qwen3b-without-text-320

4B • Updated Feb 17

upvoted a paper 2 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

upvoted a paper 3 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 230

upvoted a paper 4 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 126

updated a model 4 months ago

yaogy/qwen3b-papo

4B • Updated Nov 24, 2025

published a model 4 months ago

yaogy/qwen3b-papo

4B • Updated Nov 24, 2025

updated 2 models 5 months ago

yaogy/partial-text-3b

4B • Updated Nov 24, 2025

yaogy/partial-text-7b

8B • Updated Nov 24, 2025

published a model 5 months ago

yaogy/partial-text-7b

8B • Updated Nov 24, 2025

姚冠宇

AI & ML interests

Recent Activity

Organizations

yaogy's activity