arxiv:2506.02387
Huining Yuan
HuiningYuan
ยท
AI & ML interests
Reinforcement learning, LLM Agents, World models
Recent Activity
upvoted
a
paper
about 14 hours ago
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
upvoted
a
collection
2 months ago
MARSHAL
upvoted
a
paper
2 months ago
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle