kaikai zhao's picture

7

kaikai zhao

LifeIsSoSolong

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a paper about 15 hours ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper about 16 hours ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

View all activity

Organizations

None yet

upvoted 2 papers about 15 hours ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 2 days ago • 79

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22, 2025 • 123

upvoted a paper about 16 hours ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

Paper • 2606.24530 • Published 2 days ago • 47

upvoted a paper 1 day ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published 3 days ago • 71

upvoted 2 papers about 1 month ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published May 18 • 30

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

upvoted a paper 9 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 119