孙子轩's picture

孙子轩

matthewonssd5s

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 hours ago

leokswang/20122025-foldclo-02_lerobot_format

liked a dataset 6 days ago

xlangai/ubuntu_osworld_file_cache

liked a dataset 7 days ago

i998979/TransitApp

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 16 days ago • 192

upvoted a paper 9 days ago

SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution

Paper • 2605.18401 • Published 10 days ago • 126

upvoted a paper 12 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 15 days ago • 269

upvoted a paper 23 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 25 days ago • 165

upvoted a paper 26 days ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published 28 days ago • 57

upvoted a paper about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

upvoted 4 papers about 2 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models

Paper • 2603.21460 • Published Mar 23 • 6

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

upvoted 3 papers 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248