1 17 1

Xuying Ning

YennNing

https://xuyingning.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

upvoted a paper 15 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

authored a paper 15 days ago

Code as Agent Harness

View all activity

Organizations

upvoted a paper 6 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published 11 days ago • 19

upvoted a paper 15 days ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published 16 days ago • 50

upvoted a paper 16 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 18 days ago • 213

upvoted a paper 20 days ago

Useful Memories Become Faulty When Continuously Updated by LLMs

Paper • 2605.12978 • Published 23 days ago • 18

upvoted a paper 23 days ago

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper • 2605.10899 • Published 25 days ago • 78

upvoted a paper about 1 month ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 218

upvoted a paper 3 months ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 153

upvoted a paper 4 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204

upvoted 2 papers 5 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158

When Reasoning Meets Its Laws

Paper • 2512.17901 • Published Dec 19, 2025 • 62

upvoted a paper 6 months ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 128

upvoted a paper 7 months ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published Oct 29, 2025 • 42

upvoted a paper 8 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 137

upvoted a paper 11 months ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10, 2025 • 80

upvoted a paper 12 months ago

Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance

Paper • 2506.06444 • Published Jun 6, 2025 • 73

upvoted 2 papers about 1 year ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5, 2025 • 81

Xuying Ning

AI & ML interests

Recent Activity

Organizations

YennNing's activity