arxiv:2601.20209
Jinyang Wu
Jinyang23
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
upvoted
a
paper
2 days ago
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing
upvoted
a
paper
2 days ago
TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents
upvoted
a
paper
2 days ago
SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration
Organizations
None yet