6 18 3

Jianhao Yan

Elliott

ElliottYan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper 2 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

upvoted a paper 3 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 2 days ago • 107

upvoted a paper 2 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published 4 days ago • 20

upvoted 2 papers 3 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

submitted a paper to Daily Papers 3 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

upvoted a paper 4 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

authored a paper 4 months ago

BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search

Paper • 2601.11037 • Published Jan 16 • 17

upvoted 2 papers 6 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 87

upvoted 2 papers 8 months ago

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them

Paper • 2509.21117 • Published Sep 25, 2025 • 30

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18, 2025 • 53

upvoted 2 papers 11 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 90

IntFold: A Controllable Foundation Model for General and Specialized Biomolecular Structure Prediction

Paper • 2507.02025 • Published Jul 2, 2025 • 36

updated a collection 12 months ago

LUFFY-RL

Collection

9 items • Updated May 30, 2025 • 10

updated a dataset 12 months ago

Elliott/Openr1-Math-48k-Complement

Viewer • Updated May 30, 2025 • 47.9k • 16

published a dataset 12 months ago

Elliott/Openr1-Math-48k-Complement

Viewer • Updated May 30, 2025 • 47.9k • 16

updated a collection 12 months ago

LUFFY-RL

Collection

9 items • Updated May 30, 2025 • 10

updated a model 12 months ago

Elliott/Qwen2.5-Math-7B-SFT-RL

Text Generation • 8B • Updated May 30, 2025 • 3

published a model 12 months ago

Elliott/Qwen2.5-Math-7B-SFT-RL

Text Generation • 8B • Updated May 30, 2025 • 3

updated a collection 12 months ago

LUFFY-RL

Collection

9 items • Updated May 30, 2025 • 10

Jianhao Yan

AI & ML interests

Recent Activity

Organizations

Elliott's activity