2 12

Yuhang Zhou

zyhang1998

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

submitted a paper about 7 hours ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

upvoted a paper about 7 hours ago

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

View all activity

Organizations

upvoted a paper about 7 hours ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Paper • 2606.01476 • Published 4 days ago • 7

submitted a paper to Daily Papers about 7 hours ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Paper • 2606.01476 • Published 4 days ago • 7

upvoted a paper about 7 hours ago

Conditional Hypothesis Generation for LLM-Based Text Analysis with Researcher-Specified Covariates

Paper • 2606.03029 • Published 2 days ago • 4

upvoted a paper 12 days ago

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

Paper • 2605.22642 • Published 14 days ago • 35

upvoted a paper about 2 months ago

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published Apr 9 • 80

authored a paper about 2 months ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published Apr 6 • 14

submitted a paper to Daily Papers about 2 months ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published Apr 6 • 14

authored a paper 5 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

upvoted a paper 5 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

upvoted 2 papers 6 months ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 40

First Frame Is the Place to Go for Video Content Customization

Paper • 2511.15700 • Published Nov 19, 2025 • 54

updated a model 9 months ago

zyhang1998/qwen3_8b_answer_rankmind_rl

8B • Updated Sep 17, 2025 • 1

published a model 9 months ago

zyhang1998/qwen3_8b_answer_rankmind_rl

8B • Updated Sep 17, 2025 • 1

updated a model 9 months ago

zyhang1998/qwen3_8b_code_rankmind_rl

8B • Updated Sep 17, 2025

published a model 9 months ago

zyhang1998/qwen3_8b_code_rankmind_rl

8B • Updated Sep 17, 2025

updated a model 9 months ago

zyhang1998/qwen3_8b_plan_rankmind_rl

8B • Updated Sep 17, 2025

published a model 9 months ago

zyhang1998/qwen3_8b_plan_rankmind_rl

8B • Updated Sep 17, 2025

upvoted a paper 9 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 85

updated a model 10 months ago

zyhang1998/gemma3_27b_textonly

27B • Updated Aug 22, 2025

published a model 10 months ago

zyhang1998/gemma3_27b_textonly

27B • Updated Aug 22, 2025

Yuhang Zhou

AI & ML interests

Recent Activity

Organizations

zyhang1998's activity