huangxiao
huangxiao39
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 month ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe upvoted a paper 2 months ago
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data upvoted a paper 7 months ago
MR-Align: Meta-Reasoning Informed Factuality Alignment for Large
Reasoning ModelsOrganizations
None yet