arxiv:2508.15763
Zhouqi Hua
ZhouqiHUA
AI & ML interests
reasoning LLM
Recent Activity
upvoted a paper 18 days ago
DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning upvoted a collection 23 days ago
RL and Agents liked
a model 25 days ago
internlm/Intern-S1-Pro Organizations
None yet