arxiv:2603.23483
Huang
Jinfa
AI & ML interests
None yet
Recent Activity
upvoted a paper about 23 hours ago
AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward liked a model about 1 month ago
google/gemma-4-E2B-it upvoted a paper about 1 month ago
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization