arxiv:2504.13055
Xiangyan Liu
xyliu6
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 2 days ago
Rethinking the Divergence Regularization in LLM RL upvoted a paper about 2 months ago
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker AgentsOrganizations
None yet