WendyCheung
WendyCheung
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 19 hours ago
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning upvoted a paper 24 days ago
Prism: Spectral-Aware Block-Sparse Attention upvoted a paper about 1 month ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization Organizations
None yet