HKUSTGZ
university
AI & ML interests
None defined yet.
Papers
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL
TimeRFT: Stimulating Generalizable Time Series Forecasting for TSFMs via Reinforcement Finetuning
HKUSTGZ 's datasets
None public yet