From Web to Pixels: Bringing Agentic Search into Visual Perception Paper • 2605.12497 • Published 7 days ago • 14 • 1
From Web to Pixels: Bringing Agentic Search into Visual Perception Paper • 2605.12497 • Published 7 days ago • 14
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 11 days ago • 95
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 13 days ago • 97
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published Apr 2 • 151
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published Mar 31 • 46