How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning Paper • 2605.27310 • Published 5 days ago • 18
RiT: Vanilla Diffusion Transformers Suffice in Representation Space Paper • 2605.21981 • Published 10 days ago • 10