Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models Paper • 2603.22782 • Published 6 days ago • 7
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics Paper • 2603.14375 • Published 15 days ago • 17
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 4 days ago • 49
One View Is Enough! Monocular Training for In-the-Wild Novel View Generation Paper • 2603.23488 • Published 6 days ago • 4
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published 18 days ago • 21
Versatile Editing of Video Content, Actions, and Dynamics without Training Paper • 2603.17989 • Published 12 days ago • 16
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing Paper • 2603.19224 • Published 11 days ago • 18
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model Paper • 2603.18524 • Published 11 days ago • 58
TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning Paper • 2603.12529 • Published 18 days ago • 18
Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models Paper • 2603.15557 • Published 14 days ago • 28