OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 11 days ago • 105
Avatar V: Scaling Video-Reference Avatar Video Generation Paper • 2606.13872 • Published 11 days ago • 9
Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation Paper • 2606.17030 • Published 7 days ago • 25
PermaVid: Consistent Video Generation Across Edits via Disentangled Context Memory Paper • 2606.16449 • Published 7 days ago • 5
Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus Paper • 2606.15345 • Published 9 days ago • 15
ActWorld: From Explorable to Interactive World Model via Action-Aware Memory Paper • 2606.17730 • Published 6 days ago • 8
iMaC: Translating Actions into Motion and Contact Images for Embodied World Models Paper • 2606.09813 • Published 14 days ago • 13
MBench: A Comprehensive Benchmark on Memory Capability for Video World Models Paper • 2606.00793 • Published 14 days ago • 11
World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 12 days ago • 26
EgoCS-400K: An Egocentric Gameplay Dataset for World Models Paper • 2606.18180 • Published 6 days ago • 15
DreamX-World 1.0: A General-Purpose Interactive World Model Paper • 2606.16993 • Published 7 days ago • 106
BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering Paper • 2606.17049 • Published 7 days ago • 27
WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation Paper • 2606.13672 • Published 11 days ago • 4
MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold Paper • 2606.13376 • Published 11 days ago • 14
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning Paper • 2606.13673 • Published 11 days ago • 103