OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains Paper • 2606.14702 • Published 15 days ago • 31
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 17 days ago • 87
SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published 30 days ago • 41
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Paper • 2605.30993 • Published 29 days ago • 59