Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 5 days ago • 141
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published 5 days ago • 128
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing Paper • 2511.19963 • Published Nov 25, 2025 • 2
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 14 days ago • 18
prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX Image-Text-to-Text • 4B • Updated Feb 21 • 1.66k • 17
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published 28 days ago • 145
Running 3.76k The Ultra-Scale Playbook 🌌 3.76k The ultimate guide to training LLM on large GPU Clusters
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published Feb 20 • 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 490
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 222
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17, 2025 • 50
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17, 2025 • 51
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 145
Lynx: Towards High-Fidelity Personalized Video Generation Paper • 2509.15496 • Published Sep 19, 2025 • 13
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching Paper • 2506.23552 • Published Jun 30, 2025 • 10
Running on Zero MCP Featured 322 Chain-of-Zoom 🚀 322 Extreme Super-Resolution via Scale Autoregression