MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper • 2602.02474 • Published 4 days ago • 31
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 1 day ago • 27
RISE-Video: Can Video Generators Decode Implicit World Rules? Paper • 2602.05986 • Published 1 day ago • 25
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing Paper • 2602.02437 • Published 4 days ago • 74
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 6 days ago • 217
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs Paper • 2602.02103 • Published 5 days ago • 64
EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models Paper • 2602.04515 • Published 3 days ago • 33
Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization Paper • 2602.02958 • Published 4 days ago • 32
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 3 days ago • 49
Future Optical Flow Prediction Improves Robot Control & Video Generation Paper • 2601.10781 • Published 22 days ago • 19
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models Paper • 2601.11404 • Published 22 days ago • 25
Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization Paper • 2601.12993 • Published 19 days ago • 75
3AM: Segment Anything with Geometric Consistency in Videos Paper • 2601.08831 • Published 24 days ago • 34
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 23 days ago • 32
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 23 days ago • 28
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation Paper • 2601.10061 • Published 23 days ago • 30
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published 29 days ago • 46
3AM: Segment Anything with Geometric Consistency in Videos Paper • 2601.08831 • Published 24 days ago • 34