GQLA: Group-Query Latent Attention for Hardware-Adaptive Large Language Model Decoding Paper • 2605.15250 • Published 5 days ago • 2
Look Before You Leap: Autonomous Exploration for LLM Agents Paper • 2605.16143 • Published 4 days ago • 6
Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design Paper • 2605.15871 • Published 4 days ago • 5
MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning Paper • 2605.14212 • Published 5 days ago • 9
ReactiveGWM: Steering NPC in Reactive Game World Models Paper • 2605.15256 • Published 5 days ago • 24
MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 5 days ago • 108
Boosting Omni-Modal Language Models: Staged Post-Training with Visually Debiased Evaluation Paper • 2605.12034 • Published 6 days ago • 3
Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding Paper • 2605.07637 • Published 7 days ago • 18
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? Paper • 2605.06527 • Published 12 days ago • 42
Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems Paper • 2605.14892 • Published 5 days ago • 46
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 5 days ago • 75
MinT: Managed Infrastructure for Training and Serving Millions of LLMs Paper • 2605.13779 • Published 6 days ago • 213
SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting Paper • 2605.07243 • Published 11 days ago • 4
Scaling Continual Learning to 300+ Tasks with Bi-Level Routing Mixture-of-Experts Paper • 2602.03473 • Published 11 days ago • 11
MDN: Parallelizing Stepwise Momentum for Delta Linear Attention Paper • 2605.05838 • Published 12 days ago • 5