Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization Paper • 2602.22675 • Published 1 day ago • 14 • 2
AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning Paper • 2602.23258 • Published 1 day ago • 22 • 2
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published 1 day ago • 26 • 2
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published 1 day ago • 31 • 2
MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios Paper • 2602.22638 • Published 1 day ago • 85 • 2
From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models Paper • 2602.22859 • Published 1 day ago • 141 • 2
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published 1 day ago • 158 • 3
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL Paper • 2602.22190 • Published 2 days ago • 13 • 3
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published 3 days ago • 21 • 3
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 3 days ago • 21 • 3
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 2 days ago • 21 • 3
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation Paper • 2602.12160 • Published 15 days ago • 34 • 4
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper • 2602.21818 • Published 2 days ago • 45 • 7
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published 8 days ago • 52 • 3
HyTRec: A Hybrid Temporal-Aware Attention Architecture for Long Behavior Sequential Recommendation Paper • 2602.18283 • Published 7 days ago • 52 • 3
LongCLI-Bench: A Preliminary Benchmark and Study for Long-horizon Agentic Programming in Command-Line Interfaces Paper • 2602.14337 • Published 12 days ago • 12 • 3
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 15 days ago • 49 • 4
ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequential Recommendation Paper • 2602.20093 • Published 4 days ago • 28 • 4