Understand and Accelerate Memory Processing Pipeline for Disaggregated LLM Inference Paper • 2603.29002 • Published 5 days ago • 4
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory Paper • 2604.01007 • Published 2 days ago • 18
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 2 days ago • 74
Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers Paper • 2604.01128 • Published 3 days ago • 10
A Comparative Study in Surgical AI: Datasets, Foundation Models, and Barriers to Med-AGI Paper • 2603.27341 • Published 7 days ago • 7
OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training Paper • 2603.28858 • Published 5 days ago • 7
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 6 days ago • 127
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 15 days ago • 310
Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models? Paper • 2603.22582 • Published 12 days ago • 6
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design Paper • 2603.28376 • Published 5 days ago • 18
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models Paper • 2603.24844 • Published 10 days ago • 9
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 9 days ago • 47