PANDO: Efficient Multimodal AI Agents via Online Skill Distillation Paper • 2605.24785 • Published 6 days ago • 5
The Model Says Walk: How Surface Heuristics Override Implicit Constraints in LLM Reasoning Paper • 2603.29025 • Published Mar 30 • 13