Running 5 AV Causal Scenario Retrieval Challenge 🔍 5 Explore the AV Causal Scenario Retrieval Challenge
view article Article Prompt Phrasing Shifts Model Performance More Than Tier — and Which LLMs Will Get Your Jokes Wayfinder6 • 4 days ago • 2
Memory is Reconstructed, Not Retrieved: Graph Memory for LLM Agents Paper • 2606.06036 • Published 27 days ago • 75
FastContext: Training Efficient Repository Explorer for Coding Agents Paper • 2606.14066 • Published 19 days ago • 93
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 21 days ago • 125
EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 20 days ago • 142
Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 21 days ago • 60
Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 15 days ago • 76
Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 25 days ago • 82
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 21 days ago • 89