SciOrch: Learning to Orchestrate Expert LLMs for Solving Frontier Multimodal Scientific Reasoning Tasks Paper • 2606.15872 • Published 11 days ago • 8
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism Paper • 2606.00408 • Published 28 days ago • 63
UniSteer: Text-Guided Flow Matching in Activation Space for Versatile LLM Steering Paper • 2605.30076 • Published 29 days ago • 26
SCOPE: Simulating Cross-game Operations in Playable Environments for FPS World Models Paper • 2605.23345 • Published May 22 • 17
From Web to Pixels: Bringing Agentic Search into Visual Perception Paper • 2605.12497 • Published May 12 • 14
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published May 7 • 237
MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts Paper • 2604.06505 • Published Apr 7 • 4