Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks Paper • 2606.12344 • Published 8 days ago • 65
Function2Scene: 3D Indoor Scene Layout from Functional Specifications Paper • 2605.30819 • Published 20 days ago • 41
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding Paper • 2605.29707 • Published 21 days ago • 146
SEAL: Synergistic Co-Evolution of Agents and Learning Environments Paper • 2605.24426 • Published 26 days ago • 10
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published Mar 25 • 183
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 246