WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces Paper โข 2606.09426 โข Published 6 days ago โข 94