Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published Apr 9 • 23
A3: Agent-as-Annotators Collection Models and data from "Structured Distillation of Web Agent Capabilities Enables Generalization" (arXiv:2604.07776) • 6 items • Updated Apr 14 • 1
Running Agents Featured 15 PTS Visualizer 🔍 15 Visualize pivotal tokens and thought anchors in language models
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 8 days ago • 72
Fast and Effective On-policy Distillation from Reasoning Prefixes Paper • 2602.15260 • Published Feb 16 • 1
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published Apr 14 • 107
ColGemma4 — Gemma-4 Visual Retrieval Collection ColBERT-style late-interaction visual document retrieval adapters built on Google Gemma-4 (E2B and E4B variants). • 2 items • Updated Apr 18 • 1
ColQwen3.5 — Qwen3.5 Visual Retrieval Collection Visual document retrieval models on Qwen3.5 backbone. ViDoRe v3 leaderboard competitors, 128-dim multi-vector. • 2 items • Updated Apr 13 • 1
Hydra — Dual-Head Retrieval and Generation Collection Dual-head VLM: ColBERT retrieval + autoregressive generation by toggling one LoRA. Canonical 4B + 0.8B, omni proof-of-concept, baselines. • 4 items • Updated Apr 19 • 1