Exploring Autonomous Agentic Data Engineering for Model Specialization Paper • 2605.30407 • Published 7 days ago • 22
SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search Paper • 2605.29796 • Published 7 days ago • 24
Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation Paper • 2605.26844 • Published 9 days ago • 25
SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks Paper • 2605.31433 • Published 6 days ago • 25
SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published 7 days ago • 36
GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration Paper • 2605.31039 • Published 6 days ago • 40
LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Paper • 2605.31584 • Published 6 days ago • 41
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 6 days ago • 56
Trust-Region Behavior Blending for On-Policy Distillation Paper • 2605.31159 • Published 6 days ago • 64
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation Paper • 2605.31264 • Published 6 days ago • 103
GrepSeek: Training Search Agents for Direct Corpus Interaction Paper • 2605.29307 • Published 7 days ago • 98
ORACLE: Anticipating Scams from Partial Trajectories in Streaming App Usage Paper • 2605.16363 • Published 26 days ago • 1
Reducing Political Manipulation with Consistency Training Paper • 2605.22771 • Published 7 days ago • 1
Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection Paper • 2605.30344 • Published 7 days ago • 1
Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation Paper • 2605.22765 • Published 14 days ago • 4
OmniInteract: Benchmarking Real-World Streaming Interaction for Real-Time Omnimodal Assistants Paper • 2605.26485 • Published 9 days ago • 3