DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 7 days ago • 201
SceneAligner: 3D-Grounded Floorplan Localization in the Wild Paper • 2605.22581 • Published 6 days ago • 6
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 15 days ago • 191
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 14 days ago • 268
MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 13 days ago • 117
EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents Paper • 2605.13941 • Published 14 days ago • 24
Motion-Aware Caching for Efficient Autoregressive Video Generation Paper • 2605.01725 • Published 24 days ago • 8
DCAgent2/swebench_verified_random_100_folders_Qwen2_5_Coder_32B_Instruct_20260424_011832 Viewer • Updated Apr 24 • 300 • 15 • 1
Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment Paper • 2604.05684 • Published Apr 7 • 9
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 365