AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition Paper • 2512.03794 • Published Dec 3, 2025 • 5
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning Paper • 2606.10968 • Published 3 days ago • 41
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning Paper • 2606.10968 • Published 3 days ago • 41
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 3 days ago • 37
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition Paper • 2512.03794 • Published Dec 3, 2025 • 5