CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding Paper • 2602.01785 • Published Feb 2 • 95
PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues Paper • 2601.17277 • Published Jan 24 • 6
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE Paper • 2602.08961 • Published 26 days ago • 4
Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels Paper • 2603.02573 • Published 5 days ago • 11
Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges Paper • 2506.02048 • Published Jun 1, 2025