VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 7 days ago • 24
Running on Zero MCP Featured 1.47k Qwen-Image-Edit-2511-LoRAs-Fast 🎃 1.47k Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero MCP 57 Qwen Image Edit 2509 LoRAs Fast ⚡ 57 Demo of the Collection of Qwen Image Editing LoRAs
MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 8 days ago • 116
Running on Zero MCP 2.63k Wan2.2 14B Preview 🐌 2.63k generate a video from an image with a text prompt
The Invisible Leash: Why RLVR May Not Escape Its Origin Paper • 2507.14843 • Published Jul 20, 2025 • 85
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published Jul 19, 2025 • 136
GUI-G^2: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published Jul 21, 2025 • 135