5 211 17

QRQ

RichardQRQ

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

MMSkills: Towards Multimodal Skills for General Visual Agents

upvoted a paper 12 days ago

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

upvoted a paper 12 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

View all activity

Organizations

None yet

upvoted a paper about 10 hours ago

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published 5 days ago • 99

upvoted 2 papers 12 days ago

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

Paper • 2605.01371 • Published 17 days ago • 6

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 18 days ago • 48

upvoted a paper 19 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 21 days ago • 268

upvoted 3 papers 21 days ago

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published 26 days ago • 35

ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning

Paper • 2604.24300 • Published 22 days ago • 67

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 22 days ago • 118

upvoted 2 papers 22 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 25 days ago • 226

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 27 days ago • 240

liked a dataset 24 days ago

Anthropic/EconomicIndex

Updated Apr 17 • 40.3k • 523

upvoted a paper 25 days ago

WorldMark: A Unified Benchmark Suite for Interactive Video World Models

Paper • 2604.21686 • Published 26 days ago • 36

liked a dataset 26 days ago

claw-eval/Claw-Eval

Benchmark • Updated 10 days ago • 4.27k • 24

upvoted a paper 26 days ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 121

upvoted a paper 28 days ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 29 days ago • 84

upvoted a paper 29 days ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 58

upvoted 2 papers about 1 month ago

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Paper • 2604.08545 • Published Apr 9 • 41

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 122

upvoted 2 papers about 2 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 132

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 109

upvoted a paper 2 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

QRQ

AI & ML interests

Recent Activity

Organizations

RichardQRQ's activity