18 1

Mage

arctanx

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

PEARL: Personalized Streaming Video Understanding Model

upvoted a paper about 1 month ago

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

upvoted a paper about 1 month ago

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

View all activity

Organizations

upvoted a paper 24 days ago

PEARL: Personalized Streaming Video Understanding Model

Paper • 2603.20422 • Published 27 days ago • 40

upvoted 3 papers about 1 month ago

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Paper • 2603.15618 • Published Mar 16 • 21

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Paper • 2603.13391 • Published Mar 11 • 19

CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

Paper • 2603.08652 • Published Mar 9 • 40

upvoted a paper about 2 months ago

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published Feb 24 • 31

upvoted 3 papers 2 months ago

GENIUS: Generative Fluid Intelligence Evaluation Suite

Paper • 2602.11144 • Published Feb 11 • 55

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Paper • 2602.10063 • Published Feb 10 • 75

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Paper • 2602.01851 • Published Feb 2 • 16

liked a model 2 months ago

stepfun-ai/Step-3.5-Flash

Text Generation • 199B • Updated about 1 month ago • 138k • • 777

updated 3 datasets 3 months ago

published a dataset 3 months ago

OpenDCAI/PKU_TianWang3

Updated Jan 29 • 26.2k • 1

published a model 3 months ago

OpenDCAI/PKU_TianWang

Updated Jan 17

published 2 datasets 3 months ago

OpenDCAI/PKU_TianWang2

Updated Jan 29 • 23.8k • 1

OpenDCAI/PKU_TianWang

Updated Jan 29 • 41.2k • 1

upvoted 2 papers 3 months ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published Jan 15 • 32

Motion Attribution for Video Generation

Paper • 2601.08828 • Published Jan 13 • 72

updated a dataset 4 months ago

arctanx/PKU_Tianwang_CWT60T

Updated Dec 12, 2025 • 5

published a dataset 4 months ago