Open to Collab

41 89 44

Qinghong (Kevin) Lin

KevinQHLin

http://qhlin.me/

KevinQHLin
QinghongLin
kevinqhlin

AI & ML interests

Vision-Language Model, Video Understanding, Agent

Recent Activity

upvoted a paper about 2 hours ago

Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?

upvoted a paper 1 day ago

The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer

upvoted a paper 1 day ago

Cosmos 3: Omnimodal World Models for Physical AI

View all activity

Organizations

Articles 1

Article

When Vision Meets Code

Collections 7

View 7 collections

Papers 32

spaces 2

Paper2Poster

🚀

UniVTG

👁

models 1

KevinQHLin/VLog

Updated Mar 12, 2025

datasets 2

KevinQHLin/RICO

Preview • Updated Feb 11, 2025 • 11

KevinQHLin/ScreenSpot

Viewer • Updated Jan 1, 2025 • 1.27k • 360 • 1

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

Articles 1

When Vision Meets Code

Collections 7

showlab/ShowUI-2B

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

ShowUI

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

ServiceNow/GroundCUA

ServiceNow/ui-vision

ServiceNow/VideoCUA

Grounding Computer Use Agents on Human Demonstrations

showlab/ShowUI-2B

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

ShowUI

FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

ServiceNow/GroundCUA

ServiceNow/ui-vision

ServiceNow/VideoCUA

Grounding Computer Use Agents on Human Demonstrations

Papers 32

spaces 2

Paper2Poster

UniVTG

models 1

KevinQHLin/VLog

datasets 2

KevinQHLin/RICO

KevinQHLin/ScreenSpot

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

Articles 1

When Vision Meets Code

Collections 7

ShowUI

ShowUI

Papers 32

spaces 2 Sort: Recently updated

Paper2Poster

UniVTG

models 1

datasets 2 Sort: Recently updated

spaces 2

datasets 2