7 15

haiyimei

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

upvoted a collection about 1 month ago

SenseNova-U1

liked a model about 2 months ago

google/gemma-4-31B-it

View all activity

Organizations

upvoted a paper 27 days ago

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published 30 days ago • 84

upvoted a collection about 1 month ago

SenseNova-U1

Collection

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 9 items • Updated 2 days ago • 67

liked 2 models about 2 months ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 3 days ago • 11.3M • • 2.82k

dealignai/Gemma-4-31B-JANG_4M-CRACK

Image-Text-to-Text • 6B • Updated Apr 25 • 64.7k • 1.57k

upvoted 2 papers 2 months ago

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published Mar 19 • 42

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

upvoted a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 524

liked a Space 4 months ago

Qwen3-TTS Demo

🎙

1.94k

Generate custom speech from text, voice descriptions, or samples

upvoted a paper 7 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 27

liked a model 9 months ago

openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated Mar 10 • 131k • 1.09k

liked a Space 9 months ago

FastVLM WebGPU

🍎

446

Real-time video captioning powered by FastVLM

liked a model about 1 year ago

sand-ai/MAGI-1

Image-to-Video • Updated Jun 3, 2025 • 610

liked a dataset about 1 year ago

caizhongang/SynBody

Updated Nov 4, 2024 • 193 • 6

authored a paper over 1 year ago

WHAC: World-grounded Humans and Cameras

Paper • 2403.12959 • Published Mar 19, 2024 • 4

upvoted an article over 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

liked 4 models over 1 year ago

liked a model almost 2 years ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 5.53k • • 4.96k