11 22 31

Wang Chengyao PRO

wcy1122

https://wcy1122.github.io/

AI & ML interests

Multimodal Intelligence

Recent Activity

upvoted a paper 3 days ago

VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models

upvoted a paper 8 days ago

Efficient Reasoning with Balanced Thinking

upvoted a paper 23 days ago

Utonia: Toward One Encoder for All Point Clouds

View all activity

Organizations

upvoted a paper 3 days ago

VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models

Paper • 2603.22003 • Published 4 days ago • 11

upvoted a paper 8 days ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published 15 days ago • 142

upvoted a paper 23 days ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 24 days ago • 184

upvoted a paper about 1 month ago

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Paper • 2602.08683 • Published Feb 9 • 52

liked a model about 2 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 28 days ago • 4.12M • • 2.37k

upvoted a paper 3 months ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 320

upvoted a paper 4 months ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • Updated Dec 1, 2025 • 15.5k • 689

New activity in wcy1122/MGM-Omni 4 months ago

Thanks a Million for This HF Space!

#1 opened 7 months ago by

9voltfan2009

New activity in wcy1122/MGM-Omni-7B 5 months ago

Thank you very much for sharing. I have a few questions about vllm?

#3 opened 5 months ago by

develop2025

你好，问个事，是不是模型文字生成完了才能开始生成音频

#2 opened 5 months ago by

develop2025

liked a model 5 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 59.8k • • 1.69k

upvoted a paper 5 months ago

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 39

authored a paper 5 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 181

upvoted a paper 5 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 181

liked a model 5 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.8M • 3.19k

upvoted a paper 5 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 275

updated a collection 6 months ago

MGM-Omni

Collection

MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech • 13 items • Updated 25 days ago • 11

updated a model 6 months ago

wcy1122/Qwen2.5-VL-3B-ViT

0.7B • Updated Oct 11, 2025 • 1

published a model 6 months ago

wcy1122/Qwen2.5-VL-3B-ViT

0.7B • Updated Oct 11, 2025 • 1

Wang Chengyao PRO

AI & ML interests

Recent Activity

Organizations

wcy1122's activity

Thanks a Million for This HF Space!

Thank you very much for sharing. I have a few questions about vllm?

你好，问个事，是不是模型 文字生成完了才能开始生成音频

你好，问个事，是不是模型文字生成完了才能开始生成音频