Yu Zhang's picture

Yu Zhang

AaronZ345

·

https://aaronz345.github.io

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Recent Activity

authored a paper about 6 hours ago

ALIVE: Animate Your World with Lifelike Audio-Video Generation

authored a paper about 9 hours ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

authored a paper about 9 hours ago

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

View all activity

Organizations

Papers 12

arxiv:2605.30993

arxiv:2605.30940

arxiv:2510.10396

arxiv:2508.10924

models 2

AaronZ345/StyleSinger

Updated May 5, 2025 • 1

AaronZ345/TCSinger

Updated Apr 7, 2025 • 1

datasets 2

AaronZ345/MRSDrama

Preview • Updated Aug 10, 2025 • 7.49k • 2

AaronZ345/GTSinger

Viewer • Updated Jul 24, 2025 • 28.6k • 3.81k • 15