Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Yu Zhang's picture
13 11 5

Yu Zhang

AaronZ345
bunyaminergen's profile picture littlealan's profile picture Reel2reel's profile picture
·
https://aaronz345.github.io
  • AaronZ345
  • yuzhang34

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Recent Activity

authored a paper about 6 hours ago
ALIVE: Animate Your World with Lifelike Audio-Video Generation
authored a paper about 9 hours ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue
authored a paper about 9 hours ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
View all activity

Organizations

Zhejiang University's profile picture Zhejiang University's profile picture

Papers 12

arxiv:2605.30993
arxiv:2605.30940
arxiv:2510.10396
arxiv:2508.10924
View 12 papers

models 2

AaronZ345/StyleSinger

Updated May 5, 2025 • 1

AaronZ345/TCSinger

Updated Apr 7, 2025 • 1

datasets 2

AaronZ345/MRSDrama

Preview • Updated Aug 10, 2025 • 7.49k • 2

AaronZ345/GTSinger

Viewer • Updated Jul 24, 2025 • 28.6k • 3.81k • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs