arxiv:2605.30993
Yu Zhang
AaronZ345
AI & ML interests
Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).
Recent Activity
authored a paper about 6 hours ago
ALIVE: Animate Your World with Lifelike Audio-Video Generation authored a paper about 9 hours ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue authored a paper about 9 hours ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer