AudioX is a unified framework for multimodal-conditioned audio and music generation with superior instruction-following capabilities.
AI & ML interests
None defined yet.
Recent Activity
AudioX is a unified framework for multimodal-conditioned audio and music generation with superior instruction-following capabilities.
TTS foundation model compatible with Llama framework (160k hours tokenized speech data released)
YuE is a groundbreaking series of open-source foundation models designed for music generation, led by HKUST.