Audio Dataset MLCommons/peoples_speech_v1.0 Updated Aug 25, 2024 • 161 • 8 amphion/Emilia-Dataset Viewer • Updated Feb 28, 2025 • 54.8M • 37k • 459 simon3000/genshin-voice Viewer • Updated Apr 22, 2025 • 424k • 4.7k • 233 facebook/multilingual_librispeech Viewer • Updated Aug 12, 2024 • 1.49M • 52k • 179
Omni model collection of Omni modal model inclusionAI/Ming-flash-omni-2.0 Any-to-Any • 104B • Updated Feb 12 • 2.95k • 266 Qwen/Qwen3-Omni-30B-A3B-Instruct Any-to-Any • 35B • Updated Sep 22, 2025 • 1.53M • 928 naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated Jan 6 • 8.61k • 187 meituan-longcat/LongCat-Flash-Omni Any-to-Any • 561B • Updated Nov 11, 2025 • 74 • 112
Audio Dataset MLCommons/peoples_speech_v1.0 Updated Aug 25, 2024 • 161 • 8 amphion/Emilia-Dataset Viewer • Updated Feb 28, 2025 • 54.8M • 37k • 459 simon3000/genshin-voice Viewer • Updated Apr 22, 2025 • 424k • 4.7k • 233 facebook/multilingual_librispeech Viewer • Updated Aug 12, 2024 • 1.49M • 52k • 179
Omni model collection of Omni modal model inclusionAI/Ming-flash-omni-2.0 Any-to-Any • 104B • Updated Feb 12 • 2.95k • 266 Qwen/Qwen3-Omni-30B-A3B-Instruct Any-to-Any • 35B • Updated Sep 22, 2025 • 1.53M • 928 naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated Jan 6 • 8.61k • 187 meituan-longcat/LongCat-Flash-Omni Any-to-Any • 561B • Updated Nov 11, 2025 • 74 • 112