Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 12 items • Updated 5 days ago • 46
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer Paper • 2306.08753 • Published Jun 14, 2023 • 2
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations Paper • 2407.03495 • Published Jul 3, 2024 • 1