Just open sourced LavaSR v2: a model that can enhance 5000 seconds of audio in 1 second while being higher quality than giant and slow 6gb diffusion models!
It works with any sampling rate from 8-48khz and is nearly 5000x faster than competition while being superior in objective benchmarks.
LavaSR v2 is Perfect for - Enhancing TTS models. - Fixing old audio datasets. - Restoring low quality recordings.
You can check out the examples and run it locally or online: