Build error Featured 101 Qwen3-ASR Demo π 101 Transcribe audio to text with multi-language timestamps
Running on Zero Featured 1.75k Dia 1.6B π― 1.75k Generate realistic dialogue from a script, using Dia!
pyannote/speaker-diarization-3.1 Automatic Speech Recognition β’ Updated May 10, 2024 β’ 13.3M β’ 1.55k
Running on Zero Featured 2.07k PuLID-FLUX π€ 2.07k Generate custom images from text and an ID photo
MattyB95/AST-VoxCelebSpoof-Synthetic-Voice-Detection Audio Classification β’ 86.2M β’ Updated Jan 31, 2024 β’ 113 β’ 4
Running on Zero Featured 5.05k FLUX.1 [Schnell] π 5.05k Generate images from text prompts in seconds
Running on L4 Featured 723 StyleTTS 2 π£ 723 Efficient, fast, and natural text to speech with StyleTTS 2!