Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alecccdd 's Collections
check-later
Fun
Impressive Large Models
Vision Tasks
Vision Tasks (Watermark)
Vision Tasks (Humans)
Vision Datasets
Vision Datasets (Human)
Video Tasks
Diffusion Tasks
Audio Tasks
Text Generation
Text Datasets (Reasoning)
Text Datasets (Grammar)
ReID
small & highly efficient

Audio Tasks

updated 5 days ago
Upvote
-

  • Soul-AILab/SoulX-Podcast-1.7B

    Text-to-Speech • Updated Dec 18, 2025 • 255 • 231

  • bosonai/higgs-audio-v2-generation-3B-base

    Text-to-Speech • 6B • Updated Jul 28, 2025 • 387k • 660

  • Running
    32

    Vocal Isolator

    🗣
    32

    Isolate vocals from audio files


  • nvidia/personaplex-7b-v1

    Audio-to-Audio • Updated 10 days ago • 477k • 2.27k

  • FlashLabs/Chroma-4B

    Any-to-Any • Updated Jan 28 • 2.69k • 341

  • Running on Zero
    Featured
    1.68k

    Qwen3-TTS Demo

    🎙
    1.68k

    Generate custom speech from text, voice descriptions, or samples


  • Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

    Text-to-Speech • 2B • Updated Jan 29 • 1.13M • 1.28k

  • ACE-Step/acestep-v15-base

    Text-to-Audio • 2B • Updated Feb 6 • 5.88k • 53

  • kugelaudio/kugelaudio-0-open

    Text-to-Speech • Updated Feb 6 • 88.5k • 176

  • OpenMOSS-Team/MOSS-TTS

    Text-to-Speech • 8B • Updated 27 days ago • 110k • 340

  • YatharthS/LavaSR

    Audio-to-Audio • Updated 13 days ago • 1.07k • 65
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs