Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kreshnik 's Collections
music
OCR
3D
Language
Image
Voice
Papers
Model training

Voice

updated 14 days ago
Upvote
-

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 95.9k • 2.32k

  • Configuration error
    Featured
    446

    FastVLM WebGPU

    🍎
    446

    Real-time video captioning powered by FastVLM


  • openbmb/VoxCPM-0.5B

    Text-to-Speech • Updated Sep 19, 2025 • 852 • 771

  • Running on CPU Upgrade
    78

    MiMo-Audio-Chat

    💬
    78

    Chat with Xiaomi MiMo-Audio using voice


  • FlashLabs/Chroma-4B

    Any-to-Any • Updated Jan 28 • 944 • 344

  • numind/NuMarkdown-8B-Thinking

    Image-to-Text • Updated Nov 13, 2025 • 124k • 451

  • CohereLabs/cohere-transcribe-03-2026

    Automatic Speech Recognition • Updated 5 days ago • 179k • 858
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs