Voice - a Kreshnik Collection

Kreshnik 's Collections

Structured Output

Voice

updated Mar 30

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Jan 22 • 57.2k • 2.44k
Configuration error

Featured

446

FastVLM WebGPU

🍎

446

Real-time video captioning powered by FastVLM
openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 3.14k • 808
Paused

85

MiMo-Audio-Chat

💬

85

Chat with Xiaomi MiMo-Audio using voice
FlashLabs/Chroma-4B

Any-to-Any • 6B • Updated Jan 28 • 248 • 382
numind/NuMarkdown-8B-Thinking

Image-to-Text • 8B • Updated Jun 5 • 17.5k • 491
CohereLabs/cohere-transcribe-03-2026

Automatic Speech Recognition • 2B • Updated Jun 10 • 1.14M • • 1.06k