alkinun's picture

alkinun

AtAndDev

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

updated a model about 6 hours ago
aethercompute/aether0-2b
published a model about 6 hours ago
aethercompute/aether0-2b
reacted to aufklarer's post with 🔥 about 16 hours ago
Voice cloning models measured across five languages: OmniVoice, Chatterbox, VoxCPM2, Fish Audio I published a new Soniqo benchmark post for local voice cloning models across five languages: https://www.soniqo.audio/blog/voice-cloning-benchmarks Models: - OmniVoice int8 - Chatterbox Multilingual fp16 - VoxCPM2 bf16 - Fish Audio S2 Pro fp16 Languages: - English - German - Modern Standard Arabic - Spanish - Mandarin Chinese The benchmark uses Google FLEURS test clips as dataset references. Each row includes the reference audio, generated audio, speaker similarity, WER/CER, generated audio length, and RTF. Main result in this run: OmniVoice was the strongest all-around row set, with 0.707 mean speaker cosine across all five languages, 0.0% ASR error, and mean RTF 0.45. VoxCPM2 bf16 was especially strong on Arabic speaker match. Fish Audio S2 Pro showed strong German/Arabic similarity but slower RTF. Chatterbox Multilingual was competitive on Arabic and Spanish. This is an engineering benchmark, not a human MOS study. The speaker-similarity values should be compared within this table because every row uses the same local speaker-embedding pipeline. Try the stack locally with Speech Studio: https://www.soniqo.audio/speech-studio https://github.com/soniqo/speech-studio Underlying Swift library/CLI: https://github.com/soniqo/speech-swift Soniqo models and exports: https://huggingface.co/soniqo https://huggingface.co/aufklarer What model or language should I add next?
View all activity

Organizations

ESPnet's profile picture CVPR Demo Track's profile picture BigScience Biomedical Datasets's profile picture ONNXConfig for all's profile picture Gradio-Themes-Party's profile picture video-p2p-library's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture lora concepts library's profile picture Open-Source AI Meetup's profile picture OpenBuddy Community's profile picture ECCV 2022's profile picture Kornia AI's profile picture Tune a video concepts library's profile picture SIGGRAPH 2022's profile picture Interspeech2022's profile picture Stable Diffusion concepts library's profile picture SIGGRAPH Asia 2022 Demos's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Musika's profile picture Blog-explorers's profile picture OpenSky's profile picture ICCV2023's profile picture ICML2023's profile picture huggingPartyParis's profile picture Multi🤖Transformers's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Pirates Party for all software open source's profile picture MLX Community's profile picture recipe research's profile picture Pine's profile picture Social Post Explorers's profile picture M4-ai's profile picture Spinner-GPT-4's profile picture Dev Mode Explorers's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face Discord Community's profile picture OpenEndedLM's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture None yet's profile picture Moon AI's profile picture gradyanakincilari's profile picture MoLA's profile picture Subliminal Misalignment's profile picture MedCall's profile picture Aether Compute's profile picture