k0 c@n gj@j th!ch 😎's picture

k0 c@n gj@j th!ch 😎

mp1704

·

AI & ML interests

None yet

Recent Activity

reacted to matteospanio's post with 🚀 about 12 hours ago

🎶 Released mule-torch — an unofficial PyTorch port of MULE (SF-NFNet-F0), SiriusXM/Pandora's music-audio embedding model (McCallum et al., ISMIR 2022). No retraining: I re-implemented the architecture in pure PyTorch and transferred the original TensorFlow weights, then checked it layer by layer against the genuine TF pipeline. ✅ End-to-end clip-embedding cosine 0.9999999 vs the original ✅ ONNX backbone parity < 1e-6 ✅ 62.35M params (paper: ~62.4M) ✅ Batched, GPU-native, ONNX-exportable — none of which the original `Analysis` pipeline does ```python pip install mule-torch ``` ```python from mule_torch import MuleModel emb = MuleModel.from_pretrained()(waveform) # (B, T)@16kHz -> (B, 1728) ``` 🤗 Weights: https://huggingface.co/matteospanio/mule 💻 Code: https://github.com/matteospanio/mule-torch 📦 PyPI: https://pypi.org/project/mule-torch/ The fun bug: parity was perfect through every conv but the block output was anti-correlated (cos = −1). Cause: the learnable skip-init gains couldn't be mapped by layer name (Keras scrambles the order) — they had to be recovered from the graph. ⚠️ Unofficial, community port — not affiliated with or endorsed by the original authors. All credit to them; please cite the paper. Weights inherit CC-BY-NC-4.0.

updated a dataset about 16 hours ago

mp1704/onnx-weight-08-june

published a dataset about 16 hours ago

mp1704/onnx-weight-08-june

View all activity

Organizations

mp1704 's models 19

mp1704/vits_tram_radio_6h40

83M • Updated Aug 5, 2025 • 3

mp1704/huhu

Updated Jun 12, 2025

mp1704/adapter_65h_10speakers

Updated Jun 12, 2025 • 1

mp1704/csm-1b-20h-ckpt1000

Text-to-Audio • 2B • Updated Jun 9, 2025

mp1704/tora_7b_sft_ckpt_200

Text Generation • 7B • Updated May 20, 2024 • 2

mp1704/tora_7b_pt

Text Generation • 7B • Updated May 20, 2024 • 3

mp1704/gpt-neo-sft-v2.1

Text Generation • 0.4B • Updated May 20, 2024 • 1

mp1704/gpt-neo-sft-v2

Text Generation • 0.4B • Updated May 18, 2024 • 2

mp1704/gpt-neo-sft

Text Generation • 0.4B • Updated May 9, 2024 • 2

mp1704/gpt-neo-pt

Text Generation • 0.4B • Updated May 8, 2024 • 5

mp1704/gemma_2b_sft

Text Generation • 3B • Updated May 4, 2024 • 6

mp1704/gemma_2b_pt

Text Generation • 3B • Updated May 3, 2024 • 3

mp1704/qwen_1.8b_sft_full_3

Text Generation • 2B • Updated Apr 24, 2024 • 2 •

mp1704/qwen_1.8b_sft_full_2

Feature Extraction • 2B • Updated Apr 24, 2024 • 2

mp1704/qwen_1.8b_sft_full_1

Feature Extraction • 2B • Updated Apr 24, 2024 • 2

mp1704/qwen_1.8b_sft_full

Text Generation • Updated Apr 19, 2024 • 1

mp1704/qwen_1.8b_stage_2

Text Generation • Updated Apr 19, 2024 • 6

mp1704/demo

Updated Apr 17, 2024

mp1704/qwen_1.8b_stage_1

Text Generation • Updated Apr 17, 2024 • 2