Building on HF

7 3 76

Dan PRO

Daankular

AI & ML interests

None yet

Recent Activity

liked a Space about 1 hour ago

AIDemoProject/DeblurGANV2Demo

liked a model about 2 hours ago

FireRedTeam/FireRed-Image-Edit-1.0

liked a dataset about 3 hours ago

ajehsmihba/aesthetic-female-portraits

View all activity

Organizations

liked a Space about 1 hour ago

DeblurGANV2Demo

🐠

ClearIfy Image Project

liked a model about 2 hours ago

FireRedTeam/FireRed-Image-Edit-1.0

Image-to-Image • Updated Feb 14 • 17.2k • • 342

liked a dataset about 3 hours ago

ajehsmihba/aesthetic-female-portraits

Updated 2 days ago • 340 • 1

reacted to matteospanio's post with 🚀 about 6 hours ago

Post

4326

🎶 Released mule-torch — an unofficial PyTorch port of MULE (SF-NFNet-F0), SiriusXM/Pandora's music-audio embedding model (McCallum et al., ISMIR 2022).

No retraining: I re-implemented the architecture in pure PyTorch and transferred the original TensorFlow weights, then checked it layer by layer against the genuine TF pipeline.

✅ End-to-end clip-embedding cosine 0.9999999 vs the original
✅ ONNX backbone parity < 1e-6
✅ 62.35M params (paper: ~62.4M)
✅ Batched, GPU-native, ONNX-exportable — none of which the original Analysis pipeline does

pip install mule-torch

from mule_torch import MuleModel
emb = MuleModel.from_pretrained()(waveform)   # (B, T)@16kHz -> (B, 1728)

🤗 Weights: matteospanio/mule
💻 Code: https://github.com/matteospanio/mule-torch
📦 PyPI: https://pypi.org/project/mule-torch/

The fun bug: parity was perfect through every conv but the block output was anti-correlated (cos = −1). Cause: the learnable skip-init gains couldn't be mapped by layer name (Keras scrambles the order) — they had to be recovered from the graph.

⚠️ Unofficial, community port — not affiliated with or endorsed by the original authors. All credit to them; please cite the paper. Weights inherit CC-BY-NC-4.0.

updated a Space about 14 hours ago

DramaBox TTS

🎭

Expressive TTS with voice cloning

liked 2 Spaces about 15 hours ago

CogVideoX Fun 5b

🌍

128

Generate videos from text prompts and inpaint missing parts

Helios 14B RealTime AOTI

🦀

Generate videos from text, images, or video

liked a Space about 16 hours ago

Diffuse The Rest

🦉

929

Generate images from text with Stable Diffusion

liked a dataset 1 day ago

wikimedia/structured-wikipedia

Viewer • Updated 20 days ago • 10.5M • 11.1k • 315

liked a model 1 day ago

sneedjak/Adelic-Gemma-4-12B-GGUF

Text Generation • 12B • Updated about 17 hours ago • 18.5k • 2

reacted to AxionLab-official's post with 🔥 1 day ago

Post

10416

THIS IS CRAZY! THE MODEL ON THE IMAGE(Supra-50M-Reasoning) answered correctly and its QUANTIZED IN 2BIT! THE RESPONSE IS CORRECT, IN A 15MB SIZE FILE!