view article Article Efficient MultiModal Data Pipeline +3 ariG23498, lusxvr, andito, sergiopaniego, pcuenq • Jul 8, 2025 • 72
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 1.67M • • 1.71k
Running on CPU Upgrade Featured 3.2k The Smol Training Playbook 📚 3.2k The secrets to building world-class LLMs
view article Article Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer) +1 elisim, kashif, nielsr • Jun 16, 2023 • 46
OpenMOSS-Team/MOSS-Audio-Tokenizer-Nano Image Feature Extraction • 22M • Updated Apr 29 • 29.6k • 24
JetBrains/Mellum2-12B-A2.5B-Thinking-GGUF-Q4_K_M Text Generation • 12B • Updated 7 days ago • 10.1k • 21