johnsnowlabs/JSL-MedLlama-3-8B-v2.0 Text Generation β’ 8B β’ Updated Apr 30, 2024 β’ 591 β’ β’ 45
meta-llama/Llama-3.2-3B-Instruct Text Generation β’ 3B β’ Updated Oct 24, 2024 β’ 1.74M β’ β’ 2.19k
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL Paper β’ 2505.17952 β’ Published May 23, 2025 β’ 21
π§ LFM2.5 Collection Collection of post-trained and base LFM2.5 models. β’ 35 items β’ Updated 1 day ago β’ 147
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig β’ 25 days ago β’ 23
RikkaBotan/stable-static-embedding-fast-retrieval-mrl-en-v2 Sentence Similarity β’ 15.6M β’ Updated 23 days ago β’ 5
Running 49 physics-intern: an Autonomous Agent for Physics Research π 49 Explore an autonomous AI workflow for physics research
Running 178 The ultimate guide to RL environments: building and scaling them in the LLM era π 178 Building and scaling RL environments for LLM training
Delphi Collection Marin's first open scaling suite. 88 base models, 3e18 β 1e23 FLOPs. https://openathena.ai/blog/delphi β’ 89 items β’ Updated 17 days ago β’ 8
Running 109 Unlocking On-Policy Distillation for Any Model Family π 109 Visualize on-policy distillation for any model family
Running Featured 85 Distilling 100B+ Models 40x Faster with TRL π 85 TRL distillation for 100B+ teachers, 40x faster