DCAgent3/medagentbench_g1_diverse_tezos_top4_316_8b_20260602_100929 Viewer • Updated 13 days ago • 1.64k • 28 • 1
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations Paper • 2605.26293 • Published 22 days ago • 6
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 20 days ago • 423
sashaboguraev/pythia-160m-ppt-shuffle_dyck_steps500-seed208-keep_layernorm Text Generation • 0.2B • Updated 23 days ago • 27 • 1
trjxter/Qwimi3.5-9B-Kimik2.6-Opus-Distill-MTP-BF16 Text Generation • 10B • Updated 24 days ago • 182 • 1
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 27 days ago • 83
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 195
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published May 6 • 102
The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability Paper • 2604.17698 • Published Apr 20 • 4