dan
prayerdan
AI & ML interests
Rag, DeepResearch, Medical LLM
Recent Activity
submitted a paper about 9 hours ago
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating upvoted a paper about 13 hours ago
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating new activity 3 months ago
Qwen/Qwen3.5-397B-A17B:qwen 3.5 系列 什么时候在megatron 支持cp