-
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Text Generation • 4B • Updated • 137 -
jaygala24/Qwen3-4B-GRPO-math-reasoning
Text Generation • 4B • Updated • 9 -
jaygala24/Qwen3-4B-ReMax-math-reasoning
Text Generation • 4B • Updated • 11 -
jaygala24/Qwen3-4B-RLOO-math-reasoning
Text Generation • 4B • Updated • 92
Jay Gala
jaygala24
·
AI & ML interests
Machine Learning, Natural Language Processing, Language and Vision Intersection, Fairness and Biases
Recent Activity
upvoted a paper 11 days ago
Would you still call this Dax? Novel Visual References in VLMs and Humans upvoted a paper 25 days ago
Forecasting Downstream Performance of LLMs With Proxy Metrics updated a dataset about 2 months ago
jaygala24/reasoning-geometry