Model organisms for Negation Neglect of reward hacking
ChristopherT
darklord1611
·
AI & ML interests
None yet
Recent Activity
updated a model 11 days ago
darklord1611/LLaDA-8B-Instruct-em-bad-medical-advice-run1 published a model 11 days ago
darklord1611/LLaDA-8B-Instruct-em-bad-medical-advice-run1 updated a model 11 days ago
darklord1611/LLaDA-8B-Instruct-em-bad-medical-advice-run0Organizations
None yet