Takashi Ishida
tksii
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness authored a paper 5 days ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness authored a paper 5 days ago
LLM Routing with Dueling Feedback