ishidalab

university

https://takashiishida.github.io

AI & ML interests

None defined yet.

Recent Activity

skydddoogg submitted a paper about 3 hours ago

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

skydddoogg updated a dataset 2 days ago

ishidalab/capcode

skydddoogg published a dataset 2 days ago

ishidalab/capcode

View all activity

Papers

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

View all Papers

ishidalab 's papers 2

Submitted by

Thanawat Lodkaew

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

ishidalab

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

ishidalab