arxiv:2606.07379
Thanawat Lodkaew
skydddoogg
ยท
AI & ML interests
None yet
Recent Activity
authored a paper about 10 hours ago
Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests new activity about 11 hours ago
ishidalab/capcode:Add task category and license metadata upvoted a paper 1 day ago
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?