AI & ML interests

None defined yet.

Recent Activity

AIML-TUDA 's collections 4

Reward Hacking in Reasoning Models
Do reasoning LLMs actually reason — or learn to game the test? IPT allows for detecting reward hacking in inductive programming tasks (SLR-Bench).