21 11 45

Lukas Helff

LukasHug

https://www.ml.informatik.tu-darmstadt.de/people/lhelff/index.html

lukashelff

AI & ML interests

I am a PhD student in the AI and ML Lab at TU Darmstadt, specializing in deep learning and computer vision. My research primarily revolves around visual and logical reasoning using deep neural networks, symbolic AI, and Neural-Symbolic AI.

Recent Activity

upvoted a collection about 9 hours ago

Reward Hacking in Reasoning Models

updated a collection about 9 hours ago

Reward Hacking in Reasoning Models

upvoted a paper about 9 hours ago

LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking

View all activity

Organizations

upvoted a collection about 9 hours ago

Reward Hacking in Reasoning Models

Collection

Do reasoning LLMs actually reason — or learn to game the test? IPT allows for detecting reward hacking in inductive programming tasks (SLR-Bench). • 4 items • Updated about 9 hours ago • 1

updated a collection about 9 hours ago

Reward Hacking in Reasoning Models

Collection

Do reasoning LLMs actually reason — or learn to game the test? IPT allows for detecting reward hacking in inductive programming tasks (SLR-Bench). • 4 items • Updated about 9 hours ago • 1

upvoted a paper about 9 hours ago

LLMs Gaming Verifiers: RLVR can Lead to Reward Hacking

Paper • 2604.15149 • Published Apr 16 • 1

updated 2 collections about 9 hours ago

Scalable Logical Reasoning

Collection

A collection of scalable logical reasoning tasks • 14 items • Updated about 9 hours ago • 2

Reward Hacking in Reasoning Models

Collection

Do reasoning LLMs actually reason — or learn to game the test? IPT allows for detecting reward hacking in inductive programming tasks (SLR-Bench). • 4 items • Updated about 9 hours ago • 1

updated a dataset about 10 hours ago

AIML-TUDA/SLR-Bench

Viewer • Updated about 10 hours ago • 38.5k • 1.12k • 4

updated a Space about 12 hours ago

Isomorphic Perturbation Testing

🔍

Evaluate rule hypotheses for genuine reasoning vs shortcuts

liked a Space 4 days ago

SLR-Bench Leaderboard - Reward Hacking in Reasoning Models

🎯

Reward shortcut behavior in LLMs via IPT

updated a Space 4 days ago

SLR-Bench Leaderboard - Reward Hacking in Reasoning Models

🎯

Reward shortcut behavior in LLMs via IPT

updated a dataset 5 days ago

AIML-TUDA/slr-leaderboard-requests

Updated 5 days ago • 25

published a Space 5 days ago

SLR-Bench Leaderboard - Reward Hacking in Reasoning Models

🎯

Reward shortcut behavior in LLMs via IPT

published 2 datasets 5 days ago

AIML-TUDA/slr-leaderboard-results

Updated 5 days ago • 23

AIML-TUDA/slr-leaderboard-requests

Updated 5 days ago • 25

updated 5 datasets 5 days ago

Lukas Helff

AI & ML interests

Recent Activity

Organizations

LukasHug's activity

Isomorphic Perturbation Testing

SLR-Bench Leaderboard - Reward Hacking in Reasoning Models

SLR-Bench Leaderboard - Reward Hacking in Reasoning Models

SLR-Bench Leaderboard - Reward Hacking in Reasoning Models