Models and dataset from the CoLM 2025 paper : "Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs
Itay Itzhak
itay1itzhak
AI & ML interests
NLP & Deep learning
Recent Activity
authored a paper about 1 hour ago
ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous
LLMs authored a paper about 1 hour ago
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+
Languages and Cultures authored a paper about 1 hour ago
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens