arxiv:2409.19476
Sachin Kumar
techsachin
AI & ML interests
Generative AI,NLP
Recent Activity
commentedon a paper 1 day ago
Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations submitted a paper 1 day ago
Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations authored a paper 8 months ago
Overriding Safety protections of Open-source ModelsOrganizations
None yet