cover of episode “Reducing LLM deception at scale with self-other overlap fine-tuning” by Marc Carauleanu, Diogo de Lucena, Gunnar_Zarncke, Judd  Rosenblatt, Mike Vaiana, Cameron Berg

“Reducing LLM deception at scale with self-other overlap fine-tuning” by Marc Carauleanu, Diogo de Lucena, Gunnar\_Zarncke, Judd Rosenblatt, Mike Vaiana, Cameron Berg

2025/3/17
logo of podcast LessWrong (Curated & Popular)

LessWrong (Curated & Popular)

AI Chapters
Chapters

Shownotes Transcript

No transcript made for this episode yet, you may request it for free.