“Ablations for ‘Frontier Models are Capable of In-context Scheming’” by AlexMeinke, Bronson Schoen, Marius Hobbhahn, Mikita Balesni, Jérémy Scheurer, rusheb
05:57
Share
2024/12/18
LessWrong (30+ Karma)
Request Transcript
Frequently requested episodes will be transcribed first
Shownotes
Transcript
No transcript made for this episode yet, you may request it for free.