Audio narrations of LessWrong posts.
This is a link post.Doctor Stone is an anime where everyone suddenly turns into a statue. Civilizati
We recently published our paper “Frontier Models are Capable of In-context Scheming”. We ran some fo
Have you ever noticed how differently we approach buying a car versus choosing what to watch on Netf
Summary You can now express interest in joining Catalyze Impact's seed funding networks for AI safe
Sometimes two people are talking past each other, and I try to help them understand each other (with
This is a link post.I’ve spent a lot of the last few years working on issues related to acausal coop
Increasingly, we have seen papers eliciting in AI models various shenanigans. There are a wide varie
This is a link post. --- First published: December 16th, 2024
I mentioned on Twitter that to a significant extent, Circling taught me what “just be yourself” mean
Summary Recurrence enables hidden serial reasoning.Not every recurrence though - connections betwee
Andrew Critch defines a conflationary alliance as a situation where multiple groups deliberately use
When was the last time you (intentionally) used your caps lock key?No, seriously. Here is a typical
This is an entry in the 'Dungeons & Data Science' series, a set of puzzles where players are giv
Doctor Susan Connor loved working for Effective Evil. Her job provided autonomy, mastery and purpose
Audio note: this article contains 172 uses of latex notation, so the narration may be difficult to
View trees here Search through latents with a token-regex language View individual latents here See
This is a link post.This is a linkpost for a new research paper of ours, introducing a simple but po
Or rather, we don’t actually have a proper o1 system card, aside from the outside red teaming report
Six months ago, I was a high school English teacher.I wasn’t looking to change careers, even after n
At this point, we can confidently say that no, capabilities are not hitting a wall. Capacity density