Andyʼs working notes
About these notes
Mechanistic interpretability
i.e. understanding what complex ML models are doing
Chris Olah
Catherine Olsson