Andyʼs working notes

About these notes

Mechanistic interpretability

i.e. understanding what complex ML models are doing

Chris Olah
Catherine Olsson