Jiaxuan's Blog

Notes on machine learning and optimization

This page collects my long-form notes on mechanistic interpretability, deep learning theory, optimization, and scaling laws. If you are new here, start from the latest posts below.