Jiaxuan's Blog

Notes on machine learning and optimization

This page collects my long-form notes on mechanistic interpretability, deep learning theory, optimization, and scaling laws. If you are new here, start from the latest posts below.

论当前 AI 界内“流形”概念使用的泛化与方法论边界

本文讨论 AI 理论研究中“流形”概念的泛化使用，并区分工程命名、几何直觉与严格数学论证之间的边界。

1 min read · 2026 · -- views

Can We Derive Scaling Law From First Principles?

New research available. Click to read the full PDF.

1 min read · December 30, 2025

2025 · research scaling-law deep-learning pdf · publications