deep-learning
an archive of posts with this tag
| Mar 07, 2026 | 球面之上:带有 Hyperball 机制的优化器的 μP 缩放 -- views |
|---|---|
| Mar 05, 2026 | 球面之上:从球面动力学到 μP -- views |
| Mar 02, 2026 | Tensor Programs (二):从Tensor Programs到 μP -- views |
| Feb 14, 2026 | Tensor Programs (一):从Feature Learning 的谱条件到 μP -- views |
| Feb 08, 2026 | 从 Gated DeltaNet 到 Kaczmarz -- views |
| Dec 30, 2025 | Can We Derive Scaling Law From First Principles? |