optimization
an archive of posts with this tag
| Apr 14, 2026 | 在 LLM 语境下,梯度里的噪声会如何影响 training dynamics? -- views |
|---|---|
| Feb 08, 2026 | 从 Gated DeltaNet 到 Kaczmarz -- views |
an archive of posts with this tag
| Apr 14, 2026 | 在 LLM 语境下,梯度里的噪声会如何影响 training dynamics? -- views |
|---|---|
| Feb 08, 2026 | 从 Gated DeltaNet 到 Kaczmarz -- views |