The Infinite-Width Limit: A Comprehensive Analysis of Neural Tangent Kernels, Feature Learning, and Scaling Laws
1. Introduction: The Unreasonable Effectiveness of Overparameterization The theoretical understanding of deep neural networks has undergone a fundamental transformation over the last decade. Historically, statistical learning theory relied on concepts Read More …
