The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima

Yu Feng, Yuhai Tu. The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima. Proc. Natl. Acad. Sci. USA, 118(9), 2021. [doi]