The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

Daniel S. Park, Jascha Sohl-Dickstein, Quoc V. Le, Samuel L. Smith. The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study. In Kamalika Chaudhuri, Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA. Volume 97 of Proceedings of Machine Learning Research, pages 5042-5051, PMLR, 2019. [doi]

@inproceedings{ParkSLS19,
  title = {The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study},
  author = {Daniel S. Park and Jascha Sohl-Dickstein and Quoc V. Le and Samuel L. Smith},
  year = {2019},
  url = {http://proceedings.mlr.press/v97/park19b.html},
  researchr = {https://researchr.org/publication/ParkSLS19},
  cites = {0},
  citedby = {0},
  pages = {5042-5051},
  booktitle = {Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}