AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients

Juntang Zhuang, Tommy Tang, Yifan Ding, Sekhar C. Tatikonda, Nicha C. Dvornek, Xenophon Papademetris, James S. Duncan. AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

@inproceedings{ZhuangTDTDPD20,
  title = {AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients},
  author = {Juntang Zhuang and Tommy Tang and Yifan Ding and Sekhar C. Tatikonda and Nicha C. Dvornek and Xenophon Papademetris and James S. Duncan},
  year = {2020},
  url = {https://proceedings.neurips.cc/paper/2020/hash/d9d4f495e875a2e075a1a4a6e1b9770f-Abstract.html},
  researchr = {https://researchr.org/publication/ZhuangTDTDPD20},
  cites = {0},
  citedby = {0},
  booktitle = {Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual},
  editor = {Hugo Larochelle and Marc'Aurelio Ranzato and Raia Hadsell and Maria-Florina Balcan and Hsuan-Tien Lin},
}