Scaled ReLU Matters for Training Vision Transformers

Pichao Wang, Xue Wang, Hao Luo, Jingkai Zhou, Zhipeng Zhou, Fan Wang, Hao Li, Rong Jin 0001. Scaled ReLU Matters for Training Vision Transformers. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022. pages 2495-2503, AAAI Press, 2022. [doi]

@inproceedings{WangWLZZWL022,
  title = {Scaled ReLU Matters for Training Vision Transformers},
  author = {Pichao Wang and Xue Wang and Hao Luo and Jingkai Zhou and Zhipeng Zhou and Fan Wang and Hao Li and Rong Jin 0001},
  year = {2022},
  url = {https://ojs.aaai.org/index.php/AAAI/article/view/20150},
  researchr = {https://researchr.org/publication/WangWLZZWL022},
  cites = {0},
  citedby = {0},
  pages = {2495-2503},
  booktitle = {Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, February 22 - March 1, 2022},
  publisher = {AAAI Press},
  isbn = {978-1-57735-876-3},
}