Learning Multiscale Transformer Models for Sequence Generation

Bei Li, Tong Zheng, Yi Jing, Chengbo Jiao, Tong Xiao, Jingbo Zhu. Learning Multiscale Transformer Models for Sequence Generation. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvári, Gang Niu 0001, Sivan Sabato, editors, International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. Volume 162 of Proceedings of Machine Learning Research, pages 13225-13241, PMLR, 2022. [doi]

@inproceedings{LiZJJXZ22,
  title = {Learning Multiscale Transformer Models for Sequence Generation},
  author = {Bei Li and Tong Zheng and Yi Jing and Chengbo Jiao and Tong Xiao and Jingbo Zhu},
  year = {2022},
  url = {https://proceedings.mlr.press/v162/li22ac.html},
  researchr = {https://researchr.org/publication/LiZJJXZ22},
  cites = {0},
  citedby = {0},
  pages = {13225-13241},
  booktitle = {International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA},
  editor = {Kamalika Chaudhuri and Stefanie Jegelka and Le Song and Csaba Szepesvári and Gang Niu 0001 and Sivan Sabato},
  volume = {162},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}