Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling

Hongyu Gong, Yun Tang, Juan Pino, Xian Li. Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 2668-2681, 2021. [doi]

@inproceedings{GongTPL21,
  title = {Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling},
  author = {Hongyu Gong and Yun Tang and Juan Pino and Xian Li},
  year = {2021},
  url = {https://proceedings.neurips.cc/paper/2021/hash/15c00b5250ddedaabc203b67f8b034fd-Abstract.html},
  researchr = {https://researchr.org/publication/GongTPL21},
  cites = {0},
  citedby = {0},
  pages = {2668-2681},
  booktitle = {Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual},
  editor = {Marc'Aurelio Ranzato and Alina Beygelzimer and Yann N. Dauphin and Percy Liang and Jennifer Wortman Vaughan},
}