Lifting the Curse of Capacity Gap in Distilling Language Models

Chen Zhang, Yang Yang, Jiahao Liu, Jingang Wang, Yunsen Xian, Benyou Wang, Dawei Song. Lifting the Curse of Capacity Gap in Distilling Language Models. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023. pages 4535-4553, Association for Computational Linguistics, 2023. [doi]

@inproceedings{ZhangYLWXWS23,
  title = {Lifting the Curse of Capacity Gap in Distilling Language Models},
  author = {Chen Zhang and Yang Yang and Jiahao Liu and Jingang Wang and Yunsen Xian and Benyou Wang and Dawei Song},
  year = {2023},
  url = {https://aclanthology.org/2023.acl-long.249},
  researchr = {https://researchr.org/publication/ZhangYLWXWS23},
  cites = {0},
  citedby = {0},
  pages = {4535-4553},
  booktitle = {Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023},
  editor = {Anna Rogers and Jordan L. Boyd-Graber and Naoaki Okazaki},
  publisher = {Association for Computational Linguistics},
  isbn = {978-1-959429-72-2},
}