Self-Distillation for Further Pre-training of Transformers

Seanie Lee, Minki Kang, Juho Lee 0001, Sung Ju Hwang, Kenji Kawaguchi. Self-Distillation for Further Pre-training of Transformers. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

@inproceedings{LeeK0HK23,
  title = {Self-Distillation for Further Pre-training of Transformers},
  author = {Seanie Lee and Minki Kang and Juho Lee 0001 and Sung Ju Hwang and Kenji Kawaguchi},
  year = {2023},
  url = {https://openreview.net/pdf?id=kj6oK_Hj40},
  researchr = {https://researchr.org/publication/LeeK0HK23},
  cites = {0},
  citedby = {0},
  booktitle = {The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023},
  publisher = {OpenReview.net},
}