AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation

Qinhong Zhou, Peng Li 0030, Yang Liu, Yuyang Guan, Qizhou Xing, Ming Chen, Maosong Sun 0001, Yang Liu 0005. AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation. AI Open, 4:56-63, January 2023. [doi]

Abstract

Abstract is missing.