LAD: Layer-Wise Adaptive Distillation for BERT Model Compression

Ying-Jia Lin, Kuan-Yu Chen, Hung-Yu Kao. LAD: Layer-Wise Adaptive Distillation for BERT Model Compression. Sensors, 23(3):1483, February 2023. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: