LAD: Layer-Wise Adaptive Distillation for BERT Model Compression

Ying-Jia Lin, Kuan-Yu Chen, Hung-Yu Kao. LAD: Layer-Wise Adaptive Distillation for BERT Model Compression. Sensors, 23(3):1483, February 2023. [doi]

Abstract

Abstract is missing.