Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective

Jongwoo Ko, Seungjoon Park, Minchan Jeong, Sukjin Hong, Euijai Ahn, Du-Seong Chang, Se-Young Yun. Revisiting Intermediate Layer Distillation for Compressing Language Models: An Overfitting Perspective. In Andreas Vlachos 0001, Isabelle Augenstein, editors, Findings of the Association for Computational Linguistics: EACL 2023, Dubrovnik, Croatia, May 2-6, 2023. pages 158-175, Association for Computational Linguistics, 2023. [doi]

Authors

Jongwoo Ko

This author has not been identified. Look up 'Jongwoo Ko' in Google

Seungjoon Park

This author has not been identified. Look up 'Seungjoon Park' in Google

Minchan Jeong

This author has not been identified. Look up 'Minchan Jeong' in Google

Sukjin Hong

This author has not been identified. Look up 'Sukjin Hong' in Google

Euijai Ahn

This author has not been identified. Look up 'Euijai Ahn' in Google

Du-Seong Chang

This author has not been identified. Look up 'Du-Seong Chang' in Google

Se-Young Yun

This author has not been identified. Look up 'Se-Young Yun' in Google