Joint Dual Feature Distillation and Gradient Progressive Pruning for BERT compression

Zhou Zhang, Yang Lu 0015, Tengfei Wang, Xing Wei 0002, Zhen Wei. Joint Dual Feature Distillation and Gradient Progressive Pruning for BERT compression. Neural Networks, 179:106533, 2024. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.