Makoto Shing, Kou Misaki, Han Bao, Sho Yokoi, Takuya Akiba. TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]
No references recorded for this publication.
No citations of this publication recorded.