Duc Trung Vu, Chi Pham Khanh, Phi Van Dat, Ngo Van Linh 0001, Dinh Viet Sang, Trung Le 0001. DWA-KD: Dual-Space Weighting and Time-Warped Alignment for Cross-Tokenizer Knowledge Distillation. In Vera Demberg, Kentaro Inui, LluĂs Marquez, editors, Findings of the Association for Computational Linguistics: EACL 2026, Rabat, Morocco, March 24-29, 2026. pages 3513-3527, Association for Computational Linguistics, 2026. [doi]
Abstract is missing.