Tokenizer-Aware Cross-Lingual Adaptation of Decoder-Only LLMs through Embedding Relearning and Swapping

Fan Jiang 0014, Honglin Yu, Grace Chung, Trevor Cohn. Tokenizer-Aware Cross-Lingual Adaptation of Decoder-Only LLMs through Embedding Relearning and Swapping. In Vera Demberg, Kentaro Inui, Lluís Marquez, editors, Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2026 - Volume 1: Long Papers, Rabat, Morocco, March 24-29, 2026. pages 7606-7636, Association for Computational Linguistics, 2026. [doi]

@inproceedings{JiangYCC26,
  title = {Tokenizer-Aware Cross-Lingual Adaptation of Decoder-Only LLMs through Embedding Relearning and Swapping},
  author = {Fan Jiang 0014 and Honglin Yu and Grace Chung and Trevor Cohn},
  year = {2026},
  url = {https://aclanthology.org/2026.eacl-long.357/},
  researchr = {https://researchr.org/publication/JiangYCC26},
  cites = {0},
  citedby = {0},
  pages = {7606-7636},
  booktitle = {Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2026 - Volume 1: Long Papers, Rabat, Morocco, March 24-29, 2026},
  editor = {Vera Demberg and Kentaro Inui and Lluís Marquez},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-380-7},
}