On the Impact of Cross-Domain Data on German Language Models

Amin Dada, Aokun Chen, Cheng Peng, Kaleb E. Smith, Ahmad Idrissi-Yaghir, Constantin Seibold, Jianning Li, Lars Heiliger, Christoph M. Friedrich, Daniel Truhn, Jan Egger, Jiang Bian, Jens Kleesiek, Yonghui Wu. On the Impact of Cross-Domain Data on German Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 13801-13813, Association for Computational Linguistics, 2023. [doi]

Authors

Amin Dada

This author has not been identified. Look up 'Amin Dada' in Google

Aokun Chen

This author has not been identified. Look up 'Aokun Chen' in Google

Cheng Peng

This author has not been identified. Look up 'Cheng Peng' in Google

Kaleb E. Smith

This author has not been identified. Look up 'Kaleb E. Smith' in Google

Ahmad Idrissi-Yaghir

This author has not been identified. Look up 'Ahmad Idrissi-Yaghir' in Google

Constantin Seibold

This author has not been identified. Look up 'Constantin Seibold' in Google

Jianning Li

This author has not been identified. Look up 'Jianning Li' in Google

Lars Heiliger

This author has not been identified. Look up 'Lars Heiliger' in Google

Christoph M. Friedrich

This author has not been identified. Look up 'Christoph M. Friedrich' in Google

Daniel Truhn

This author has not been identified. Look up 'Daniel Truhn' in Google

Jan Egger

This author has not been identified. Look up 'Jan Egger' in Google

Jiang Bian

This author has not been identified. Look up 'Jiang Bian' in Google

Jens Kleesiek

This author has not been identified. Look up 'Jens Kleesiek' in Google

Yonghui Wu

This author has not been identified. Look up 'Yonghui Wu' in Google