Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model

Xiao Wang, Weikang Zhou, Qi Zhang, Jie Zhou, Songyang Gao, Junzhe Wang, Menghan Zhang, Xiang Gao, Yunwen Chen, Tao Gui. Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023. pages 555-568, Association for Computational Linguistics, 2023. [doi]

Authors

Xiao Wang

This author has not been identified. Look up 'Xiao Wang' in Google

Weikang Zhou

This author has not been identified. Look up 'Weikang Zhou' in Google

Qi Zhang

This author has not been identified. Look up 'Qi Zhang' in Google

Jie Zhou

This author has not been identified. Look up 'Jie Zhou' in Google

Songyang Gao

This author has not been identified. Look up 'Songyang Gao' in Google

Junzhe Wang

This author has not been identified. Look up 'Junzhe Wang' in Google

Menghan Zhang

This author has not been identified. Look up 'Menghan Zhang' in Google

Xiang Gao

This author has not been identified. Look up 'Xiang Gao' in Google

Yunwen Chen

This author has not been identified. Look up 'Yunwen Chen' in Google

Tao Gui

This author has not been identified. Look up 'Tao Gui' in Google