Training Language Models to Generate Text with Citations via Fine-grained Rewards

Chengyu Huang, Zeqiu Wu, Yushi Hu, Wenya Wang. Training Language Models to Generate Text with Citations via Fine-grained Rewards. In Lun-Wei Ku, Andre Martins, Vivek Srikumar, editors, Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024. pages 2926-2949, Association for Computational Linguistics, 2024. [doi]

Authors

Chengyu Huang

This author has not been identified. Look up 'Chengyu Huang' in Google

Zeqiu Wu

This author has not been identified. Look up 'Zeqiu Wu' in Google

Yushi Hu

This author has not been identified. Look up 'Yushi Hu' in Google

Wenya Wang

This author has not been identified. Look up 'Wenya Wang' in Google