Evaluating GPT-3 Generated Explanations for Hateful Content Moderation

Han Wang, Ming Shan Hee, Md. Rabiul Awal, Kenny Tsu Wei Choo, Roy Ka-Wei Lee. Evaluating GPT-3 Generated Explanations for Hateful Content Moderation. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI 2023, 19th-25th August 2023, Macao, SAR, China. pages 6255-6263, ijcai.org, 2023. [doi]

Authors

Han Wang

This author has not been identified. Look up 'Han Wang' in Google

Ming Shan Hee

This author has not been identified. Look up 'Ming Shan Hee' in Google

Md. Rabiul Awal

This author has not been identified. Look up 'Md. Rabiul Awal' in Google

Kenny Tsu Wei Choo

This author has not been identified. Look up 'Kenny Tsu Wei Choo' in Google

Roy Ka-Wei Lee

This author has not been identified. Look up 'Roy Ka-Wei Lee' in Google