Evaluating GPT-3 Generated Explanations for Hateful Content Moderation

Han Wang, Ming Shan Hee, Md. Rabiul Awal, Kenny Tsu Wei Choo, Roy Ka-Wei Lee. Evaluating GPT-3 Generated Explanations for Hateful Content Moderation. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI 2023, 19th-25th August 2023, Macao, SAR, China. pages 6255-6263, ijcai.org, 2023. [doi]

@inproceedings{WangHACL23,
  title = {Evaluating GPT-3 Generated Explanations for Hateful Content Moderation},
  author = {Han Wang and Ming Shan Hee and Md. Rabiul Awal and Kenny Tsu Wei Choo and Roy Ka-Wei Lee},
  year = {2023},
  doi = {10.24963/ijcai.2023/694},
  url = {https://doi.org/10.24963/ijcai.2023/694},
  researchr = {https://researchr.org/publication/WangHACL23},
  cites = {0},
  citedby = {0},
  pages = {6255-6263},
  booktitle = {Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI 2023, 19th-25th August 2023, Macao, SAR, China},
  publisher = {ijcai.org},
}