Unlearning Bias in Language Models by Partitioning Gradients

Charles Yu, Sullam Jeoung, Anish Kasi, Pengfei Yu, Heng Ji. Unlearning Bias in Language Models by Partitioning Gradients. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023. pages 6032-6048, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.