Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models

Peter Hase, Mohit Bansal, Been Kim, Asma Ghandeharioun. Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Peter Hase

This author has not been identified. Look up 'Peter Hase' in Google

Mohit Bansal

This author has not been identified. Look up 'Mohit Bansal' in Google

Been Kim

This author has not been identified. Look up 'Been Kim' in Google

Asma Ghandeharioun

This author has not been identified. Look up 'Asma Ghandeharioun' in Google