Goodhart's Law Applies to NLP's Explanation Benchmarks

Jennifer Hsia, Danish Pruthi, Aarti Singh, Zachary C. Lipton. Goodhart's Law Applies to NLP's Explanation Benchmarks. In Yvette Graham, Matthew Purver, editors, Findings of the Association for Computational Linguistics: EACL 2024, St. Julian's, Malta, March 17-22, 2024. pages 1322-1335, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.