Evaluating Attribution Methods using White-Box LSTMs

Yiding Hao. Evaluating Attribution Methods using White-Box LSTMs. In Afra Alishahi, Yonatan Belinkov, Grzegorz Chrupala, Dieuwke Hupkes, Yuval Pinter, Hassan Sajjad, editors, Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2020, Online, November 2020. pages 300-313, Association for Computational Linguistics, 2020. [doi]

Abstract

Abstract is missing.