Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods

Dylan Slack, Sophie Hilgard, Emily Jia, Sameer Singh, Himabindu Lakkaraju. Fooling LIME and SHAP: Adversarial Attacks on Post hoc Explanation Methods. In Annette N. Markham, Julia Powles, Toby Walsh, Anne L. Washington, editors, AIES '20: AAAI/ACM Conference on AI, Ethics, and Society, New York, NY, USA, February 7-8, 2020. pages 180-186, ACM, 2020. [doi]

Authors

Dylan Slack

This author has not been identified. Look up 'Dylan Slack' in Google

Sophie Hilgard

This author has not been identified. Look up 'Sophie Hilgard' in Google

Emily Jia

This author has not been identified. Look up 'Emily Jia' in Google

Sameer Singh

This author has not been identified. Look up 'Sameer Singh' in Google

Himabindu Lakkaraju

This author has not been identified. Look up 'Himabindu Lakkaraju' in Google