Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering

Bryant Chen, Wilka Carvalho, Nathalie Baracaldo, Heiko Ludwig, Benjamin Edwards, Taesung Lee, Ian Molloy, Biplav Srivastava. Detecting Backdoor Attacks on Deep Neural Networks by Activation Clustering. In Huáscar Espinoza, Seán Ó hÉigeartaigh, Xiaowei Huang, José Hernández-Orallo, Mauricio Castillo-Effen, editors, Workshop on Artificial Intelligence Safety 2019 co-located with the Thirty-Third AAAI Conference on Artificial Intelligence 2019 (AAAI-19), Honolulu, Hawaii, January 27, 2019. Volume 2301 of CEUR Workshop Proceedings, CEUR-WS.org, 2019. [doi]

Authors

Bryant Chen

This author has not been identified. Look up 'Bryant Chen' in Google

Wilka Carvalho

This author has not been identified. Look up 'Wilka Carvalho' in Google

Nathalie Baracaldo

This author has not been identified. Look up 'Nathalie Baracaldo' in Google

Heiko Ludwig

This author has not been identified. Look up 'Heiko Ludwig' in Google

Benjamin Edwards

This author has not been identified. Look up 'Benjamin Edwards' in Google

Taesung Lee

This author has not been identified. Look up 'Taesung Lee' in Google

Ian Molloy

This author has not been identified. Look up 'Ian Molloy' in Google

Biplav Srivastava

This author has not been identified. Look up 'Biplav Srivastava' in Google