What Models Know About Their Attackers: Deriving Attacker Information From Latent Representations - researchr publication

researchr

You are not signed in
Sign in
Sign up

Zhouhang Xie, Jonathan Brophy, Adam Noack, Wencong You, Kalyani Asthana, Carter Perkins, Sabrina Reis, Zayd Hammoudeh, Daniel Lowd, Sameer Singh. What Models Know About Their Attackers: Deriving Attacker Information From Latent Representations. In Jasmijn Bastings, Yonatan Belinkov, Emmanuel Dupoux, Mario Giulianelli, Dieuwke Hupkes, Yuval Pinter, Hassan Sajjad, editors, Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, BlackboxNLP@EMNLP 2021, Punta Cana, Dominican Republic, November 11, 2021. pages 69-78, Association for Computational Linguistics, 2021. [doi]

Abstract is missing.

runs on WebDSL