Training-free Lexical Backdoor Attacks on Language Models

Yujin Huang, Terry Yue Zhuo, Qiongkai Xu, Han Hu, Xingliang Yuan, Chunyang Chen. Training-free Lexical Backdoor Attacks on Language Models. In Ying Ding 0001, Jie Tang 0001, Juan F. Sequeda, Lora Aroyo, Carlos Castillo 0001, Geert-Jan Houben, editors, Proceedings of the ACM Web Conference 2023, WWW 2023, Austin, TX, USA, 30 April 2023- 4 May 2023. pages 2198-2208, ACM, 2023. [doi]

No reviews for this publication, yet.