Visual Adversarial Examples Jailbreak Aligned Large Language Models

Xiangyu Qi, Kaixuan Huang, Ashwinee Panda, Peter Henderson 0002, Mengdi Wang, Prateek Mittal. Visual Adversarial Examples Jailbreak Aligned Large Language Models. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 21527-21536, AAAI Press, 2024. [doi]

Authors

Xiangyu Qi

This author has not been identified. Look up 'Xiangyu Qi' in Google

Kaixuan Huang

This author has not been identified. Look up 'Kaixuan Huang' in Google

Ashwinee Panda

This author has not been identified. Look up 'Ashwinee Panda' in Google

Peter Henderson 0002

This author has not been identified. Look up 'Peter Henderson 0002' in Google

Mengdi Wang

This author has not been identified. Look up 'Mengdi Wang' in Google

Prateek Mittal

This author has not been identified. Look up 'Prateek Mittal' in Google