VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models

Bingrui Sima, Linhua Cong, Wenxuan Wang 0001, Kun He. VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models. In Christos Christodoulopoulos 0001, Tanmoy Chakraborty 0002, Carolyn Rose, Violet Peng, editors, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025, Suzhou, China, November 4-9, 2025. pages 6131-6144, Association for Computational Linguistics, 2025. [doi]

@inproceedings{SimaCWH25,
  title = {VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models},
  author = {Bingrui Sima and Linhua Cong and Wenxuan Wang 0001 and Kun He},
  year = {2025},
  doi = {10.18653/v1/2025.emnlp-main.312},
  url = {https://doi.org/10.18653/v1/2025.emnlp-main.312},
  researchr = {https://researchr.org/publication/SimaCWH25},
  cites = {0},
  citedby = {0},
  pages = {6131-6144},
  booktitle = {Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025, Suzhou, China, November 4-9, 2025},
  editor = {Christos Christodoulopoulos 0001 and Tanmoy Chakraborty 0002 and Carolyn Rose and Violet Peng},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-332-6},
}