A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models

researchr

You are not signed in
Sign in
Sign up

Zihao Xu, Yi Liu, Gelei Deng, Yuekang Li, Stjepan Picek. A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models. In Lun-Wei Ku, Andre Martins, Vivek Srikumar, editors, Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024. pages 7432-7449, Association for Computational Linguistics, 2024. [doi]

@inproceedings{XuLDLP24,
  title = {A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models},
  author = {Zihao Xu and Yi Liu and Gelei Deng and Yuekang Li and Stjepan Picek},
  year = {2024},
  url = {https://aclanthology.org/2024.findings-acl.443},
  researchr = {https://researchr.org/publication/XuLDLP24},
  cites = {0},
  citedby = {0},
  pages = {7432-7449},
  booktitle = {Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024},
  editor = {Lun-Wei Ku and Andre Martins and Vivek Srikumar},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-099-8},
}

External Links

Cite Key

Statistics

PDF

Researchr

A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models