Efficient Jailbreak Attack sequences on Large Language Models via Multi-Armed Bandit-based Context switching

Aditya Ramesh, Shivam Bhardwaj, Aditya Saibewar, Manohar Kaul. Efficient Jailbreak Attack sequences on Large Language Models via Multi-Armed Bandit-based Context switching. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]

@inproceedings{RameshBSK25,
  title = {Efficient Jailbreak Attack sequences on Large Language Models via Multi-Armed Bandit-based Context switching},
  author = {Aditya Ramesh and Shivam Bhardwaj and Aditya Saibewar and Manohar Kaul},
  year = {2025},
  url = {https://openreview.net/forum?id=jCDF7G3LpF},
  researchr = {https://researchr.org/publication/RameshBSK25},
  cites = {0},
  citedby = {0},
  booktitle = {The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025},
  publisher = {OpenReview.net},
}