Jailbreaker in Jail: Moving Target Defense for Large Language Models

Bocheng Chen, Advait Paliwal, Qiben Yan. Jailbreaker in Jail: Moving Target Defense for Large Language Models. In Ning Zhang 0017, Qi Li 0002, editors, Proceedings of the 10th ACM Workshop on Moving Target Defense, MTD 2023, Copenhagen, Denmark, 26 November 2023. pages 29-32, ACM, 2023. [doi]

Abstract

Abstract is missing.