Multi-step Jailbreaking Privacy Attacks on ChatGPT

Haoran Li, Dadi Guo, Wei Fan, Mingshi Xu, Jie Huang 0009, Fanpu Meng, Yangqiu Song. Multi-step Jailbreaking Privacy Attacks on ChatGPT. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 4138-4153, Association for Computational Linguistics, 2023. [doi]

Authors

Haoran Li

This author has not been identified. Look up 'Haoran Li' in Google

Dadi Guo

This author has not been identified. Look up 'Dadi Guo' in Google

Wei Fan

This author has not been identified. Look up 'Wei Fan' in Google

Mingshi Xu

This author has not been identified. Look up 'Mingshi Xu' in Google

Jie Huang 0009

This author has not been identified. Look up 'Jie Huang 0009' in Google

Fanpu Meng

This author has not been identified. Look up 'Fanpu Meng' in Google

Yangqiu Song

This author has not been identified. Look up 'Yangqiu Song' in Google