Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming

Rui Li, Peiyi Wang, Jingyuan Ma, Di Zhang, Lei Sha, Zhifang Sui. Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 3287-3301, Association for Computational Linguistics, 2024. [doi]

Authors

Rui Li

This author has not been identified. Look up 'Rui Li' in Google

Peiyi Wang

This author has not been identified. Look up 'Peiyi Wang' in Google

Jingyuan Ma

This author has not been identified. Look up 'Jingyuan Ma' in Google

Di Zhang

This author has not been identified. Look up 'Di Zhang' in Google

Lei Sha

This author has not been identified. Look up 'Lei Sha' in Google

Zhifang Sui

This author has not been identified. Look up 'Zhifang Sui' in Google