Query-Efficient Black-Box Red Teaming via Bayesian Optimization

Deokjae Lee, Junyeong Lee, Jung-Woo Ha, Jin-Hwa Kim, Sang-Woo Lee, Hwaran Lee, Hyun Oh Song. Query-Efficient Black-Box Red Teaming via Bayesian Optimization. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023. pages 11551-11574, Association for Computational Linguistics, 2023. [doi]

Authors

Deokjae Lee

This author has not been identified. Look up 'Deokjae Lee' in Google

Junyeong Lee

This author has not been identified. Look up 'Junyeong Lee' in Google

Jung-Woo Ha

This author has not been identified. Look up 'Jung-Woo Ha' in Google

Jin-Hwa Kim

This author has not been identified. Look up 'Jin-Hwa Kim' in Google

Sang-Woo Lee

This author has not been identified. Look up 'Sang-Woo Lee' in Google

Hwaran Lee

This author has not been identified. Look up 'Hwaran Lee' in Google

Hyun Oh Song

This author has not been identified. Look up 'Hyun Oh Song' in Google