ASSERT: Automated Safety Scenario Red Teaming for Evaluating the Robustness of Large Language Models

Alex Mei, Sharon Levy, William Yang Wang. ASSERT: Automated Safety Scenario Red Teaming for Evaluating the Robustness of Large Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 5831-5847, Association for Computational Linguistics, 2023. [doi]

Authors

Alex Mei

This author has not been identified. Look up 'Alex Mei' in Google

Sharon Levy

This author has not been identified. Look up 'Sharon Levy' in Google

William Yang Wang

This author has not been identified. Look up 'William Yang Wang' in Google