Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models

Boxin Wang, Chejian Xu, Shuohang Wang, Zhe Gan, Yu Cheng 0001, Jianfeng Gao, Ahmed Hassan Awadallah, Bo Li 0026. Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models. In Joaquin Vanschoren, Sai Kit Yeung, editors, Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, NeurIPS Datasets and Benchmarks 2021, December 2021, virtual. 2021. [doi]

Authors

Boxin Wang

This author has not been identified. Look up 'Boxin Wang' in Google

Chejian Xu

This author has not been identified. Look up 'Chejian Xu' in Google

Shuohang Wang

This author has not been identified. Look up 'Shuohang Wang' in Google

Zhe Gan

This author has not been identified. Look up 'Zhe Gan' in Google

Yu Cheng 0001

This author has not been identified. Look up 'Yu Cheng 0001' in Google

Jianfeng Gao

This author has not been identified. Look up 'Jianfeng Gao' in Google

Ahmed Hassan Awadallah

This author has not been identified. Look up 'Ahmed Hassan Awadallah' in Google

Bo Li 0026

This author has not been identified. Look up 'Bo Li 0026' in Google