Adversarial Benchmark Evaluation Rectified by Controlling for Difficulty

Behzad Mehrbakhsh, Fernando Martínez-Plumed, José Hernández-Orallo. Adversarial Benchmark Evaluation Rectified by Controlling for Difficulty. In Kobi Gal, Ann Nowé, Grzegorz J. Nalepa, Roy Fairstein, Roxana Radulescu, editors, ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland - Including 12th Conference on Prestigious Applications of Intelligent Systems (PAIS 2023). Volume 372 of Frontiers in Artificial Intelligence and Applications, pages 1696-1703, IOS Press, 2023. [doi]

Abstract

Abstract is missing.