When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations

Rhys Compton, Lily H. Zhang, Aahlad Manas Puli, Rajesh Ranganath. When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations. In Kaivalya Deshpande, Madalina Fiterau, Shalmali Joshi, Zachary C. Lipton, Rajesh Ranganath, IƱigo Urteaga, Serene Yeung, editors, Machine Learning for Healthcare Conference, MLHC 2023, 11-12 August 2023, New York, USA. Volume 219 of Proceedings of Machine Learning Research, pages 110-127, PMLR, 2023. [doi]

Abstract

Abstract is missing.