When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations

Hyungrok Do, Yuxin Chang, Yoon-Sang Cho, Padhraic Smyth, Judy Zhong. When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations. In Kaivalya Deshpande, Madalina Fiterau, Shalmali Joshi, Zachary C. Lipton, Rajesh Ranganath, IƱigo Urteaga, Serene Yeung, editors, Machine Learning for Healthcare Conference, MLHC 2023, 11-12 August 2023, New York, USA. Volume 219 of Proceedings of Machine Learning Research, pages 128-149, PMLR, 2023. [doi]

Abstract

Abstract is missing.