When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations

Hyungrok Do, Yuxin Chang, Yoon-Sang Cho, Padhraic Smyth, Judy Zhong. When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations. In Kaivalya Deshpande, Madalina Fiterau, Shalmali Joshi, Zachary C. Lipton, Rajesh Ranganath, IƱigo Urteaga, Serene Yeung, editors, Machine Learning for Healthcare Conference, MLHC 2023, 11-12 August 2023, New York, USA. Volume 219 of Proceedings of Machine Learning Research, pages 128-149, PMLR, 2023. [doi]

Authors

Hyungrok Do

This author has not been identified. Look up 'Hyungrok Do' in Google

Yuxin Chang

This author has not been identified. Look up 'Yuxin Chang' in Google

Yoon-Sang Cho

This author has not been identified. Look up 'Yoon-Sang Cho' in Google

Padhraic Smyth

This author has not been identified. Look up 'Padhraic Smyth' in Google

Judy Zhong

This author has not been identified. Look up 'Judy Zhong' in Google