When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations

Hyungrok Do, Yuxin Chang, Yoon-Sang Cho, Padhraic Smyth, Judy Zhong. When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations. In Kaivalya Deshpande, Madalina Fiterau, Shalmali Joshi, Zachary C. Lipton, Rajesh Ranganath, Iñigo Urteaga, Serene Yeung, editors, Machine Learning for Healthcare Conference, MLHC 2023, 11-12 August 2023, New York, USA. Volume 219 of Proceedings of Machine Learning Research, pages 128-149, PMLR, 2023. [doi]

@inproceedings{DoCCSZ23,
  title = {When More is Less: Incorporating Additional Datasets Can Hurt Performance By Introducing Spurious Correlations},
  author = {Hyungrok Do and Yuxin Chang and Yoon-Sang Cho and Padhraic Smyth and Judy Zhong},
  year = {2023},
  url = {https://proceedings.mlr.press/v219/do23a.html},
  researchr = {https://researchr.org/publication/DoCCSZ23},
  cites = {0},
  citedby = {0},
  pages = {128-149},
  booktitle = {Machine Learning for Healthcare Conference, MLHC 2023, 11-12 August 2023, New York, USA},
  editor = {Kaivalya Deshpande and Madalina Fiterau and Shalmali Joshi and Zachary C. Lipton and Rajesh Ranganath and Iñigo Urteaga and Serene Yeung},
  volume = {219},
  series = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
}