Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study

John Lalor, Hao Wu, Tsendsuren Munkhdalai, Hong Yu. Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study. In Ellen Riloff, David Chiang 0001, Julia Hockenmaier, Jun'ichi Tsujii, editors, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018. pages 4711-4716, Association for Computational Linguistics, 2018. [doi]

@inproceedings{LalorWMY18,
  title = {Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study},
  author = {John Lalor and Hao Wu and Tsendsuren Munkhdalai and Hong Yu},
  year = {2018},
  url = {https://aclanthology.info/papers/D18-1500/d18-1500},
  researchr = {https://researchr.org/publication/LalorWMY18},
  cites = {0},
  citedby = {0},
  pages = {4711-4716},
  booktitle = {Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31 - November 4, 2018},
  editor = {Ellen Riloff and David Chiang 0001 and Julia Hockenmaier and Jun'ichi Tsujii},
  publisher = {Association for Computational Linguistics},
  isbn = {978-1-948087-84-1},
}