Linguistically-augmented perplexity-based data selection for language models

Antonio Toral, Pavel Pecina, Longyue Wang, Josef van Genabith. Linguistically-augmented perplexity-based data selection for language models. Computer Speech & Language, 32(1):11-26, 2015. [doi]

@article{ToralPWG15,
  title = {Linguistically-augmented perplexity-based data selection for language models},
  author = {Antonio Toral and Pavel Pecina and Longyue Wang and Josef van Genabith},
  year = {2015},
  doi = {10.1016/j.csl.2014.10.002},
  url = {http://dx.doi.org/10.1016/j.csl.2014.10.002},
  researchr = {https://researchr.org/publication/ToralPWG15},
  cites = {0},
  citedby = {0},
  journal = {Computer Speech & Language},
  volume = {32},
  number = {1},
  pages = {11-26},
}