Selecting relevant text subsets from web-data for building topic specific language models

Abhinav Sethy, Panayiotis P. Georgiou, Shrikanth Narayanan. Selecting relevant text subsets from web-data for building topic specific language models. In Robert C. Moore, Jeff A. Bilmes, Jennifer Chu-Carroll, Mark Sanderson, editors, Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 4-9, 2006, New York, New York, USA. The Association for Computational Linguistics, 2006. [doi]

@inproceedings{SethyGN06,
  title = {Selecting relevant text subsets from web-data for building topic specific language models},
  author = {Abhinav Sethy and Panayiotis P. Georgiou and Shrikanth Narayanan},
  year = {2006},
  url = {http://acl.ldc.upenn.edu/N/N06/N06-2037.pdf},
  tags = {data-flow language, modeling language, language modeling, data-flow, domain-specific language},
  researchr = {https://researchr.org/publication/SethyGN06},
  cites = {0},
  citedby = {0},
  booktitle = {Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 4-9, 2006, New York, New York, USA},
  editor = {Robert C. Moore and Jeff A. Bilmes and Jennifer Chu-Carroll and Mark Sanderson},
  publisher = {The Association for Computational Linguistics},
}