Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus

Andrey Kutuzov, Maria Kunilovskaya. Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus. In Wil M. P. van der Aalst, Dmitry I. Ignatov, Michael Khachay, Sergei O. Kuznetsov, Victor S. Lempitsky, Irina A. Lomazova, Natalia V. Loukachevitch, Amedeo Napoli, Alexander Panchenko, Panos M. Pardalos, Andrey V. Savchenko, Stanley Wasserman, editors, Analysis of Images, Social Networks and Texts - 6th International Conference, AIST 2017, Moscow, Russia, July 27-29, 2017, Revised Selected Papers. Volume 10716 of Lecture Notes in Computer Science, pages 47-58, Springer, 2017. [doi]

@inproceedings{KutuzovK17-0,
  title = {Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus},
  author = {Andrey Kutuzov and Maria Kunilovskaya},
  year = {2017},
  doi = {10.1007/978-3-319-73013-4_5},
  url = {https://doi.org/10.1007/978-3-319-73013-4_5},
  researchr = {https://researchr.org/publication/KutuzovK17-0},
  cites = {0},
  citedby = {0},
  pages = {47-58},
  booktitle = {Analysis of Images, Social Networks and Texts - 6th International Conference, AIST 2017, Moscow, Russia, July 27-29, 2017, Revised Selected Papers},
  editor = {Wil M. P. van der Aalst and Dmitry I. Ignatov and Michael Khachay and Sergei O. Kuznetsov and Victor S. Lempitsky and Irina A. Lomazova and Natalia V. Loukachevitch and Amedeo Napoli and Alexander Panchenko and Panos M. Pardalos and Andrey V. Savchenko and Stanley Wasserman},
  volume = {10716},
  series = {Lecture Notes in Computer Science},
  publisher = {Springer},
  isbn = {978-3-319-73013-4},
}