Lexical Normalization of Spanish Tweets with Preprocessing Rules, Domain-specific Edit Distances, and Language Models

Pablo Ruiz, Montse Cuadros, Thierry Etchegoyhen. Lexical Normalization of Spanish Tweets with Preprocessing Rules, Domain-specific Edit Distances, and Language Models. In Iñaki Alegria, Nora Aranberri, Víctor Fresno, Pablo Gamallo, Lluís Padró, Iñaki San Vicente, Jordi Turmo, Arkaitz Zubiaga, editors, Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), Madrid, Spain, September 20th, 2013. Volume 1086 of CEUR Workshop Proceedings, pages 59-63, CEUR-WS.org, 2013. [doi]

Abstract

Abstract is missing.