Introduction to Text Classification: Impact of Stemming and Comparing TF-IDF and Count Vectorization as Feature Extraction Technique

André Wendland, Marco Zenere, Jörg Niemann. Introduction to Text Classification: Impact of Stemming and Comparing TF-IDF and Count Vectorization as Feature Extraction Technique. In Murat Yilmaz, Paul M. Clarke, Richard Messnarz, Michael Reiner, editors, Systems, Software and Services Process Improvement - 28th European Conference, EuroSPI 2021, Krems, Austria, September 1-3, 2021, Proceedings. Volume 1442 of Communications in Computer and Information Science, pages 289-300, Springer, 2021. [doi]

Abstract

Abstract is missing.