How the ExpLSA Approach Impacts the Document Classification Tasks
Abstract
Latent Semantic Analysis (LSA) is a statistical method which can be used to classify texts. This paper proposes a sentence expansion method (ExpLSA) to improve document classification tasks. We propose to study the impact of ExpLSA on the size and on the type of corpora.