Classifying texts through natural language parsing and semantic filtering
Abstract
This paper presents a study in text classification through semantic and syntactic natural language processing. The authors have used a parser for French, SYGFRAN, and applied it to a real project of press articles classification. The results of this research on a corpus of 4, 843 texts containing more than 76, 000 sentences are described. Classification into 37 categories has been obtained through meaning discrimination by semantic filtering techniques, explained in the document.