Terminology Extraction from Log Files

Hassan Saneifar 1, 2, * Stéphane Bonniol 2 Anne Laurent 3 Pascal Poncelet 3 Mathieu Roche 4
* Auteur correspondant
3 TATOO - Fouille de données environnementales
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
4 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : The log files generated by digital systems can be used in management information systems as the source of important information on the condition of systems. However, log files are not exhaustively exploited in order to extract information. The classical methods of information extraction such as terminology extraction methods are irrelevant to this context because of the specific characteristics of log files like their heterogeneous structure, the special vocabulary and the fact that they do not respect a natural language grammar. In this paper, we introduce our approach Exterlog to extract the terminology from log files. We detail how it deals with the particularity of such textual data.
Type de document :
Communication dans un congrès
DEXA'2009: 20th International Conference on Database and Expert Systems Applications, Aug 2009, Linz, Austria. Springer, pp.769-776, 2009, LNCS. 〈http://www.dexa.org/dexa_cfp〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00423940
Contributeur : Hassan Saneifar <>
Soumis le : mardi 13 octobre 2009 - 12:43:11
Dernière modification le : jeudi 24 mai 2018 - 15:59:23

Identifiants

  • HAL Id : lirmm-00423940, version 1

Collections

Citation

Hassan Saneifar, Stéphane Bonniol, Anne Laurent, Pascal Poncelet, Mathieu Roche. Terminology Extraction from Log Files. DEXA'2009: 20th International Conference on Database and Expert Systems Applications, Aug 2009, Linz, Austria. Springer, pp.769-776, 2009, LNCS. 〈http://www.dexa.org/dexa_cfp〉. 〈lirmm-00423940〉

Partager

Métriques

Consultations de la notice

81