Epimining: Using Web News for Influenza Surveillance

Didier Breton 1 Sandra Bringay 2, 3 François Marques 1 Pascal Poncelet 2 Mathieu Roche 4
2 TATOO - Fouille de données environnementales
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
4 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Epidemiological surveillance is an important issue of public health policy. In this paper, we describe a method based on knowledge extraction from news and news classification to understand the epidemic evolution. Descriptive studies are useful for gathering information on the incidence and characteristics of an epidemic. New approaches, based on new modes of mass publication through the web, are developed: based on the analysis of user queries or on the echo that an epidemic may have in the media. In this study, we focus on a particular media: web news. We propose the Epimining approach, which allows the extraction of information from web news (based on pattern research) and a fine classification of these news into various classes (new cases, deaths, and so forth). The experiments conducted on a real corpora (AFP news) showed a precision greater than 94% and an F-measure above 85%.
Type de document :
Communication dans un congrès
DMHM: Data Mining for Healthcare Management, May 2012, Kuala Lumpur, Malaysia. 3rd Workshop on Data Mining for Healthcare Management, pp.11-21, 2012, 〈http://www-users.cs.umn.edu/~desikan/pakdd2012/dmhm.html〉
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00723582
Contributeur : Mathieu Roche <>
Soumis le : vendredi 10 août 2012 - 22:49:04
Dernière modification le : jeudi 24 mai 2018 - 15:59:23
Document(s) archivé(s) le : dimanche 11 novembre 2012 - 02:30:35

Fichier

Breton_DMHM2012_final.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : lirmm-00723582, version 1

Collections

Citation

Didier Breton, Sandra Bringay, François Marques, Pascal Poncelet, Mathieu Roche. Epimining: Using Web News for Influenza Surveillance. DMHM: Data Mining for Healthcare Management, May 2012, Kuala Lumpur, Malaysia. 3rd Workshop on Data Mining for Healthcare Management, pp.11-21, 2012, 〈http://www-users.cs.umn.edu/~desikan/pakdd2012/dmhm.html〉. 〈lirmm-00723582〉

Partager

Métriques

Consultations de la notice

161

Téléchargements de fichiers

170