Mining Web Data for Epidemiological Surveillance

Didier Breton 1 Sandra Bringay 2, 3 François Marques 1 Pascal Poncelet 2 Mathieu Roche 4
2 ADVANSE - ADVanced Analytics for data SciencE
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
4 TEXTE - Exploration et exploitation de données textuelles
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Epidemiological surveillance is an important issue of public health policy. In this paper, we describe a method based on knowledge extraction from news and news classification to understand the epidemic evolution. Descriptive studies are useful for gathering information on the incidence and characteristics of an epidemic. New approaches, based on new modes of mass publication through the web, are developed: based on the analysis of user queries or on the echo that an epidemic may have in the media. In this study, we focus on a particular media: web news. We propose the Epimining approach, which allows the extraction of information from web news (based on pattern research) and a fine classification of these news into various classes (new cases, deaths...). The experiments conducted on a real corpora (AFP news) showed a precision greater than 94% and an F-measure above 85%. We also investigate the interest of tacking into account the data collected through social networks such as Twitter to trigger alarms.
Document type :
Book sections
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00798274
Contributor : Pascal Poncelet <>
Submitted on : Wednesday, March 20, 2019 - 3:41:42 PM
Last modification on : Thursday, June 6, 2019 - 2:39:46 PM
Long-term archiving on : Friday, June 21, 2019 - 9:16:10 PM

File

PAKDD12_Workshop.pdf
Files produced by the author(s)

Identifiers

Citation

Didier Breton, Sandra Bringay, François Marques, Pascal Poncelet, Mathieu Roche. Mining Web Data for Epidemiological Surveillance. Emerging Trends in Knowledge Discovery and Data Mining, LNCS (7769), Springer, pp.11-21, 2013, ⟨10.1007/978-3-642-36778-6_2⟩. ⟨lirmm-00798274⟩

Share

Metrics

Record views

197

Files downloads

43