How to Extract Relevant Knowledge from Tweets?

Tweets exchanged over the Internet are an important source of information even if their characteristics make them difficult to analyze (e.g., a maximum of 140 characters; noisy data). In this paper, we investigate two different problems. The first one is related to the extraction of representative terms from a set of tweets. More precisely we address the following question: are traditional information retrieval measures appropriate when dealing with tweets?. The second problem is related to the evolution of tweets over time for a set of users. With the development of data mining approaches, lots of very efficient methods have been defined to extract patterns hidden in the huge amount of data available. More recently new spatio-temporal data mining approaches have specifically been defined for dealing with the huge amount of moving object data that can be obtained from the improvement in positioning technology. Due to particularity of tweets, the second question we investigate is the following: are spatio-temporal mining algorithms appropriate for better understanding the behavior of communities over time? These two prob- lems are illustrated through real applications concerning both health and political tweets.

Mots clés

Pattern Mining Data Mining Approach Left Party Multidimensional Characteristic Move Object Database

Domaines

Base de données [cs.DB] Web

Fichier principal

Isip2012.pdf (493.59 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Pascal Poncelet : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00798662

Soumis le : vendredi 29 mars 2013-16:55:54

Dernière modification le : jeudi 16 mai 2024-13:30:46

Archivage à long terme le : dimanche 30 juin 2013-02:55:10

Dates et versions

lirmm-00798662 , version 1 (29-03-2013)

Identifiants

HAL Id : lirmm-00798662 , version 1
DOI : 10.1007/978-3-642-40140-4_12
IRSTEA : PUB00054256

Citer

Flavien Bouillot, Nhat Hai Phan, Nicolas Béchet, Sandra Bringay, Dino Ienco, et al.. How to Extract Relevant Knowledge from Tweets?. International Workshop on Information Search, Integration, and Personalization (ISIP), Oct 2012, Sapporo, Japan. pp.111-120, ⟨10.1007/978-3-642-40140-4_12⟩. ⟨lirmm-00798662⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CIRAD AGROPARISTECH CNRS UNIV-MONTP3 IRSTEA GREYC GREYC-CODAG ADVANSE TEXTE LIRMM COMUE-NORMANDIE AGROPOLIS TETIS MIPS UNIV-MONTPELLIER ENSICAEN UNICAEN INRAE INRAEOCCITANIEMONTPELLIER MATHNUM AMIS

420 Consultations

857 Téléchargements