Sur l'utilisation de LDA en RI Pair à Pair

Sylvie Cazalens 1 Esther Pacitti 2 Sylvie Calabretto 3 Yulian Yang 3
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
3 DRIM - Distribution, Recherche d'Information et Mobilité
LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : We revisit the problem of defining a peer-to-peer system for Information Retrieval when each peer’s topic-based profile is obtained using Latent Dirichlet Allocation. This method, defined for a centralized collection, provides a rich representation of the topics and of the doc- uments. We describe two ways of using it in a distributed system and analyze their advantages and drawbacks. Then, we illustrate the use of the obtained topic-based profiles within two systems. The first one is unstructured and uses a gossip-based algorithm to obtain dynamic overlays of topically related peers. This requires defining a similarity between profiles. The second one uses super-peers and maintains a topic-based index of the peers, which is recorded in a distributed Hash table. The keys are derived from the topic-based profiles.
Document type :
Conference papers
Complete list of metadatas

Cited literature [7 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01088735
Contributor : Esther Pacitti <>
Submitted on : Sunday, June 26, 2016 - 10:30:27 AM
Last modification on : Thursday, February 7, 2019 - 4:52:47 PM
Long-term archiving on : Wednesday, November 9, 2016 - 2:03:16 PM

File

2013_1a_4 Cazalens.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : lirmm-01088735, version 1

Citation

Sylvie Cazalens, Esther Pacitti, Sylvie Calabretto, Yulian Yang. Sur l'utilisation de LDA en RI Pair à Pair. INFORSID, May 2013, Paris, France. ⟨lirmm-01088735⟩

Share

Metrics

Record views

453

Files downloads

182