Where's Charlie: Family Based Heuristics for Peer-to-Peer Schema Integration

John Tranier 1, 2 Renaud Baraër 1 Zohra Bellahsene 3 Maguelonne Teisseire 4
3 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
4 TATOO - Fouille de données environnementales
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Querying semantically related data sources depends on the ability to map between their schemas. Unfortunately, in most cases matching between schemas is still largely performed manually. As a consequence, semantic integration issues have become a key bottleneck in the deployment of a large scale integration systems (the number of schemas to map is huge). This work deals with automated methods for matching and efficiently generating schema mappings in large scale environments. We propose a level-wise algorithm based on a semantic distance to evaluate similarity between schema nodes. However, in a large scale context, computing the semantic distance for every couple of nodes cannot be done. Thus, we proposed family based heuristics (CHARLIE) in order to efficiently generate mappings. Experiments have shown that our approach is very efficient for large scale integration especially in a super peer based architecture and that it is relevant for real datasets.
Type de document :
Communication dans un congrès
IDEAS: International Database Engineering & Applications Symposium, Jul 2004, Coimbra, Portugal. IEEE, pp.227-235, 2004, 〈10.1109/IDEAS.2004.1319795〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00108881
Contributeur : Christine Carvalho de Matos <>
Soumis le : lundi 23 octobre 2006 - 12:57:08
Dernière modification le : jeudi 24 mai 2018 - 15:59:23

Identifiants

Collections

Citation

John Tranier, Renaud Baraër, Zohra Bellahsene, Maguelonne Teisseire. Where's Charlie: Family Based Heuristics for Peer-to-Peer Schema Integration. IDEAS: International Database Engineering & Applications Symposium, Jul 2004, Coimbra, Portugal. IEEE, pp.227-235, 2004, 〈10.1109/IDEAS.2004.1319795〉. 〈lirmm-00108881〉

Partager

Métriques

Consultations de la notice

88