Skip to Main content Skip to Navigation

A Selftuning Approach for Improving Composite Schema Matchers

Fabien Duchateau 1 Remi Coletta 2 Zohra Bellahsene 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Most of the schema matching tools are assembled from multiple match algorithms, each employing a particular technique to improve matching accuracy and making matching systems extensible and customizable to a specific domain. Recently, it has been pointed out that the main issue is how to select the most suitable match algorithms to execute for a given domain and how to adjust the multiple parameters. The solutions provided by current schema matching tools consist in aggregating the results obtained by several match algorithms to improve the quality of the discovered matches. In this article, we present a novel method to replace this aggregation function and its drawbacks. Unlike other composite matchers, our matching engine makes use of a decision tree to combine the most appropriate match algorithms. As a first consequence, the performance of the system is improved since only a subset of match algorithms from a large library is used. The second advantage is the improvement of the quality of matches. Indeed, for a given domain, only the most suitable match algorithms are used. Our approach is also able to learn the most appropriate match algorithms for a given domain by relying on the expert feedback. It can also selftune some parameters like thresholds and the performance versus quality ratio.
Document type :
Reports
Complete list of metadatas

Cited literature [25 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00271534
Contributor : Fabien Duchateau <>
Submitted on : Wednesday, April 9, 2008 - 1:33:13 PM
Last modification on : Thursday, February 7, 2019 - 2:24:09 PM
Document(s) archivé(s) le : Friday, September 28, 2012 - 12:26:30 PM

Identifiers

  • HAL Id : lirmm-00271534, version 1

Collections

Citation

Fabien Duchateau, Remi Coletta, Zohra Bellahsene. A Selftuning Approach for Improving Composite Schema Matchers. RR-08010, 2008. ⟨lirmm-00271534⟩

Share

Metrics

Record views

590

Files downloads

196