Designing a Benchmark for the Assessment of Schema Matching Tools

Abstract : Over the years, many schema matching approaches have been developed to discover correspondences between schemas. Although this task is crucial in data integration, its evaluation, both in terms of matching quality and time performance, is still manually performed. Indeed, there is no common platform which gathers a collection of schema matching datasets to fulfil this goal. Another problem deals with the measuring of the post-match effort, a human cost that schema matching approaches aim at reducing. Consequently, we propose XBenchMatch, a schema matching benchmark with available datasets and new measures to evaluate this manual post-match effort and the quality of integrated schemas. We finally report the results obtained by different approaches, namely COMA++, Similarity Flooding and YAM. We show that such a benchmark is required to understand the advantages and failures of schema matching approaches. Therefore, it could help an end-user to select a schema matching tool which covers his/her needs.
Document type :
Journal articles
Complete list of metadatas

Cited literature [48 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01061615
Contributor : Zohra Bellahsene <>
Submitted on : Wednesday, January 30, 2019 - 11:27:21 AM
Last modification on : Thursday, February 7, 2019 - 3:56:41 PM

File

OJDB-v1i1n02_Duchateau.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License

Identifiers

  • HAL Id : lirmm-01061615, version 1

Citation

Fabien Duchateau, Zohra Bellahsene. Designing a Benchmark for the Assessment of Schema Matching Tools. Open Journal of Databases, RonPub, 2014, 1 (1), pp.3-25. ⟨http://www.ronpub.com/publications/OJDB-v1i1n02_Duchateau.pdf⟩. ⟨lirmm-01061615⟩

Share

Metrics

Record views

343

Files downloads

18