Skip to Main content Skip to Navigation
Other publications

New Challenges in Data Integration: Large Scale Automatic Schema Matching

Khalid Saleem 1 Zohra Bellahsene 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Today schema matching is a basic task in almost every data intensive distributed application, namely enterprise information integration, collaborating web services, ontology based agents communication, web catalogue integration and schema based P2P database systems. There has been a plethora of algorithms and techniques researched in schema matching and integration for data interoperability. Numerous surveys have been presented in the past to summarize this research. The requirement for extending the previous surveys has been created because of the mushrooming of the dynamic nature of these data intensive applications. Indeed, evolving large scale distributed information systems are further pushing the schema matching research to utilize the processing power not available in the past and directly increasing the industry investment proportion in the matching domain. This article reviews the latest application domains in which schema matching is being utilized. The paper gives a detailed insight about the desiderata for schema matching and integration in the large scale scenarios. Another panorama which is covered by this survey is the shift from manual to automatic schema matching. Finally the paper presents the state of the art in large scale schema matching, classifying the tools and prototypes according to their input, output and execution strategies and algorithms.
Document type :
Other publications
Complete list of metadata

Cited literature [69 references]  Display  Hide  Download
Contributor : Khalid Saleem <>
Submitted on : Wednesday, April 16, 2008 - 3:36:53 AM
Last modification on : Friday, October 23, 2020 - 4:40:14 PM
Long-term archiving on: : Tuesday, September 21, 2010 - 4:10:16 PM


  • HAL Id : lirmm-00170765, version 2



Khalid Saleem, Zohra Bellahsene. New Challenges in Data Integration: Large Scale Automatic Schema Matching. 2007. ⟨lirmm-00170765v2⟩



Record views


Files downloads