Skip to Main content Skip to Navigation

Automatic Extraction of Structurally Coherent Mini-Taxonomies

Khalid Saleem 1 Zohra Bellahsene 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : In this paper we demonstrate an automatic approach for emergent semantics modeling of ontologies. We follow the collaborative ontology construction method without the direct interaction of domain users, engineers or developers. A very important characteristic of an ontology is its hierarchical structure of concepts. Semantic web is heavily dependent on the XML paradigm, which inherently follows the hierarchical structure. We consider large sets of domain specific schemas as trees and apply frequent sub-tree mining for extracting common hierarchical patterns. Our experiments show that these hierarchical patterns are good enough to represent and describe the concepts of the domain ontology. The technique further demonstrates the construction of the taxonomy of domain ontology. In this regard we consider the largest frequent tree or a tree created by merging the set of largest frequent sub-trees as the taxonomy. We argue in favour of the trustabilty for such a taxonomy and related concepts, since it has been extracted from the schemas being used with in the specified domain.
Document type :
Complete list of metadata
Contributor : Khalid Saleem <>
Submitted on : Sunday, March 30, 2008 - 3:57:40 AM
Last modification on : Wednesday, November 14, 2018 - 2:56:02 PM
Long-term archiving on: : Friday, May 21, 2010 - 1:02:08 AM


  • HAL Id : lirmm-00267982, version 1



Khalid Saleem, Zohra Bellahsene. Automatic Extraction of Structurally Coherent Mini-Taxonomies. 08009, 2008. ⟨lirmm-00267982⟩



Record views


Files downloads