Automatic Extraction of Structurally Coherent Mini-Taxonomies

Khalid Saleem 1 Zohra Bellahsene 2
2 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : In this paper we demonstrate an automatic approach for emergent semantics modeling of ontologies. We follow the collaborative ontology construction method without the direct interaction of domain users, engineers or developers. A very important characteristic of an ontology is its hierarchical structure of concepts. Semantic web is heavily dependent on the XML paradigm, which inherently follows the hierarchical structure. We consider large sets of domain specific schemas as trees and apply frequent sub-tree mining for extracting common hierarchical patterns. Our experiments show that these hierarchical patterns are good enough to represent and describe the concepts of the domain ontology. The technique further demonstrates the construction of the taxonomy of domain ontology. In this regard we consider the largest frequent tree or a tree created by merging the set of largest frequent sub-trees as the taxonomy. We argue in favour of the trustabilty for such a taxonomy and related concepts, since it has been extracted from the schemas being used with in the specified domain.
Type de document :
Rapport
08009, 2008
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00267982
Contributeur : Khalid Saleem <>
Soumis le : dimanche 30 mars 2008 - 03:57:40
Dernière modification le : samedi 27 janvier 2018 - 01:32:14
Document(s) archivé(s) le : vendredi 21 mai 2010 - 01:02:08

Fichier

Identifiants

  • HAL Id : lirmm-00267982, version 1

Collections

Citation

Khalid Saleem, Zohra Bellahsene. Automatic Extraction of Structurally Coherent Mini-Taxonomies. 08009, 2008. 〈lirmm-00267982〉

Partager

Métriques

Consultations de la notice

157

Téléchargements de fichiers

215