Fuzzy Tree Mining: Go Soft on Your Nodes

Abstract : Tree mining consists in discovering the frequent subtrees from a forest of trees. This problem has many application areas. For instance, a huge volume of data available from the Internet is now described by trees (e.g. XML). Still, for several documents dealing with the same topic, this description is not always the same. It is thus necessary to mine a common structure in order to query these documents. Biology is another field where data may be described by means of trees. The problem of mining trees has now been addressed for several years, leading to well-known algorithms. However, these algorithms can hardly deal with real data in a soft manner. Indeed, they consider a subtree as fully included in the super-tree. This means that all the nodes must appear. In this paper, we extend this definition to fuzzy inclusion based on the idea that a tree is included to a certain degree within another one, this fuzzy degree being correlated to the number of matching nodes.
Type de document :
Communication dans un congrès
IFSA: International Fuzzy Systems Association World Congress, Jun 2007, Cancun, Mexico. Springer Berlin / Heidelberg, IFSA'07: 12th International Fuzzy Systems Association World Congress, pp.145-154, 2007, Lecture Notes in Computer Science. 〈http://www.hafsamx.org/ifsa2007/〉. 〈10.1007/978-3-540-72950-1_15〉
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-00272118
Contributeur : Anne Laurent <>
Soumis le : vendredi 11 avril 2008 - 08:33:50
Dernière modification le : jeudi 24 mai 2018 - 15:59:23

Lien texte intégral

Identifiants

Collections

Citation

Federico Del Razo Lopez, Anne Laurent, Pascal Poncelet, Maguelonne Teisseire. Fuzzy Tree Mining: Go Soft on Your Nodes. IFSA: International Fuzzy Systems Association World Congress, Jun 2007, Cancun, Mexico. Springer Berlin / Heidelberg, IFSA'07: 12th International Fuzzy Systems Association World Congress, pp.145-154, 2007, Lecture Notes in Computer Science. 〈http://www.hafsamx.org/ifsa2007/〉. 〈10.1007/978-3-540-72950-1_15〉. 〈lirmm-00272118〉

Partager

Métriques

Consultations de la notice

26