| HAL: lirmm-00373643, version 2 |
| DOI: 10.1007/978-3-642-10745-0_61 |
| Detailed view | Export this paper |
|
|
| IFCS'09: International Federation of Classification Societies Conference, Dresde : Germany (2009) |
|
|
| Available versions: | v1 (2009-04-08) | v2 (2010-08-26) |
|
|
|
|
| Visualising a Text with a Tree Cloud |
|
|
Philippe Gambette 1Jean Véronis 2 |
|
|
| (2009) |
|
|
| Tag clouds have gained popularity over the internet to provide a quick overview of the content of a website or a text. We introduce a new visualisation which displays more information: the tree cloud. Like a word cloud, it shows the most frequent words of the text, where the size reflects the frequency, but the words are arranged on a tree to reflect their semantic proximity according to the text. Such tree clouds help identify the main topics of a document, and even be used for text analysis. We also provide methods to evaluate the quality of the obtained tree cloud, and some key steps of its construction. Our algorithms are implemented in the free software TreeCloud available at http://www.treecloud.org |
|
|
|
|
|
|
|
|
|
|
| 1: | Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier (LIRMM) |
| CNRS : UMR5506 – Université Montpellier II - Sciences et Techniques du Languedoc | |
| 2: | Laboratoire d'informatique Fondamentale de Marseille (LIF) |
| CNRS : UMR6166 – Université de la Méditerranée - Aix-Marseille II – Université de Provence - Aix-Marseille I | |
|
|
|
|
|
|
|
|
| [INFO/ALGCO] |
|
|
|
|
| Subject | : | Computer Science/Document and Text Processing Computer Science/Computation and Language |
|
|
| Information visualisation – tag cloud – semantic proximity – hierarchical clustering – arboricity |
|
|
| Attached file list to this document: | |||||
|
|
|
| lirmm-00373643, version 2 | |
| http://hal-lirmm.ccsd.cnrs.fr/lirmm-00373643 | |
| oai:hal-lirmm.ccsd.cnrs.fr:lirmm-00373643 | |
| From: Philippe Gambette | |
| Submitted on: Saturday, 7 August 2010 22:03:33 | |
| Updated on: Thursday, 26 August 2010 15:25:34 | |