A Text Mining Pipeline for Mining the Quantum Cascade Laser Properties - LIRMM - Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

A Text Mining Pipeline for Mining the Quantum Cascade Laser Properties

Résumé

The development of the Terahertz laser technology in quantum cascade lasers (qcl) has brought about great potential for industrial applications. These lasers are based on the Terahertz electromagnetic waves, in the frequency range from about 100GHz to 10THz. There is need to understand the structure of the laser and its influence on the performance in order to optimize the design process. One way of collating this information is by having ontologies and knowledge bases capturing the various qcl designs and their performance characteristics. Majority of the laser design data is usually contained in scientific literature. The main drawback of such textual data sources is their unstructured nature. The complex nature of the laser design and the varying author language styles poses some level of difficulty in retrieving this information. Owing to this, the existing methods needs improvement in order retrieve the laser information at a high precision(with minimal number of incorrect records extracted) and minimized number of correct records not extracted. In this paper, we tackle this initial challenge by proposing a text mining pipeline for mining the qcl properties by extending the grammar rules of a conditional random field (CRF) based model using a rule-based approach. The properties of interest include: hetero-structure (laser stacking properties), working temperature, lasing frequency, laser thickness and the optical power. We evaluate the pipeline on sample open access journal papers from AIP, OPTICA and IOP Publishers.
Fichier principal
Vignette du fichier
Kerre_et_al_2023.pdf (357.18 Ko) Télécharger le fichier

Dates et versions

lirmm-04292731 , version 1 (17-11-2023)

Licence

Copyright (Tous droits réservés)

Identifiants

Citer

Deperias Kerre, Anne Laurent, Kenneth Maussang, Dickson Owuor. A Text Mining Pipeline for Mining the Quantum Cascade Laser Properties. ADBIS 2023 - 27th European Conference on Advances in Databases and Information Systems, Sep 2023, Barcelona, Spain. pp.393-406, ⟨10.1007/978-3-031-42941-5_34⟩. ⟨lirmm-04292731⟩
51 Consultations
9 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More