Enhanced Functionalities for Annotating and Indexing Clinical Text with the NCBO Annotator+

Andon Tchechmedjiev 1 Amine Abdaoui 2 Vincent Emonet 1 Soumia Melzi 1 Jitendra Jonnagaddala 3 Clement Jonquet 4, 1
1 SMILE - Système Multi-agent, Interaction, Langage, Evolution
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
2 ADVANSE - ADVanced Analytics for data SciencE
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : Summary: Second use of clinical data commonly involves annotating biomedical text with terminologies and ontolo-gies. The National Center for Biomedical Ontology Annotator is a frequently used annotation service, originally de-signed for biomedical data, but not very suitable for clinical text annotation. In order to add new functionalities to the NCBO Annotator without hosting or modifying the original Web service, we have designed a proxy architecture that enables seamless extensions by pre-processing of the input text and parameters, and post processing of the annotations. We have then implemented enhanced functionalities for annotating and indexing free text such as: scoring, detection of context (negation, experiencer, temporality), new output formats, and coarse-grained concept recognition (with UMLS Semantic Groups). In this paper, we present the NCBO Annotator+, a Web service which incorporates these new functionalities as well as a small set of evaluation results for concept recognition and clini-cal context detection on two standard evaluation tasks (Clef eHealth 2017, SemEval 2014). Availability and Implementation: The Annotator+ has been successfully integrated into the SIFR BioPortal platform –an implementation of NCBO BioPortal for French biomedical terminologies and ontologies– to annotate English text. A Web user interface is available for testing and ontology selection (http://bioportal.lirmm.fr/ncbo_annotatorplus); however the Annotator+ is meant to be used through the Web service application programming interface (http://services.bioportal.lirmm.fr/ncbo_annotatorplus). The code is openly availa-ble, and we also provide a Docker packaging to enable easy local deployment to process sensitive (e.g., clinical) data in-house (https://github.com/sifrproject). Contact: andon.tchechmedjiev@lirmm.fr and jonquet@lirmm.fr Supplementary information: Technical details and documentation available online.
Liste complète des métadonnées

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01680514
Contributeur : Clement Jonquet <>
Soumis le : mardi 8 mai 2018 - 00:22:30
Dernière modification le : jeudi 24 mai 2018 - 15:59:25

Fichier

Article_Bioinformatics_2018_NC...
Fichiers éditeurs autorisés sur une archive ouverte

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Collections

Citation

Andon Tchechmedjiev, Amine Abdaoui, Vincent Emonet, Soumia Melzi, Jitendra Jonnagaddala, et al.. Enhanced Functionalities for Annotating and Indexing Clinical Text with the NCBO Annotator+. Bioinformatics, Oxford University Press (OUP), inPress, 〈10.1093/bioinformatics/bty009〉. 〈lirmm-01680514〉

Partager

Métriques

Consultations de la notice

276

Téléchargements de fichiers

10