C. Workflow and S. Extraction, 121 11.2 Solving the Polysemy Detection issue, p.122

W. Methodology-for-polysemy-prediction and .. , 126 11.5 Subgraph created for the term t = Yellow Fever, 129 11.6 Solving the Term Sense Induction 132 11.7 Term Sense Induction Workflow. . . . . . . . . . . . . . . . . . . . 132

S. Linkage, C. Term, and .. , 140 11.10First Semantic Linkage step, 142 12.1 Data sets and Results of the Proposed Methodology. . . . . . . . . 143 12.2 Evaluation of Polysemy Detection 144 12.3 Decision Tree obtained from the Polysemic and Non-polysemic Data set, p.149

N. Albatineh, A. N. Albatineh, and M. Niewiadomska-bugaj, MCS: A Method for Finding the Number of Clusters, Journal of Classification, vol.91, issue.2, pp.184-209, 2011.
DOI : 10.1007/s00357-010-9069-1

M. J. Anderson, A new method for non-parametric multivariate analysis of variance, Austral ecology, vol.26, issue.1, pp.32-46, 2001.

E. H. Anguiano, Efficient large-context dependency parsing and correction with distributional lexical resources, p.115, 2013.
URL : https://hal.archives-ouvertes.fr/tel-00860720

. Arsevska, Exploiting textual source information for epidemiosurveillance, Metadata and Semantics Research, p.83, 2014.
URL : https://hal.archives-ouvertes.fr/lirmm-01184556

H. Assadi-]-assadi, Knowledge acquisition from texts: Using an automatic clustering method based on noun-modifier relationship, Proceedings of the eighth conference on European chapter of the Association for Computational Linguistics, pp.504-506, 1997.

H. Aubin, S. Aubin, and T. Hamon, Improving Term Extraction with Terminological Resources, Proceedings of the 5th International Conference Natural Language Processing, FinTAL'06, pp.380-387, 2006.
DOI : 10.1007/11816508_39

URL : https://hal.archives-ouvertes.fr/hal-00091444

. Aussenac-gilles, Revisiting Ontology Design: A Method Based on Corpus Analysis, Proceedings of the 12th European Workshop on Knowledge Acquisition, Modeling and Management, EKAW'00, pp.172-188, 2000.
DOI : 10.1007/3-540-39967-4_13

N. Aussenac-gilles and D. Bourigault, The th (ic) 2 initiative: Corpus-based thesaurus construction for indexing www documents, Proceedings of the EKAW conference, pp.3-20, 2000.

. Aussenac-gilles, The terminae method and platform for ontology engineering from texts The Netherlands, Proceedings of the 2008 Conference on Ontology Learning and Population: Bridging the Gap Between Text and Knowledge, pp.199-223, 2008.

. Bowker, . Pearson, L. Bowker, and J. Pearson, Working with specialized language: a practical guide to using corpora. Routledge, p.24, 2002.
DOI : 10.4324/9780203469255

. Boyd-graber, . Blei, J. Boyd-graber, and D. Blei, PUTOP, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.277-281, 2007.
DOI : 10.3115/1621474.1621534

. Boyd-graber, A topic model for word sense disambiguation, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL '07, pp.1024-1033, 2007.

. Brazdil, Metalearning, p.112, 2008.
DOI : 10.1007/978-1-4899-7502-7_543-1

P. Brin, S. Brin, and L. Page, The anatomy of a large-scale hypertextual web search engine. Computer networks and ISDN systems, pp.107-117, 1998.

L. Brody, S. Brody, and M. Lapata, Bayesian word sense induction, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics on, EACL '09, pp.103-111, 2009.
DOI : 10.3115/1609067.1609078

. Budanitsky, . Hirst, A. Budanitsky, and G. Hirst, Evaluating WordNet-based Measures of Lexical Semantic Relatedness, Computational Linguistics, vol.17, issue.1, pp.13-47, 2006.
DOI : 10.1016/S0022-5371(79)90604-2

. Budanitsky, . Hirst, A. Budanitsky, and G. Hirst, Evaluating WordNet-based Measures of Lexical Semantic Relatedness, Computational Linguistics, vol.17, issue.1, pp.13-47, 2006.
DOI : 10.1016/S0022-5371(79)90604-2

. Buitelaar, A Prot??g?? Plug-In for Ontology Extraction from Text Based on Linguistic Analysis, Proceeding of the Semantic Web: Research and Applications, First European Semantic Web Symposium, ESWS'04, pp.31-44, 2004.
DOI : 10.1007/978-3-540-25956-5_3

. Bunescu, . Mooney, R. C. Bunescu, and R. J. Mooney, A shortest path dependency kernel for relation extraction, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing , HLT '05, pp.724-731, 2005.
DOI : 10.3115/1220575.1220666

. Cai, Improving word sense disambiguation using topic features, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL '07, pp.1015-1023, 2007.

. Cali?ski, . Harabasz, T. Cali?ski, and J. Harabasz, A dendrite method for cluster analysis, Communications in Statistics - Theory and Methods, vol.3, issue.1, pp.1-27, 1974.
DOI : 10.1080/03610927408827101

N. Casellas, Legal ontology engineering: methodologies, modelling trends, and the ontology of professional judicial knowledge, 2011.
DOI : 10.1007/978-94-007-1497-7

. Castiello, Metadata: Characterization of input features for meta-learning, In Modeling Decisions for Artificial Intelligence, pp.457-468, 2005.

. Charlet, Building medical ontologies by terminology extraction from texts: An experiment for the intensive care units, Computers in biology and medicine, pp.36857-870, 2006.
DOI : 10.1016/j.compbiomed.2005.04.012

. Clauset, Structural Inference of Hierarchies in Networks, Statistical network analysis: models, issues, and new directions, pp.1-13, 2007.
DOI : 10.1007/978-3-540-73133-7_1

K. Claveau, V. Claveau, and E. Kijak, Thésaurus distributionnels pour la recherche d'information et vice-versa, CORIA 2015 - Conférence en Recherche d'Infomations et Applications -12th French Information Retrieval Conference, pp.405-420, 2015.

. Claveau, Improving distributional thesauri by exploring the graph of neighbors, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp.709-720, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01027545

. Clouet, . Daille, E. Clouet, and B. Daille, Compound??Terms and??Their??Multi-word??Variants: Case??of??German??and??Russian??Languages, Computational Linguistics and Intelligent Text Processing, pp.68-78, 2014.
DOI : 10.1007/978-3-642-54906-9_6

URL : https://hal.archives-ouvertes.fr/hal-01116119

. Conrado, Exploration of a Rich Feature Set for Automatic Term Extraction, Advances in Artificial Intelligence and Its Applications, pp.342-354, 2013.
DOI : 10.1007/978-3-642-45114-0_28

. Corcho, Law and the semantic web. chapter Building Legal Ontologies with METHONTOLOGY and WebODE, pp.142-157, 2005.

. Crossley, The Development of Polysemy and Frequency Use in English Second Language Speakers, Language Learning, vol.33, issue.1, pp.573-605, 2010.
DOI : 10.1111/j.1467-9922.2010.00568.x

. Cunningham, GATE, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.168-175, 2002.
DOI : 10.3115/1073083.1073112

M. Curran, J. R. Curran, and M. Moens, Scaling context space, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.231-238, 2002.
DOI : 10.3115/1073083.1073123

B. Daille, An evaluation of statistical scores for word association, Proceedings of the Tbilisi Symposium on Logic, Language and Computation: Selected Papers, pp.177-188, 1998.

. Daille, Towards automatic extraction of monolingual and bilingual terminology Association for Computational Linguis- tics, Proceedings of the 15th Conference on Computational Linguistics, pp.515-521, 1994.

M. Daille, B. Daille, and E. Morin, French-English Terminology Extraction from Comparable Corpora, Proceedings of the 2nd International Joint Conference Natural Language Processing, IJCNLP'05, pp.707-718, 2005.
DOI : 10.1007/11562214_62

URL : https://hal.archives-ouvertes.fr/hal-00444427

. Darmoni, Multiple Terminologies in a Health Portal: Automatic Indexing and Information Retrieval, In Artificial Intelligence in Medicine, vol.124, issue.Pt 2, pp.255-259, 2009.
DOI : 10.1016/j.ipm.2005.01.003

P. David, S. David, and P. Plante, De la nécessité d'une approche morpho-syntaxique dans l'analyse de textes, Intelligence artificielle et sciences cognitives au Québec, vol.3, issue.3 2, pp.140-154, 1990.

B. Davies, D. L. Davies, and D. W. Bouldin, A cluster separation measure. Pattern Analysis and Machine Intelligence, IEEE Transactions on, issue.2, pp.224-227, 1979.

. Decadt, Gambl: Genetic algorithm optimization of memorybased wsd, Proceedings of the 3rd Interna-tional Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), pp.108-112, 2004.

. Dehkordi, A novel hybrid structure for clustering, Advances in Computer Science and Engineering, pp.888-891, 2009.

. Déjean, H. Gaussier-]-déjean, and E. Gaussier, Une nouvelle approche à l'extraction de lexiques bilingues à partir de corpus comparables, p.24, 2002.

. Deléger, Translating medical terminologies through word alignment in parallel text corpora, Journal of Biomedical Informatics, vol.42, issue.4, pp.692-701, 2009.
DOI : 10.1016/j.jbi.2009.03.002

M. Di, N. Marco, A. Navigli, and R. , Clustering web search results with maximum spanning trees, Proceedings of the 12th International Conference on Artificial Intelligence Around Man and Beyond, AI*IA'11, pp.201-212, 2011.

D. Marco, D. Navigli, A. Marco, and R. Navigli, Clustering and Diversifying Web Search Results with Graph-Based Word Sense Induction, Computational Linguistics, vol.40, issue.1, pp.709-754, 2013.
DOI : 10.1023/B:MACH.0000027785.44527.d6

. Dixit, Design of an Automatic Ontology Construction Mechanism Using Semantic Analysis of the Documents, 2012 Fourth International Conference on Computational Intelligence and Communication Networks, pp.611-616, 2012.
DOI : 10.1109/CICN.2012.89

L. Dobrov, B. Dobrov, and N. Loukachevitch, Multiple evidence for term extraction in broad domains, Proceeding of Recent Advances in Natural Language Processing, pp.710-715, 2011.

. Doing-harris, Automated concept and relationship extraction for the semi-automated ontology management (SEAM) system, Journal of Biomedical Semantics, vol.9, issue.1, pp.15-114, 2015.
DOI : 10.1186/s13326-015-0011-7

S. Dongen, Performance criteria for graph clustering and markov cluster experiments, 2000.

N. Faure, D. Faure, and C. Nedellec, Knowledge acquisition of predicate argument structures from technical texts using Machine Learning: the system Asium, Proceedings of the 11th European Workshop on Knowledge Acquisition, Modeling and Management, EKAW'99, pp.329-334, 1999.
DOI : 10.1007/3-540-48775-1_22

. Faure, Acquisition of semantic knowledge using machine learning methods: The system" asium, 1998.

N. Faure, D. Faure, and C. Nédellec, A corpus-based conceptual clustering method for verb frames and ontology acquisition, LREC workshop on, pp.5-12, 1998.

. Fernández-lópez, Methontology: from ontological art towards ontological engineering, 1997.

O. Ferret, Testing semantic similarity measures for extracting synonyms from a corpus, LREC, pp.3338-3343, 2010.

O. Ferret, Combining bootstrapping and feature selection for improving a distributional thesaurus, ECAI, pp.336-341, 2012.

O. Ferret, Identifying bad semantic neighbors for improving distributional thesauri, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp.561-571, 2013.

V. Golub, . Loan, G. H. Golub, and C. F. Van-loan, Matrix computations, p.96, 1989.

. Gómez-pérez, Ontological Engineering: With Examples from the Areas of Knowledge Management, e-Commerce and the Semantic Web, Advanced Information and Knowledge Processing, 2007.

A. D. Gordon, Classification, (chapman & hall/crc monographs on statistics & applied probability), 1999.

G. Grefenstette, Explorations in Automatic Thesaurus Discovery, p.115, 1994.
DOI : 10.1007/978-1-4615-2710-7

C. Grozea, Finding optimal parameter settings for high performance word sense disambiguation, Proceedings of the 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), pp.125-128, 2004.

T. R. Gruber, A translation approach to portable ontology specifications. Knowledge acquisition, pp.199-220, 1993.

. Habert, Symbolic word clustering for medium-size corpora Parameter optimization for machine-learning of word sense disambiguation, Proceedings of the 16th Conference on Computational Linguistics, pp.311-325, 1996.

. Huang, Unsupervised word sense induction using rival penalized competitive learning, Engineering Applications of Artificial Intelligence, vol.41, issue.C 2, pp.41166-174, 2015.
DOI : 10.1016/j.engappai.2015.02.004

L. Hubert, L. J. Hubert, and J. R. Levin, A general statistical framework for assessing categorical clustering in free recall., Psychological Bulletin, vol.83, issue.6, pp.1072-109, 1976.
DOI : 10.1037/0033-2909.83.6.1072

. Hullman, Content, Context, and Critique, Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, CSCW '15, pp.1170-1175, 2015.
DOI : 10.1145/2675133.2675207

E. Ide, N. Ide, and T. Erjavec, Automatic sense tagging using parallel corpora, Natural Language Pacific Rim Symposium (artificial intelligence), NLPRS '01, p.99, 2001.

C. Jacquemin, A symbolic and surgical acquisition of terms through variation, Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing, pp.425-438, 1996.
DOI : 10.1007/3-540-60925-3_64

. Javed, Modeling inter-camera space???time and appearance relationships for tracking across non-overlapping views, Computer Vision and Image Understanding, vol.109, issue.2, pp.146-162, 2008.
DOI : 10.1016/j.cviu.2007.01.003

. Jayapandian, Domain Ontology As Conceptual Model for Big Data Management: Application in Biomedical Informatics, Conceptual Modeling, pp.144-157, 2014.
DOI : 10.1007/978-3-319-12206-9_12

. Ji, Chinese terminology extraction using window-based contextual information A text clustering system based on k-means type subspace clustering and ontology, Proceedings [Jing et al, pp.91-103, 2006.

J. , L. , G. H. Langley, and P. , Estimating continuous distributions in bayesian classifiers, Proceedings of the Eleventh conference on Uncertainty in artificial intelligence, pp.338-345, 1995.

. Jonquet, Ncbo annotator: Semantic annotation of biomedical data, 8th International Semantic Web Conference, Poster and Demonstration Session, ISWC'09, p.77, 2009.

. Kageura, . Umino, K. Kageura, and B. Umino, Methods of automatic term recognition: A review, Terminology International Journal of Theoretical and Applied Issues in Specialized Communication, vol.3, issue.2, pp.259-289, 1996.
DOI : 10.1075/term.3.2.03kag

N. Kambhatla, Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations, Proceedings of the ACL 2004 on Interactive poster and demonstration sessions -, p.114, 2004.
DOI : 10.3115/1219044.1219066

. Kinnunen, Comparison of clustering methods: A case study of text-independent speaker modeling, Pattern Recognition Letters, vol.32, issue.13, pp.1604-1617, 2011.
DOI : 10.1016/j.patrec.2011.06.023

M. Klapaftis, I. P. Klapaftis, and S. Manandhar, Word sense induction using graphs of collocations The Netherlands, The Netherlands, Proceedings of the 2008 Conference on ECAI 2008: 18th European Conference on Artificial Intelligence, ECAI '08, pp.298-302, 2008.

H. T. Ng, An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation, Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, pp.41-48, 2002.

. Lemke, Metalearning: a survey of trends and technologies, Artificial Intelligence Review, vol.3, issue.2, pp.117-130, 2013.
DOI : 10.1007/s10462-013-9406-y

. Lemke, Dynamic combination of forecasts generated by diversification procedures applied to forecasting of airline cancellations, 2009 IEEE Symposium on Computational Intelligence for Financial Engineering, p.112, 2009.
DOI : 10.1109/CIFER.2009.4937507

. Liang, Determining the number of clusters using information entropy for mixed data, Pattern Recognition, vol.45, issue.6, pp.2251-2265, 2012.
DOI : 10.1016/j.patcog.2011.12.017

D. Lin, Automatic retrieval and clustering of similar words, Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics of ACL-COLING '98, pp.768-774, 1998.

D. Lin, Automatic retrieval and clustering of similar words, Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, pp.768-774, 1998.

. Lin, D. Lin, and P. Pantel, DIRT @SBT@discovery of inference rules from text, Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '01, pp.323-328, 2001.
DOI : 10.1145/502512.502559

. Liu, Effectiveness of Lexico-syntactic Pattern Matching for Ontology Enrichment with Clinical Documents, Methods of Information in Medicine, vol.50, issue.5, pp.397-114, 2011.
DOI : 10.3414/ME10-01-0020

]. Clouet and E. , Processing of Compound Terms: Segmentation, Translation and Variation. Theses, p.22, 2014.
URL : https://hal.archives-ouvertes.fr/tel-01116104

. Lossio-ventura, Conversations reconstruction in the social web, Proceedings of the 21st international conference companion on World Wide Web, WWW '12 Companion, p.12, 2012.
DOI : 10.1145/2187980.2188133

. Lossio-ventura, Biomedical terminology extraction: A new combination of statistical and web mining approaches, Proceedings of the Journées internationales d'Analyse statistique des Données Textuelles, pp.14-78, 2014.
URL : https://hal.archives-ouvertes.fr/lirmm-01056598

. Lossio-ventura, BIOTEX: A system for biomedical terminology extraction , ranking, and validation, Proceedings of the 13th International Semantic Web Conference, Posters & Demonstrations Track, ISWC'14, pp.157-160, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01136531

. Lossio-ventura, Biomedical term extraction: overview and a new methodology, Information Retrieval Journal, vol.14, issue.1, pp.59-99, 2015.
DOI : 10.1007/s10791-015-9262-2

URL : https://hal.archives-ouvertes.fr/lirmm-01274539

. Lossio-ventura, Automatic biomedical term polysemy detection, Proceedings of the 10th International Language Resources and Evaluation Conference, LREC'2016, p.page, 2016.

. Lossio-ventura, Communication overload management through social interactions clustering, Proceedings of the 31st Annual ACM Symposium on Applied Computing, SAC '16, p.139, 2016.
DOI : 10.1145/2851613.2851984

URL : https://hal.archives-ouvertes.fr/lirmm-01362442

. Lossio-ventura, A way to automatically enrich biomedical ontologies, Proceedings of the 19th International Conference on Extending Database Technology , EDBT'2016, p.135, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01274540

. Lundqvist, Ontology supported competency system, International Journal of Knowledge and Learning, vol.7, issue.3/4, pp.3-4197, 2011.
DOI : 10.1504/IJKL.2011.044539

. Lv, . Zhai, Y. Lv, and C. Zhai, Adaptive term frequency normalization for BM25, Proceedings of the 20th ACM international conference on Information and knowledge management, CIKM '11, pp.1985-1988, 2011.
DOI : 10.1145/2063576.2063871

. Lv, . Zhai, Y. Lv, and C. Zhai, When documents are very long, BM25 fails! In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.1103-1104, 2011.

S. Madden, From Databases to Big Data, IEEE Internet Computing, vol.16, issue.3, pp.4-6, 2012.
DOI : 10.1109/MIC.2012.50

S. Maedche, A. Maedche, and S. Staab, Discovering conceptual relations from text, Proceedings of the 14th European Conference on Artificial Intelligence, p.27, 2000.

. Maedche, A. Volz-]-maedche, and R. Volz, The ontology extraction & maintenance framework text-to-onto, Proceedings of the Workshop on Integrating Data Mining and Knowledge Management the 2001 IEEE International Conference on Data Mining, Workshop-ICDM'01, pp.1-12, 2001.

. Manandhar, SemEval-2010 task 14, Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, DEW '09, pp.63-68, 2010.
DOI : 10.3115/1621969.1621990

S. Manning, C. D. Manning, and H. Schütze, Foundations of statistical natural language processing, 1999.

. Marelli, SemEval-2014 Task 1: Evaluation of Compositional Distributional Semantic Models on Full Sentences through Semantic Relatedness and Textual Entailment, Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), p.113, 2014.
DOI : 10.3115/v1/S14-2001

F. Marriott, Practical Problems in a Method of Cluster Analysis, Biometrics, vol.27, issue.3, pp.501-514, 1971.
DOI : 10.2307/2528592

M. Ishizuka, Keyword extraction from a single document using word co-occurrence statistical information, International Journal on Artificial Intelligence Tools, vol.13, issue.01 2, pp.157-169, 2004.

D. Maudsley, A Theory of Meta-learning and Principles of Facilitation : an Organismic Perspective. Thesis, 1979.

. Maynard, SPRAT: a tool for automatic semantic pattern-based ontology population, International Conference for Digital Libraries and the Semantic Web, pp.2-10, 2009.

B. T. Mcinnes, An unsupervised vector approach to biomedical term disambiguation, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies Student Research Workshop, HLT '08, pp.49-54, 2008.
DOI : 10.3115/1564154.1564165

P. Mcinnes, B. T. Mcinnes, and T. Pedersen, Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text, Journal of Biomedical Informatics, vol.46, issue.6, pp.1116-1124, 2013.
DOI : 10.1016/j.jbi.2013.08.008

. Mcinnes, Knowledge-based method for determining the meaning of ambiguous biomedical terms using information content measures of similarity American Medical Informatics Association, AMIA Annual Symposium Proceedings, pp.895-106, 2011.

S. Mcinnes, B. T. Mcinnes, and M. Stevenson, Determining the difficulty of Word Sense Disambiguation, Journal of Biomedical Informatics, vol.47, pp.83-90, 2014.
DOI : 10.1016/j.jbi.2013.09.009

. Medelyan, Humancompetitive tagging using automatic keyphrase extraction, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp.1318-1327, 2009.

. Medelyan, O. Witten-]-medelyan, and I. H. Witten, Thesaurus based automatic keyphrase indexing, Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries , JCDL '06, pp.296-297, 2006.
DOI : 10.1145/1141753.1141819

F. Mihalcea, R. Mihalcea, and E. Faruque, SenseLearner, Proceedings of the ACL 2005 on Interactive poster and demonstration sessions , ACL '05, pp.155-158, 2004.
DOI : 10.3115/1225753.1225767

C. Milligan, G. W. Milligan, and M. C. Cooper, An examination of procedures for determining the number of clusters in a data set, Psychometrika, vol.77, issue.2, pp.159-179, 1985.
DOI : 10.1007/BF02294245

B. Mirkin, Choosing the number of clusters, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.125, issue.1, pp.252-260, 2011.
DOI : 10.1002/widm.15

T. Mondary, Construction d'ontologies à partir de textes. L'apport de l'analyse de concepts formels. Theses, 2011.
URL : https://hal.archives-ouvertes.fr/tel-00596825

. Moon, Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain, Healthcare Informatics Research, vol.21, issue.1, pp.35-42, 2015.
DOI : 10.4258/hir.2015.21.1.35

P. Morin, E. Morin, and E. Prochasson, Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora, Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, pp.27-34, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00608475

H. Morris, J. Morris, and G. Hirst, Non-classical lexical semantic relations, Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics, CLS '04, pp.46-51, 2004.
DOI : 10.3115/1596431.1596438

D. Murdoch, T. B. Murdoch, and A. S. Detsky, The Inevitable Application of Big Data to Health Care, JAMA, vol.309, issue.13, pp.3091351-1352, 2013.
DOI : 10.1001/jama.2013.393

M. Nakagawa, H. Nakagawa, and T. Mori, A simple but powerful automatic term extraction method, COLING-02 on COMPUTERM 2002 second international workshop on computational terminology -, pp.1-7, 2002.
DOI : 10.3115/1118771.1118778

R. Navigli, Word sense disambiguation, ACM Computing Surveys, vol.41, issue.2, pp.10-105, 2009.
DOI : 10.1145/1459352.1459355

R. Navigli, A quick tour of word sense disambiguation, induction and related approaches Theory and practice of computer science, SOFSEM 2012 Research and Development in Information Retrieval, SIGIR'09, pp.115-129, 2012.

. Noy, BioPortal: ontologies and integrated data resources at the click of a mouse, Nucleic Acids Research, vol.37, issue.Web Server, pp.170-173, 2009.
DOI : 10.1093/nar/gkp440

URL : https://hal.archives-ouvertes.fr/hal-00492020

. Nzali, Construction d'un vocabulaire patient/médecin dédié au cancer du sein à partir des médias sociaux, Actes de 25es journées francophones d'Ingénierie des Connaissances, p.82, 2015.

. Opsahl, Node centrality in weighted networks: Generalizing degree and shortest paths, Social Networks, vol.32, issue.3, pp.245-251, 2010.
DOI : 10.1016/j.socnet.2010.03.006

. Padró, Nothing like Good Old Frequency: Studying Context Filters for Distributional Thesauri, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.419-424, 2014.
DOI : 10.3115/v1/D14-1047

. Page, The pagerank citation ranking: Bringing order to the web, p.27, 1999.

. Pantel, Web-scale distributional similarity and entity set expansion, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 2, EMNLP '09, pp.938-947, 2009.
DOI : 10.3115/1699571.1699635

L. Pantel, P. Pantel, and D. Lin, Discovering word senses from text, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '02, pp.613-619, 2002.
DOI : 10.1145/775047.775138

T. Pedersen, UMND2, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.394-397, 2007.
DOI : 10.3115/1621474.1621561

T. Pedersen, Duluth-wsi: Senseclusters applied to the sense induction task of semeval-2, Proceedings of the 5th International Workshop on Semantic Evaluation, pp.363-366, 2010.

. Pérez, Neon methodology for building ontology networks: Ontology specification, Methodology, pp.1-18, 2008.

. Pfahringer, Tell me who can learn you and i can tell you who you are: Landmarking various learning algorithms, Proceedings of the Seventeenth International Conference on Machine Learning, ICML2000, pp.743-750, 2000.

. Pinto, UPV-SI, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.430-433, 2007.
DOI : 10.3115/1621474.1621570

. Pinto, Diligent: Towards a fine-grained methodology for distributed, loosely-controlled and evolving, Proceedings of the 16th European Conference on Artificial Intelligence, p.393, 2004.

J. C. Platt, Advances in kernel methods. chapter Fast training of support vector machines using sequential minimal optimization, pp.185-208, 1999.

C. Polajnar, T. Polajnar, and S. Clark, Improving Distributional Semantic Vectors through Context Selection and Normalisation, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp.230-238, 2014.
DOI : 10.3115/v1/E14-1025

N. Ponzetto, S. P. Ponzetto, and R. Navigli, Knowledge-rich word sense disambiguation rivaling supervised systems, Proceedings of the 48th annual meeting of the association for computational linguistics, pp.1522-1531, 2010.

P. Purandare, A. Purandare, and T. Pedersen, Word sense discrimination by clustering contexts in vector and similarity spaces, Proceedings of the Conference on Computational Natural Language Learning, p.97, 2004.

. Qian, Exploiting constituent dependencies for tree kernel-based semantic relation extraction, Proceedings of the 22nd International Conference on Computational Linguistics, COLING '08, pp.697-704, 2008.
DOI : 10.3115/1599081.1599169

J. R. Quinlan, C4.5: programs for machine learning, p.131, 1993.

. Qureshi, Shorttext domain specific key terms/phrases extraction using an n-gram model with wikipedia, Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM'12, pp.2515-2518, 2012.

. Rada, Development and application of a metric on semantic nets. Systems, Man and Cybernetics, IEEE Transactions on, vol.19, issue.1, pp.17-30, 1989.

. Rao, Entity Linking: Finding Extracted Entities in a Knowledge Base, Multi-source, multilingual information extraction and summarization, pp.93-115, 2013.
DOI : 10.1007/978-3-642-28569-1_5

. Ratkowsky, D. Lance-]-ratkowsky, and G. Lance, A criterion for determining the number of groups in a classification, Australian Computer Journal, vol.10, issue.3, pp.115-117, 1978.

H. Ravichandran, D. Ravichandran, and E. Hovy, Learning surface text patterns for a Question Answering system, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.41-47, 2002.
DOI : 10.3115/1073083.1073092

. Rebholz-schuhmann, Text processing through Web services: calling Whatizit, Bioinformatics, vol.24, issue.2, pp.296-298, 2008.
DOI : 10.1093/bioinformatics/btm557

. Reif, Dataset generation for meta-learning, 35th German Conference on Artificial Intelligence, 2012.

. Reif, Meta2-features: Providing meta-learners more information, 35th German Conference on Artificial Intelligence, p.112, 2012.

. Reif, Automatic classifier selection for non-experts, Pattern Analysis and Applications, vol.8, issue.7, pp.83-96, 2014.
DOI : 10.1007/s10044-012-0280-z

. Robertson, Okapi at TREC-7: automatic ad hoc, filtering, vlc and interactive track, pp.253-264, 1999.

F. Roche, M. Roche, and S. Fortuno, La fouille de textes au service de la documentation, Arabesques, vol.76, pp.13-14, 2014.
URL : https://hal.archives-ouvertes.fr/lirmm-01071877

. Roche, Extraction automatique des mots-clés à partir de publications scientifiques pour l'indexation et l'ouverture des données en agronomie, Cahiers Agricultures, p.83, 2015.

. Roche, EXIT: Un système itératif pour l'extraction de la terminologie du domaine à partir de corpus spécialisés, 7th International Conference on Statistical Analysis of Textual Data, pp.946-956, 2004.

M. Roche and V. Prince, A web-mining approach to disambiguate biomedical acronym expansions, Informatica, vol.34, issue.2, 2010.
URL : https://hal.archives-ouvertes.fr/lirmm-00487536

T. M. Roche, Traitement automatique des données hétérogènes liées à l'aménagement des territoires, Proceedings of Association de Science Régionale de Langue Française, p.83, 2015.

. Rose, Automatic Keyword Extraction from Individual Documents, Text Mining: Theory and Applications, pp.1-20, 2010.
DOI : 10.1002/9780470689646.ch1

F. Rousseau and M. Vazirgiannis, Main Core Retention on Graph-of-Words for Single-Document Keyword Extraction, Advances in Information Retrieval, pp.382-393, 2015.
DOI : 10.1007/978-3-319-16354-3_42

P. J. Rousseeuw, Silhouettes: A graphical aid to the interpretation and validation of cluster analysis, Journal of Computational and Applied Mathematics, vol.20, pp.53-65, 1987.
DOI : 10.1016/0377-0427(87)90125-7

. Rychlý, P. Kilgarriff-]-rychlý, and A. Kilgarriff, An efficient algorithm for building a distributional thesaurus (and other Sketch Engine developments), Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, ACL '07, pp.41-44, 2007.
DOI : 10.3115/1557769.1557783

. Salton, G. Buckley-]-salton, and C. Buckley, Term-weighting approaches in automatic text retrieval. Information processing & management, pp.513-523, 1988.

M. Sánchez, D. Sánchez, and A. Moreno, Learning non-taxonomic relationships from web documents for domain ontology construction, Data & Knowledge Engineering, vol.64, issue.3, pp.600-623, 2008.
DOI : 10.1016/j.datak.2007.10.001

W. S. Sarle, Cubic clustering criterion. SAS Institute, p.109, 1983.

P. Savova, G. Savova, and T. Pedersen, Resolving ambiguities in biomedical text with unsupervised clustering approaches, pp.97-98, 2005.

. Savova, Word sense disambiguation across two domains: Biomedical literature and clinical notes, Journal of Biomedical Informatics, vol.41, issue.6, pp.411088-1100, 2008.
DOI : 10.1016/j.jbi.2008.02.003

. Schuemie, Word Sense Disambiguation in the Biomedical Domain: An Overview, Journal of Computational Biology, vol.12, issue.5, pp.554-565, 2005.
DOI : 10.1089/cmb.2005.12.554

H. Schütze, Automatic word sense discrimination, Computational linguistics, vol.24, issue.1, pp.97-123, 1998.

V. Sclano, F. Sclano, and P. Velardi, TermExtractor: a Web Application to Learn the Shared Terminology of Emergent Web Communities, Enterprise Interoperability II, pp.287-290, 2007.
DOI : 10.1007/978-1-84628-858-6_32

S. Scott, A. J. Scott, and M. J. Symons, Clustering Methods Based on Likelihood Ratio Criteria, Biometrics, vol.27, issue.2, pp.387-397, 1971.
DOI : 10.2307/2529003

A. Séguéla, P. Séguéla, and N. Aussenac-gilles, Extraction de relations sémantiques entre termes et enrichissement de modèles du domaine, Conférence ingénierie des connaissances, pp.79-88, 1999.

S. Sharoff, In the Garden and in the Jungle, Genres on the Web, pp.149-166, 2011.
DOI : 10.1007/978-90-481-9178-9_7

. Shen, Entity linking with a knowledge base: Issues, techniques, and solutions. Knowledge and Data Engineering, IEEE Transactions on, vol.27, issue.2, pp.443-460, 2015.

. Singhal, Pivoted document length normalization, Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '96, pp.21-29, 1996.
DOI : 10.1145/243199.243206

F. Smadja, Retrieving collocations from text: Xtract, Computation Linguistic, vol.19, issue.1, pp.143-177, 1993.

[. Jones and K. , Synonymy and Semantic Clas- sification, p.115, 1986.

. Spasic, FlexiTerm: a flexible term recognition method, Journal of Biomedical Semantics, vol.4, issue.1, 2013.
DOI : 10.3163/1536-5050.97.4.009

. Staab, Knowledge processes and ontologies, IEEE Intelligent Systems, vol.16, issue.1, pp.26-34, 2001.
DOI : 10.1109/5254.912382

. Stevenson, Disambiguation of biomedical text using diverse sources of information, S7. 2 citations, pp.89-106, 2008.
DOI : 10.1186/1471-2105-9-S11-S7

P. Stoykova, V. Stoykova, and E. Petkova, Automatic extraction of mathematical terms for precalculus, Procedia Technology, vol.1, pp.464-468, 2012.
DOI : 10.1016/j.protcy.2012.02.102

. Suchanek, Combining linguistic and statistical analysis to extract relations from web documents, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '06, pp.712-717, 2006.
DOI : 10.1145/1150402.1150492

. Sy, User centered and ontology based information retrieval system for life sciences, BMC Bioinformatics, vol.13, issue.Suppl 1, pp.13-17, 2012.
DOI : 10.1038/ng.81

URL : https://hal.archives-ouvertes.fr/inserm-00662993

. Tamura, Bilingual lexicon extraction from comparable corpora using label propagation, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL'12, pp.24-36, 2012.

. Tang, Statistical word sense aware topic models, Soft Computing, vol.101, issue.suppl. 1, pp.1-15, 2014.
DOI : 10.1007/s00500-014-1372-z

. Teh, Hierarchical Dirichlet Processes, Journal of the American Statistical Association, vol.101, issue.476, pp.1566-1581, 2006.
DOI : 10.1198/016214506000000302

L. Tian, Y. Tian, and D. Lo, A comparative study on the effectiveness of part-of-speech tagging techniques on bug reports, 2015 IEEE 22nd International Conference on Software Analysis, Evolution, and Reengineering (SANER), pp.570-574, 2015.
DOI : 10.1109/SANER.2015.7081879

. Tibshirani, Estimating the number of clusters in a data set via the gap statistic, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.63, issue.2, pp.411-423, 2001.
DOI : 10.1111/1467-9868.00293

J. Vanschoren, Understanding machine learning performance with experiment databases. lirias. kuleuven. be, no, p.113, 2010.

. Velardi, A taxonomy learning method and its application to characterize a scientific web community . Knowledge and Data Engineering, IEEE Transactions on, vol.19, issue.2 2, pp.180-191, 2007.

J. Véronis, HyperLex: lexical cartography for information retrieval, Computer Speech & Language, vol.18, issue.3, pp.223-252, 2004.
DOI : 10.1016/j.csl.2004.05.002

D. Vilalta, R. Vilalta, and Y. Drissi, A characterization of difficult problems in classification, Proceedings of the 6th European conference on principles and practice of knowledge discovery in databases, pp.85-91, 2009.

. Wang, Medical synonym extraction with concept space models, 2015.

. Wang, A sense-topic model for word sense induction with unsupervised data enrichment, Transactions of the Association for Computational Linguistics, vol.3, issue.2, pp.59-71, 2015.

D. Widdows, D. Widdows, and B. Dorow, A graph model for unsupervised lexical acquisition, Proceedings of the 19th international conference on Computational linguistics -, pp.1-7, 2002.
DOI : 10.3115/1072228.1072342

M. Yan, Methods of determining the number of clusters in a data set and a new clustering criterion, p.108, 2005.

. Yao, Structured relation discovery using generative models, Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp.1456-1466, 2011.

V. Yao, . Durme, X. Yao, and B. Van-durme, Nonparametric bayesian word sense induction, Proceedings of TextGraphs-6, 2011.

G. Zadeh, R. B. Zadeh, and A. Goel, Dimension independent similarity computation, Journal of Machine Learning Research, vol.14, issue.1, pp.1605-1626, 2013.

T. Zesch and I. Gurevych, Wisdom of crowds versus wisdom of linguists ??? measuring the semantic relatedness of words, Natural Language Engineering, vol.17, issue.01, pp.25-59, 2010.
DOI : 10.3115/1219840.1219887