E. Agirre and P. Edmonds, Word Sense Disambiguation: Algorithms and Applications, 2007.
DOI : 10.1007/978-1-4020-4809-8

URL : https://hal.archives-ouvertes.fr/artxibo-00080512

E. Agirre and A. Soroa, Semeval-2007 task 02, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.7-12, 2007.
DOI : 10.3115/1621474.1621476

E. Agirre and A. Soroa, UBC-AS, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.346-349, 2007.
DOI : 10.3115/1621474.1621549

H. Mubaid and P. Chen, Biomedical term disambiguation: An application to geneprotein name disambiguation, Proceedings of the Third International Conference on Information Technology: New Generations, ITNG '06, pp.606-612, 2006.

H. Mubaid and S. Gungu, A learning-based approach for biomedical word sense disambiguation, The Scientific World Journal, 2012.

L. Albano, D. Beneventano, and S. Bergamaschi, Word Sense Induction with Multilingual Features Representation, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), pp.343-349, 2014.
DOI : 10.1109/WI-IAT.2014.117

L. Albano, D. Beneventano, and S. Bergamaschi, Multilingual Word Sense Induction to Improve Web Search Result Clustering, Proceedings of the 24th International Conference on World Wide Web, WWW '15 Companion, pp.835-839, 2015.
DOI : 10.1109/TPAMI.2009.36

A. N. Albatineh and M. Niewiadomska-bugaj, MCS: A Method for Finding the Number of Clusters, Journal of Classification, vol.91, issue.2, pp.184-209, 2011.
DOI : 10.1016/S0165-0114(96)00157-1

M. J. Anderson, A new method for non-parametric multivariate analysis of variance, Austral ecology, vol.26, issue.1, pp.32-46, 2001.

T. Baldwin, Y. Li, B. Alexe, and I. R. Stanoi, Automatic term ambiguity detection, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp.804-809, 2013.

W. Blair and B. Smith, Nursing documentation: Frameworks and barriers, Contemporary Nurse, vol.41, issue.2, pp.160-168, 2012.
DOI : 10.5172/conu.2012.41.2.160

D. M. Blei, A. Y. Ng, and M. I. Jordan, Latent dirichlet allocation. the Journal of Machine Learning Research, vol.3, pp.993-1022, 2003.

J. G. Booth, G. Casella, and J. P. Hobert, Clustering using objective functions and stochastic search, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.37, issue.1, pp.119-139, 2008.
DOI : 10.1017/CBO9780511805967

S. Bordag, Word sense induction: Triplet-based clustering and automatic evaluation, Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics, EACL'06, pp.137-144, 2006.

S. Brody and M. Lapata, Bayesian word sense induction, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics on, EACL '09, pp.103-111, 2009.
DOI : 10.3115/1609067.1609078

T. Cali´nskicali´nski and J. Harabasz, A dendrite method for cluster analysis, Communications in Statistics - Theory and Methods, vol.3, issue.1, pp.1-27, 1974.
DOI : 10.1080/03610927408827101

J. Camacho-collados, M. T. Pilehvar, and R. Navigli, Nasari : Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities, Artificial Intelligence, vol.240, pp.36-64, 2016.
DOI : 10.1016/j.artint.2016.07.005

R. Chasin, A. Rumshisky, O. Uzuner, and P. Szolovits, Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods, Journal of the American Medical Informatics Association, vol.24, issue.22, pp.842-849, 2014.
DOI : 10.1198/016214506000000302

P. Chen, W. Ding, C. Bowes, and D. Brown, A fully unsupervised word sense disambiguation method using dependency knowledge, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics on, NAACL '09, pp.28-36, 2009.
DOI : 10.3115/1620754.1620759

D. K. Choe and E. Charniak, Naive bayes word sense induction, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, EMNLP '13, pp.1433-1437, 2013.

J. J. Cimino, Auditing the Unified Medical Language System with Semantic Methods, Journal of the American Medical Informatics Association, vol.30, issue.1, pp.41-51, 1998.
DOI : 10.1016/0169-2607(94)90020-5

J. J. Cimino, Battling scylla and charybdis: the search for redundancy and ambiguity in the 2001 umls metathesaurus, Proceedings of the AMIA Symposium, page 120. American Medical Informatics Association, 2001.

P. Cook, J. H. Lau, D. Mccarthy, and T. Baldwin, Novel word-sense identification, Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers, pp.1624-1635, 2014.

D. L. Davies and D. W. Bouldin, A cluster separation measure. Pattern Analysis and Machine Intelligence, IEEE Transactions on, issue.2, pp.224-227, 1979.

M. Y. Dehkordi, R. Boostani, and M. Tahmasebi, A novel hybrid structure for clustering, Advances in Computer Science and Engineering, pp.888-891, 2009.

B. Dorow and D. Widdows, Discovering corpus-specific word senses, Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics , EACL '03, pp.79-82, 2003.
DOI : 10.3115/1067737.1067753

W. Duan, M. Song, and A. Yates, Fast max-margin clustering for unsupervised word sense disambiguation in biomedical texts, BMC Bioinformatics, vol.10, issue.Suppl 3, 2009.
DOI : 10.1186/1471-2105-10-S3-S4

R. O. Duda and P. E. Hart, Pattern classification and scene analysis, 1973.

L. Frermann and M. Lapata, A bayesian model of diachronic meaning change. TACL, pp.31-45, 2016.

A. D. Gordon, Classification, (chapman & hall/crc monographs on statistics & applied probability), 1999.

M. Halkidi and M. Vazirgiannis, Clustering validity assessment: finding the optimal partitioning of a data set, Proceedings 2001 IEEE International Conference on Data Mining, pp.187-194, 2001.
DOI : 10.1109/ICDM.2001.989517

M. Halkidi, M. Vazirgiannis, and Y. Batistakis, Quality Scheme Assessment in the Clustering Process, Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, PKDD '00, pp.265-276, 2000.
DOI : 10.1007/3-540-45372-5_26

Y. Huang, X. Shi, J. Su, Y. Chen, and G. Huang, Unsupervised word sense induction using rival penalized competitive learning, Engineering Applications of Artificial Intelligence, vol.41, issue.C, pp.41166-174, 2015.
DOI : 10.1016/j.engappai.2015.02.004

N. Ide and T. Erjavec, Automatic sense tagging using parallel corpora, Natural Language Pacific Rim Symposium (artificial intelligence), NLPRS '01, 2001.

O. Javed, K. Shafique, Z. Rasheed, and M. Shah, Modeling inter-camera space???time and appearance relationships for tracking across non-overlapping views, Computer Vision and Image Understanding, vol.109, issue.2, pp.146-162, 2008.
DOI : 10.1016/j.cviu.2007.01.003

A. Jimeno-yepes, Higher order features and recurrent neural networks based on longshort term memory nodes in supervised biomedical word sense disambiguation, 1604.

A. J. Jimeno-yepes, B. T. Mcinnes, and A. R. Aronson, Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation, BMC Bioinformatics, vol.12, issue.1, p.223, 2011.
DOI : 10.1016/j.ijmedinf.2005.03.013

I. P. Klapaftis and S. Manandhar, Word sense induction using graphs of collocations, Proceedings of the 2008 Conference on ECAI 2008: 18th European Conference on Artificial Intelligence, ECAI '08, pp.298-302, 2008.

I. P. Klapaftis and S. Manandhar, Word sense induction & disambiguation using hierarchical random graphs, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp.745-755, 2010.

A. Kolesnikov, E. Trichina, and T. Kauranne, Estimating the number of clusters in a numerical data set via quantization error modeling, Pattern Recognition, vol.48, issue.3, pp.941-952, 2015.
DOI : 10.1016/j.patcog.2014.09.017

M. Köper and S. S. Im-walde, A rank-based distance measure to detect polysemy and to determine salient vector-space features for german prepositions, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC'14 European Language Resources Association (ELRA), pp.4459-4466, 2014.

I. Korkontzelos and S. Manandhar, Uoy: Graphs of unambiguous vertices for word sense induction and disambiguation, Proceedings of the 5th International Workshop on Semantic Evaluation, SemEval '10, pp.355-358, 2010.

W. J. Krzanowski and Y. Lai, A Criterion for Determining the Number of Groups in a Data Set Using Sum-of-Squares Clustering, Biometrics, vol.44, issue.1, pp.23-34, 1988.
DOI : 10.2307/2531893

J. H. Lau, P. Cook, D. Mccarthy, D. Newman, and T. Baldwin, Word sense induction for novel sense detection, Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp.591-601, 2012.

J. H. Lau, P. Cook, D. Mccarthy, D. Newman, and T. Baldwin, Word sense induction for novel sense detection, Proceedings of the 13th Conference of the European Chapter, pp.591-601

Y. K. Lee and H. T. Ng, An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation, Proceedings of the ACL-02 conference on Empirical methods in natural language processing , EMNLP '02, pp.41-48, 2002.
DOI : 10.3115/1118693.1118699

J. Liang, X. Zhao, D. Li, F. Cao, and C. Dang, Determining the number of clusters using information entropy for mixed data, Pattern Recognition, vol.45, issue.6, pp.2251-2265, 2012.
DOI : 10.1016/j.patcog.2011.12.017

D. Lin, Automatic retrieval and clustering of similar words, Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics of ACL-COLING '98, pp.768-774, 1998.

J. A. Lossio-ventura, C. Jonquet, M. Roche, and M. Teisseire, BIOTEX: A system for biomedical terminology extraction, ranking, and validation, Proceedings of the 13th International Semantic Web Conference, Posters & Demonstrations Track, ISWC'14, pp.157-160, 2014.
URL : https://hal.archives-ouvertes.fr/lirmm-01112894

J. A. Lossio-ventura, C. Jonquet, M. Roche, and M. Teisseire, Automatic biomedical term polysemy detection, Proceedings of the Tenth International Conference on Language Resources and Evaluation, LREC'2016, pp.1684-1688

J. A. Lossio-ventura, C. Jonquet, M. Roche, and M. Teisseire, Biomedical term extraction: overview and a new methodology, Information Retrieval Journal, vol.14, issue.1, pp.59-99, 2016.
DOI : 10.1109/NLPKE.2010.5587809

URL : https://hal.archives-ouvertes.fr/lirmm-01274539

J. A. Lossio-ventura, C. Jonquet, M. Roche, and M. Teisseire, A way to automatically enrich biomedical ontologies, Proceedings of the 19th International Conference on Extending Database Technology, EDBT'2016, pp.676-677, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01274540

S. Manandhar, I. P. Klapaftis, D. Dligach, and S. S. Pradhan, SemEval-2010 task 14, Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, DEW '09, pp.63-68, 2010.
DOI : 10.3115/1621969.1621990

URL : http://dl.acm.org/ft_gateway.cfm?id=1621990&type=pdf

D. Mccarthy, M. Apidianaki, and K. Erk, Word Sense Clustering and Clusterability, Computational Linguistics, vol.42, issue.2, pp.245-275, 2016.
DOI : 10.3115/1621474.1621518

URL : https://hal.archives-ouvertes.fr/hal-01838502

G. W. Milligan and M. C. Cooper, An examination of procedures for determining the number of clusters in a data set, Psychometrika, vol.77, issue.2, pp.159-179, 1985.
DOI : 10.1080/00223980.1963.9916640

B. Mirkin, Choosing the number of clusters, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, vol.125, issue.1, pp.252-260, 2011.
DOI : 10.1007/s00214-009-0614-0

R. Navigli, A quick tour of word sense disambiguation, induction and related approaches Theory and practice of computer science, SOFSEM 2012, pp.115-129, 2012.

R. Navigli and G. Crisafulli, Inducing word senses to improve web search result clustering, Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp.116-126, 2010.

R. Navigli and D. Vannella, Semeval-2013 task 11: Word sense induction and disambiguation within an end-user application, pp.167-174, 2013.

Z. Niu, D. Ji, and C. Tan, I2R, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.177-182, 2007.
DOI : 10.3115/1621474.1621511

URL : http://dl.acm.org/ft_gateway.cfm?id=1621511&type=pdf

T. Noh, S. Park, and S. Lee, Unsupervised word sense disambiguation in biomedical texts with co-occurrence network and graph kernel, Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informatics, DTMBIO '10, pp.61-64, 2010.
DOI : 10.1145/1871871.1871883

S. V. Pakhomov, G. Finley, R. Mcewan, Y. Wang, and G. B. Melton, Corpus domain effects on distributional semantic modeling of medical terms, Bioinformatics, vol.11, issue.23, pp.323635-3644, 2016.
DOI : 10.1093/bioinformatics/bts129

URL : http://europepmc.org/articles/pmc5181540?pdf=render

P. Pantel and D. Lin, Discovering word senses from text, Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '02, pp.613-619, 2002.
DOI : 10.1145/775047.775138

URL : http://www.cse.unsw.edu.au/~qzhang/papers/p31.pdf

T. Pedersen, UMND2, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.394-397, 2007.
DOI : 10.3115/1621474.1621561

URL : http://dl.acm.org/ft_gateway.cfm?id=1621561&type=pdf

T. Pedersen, Duluth-wsi: Senseclusters applied to the sense induction task of semeval- 2, Proceedings of the 5th International Workshop on Semantic Evaluation, pp.363-366, 2010.

T. Pedersen and R. Bruce, Distinguishing word senses in untagged text, Second Conference on Empirical Methods in Natural Language Processing, EMNLP '97, pp.197-207, 1997.

D. Pinto, P. Rosso, and H. Jimenez-salazar, UPV-SI, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.430-433, 2007.
DOI : 10.3115/1621474.1621570

URL : http://dl.acm.org/ft_gateway.cfm?id=1621570&type=pdf

A. Purandare and T. Pedersen, SenseClusters, Demonstration Papers at HLT-NAACL 2004 on XX, HLT-NAACL '04, pp.26-29, 2004.
DOI : 10.3115/1614025.1614033

URL : http://dl.acm.org/ft_gateway.cfm?id=1614033&type=pdf

A. Purandare and T. Pedersen, Word sense discrimination by clustering contexts in vector and similarity spaces, Proceedings of the Conference on Computational Natural Language Learning, 2004.

P. J. Rousseeuw, Silhouettes: A graphical aid to the interpretation and validation of cluster analysis, Journal of Computational and Applied Mathematics, vol.20, pp.53-65, 1987.
DOI : 10.1016/0377-0427(87)90125-7

A. K. Sabbir, A. Jimeno-yepes, and R. Kavuluru, Knowledge-based biomedical word sense disambiguation with neural concept embeddings and distant supervision, 1610.

G. Savova and T. Pedersen, Resolving ambiguities in biomedical text with unsupervised clustering approaches, 2005.

H. Schutze, Dimensions of meaning, Proceedings Supercomputing '92, pp.787-796, 1992.
DOI : 10.1109/SUPERC.1992.236684

H. Schütze, Automatic word sense discrimination, Comput. Linguist, vol.24, issue.1, pp.97-123, 1998.

A. K. Sehgal, P. Srinivasan, and O. Bodenreider, Gene terms and english words: An ambiguous mix, Proc. of the ACM SIGIR Workshop on Search and Discovery for Bioinformatics, 2004.

M. Stevenson and Y. Guo, Disambiguation in the biomedical domain: The role of ambiguity type, Journal of Biomedical Informatics, vol.43, issue.6, pp.972-981, 2010.
DOI : 10.1016/j.jbi.2010.08.009

G. Tang, Y. Xia, J. Sun, M. Zhang, and T. F. Zheng, Statistical word sense aware topic models, Soft Computing, vol.101, issue.suppl. 1, pp.1-15, 2014.
DOI : 10.1109/ICDM.2007.86

Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei, Hierarchical Dirichlet Processes, Journal of the American Statistical Association, vol.101, issue.476, pp.1566-1581, 2006.
DOI : 10.1198/016214506000000302

G. Udani, S. Dave, A. Davis, and T. Sibley, Noun sense induction using web search results, Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '05, pp.657-658, 2005.
DOI : 10.1145/1076034.1076176

T. Van-de-cruys and M. Apidianaki, Latent semantic word sense induction and disambiguation, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp.1476-1485, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00607672

S. Van-dongen, Graph clustering by flow simulation, 2000.

J. Véronis, HyperLex: lexical cartography for information retrieval, Computer Speech & Language, vol.18, issue.3, pp.223-252, 2004.
DOI : 10.1016/j.csl.2004.05.002

J. Wang, M. Bansal, K. Gimpel, B. Ziebart, and C. Yu, A sense-topic model for word sense induction with unsupervised data enrichment, Transactions of the Association for Computational Linguistics, vol.3, pp.59-71, 2015.

Y. Wang, K. Zheng, H. Xu, and Q. Mei, Clinical word sense disambiguation with interactive search and classification, AMIA Annual Symposium Proceedings, p.2062, 2016.

D. Widdows and B. Dorow, A graph model for unsupervised lexical acquisition, Proceedings of the 19th international conference on Computational linguistics -, pp.1-7, 2002.
DOI : 10.3115/1072228.1072342

H. Xu, M. Markatou, R. Dimova, H. Liu, and C. Friedman, Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues, BMC Bioinformatics, vol.7, issue.1, p.334, 2006.
DOI : 10.1186/1471-2105-7-334

M. Yan, Methods of determining the number of clusters in a data set and a new clustering criterion, 2005.

X. Yao and B. Van-durme, Nonparametric bayesian word sense induction, Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing, pp.10-14, 2011.

H. Yu, Z. Liu, and G. Wang, An automatic method to determine the number of clusters using decision-theoretic rough set, International Journal of Approximate Reasoning, vol.55, issue.1, pp.101-115, 2014.
DOI : 10.1016/j.ijar.2013.03.018

X. Zhu, J. Fan, D. M. Baorto, C. Weng, and J. J. Cimino, A review of auditing methods applied to the content of controlled biomedical terminologies, Journal of Biomedical Informatics, vol.42, issue.3, pp.413-425, 2009.
DOI : 10.1016/j.jbi.2009.03.003