C. Ieva, A. Gotlieb, S. Kaci, and N. Lazaar, Discovering program topoi through clustering, Proceedings of the Thirty-Second IAAI Conference on Innovative Applications of Artificial Intelligence, 2018.
URL : https://hal.archives-ouvertes.fr/lirmm-01790874

, Discovering program topoi via hierarchical agglomerative clustering, IEEE Transactions on Reliability, vol.67, issue.3, pp.758-770, 2018.

C. Aaron, M. Newman, and C. Moore, Finding community structure in very large networks, Physical Reviews E, vol.70, 2004.

R. , D. Cosmo, and S. Zacchiroli, Software heritage: Why and how to preserve software source code, Proc. of the 14th International Conference on Digital Preservation (iPRES'17, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01590958

W. Zhao, L. Zhang, Y. Liu, J. Sun, and F. Yang, SNIAFL: towards a static non-interactive approach to feature location, Proc. of the 26th International Conference on Software Engineering (ICSE'04, pp.293-303, 2004.

A. Kuhn, S. Ducasse, and T. G??rbag??rba, Semantic clustering: Identifying topics in source code, Information and Software Technology, vol.49, issue.3, pp.230-243, 2007.

E. Linstead, P. Rigor, S. Bajracharya, C. Lopes, and P. Baldi, Mining concepts from code with probabilistic topic models, Proc. of the IEEE Automated Software Engineering Conference (ASE'07), p.461, 2007.

H. Dumitru, M. Gibiec, N. Hariri, J. Cleland-huang, B. Mobasher et al., On-demand feature recommendations derived from mining public product descriptions, Proc. of the IEEE International Conference in Software Engineering (ICSE'11, pp.181-190, 2011.

C. Mcmillan, N. Hariri, D. Poshyvanyk, and J. Cleland-huang, Recommending Source Code for Use in Rapid Software Prototypes, Proc. of the IEEE International Conference in Software Engineering (ICSE'12), pp.848-858, 2012.

S. Grant, J. R. Cordy, and D. B. Skillicorn, Using heuristics to estimate an appropriate number of latent topics in source code analysis, Science of Computer Programming, vol.78, issue.9, pp.1663-1678, 2013.

S. L. Abebe and P. Tonella, Extraction of domain concepts from the source code, Science of Computer Programming, vol.98, pp.680-706, 2015.

J. Rubin and M. Chechik, A survey of feature location techniques, Domain Engineering, pp.29-58, 2013.

P. W. Mcburney, C. Liu, and C. Mcmillan, Automated feature discovery via sentence selection and source code summarization, Journal of Software Evolution and Process, vol.28, issue.2, pp.120-145, 2016.

U. Alon, M. Zilberstein, O. Levy, and E. Yahav, code2vec: Learning distributed representations of code, CoRR, 2018.

L. Kaufman and P. J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, 1990.

P. W. Foltz, W. Kintsch, and T. K. Landauer, The Measurement of Textual Coherence with Latent Semantic Analysis, Discourse Processes, vol.25, pp.285-307, 1998.

K. Chen and V. Rajlich, Case study of feature location using dependence graph, Proc. of the 8th International Workshop in Program Comprehension (IWPC'00, pp.241-247, 2000.

A. Marcus and S. Haiduc, Text Retrieval Approaches for Concept Location in Source Code, pp.126-158, 2013.