F. Bouillot, P. Poncelet, and M. Roche, approche NaivesBayes a été préférée à l'approche CFC 2. workshop international regroupant les partenaires européens de la société 1Mesurer la proximité entre corpus par de nouveaux méta-descripteurs, Dans les actes de la conférence francophone en Recherche d'Information et Applications To appear, 2015.

F. Bouillot, P. Poncelet, and M. Roche, Classification of Small Datasets: Why Using Class-Based Weighting Measures?, Proceedings of the 21st International Symposium on Methodologies for Intelligent Systems, 2014.
DOI : 10.1007/978-3-319-08326-1_35

URL : https://hal.archives-ouvertes.fr/lirmm-01054900

F. Bouillot, P. Poncelet, and M. Roche, De nouvelles pondérations adaptées à la classification de petits volumes de données textuelles, Actes des 14ièmes Journées Francophone "Extraction et Gestion des Connaissances, 2014.

F. Bouillot, P. Hai, N. Béchet, S. Bringay, D. Ienco et al., How to Extract Relevant Knowledge from Tweets?, Springer CCSI (Communications in Computer and Information Science), post proceedings of ISIP ?2012 (International Workshop on Information Search, Integration and Personalization), 2013.
DOI : 10.1007/978-3-642-40140-4_12

URL : https://hal.archives-ouvertes.fr/lirmm-00798662

F. Bouillot, O. Gout, P. Magnier, C. Pénin, P. Poncelet et al., Vers un outil de cartographie : qui est l'expert ?, Démonstration, Actes des 13ièmes Journées Francophone "Extraction et Gestion des Connaissances" 12. Conclusion générale Démo paper, 2013.
URL : https://hal.archives-ouvertes.fr/lirmm-00798073

F. Bouillot, P. Poncelet, M. Roche, D. Ienco, S. Matwin et al., French presidential elections, Proceedings of the first edition workshop on Politics, elections and data, PLEAD '12, 2012.
DOI : 10.1145/2389661.2389669

URL : https://hal.archives-ouvertes.fr/lirmm-00801028

F. Bouillot, P. Poncelet, and M. Roche, How and Why Exploit Tweet's Location Information ?, Proceedings of the 15th International Conference on Geographic Information Science (AGILE'12), 2012.

S. Bringay, N. Béchet, F. Bouillot, P. Poncelet, M. Roche et al., Towards an On-Line Analysis of Tweets Processing, Proceedings of the 22nd International Conference on Database and Expert Systems Applications, 2011.
DOI : 10.1145/361219.361220

URL : https://hal.archives-ouvertes.fr/hal-00636285

S. Bringay, N. Béchet, F. Bouillot, P. Poncelet, M. Roche et al., Analyse de gazouillis en ligne, Actes 7ièmes Journées Francophones sur les Entrepôts de Données et l'Analyse en ligne, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00828003

A. Aizawa, « An information-theoretic perspective of tf&mdash ;idf measures, dans Inf. Process. Manage., t. 39, jan, pp.45-65, 2003.

S. M. Ali, S. D. Silvey, and «. A. , General Class of Coefficients of Divergence of One Distribution from Another, Journal of the Royal Statistical Society. Series B, pp.131-142, 1966.

S. Ali and K. A. , A meta-learning approach to automatic kernel selection for support vector machines, Neurocomputing, vol.70, issue.1-3, pp.173-186, 2006.
DOI : 10.1016/j.neucom.2006.03.004

S. Ali and K. , Smith-Miles, « On optimal degree selection for polynomial kernel with support vector machines : theoretical and empirical investigations, Int. J. Know.-Based Intell. Eng. Syst, issue.11, pp.1-18, 2007.

S. Ali and K. A. Smith, Matching svm kernel's suitability to data characteristics using tree by fuzzy c-means clustering », dans Design and application of hybrid intelligent systems, pp.553-562, 2003.

S. Ali and K. A. Smith, On learning algorithm selection for classification, Applied Soft Computing, vol.6, issue.2, pp.119-138, 2006.
DOI : 10.1016/j.asoc.2004.12.002

R. Basili, M. Cammisa, A. Moschitti, and . Rome, « A semantic kernel to classify texts with very few training examples, dans Informatica (Slovenia), t. 30, pp.163-172, 2006.

J. Baxter, information theoretic model of learning to learn viamultiple task sampling, Machine Learning, vol.28, issue.1, pp.7-39, 1997.
DOI : 10.1023/A:1007327622663

H. Bensusan and C. Giraud-carrier, « Casa batla is in passeig de gracia or how landmark performances can describe tasks, dans Proceedings of the ECML-00 Workshop on Meta-Learning : Building Automatic Advice Strategies for Model Selection and Method Combination, pp.29-46, 2000.

H. Bensusan, God doesn't always shave with Occam's razor ??? Learning when and how to prune, Proceedings of the 10th European Conference on Machine Learning, pp.119-124, 1998.
DOI : 10.1007/BFb0026680

H. Bensusan and A. , Kalousis, « Estimating the predictive accuracy of a classifier, dans Proceedings of the 12th European Conference on Machine Learning, pp.25-36, 2001.

H. Berrer, I. Paterson, and J. Keller, « Evaluation of machine-learning algorithm ranking advisors, Proceedings of the PKDD-2000 Workshop on DataMining, Decision Support, Meta-Learning and ILP : Forum for Practical Problem Presentation and Prospective Solutions, 2000.

N. Bhatt, A. Thakkar, and A. Ganatra, « A survey and current research challenges in meta learning approaches based on dataset characteristics, International Journal of Soft Computing and Engineering, issue.2, pp.234-247, 2012.

N. Bhatt, A. Thakkar, A. Ganatra, and N. Bhatt, Ranking of Classifiers based on Dataset Characteristics using Active Meta Learning, International Journal of Computer Applications, vol.69, issue.20, pp.31-36, 2013.
DOI : 10.5120/12089-8269

P. B. Brazdil, C. Soares, J. P. Da, and . Costa, Ranking learning algorithms : using ibl and meta-learning on accuracy and time results, Machine Learning, vol.50, issue.3, pp.251-277, 2003.
DOI : 10.1023/A:1021713901879

P. Brazdil, C. G. Giraud-carrier, C. Soares, and R. Vilalta, Metalearning -Applications to Data Mining, 2009.

C. E. Brodley, Recursive automatic bias selection for classifier construction, Machine Learning, vol.225, issue.1-2, pp.63-94, 1995.
DOI : 10.1007/BF00993475

C. E. Brodley and P. Smyth, Applying classification algorithms in practice, Statistics and Computing, pp.45-56, 1997.

C. Buckley, Automatic query expansion using smart : trec 3, Proceedings of The third Text REtrieval Conference, pp.69-80, 1994.

Z. Cataltepe and E. Aygun, « An improvement of centroid-based classification algorithm for text classification », dans Data Engineering Workshop, IEEE 23rd International Conference on, pp.952-956, 2007.

A. Cayci, S. Eibe, E. Menasalvas, and Y. Saygin, Bayesian Networks to Predict Data Mining Algorithm Behavior in Ubiquitous Computing Environments, dans Proceedings of the 2010 International Conference on Analysis of Social Media and Ubiquitous Data, pp.119-141, 2011.
DOI : 10.1007/978-3-642-23599-3_7

S. Cha, Comprehensive survey on distance/similarity measures between probability density functions, International Journal of Mathematical Models and Methods in Applied Sciences, issue.1, pp.300-307, 2007.

O. Chapelle, B. Schölkopf, and A. Zien, Semi-supervised learning, 2006.
DOI : 10.7551/mitpress/9780262033589.001.0001

O. Chapelle, V. Vapnik, O. Bousquet, and S. Mukherjee, « Choosing multiple parameters for support vector machines », dans Machine learning, pp.131-159, 2002.

M. Chen, X. Jin, and D. Shen, « Short text classification improved by learning multi-granularity topics, dans Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence -Volume Volume Three, pp.1776-1781, 2011.

W. T. Chuang, A. Tiyyagura, J. Yang, and G. Giuffrida, A Fast Algorithm for Hierarchical Text Classification, DaWaK 2000 : Proceedings of the Second International Conference on Data Warehousing and Knowledge Discovery, pp.409-418, 2000.
DOI : 10.1007/3-540-44466-1_41

N. Cristianini and J. Shawe-taylor, An Introduction to Support Vector Machines : And Other Kernel-based Learning Methods, 2000.
DOI : 10.1017/CBO9780511801389

S. Deerwester, S. T. Dumais, G. W. Furnas, T. K. Landauer, and R. Harshman, Indexing by latent semantic analysis, Journal of the American Society for Information Science, vol.41, issue.6, pp.41-391, 1990.
DOI : 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

Z. Deng, S. Tang, D. Yang, M. Zhang, X. Wu et al., A Linear Text Classification Algorithm Based on Category Relevance Factors, English, dans Digital Libraries : People, Knowledge, and Technology, pp.2555-88, 2002.
DOI : 10.1007/3-540-36227-4_9

R. Engels and C. Theusinger, « Using a data metric for preprocessing advice for data mining applications. », dans ECAI, pp.430-434, 1998.

J. R. Firth, « A synopsis of linguistic theory 1930-55. », dans Studies in Linguistic Analysis (special volume of the Philological Society), t, pp.1952-59, 1957.

G. Forman and I. Cohen, Learning from Little: Comparison of Classifiers Given Little Training, Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases, pp.161-172, 2004.
DOI : 10.1007/978-3-540-30116-5_17

J. V. Frasch, A. Lodwich, F. Shafait, and T. M. Breuel, « A bayestrue data generator for evaluation of supervised and unsupervised learning methods », dans Pattern Recogn, pp.1523-1531, 2011.

K. Furdík, J. Parali?, G. Tutoky, and . Meta, learning method for automatic selection of algorithms for text classification, Proc. of the Central European Conference on Information and Intelligent Systems (CE- CIIS 2008), pp.24-26, 2008.

J. Gama and P. Brazdil, Characterization of classification algorithms, Proceedings of the 7th Portuguese Conference on Artificial Intelligence : Progress in Artificial Intelligence, pp.189-200, 1995.
DOI : 10.1007/3-540-60428-6_16

C. Giraud-carrier, R. Vilalta, and P. Brazdil, Introduction to the Special Issue on Meta-Learning, Machine Learning, t. 54, pp.187-193, 2004.
DOI : 10.1023/B:MACH.0000015878.60765.42

D. E. Goldberg, Genetic Algorithms in Search, Optimization and Machine Learning, 1st, 1989.

T. A. Gomes, R. B. Prudêncio, C. Soares, A. L. Rossi, and A. C. Carvalho, Combining meta-learning and search techniques to select parameters for support vector machines, Neurocomputing, vol.75, issue.1, pp.75-78, 2012.
DOI : 10.1016/j.neucom.2011.07.005

H. Guan, J. Zhou, and M. Guo, A class-feature-centroid classifier for text categorization, Proceedings of the 18th international conference on World wide web, WWW '09, pp.201-210, 2009.
DOI : 10.1145/1526709.1526737

E. Han and G. Karypis, Centroid-Based Document Classification: Analysis and Experimental Results, dans Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, pp.424-431, 2000.
DOI : 10.1007/3-540-45372-5_46

E. Han, G. Karypis, and V. Kumar, Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification, Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.53-65, 2001.
DOI : 10.1007/3-540-45357-1_9

T. Hertz, A. B. Hillel, and D. Weinshall, Learning a kernel function for classification with small training samples, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.401-408, 2006.
DOI : 10.1145/1143844.1143895

M. Hilario, P. Nguyen, H. Do, and A. Woznica, Kalousis, « Ontologybased meta-mining of knowledge discovery workflows, Meta-Learning in Computational Intelligence, pp.273-315, 2011.

X. Hu, N. Sun, C. Zhang, and T. Chua, Exploiting internal and external semantics for the clustering of short texts using world knowledge, Proceeding of the 18th ACM conference on Information and knowledge management, CIKM '09, pp.919-928, 2009.
DOI : 10.1145/1645953.1646071

D. Hull, Improving Text Retrieval for the Routing Problem using Latent Semantic Indexing, Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.282-291, 1994.
DOI : 10.1007/978-1-4471-2099-5_29

D. J. Ittner, D. D. Lewis, and D. D. , Ahn, « Text categorization of low quality images, Proceedings of SDAIR-95, 4th Annual Symposium on Document Analysis and Information Retrieval, pp.301-315, 1995.

N. Jankowski, W. Duch, and K. Grabczewski, Meta-Learning in Computational Intelligence, p.358, 2011.
DOI : 10.1007/978-3-642-20980-2

T. Joachims, « A probabilistic analysis of the rocchio algorithm with tfidf for text categorization, dans Proceedings of the Fourteenth International Conference on Machine Learning, pp.143-151, 1997.

T. Joachims, « Text categorization with suport vector machines : learning with many relevant features, dans Proceedings of the 10th European Conference on Machine Learning, pp.137-142, 1998.

K. S. Jones, S. Walker, and S. E. Robertson, A probabilistic model of information retrieval: development and comparative experiments, Information Processing & Management, vol.36, issue.6, pp.779-808, 2000.
DOI : 10.1016/S0306-4573(00)00015-7

K. S. Jones, A STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL, Journal of Documentation, vol.28, issue.1, pp.11-21, 1972.
DOI : 10.1108/eb026526

A. Kalousis, J. Gama, and M. Hilario, On Data and Algorithms: Understanding Inductive Performance, Machine Learning, vol.54, issue.3, pp.275-312, 2004.
DOI : 10.1023/B:MACH.0000015882.38031.85

A. Kalousis and M. Hilario, Feature Selection for Meta-learning, Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.222-233, 2001.
DOI : 10.1007/3-540-45357-1_26

A. Kalousis and T. Theoharis, « Noemon : design, implementation and performance results of an intelligent assistant for classifier selection », dans Intelligent Data Analysis, pp.319-337, 1999.

A. M. Kaptein, « Meta-classifier approach to reliable text classification », rap. tech, 2005.

C. Köpf, C. Taylor, J. Keller, and . Meta, analysis : from data characterisation for meta-learning to meta-regression, dans Proceedings of the PKDD-00 workshop on data mining, decision support, meta-learning and ILP, Citeseer, 2000.

P. Kuba, P. Brazdil, C. Soares, and A. Woznica, Exploiting sampling and meta-learning for parameter setting for support vector machines, Proc. of Workshop Learning and Data Mining associated with Iberamia 2002, VIII Iberoamerican Conference on Artificial Intellignce, pp.209-216, 2002.

W. Lam and K. Lai, A meta-learning approach for text categorization, Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '01, pp.303-309, 2001.
DOI : 10.1145/383952.384011

M. Lan, S. Sung, H. Low, and C. Tan, « A comparative study on term weighting schemes for text categorization, dans Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on, pp.546-551, 2005.

M. Lan, C. Tan, J. Su, and Y. Lu, « Supervised and traditional term weighting methods for automatic text categorization », dans Pattern Analysis and Machine Intelligence, IEEE Transactions, pp.31-721, 2009.

M. Lan, C. L. Tan, J. Su, and Y. Lu, « Supervised and traditional term weighting methods for automatic text categorization », dans Pattern Analysis and Machine Intelligence, IEEE Transactions, pp.31-721, 2009.

R. Leite and P. Brazdil, « Improving progressive sampling via metalearning on learning curves, Machine Learning : ECML 2004, t. 3201, pp.250-261, 2004.

R. Leite and P. Brazdil, Active testing strategy to predict the best classification algorithm via sampling and metalearning, dans Proceedings of the 2010 Conference on ECAI 2010 : 19th European Conference on Artificial Intelligence, pp.309-314, 2010.

R. Leite, P. Brazdil, and J. Vanschoren, Selecting Classification Algorithms with Active Testing, Proceedings of the 8th International Conference on Machine Learning and Data Mining in Pattern Recognition, pp.117-131, 2012.
DOI : 10.1007/978-3-642-31537-4_10

E. Leopold and J. Kindermann, « Text categorization with support vector machines. how to represent texts in input space ?, Machine Learning, vol.46, issue.1/3, pp.423-444, 2002.
DOI : 10.1023/A:1012491419635

V. Lertnattee and C. Leuviphan, « Using class frequency for improving centroid-based text classification, ACEEE International Journal on Information Technology, issue.2, 2012.

V. Lertnattee and T. Theeramunkong, Combining homogeneous classifiers for centroid-based text classification, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications, pp.1034-1039, 2002.
DOI : 10.1109/ISCC.2002.1021799

V. Lertnattee and T. Theeramunkong, Effect of term distributions on centroid-based text categorization, Information Sciences, vol.158, pp.89-115, 2004.
DOI : 10.1016/j.ins.2003.07.007

D. D. Lewis, Naive (Bayes) at forty: The independence assumption in information retrieval, dans Proceedings of the 10th European Conference on Machine Learning, pp.4-15, 1998.
DOI : 10.1007/BFb0026666

F. Lin and W. W. Cohen, Semi-Supervised Classification of Network Data Using Very Few Labels, 2010 International Conference on Advances in Social Networks Analysis and Mining, pp.192-199, 2010.
DOI : 10.1109/ASONAM.2010.19

Y. Lin, J. Jiang, and S. Lee, A Similarity Measure for Text Classification and Clustering, IEEE Transactions on Knowledge and Data Engineering, vol.26, issue.7, pp.99-100, 2013.
DOI : 10.1109/TKDE.2013.19

G. Linden, B. Smith, J. York, and . Amazon, com recommendations : item-to-item collaborative filtering, IEEE Internet Computing, pp.76-80, 2003.

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, 2008.
DOI : 10.1017/CBO9780511809071

S. Martin, J. Liermann, and H. Ney, « Algorithms for bigram and trigram word clustering », dans Speech Commun., t. 24, avr, pp.19-37, 1998.

J. Martineau, T. Finin, A. Joshi, and S. Patel, Improving binary classification on text problems using differential word features, Proceeding of the 18th ACM conference on Information and knowledge management, CIKM '09, pp.2019-2024, 2009.
DOI : 10.1145/1645953.1646291

A. Mccallum and K. Nigam, « A comparison of event models for naive bayes text classification », dans AAAI-98 workshop on learning for text categorization, pp.41-48, 1998.

C. J. Merz, « Dynamical selection of learning algorithms », dans Learning from Data, pp.281-290, 1996.

G. A. Miller and . Wordnet, WordNet: a lexical database for English, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.
DOI : 10.1145/219717.219748

P. B. De-miranda, R. B. Prudêncio, A. C. De-carvalho, and C. Soares, Combining a multi-objective optimization approach with meta-learning for SVM parameter selection, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp.2909-2914, 2012.
DOI : 10.1109/ICSMC.2012.6378235

M. M. Molina, J. M. Luna, C. Romero, S. Ventura, and . Meta, learning approach for automatic parameter tuning : a case study with educational datasets, Proceedings of the 5th International Conference on Educational Data Mining, pp.180-183, 2012.

S. Moran, Y. He, and K. Liu, « Choosing the best bayesian classifier : an empirical study, IAENG International Journal of Computer Science, vol.36, pp.322-331, 2009.

T. Mori, Information gain ratio as term weight, Proceedings of the 19th international conference on Computational linguistics -, pp.1-7, 2002.
DOI : 10.3115/1072228.1072246

R. Moschitti and R. Basili, Complex Linguistic Features for Text Classification: A Comprehensive Study, dans Proceedings of the 26th European Conference on Information Retrieval, pp.181-196, 2004.
DOI : 10.1007/978-3-540-24752-4_14

D. Nakache and E. Metais, « Evaluation : nouvelle approche avec juges », dans INFORSID'05 XXIII e congrès, pp.555-570, 2005.

P. Nguyen, J. W. 0017, and M. Hilario, Kalousis, « Learning heterogeneous similarity measures for hybrid-recommendations in meta-mining », dans CoRR, t. abs, 1210.

R. Pavón, F. Díaz, R. Laza, and V. Luzón, Automatic parameter tuning with a Bayesian case-based reasoning system. A case of study, Expert Systems with Applications, vol.36, issue.2, pp.3407-3420, 2009.
DOI : 10.1016/j.eswa.2008.02.044

M. Pechenizkiy, « Data mining strategy selection via empirical and constructive induction. », dans Databases and Applications, pp.59-64, 2005.

X. Phan, L. Nguyen, and S. Horiguchi, Learning to classify short and sparse text & web with hidden topics from large-scale data collections, Proceeding of the 17th international conference on World Wide Web , WWW '08, pp.91-100, 2008.
DOI : 10.1145/1367497.1367510

J. C. Platt, Advances in kernel methods », dans, 1999, chap. Fast training of support vector machines using sequential minimal optimization, pp.185-208

R. B. Prudêncio and T. B. Ludermir, Selective generation of training examples in active meta-learning, International Journal of Hybrid Intelligent Systems, vol.5, issue.2, pp.59-70, 2008.
DOI : 10.3233/HIS-2008-5202

R. B. Prudêncio and T. B. Ludermir, Combining Uncertainty Sampling methods for supporting the generation of meta-examples, Information Sciences, vol.196, pp.1-14, 2012.
DOI : 10.1016/j.ins.2012.02.003

R. B. Prudencio, T. B. Ludermir, F. De, and A. T. De-carvalho, A Modal Symbolic Classifier for selecting time series models, Pattern Recognition Letters, vol.25, issue.8, pp.911-921, 2004.
DOI : 10.1016/j.patrec.2004.02.004

R. B. Prudêncio and T. B. Ludermir, « Combining uncertainty sampling methods for active meta-learning », dans ISDA, pp.220-225, 2009.

J. R. Quinlan and C. , 5 : programs for machine learning, 1993.

M. Reif, « A comprehensive dataset for evaluating approaches of various meta-learning tasks, dans Proceedings of the First International Conference on Pattern Recognition Applications and Methods, p.2012

M. Reif, F. Shafait, and A. Dengel, Meta-learning for evolutionary parameter optimization of classifiers, Machine Learning, vol.4, issue.1, pp.357-380, 2012.
DOI : 10.1007/s10994-012-5286-7

P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom, and J. , GroupLens, Proceedings of the 1994 ACM conference on Computer supported cooperative work , CSCW '94, pp.175-186, 1994.
DOI : 10.1145/192844.192905

J. R. Rice, The Algorithm Selection Problem, pp.65-118, 1976.
DOI : 10.1016/S0065-2458(08)60520-3

S. E. Robertson and K. S. Jones, Relevance weighting of search terms, Journal of the American Society for Information Science, vol.30, issue.3, pp.129-146, 1976.
DOI : 10.1002/asi.4630270302

S. Robertson, Understanding inverse document frequency: on theoretical arguments for IDF, Journal of Documentation, vol.60, issue.5, p.2004, 2004.
DOI : 10.1108/00220410410560582

J. J. Rocchio, « Relevance feedback in information retrieval », dans The Smart retrieval system -experiments in automatic document processing, pp.313-323, 1971.

D. J. Rogers, T. T. Tanimoto, and «. A. , Computer Program for Classifying Plants », dans Science, t. 132, oct, pp.1115-1118, 1960.

G. Salton, The SMART Retrieval System. Experiments in Automatic Document Processing, 1971.

G. Salton, A. Wong, and C. S. Yang, A vector space model for automatic indexing, Communications of the ACM, vol.18, issue.11, pp.613-620, 1975.
DOI : 10.1145/361219.361220

G. Salton, Automatic Text Processing : The Transformation, Analysis, and Retrieval of Information by Computer, 1989.

G. Salton and C. Buckley, « Term-weighting approaches in automatic text retrieval », dans Information processing & management, pp.513-523, 1988.

G. Salton and M. J. Mcgill, Introduction to Modern Information Retrieval, 1986.

C. Schaffer, A Conservation Law for Generalization Performance, pp.259-265, 1994.
DOI : 10.1016/B978-1-55860-335-6.50039-8

K. Schneider, Techniques for Improving the Performance of Naive Bayes for Text Classification, dans Proceedings of the 6th International Conference on Computational Linguistics and Intelligent Text Processing, pp.682-693, 2005.
DOI : 10.1007/978-3-540-30586-6_76

S. Segrera, J. Pinho, and M. Moreno, Information-theoretic measures for meta-learning », dans Hybrid Artificial Intelligence Systems, pp.458-465, 2008.

F. Serban, J. Vanschoren, J. Kietz, and A. Bernstein, A survey of intelligent assistants for data analysis, ACM Computing Surveys, vol.45, issue.3, pp.1-3135
DOI : 10.1145/2480741.2480748

S. Shankar and G. Karypis, « Weight adjustment schemes for a centroid based classifier, 2000.

C. E. Shannon, « A mathematical theory of communication, ACM SIGMOBILE Mobile Computing and Communications Review, issue.5, pp.3-55, 2001.

J. W. Shavlik, R. J. Mooney, and G. G. , Towell, « Symbolic and neural learning algorithms : an experimental comparison », dans Machine learning, pp.111-143, 1991.

Z. Shevked and L. Dakovski, Learning and classification with prime implicants applied to medical data diagnosis, Proceedings of the 2007 international conference on Computer systems and technologies , CompSysTech '07, pp.1-103, 2007.
DOI : 10.1145/1330598.1330708

A. F. Smeaton, Using NLP or NLP Resources for Information Retrieval Tasks, Natural Language Information Retrieval, pp.99-111, 1997.
DOI : 10.1007/978-94-017-2388-6_4

C. Soares and P. B. Brazdil, « Selecting parameters of svm using metalearning and kernel matrix-based meta-features, dans Proceedings of the 2006 ACM Symposium on Applied Computing, pp.564-568, 2006.

C. Soares, P. B. Brazdil, and P. Kuba, A Meta-Learning Method to Select the Kernel Width in Support Vector Regression, Machine Learning , t. 54, pp.195-209, 2004.
DOI : 10.1023/B:MACH.0000015879.28004.9b

C. Soares and P. Brazdil, Zoomed Ranking: Selection of Classification Algorithms Based on Relevant Performance Information, dans Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery, pp.126-135, 2000.
DOI : 10.1007/3-540-45372-5_13

S. Y. Sohn, Meta analysis of classification algorithms for pattern recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.21, issue.11, pp.1137-1144, 1999.
DOI : 10.1109/34.809107

P. Soucy and G. W. , Mineau, « Beyond tfidf weighting for text categorization in the vector space model, dans Proceedings of the 19th International Joint Conference on Artificial Intelligence, pp.1130-1135, 2005.

M. C. De-souto, R. B. Prudêncio, R. G. Soares, D. S. De-araujo, I. G. Costa et al., Ranking and selecting clustering algorithms using a meta-learning approach, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), pp.3729-3735, 2008.
DOI : 10.1109/IJCNN.2008.4634333

M. Spiliopoulou, A. Kalousis, L. C. Faulstich, and T. Theoharis, « Noemon : an intelligent assistant for classifier selection, pp.90-97, 1998.

B. Srivastava and A. Mediratta, « Domain-dependent parameter selection of search-based algorithms compatible with user performance criteria », dans AAAI, pp.1386-1391, 2005.

A. Sun, Short text classification using very few words, Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval, SIGIR '12, pp.1145-1146, 2012.
DOI : 10.1145/2348283.2348511

Q. Sun and B. Pfahringer, « Pairwise meta-rules for better meta-learningbased algorithm ranking, Machine Learning, t. 93, pp.141-161, 2013.

V. Tam, A. Santoso, and R. Setiono, « A comparative study of centroidbased , neighborhood-based and statistical approaches for effective document categorization, dans Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02), t. 4, pp.235-238, 2002.

S. Tan, Large margin dragpushing strategy for centroid text categorization », dans Expert Systems with Applications, t. 33, juil, pp.215-220, 2007.

S. Tan, An improved centroid classifier for text categorization », dans Expert Systems with Applications, t, pp.279-285, 2008.

D. M. Tax and R. P. Duin, « Characterizing one-class datasets, Proceedings of the Sixteenth Annual Symposium of the Pattern Recognition Association of South Africa, pp.21-26, 2005.

T. Theeramunkong and V. Lertnattee, Improving centroid-based text classification using term-distribution-based weighting system and clustering, dans Proceedings of ISCIT-01, 2nd International Symposium on Communication and Information Technology, pp.1167-1182, 2001.

L. Todorovski, P. Brazdil, and C. Soares, Report on the experiments with feature selection in meta-level learning », dans Proceedings of the PKDD-00 workshop on data mining, decision support, meta-learning and ILP : forum for practical problem presentation and prospective solutions, 2000.

P. D. Turney, Similarity of semantic relations », dans Comput. Linguist ., t. 32, sept, pp.379-416, 2006.

P. D. Turney and M. L. Littman, Measuring praise and criticism, ACM Transactions on Information Systems, vol.21, issue.4, pp.315-346, 2003.
DOI : 10.1145/944012.944013

P. D. Turney and P. Pantel, « From frequency to meaning : vector space models of semantics, J. Artif. Int. Res., t, vol.37, pp.141-188, 2010.

R. Vilalta and Y. Drissi, « A perspective view and survey of metalearning, Artificial Intelligence Review, vol.18, issue.2, pp.77-95, 2002.
DOI : 10.1023/A:1019956318069

E. M. Voorhees, Using WordNet to disambiguate word senses for text retrieval, Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '93, pp.171-180, 1993.
DOI : 10.1145/160688.160715

M. Wajeed and T. Adilakshmi, Different similarity measures for text classification using KNN, 2011 2nd International Conference on Computer and Communication Technology (ICCCT-2011), pp.41-45, 2011.
DOI : 10.1109/ICCCT.2011.6075188

S. M. Weiss and I. Kapouleas, An empirical comparison of pattern recognition, neural nets, and machine learning classification methods, dans Proceedings of the 11th International Joint Conference on Artificial Intelligence, pp.781-787, 1989.

S. M. Weiss and C. A. Kulikowski, Computer Systems That Learn : Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning, and Expert Systems, 1991.

E. B. Wilson, Probable Inference, the Law of Succession, and Statistical Inference, Journal of the American Statistical Association, vol.22, issue.158, pp.209-212, 1927.
DOI : 10.1080/01621459.1927.10502953

I. H. Witten and E. Frank, Data mining, ACM SIGMOD Record, vol.31, issue.1, 2000.
DOI : 10.1145/507338.507355

D. H. Wolpert, The Supervised Learning No-Free-Lunch Theorems, Proc. 6th Online World Conference on Soft Computing in Industrial Applications, pp.25-42, 2001.
DOI : 10.1007/978-1-4471-0123-9_3

D. H. Wolpert and W. G. Macready, « No free lunch theorems for search, 1995.

D. H. Wolpert and W. G. Macready, « No free lunch theorems for optimization », dans Evolutionary Computation, IEEE Transactions, issue.1, pp.67-82, 1997.

H. Wu and G. Salton, « A comparison of search term weighting : term relevance vs. inverse document frequency, dans Proceedings of the 4th Annual International ACM SIGIR Conference on Information Storage and Retrieval : Theoretical Issues in Information Retrieval, pp.30-39, 1981.

Y. Wu and D. W. Oard, Bilingual topic aspect classification with a few training examples, Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, SIGIR '08, pp.203-210, 2008.
DOI : 10.1145/1390334.1390371

J. Xu and W. B. Croft, Corpus-based stemming using cooccurrence of word variants, ACM Transactions on Information Systems, vol.16, issue.1, pp.61-81, 1998.
DOI : 10.1145/267954.267957

Y. Yang and J. O. Pedersen, « A comparative study on feature selection in text categorization, dans Proceedings of the Fourteenth International Conference on Machine Learning, pp.412-420, 1997.

S. Zelikovitz, W. W. Cohen, and H. Hirsh, Extending WHIRL with background knowledge for improved text classification, Information Retrieval, vol.10, issue.1, pp.35-67, 2007.
DOI : 10.1007/s10791-006-9004-6

X. Zhang, T. Wang, X. Liang, F. Ao, and Y. Li, « A class-based feature weighting method for text classification, Journal of Computational Information Systems, issue.8, pp.965-972, 2012.

X. Zhu, « Semi-supervised learning literature survey, p.10, 2005.

J. Zobel and A. Moffat, Exploring the similarity space, Exploring the similarity space, pp.18-34, 1998.
DOI : 10.1145/281250.281256