Generalizing the Wilcoxon rank-sum test for interval data

Julien Perolat; Ines Couso; Kevin Loquin; Olivier Strauss

doi:10.1016/j.ijar.2014.08.001

Article Dans Une Revue International Journal of Approximate Reasoning Année : 2015

Generalizing the Wilcoxon rank-sum test for interval data

(1) , (2) , (3) , (4)

1
2
3
4

Julien Perolat

Fonction : Auteur
PersonId : 14403
IdHAL : julien-perolat
IdRef : 137378181

Sequential Learning

Ines Couso

Fonction : Auteur
PersonId : 1235140
ORCID : 0000-0002-1675-6203

Universidad de Oviedo = University of Oviedo

Kevin Loquin

Fonction : Auteur
PersonId : 836321

Laboratoire Traitement et Communication de l'Information

Olivier Strauss

Fonction : Auteur
PersonId : 21713
IdHAL : olivier-strauss
ORCID : 0000-0003-4485-772X
IdRef : 068902891

Image & Interaction

Résumé

Here we propose an adaption of Wilcoxon's two-sample rank-sum test to interval data. This adaption is interval-valued: it computes the minimum and maximum values of the statistic when we rank the set of all feasible samples (all joint samples compatible with the initial set-valued information). We prove that these bounds can be explicitly computed using a very low computational cost algorithm. Interpreting this generalized test is straightforward: if the obtained interval-valued p-value is on one side of the significance level, we will be able to make a decision (reject/no reject). Otherwise, we will conclude that our information is too vague to lead to a clear decision. Our method is also applicable to quantized data: in the presence of quantized information, the joint sample may contain a high proportion of draws, which can prevent the test from drawing a clear conclusion. According to the usual convention, when there are ties, the ranks for the observations in a tie are taken to be the average of the ranks for those observations. This convention can lead to wrong conclusions. Here, we consider the family of all possible rank permutations, such that a sample containing ties will not just be associated with a single value, but rather with a collection of values for the Wilcoxon's rank-sum statistic, with each one of them being associated with a different p-value. When the impact of quantization is too high to lead to a clear decision, our test provides an interval-valued p-value that includes the chosen significance level. It indicates that there is no clear conclusion according to this test. Two different experiments exemplify the properties of the generalized test: the first one illustrates its ability to avoid wrong decisions in the presence of quantized data. The second one shows the performance of the generalized test when used with interval data.

Mots clés

Statistical hypothesis test Interval-valued imprecise data Bipolar decision

Domaines

Traitement du signal et de l'image [eess.SP]

Olivier Strauss : Connectez-vous pour contacter le contributeur

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01278071

Soumis le : mardi 23 février 2016-15:51:37

Dernière modification le : lundi 25 novembre 2024-14:19:44

Dates et versions

lirmm-01278071 , version 1 (23-02-2016)

Identifiants

HAL Id : lirmm-01278071 , version 1
DOI : 10.1016/j.ijar.2014.08.001

Citer

Julien Perolat, Ines Couso, Kevin Loquin, Olivier Strauss. Generalizing the Wilcoxon rank-sum test for interval data. International Journal of Approximate Reasoning, 2015, 56, pp.108-121. ⟨10.1016/j.ijar.2014.08.001⟩. ⟨lirmm-01278071⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA PARISTECH LIRMM ICARLIRMM CRISTAL INRIA2 CRISTAL-SEQUEL MIPS UNIV-MONTPELLIER UNIV-LILLE LTCI INSTITUT-MINES-TELECOM

318 Consultations

0 Téléchargements

Generalizing the Wilcoxon rank-sum test for interval data

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager