- Research
- Open access
- Published:
Joint distribution of rank statistics considering the location and scale parameters and its power study
Journal of Statistical Distributions and Applications volume 1, Article number: 6 (2014)
Abstract
The ranking method used for testing the equivalence of two distributions has been studied for decades and is widely adopted for its simplicity. However, due to the complexity of calculations, the power of the test is either estimated by a normal approximation or found when an appropriate alternative is given. Here, via the Finite Markov chain imbedding technique, we are able to establish the marginal and joint distributions of the rank statistics considering the shift and scale parameters, respectively and simultaneously, under two different continuous distribution functions. Furthermore, the procedures of distribution equivalence tests and their power functions are discussed. Numerical results of a joint distribution of rank statistics under the standard normal distribution and the powers for a sequence of alternative normal distributions with means from −20 to 20 and standard deviations from 1 to 9 and their reciprocal are presented. In addition, we discuss the powers of the rank statistics under the Lehmann alternatives.
2010 Mathematics Subject Classification
Primary 62G07; Secondary 62G10
1 Introduction
Suppose that on the basis of observations X1,…,X m ;Y1,…,Y n from the cumulative distribution functions F and G, two major topics in the hypothesis testing are to test the equivalence of either the center or the dispersion of the two populations of interest. The hypotheses are stated, for some θ ≠ 0,
which is known as the shift alternative and, for some σ≠1,
Wilcoxon (1945) proposed the ranking method for testing the significance of the difference of the two populations means, also known as the Wilcoxon rank-sum test, and defined a statistic W Y , as the sum of the ranks of the y′s in the combined and ordered sequence of x′s and y′s, equivalent to
Mann and Whitney (1947) introduced an elaboration of the ranking test, proposed the statistic , and proved that the limiting distribution of the test statistic U X is
as m and n go to infinity in any arbitrary manner where
and
with
where X,X′ and Y,Y′ are independently distributed, X,X′ with the distribution F, and Y,Y′ with the distribution G. Intuitively, the power for the right-sided test can be found as
where c is the value such that
Over the years, there have been studies on finding the exact or approximate power for the rank-sum test. By choosing an appropriate alternative distribution function, Shieh et al. (2006) derived the exact power for the uniform, normal, double exponential and exponential shift models. Rosner and Glynn (2009) discussed power against the family of alternatives of the form
where the underlying distributions F X and F Y are normal. Collings and Hamilton (1988) presented a bootstrap method to find the empirical distribution functions in order to approximate the power against the shift alternative. Lehmann (1953) derived the power function as
where s j is the rank of y j in the combined samples for the alternative hypothesis of
where k is a positive integer. However, Lehmann (1998) pointed out that the power function of the rank-sum test, Equation (2), was only qualitative. Since the numerical values for assessing the probabilities in Equation (1) are considerably complicated in computation when F and G are continuous distributions with F≠G.
As the rank-sum test is widely adopted for testing the center differences of two distributions, it is natural to study the efficiency of a rank-sum test for variability (Ansari and Bradley 1960). For decades, studies have focused on proposing new definitions of the rank statistic and using the methods of Chernoff and Savage to show the relative efficiency of the proposed statistic to the F-test, see for example Mood (1954), Siegel and Tukey (1960), Ansari and Bradley (1960), and Klotz (1962). Ansari and Bradley (1960) mentioned that if the means of the X and Y samples cannot be considered equal, differences in location have a severe impact on all the tests of dispersion. Klotz (1962) showed the power of a rank test can be found by integrating the joint density of X and Y samples over that part of the m+n dimensional space defined by the alternative orderings which lie in the critical region of the test, for which conditions are very strict.
Our approach aims at releasing some of the conditions for finding the distribution of the proposed rank statistic. We systematically imbed the random vector U n into a Markov chain to induce the marginal and joint distributions of the rank statistics considering the shift and scale parameter, respectively, under any form of two distribution functions. A joint distribution of rank statistics, to the best of our knowledge, has not been studied in the literature. The main strength of using the finite Markov chain imbedding approach (FMCI) is to derive the distribution of the rank statistic without giving any conditions. Therefore, under the null hypothesis of F=G, we are able to identify a proper critical region and, under the alternative assumption, the power of the test can be determined naturally. The distribution of the random vector U n , independent of the form of the distribution function F, is also demonstrated under the null hypothesis of the distribution equivalence.
The main contributions of this paper are as follows. In Section 2.1, we introduce the procedures of deriving the distribution of the rank statistic considering the shift parameter and its power function by using FMCI. The procedures are general and can be applied to either two identical distribution functions of interest or two different continuous density functions. In Section 2.2, we address the steps for finding the distribution of the rank statistic considering the scale parameter and its power function. In Section 2.3, we retrieve the joint distribution of the rank statistics considering the location and scale parameters simultaneously as well as its power function. Numerical results of a joint distribution and some powers of the rank statistics against shift parameter and scale parameter, individually and simultaneously, are presented in Section 3. We also discuss the powers of the rank statistics under the Lehmann alternatives. We end this paper with a short conclusion in Section 4.
2 Methods
2.1 Distributions of the rank statistic in the shift case
Let {X1,…,X m } and {Y1,…,Y n } be two independent samples from the continuous cumulative density distributions F(x) and G(x−θ), respectively. Given x={x1,…,x m } and x[i] is the ith smallest number in the sample, we have
for i=1,2,…,m+1 where x[0]=−∞ and x[m+1]=∞. Therefore, we define the sampling distribution of Y in the (m+1) intervals as
Given m, for t=1,2,…,n, let
where u i (t) is the number of y′s in the interval [ x[i−1],x[i]) among y1,…,y t . For each u n =(u1(n),⋯,um+1(n)), we have a corresponding rank-sum of y’s in the combined sample
Theorem 1
The statistic R l is equivalent to the statistic W Y , which is addressed by Wilcoxon in 1945.
Proof
Let
The rank statistic W Y , sum of the ranks of y’s observations, can be determined by
The first summation of the first term in Equation (5) can be interpreted as the number of y observations larger than x[i] which is in our expression. It is not difficult to see that equals n, the size of y sample. Therefore, the equation can be rewritten as
It is then easy to see that
Next, we demonstrate that for two random samples from the same population, the distribution of the random vector U n is independent of the form of the distribution function.
Theorem 2
Distribution-free property of U n .
Proof
We know the joint PDF of the ordered sample of x′s is given by
and, when F=G, the conditional probability of the random vector U n given X=(x1,x2,…,x m ) is
where x[0]=−∞ and x[m+1]=∞. By taking the expected value of the conditional probability, we have
Using variable transformation, it is clear to see that the random variables F(x[1]),…,F(x[m]) have a Dirichlet distribution with parameters u1(n)+1,u2(n)+1, …,um+1(n)+1. Therefore, we have
which is independent of the distribution function.
This is the reason that the distribution of the rank statistic U n is distribution-free under the null hypothesis. However, the distribution of the random vector U n is discrete uniform with the mass function one over the number of possible outcomes of the random vector U n only when assuming F=G. In other words, the distribution of the random vector U n can be found by the traditional combinatorial analysis when F=G. Unfortunately, when F≠G, we will not be able to establish the distribution of U n through Equation (7) as solving the multiple integral in Equation (8) is either tedious given some appropriate alternative distribution function or difficult. Our understanding is that finding the power of the test has not been solved in most cases. To overcome this situation, we bring in the finite Markov chain imbedding approach.
Let Ω t ,t=0,1,…,n, be the state space which has
possible states, Γ n ={0,1,…,n} be an index set, and {Z t :t∈Γ n } be a non-homogeneous Markov chain on the state space Ω t . As a transition probability matrix M t for this chain, t=1,…,n, consider
where
and p i is defined in Equation (3).
Theorem 3
R l (U n |X) is finite Markov chain imbeddable, and
whereis aunit row vector corresponding to state u n , ξ(=P(Z0=1)=1) is the initial probability and M t , t=1,…,n, are the transition probability matrices of the imbedded Markov chain defined on the state space Ω t .
Proof
For each u n =(u1(n),⋯,um+1(n)) in the state space Ω n , we have a corresponding rank R l as shown in Equation (4). Intuitively, the minimum rank r l s is n(n+1)/2 and the maximum rank r l b is n(2m+n+1)/2. In accordance with the possible values of the rank R l , we define a finite partition {C r :r=r l s ,…,r l b } such that
where is a unit row vector corresponding to state U n , we then obtain the conditional probability of the rank R l .
Then, the Law of Large Numbers is used to determine the probability of U n for any continuous F and G
where X i is the ith sample of size m from the distribution function F. It is easy to see that
To test
for some θ≠0, the power function is approximated by
where
Note that the alternative hypothesis is subject to the purpose of the test. This simply needs to be slightly modified if a one-sided test is adopted.
2.2 Distributions of the rank statistic in the scale case
We studied the distribution and the power function of the rank statistic R l considering a shift in location. Now, the distribution and the power function of the rank statistic considering the scale parameter will be addressed. For this purpose, we consider F(x)=G(x σ−1) and state the null and alternative hypotheses as
To do so, we begin with the procedure of finding the distribution of the rank statistic, denoted R s , considering the scale parameter through the random vector U n . The array of ranks are given by
if m+n is even, and
if m+n is odd. We first introduce how to determine the rank-sum of y′s observations in the combined samples, R s , with respect to
where u i (n) means the number of y observations belonging to [ x[i−1],x[i]). Let m e d(x,y) be the median among x′s and y′s and belongs to [ x[i],x[i+1]) which will then break U n into two parts and . If m+n is odd and m e d(x,y)=x[i], then
is a 1×i vector and
is a 1×(m+1−i) vector. The second possible case is, if m+n is odd and , then , a row vector with length i+1, has the form
and , a row vector with length m+1−i, is given by
The third possible case is, if m+n is even and x[i] is the smallest number larger than m e d(x,y), the vectors are now defined as
and
The last possibility is, if m+n is even, is the smallest number larger than m e d(x,y). The vectors are now defined as
and
Let n− be the length of the vector and n+ be the length of the vector .
Theorem 4
R s (U n |X) is finite Markov chain imbeddable, and
whereis aunit row vector corresponding to state U n , ξ(=P(Z0=1)=1) is the initial probability and M t , t=1,…,n are the transition probability matrices of the imbedded Markov chain defined on the state space Ω t .
Proof
For each U n in the state space Ω n , we have a corresponding
The smallest possible value of R s (U n ) is
and the largest possible value is
In accordance with Equation (11), we use the possible value of R s as a rule of the partition. The rest of the proof follows along the same line as that of Theorem 3, and here, is omitted.
Similarly, we apply the LLN to conclude that
which establishes the distribution of R s .
Through FMCI we, again, successfully retrieved the distribution of R s under selected alternative distributions, for which the procedures are similar to those in the previous section. In addition, it is quite intuitive to approximate the power function by
where
2.3 Joint distributions of the rank statistics in the shift and scale case
We have derived the marginal distributions of R l and R s in terms of U n , respectively, which yield the following theorem.
Theorem 5
(R l (U n |X),R s (U n |X)) is finite Markov chain imbeddable, and
whereis aunit row vector corresponding to state u n , ξ(=P(Z0=1)=1) is the initial probability and M t , t=1,…,n are the transition probability matrices of the imbedded Markov chain defined on the state space Ω t .
Proof
By Equations (4) and (11), we know each u n in the state space Ω n has corresponding values of R l and R s . The combinations of the values R l and R s are used to be the standard of the partition. The rest of the proof follows along the same line as that of Theorem 3.
The joint distribution of the ranks considering both the location and scale parameters which can be determined through our algorithm is yet to be studied in the literature. Our result allows us to test the homogeneity of the distribution functions F(x)=G((x−θ)σ−1). We state the hypotheses as follows
Also we are able to identify a proper critical region under the null hypothesis and discuss its power when F≠G. For example, a rectangular critical region can be
where r1l, r2l, r1s and r2s are the critical values such that
or an elliptic critical region
for some positive constants a and b such that
According to the above defined rejection region, the power of the test can be found as
or
Note that unless having a conjecture about the values of θ and σ, we tend to use a two-sided test. However, with the knowledge of the center and shape of the distribution of interest, deciding a sectorial critical region is a better choice, for which an example is demonstrated in the numerical studies.
3 Numerical results and discussion
3.1 A joint distribution of R l and R s
Let {X1,…,X5}∼N(0,1) and {Y1,…,Y7}∼N(θ,σ). Figure 1 gives the joint distribution of the random variables R l and R s under the null hypothesis of θ=0 and σ=1. The marginal distributions of R l and R s can be easily established from their joint distribution. Figure 1 also shows that the two random variables R l and R s are dependent. We construct two critical regions as shown in Figure 2, according to their joint distribution. Outside the yellow area in Figure 2 is the selected rectangular critical region C0.1738 and outside the red shadow is the elliptic one C 0.1738′.
3.2 Powers for a joint test using R l and R s
The alternative of interest is stated in the preceding section (see Equation (14)). The power functions of the test statistics R l and R s for a sequence of normally distributed populations with θ from -20 to 20 with an increment of 0.5 and σ from 1 to 10 with an increment of 1, and its reciprocal under two types of critical regions are provided in Figures 3 and 4. We adopt a two-sided test because of the selected values of the parameters. It should be slightly modified the critical region in the previous step in order to calculate the powers if a one-sided test is adopted. Both critical regions roughly perform equally well as shown in Figures 3 and 4. Figure 5 presents the performance of the two critical regions for given various parameter settings. Figures 5(a) and (b) show that given a standard deviation of 1 or a mean of 0, the powers of the two critical regions, rectangular and elliptic, are high and similar. However, when the variation of the alternative population reduces (σ=1/10) or increases (σ=10), the elliptic critical region performs better than the rectangular one as shown in Figures 5(c) and (d). Therefore, we suggest that when conducting a test for the equivalence of two distributions, an elliptic rejection area should be used.
Next, we consider the problem of determining an optimum rank test. To conduct a test of distributions equivalency, we can use either R l or R s as the test statistic. As mentioned earlier, the marginal distribution R l or R s can be easily established from their joint distribution. Figures 6 and 7 provide the power functions for the test statistics R l and R s at the level of significance 17.38%, respectively. Figure 7 shows that the rank test against scale parameter is badly effected by the centre of the alternative population. This was seen before by Ansari and Bradley (1960). By comparing Figures 6 and 7 with Figure 4, it seems that the joint test would be much more reliable than either R l or R s alone for distributions equivalence tests. A joint test for distributions equivalency would like a better option under most circumstances.
3.3 Lehmann alternatives
Consider the one-sided alternative F(x;θ,σ) > G(x;θ,σ), Lehmann (1953) proposed a test of H o :F(x;θ,σ) = G(x;θ,σ) against H a :F(x;θ,σ)k= G(x;θ,σ) which is known as the family of Lehmann alternative. Note F(x;θ,σ)k is the cumulative distribution of max1≤i≤k(x i ) when X i ∼ F and, under the alternative hypothesis, G(x;θ,σ) is stochastically larger than F(x;θ,σ). First of all, we know
Therefore, the larger the R l is, the stronger the evidence against the null hypothesis will be. For the variation of the distribution per se, the codomain of the density function is compressed to larger numbers; therefore, in most cases, we have V a r(X k ) < V a r(X). We then propose to reject the null hypothesis when R s is large. For example, given F ∼ U(0,1) and G = Fk, it is easy to see
and
for all k. We first find the marginal and joint distributions of the ranks R l and R s in order to define critical regions for R l and R s individually and simultaneously. Due to the properties of the mean and variance of the alternative distribution, as shown in Equations (17), (18) and (19), we are cautious to define the critical regions. Table 1 provides powers for the tests as we choose uniform, standard Normal, student-t with 3 degrees of freedom, exponential distributions for the hypothesized distribution, a couple of different settings for sample sizes m and n, and 2, 3, 6 for k. Clearly, a joint test considering both R l and R s for the equality of distributions is best suited in comparison with tests considering only one of the rank statistics.
4 Conclusion
Our proposed algorithm provides a solution for finding the power of distribution equivalence tests considering the shift and scale parameters, respectively and simultaneously. Numerical studies show that a joint test should be adopted for the test homogeneity of distributions as well as under Lehmann alternatives. Also an elliptic critical region is a better choice rather than a rectangular one for a joint test. In practice, it is reasonable to have neither the normality assumption nor equal mean/variance of the interested distributions. However, our algorithm highly depends on the technology equipments as the possible states in Ω n grow rapidly when the sample sizes increase. Therefore, we can, so far, only target small sample sizes in our work.
References
Ansari AR, Bradley RA: Rank-Sum Tests for Dispersions. Ann. Math. Stat 1960, 31: 1174–1189. 10.1214/aoms/1177705688
Collings BJ, Hamilton MA: Estimating the power of the two-sample Wilcoxon Test for location shift. Biometrics 1988, 44: 847–860. 10.2307/2531596
Klotz J: Nonparametric test for scale. Ann. Math. Stat 1962, 33: 498–512. 10.1214/aoms/1177704576
Lehmann EL: The power for rank tests. Ann. Math. Stat 1953, 24: 23–43. 10.1214/aoms/1177729080
Lehmann EL: Nonparametrics: Statistical Methods Based on Ranks. Prentice-Hall, New Jersey; 1998.
Mann HB, Whitney DR: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat 1947, 18: 50–60. 10.1214/aoms/1177730491
Mood AM: On the asymptotic efficiency of certain nonparametric two-sample tests. Ann. Math. Stat 1954, 25: 514–522. 10.1214/aoms/1177728719
Rosner B, Glynn RJ: Power and sample size estimation for the Wilcoxon rank sum test with application to comparisons of C statistics from alternative prediction models. Biometrics 2009, 65: 188–197. 10.1111/j.1541-0420.2008.01062.x
Shieh G, Jan SL, Randles RH: On power and sample size determinations for the Wilcoxon-Mann-Whitney test. Nonparametric Stat 2006, 18: 33–43. 10.1080/10485250500473099
Siegel S, Tukey JW: A nonparametric sum of ranks procedure for relative spread in unpaired samples. J. Am. Stat. Assoc 1960, 55: 429–445. 10.1080/01621459.1960.10482073
Wilcoxon F: Individual comparisons by ranking methods. Biometrics 1945, 1: 80–83. 10.2307/3001968
Acknowledgments
The author would like to thank James C. Fu and anonymous referee whose comments led to significant improvements of this manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The author declares that she has no competing interests.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Lee, WC. Joint distribution of rank statistics considering the location and scale parameters and its power study. J Stat Distrib App 1, 6 (2014). https://doi.org/10.1186/2195-5832-1-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/2195-5832-1-6