Recent developments on the construction of bivariate distributions with fixed marginals

Lin, Gwo Dong; Dou, Xiaoling; Kuriki, Satoshi; Huang, Jin-Sheng

doi:10.1186/2195-5832-1-14

Review
Open access
Published: 17 June 2014

Recent developments on the construction of bivariate distributions with fixed marginals

Gwo Dong Lin¹,
Xiaoling Dou²,
Satoshi Kuriki³ &
…
Jin-Sheng Huang⁴

Journal of Statistical Distributions and Applications volume 1, Article number: 14 (2014) Cite this article

3937 Accesses
14 Citations
Metrics details

Abstract

Constructing a bivariate distribution with specific marginals and correlation has been a challenging problem since 1930s. In this survey we shall focus on the recent developments on the FGM-related distributions, including Sarmanov and Lee’s distributions, Baker’s distributions and Bayramoglu’s distributions. This complements the most recent works of (i) the review by Sarabia and Gómez-Déniz (2008, SORT) and (ii) the monograph by Balakrishnan and Lai (2009, Springer). Some new results are provided.

Mathematics Subject Classification (2000)

62H20; 62H86; 62G30; 60E05; 62E10

1 Introduction and a brief history

In both statistical theory and practice, we often need to construct a joint distribution with specific marginals and correlation. Ever since Wigner (1932) and Eyraud (1936), it has been an active and challenging topic. Its field of applications is wide ranging. It encompasses physics, economics, engineering, risk analysis, medicine etc. There is no dearth of interesting real life examples. Only recently Takeuchi (2010) constructed a bivariate distribution having specific marginals and certain degrees of correlation to model the joint behavior of far-infrared and far-ultraviolet galaxy luminosity. Danaher and Smith (2011) studied the interaction between a firm’s total amount of purchase (log-normal distribution) and the time since last purchase (exponential distribution). For applications in reliability theory, see for example Li et al. (2013) and the references therein.

Since 1990 there have been seven conferences devoted to this topic: 1990 (Rome, Italy), 1993 (Seattle, USA), 1996 (Prague, Czech Republic), 2000 (Barcelona, Spain), 2004 (Québec, Canada), 2007 (Tartu, Estonia), and 2010 (São Paulo, Brazil), averaging one in every three to four years. The papers presented in the conferences during the period 1990-2000 have all appeared in monographs, edited by Dall’Aglio et al. (1991), Rüschendorf et al. (1996), Beněs and Štěṕan (1997), and Cuadras et al. (2002), respectively. As for the 2004 and 2007 ones, they were published as special issues of Insurance: Mathematics and Economics (August 2005), The Canadian Journal of Statistics (September 2005) and Journal of Statistical Planning and Inference (November 2009). We have yet to find any documentation for the 2010 conference.

There were three survey papers: Lai (2004; 2006) and Sarabia and Gómez-Déniz (2008). The latest monograph by Balakrishnan and Lai (2009) provided an excellent overview. As for the continuous multivariate version, there were Kotz et al. (2000) and Kotz and Nadarajah (2004). For the discrete version, see Johnson et al. (1997), Gómez-Déniz et al. (2012) and Sarabia and Gómez-Déniz (2011), among others. Then there were Cuadras (1992) and Dolati and Úbeda-Flores (2005) for constructing distributions with given multivariate marginals and given dependence structure.

Section 2 is a review of some basic properties of the bivariate distributions with fixed marginals. Then, in Sections 3 to 6, we will focus on the recent developments on the FGM-related distributions, including Sarmanov and Lee’s distributions, Baker’s distributions and Bayramoglu’s distributions. Some new results are provided. This complements the most recent works mentioned above. Finally, in Section 7 we briefly discuss some other related distributions.

2 The natural bounds of the correlations

We first recall an important fundamental result due to Hoeffding (1940) and Fréchet (1951) independently. Let H (x,y) = Pr(X ≤x, Y ≤ y) be the joint distribution of any pair (X,Y) of random variables whose marginals are F (x) = Pr(X ≤ x) and G (y) = Pr(Y ≤ y). We write (X,Y) ∼ H, X ∼ F, Y ∼ G. Then the bivariate distribution H satisfies the following inequality:

\begin{array}{l} max {0, F (x) + G (y) - 1} \equiv H_{-} (x, y) \\ \leq & H (x, y) \leq H_{+} (x, y) \equiv min {F (x), G (y)}, x, y \in R \equiv (- \infty, \infty), \end{array}

(1)

where the extremal distributions H₊ and H_- are known as the Fréchet–Hoeffding upper and lower bounds, respectively.

To generate a pair of random variables obeying the extremal distributions, we start with Z∼ U(0,1), the uniform distribution on (0,1), and note that (F^-1(Z), G^-1(Z)) ∼ H₊ and (F^{- 1}(Z), G^{- 1}(1 - Z)) ∼ H_- , where F^{- 1}(t) = inf {x : F (x) ≥ t}, t ∈ (0,1), is the quantile function of F. Take for example F = G = U (0,1). Then (Z,Z) ∼ H₊ with H₊(x,y) = min {x,y} and (Z, 1 - Z)∼H_- with H_-(x,y)= max{0,x+y-1}. We see that for this special case, both H₊ and H_- are singular bivariate distributions, with the entire mass concentrated on the lines y = x or y = - x, respectively (see Lin and Huang 2010, and the references therein).

The inequality (1) for H implies that Cov(H_-) ≤ Cov (H) ≤ Cov(H₊) by the Hoeffding representation for covariance:

\begin{array}{l} Cov (H) \equiv Cov (X, Y) = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [H (x, y) - F (x) G (y)] dxdy, \end{array}

(2)

whenever the double integral exists and is finite. It then follows that

\begin{array}{l} ρ (H_{-}) \leq ρ (H) \leq ρ (H_{+}), \end{array}

(3)

where ρ (H) denotes the correlation of any pair of random variables (X,Y) ∼ H (or, simply, the correlation of H). Writing μ and σ for the mean and the standard deviation of the random variable appearing as a subscript, we see that the maximum correlation can be expressed explicitly in terms of the quantile functions of the marginals:

ρ (H_{+}) = (\int_{0}^{1} F^{- 1} (t) G^{- 1} (t) dt - μ_{X} μ_{Y}) / (σ_{X} σ_{Y}) \leq 1 .

Moreover, ρ (H₊) = 1 iff the marginals F and G are of the same type, namely, F(x) = G (a x + b) on R for some a > 0 and b ∈ R. Similarly, we have

ρ (H_{-}) = (\int_{0}^{1} F^{- 1} (t) G^{- 1} (1 - t) dt - E (X) E (Y)) / (σ_{X} σ_{Y}) \geq - 1,

and ρ (H_-) = - 1 iff the distributions of X and - Y are of the same type.

3 FGM distributions

To construct a bivariate distribution with fixed marginals, a well known method is the FGM distribution, dating back to Eyraud (1936), Farlie (1960), Gumbel (1960) and Morgenstern (1956). Let (X,Y) ∼ H, X ∼ F and Y ∼ G. The bivariate distribution

\begin{array}{l} H (x, y) = F (x) G (y) {1 + α \bar{F} (x) \bar{G} (y)}, x, y \in R, \end{array}

(4)

is called the FGM distribution, where $\bar{F} = 1 - F, \bar{G} = 1 - G$ and α is a parameter such that H is a bona fide bivariate distribution.

The advantage of the FGM distribution is its simplicity. Its drawback is the lack of flexibility. As was pointed out by Schucany et al. (1978), for absolutely continuous marginals the correlation in FGM is restricted to the interval [-1/3,1/3], which is a far cry from the full range [-1,1] enjoyed by the bivariate normal. Schucany et al.’s condition of absolute continuity was later relaxed by Lin (1987) to just continuity.

To alleviate the deficiency, Johnson and Kotz (1975) proposed the ‘iterated’ FGM distribution:

H_{k} (x, y) = F (x) G (y) + \sum_{j = 1}^{k} α_{j} {(F (x) G (y))}^{⌊ j / 2 ⌋ + 1} {(\bar{F} (x) \bar{G} (y))}^{⌊ (j + 1) / 2 ⌋}, x, y \in R,

where ⌊z⌋ denotes the greatest integer less than or equal to z, and k = 1,2,3,….

For k = 1, it reverts back to the plain FGM, where |α₁| ≤ 1 if both F and G are continuous. For k = 2 the iterated FGM takes the form (Huang and Kotz 1984, 1999)

\begin{array}{l} H (x, y) = F (x) G (y) + α_{1} F (x) G (y) \bar{F} (x) \bar{G} (y) + α_{2} {[F (x) G (y)]}^{2} \bar{F} (x) \bar{G} (y), x, y \in R. \end{array}

To ensure that H be a bona fide distribution function, the range of the parameters, however, is more complicated, namely, for continuous marginals,

| α_{1} | \leq 1, α_{1} + α_{2} \geq - 1, α_{2} \leq 2^{- 1} \{3 - α_{1} + {(9 - 6 α_{1} - 3 α_{1}^{2})}^{1 / 2}\} .

(See also Lin 1987.) Take F = G = U (0,1) for the sake of comparison, the iteration succeeded in extending the range to $- \frac{1}{3} \leq ρ \leq 0.434$ from the previous $- \frac{1}{3} \leq ρ \leq \frac{1}{3}$ of the plain FGM.

The iteration, however, was little studied beyond the k = 2 case due to the difficulty in determining the admissible range of the parameters α_j.

There were several other extensions of the FGM distribution, but the improvement in correlation was still limited (see, e.g., Balakrishnan and Lai 2009). To accommodate higher flexibility we turn to two other classes: (i) Sarmanov and Lee’s distribution (Sarmanov 1966, Lee 1996) and its generalization and (ii) Baker’s distribution (Baker 2008) and its generalization.

4 Sarmanov and Lee’s distributions

The joint density of a Sarmanov–Lee distribution is given by

\begin{array}{l} h (x, y) = f (x) g (y) {1 + α θ_{1} (x) θ_{2} (y)}, x, y \in R, \end{array}

(5)

where f and g are the densities of the marginals F and G, while θ₁ and θ₂ are measurable functions satisfying the condition

1 + α θ_{1} (x) θ_{2} (y) \geq 0 and E (θ_{1} (X)) = E (θ_{2} (Y)) = 0,

which serves to ensure that h is a bona fide joint density.

When the marginals F and G are absolutely continuous, setting θ₁ = 1 - 2F and θ₂ = 1 - 2G shows that FGM is a special case of the Sarmanov–Lee.

Shubina and Lee (2004) showed that the maximum positive correlation of (5) is effected by concentrating all its mass on the (NE-SW) quadrants: {(x,y):(x - x₀)(y - y₀) ≥ 0}, and for the negative, on the (NW-SE) quadrants: {(x,y):(x - x₀)(y - y₀) ≤ 0}, for some real numbers x₀ and y₀.

The improvement in correlation is substantial. The maximum correlation, for the uniform marginals case, is 3/4 as opposed to 1/3 of the plain FGM and 0.434 of the (k=2) iterated FGM.

To further extend the Sarmanov–Lee family (5), a ‘generalized’ Sarmanov–Lee was proposed by Bairamov et al. (2001),

\begin{array}{l} h (x, y) = f (x) g (y) \{1 + αT (F (x), G (y))\}, x, y \in R, \end{array}

(6)

where the product function θ₁(x) θ₂(y) in (5) is replaced by T (F (x),G (y)) in which T is an integrable bivariate function on [0,1]², satisfying

\begin{array}{l} \int_{0}^{1} T (u, v) du = \int_{0}^{1} T (u, v) dv = 0 and 1 + α T \geq 0 on {[0, 1]}^{2}, \end{array}

(7)

and the parameter α satisfies

\begin{array}{l} - {[max_{(u, v) \in D^{+}} T (u, v)]}^{- 1} \leq α \leq {[max_{(u, v) \in D^{-}} (- T (u, v))]}^{- 1}, \end{array}

(8)

where D⁺ = {(u,v) : T (u,v) > 0} and D^- = {(u,v) : T (u,v) < 0}. Again, the constraints (7) and (8) together guarantee that the function h in (6) is a bona fide bivariate density with marginal densities f and g.

The new class (6) is so rich it contains members arbitrarily close to H₊ and H_-, the two extremal distributions (1) of Fréchet–Hoeffding. It is thus flexible enough to accommodate nearly the maximum positive (negative) correlation. Some simple examples help demonstrate the idea (more can be found in Lin and Huang 2011).

Example 1.

The generalized Sarmanov–Lee density (3 × 3) with uniform marginals. Consider the partitioning of the unit square into 3 × 3= 9 subsquares, induced by the lines x,y = 1/3,2/3. Let the function T be defined by

T (x, y) = \{\begin{matrix} 2, & 0 < x, y < \frac{1}{3} or \frac{1}{3} < x, y < \frac{2}{3} or \frac{2}{3} < x, y < 1, \\ - 1, & elsewhere . \end{matrix}

Then the joint density becomes

h (x, y) = \{\begin{matrix} 3, & 0 < x, y < \frac{1}{3} or \frac{1}{3} < x, y < \frac{2}{3} or \frac{2}{3} < x, y < 1, \\ 0, & elsewhere . \end{matrix}

Its correlation is ρ (h) = 8/9, which exceeds the maximum, ρ_max = 3/4, of the plain Sarmanov–Lee (with uniform marginals).

Example 2.

The generalized Sarmanov–Lee density (n × n) with uniform marginals. For n = 2,3,4,…, let

T_{n} (x, y) = \{\begin{matrix} n - 1, & \frac{i - 1}{n} < x, y < \frac{i}{n}, i = 1, \dots, n, \\ - 1, & elsewhere, \end{matrix}

and

h_{n} (x, y) = \{\begin{matrix} n, & \frac{i - 1}{n} < x, y < \frac{i}{n}, i = 1, \dots, n, \\ 0, & elsewhere . \end{matrix}

We have the correlation ρ (h_n) = 1 - n^-2 → 1 as n → ∞.

Example 3.

The generalized Sarmanov–Lee density (n×n) with exponential marginals. Let

h_{n} (x, y) = \{\begin{matrix} n e^{- x - y}, & ln \frac{n}{n - i + 1} < x, y < ln \frac{n}{n - i}, i = 1, \dots, n, \\ 0, & elsewhere, \end{matrix}

where ln(n/0) ≡ ∞. Our calculations show that ρ(h_n) ≈ 1 - 1.08/n → 1 as n → ∞.

The convergence to the maximal correlation is not limited to those with uniform or the exponential marginals. It holds for more general cases. The following Theorems 1 through 3 are successively stronger. Only Theorem 3 will be proved, since the others appeared in Lin and Huang (2011).

Let F and G be arbitrary distributions (not necessarily identical nor ‘of the same type’) with densities F^′= f and G^′= g. Define the joint density of (X_n,Y_n) by

h_{n} (x, y) = \{\begin{array}{l} nf (x) g (y), & (x, y) \in (F^{- 1} (\frac{i - 1}{n}), F^{- 1} (\frac{i}{n})) \times (G^{- 1} (\frac{i - 1}{n}), G^{- 1} (\frac{i}{n})), \\ i = 1, 2, \dots, n, \\ 0, & elsewhere, \end{array}

(9)

where $F^{- 1} (0) \equiv {lim}_{t \to 0^{+}} F^{- 1} (t)$ and $F^{- 1} (1) \equiv {lim}_{t \to 1^{-}} F^{- 1} (t)$ are the left and right extremities of the distribution F, respectively. It qualifies as a generalized Sarmanov–Lee bivariate density (with marginal densities f and g).

Theorem 1.

For F = G, (9) becomes

h_{n} (x, y) = \{\begin{array}{l} nf (x) f (y), & F^{- 1} (\frac{i - 1}{n}) < x, y < F^{- 1} (\frac{i}{n}), i = 1, 2, \dots, n, \\ 0, & elsewhere . \end{array}

If F has a finite variance σ², then the correlation is

ρ (h_{n}) = \frac{1}{σ^{2}} [n \sum_{i = 1}^{n} {(\int_{F^{- 1} (\frac{i - 1}{n})}^{F^{- 1} (\frac{i}{n})} xf (x) dx)}^{2} - {(E (X))}^{2}],

which converges to ρ (H₊), which is 1 in this case, as n → ∞.

Theorem 2.

If in (9), X and Y satisfy any of the conditions:

F^{- 1} or G^{- 1} is uniformly continuous on (0,1),
a≤X, a^′ ≤ Y a.s., (iii) X ≤b, Y ≤ b^′a.s.,
X ≥ a, Y ≤ b a.s. and F^{- 1},G^{- 1} have continuous derivatives, where a, b, a^′, b^′ ∈ R,

then ρ (h_n) converges to ρ (H₊) as n → ∞.

Remarks 1.

Theorem 1 follows immediately from a general result, which is interesting in its own right: For any X ∼ F with finite E (X²),

S_{n} \equiv n \sum_{i = 1}^{n} {(\int_{\frac{i - 1}{n}}^{\frac{i}{n}} F^{- 1} (t) dt)}^{2} \to \int_{0}^{1} {(F^{- 1} (t))}^{2} dt = E (X^{2}) as n \to \infty .

The proofs of Remarks 1 and Theorem 2 are based on the following two lemmas.

Lemma 1.

(Chebyshev’s inequality for integrals.) Let f₁, f₂ : (a,b) → R be both increasing or both decreasing, and let p : (a,b) → (0,∞) be an integrable function. Then

\int_{a}^{b} p (x) f_{1} (x) dx \int_{a}^{b} p (x) f_{2} (x) dx \leq \int_{a}^{b} p (x) dx \int_{a}^{b} p (x) f_{1} (x) f_{2} (x) dx,

provided that all integrals exist and are finite.

Remarks 2.

An extension of this inequality can be rephrased in probabilistic terms: Let X be any random variable and let f₁, f₂ be both increasing or both decreasing, then the covariance Cov(f₁(X), f₂(X)) = E [f₁(X)f₂(X)] - E [f₁(X)]E [f₂(X)] is non-negative, provided that the expectations E[f₁(X)], E [f₂(X)] and E [f₁(X)f₂(X)] exist.

Lemma 2.

(Euler–Maclaurin summation formula.) Let m < n be positive integers and let f be a real-valued function on [m,n]. Then we have(i) if f has a continuous derivative on [m,n],

\sum_{k = m}^{n} f (k) = \int_{m}^{n} f (x) dx + \frac{1}{2} (f (m) + f (n)) + \int_{m}^{n} f^{'} (x) (x - ⌊ x ⌋ - \frac{1}{2}) dx;

(ii) if f has a continuous derivative of order 4,

\sum_{k = m}^{n} f (k) = \int_{m}^{n} f (x) dx + \frac{1}{2} (f (m) + f (n)) + \frac{1}{12} (f^{'} (n) - f^{'} (m)) + R (f),

where the remainder $R (f) = \frac{1}{24} \int_{m}^{n} f^{(4)} (x) (B_{4} - B_{4} (x - ⌊ x ⌋)) dx$ in which $B_{4} (x) = x^{4} - 2 x^{3} + x^{2} - \frac{1}{30}$ is the Bernoulli polynomial and B₄ = B₄(0) = - 1/30 the Bernoulli number.

Theorem 3.

For arbitrary marginals F and G with densities F^′= f and G^′= g, we have (i) the distribution H_n of (9) converges weakly to H₊ as n → ∞, and (ii) the correlation of H_n converges to that of H₊ as n → ∞, provided that F and G have finite variances.

Proof.

Note that the support of (9) is contained in the region bounded by the two curves $G (y) = F (x) + \frac{1}{n}$ and $G (y) = F (x) - \frac{1}{n} .$ Therefore for any (x,y) in the region G (y) > F (x), we have Pr(X_n< x, Y_n> y) = 0 for all large n (n ≥ (G (y) - F (x))^-1). This implies that H_n(x,y) = Pr(X_n≤ x, Y_n≤ y)= Pr(X_n≤ x) - Pr (X_n≤ x, Y_n> y) = Pr (X_n≤ x) = F (x) = min {F (x),G (y)} = H₊(x,y) for all n ≥ (G (y) - F (x))^-1. Likewise, for each (x,y) in the region G (y) < F(x) we have H_n(x,y) = G (y) = min {F (x),G (y)} = H₊(x,y) for all large n. Finally, for each (x,y) with G (y) = F(x), $F (x) - \frac{1}{n} \leq H_{n} (x, y) \leq F (x)$ for all n, and ${lim}_{n \to \infty} H_{n} (x, y) = F (x) = G (y) = H_{+} (x, y) .$ All together, we see that in all three cases, we have ${lim}_{n \to \infty} H_{n} (x, y) = H_{+} (x, y)$ . This proves part (i). Part (ii) follows from the next lemma which can be proved by using Hölder’s inequality (see, e.g., the proof of Theorem 4 in Dou et al. 2013). □

Lemma 3.

Let (X_n,Y_n) ∼ H_n be a sequence of bivariate random variables and X_n∼ F, Y_n∼ G for all n ≥ 1. Assume that (X_n,Y_n) converges in distribution to (X₀,Y₀) ∼ H₀ as n tends to infinity. If, in addition, E[|X_n|^{p + q}], E[|Y_n|^{p + q}] < ∞ for some positive integers p and q, then

{lim}_{n \to \infty} E [X_{n}^{p} Y_{n}^{q}] = E [X_{0}^{p} Y_{0}^{q}] .

Remarks 3.

We wondered if the convergence of Theorem 3(ii) is monotone. Indeed, ρ (H_m) ≥ ρ (H_n) if m is a multiple of n (say, m = k n). To see this, write from (9)

E (X_{n} Y_{n}) = n \sum_{i = 1}^{n} (\int_{(i - 1) / n}^{i / n} F^{- 1} (t) dt) (\int_{(i - 1) / n}^{i / n} G^{- 1} (t) dt) \equiv n \sum_{i = 1}^{n} A_{i} B_{i},

where

\begin{array}{l} A_{i} B_{i} & = & \sum_{j = 1}^{k} (\int_{(i - 1) / n + (j - 1) / (kn)}^{(i - 1) / n + j / (kn)} F^{- 1} (t) dt) \sum_{j = 1}^{k} (\int_{(i - 1) / n + (j - 1) / (kn)}^{(i - 1) / n + j / (kn)} G^{- 1} (t) dt) \\ \equiv & \sum_{j = 1}^{k} a_{ij} \sum_{j = 1}^{k} b_{ij} \leq k \sum_{j = 1}^{k} a_{ij} b_{ij}, \end{array}

by Chebyshev’s sum inequality (see, e.g., Dou et al. 2013, Lemma 1). This in turn implies that E (X_mY_m) ≥ E(X_nY_n) and hence ρ (H_m) ≥ ρ(H_n). That the sequence ${ρ (H_{n})}_{n = 1}^{\infty}$ fails to be monotonically increasing can be seen from the following counterexample. Let f = g,f (x) = 1/6 if 0 ≤ a ≤ |x| ≤ a + 3, and f (x) = 0 otherwise. We have ρ (H₃) < ρ (H₂) for $a > (- 1 + \sqrt{6}) / 2$ .

For certain marginals, the rate of convergence of Theorem 3(ii) can be determined (Lin and Huang 2011 and Theorem 4 below). The calculation is based on the following. Note that the correlation of two random variables is location-scale invariant.

Lemma 4.

For positive integer n > 1, we have (i) $\sum_{k = 1}^{n} k ln k = \frac{1}{2} n^{2} ln n - \frac{1}{4} n^{2} + \frac{1}{2} n ln n + \frac{1}{12} ln n + R_{1} (n), where | R_{1} (n) | < \frac{5}{12};$ (ii) $\sum_{k = 1}^{n} k^{2} {(ln k)}^{2} = \frac{1}{3} n^{3} {(ln n)}^{2} - \frac{2}{9} n^{3} ln n + \frac{2}{27} n^{3} + \frac{1}{2} n^{2} {(ln n)}^{2} + \frac{1}{6} n {(ln n)}^{2} + \frac{1}{6} n ln n + R_{2} (n)$ , where R₂(n) = (1) as n → ∞; (iii) $\sum_{k = 1}^{n} k (k + 1) (ln k) ln (k + 1) = \frac{1}{3} n^{3} (ln n) ln (n + 1) - \frac{1}{9} n^{3} (ln n + ln (n + 1)) + \frac{2}{27} n^{3} + n^{2} (ln n) ln (n + 1) - \frac{1}{4} n^{2} ln (n + 1) - \frac{1}{12} n^{2} ln n + \frac{2}{3} n (ln n) ln (n + 1) + \frac{1}{9} n^{2} + \frac{1}{12} (ln n) ln (n + 1) + \frac{1}{4} n ln n + \frac{1}{12} (n + 1) ln (n + 1) - \frac{11}{36} n - \frac{1}{12} {(ln n)}^{2} + \frac{5}{36} ln (n + 1) + R_{3} (n), where R_{3} (n) = O (1)$ as n → ∞.

Remarks 4.

Recall that the Euler constant $γ = {lim}_{n \to \infty} (\sum_{k = 1}^{n} 1 / k - ln n) \approx 0.57721 .$ Interestingly, each of the three remainder terms in Lemma 4 also converges to a real constant as n → ∞, say ${lim}_{n \to \infty} R_{i} (n) = R_{i}, i = 1, 2, 3 .$ In fact, R₁ = lnA ≈ 0.24875, where A ≈ 1.28243 is the so-called Glaisher–Kinkelin constant, and lnA can be represented as

\begin{array}{l} ln A & = & \frac{1}{4} + \frac{1}{24} \int_{1}^{\infty} f_{1}^{(4)} (x) h (x) dx \\ = & \frac{1}{4} + \frac{1}{24} \sum_{k = 1}^{\infty} \int_{0}^{1} f_{1}^{(4)} (x + k) [- x^{2} {(x - 1)}^{2}] dx \\ = & \frac{1}{4} + \frac{1}{12} \sum_{k = 1}^{\infty} {6 k + 3 - (6 k^{2} + 6 k + 1) ln (1 + 1 / k)}, \end{array}

where f₁(x) = x lnx and h (x) = B₄- B₄(x - ⌊ x ⌋). Similarly,

\begin{array}{l} R_{2} = - \frac{2}{27} + \frac{1}{24} \int_{1}^{\infty} f_{2}^{(4)} (x) h (x) dx \approx - 0.06576, \\ R_{3} = \frac{1}{18} ln 2 + \frac{13}{108} + \frac{π^{2}}{72} + \frac{1}{24} \int_{1}^{\infty} f_{3}^{(4)} (x) h (x) dx \\ = 0.295956 + \frac{1}{24} \int_{1}^{\infty} f_{3}^{(4)} (x) h (x) dx \approx 0.303, \end{array}

where f₂(x) = x²(lnx)² and f₃(x) = x (x + 1)(lnx) ln(x + 1).

Theorem 4.

(i) If F is uniform and if G is a power function distribution, then the convergence rate of ρ (h_n) in (9) is 1/n² as n → ∞. (ii) If F = U(0,1) and if G is exponential, then the convergence rate of ρ (h_n) is (lnn)/n² as n → ∞. More precisely, $ρ (h_{n}) = \sqrt{3} / 2 - {(2 \sqrt{3})}^{- 1} (ln n) / n^{2} + O (n^{- 2})$ as n → ∞. (iii) If F = U(0,1) and if G is logistic, then the convergence rate of ρ (h_n) is (lnn)/n² as n → ∞. More precisely, ρ (h_n) = 3/π - π^{- 1}(lnn)/n²+ (n^{- 2}) as n → ∞. (iv) If F = G is exponential, then ρ(h_n) = 1+(n^{- 1}) as n → ∞.

It is interesting to characterize the FGM and Sarmanov–Lee distributions by minimizing the χ² divergence. Let h be the joint density of X and Y with marginal densities f = F^′ and g = G^′. Define the χ² divergence (distance) between the joint density h and the product density fg (of independent random variables) by

\begin{array}{l} χ^{2} (h; f, g) = {\int \int}_{S_{F} \times S_{G}} {[\frac{h (x, y)}{f (x) g (y)} - 1]}^{2} f (x) g (y) dxdy, \end{array}

(10)

where S_F and S_G are the supports of F and G, respectively.

Nelsen (1994) obtained a characterization of the FGM distributions by minimizing the χ² divergence (10). Huang and Lin (2011) extended Nelsen’s (1994) result to the case of Sarmanov–Lee distributions. For i=1,2, consider the functions $θ_{i}^{*} : [0, 1] \to [- 1, 1]$ satisfying

\begin{array}{l} \int_{0}^{1} θ_{i}^{*} (u) du = 0, \int_{0}^{1} {(θ_{i}^{*} (u))}^{2} du = c_{i} > 0, sup_{u \in [0, 1]} θ_{i}^{*} (u) = 1, inf_{u \in [0, 1]} θ_{i}^{*} (u) = - h_{i}, \end{array}

(11)

where h_i∈ (0,1]. Then we have the following.

Theorem 5.

Among all absolutely continuous bivariate distributions with marginal densities f = F^′ and g = G^′, the one whose joint density is closest to the product density of independent random variables (in the sense of minimizing the χ² divergence) subject to the constraint $E [θ_{1}^{*} (F (X)) θ_{2}^{*} (G (Y))] = c_{0},$ where $- 1 \leq \frac{c_{0}}{c_{1} c_{2}} \leq {(max {h_{1}, h_{2}})}^{- 1}$ and $θ_{i}^{*}, c_{i}, h_{i}, i = 1, 2,$ are given in (11), is the Sarmanov–Lee distribution having joint density

h (x, y) = f (x) g (y) \{1 + \frac{c_{0}}{c_{1} c_{2}} θ_{1}^{*} (F (x)) θ_{2}^{*} (G (y))\}, x, y \in R .

The Sarmanov–Lee distribution and its generalization have been used in actuarial science, financial markets, electrical engineering and quantum statistical mechanics (see the references in Lin and Huang 2011). For example, Hernández-Bastida and Fernández-Sánchez (2012) applied a Sarmanov–Lee family to the Bayes premium in a collective risk model. In the analysis of longitudinal data, Cole et al. (1995) used a Sarmanov–Lee bivariate distribution for transition probabilities in a two-state Markov model and developed an empirical Bayes estimation methodology. Recently, Pelican and Vernic (2013a; 2013b) studied the parameter estimation problems for the Sarmanov–Lee distribution.

5 Baker’s distributions

Write X_k,n∼ F_k,n for the k th smallest order statistic of the random sample ${X_{i}}_{i = 1}^{n}$ from F. Likewise, let Y_k,n and G_k,n stand for another sequence Y_i∼ G, ${Y_{i}}_{i = 1}^{n}$ independent of ${X_{i}}_{i = 1}^{n}$ . For maximal correlation, Baker (2008) proposed the bivariate distribution

\begin{array}{l} H_{+}^{(n)} (x, y) = \frac{1}{n} \sum_{k = 1}^{n} F_{k, n} (x) G_{k, n} (y), x, y \in R, \end{array}

(12)

and for the minimum,

\begin{array}{l} H_{-}^{(n)} (x, y) = \frac{1}{n} \sum_{k = 1}^{n} F_{k, n} (x) G_{n - k + 1, n} (y), x, y \in R . \end{array}

(13)

Clearly, both $H_{+}^{(n)}$ and $H_{-}^{(n)}$ satisfy the constraint of having F and G as the marginals. The following convex combination of $H_{+}^{(n)}$ (or $H_{-}^{(n)}$ ), n = 1,2,

\begin{array}{l} H_{q \pm} (x, y) & = & (1 - q) H_{\pm}^{(1)} (x, y) + {qH}_{\pm}^{(2)} (x, y) \\ = & F (x) G (y) {1 \pm q \bar{F} (x) \bar{G} (y)}, x, y \in R, q \in [0, 1], \end{array}

is an FGM distribution.

As a generalization of (12) and (13), Baker (2008) also introduced

\begin{array}{l} H_{r}^{(n)} (x, y) = \sum_{k = 1}^{n} \sum_{ℓ = 1}^{n} r_{k, ℓ} F_{k, n} (x) G_{ℓ, n} (y), x, y \in R, \end{array}

(14)

where (r_k,ℓ) = r ∈ which consists of all r satisfying: r_k,ℓ≥ 0 and $\sum_{k = 1}^{n} r_{k, ℓ} = \sum_{ℓ = 1}^{n} r_{k, ℓ} = 1 / n$ for all k,ℓ = 1,2,…,n; namely, n r is a doubly stochastic matrix.

If P {(X_n,Y_n) = (X_k,n,Y_ℓ,n)}=r_k,ℓ for all k,ℓ, then $(X_{n}, Y_{n}) \sim H_{r}^{(n)} .$ A simple application of majorization theory leads to the inequality:

H_{-} (x, y) \leq H_{-}^{(n)} (x, y) \leq H_{r}^{(n)} (x, y) \leq H_{+}^{(n)} (x, y) \leq H_{+} (x, y), x, y \in R,

and hence

Cov (H_{-}) \leq Cov (H_{-}^{(n)}) \leq Cov (H_{r}^{(n)}) \leq Cov (H_{+}^{(n)}) \leq Cov (H_{+}),

ρ (H_{-}) \leq ρ (H_{-}^{(n)}) \leq ρ (H_{r}^{(n)}) \leq ρ (H_{+}^{(n)}) \leq ρ (H_{+}),

provided the variances of F and G are positive and finite.

Unlike the Sarmanov–Lee, all Baker’s distributions retain the same support as that of F×G (and so does FGM). The former also admits discrete F and G.

For (X,Y) ∼ H with continuous marginals F and G, let ρ_s(H)(= 12 E [F (X)G (Y)]-3) and τ(H)(= 4 E [H (X,Y)]-1) be its Spearman’s rho and Kendall’s tau coefficients, respectively (see, e.g., Joe 2001, pp. 31-32). Baker (2008) proved that ${lim}_{n \to \infty} ρ_{s} (H_{+}^{(n)}) = 1$ and ${lim}_{n \to \infty} τ (H_{+}^{(n)}) = 1$ . On the other hand, Lin and Huang (2010) found conditions under which Pearson’s correlation $ρ (H_{+}^{(n)})$ converges to ρ(H₊) as n → ∞. It is also possible to calculate the convergence rate of $ρ (H_{+}^{(n)})$ for some specific marginals. For instance, we have

Theorem 6.

(i) If F is uniform and if G a power function distribution, then the convergence rate of $ρ (H_{+}^{(n)})$ in (12) is 1/n as n → ∞. (ii) If F = U (0,1) and if G is exponential, then the convergence rate of $ρ (H_{+}^{(n)})$ is 1/n as n → ∞. More precisely, $ρ (H_{+}^{(n)}) = \sqrt{3} / 2 - \sqrt{3} / n + O (n^{- 2})$ as n → ∞. (iii) If F = U(0,1) and if G is logistic, then the convergence rate of $ρ (H_{+}^{(n)})$ is 1/n as n → ∞. More precisely, $ρ (H_{+}^{(n)}) = 3 π^{- 1} - 6 π^{- 1} / n + O (n^{- 2})$ as n → ∞. (iv) If F = G is exponential, then the convergence rate of $ρ (H_{+}^{(n)})$ is (lnn)/n as n→ ∞. More precisely, $ρ (H_{+}^{(n)}) = 1 - \frac{1}{n} \sum_{k = 1}^{n} 1 / k = 1 - (ln n) / n + O (n^{- 1})$ as n → ∞.

For $H_{+}^{(n)}$ in (12), Dou et al. (2013) established the weak convergence and the product-moment convergence as well as the TP ₂ property described below. The results for $H_{-}^{(n)}$ can be derived similarly and are omitted. The next theorem is of interest in its own right.

Theorem 7.

Let $(X_{n}, Y_{n}) \sim H_{+}^{(n)}$ in (12) with marginal densities f and g, and let $U_{n} = \sqrt{n} (G (Y_{n}) - F (X_{n}))$ . Then, as n → ∞, (X_n,U_n) converges in distribution to $(\tilde{X}, \tilde{U})$ having joint density

\begin{array}{l} k (x, u) = \frac{1}{2 \sqrt{π F (x) \bar{F} (x)}} exp \{- \frac{u^{2}}{4 F (x) \bar{F} (x)}\} f (x) . \end{array}

The conditional distribution of $\tilde{U}$ given $\tilde{X} = x$ is the normal distribution with mean 0 and variance $2 F (x) \bar{F} (x)$ , namely,

\tilde{U} |_{\tilde{X} = x} \sim N (0, 2 F (x) \bar{F} (x)) .

By Theorem 7, we see that for uniform marginals F = G = U (0,1), Baker’s distribution $H_{+}^{(n)}$ converges weakly to H₊ (with support on the diagonal line y = x) as n → ∞. This can be further extended to the following general case by using the monotone property of distribution functions.

Theorem 8.

For general marginals F and G, Baker’s distribution $H_{+}^{(n)}$ converges weakly to the Fréchet–Hoeffding upper bound H₊ as n tends to infinity.

Theorem 9.

Let $(X_{n}, Y_{n}) \sim H_{+}^{(n)}$ with general marginals F and G, and let $(\tilde{X}, \tilde{Y}) = (F^{- 1} (Z), G^{- 1} (Z))$ , where Z ∼ U (0,1). If, in addition, E [|X_n|^{p + q}], E [|Y_n|^{p + q}] < ∞ for some positive integers p and q, then

{lim}_{n \to \infty} E [X_{n}^{p} Y_{n}^{q}] = E [{(\tilde{X})}^{p} {(\tilde{Y})}^{q}] = \int_{0}^{1} {(F^{- 1} (t))}^{p} {(G^{- 1} (t))}^{q} dt .

Corollary 1.

If F and G have finite non-zero variances, ${lim}_{n \to \infty} ρ (H_{+}^{(n)}) = ρ (H_{+})$ .

The main tool in the proof of Theorem 7 is the following generalization of the so-called local limit theorem for binomial distribution, which is also of interest in itself.

Lemma 5.

Let 0 < p,q < 1 be constants such that p + q = 1, and let u ∈ R be a constant. Let ψ be a function such that ψ (n) = o (n^1/6) as n → ∞. Write $k = np + y_{k} \sqrt{npq} \in {0, 1, 2, \dots, n}$ . Then, as n → ∞,

(\binom{n}{k}) {(p + \frac{u}{\sqrt{n}})}^{k} {(q - \frac{u}{\sqrt{n}})}^{n - k} \approx \frac{1}{\sqrt{2 π npq}} exp \{- \frac{1}{2} {(y_{k} - \frac{u}{\sqrt{pq}})}^{2}\}

(15)

(asymptotically) uniformly in y_k such that |y_k| ≤ ψ (n). Namely, the left-hand side a_n(k) and the right-hand side b_n(y_k) in (15) satisfy

sup_{{k : | y_{k} | \leq ψ (n)}} | a_{n} (k) / b_{n} (y_{k}) - 1 | \to 0 as n \to \infty .

Recall that a real-valued function k on R² is totally positive of order two (TP ₂), a notion of strong positive dependence, if k (x₁,y₁)k (x₂,y₂)≥ k(x₁,y₂)k (x₂,y₁) for all x₁≤ x₂, y₁≤ y₂.

Theorem 10.

For general distributions F and G, Baker’s distribution $H_{+}^{(n)}$ is TP ₂.

Finally, if we allow the two sample sizes in (14) to be different, say m and n, then the resulting bivariate distribution is

\begin{array}{l} H_{r}^{(m, n)} (x, y) = \sum_{k = 1}^{m} \sum_{ℓ = 1}^{n} r_{k, ℓ} F_{k, m} (x) G_{ℓ, n} (y) = C (F (x), G (y); r), \end{array}

(16)

where C is a Bernstein copula and the parameters r_k,ℓ satisfy

\sum_{ℓ = 1}^{n} r_{k, ℓ} = \frac{1}{m}, \sum_{k = 1}^{m} r_{k, ℓ} = \frac{1}{n}, r_{k, ℓ} \geq 0, 1 \leq k \leq m, 1 \leq ℓ \leq n .

Because of the Weierstrass approximation theorem, we can use $H_{r}^{(m, n)}$ in (16) to approximate smooth bivariate distributions. Moreover, since (16) is a finite mixture distribution, the EM algorithm applies to the estimation of the parameters r_k,ℓ of $H_{r}^{(m, n)} .$ Dou et al. (2014) took these advantages and succeeded in fitting Baker’s distributions to some practical data sets including the Illinois state education data.

6 Bayramoglu’s distributions

In this section, we extend Baker’s distributions by an alternative approach. Starting with any joint distribution H with marginals F and G (e.g., the FGM distribution (4)), let (X₁,Y₁),…,(X_n,Y_n) be a random sample of size n from H. As before, sorting ${X_{ℓ}}_{ℓ = 1}^{n}$ and ${Y_{ℓ}}_{ℓ = 1}^{n}$ , we obtain the order statistics X_1,n≤ X_2,n≤ ⋯ ≤ X_n,n and Y_1,n≤ Y_2,n≤ ⋯ ≤ Y_n,n, respectively, and again, write X_r,n∼ F_r,n, Y_s,n∼ G_s,n.

Instead of Baker’s $H_{+}^{(n)}$ and $H_{-}^{(n)}$ , Bairamov and Bayramoglu (2013) proposed

\begin{array}{l} K_{+}^{(n)} (x, y) = \frac{1}{n} \sum_{r = 1}^{n} Pr (X_{r, n} \leq x, Y_{r, n} \leq y), x, y \in R, \end{array}

(17)

\begin{array}{l} K_{-}^{(n)} (x, y) = \frac{1}{n} \sum_{r = 1}^{n} Pr (X_{r, n} \leq x, Y_{n - r + 1, n} \leq y), x, y \in R, \end{array}

(18)

both having marginals F and G. We see that if H (x,y) = F (x) G (y) for all x,y, namely, if X and Y are independent, then Pr(X_r,n≤ x, Y_s,n≤ y) = F_r,n(x)G_s,n(y) for all x,y, and hence (17) and (18) reduce to (12) and (13), respectively. Therefore, the generalization admits a wider range of correlation.

When X and Y are not independent, the computation of the joint distribution of bivariate order statistics is complicated (see David 1981, p. 26, or David and Nagaraja 2003, p. 25):

K_{r, s}^{(n)} (x, y) \equiv Pr (X_{r, n} \leq x, Y_{s, n} \leq y) = \sum_{i = r}^{n} \sum_{j = s}^{n} \sum_{k} f_{k, i, j}^{(n)} (x, y), x, y \in R,

where k takes all integers such that k, i - k, j - k, n - i - j + k ≥ 0, the summand

\begin{array}{l} f_{k, i, j}^{(n)} (x, y) & = & \frac{n!}{k! (i - k)! (j - k)! (n - i - j + k)!} H^{k} (x, y) {(F (x) - H (x, y))}^{i - k} \\ \cdot {(G (y) - H (x, y))}^{j - k} {(\bar{H} (x, y))}^{n - i - j + k}, \\ if the exponents k, i - k, j - k, n - i - j + k \geq 0, \end{array}

and $f_{k, i, j}^{(n)} (x, y) = 0$ , otherwise. Here $\bar{H} (x, y) \equiv Pr (X > x, Y > y) = 1 - F (x) - G (y) + H (x, y)$ .

Huang et al. (2013) established the following properties of $K_{r, s}^{(n)}, K_{+}^{(n)}$ and $K_{-}^{(n)}$ . Hereafter, we shall consider x,y as fixed, and for simplicity, write F,G, H and $\bar{H}$ for F (x),G (y), H (x,y) and $\bar{H} (x, y)$ , respectively.

Theorem 11.

For fixed marginals F, G and 1 ≤ r, s ≤ n, the joint distribution $K_{r, s}^{(n)}$ of bivariate order statistics (X_r,n,Y_s,n) is increasing in H. More precisely, for given x,y,F and G, if H₁(x,y) ≤ H₂(x,y), then $K_{r, s, 1}^{(n)} (x, y) \leq K_{r, s, 2}^{(n)} (x, y)$ , where $K_{r, s, i}^{(n)}, i = 1, 2,$ is the distribution of (X_r,n,Y_s,n) generated from H_i with marginals F and G.

Theorem 12.

For given H with marginals F and G, Bayramoglu’s distribution $K_{+}^{(n)}$ in (17) is increasing in n. More precisely,

\begin{array}{l} K_{+}^{(n)} = K_{+}^{(n - 1)} + \frac{1}{n - 1} \sum_{i = 1}^{⌊ n / 2 ⌋} (\binom{n - 1}{i, i - 1, n - 2 i}) {(F - H)}^{i} {(G - H)}^{i} {(H + \bar{H})}^{n - 2 i}, \\ n \geq 2 . \end{array}

(19)

Theorem 13.

For given H with marginals F and G, Bayramoglu’s distribution $K_{-}^{(n)}$ in (18) is decreasing in n. More precisely,

\begin{array}{l} K_{-}^{(n)} = K_{-}^{(n - 1)} - \frac{1}{n - 1} \sum_{i = 1}^{⌊ n / 2 ⌋} (\binom{n - 1}{i, i - 1, n - 2 i}) {(H \bar{H})}^{i} {(1 - H - \bar{H})}^{n - 2 i}, n \geq 2 . \end{array}

(20)

Recall that the bivariate distribution H with marginals F and G is positive quadrant dependent (PQD), if H ≥ F G, that is, H (x,y) ≥ F (x) G(y) for all x,y, and that H is negative quadrant dependent (NQD) if H ≤ F G. As immediate consequences of Theorems 11–13, we have the following corollary (assuming $σ_{X}^{2}, σ_{Y}^{2} \in (0, \infty)$ if necessary).

Corollary 2.

(i) For fixed n and marginals F, G, both $K_{+}^{(n)}$ and $K_{-}^{(n)}$ are increasing in H;(ii) $ρ (K_{+}^{(n)}) \geq ρ (H_{+}^{(n)})$ if H is PQD, and $ρ (K_{-}^{(n)}) \leq ρ (H_{-}^{(n)})$ if H is NQD;(iii) $K_{+}^{(2)} = FG$ if H = H_-, and in general, $0 \leq Cov (K_{+}^{(2)}) \leq Cov (K_{+}^{(3)}) \leq \dots$ ;(iv) $K_{-}^{(2)} = FG$ if H = H₊, and in general, $0 \geq Cov (K_{-}^{(2)}) \geq Cov (K_{-}^{(3)}) \geq \dots$ .

Remarks 5.

The first parts of Corollaries 2(iii) and 2(iv) can be proved directly without invoking the identities (19) and (20), but the proof is complicated. For instance, that of Corollary 2(iv) can be proceeded as follows. Let Z₁ and Z₂ be two independent copies of Z ∼ U (0,1), and let Z_1,2≤ Z_2,2 be the corresponding order statistics. Then (X_i,Y_i) = (F^-1(Z_i), G^{- 1}(Z_i)), i = 1,2, are independent and have common distribution H₊(x,y) = min {F (x), G (y)}, x,y ∈ R. In this case, we have

\begin{array}{l} K_{-}^{(2)} (x, y) & = & \frac{1}{2} [Pr (X_{1, 2} \leq x, Y_{2, 2} \leq y) + Pr (X_{2, 2} \leq x, Y_{1, 2} \leq y)] \\ = & \frac{1}{2} [Pr (F^{- 1} (Z_{1, 2}) \leq x, G^{- 1} (Z_{2, 2}) \leq y) + Pr (F^{- 1} (Z_{2, 2}) \leq x, G^{- 1} (Z_{1, 2}) \leq y)] \\ = & \frac{1}{2} [Pr (Z_{1, 2} \leq F (x), Z_{2, 2} \leq G (y)) + Pr (Z_{2, 2} \leq F (x), Z_{1, 2} \leq G (y))] \\ = & \frac{1}{2} [Pr (Z_{1} \leq F (x), Z_{2} \leq G (y) | Z_{1} \leq Z_{2}) + Pr (Z_{1} \leq F (x), Z_{2} \leq G (y) | Z_{1} > Z_{2})] \\ = & Pr (Z_{1} \leq F (x), Z_{2} \leq G (y)) = F (x) G (y), x, y \in R, \end{array}

the penultimate equality following from the fact that Pr(Z₁≤ Z₂) = Pr(Z₁> Z₂) = 1/2 and the law of total probability.

In the rest of this section, we will mention some new results (Theorems 14–17) regarding (a) weak convergence, (b) product-moment convergence and (c) convergence of correlations (including Pearson’s correlation, Spearman’s rho and Kendall’s tau) of Bayramoglu’s distributions. The proofs are given in the Appendix: Proof of the new results in Section 6. For proving the weak convergence, we offer an alternative approach different from that (Theorem 7) for Baker’s distributions.

Theorem 14.

Let H be any bivariate distribution with marginals F and G. (i) Bayramoglu’s distribution (17) converges weakly to H₊ as n → ∞. Moreover, ${lim}_{n \to \infty} ρ (K_{+}^{(n)}) = ρ (H_{+})$ , provided both F and G have finite nonzero variances. (ii) Bayramoglu’s distribution (18) converges weakly to H_- as n → ∞. Moreover, ${lim}_{n \to \infty} ρ (K_{-}^{(n)}) = ρ (H_{-})$ , provided both F and G have finite nonzero variances.

Theorem 15.

Let (X,Y) ∼ H with marginals F and G and E [|X|^{p + q}],E [|Y|^{p + q}] < ∞ for some positive integers p and q. (i) If $(X_{n}, Y_{n}) \sim K_{+}^{(n)}$ and $(\tilde{X}, \tilde{Y}) = (F^{- 1} (Z), G^{- 1} (Z)) \sim H_{+}$ , then

{lim}_{n \to \infty} E [X_{n}^{p} Y_{n}^{q}] = E [{(\tilde{X})}^{p} {(\tilde{Y})}^{q}] = \int_{0}^{1} {(F^{- 1} (t))}^{p} {(G^{- 1} (t))}^{q} dt .

(ii) If $(X_{n}^{*}, Y_{n}^{*}) \sim K_{-}^{(n)}$ and $(\tilde{X^{*}}, \tilde{Y^{*}}) = (F^{- 1} (Z), G^{- 1} (1 - Z)) \sim H_{-}$ , then

{lim}_{n \to \infty} E [{(X_{n}^{*})}^{p} {(Y_{n}^{*})}^{q}] = E [{(\tilde{X^{*}})}^{p} {(\tilde{Y^{*}})}^{q}] = \int_{0}^{1} {(F^{- 1} (t))}^{p} {(G^{- 1} (1 - t))}^{q} dt .

For non-negative random variables X and Y, we can consider the general functions α, β of X, Y, respectively.

Theorem 16.

Let (X,Y) ∼ H, defined on $R_{+}^{2} \equiv [0, \infty) \times [0, \infty),$ and let X ∼ F, Y ∼ G. Let α and β be two increasing and left-continuous functions on R₊. Assume also that E [|α (X) β (Y)|], E[|α (X)|] and E [|β (Y)|] are finite. (i) If $(X_{n}, Y_{n}) \sim K_{+}^{(n)}$ and $(\tilde{X}, \tilde{Y}) = (F^{- 1} (Z), G^{- 1} (Z)) \sim H_{+}$ , then

{lim}_{n \to \infty} E [α (X_{n}) β (Y_{n})] = E [α (\tilde{X}) β (\tilde{Y})] = \int_{0}^{1} α (F^{- 1} (t)) β (G^{- 1} (t)) dt .

(ii) If $(X_{n}^{*}, Y_{n}^{*}) \sim K_{-}^{(n)}$ and $(\tilde{X^{*}}, \tilde{Y^{*}}) = (F^{- 1} (Z), G^{- 1} (1 - Z)) \sim H_{-}$ , then

{lim}_{n \to \infty} E [α (X_{n}^{*}) β (Y_{n}^{*})] = E [α (\tilde{X^{*}}) β (\tilde{Y^{*}})] = \int_{0}^{1} α (F^{- 1} (t)) β (G^{- 1} (1 - t)) dt .

For Spearman’s ρ_s and Kendall’s τ, we have

Theorem 17.

If H has continuous marginals F and G then (i) ${lim}_{n \to \infty} ρ_{s} (K_{+}^{(n)}) = 1$ and ${lim}_{n \to \infty} ρ_{s} (K_{-}^{(n)}) = - 1$ ; (ii) ${lim}_{n \to \infty} τ (K_{+}^{(n)}) = 1$ and ${lim}_{n \to \infty} τ (K_{-}^{(n)}) = - 1$ .

Remarks 6.

By the monotone property of $K_{+}^{(n)}$ and $K_{-}^{(n)}$ (Theorems 12 and 13), we can further conclude, in addition to Theorems 14 and 17, that all the sequences ${ρ (K_{+}^{(n)})}$ , ${ρ (K_{-}^{(n)})}$ , ${ρ_{s} (K_{+}^{(n)})}$ , ${ρ_{s} (K_{-}^{(n)})}$ , ${τ (K_{+}^{(n)})}$ and ${τ (K_{-}^{(n)})}$ are monotone (see, e.g., Joe 2001, p. 54).

To prove the new results, we need two more lemmas. Lemma 6 below is an extension of the classic Hoeffding’s (1940) identity (2) for Cov (X,Y) to Cov (α (X),β (Y)), and Lemma 7 gives an explicit form of the expectation E [α (X) β (Y)] for (X,Y) ∼ H on $R_{+}^{2}$ . Our assumptions on the marginals (F,G) are weaker than those in Cuadras (2002), pp. 19-20, and the proof differs from that of Beare (2009).

Lemma 6.

Let (X,Y) ∼ H, X ∼ F and Y ∼ G. Let α,β : R→R be two left-continuous functions and be of bounded variation on each compact subset of R. Further, assume that the expectations E [|α (X) β (Y)|], E [|α (X)|], E[|β (Y)|] < ∞. Then

\begin{array}{l} Cov (α (X), β (Y)) = E [α (X) β (Y)] - E [α (X)] E [β (Y)] \\ = & \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [\bar{H} (x, y) - \bar{F} (x) \bar{G} (y)] d α (x) d β (y) \\ = & \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [H (x, y) - F (x) G (y)] d α (x) d β (y) . \end{array}

Lemma 7.

Let (X,Y) ∼ H defined on $R_{+}^{2}$ and X ∼ F, Y ∼ G. Let α and β be two increasing and left-continuous functions on R₊. Then

\begin{array}{l} E [α (X) β (Y)] & = & \int_{0}^{\infty} \int_{0}^{\infty} \bar{H} (x, y) dα (x) dβ (y) - α (0) β (0) \\ + α (0) E [β (Y)] + β (0) E [α (X)], \end{array}

(21)

provided the expectations exist.

Corollary 3.

Let (X,Y) ∼ H defined on $R_{+}^{2}$ , and assume that the expectations E[X^rY^s], E [X^r], E [Y^s] < ∞ for some real numbers r,s > 0. Then

E [X^{r} Y^{s}] = rs \int_{0}^{\infty} \int_{0}^{\infty} \bar{H} (x, y) x^{r - 1} y^{s - 1} dxdy.

Lemma 7 is a strengthening of Theorem 3.1 of Gupta et al. (2008). Then as a result, our corollary here extends their Corollary 2 in that no assumption of any smoothness of the bivariate distribution H is required.

7 Other related distributions

In addition to the above Sarmanov–Lee and Baker approaches, we mention some others.

For constructing bivariate distributions H with marginals F and G, Johnson and Tenenbein (1981) proposed the following steps:(i) Let $X_{1}^{'}$ and $X_{2}^{'}$ be two i.i.d. random variables with a density k.(ii) Define the correlated random variables

(X_{1}, X_{2}) = (X_{1}^{'}, α X_{1}^{'} + (1 - α) X_{2}^{'}), α \in [0, 1] .

(iii) Find the copula C (u,v) of (X₁,X₂), which is a function of α and k. (iv) Define H (x,y) = C (F (x),G (y)), x,y ∈ R.

In general, the constructed copula C in (iii) is too complicated to have a closed form.

Takeuchi (2010) suggested using FGM distributions for low correlation, and the Gaussian copula for high correlation:

C_{N} (u, v; ρ) = Φ_{2} (Φ^{- 1} (u), Φ^{- 1} (v); ρ), u, v \in [0, 1],

where Φ₂ is a two-dimensional normal distribution with correlation ρ, and Φ^-1 is the quantile function of the standard normal distribution. For example, for constructing a bivariate (FIR-FUV) galaxy luminosity distribution, he proposed H (x,y) = C_N(F (x),G (y); ρ), x,y ∈ R, where X ∼ F is far-infrared luminosity and Y ∼ G is far-ultraviolet luminosity. Note that there is no explicit form for the quantile function Φ^{- 1}.

Larralde (2012) gave the maximum-entropy bivariate distribution with normal marginals. Dukic and Marić (2013) considered the convex combinations of the independent case FG and the extreme H₊ or H_-. But these are not what we really want because both H₊ and H_- are composed of singular distributions.

In conclusion, the most convenient unified approach to the above-mentioned problem is probably by way of a linear combination of the joint distributions of bivariate order statistics.

Appendix: Proof of the new results in Section 6

Proof of Lemma 6.

Let (X₁,Y₁) and (X₂,Y₂) be two independent copies of (X,Y). Then

E [α (X_{i}) β (Y_{i})] = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} α (x) β (y) dH (x, y) for i = 1, 2,

and

E [α (X_{i}) β (Y_{j})] = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} α (x) β (y) dF (x) G (y) for i \neq j .

Define the function I on R² by

I (u, x) = \{\begin{matrix} 1, & if u < x, \\ 0, & otherwise \end{matrix}

(note that the function I here is a slight modification of those defined in Lehmann 1966, Lemma 2, and Quesada-Molina 1992, Theorem 2.1, and the references therein). Since α is left-continuous on R and is of bounded variation on each compact subset of R, we have

\begin{array}{l} \int_{- \infty}^{\infty} [I (u, x_{1}) - I (u, x_{2})] d α (u) = - \int_{[x_{1}, x_{2})} d α (u) = α (x_{1}) - α (x_{2}) & if x_{1} \leq x_{2}, \end{array}

and

\begin{array}{l} \int_{- \infty}^{\infty} [I (u, x_{1}) - I (u, x_{2})] dα (u) = \int_{[x_{2}, x_{1})} dα (u) = α (x_{1}) - α (x_{2}) & if x_{2} \leq x_{1} . \end{array}

Hence, in either case, $\int_{- \infty}^{\infty} [I (u, x_{1}) - I (u, x_{2})] dα (u) = α (x_{1}) - α (x_{2}),$ and

\begin{array}{l} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [I (u, x_{1}) - I (u, x_{2})] [I (v, y_{1}) - I (v, y_{2})] dα (u) dβ (v) \\ = & [α (x_{1}) - α (x_{2})] [β (y_{1}) - β (y_{2})] . \end{array}

On the other hand, note that E [I (u,X_i) I (v,Y_j)] = Pr(X_i> u,Y_j> v). This implies that

\begin{array}{l} E [I (u, X_{i}) I (v, Y_{j})] = Pr (X > u, Y > v) if i = j, and \\ E [I (u, X_{i}) I (v, Y_{j})] = Pr (X > u) Pr (Y > v) if i \neq j. \end{array}

Therefore, we have

\begin{array}{l} 2 {E [α (X) β (Y)] - E [α (X)] E [β (Y)]} \\ = & E [α (X_{1}) β (Y_{1}) - α (X_{1}) β (Y_{2}) - α (X_{2}) β (Y_{1}) + α (X_{2}) β (Y_{2})] \\ = & E [\int_{- \infty}^{\infty} \int_{- \infty}^{\infty} {I (u, X_{1}) - I (u, X_{2})} {I (v, Y_{1}) - I (v, Y_{2})} d α (u) d β (v)] \\ = & \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} E [{I (u, X_{1}) - I (u, X_{2})} {I (v, Y_{1}) - I (v, Y_{2})}] d α (u) d β (v) \\ = & \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} 2 [Pr (X > u, Y > v) - Pr (X > u) Pr (Y > v)] d α (u) d β (v) \\ = & 2 \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [\bar{H} (x, y) - \bar{F} (x) \bar{G} (y)] d α (x) d β (y) \\ = & 2 \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [H (x, y) - F (x) G (y)] d α (x) d β (y), \end{array}

last equality following from the fact that $H (x, y) - F (x) G (y) = \bar{H} (x, y) - \bar{F} (x) \bar{G} (y)$ .

Proof of Lemma 7.

By Lemma 6, we have

\begin{array}{l} E [α (X) β (Y)] - E [α (X)] E [β (Y)] \\ = & \int_{0}^{\infty} \int_{0}^{\infty} [\bar{H} (x, y) - \bar{F} (x) \bar{G} (y)] d α (x) d α (y) \\ = & \int_{0}^{\infty} \int_{0}^{\infty} \bar{H} (x, y) d α (x) d β (y) - \int_{0}^{\infty} \bar{F} (x) d α (x) \int_{0}^{\infty} \bar{G} (y) d β (y) . \end{array}

(22)

On the other hand, we have

\begin{array}{l} E [α (X)] & = & \int_{0}^{\infty} α (x) dF (x) = - \int_{0}^{\infty} α (x) d \bar{F} (x) = - {lim}_{b \to \infty} \int_{[0, b]} α (x) d \bar{F} (x) \\ = & - {lim}_{b \to \infty} [α (b) \bar{F} (b) - α (0 -) \bar{F} (0 -) - \int_{0}^{b} \bar{F} (x) d α (x)] \\ = & α (0) + \int_{0}^{\infty} \bar{F} (x) d α (x) . \end{array}

(23)

The last equality is due to the assumption E [|α (X)|] < ∞ and the monotone property of the function α in case ${lim}_{x \to \infty} α (x) = \infty$ . Similarly,

\begin{array}{l} E [β (Y)] = β (0) + \int_{0}^{\infty} \bar{G} (y) d β (y) . \end{array}

(24)

Therefore, it follows from (23) and (24) that

\begin{array}{l} \int_{0}^{\infty} \bar{F} (x) d α (x) \int_{0}^{\infty} \bar{G} (y) d β (y) \\ = & {E [α (X)] - α (0)} {E [β (Y)] - β (0)} \\ = & E [α (X)] E [β (Y)] - α (0) E [β (Y)] - β (0) E [α (X)] \\ + α (0) β (0) . \end{array}

(25)

Combining (22) and (25), we prove the identity (21).

Proof of Theorem 14.

We first prove the following two facts.

Fact A. If H = H_- with uniform marginals,

{lim}_{n \to \infty} Cov (K_{+}^{(n)}) = \frac{1}{12} = Cov (H_{+}) .

Fact B. If H = H₊ with uniform marginals,

{lim}_{n \to \infty} Cov (K_{-}^{(n)}) = - \frac{1}{12} = Cov (H_{-}) .

Proof of Fact A. For F = G = U (0,1), and H (x,y) = H_-(x,y) = max {0,x + y - 1}, we have, by (19),

\begin{array}{l} K_{+}^{(n)} & = & H + \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) {(x - H)}^{i} {(y - H)}^{i} {(1 - x - y + 2 H)}^{m - 2 i} \\ = & \{\begin{matrix} \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) x^{i} y^{i} {(1 - x - y)}^{m - 2 i}, if x + y \leq 1, \\ x + y - 1 + \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) {(1 - x)}^{i} {(1 - y)}^{i} {(x + y - 1)}^{m - 2 i}, \\ if x + y > 1 . \end{matrix} \end{array}

Then, by (2),

\begin{array}{l} Cov (K_{+}^{(n)}) \\ = & \int_{0}^{1} \int_{0}^{1 - y} \{- xy + \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) x^{i} y^{i} {(1 - x - y)}^{m - 2 i}\} dxdy \\ + \int_{0}^{1} \int_{1 - y}^{1} \{- (1 - x) (1 - y) + \sum_{m = 2}^{n} \frac{1}{m - 1} \\ \cdot \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) {(1 - x)}^{i} {(1 - y)}^{i} {(x + y - 1)}^{m - 2 i}\} dxdy \\ = & - \int_{0}^{1} \int_{0}^{1 - y} xydxdy - \int_{0}^{1} \int_{1 - y}^{1} (1 - x) (1 - y) dxdy \\ + \int_{0}^{1} \int_{0}^{1 - y} \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) x^{i} y^{i} {(1 - x - y)}^{m - 2 i} dxdy \\ + \int_{0}^{1} \int_{1 - y}^{1} \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) {(1 - x)}^{i} {(1 - y)}^{i} {(x + y - 1)}^{m - 2 i} dxdy \\ = & - \frac{1}{12} + \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) \{\int_{0}^{1} \int_{0}^{1 - y} x^{i} y^{i} {(1 - x - y)}^{m - 2 i} dxdy \\ + \int_{0}^{1} \int_{1 - y}^{1} {(1 - x)}^{i} {(1 - y)}^{i} {(x + y - 1)}^{m - 2 i} dxdy\} \end{array}

(26)

\begin{array}{l} = & - \frac{1}{12} + \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) (\frac{i! i! (m - 2 i)!}{(m + 2)!} + \frac{i! i! (m - 2 i)!}{(m + 2)!}) \\ = & - \frac{1}{12} + 2 \sum_{m = 2}^{n} \frac{1}{(m - 1) m (m + 1) (m + 2)} \sum_{i = 1}^{⌊ m / 2 ⌋} i . \end{array}

(27)

The two double integrals in (26) are equal by changing variables and in (27) we apply the representations for generalized (or multinomial) Beta function:

\begin{array}{l} \int_{0}^{1} \int_{0}^{1 - y} x^{i} y^{i} {(1 - x - y)}^{m - 2 i} dxdy = B (i + 1, i + 1, m - 2 i + 1) \\ = & \frac{Γ (i + 1) Γ (i + 1) Γ (m - 2 i + 1)}{Γ (m + 3)} = \frac{i! i! (m - 2 i)!}{(m + 2)!} . \end{array}

Note that

\begin{array}{l} \sum_{m = 2}^{n} \sum_{i = 1}^{⌊ m / 2 ⌋} = \sum_{m = 2, even}^{n} \sum_{i = 1}^{⌊ m / 2 ⌋} + \sum_{m = 2, odd}^{n} \sum_{i = 1}^{⌊ m / 2 ⌋} = \sum_{m = 2, even}^{n} \sum_{i = 1}^{m / 2} + \sum_{m = 3, odd}^{n} \sum_{i = 1}^{(m - 1) / 2} \end{array}

and that

\begin{array}{l} \sum_{i = 1}^{⌊ m / 2 ⌋} i = \{\begin{matrix} \frac{1}{2} \frac{m}{2} (\frac{m}{2} + 1) = \frac{1}{8} m (m + 2), & m = even, \\ \frac{1}{2} \frac{m - 1}{2} (\frac{m - 1}{2} + 1) = \frac{1}{8} (m - 1) (m + 1), & m = odd . \end{matrix} \end{array}

We have

\begin{array}{l} \sum_{m = 2}^{n} \frac{1}{(m - 1) m (m + 1) (m + 2)} \sum_{i = 1}^{⌊ m / 2 ⌋} i \\ = & \sum_{m = 2, even}^{n} \frac{1}{(m - 1) m (m + 1) (m + 2)} \frac{m (m + 2)}{8} \\ + \sum_{m = 3, odd}^{n} \frac{1}{(m - 1) m (m + 1) (m + 2)} \frac{(m - 1) (m + 1)}{8} \\ = & \frac{1}{8} (\sum_{m = 2, even}^{n} \frac{1}{(m - 1) (m + 1)} + \sum_{m = 3, odd}^{n} \frac{1}{m (m + 2)}) \\ = & \{\begin{matrix} \frac{1}{8} (\sum_{m = 2, even}^{n} \frac{1}{(m - 1) (m + 1)} + \sum_{m = 3, odd}^{n - 1} \frac{1}{m (m + 2)}), & n = even, \\ \frac{1}{8} (\sum_{m = 2, even}^{n - 1} \frac{1}{(m - 1) (m + 1)} + \sum_{m = 3, odd}^{n} \frac{1}{m (m + 2)}), & n = odd \end{matrix} \\ = & \{\begin{matrix} \frac{1}{8} (\sum_{k = 1}^{n / 2} \frac{1}{(2 k - 1) (2 k + 1)} + \sum_{k = 1}^{n / 2 - 1} \frac{1}{(2 k + 1) (2 k + 3)}), & n = even, \\ \frac{1}{8} (\sum_{k = 1}^{(n - 1) / 2} \frac{1}{(2 k - 1) (2 k + 1)} + \sum_{k = 1}^{(n - 1) / 2} \frac{1}{(2 k + 1) (2 k + 3)}), & n = odd \end{matrix} \\ = & \{\begin{matrix} \frac{1}{8} (\frac{1}{2} - \frac{1}{2 (n + 1)} + \frac{1}{6} - \frac{1}{2 (n + 1)}) = \frac{1}{24} \frac{2 n - 1}{n + 1}, & n = even, \\ \frac{1}{8} (\frac{1}{2} - \frac{1}{2 n} + \frac{1}{6} - \frac{1}{2 (n + 2)}) = \frac{1}{24} \frac{(n - 1) (2 n + 3)}{n (n + 2)}, & n = odd. \end{matrix} \end{array}

The last equality follows from the two identities:

\begin{array}{l} \sum_{k = 1}^{N} \frac{1}{(2 k - 1) (2 k + 1)} = \frac{1}{2} \sum_{k = 1}^{N} (\frac{1}{2 k - 1} - \frac{1}{2 k + 1}) = \frac{1}{2} - \frac{1}{2 (2 N + 1)}, \\ \sum_{k = 1}^{N} \frac{1}{(2 k + 1) (2 k + 3)} = \frac{1}{2} \sum_{k = 1}^{N} (\frac{1}{2 k + 1} - \frac{1}{2 k + 3}) = \frac{1}{6} - \frac{1}{2 (2 N + 3)} . \end{array}

Thus

Cov (K_{+}^{(n)}) = \{\begin{matrix} - \frac{1}{12} + 2 (\frac{1}{24} \frac{2 n - 1}{n + 1}) = \frac{1}{12} \frac{n - 2}{n + 1}, & n = even, \\ - \frac{1}{12} + 2 (\frac{1}{24} \frac{(n - 1) (2 n + 3)}{n (n + 2)}) = \frac{1}{12} \frac{n^{2} - n - 3}{n (n + 2)}, & n = odd, \end{matrix}

and in either case

{lim}_{n \to \infty} \frac{1}{12} \frac{n - 2}{n + 1} = \frac{1}{12}, {lim}_{n \to \infty} \frac{1}{12} \frac{n^{2} - n - 3}{n (n + 2)} = \frac{1}{12} .

Therefore, $Cov (K_{+}^{(n)}) \to \frac{1}{12} = Cov (H_{+})$ as n → ∞.

Proof of Fact B. The case of $K_{-}^{(n)}$ from H (x,y) = H₊(x,y) = min {x,y} requires no new proof by noting that

\begin{array}{l} Cov (K_{-}^{(n)}) & = & \frac{1}{12} - \sum_{m = 2}^{n} \frac{1}{m - 1} \sum_{i = 1}^{⌊ m / 2 ⌋} (\binom{m - 1}{i, i - 1, m - 2 i}) \{\int_{0}^{1} \int_{0}^{y} x^{i} {(1 - y)}^{i} {(y - x)}^{m - 2 i} dxdy \\ + \int_{0}^{1} \int_{y}^{1} y^{i} {(1 - x)}^{i} {(x - y)}^{m - 2 i} dxdy\} \end{array}

(28)

and that the two integrals in (28) are identical to those of (26) (by a simple change of variables).

Proof of Theorem 14 (continued).

(i) We divide the proof of part (i) into two steps:

(1a) Consider arbitrary H with uniform marginals, and prove that $K_{+}^{(n)}$ converges weakly to H₊ as n → ∞ and ${lim}_{n \to \infty} σ (K_{+}^{(n)}) = σ (H_{+})$ .

(1b) Extend the result (1a) to any H with general fixed marginals.

Proof of (1a). For arbitrary H with uniform marginals, since ${K_{+}^{(n)}}_{n = 1}^{\infty}$ is a bounded and increasing sequence, its limit exists, say K_{+ ∞}. Then $K_{+}^{(n)} \leq K_{+ \infty} \leq H_{+}$ for all n ≥ 1. Also, note that $K_{+}^{(n)} \geq K_{+}^{(n)} (H_{-})$ due to Theorem 11. We have, since $K_{+}^{(n)}$ increases in n,

\begin{array}{l} 0 & \leq & \int_{0}^{1} \int_{0}^{1} [H_{+} (x, y) - K_{+ \infty} (x, y)] dxdy \\ = & \int_{0}^{1} \int_{0}^{1} [H_{+} (x, y) - {lim}_{n \to \infty} K_{+}^{(n)} (x, y)] dxdy \\ = & {lim}_{n \to \infty} \int_{0}^{1} \int_{0}^{1} [H_{+} (x, y) - K_{+}^{(n)} (x, y)] dxdy \\ = & {lim}_{n \to \infty} \int_{0}^{1} \int_{0}^{1} [(H_{+} (x, y) - xy) - (K_{+}^{(n)} (x, y) - xy)] dxdy \\ = & {lim}_{n \to \infty} [Cov (H_{+}) - Cov (K_{+}^{(n)})] \leq {lim}_{n \to \infty} [Cov (H_{+}) - Cov (K_{+}^{(n)} (H_{-}))] = 0 . \end{array}

The last equality follows from Fact A. Therefore $K_{+}^{(n)}$ converges weakly to H₊ as n → ∞ and ${lim}_{n \to \infty} σ (K_{+}^{(n)}) = σ (H_{+})$ .

Proof of (1b). We now extend the result of part (1a) to any H with general fixed marginals F and G. First, let C be a copula of H. We rewrite the result of part (1a) in terms of random variables. Let $(Z_{i}, Z_{i}^{*}), i = 1, 2, \dots, n$ , be n independent copies of (Z,Z^∗) ∼ C. And let Z_(k) and $Z_{(k)}^{*}$ denote the k th smallest order statistics of ${Z_{i}}_{i = 1}^{n}$ and ${Z_{i}^{*}}_{i = 1}^{n}$ , respectively. On the other hand, let K_n be a discrete random variable obeying the uniform distribution on {1,2,…,n}. Then $(Z_{(K_{n})}, Z_{(K_{n})}^{*}) \sim K_{+}^{(n)} (C)$ (the bivariate distribution (17) generated from C) and by part (1a), we have that as n → ∞,

\begin{array}{l} (Z_{(K_{n})}, Z_{(K_{n})}^{*}) \to^{d} (Z, Z) \sim C_{+} (the upper bound with uniform marginals) . \end{array}

Proceeding on similar lines in the proof of Theorem 3 of Dou et al. (2013), we conclude that the bivariate $(F^{- 1} (Z_{(K_{n})}), G^{- 1} (Z_{(K_{n})}^{*}))$ converges in distribution to (F^{- 1}(Z),G^{- 1}(Z)) as n → ∞ (Serfling 1980, p. 24), namely, $K_{+}^{(n)}$ converges weakly to H₊ as n → ∞. This together with Corollary 2(iii) completes the proof.

(ii) We also divide the proof of part (ii) into two steps:

(2a) Consider arbitrary H with uniform marginals, and prove that $K_{-}^{(n)}$ converges weakly to H_- as n → ∞, and ${lim}_{n \to \infty} σ (K_{-}^{(n)}) = σ (H_{-})$ .

(2b) Extend the result (2a) to any H with general fixed marginals.

Proof of (2a). For arbitrary H with uniform marginals, since ${K_{-}^{(n)}}_{n = 1}^{\infty}$ is a bounded and decreasing sequence, its limit exists, say K_{- ∞}. Then $H_{-} \leq K_{- \infty} \leq K_{-}^{(n)}$ for all n≥1. Also, note that $K_{-}^{(n)} \leq K_{-}^{(n)} (H_{+})$ by Theorem 11. We have, since $K_{-}^{(n)}$ decreases in n,

\begin{array}{l} 0 & \leq & \int_{0}^{1} \int_{0}^{1} [K_{- \infty} (x, y) - H_{-} (x, y)] dxdy \\ = & \int_{0}^{1} \int_{0}^{1} [{lim}_{n \to \infty} K_{-}^{(n)} (x, y) - H_{-} (x, y)] dxdy \\ = & {lim}_{n \to \infty} \int_{0}^{1} \int_{0}^{1} [K_{-}^{(n)} (x, y) - H_{-} (x, y)] dxdy \\ = & {lim}_{n \to \infty} \int_{0}^{1} \int_{0}^{1} [(K_{-}^{(n)} (x, y) - xy) - (H_{-} (x, y) - xy)] dxdy \\ = & {lim}_{n \to \infty} [Cov (K_{-}^{(n)}) - Cov (H_{-})] \leq {lim}_{n \to \infty} [Cov (K_{-}^{(n)} (H_{+})) - Cov (H_{-})] = 0 . \end{array}

The last equality is due to Fact B. Therefore $K_{-}^{(n)}$ converges weakly to H_- as n → ∞. This together with Corollary 2(iv) completes the proof.

Proof of (2b). This is similar to the proof of part (1b) and is omitted.

Proof of Theorem 15.

Apply Lemma 3 to Theorem 14.

Proof of Theorem 16.

(i) Note that as n → ∞, $K_{+}^{(n)}$ converges weakly to H₊, the Fréchet–Hoeffding upper bound with the same marginals F and G, and that (F^{- 1}(Z), G^{- 1}(Z)) ∼ H₊. Then the required result follows from Lemma 7 above and the Generalized Monotone Convergence Theorem (see, e.g., Royden 1988, p. 265), because

\begin{array}{l} E [α (X_{n}) β (Y_{n})] & = & \int_{0}^{\infty} \int_{0}^{\infty} {\bar{K}}_{+}^{(n)} (x, y) d α (x) d β (y) - α (0) β (0) \\ + α (0) E [β (Y)] + β (0) E [α (X)], \end{array}

and ${\bar{K}}_{+}^{(n)}$ converges monotonically to ${\bar{H}}_{+}$ as n → ∞ (see Theorems 12 and 14(i)).

(ii) The proof is similar to that of part (i), by noting that (F^{- 1}(Z), G^{- 1}(1 - Z)) ∼ H_- and that ${\bar{K}}_{-}^{(n)}$ converges monotonically to ${\bar{H}}_{-}$ as n → ∞ (see Theorems 13 and 14(ii)).

Proof of Theorem 17.

It suffices to consider the case of uniform marginals. Let $(X_{n}, Y_{n}) \sim K_{+}^{(n)}$ with marginals F = G = U (0,1), and $(X_{n}^{*}, Y_{n}^{*}) \sim K_{-}^{(n)}$ with marginals F = G = U (0,1).

(i) From Theorem 15 it follows that

\begin{array}{l} {lim}_{n \to \infty} E [F (X_{n}) G (Y_{n})] = {lim}_{n \to \infty} E [X_{n} Y_{n}] = \frac{1}{3}, \\ {lim}_{n \to \infty} E [F (X_{n}^{*}) G (Y_{n}^{*})] = {lim}_{n \to \infty} E [X_{n}^{*} Y_{n}^{*}] = \frac{1}{6} . \end{array}

This in turn implies that ${lim}_{n \to \infty} ρ_{s} (K_{+}^{(n)}) = 12 {lim}_{n \to \infty} E [X_{n} Y_{n}] - 3 = 1$ and ${lim}_{n \to \infty} ρ_{s} (K_{-}^{(n)}) = 12 {lim}_{n \to \infty} E [X_{n}^{*} Y_{n}^{*}] - 3 = - 1$ . (Note that these results also follow from Theorem 14; see, e.g., Balakrishnan and Lai 2009, p. 156.)

(ii) Let $(\tilde{X}, \tilde{Y}) = (Z, Z) \sim H_{+}$ . Recall that $K_{+}^{(n)}$ are bounded and continuous functions, and that $K_{+}^{(n)}$ converges increasingly to H₊ as n → ∞ (see Theorems 12 and 14). Moreover, (X_n,Y_n) converges in distribution to $(\tilde{X}, \tilde{Y})$ as n → ∞. By the Generalized Lebesgue Dominated Convergence Theorem (see, e.g., Royden 1988, p. 270, or Hernández-Lerma and Lasserre 2000), we have

\begin{array}{l} {lim}_{n \to \infty} E [K_{+}^{(n)} (X_{n}, Y_{n})] = E [H_{+} (\tilde{X}, \tilde{Y})] \\ = & E [H_{+} (Z, Z)] = E [min {Z, Z}] = \int_{0}^{1} t dt = \frac{1}{2} . \end{array}

Therefore, ${lim}_{n \to \infty} τ (K_{+}^{(n)}) = 4 {lim}_{n \to \infty} E [K_{+}^{(n)} (X_{n}, Y_{n})] - 1 = 1 .$ On the other hand, letting $(\tilde{X^{*}}, \tilde{Y^{*}}) = (Z, 1 - Z) \sim H_{-}$ , we have that $(X_{n}^{*}, Y_{n}^{*})$ converges in distribution to $(\tilde{X^{*}}, \tilde{Y^{*}})$ as n → ∞, and, as before,

\begin{array}{l} {lim}_{n \to \infty} E [K_{-}^{(n)} (X_{n}^{*}, Y_{n}^{*})] = E [H_{-} (\tilde{X^{*}}, \tilde{Y^{*}})] \\ = & E [H_{-} (Z, 1 - Z)] = E [max {0, Z + (1 - Z) - 1}] = 0 . \end{array}

Therefore,

{lim}_{n \to \infty} τ (K_{-}^{(n)}) = 4 {lim}_{n \to \infty} E [K_{-}^{(n)} (X_{n}^{*}, Y_{n}^{*})] - 1 = - 1 .

This completes the proof.

References

Bairamov I, Bayramoglu K: From the Huang–Kotz FGM distribution to Baker’s bivariate distribution. J. Multivariate Anal 2013, 113: 106–115.
Article MathSciNet MATH Google Scholar
Bairamov I, Kotz S, Gebizlioglu OL: The Sarmanov family and its generalization. S. Afr. Stat. J 2001, 35: 205–224.
MathSciNet MATH Google Scholar
Baker R: An order-statistics-based method for constructing multivariate distributions with fixed marginals. J. Multivariate Anal 2008, 99: 2312–2327. 10.1016/j.jmva.2008.02.019
Article MathSciNet MATH Google Scholar
Balakrishnan N, Lai C-D: Continuous Bivariate Distributions. Springer–Verlag, New York; 2009.
MATH Google Scholar
Beare BK: A generalization of Hoeffding’s lemma, and a new class of covariance inequalities. Statist. & Probab. Lett 2009, 79: 637–642. 10.1016/j.spl.2008.10.012
Article MathSciNet MATH Google Scholar
Beněs V, Štěṕan J (Eds): Distributions With Given Marginals and Moment Problems. Kluwer Academic Publishers, Dordrecht; 1997. Conference held in Prague 1996 Conference held in Prague 1996
MATH Google Scholar
Cole BF, Lee M-LT, Whitmore GA, Zaslavsky AM: An empirical Bayes model for Markov-dependent binary sequences with randomly missing observations. J. Amer. Stat. Assoc 1995, 90: 1364–1372. 10.1080/01621459.1995.10476641
Article MathSciNet MATH Google Scholar
Cuadras, CM: Probability distributions with given multivariate marginals and given dependence structure. J. Multivariate Anal 1992, 42: 51–66. 10.1016/0047-259X(92)90078-T
Article MathSciNet MATH Google Scholar
Cuadras CM: On the covariance between functions. J. Multivariate Anal 2002, 81: 19–27. 10.1006/jmva.2001.2000
Article MathSciNet MATH Google Scholar
Cuadras CM, Fortiana J, Rodriguez-Lallena JA (Eds): Distributions With Given Marginals and Statistical Modelling. Kluwer Academic Publishers, Dordrecht; 2002. Papers from the meeting held in Barcelona 2000 Papers from the meeting held in Barcelona 2000
Google Scholar
Dall’Aglio G, Kotz S, Salinetti G (Eds): Advances in Probability Distributions with Given Marginals. Beyond the Copulas. Mathematics and its Applications 67, Kluwer Academic Publishers Group, Dordrecht; 1991. Papers from the Symposium on Distributions with Given Marginals held in Rome 1990 Papers from the Symposium on Distributions with Given Marginals held in Rome 1990
MATH Google Scholar
Danaher PJ, Smith MS: Modeling multivariate distributions using copulas: applications in marketing. Mark. Sci 2011, 30: 4–21. 10.1287/mksc.1090.0491
Article Google Scholar
David HA: Order Statistics. 2nd ed. Wiley, New York; 1981.
MATH Google Scholar
David HA, Nagaraja HN: Order Statistics. 3rd ed. Wiley, New Jersey; 2003.
Book MATH Google Scholar
Dolati A, Úbeda-Flores M: A method for constructing multivariate distributions with given bivariate margins. Brazilian Journal of Probability and Statistics 2005, 19: 85–92.
MathSciNet MATH Google Scholar
Dou X, Kuriki S, Lin GD: Dependence structures and asymptotic properties of Baker’s distributions with fixed marginals. J. Statist. Plann. Inference 2013, 143: 1343–1354. 10.1016/j.jspi.2013.03.019
Article MathSciNet MATH Google Scholar
Dou X, Kuriki S, Lin GD, Richards D: EM algorithms for estimating the Bernstein copula. To appear in Computational Statistics and Data Analysis 2014. Website . http://dx.doi.org/10.1016/j.csda.2014.01.009 Website .
Google Scholar
Dukic VM, Marić N: Minimum correlation in construction of multivariate distributions. Phys. Rev. E 2013., 87(032114):
Google Scholar
Eyraud H:Les principes de la mesure des corr´elations. Ann. Univ. Lyon Sect. A. 1, 30–47 (1936)
Google Scholar
Farlie DJG: The performance of some correlation coefficients for a general bivariate distribution. Biometrika 1960, 47: 307–323. 10.1093/biomet/47.3-4.307
Article MathSciNet MATH Google Scholar
Fréchet M: On correlation matrices with fixed margins (Sur les tableaux de corrélation dont les marges sont données). Ann. Univ. Lyon Sect. A 1951, 14: 53–77.
MathSciNet MATH Google Scholar
Gómez-Déniz E, Sarabia JM, Balakrishnan N: A multivariate discrete Poisson-Lindley distribution: extensions and actuarial applications. Astin Bulletin 2012, 42: 655–678.
MathSciNet MATH Google Scholar
Gumbel EJ: Bivariate exponential distributions. J. Amer. Stat. Assoc 1960, 55: 698–707. 10.1080/01621459.1960.10483368
Article MathSciNet MATH Google Scholar
Gupta RC, Tajdari M, Bresinsky H: Some general results for moments in bivariate distributions. Metrika 2008, 68: 173–187. 10.1007/s00184-007-0150-7
Article MathSciNet MATH Google Scholar
Hernández-Bastida A, Fernández-Sánchez MP: A Sarmanov family with beta and gamma marginal distributions: an application to the Bayes premium in a collective risk model. Stat. Methods Appl 2012, 21: 391–409. 10.1007/s10260-012-0194-3
Article MathSciNet MATH Google Scholar
Hernández-Lerma O, Lasserre JB: Fatou’s lemma and Lebesgue’s convergence theorem for measures. J. Appl. Math. Stochastic Anal 2000, 13: 137–146. 10.1155/S1048953300000150
Article MathSciNet MATH Google Scholar
Hoeffding W: Masstabinvariante Korrelations-theorie. Schr. Math. Inst. Univ. Berlin 1940, 5: 181–233. English translation: Scale-Invariant Correlation Theory. In: Fisher, NI, Sen, PK (eds.) The Collected Works of Wassily Hoeffding, pp. 57–107, Springer-Verlag, New York (1999) English translation: Scale-Invariant Correlation Theory. In: Fisher, NI, Sen, PK (eds.) The Collected Works of Wassily Hoeffding, pp. 57–107, Springer-Verlag, New York (1999)
Google Scholar
Huang JS, Dou X, Kuriki S, Lin GD: Dependence structure of bivariate order statistics with applications to Bayramoglu’s distributions. J. Multivariate Anal 2013, 114: 201–208.
Article MathSciNet MATH Google Scholar
Huang JS, Kotz S: Correlation structure in iterated Farlie–Gumbel–Morgenstern distributions. Biometrika 1984, 71: 633–636.
MathSciNet MATH Google Scholar
Kotz S, Huang, JS: Modifications of the Farlie–Gumbel–Morgenstern distributions. A tough hill to climb. Metrika 1999, 49: 135–145. 10.1007/s001840050030
Article MathSciNet MATH Google Scholar
Huang JS, Lin GD: A note on the Sarmanov bivariate distributions. Applied Mathematics and Computation 2011, 218: 919–923. 10.1016/j.amc.2011.01.087
Article MathSciNet MATH Google Scholar
Joe H: Multivariate Models and Dependence Concepts. Chapman & Hall/CRC, New York; 2001.
MATH Google Scholar
Johnson NL, Kotz S: On some generalized Farlie–Gumbel–Morgenstern distributions. Commun. Statist. - Theory Meth 1975, 4: 415–427.
Article MathSciNet MATH Google Scholar
Johnson NL, Kotz S, Balakrishnan B: Discrete Multivariate Analysis. Wiley, New York; 1997.
MATH Google Scholar
Johnson ME, Tenenbein A: A bivariate distribution family with specified marginals. J. Amer. Statist. Assoc 1981, 76: 198–201. 10.1080/01621459.1981.10477628
Article MathSciNet Google Scholar
Kotz S, Balakrishnan N, Johnson NL: Continuous Multivariate Distributions, Vol. 1: Models and Applications. 2nd ed. Wiley, New York; 2000.
Book MATH Google Scholar
Kotz S, Nadarajah S: Multivariate t Distributions and Their Applications. Cambridge University Press, Cambridge; 2004.
Book MATH Google Scholar
Lai, CD: Constructions of continuous bivariate distributions. J. Indian Soc. Probab. Statist 2004, 8: 21–43.
Google Scholar
Lai CD: Constructions of discrete bivariate distributions. In Advances in Distribution Theory, Order Statistics and Inference. Edited by: Balakrishnan N, Castillo E, Sarabia JM. Boston: Birkäuser; 2006.
Google Scholar
Larralde H: Maximum-entropy distributions of correlated variables with prespecified marginals. Physical Review E 2012., 86(061117):
Google Scholar
Lee M-LT: Properties and applications of the Sarmanov family of bivariate distributions. Commun. Statist. - Theory Meth 1996, 25: 1207–1222. 10.1080/03610929608831759
Article MathSciNet MATH Google Scholar
Lehmann EL: Some concepts of dependence. Ann. Math. Statist 1966, 37: 1137–1153. 10.1214/aoms/1177699260
Article MathSciNet MATH Google Scholar
Li D-Q, Jiang S-H, Wu S-B, Zhou C-B, Zhang L-M: Modeling multivariate distributions using Monte Carlo simulation for structural reliability analysis with complex performance function. Proc. Inst. Mech. Eng. Part O: J. Risk Reliabil 2013, 227: 109–118.
Article Google Scholar
Lin GD: Relationships between two extensions of Farlie–Gumbel–Morgenstern distribution. Ann. Inst. Statist. Math 1987, 39: 129–140. 10.1007/BF02491454
Article MathSciNet MATH Google Scholar
Lin GD, Huang JS: A note on the maximum correlation for Baker’s bivariate distributions with fixed marginals. J. Multivariate Anal 2010, 101: 2227–2233. 10.1016/j.jmva.2010.04.005
Article MathSciNet MATH Google Scholar
Lin GD, Huang, JS: Maximum correlation for the generalized Sarmanov bivariate distributions. J. Statist. Plann. Inference 2011, 141: 2738–2749. 10.1016/j.jspi.2011.02.024
Article MathSciNet MATH Google Scholar
Morgenstern D: Einfache Beispiele zweidimensionaler Verteilungen. Mitteilungsbl. Math. Statist 1956, 8: 234–235.
MathSciNet MATH Google Scholar
Nelsen RB: A characterization of Farlie–Gumbel–Morgenstern distributions via Spearman’s rho and chi-square divergence. Sankhyā, Ser. A 1994, 56: 476–479.
MATH Google Scholar
Pelican E, Vernic R: Maximum-likelihood estimation for the multivariate Sarmanov distribution: simulation study. Int. J. Comput. Math 2013a, 90: 1958–1970. 10.1080/00207160.2013.770148
Article Google Scholar
Pelican E, Vernic, R: Parameters estimation for the bivariate Sarmanov distribution with normal-type marginals. ROMAI J 2013b, 9: 155–165.
MathSciNet Google Scholar
Quesada-Molina: A generalization of an identity of Hoeffding and some applications. J. Ital. Statist. Soc 1992, 1: 405–411. 10.1007/BF02589088
Article MATH Google Scholar
Royden HL: Real Analysis. 3rd ed. Prentice-Hall, New Jersey; 1988.
MATH Google Scholar
Distributions with Fixed Marginals and Related Topics 1996.
Sarabia JM, Gómez-Déniz E: Construction of multivariate distributions: a review of some recent results (with discussions). SORT 2008, 32: 3–36.
MathSciNet MATH Google Scholar
Sarabia JM, Gómez-Déniz, E: Multivariate Poisson-Beta distributions with applications. Commun. Statist. - Theory Meth 2011, 40: 1093–1108. 10.1080/03610920903537269
Article MathSciNet MATH Google Scholar
Sarmanov IO: Generalized normal correlation and two dimensional Fréchet classes. Soviet Math. Dokl 1966, 7: 596–599. [English translation; Russian original in Dokl. Akad. Nauk. SSSR 168, 32–35 (1966)] [English translation; Russian original in Dokl. Akad. Nauk. SSSR 168, 32–35 (1966)]
MathSciNet MATH Google Scholar
Schucany WR, Parr WC, Boyer JE: Correlation structure in Farlie–Gumbel–Morgenstern distributions. Biometrika 1978, 65: 650–653. 10.1093/biomet/65.3.650
Article MathSciNet MATH Google Scholar
Serfling RJ: Approximation Theorems of Mathematical Statistics. Wiley, New York; 1980.
Book MATH Google Scholar
Shubina M, Lee M-LT: On maximum attainable correlation and other measures of dependence for the Sarmanov family of bivariate distributions. Commun. Statist. - Theory Meth 2004, 33: 1031–1052. 10.1081/STA-120029824
Article MathSciNet MATH Google Scholar
Takeuchi TT: Constructing a bivariate distribution function with given marginals and correlation: application to the galaxy luminosity function. Mon. Notices Roy. Astronom. Soc 2010, 406: 1830–1840.
Google Scholar
Wigner E: On the quantum correction for thermodynamic equilibrium. Phys. Rev 1932, 40: 749–759. 10.1103/PhysRev.40.749
Article MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the editors and referees for valuable comments and suggestions. The paper was presented at the first conference on Statistical Distributions and Applications (ICOSDA 2013) held from 10th to 12th October 2013 by Central Michigan University, USA. GD Lin would like to thank the Chairs of Organizing Committee, Professor Felix Famoye and Professor Carl Lee, for their kind invitation, and the National Science Council of Taiwan (ROC) for financial support under Grants NSC 99-2118-M-001-003-MY3 and NSC 102-2118-M-001-008-MY2.

Author information

Authors and Affiliations

Institute of Statistical Science, Academia Sinica, Taiwan, Republic of China
Gwo Dong Lin
Waseda University, Tokyo, Japan
Xiaoling Dou
The Institute of Statistical Mathematics, Tokyo, Japan
Satoshi Kuriki
Department of Mathematics and Statistics, University of Guelph, Guelph, Canada
Jin-Sheng Huang

Authors

Gwo Dong Lin
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoling Dou
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Kuriki
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Sheng Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gwo Dong Lin.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Lin, G.D., Dou, X., Kuriki, S. et al. Recent developments on the construction of bivariate distributions with fixed marginals. J Stat Distrib App 1, 14 (2014). https://doi.org/10.1186/2195-5832-1-14

Download citation

Received: 27 February 2014
Accepted: 23 May 2014
Published: 17 June 2014
DOI: https://doi.org/10.1186/2195-5832-1-14

Recent developments on the construction of bivariate distributions with fixed marginals

Abstract

Mathematics Subject Classification (2000)

1 Introduction and a brief history

2 The natural bounds of the correlations

3 FGM distributions

4 Sarmanov and Lee’s distributions

Example 1.

Example 2.

Example 3.

Theorem 1.

Theorem 2.

Remarks 1.

Lemma 1.

Remarks 2.

Lemma 2.

Theorem 3.

Proof.

Lemma 3.

Remarks 3.

Lemma 4.

Remarks 4.

Theorem 4.

Theorem 5.

5 Baker’s distributions

Theorem 6.

Theorem 7.

Theorem 8.

Theorem 9.

Corollary 1.

Lemma 5.

Theorem 10.

6 Bayramoglu’s distributions

Theorem 11.

Theorem 12.

Theorem 13.

Corollary 2.

Remarks 5.

Theorem 14.

Theorem 15.

Theorem 16.

Theorem 17.

Remarks 6.

Lemma 6.

Lemma 7.

Corollary 3.

7 Other related distributions

Appendix: Proof of the new results in Section 6

Proof of Lemma 6.

Proof of Lemma 7.

Proof of Theorem 14.

Proof of Theorem 15.

Proof of Theorem 16.

Proof of Theorem 17.

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords