The beta Marshall-Olkin family of distributions

Alizadeh, Morad; Cordeiro, Gauss M.; Brito, Edleide de; B. Demétrio, Clarice Garcia

doi:10.1186/s40488-015-0027-7

Methodology
Open access
Published: 01 July 2015

The beta Marshall-Olkin family of distributions

Morad Alizadeh¹,
Gauss M. Cordeiro²,
Edleide de Brito³ &
…
Clarice Garcia B. Demétrio⁴

Journal of Statistical Distributions and Applications volume 2, Article number: 4 (2015) Cite this article

5156 Accesses
24 Citations
Metrics details

Abstract

We study general mathematical properties of a new generator of continuous distributions with three extra shape parameters called the beta Marshall-Olkin family. We present some special models and investigate the asymptotes and shapes. The new density function can be expressed as a mixture of exponentiated densities based on the same baseline distribution. We derive a power series for its quantile function. Explicit expressions for the ordinary and incomplete moments, quantile and generating functions, Bonferroni and Lorenz curves, Shannon and Rényi entropies and order statistics, which hold for any baseline model, are determined. We discuss the estimation of the model parameters by maximum likelihood and illustrate the flexibility of the family by means of two applications to real data. PACS 02.50.Ng, 02.50.Cw, 02.50.-r Mathematics Subject Classification (2010) 62E10, 60E05, 62P99

Introduction

Recently, some attempts have been made to define new families to extend well-known distributions and at the same time provide great flexibility in modelling data in practice. So, several classes by adding one or more parameters to generate new distributions have been proposed in the statistical literature. Some well-known generators are: the Marshall-Olkin generated (MO-G) by (Marshall and Olkin 1997), the beta-G by (Eugene et al. 2002), the Kumaraswamy-G (Kw-G for short) by (Cordeiro and Castro 2011), the McDonald-G (Mc-G) by (Alexander et al. 2012), the gamma-G by (Zografos and Balakrishnan 2009), the transformer (T-X) by (Alzaatreh et al. 2013), the Weibull-G by (Bourguignon et al. 2014) and the exponentiated half-logistic by (Cordeiro et al. 2014).

Let r(t) be the probability density function (pdf) of a random variable T∈[d,e] for −∞≤d<e<∞ and let W[G(x)] be a function of the cumulative distribution function (cdf) of a random variable X such that W[G(x)] satisfies the following conditions:

$$ \left\{ \begin{array}{ll} (i) &W\left[G(x)\right]\in [d,e],\\ (ii) & W\left[G(x)\right] \textrm{is differentiable and monotonically non-decreasing, and}\\ (iii)& W\left[G(x)\right] \rightarrow d\,\,\, \text{as}\,\,\, x \rightarrow -\infty \,\text{and}\, W\left[G(x)\right] \rightarrow e \,\,\,\text{as}\,\,\, x \rightarrow \infty.\\ \end{array} \right. $$

((1))

Alzaatreh et al. 2013 defined the T-X family cdf by

$$ F(x)=\int_{d}^{W[G(x)]}\, r(t)\,dt, $$

((2))

where W[G(x)] satisfies the conditions (1). The pdf corresponding to (2) is given by

$$ f(x)=\left\{\frac{d}{dx}\, W[G(x)]\right\} \, r\left\{\,W[G(x)]\right\}. $$

In this paper, we propose a new wider class of continuous distributions called the beta Marshall-Olkin (BMO) family by taking $W[G(x)]=\frac {G(x;\boldsymbol {\xi })}{c+(1-c)G(x;\boldsymbol {\xi })}$ and $r(t)=\frac {1}{B(a,b)}t^{a-1}(1-t)^{b-1},\,\,\,0<t<1,a,b,c>0$. Its cdf is given by

$$\begin{array}{@{}rcl@{}} F(x;a,b,c,\boldsymbol{\xi})=I_{\frac{G(x;\boldsymbol{\xi})}{c+(1-c)G(x;\boldsymbol{\xi})}}(a,b), \end{array} $$

((3))

where $I_{x}(a,b)=B(a,b)^{-1}{\int _{0}^{x}}t^{a-1}\,(1-t)^{b-1}dt$ denotes the incomplete beta function ratio, G(x;ξ) is the baseline cdf depending on a parameter vector ξ and a>0, b>0 and c>0 are three additional shape parameters. For each baseline G, the BMO-G distribution is defined by the cdf (3). Equation (3) includes as special cases the beta-G, Marshall-Olkin-G (MOG), exponentiated Marshal-Olkin-G (EMOG) and exponentiated classes as those listed in Table 1.

Table 1 Some special models

Full size table

This paper is organized as follows. In Section 2, we provide a physical interpretation of the BMO-G family. Three special cases of this family are defined in Section 3. In Section 4, the shape of the density and hazard rate functions are described analytically. Some useful expansions are derived in Section 5. In Section 6, we obtain a power series for the BMO-G quantile function (qf). In Section 7, we propose explicit expressions for the ordinary and incomplete moments using the qf expansion. The generating function and mean deviations are derived in Sections 8 and 9, respectively. General expressions for the Rényi and Shannon entropies are presented in Section 10. The order statistics are investigated in Section 11. Estimation of the model parameters by maximum likelihood is performed in Section 12. Applications to two real data sets illustrate the performance of the new family in Section 13. The paper is concluded in Section 14.

The new density

The density function corresponding to (3) is given by

$$ f(x;a,b,c,\boldsymbol{\xi})=\frac{c^{b}g(x;\boldsymbol{\xi})G(x;\boldsymbol{\xi})^{a-1}\overline{G}(x;\boldsymbol{\xi})^{b-1}} {B(a,b)\left[c+(1-c)G(x;\boldsymbol{\xi})\right]^{a+b}}, $$

((4))

where g(x;ξ) is the baseline pdf. Equation (4) will be most tractable when G(x;ξ) and g(x;ξ) have simple analytic expressions. Hereafter, a random variable X with density function (4) is denoted by X∼BMO-G(a,b,c,ξ). Further, we can omit sometimes the dependence on the vector ξ of parameters and write simply G(x)=G(x;ξ).

The hazard rate function (hrf) of X becomes

$$ \tau(x;a,b,c,\boldsymbol{\xi})=\frac{c^{b}g(x;\boldsymbol{\xi})G(x;\boldsymbol{\xi})^{a-1}\bar{G}(x;\boldsymbol{\xi})^{b-1}}{B(a,b)\left[c+(1-c)G(x;\boldsymbol{\xi})\right]^{a+b}\left[1-I_{\frac{G(x;\boldsymbol{\xi})}{c+(1-c)G(x;\boldsymbol{\xi})}}(a,b)\right]}. $$

((5))

The BMO family is easily simulated by inverting (3) as follows: if V has a beta distribution with positive parameters a and b, the solution of the nonlinear equation

$$ x_{q}=G^{-1}\left\{\frac{c\,V}{1-(1-c)V};\boldsymbol{\xi} \right\} $$

has the density function (4).

The basic motivations for using the BMO family in practice are: (i) to make the kurtosis more flexible compared to the baseline model; (ii) to produce a skewness for symmetrical distributions; (iii) to construct heavy-tailed distributions that are not longer-tailed for modeling real data; (iv) to generate distributions with symmetric, left-skewed, right-skewed and reversed-J shaped; (v) to define special models with all types of the hrf; (vi) to generate a large number of special distributions as those presented in Table 1; and (vii) to provide consistently better fits than other generated models under the same baseline distribution. A simple example of (ii): the normal distribution is symmetric, but the beta Marshall-Olkin normal (BMO-N) becomes skewed. The fact (vii) is well-demonstrated by fitting the BMO-N and beta Marshall-Olkin Weibull (BMO-W) distributions to two real data sets in Section 13. However, we expect that there are other contexts in which the BMO special models can produce worse fits than other generated distributions. Clearly, the results in Section 13 indicate that the new family is a very competitive class to other known generators with at most three extra shape parameters.

Some special BMO distributions

The BMO-G density function (4) allows for greater flexibility of its tails and can be widely applied in many areas. The new family extends several widely-known distributions in the literature. Here, we present a few of its many special models.

3.1 The BMO-N distribution

The BMO-N pdf is obtained from (4) by taking the normal N(μ,σ) as the parent distribution, where ξ=(μ,σ). Then,

$$\begin{array}{@{}rcl@{}} f(x;a, b, c, \alpha, \beta) =\frac{c^{b}\,\phi\left(\frac{x-\mu}{\sigma}\right)\,\left[\Phi\left(\frac{x-\mu}{\sigma}\right)\right]^{a-1}\, \left[1-\Phi\left(\frac{x-\mu}{\sigma}\right)\right]^{b-1}}{B(a,b)\left[c+(1-c)\Phi\left(\frac{x-\mu}{\sigma}\right)\right]^{a+b}}, \end{array} $$

((6))

where x∈ I R, μ∈ I R is a location parameter, σ>0 is a scale parameter, ϕ(·) and Φ(·) are the pdf and cdf of the standard normal distribution, respectively. The standard BMO-N density comes when μ=0 and σ=1. For a=b=c=1, it reduces to the normal density. Plots of (6) for some parameter values are displayed in Fig. 1.

3.2 The BMO-W distribution

Let G(x;ξ)=1− exp[−(α x)^β] be the Weibull cdf with scale parameter α>0 and shape parameter β>0, where ξ=(α,β). The BMO-W pdf (for x>0) reduces to

$$\begin{array}{@{}rcl@{}} f(x;a, b, c, \alpha, \beta) =\frac{c^{b}\beta\,\alpha^{\beta}\,x^{\beta-1}\exp[-b(\alpha\,x)^{\beta}]\left[1-\exp[-(\alpha\,x)^{\beta}]\right]^{a-1}}{ B(a,b)\left[c+(1-c)[1-\exp[-(\alpha\,x)^{\beta}]]\right]^{a+b}}. \end{array} $$

The Weibull pdf (with parameters α and β) is a special case for a=b=c=1. Some possible shapes of the BMO-W pdf and hrf are displayed in Fig. 2.

3.3 The Beta Marshall-Olkin gamma (BMO-Ga) distribution

The gamma cumulative distribution (for x>0) with shape parameter α>0 and scale parameter β>0, ξ=(α,β), is given by

$$\begin{array}{@{}rcl@{}} G(x; \boldsymbol{\xi})=\frac{\gamma(\alpha, x/\beta)}{\Gamma(\alpha)}=\gamma_{1}(\alpha, x/\beta), \end{array} $$

where $\Gamma (p)=\int _{0}^{\infty }w^{p-1}\,\mathrm {e}^{-w}dw$ is the gamma function and $\gamma (\alpha, z)={\int _{0}^{z}}w^{\alpha -1}\,\mathrm {e}^{-w}dw$ is the incomplete gamma function. The BMO-Ga pdf (for x>0) becomes

$$\begin{array}{@{}rcl@{}} f(x;a,b,\alpha,\beta) = \frac{c^{b}\,x^{a-1}\,\mathrm{e}^{-x/\beta}\,\left[\gamma_{1}(\alpha, x/\beta)\right]^{a-1}\, \left[1-\gamma_{1}(\alpha, x/\beta)\right]^{b-1}}{\beta^{\alpha}\,\Gamma(\alpha)\,B(a, b)\,\left[c+(1-c)\,\gamma_{1}(\alpha, x / \beta)\right]^{a+b}}. \end{array} $$

For c=1, we obtain the beta Weibull (BW) distribution. The beta Marshall-Olkin exponential (BMO-E) distribution corresponds to β=1. Figure 3 displays some BMO-Ga pdf’s and hrf’s.

Asymptotics and shapes

Corollary 1.

The asymptotics of Eqs. (3), (4) and (5) as G(x)→0 are given by

$$\begin{array}{@{}rcl@{}} &&F(x)\sim \frac{G(x)^{a}}{a\,c^{a}\,B(a,b)}\,\,\,\,\,\,\text{as}\,\,\,\,\,G(x)\rightarrow 0, \\ &&f(x)\sim \frac{g(x)\,G(x)^{a-1}}{c^{a}\,B(a,b)}\,\,\,\,\,\,\text{as}\,\,\,\,\, G(x)\rightarrow 0,\\ &&\tau(x)\sim \frac{g(x)\,G(x)^{a-1}}{c^{a}\,B(a,b)} \,\,\,\,\,\text{as}\,\,\,\,\,G(x)\rightarrow 0. \end{array} $$

Corollary 2.

The asymptotics of Eqs. (3), (4) and (5) as x→∞ are given by

$$\begin{array}{@{}rcl@{}} &&1-F(x)\sim \frac{c^{b}\,\bar{G}(x)^{b}}{b\,B(a,b)} \,\,\,\,\,\text{as}\,\,\, x\rightarrow \infty,\\ &&f(x)\sim \frac{c^{b}\,g(x)\,\bar{G}(x)^{b-1}}{B(a,b)}\,\,\,\,\,\text{as}\,\,\, x\rightarrow \infty,\\ &&\tau(x)\sim \frac{b\,g(x)}{\bar{G}(x)}\,\,\,\,\,\text{as}\,\,\, x\rightarrow \infty. \end{array} $$

The shapes of the density and hazard rate functions can be described analytically. The critical points of the BMO-G density function are the roots of the equation:

$$\begin{array}{*{20}l} \frac{d\log[f(x)]}{dx}=\frac{g'(x)}{g(x)}+\frac{(a-1)\,g(x)}{G(x)}+\frac{(1-b)g(x)}{\bar{G}(x)}+\frac{(c-1)(a+b)g(x)}{c+(1-c)G(x)}. \end{array} $$

((7))

There may be more than one root to (7). If x=x ₀ is a root of (7) then it corresponds to a local maximum, a local minimum or a point of inflexion depending on whether λ(x ₀)<0, λ(x ₀)>0 or λ(x ₀)=0, where $\lambda (x)=\frac {d^{2}\log [f(x)] }{d x^{2}}$ is given by

$$\begin{aligned} \lambda(x)&=\frac{g^{\prime\prime}(x)g(x)-[g'(x)]^{2}}{g(x)^{2}}+(a-1)\frac{g'(x)G(x)-g(x)^{2}}{G(x)^{2}}+(1-b)\frac{g'(x)\bar{G}(x)+g(x)^{2}}{\bar{G}(x)^{2}}\\ &\quad+\frac{(c-1)(a+b)g'(x)}{c+(1-c)G(x)}+\frac{(c-1)^{2}(a+b)g(x)^{2}}{\left[c+(1-c)G(x)\right]^{2}}. \end{aligned} $$

The critical points of τ(x) are the roots of the equation:

$$\begin{array}{*{20}l} &\frac{d\log[\tau(x)] }{d x}=\frac{g'(x)}{g(x)}+\frac{(a-1)\,g(x)}{G(x)}+\frac{(1-b)g(x)}{\bar{G}(x)}+\frac{(c-1)(a+b)g(x)}{c+(1-c)G(x)}\\ &-\frac{c^{b}g(x)G(x)^{a-1}\bar{G}(x)^{b-1}}{B(a,b)\left[c+(1-c)G(x)\right]^{a+b}\left[1-I_{\frac{G(x)}{c+(1-c)G(x)}}(a,b)\right]}. \end{array} $$

((8))

There may be more than one root to (8). If x=x ₀ is a root of (8) then it corresponds to a local maximum, a local minimum or a point of inflexion depending on whether ς(x ₀)<0, ς(x ₀)>0 or ς(x ₀)=0, where $\varsigma (x)=\frac {d^{2}\log [\tau (x)] }{d x^{2}}$ is given by

$$\begin{array}{@{}rcl@{}} \begin{aligned} \varsigma(x)&=\frac{g^{\prime\prime}(x)g(x)-[g'(x)]^{2}}{g(x)^{2}}+(a-1)\frac{g'(x)G(x)-g(x)^{2}}{G(x)^{2}}+(1-b)\frac{g'(x)\bar{G}(x)+g(x)^{2}}{\bar{G}(x)^{2}}\\ &\quad+\frac{(c-1)(a+b)g'(x)}{c+(1-c)G(x)}+\frac{(c-1)^{2}(a+b)g(x)^{2}}{\left[c+(1-c)G(x)\right]^{2}}\\ &\quad+\frac{c^{b}g'(x)G(x)^{a-1}\bar{G}(x)^{b-1}}{B(a,b)\left[c+(1-c)G(x)\right]^{a+b}\left[1-I_{\frac{G(x)}{c+(1-c)G(x)}}(a,b)\right]}\\ &\quad+\frac{c^{b}\,(a-1)g(x)^{2}G(x)^{a-2}\bar{G}(x)^{b-1}}{B(a,b)\left[c+(1-c)G(x)\right]^{a+b}\left[1-I_{\frac{G(x)}{c+(1-c)G(x)}}(a,b)\right]}\\ &\quad-\frac{c^{b}\,(b-1)g(x)^{2}G(x)^{a-1}\bar{G}(x)^{b-2}}{B(a,b)\left[c+(1-c)G(x)\right]^{a+b}\left[1-I_{\frac{G(x)}{c+(1-c)G(x)}}(a,b)\right]}\\ &\quad+\left\{\frac{c^{b}g(x)G(x)^{a-1}\bar{G}(x)^{b-1}}{B(a,b)\left[c+(1-c)G(x)\right]^{a+b}\left[1-I_{\frac{G(x)}{c+(1-c)G(x)}}(a,b)\right]}\right\}^{2}.\\ \end{aligned} \end{array} $$

Useful representation

By using the generalized binomial expansion, we can prove that the cdf (3) of X admits the expansion

$$\begin{array}{@{}rcl@{}} F(x;a,b,c,\boldsymbol{\xi})=\sum_{i,j,l=0}^{\infty}\sum_{k=0}^{l}\frac{(-1)^{i+l+k}(1-c)^{i}\left(\begin{array}{cc}{b-1}\\{i}\end{array}\right) \left(\begin{array}{cc}{-a-i}\\{j}\end{array}\right)\left(\begin{array}{cc}{a+i+j}\\{l}\end{array}\right)\left(\begin{array}{cc}{l}\\{k}\end{array}\right)}{c^{a+i+j}\,B(a,b)(a+i)} \,G(x)^{k}. \end{array} $$

By exchanging the indices l and k in the sum symbol, we can write

$$ F(x;a,b,c,\boldsymbol{\xi})=\sum_{i,j,k=0}^{\infty}\sum_{l=k}^{\infty}\frac{(-1)^{i+l+k}(1-c)^{i}\left(\begin{array}{cc}{b-1}\\{i}\end{array}\right)\left(\begin{array}{cc}{-a-i}\\{j}\end{array}\right)\left(\begin{array}{cc}{a+i+j}\\{l}\end{array}\right)\left(\begin{array}{cc}{l}\\{k}\end{array}\right)}{c^{a+i+j}\,B(a,b)(a+i)} \,G(x)^{k}, $$

and then

$$ F(x;a,b,c,\boldsymbol{\xi})=\sum_{k=0}^{\infty}\beta_{k}\,G(x)^{k}, $$

where (for k≥0)

$$ \beta_{k}=\sum_{i,j=0}^{\infty}\sum_{l=k}^{\infty}\frac{(-1)^{i+l+k}(1-c)^{i}\left(\begin{array}{cc}{b-1}\\{i}\end{array}\right)\left(\begin{array}{cc}{-a-i}\\{j}\end{array}\right)\left(\begin{array}{cc}{a+i+j}\\{l}\end{array}\right) \left(\begin{array}{cc}{l}\\{k}\end{array}\right)}{c^{a+i+j}\,B(a,b)(a+i)}. $$

((9))

The density function of X can be expressed as a mixture of exp-G densities

$$ f(x;a,b,c,\boldsymbol{\xi})=\sum_{k=0}^{\infty}{\beta_{k+1}\,h_{k+1}(x)}, $$

((10))

where h _k+1(x)=(k+1) g(x;ξ) G ^k(x;ξ) denotes the exp-G density function with power parameter k+1.

Thus, some mathematical properties of the new model can be derived from those exp-G properties. For example, the ordinary and incomplete moments and moment generating function (mgf) of X can be obtained from those quantities of the exp-G distribution.

The formulae derived throughout the paper can be easily handled in most symbolic computation software platforms such as Maple, Mathematica and Matlab. These platforms allow to deal with analytic expressions of formidable size and complexity. Established explicit expressions to calculate statistical measures can be more efficient than computing them directly by numerical integration. The infinity limit in these sums can be substituted by a large positive integer such as 20 or 30 for most practical purposes.

Quantile power series

The qf of X, say x=Q(u)=F ⁻¹(u), can be obtained by inverting (3). Let z=Q _a,b(u) be the beta qf. Then,

$$\begin{array}{@{}rcl@{}} x=Q(u)=Q_{G}\left\{\frac{c\,\,Q_{a,b}(u)}{1-(1-c)Q_{a,b}(u)}\right\}. \end{array} $$

((11))

It is possible to obtain some expansions for Q _a,b(u) in the wolfram website¹ such as

$$\begin{array}{@{}rcl@{}} z=Q_{a,b}(u)=\sum_{i=0}^{\infty} e_{i}\,u^{i/a}, \end{array} $$

where e _i=[a B(a,b)]^1/a d _i and d ₀=0, d ₁=1, d ₂=(b−1)/(a+1),

$$d_{3}=\frac{(b-1)\,(a^{2} +3 a b- a+5 b-4)}{2(a+1)^{2}(a+2)},$$

$$\begin{array}{@{}rcl@{}} d_{4}&=&(b-1)[a^{4}+(6b-1)a^{3} + (b+2)(8b-5)a^{2} +(33b^{2}-30b+4)a\\ &+&b(31b-47)+18]/[3(a+1)^{3}(a+2)(a+3)],\ldots \end{array} $$

The effects of the shape parameters a, b and c on the skewness and kurtosis of X can be based on quantile measures. The shortcomings of the classical kurtosis measure are well-known. The Bowley skewness (Kenney and Keeping 1962) is one of the earliest skewness measures defined by the average of the quartiles minus the median, divided by half the interquartile range, namely

$$B=\frac{Q\left(\frac{3}{4}\right)+Q\left(\frac{1}{4}\right)-2Q\left(\frac{1}{2}\right)}{Q\left(\frac{3}{4}\right)-Q\left(\frac{1}{4}\right)}. $$

Since only the middle two quartiles are considered and the outer two quartiles are ignored, this adds robustness to the measure. The Moors kurtosis (Moors 1988) is based on octiles

$$M=\frac{Q\left(\frac{3}{8}\right)-Q\left(\frac{1}{8}\right)+Q\left(\frac{7}{8}\right)-Q\left(\frac{5}{8}\right)}{Q\left(\frac{6}{8}\right)-Q\left(\frac{2}{8}\right)}. $$

These measures are less sensitive to outliers and they exist even for distributions without moments.

Moments

We assume that Y is a random variable having the baseline cdf G(x). The moments of X can be determined from the (r,k)th probability weighted moment (PWM) of Y defined by

$$\begin{array}{@{}rcl@{}} \omega_{r,k}=\mathrm{E}[Y^{r}\,G(Y)^{k}]=\int_{-\infty}^{\infty} x^{r}\,G(x)^{k}\,g(x)dx. \end{array} $$

The PWMs are used to derive estimators of the parameters and quantiles of generalized distributions. The moment method of estimation is formulated by equating the population and sample PWMs. These moments have low variance and no severe biases, and they compare favorably with estimators obtained by maximum likelihood. However, the maximum likelihood method is adopted in Section 12 since it is easier to estimate the BMO-G parameters because of several computer routines available in widely known softwares. The maximum likelihood estimators (MLEs) enjoy desirable properties and can be used for constructing confidence intervals and also for test statistics.

We can write from Eq. 10

$$\begin{array}{*{20}l} \mu_{r}^{\prime}=\mathrm{E}(X^{r})=\sum_{k=0}^{\infty} (k+1)\,\beta_{k+1}\,\omega_{r,k}, \end{array} $$

((12))

where $\omega _{r,k}={\int _{0}^{1}} Q_{G}(u)^{r}\,u^{k} d u$ can be computed at least numerically from any baseline qf.

Thus, the moments of any BMO-G distribution can be expressed as an infinite weighted sum of the baseline PWMs. We now provide the PWMs for three distributions discussed in Section 3. For the BMO-N and BMO-Ga distributions introduced in Sections 3.1 and 3.3, the quantities ω _r,k can be expressed in terms of the Lauricella functions of type A (see Exton 1978; Trott 2006) defined by

$$\begin{array}{*{20}l} &F_{A}^{(n)}(a;b_{1},\ldots,b_{n};c_{1},\ldots,c_{n};x_{1},\ldots,x_{n}) =\\ &\sum_{m1=0}^{\infty}\ldots\sum_{m_{n}=0}^{\infty} \frac{(a)_{m_{1}+\ldots +m_{n}}(b_{1})_{m_{1}}\ldots (b_{n})_{m_{n}}}{(c_{1})_{m_{1}}\ldots (c_{n})_{m_{n}}}\frac{x_{1}^{m_{1}}\ldots x_{n}^{m_{n}}}{m_{1}!\ldots m_{n}!}, \end{array} $$

where (a)_i=a(a+1)…(a+i−1) is the ascending factorial (with the convention that (a)₀=1).

In fact, (Cordeiro and Nadarajah 2011) determined ω _r,k for the standard normal distribution as

$$\begin{array}{*{20}l} \omega_{r,k}&=2^{r/2}\,\pi^{-(k+1/2)}\,\sum_{\substack{l=0\\(r+k-l)\,\text{even}}}^{k}\,\binom{k}{l}\, 2^{-l}\,\pi^{l}\,\Gamma\left(\frac{r+k-l+1}{2}\right)\\ &\quad \times F_{A}^{(k-l)}\left(\frac{r+k-l+1}{2};\frac{1}{2},\ldots,\frac{1}{2};\frac{3}{2},\ldots,\frac{3}{2};-1,\ldots,-1\right). \end{array} $$

This equation holds when r+k−l is even and it vanishes when r+k−l is odd. So, any BMO-N moment can be expressed as an infinite weighted linear combination of Lauricella functions of type A.

For the gamma distribution, the quantities ω _r,k can be expressed from Eq. (9) of (Cordeiro and Nadarajah 2011) as

$$\begin{array}{*{20}l} \omega_{r,k} = \frac{\Gamma(r+(k+1)\alpha)}{\alpha^{k}\,\beta^{r}\,\Gamma(\alpha)^{k+1}}\,\, F_{A}^{(k)}(r+(k+1)\alpha;\alpha,\ldots,\alpha;\alpha+1,\ldots,\alpha+1,-1,\ldots,-1). \end{array} $$

As the last example, for the BMO-W distribution discussed in Section 3.2, the quantities ω _r,k reduce to

$$\begin{array}{@{}rcl@{}} \omega_{r,k}=\frac{\Gamma(r/\beta+1)}{\alpha^{r/\beta}}\,\,\sum_{s=0}^{k}\frac{(-1)^{s}}{(s+1)^{r/\beta+1}}\,\left(\begin{array}{cc}{k}\\{s}\end{array}\right). \end{array} $$

Some important questions in economics are answered by knowing the mean and the shape of a distribution. Incomplete moments of an income distribution form natural building blocks for measuring inequality: for example, the Lorenz and Bonferroni curves depend upon the incomplete moments of the income distribution.

The nth incomplete moment of X is defined by $m_{n}(y)=\int _{-\infty }^{y}x^{r}\,f(x)dx$. So, m _n(y) follows as

$$\begin{array}{@{}rcl@{}} m_{n}(y)= \sum_{k=0}^{\infty}\,\beta_{k+1}\,\int_{0}^{G(y;\,\boldsymbol{\xi})}\,Q_{G}(u)^{n}\, u^{k}\,du. \end{array} $$

((13))

The integral in (13) can be computed at least numerically for most baseline distributions.

Generating function

We provide two formulae for the mgf M(s)=E(e^sX) of X. The first formula for M(s) comes from Eq. (10) as

$$\begin{array}{@{}rcl@{}} M(s)=\sum_{k=0}^{\infty} \beta_{k+1}\,M_{k+1}(s), \end{array} $$

((14))

where M _k+1(s) is the exp-G generating function with power parameter k+1.

The second formula for M(s) follows in terms of the baseline qf as

$$\begin{array}{@{}rcl@{}} M(s)=\sum_{k=0}^{\infty} (k+1)\,\,\beta_{k+1}\,\rho_{k}(s), \end{array} $$

((15))

where the quantity $\rho _{k}(s)={\int _{0}^{1}}\exp \left [s\,Q_{G}(u)\right ] u^{k} d u$ can be computed numerically. Equations (14) and (15) are the main results of this section.

Mean deviations

The mean deviations about the mean ($\delta _{1}=\mathrm {E}(|X-\mu ^{\prime }_{1}|)$) and about the median (δ ₂=E(|X−M|)) of X can be expressed as

$$\begin{array}{@{}rcl@{}} \delta_{1}=2 \mu^{\prime}_{1}\,F\left(\mu^{\prime}_{1}\right)-2 m_{1}\left(\mu^{\prime}_{1}\right) \qquad\,\text{and}\qquad\,\delta_{2}=\mu^{\prime}_{1}-2 m_{1}(M), \end{array} $$

((16))

respectively, where M=Q(0.5) is the median of X, $\mu ^{\prime }_{1}=\mathrm {E}(X)$ comes from Eq. (12), $F(\mu ^{\prime }_{1})$ is easily calculated from Eq. (3) and $m_{1}(z)=\int _{-\infty }^{z} x\,f(x) dx$ is the first incomplete moment.

Now, we provide two alternative ways to compute δ ₁ and δ ₂. A general equation for m ₁(z) can be derived from Eq. (10) as

$$\begin{array}{@{}rcl@{}} m_{1}(z)= \sum_{k=0}^{\infty} \beta_{k+1}\,J_{k+1}(z), \end{array} $$

((17))

where $J_{k+1}(z)=\int _{-\infty }^{z} x\,h_{k+1}(x)dx$.

Equation (17) is the basic quantity to compute the mean deviations in Eq. 16. A simple application of it refers to the BMO-W model (Section 3.2). The exponentiated Weibull density function (for x>0) with power parameter k+1, shape parameter α and scale parameter β, is given by

$$\begin{array}{@{}rcl@{}} h_{k+1}(x)=(k+1)\,\alpha\,\beta^{\alpha}\,x^{\alpha-1}\,\exp\left\{-(\beta x)^{\alpha}\right\}\,\left[1-\exp\left\{-(\beta x)^{\alpha}\right\}\right]^{k}, \end{array} $$

and then

$$\begin{array}{@{}rcl@{}} {J_{k+1}(z)=\alpha\,(k+1)\,\beta^{\alpha}\,\,\sum_{r=0}^{\infty}(-1)^{r}\,{k \choose r}\, {\int_{0}^{z}} x^{\alpha}\,\exp\left\{-(r+1)(\beta x)^{\alpha}\right\} dx}. \end{array} $$

Using the incomplete gamma function, the last integral reduces to

$$\begin{array}{@{}rcl@{}} J_{k+1}(z)=\beta^{-1}\,\sum_{r=0}^{\infty} \frac{(-1)^{r}\,(k+1)\,{k \choose r}}{(r+1)^{1+\alpha^{-1}}}\, \gamma\left(1+\alpha^{-1},(r+1)(\beta z)^{\alpha}\right). \end{array} $$

A second general formula for m ₁(z) can be derived by setting u=G(x) in Eq. 17

$$\begin{array}{@{}rcl@{}} m_{1}(z)=\sum_{k=0}^{\infty} (k+1)\,\beta_{k+1}\,\,T_{k}(z), \end{array} $$

where $T_{k}(z)=\int _{0}^{G(z)}Q_{G}(u)\,u^{k} du$.

The main application of the first incomplete moment refers to the Bonferroni and Lorenz curves that are very useful in economics, reliability, demography, insurance and medicine. For a given probability π, applications of these equations can be addressed to obtain these curves defined by $B(\pi)=m_{1}(q)/(\pi \,\mu ^{\prime }_{1})$ and $L(\pi)=m_{1}(q)/\mu ^{\prime }_{1}$, respectively, where q=Q(π) is calculated from the parent qf in (11). In Fig. 4, we plot the measures B and L of the BMO-N and BMO-W distributions. The plots indicate the variability of these measures on the shape parameters.

Entropies

An entropy is a measure of variation or uncertainty of a random variable X. Two popular entropy measures are the Rényi and Shannon entropies (Rényi 1961; Shannon 1951. The Rényi entropy of a random variable with pdf f(x) is defined by

$$\begin{array}{@{}rcl@{}} I_{R}(c)=\frac{1}{1-\gamma}\log\left(\int_{0}^{\infty} f^{\gamma} (x) dx\right), \end{array} $$

for γ>0 and γ≠1. The Shannon entropy of a random variable X is defined by E{− log[f(X)]}. It is the special case of the Rényi entropy when γ ↑1. Direct calculation yields

$$\begin{array}{@{}rcl@{}} \text{E}\left\lbrace-\log\left[f (X)\right] \right\rbrace &=& -\log \left[\frac{c^{b}}{B(a,b)} \right]-\text{E} \left\lbrace \log\left[g(X; \boldsymbol{\xi})\right]\right\rbrace +(1-a)\text{E} \left\lbrace \log\left[G(x; \boldsymbol{\xi})\right]\right\rbrace \\ &+&(1-b)\text{E} \left\lbrace \log\left[\bar{G}(x; \boldsymbol{\xi})\right]\right\rbrace +(a+b) \text{E} \left\lbrace \log\left[c+(1-c)G(x; \boldsymbol{\xi}) \right]\right\rbrace. \end{array} $$

First, let

$$A(a_{1},a_{2},a_{3};c)={\int_{0}^{1}}\,\frac{u^{a_{1}}\,(1-u)^{a_{2}}}{[c+(1-c)u]^{a_{3}}}du. $$

Using the generalized binomial expansion, we obtain

$$\begin{array}{@{}rcl@{}} A(a_{1},a_{2},a_{3};c)=\sum_{i=0}^{\infty}\,(1-c)^{i}\,c^{-a_{3}-i}\,{-a_{3}\choose i}\,B(a_{1}+i+1,a_{2}+1). \end{array} $$

After some algebraic manipulations, we have the following proposition.

Proposition 1.

Let X be a random variable with pdf (4). Then,

$$\text{E}\left\{\log\left[ G(X;\boldsymbol{\xi})\right] \right\}=\frac{c^{b}}{B(a,b)}\,\,\frac{\partial}{\partial t}A(a+t-1,b-1,a+b;c)\bigg{|}_{t=0} $$

$$\text{E}\left\{\log\left[ \overline{G}(X;\boldsymbol{\xi})\right] \right\}=\frac{c^{b}}{B(a,b)}\,\,\frac{\partial}{\partial t}A(a-1,b+t-1,a+b;c)\bigg{|}_{t=0} $$

$$\text{E}\left\{\log\left[c+(1-c) G(X;\boldsymbol{\xi})\right] \right\}=\frac{c^{b}}{B(a,b)}\,\,\frac{\partial}{\partial t}A(a-1,b-1,a+b-t;c)\bigg{|}_{t=0}. $$

The simplest formula for the entropy of X is given by

$$\begin{array}{@{}rcl@{}} \text{E}\left\lbrace -\log[f(X)]\right\rbrace &=&-\log[\alpha\lambda(1-p)]-\text{E}\left\lbrace \log[g(X; \boldsymbol{\xi})]\right\rbrace \\ &+&\frac{(1-a)\,c^{b}}{B(a,b)}\,\,\frac{\partial}{\partial t}A(a+t-1,b-1,a+b;c)\bigg{|}_{t=0}\\ &+&\frac{(1-b)\,c^{b}}{B(a,b)}\,\,\frac{\partial}{\partial t}A(a-1,b+t-1,a+b;c)\bigg{|}_{t=0}\\ &+&\frac{(a+b)\,c^{b}}{B(a,b)}\,\,\frac{\partial}{\partial t}A(a-1,b-1,a+b-t;c)\bigg{|}_{t=0}. \end{array} $$

After some algebra, we obtain an alternative expression for I _R(γ)

$$\begin{array}{*{20}l} I_{R}(\gamma)&={\frac{\gamma}{1-\gamma}\log\left[\frac{c^{b}}{B(a,b)}\right]+\frac{1}{1-\gamma}\log\left\{\sum_{i,j=0}^{\infty}w_{i,j}^{\star}\,I(\gamma,a,j)\right\}}, \end{array} $$

where

$$w_{i,j}^{\star}=(-1)^{j}\,(1-c)^{i}\,c^{-\gamma(a+b)-i}\,{-\gamma(a+b) \choose i}\,{\gamma(b-1) \choose j} $$

and $I(\gamma,a,j)=\int _{0}^{\infty }\,g(x)^{\gamma }\,G(x)^{\gamma (a-1)+j}$

Order statistics

Order statistics make their appearance in many areas of statistical theory and practice. Suppose X ₁,…,X _n is a random sample from any BMO-G distribution. Let X _i:n denote the ith order statistic. The pdf of X _i:n can be expressed as

$$\begin{array}{@{}rcl@{}} f_{i:n}(x)=\,K\,f(x)\,F^{i-1}(x)\,\left\{1-F(x)\right\}\text{} ^{n-i}=K\, \sum_{j=0}^{n-i} (-1)^{j}\,{n - i \choose j}\,f(x)\,F(x)^{j+i-1}, \end{array} $$

where K=n!/[(i−1)! (n−i)!].

We can demonstrate that the density function of the ith order statistic of any BMO-G distribution can be expressed as

$$\begin{array}{@{}rcl@{}} f_{i:n}(x)=\sum_{r,k=0}^{\infty}\,m_{r,k}\,h_{r+k+1}(x), \end{array} $$

((18))

where h _r+k+1(x) denotes the exp-G density function with parameter r+k+1,

$$\begin{array}{@{}rcl@{}} m_{r,k}=\frac{n!\,(r+1)\,(i-1)!\,\beta_{r+1}}{(r+k+1)}\sum_{j=0}^{n-i}\,\frac{(-1)^{j}\,f_{j+i-1,k}}{(n-i-j)!\,j!}, \end{array} $$

β _r is given by (9) and the quantities f _j+i−1,k can be determined by $f_{j+i-1,0}=\beta _{0}^{j+i-1}$ and recursively (for k≥1)

$$\begin{array}{@{}rcl@{}} f_{j+i-1,k}=\left(k\,\beta_{0} \right)^{-1}\sum_{m=1}^{k}[m\,(j+i)-k]\,\beta_{m}\,f_{j+i-1,k-m}. \end{array} $$

We can obtain the ordinary and incomplete moments, generating function and mean deviations of the BMO-G order statistics from Eq. (18) and some properties of the exp-G model.

Estimation

Here, we determine the MLEs of the model parameters of the new family from complete samples only. Let x ₁,…,x _n be observed values from the BMO-G distribution with parameters a,b,c and ξ. Let Θ=(a,b,c,ξ)^⊤ be the r×1 parameter vector. The total log-likelihood function for Θ is given by

$$\begin{array}{@{}rcl@{}} \ell &=& \ell(\Theta) = {n\,b\,\log(c)}-n\,\log[B(a,b)]+\sum_{i=1}^{n}\log[g(x_{i},\boldsymbol{\xi})] \\ &+& (a-1)\sum_{i=1}^{n}\log\left\{G(x_{i},\boldsymbol{\xi})\right\}+(b-1) \sum_{i=1}^{n}\log[\bar{G}(x_{i},\boldsymbol{\xi})]\\ &-& (a+b)\sum_{i=1}^{n}\log\left[c+(1-c)G(x_{i},\boldsymbol{\xi})\right]. \end{array} $$

((19))

Numerical maximization of (19) can be performed by using the RS method (Rigby and Stasinopoulos 2005) which is available in the gamlss package (R Development Core Team 2013), SAS (Proc NLMixed) or the Ox program (sub-routine MaxBFGS) (see Doornik 2007) or by solving the nonlinear likelihood equations obtained by differentiating (19). Let U _n(Θ)=(∂ ℓ _n/∂ a,∂ ℓ _n/∂ b,∂ ℓ _n/∂ c,∂ ℓ _n/∂ ξ)^⊤ be the score function, whose components are

$$ \begin{aligned} U_{a}&=\frac{\partial \ell}{\partial a}= -n\psi(a)+n\psi(a+b)+\sum_{i=1}^{n}\log\left\{G(x_{i},\boldsymbol{\xi})\right\}-\sum_{i=1}^{n}\log\left[c+(1-c)G(x_{i},\boldsymbol{\xi})\right],\\ U_{b}&=\frac{\partial \ell}{\partial b} ={n\,\log(c)}-n\psi(b)+n\psi(a+b)+\sum_{i=1}^{n}\log\left\{\bar{G}(x_{i},\boldsymbol{\xi})\right\}\\&\quad-\sum_{i=1}^{n}\log\left[c+(1-c)G(x_{i},\boldsymbol{\xi})\right],\\ U_{c}&=\frac{\partial \ell}{\partial c}={\frac{n\,b}{c}}-(a+b)\sum_{i=1}^{n}\frac{\bar{G}(x_{i},\boldsymbol{\xi})}{c+(1-c)G(x_{i},\boldsymbol{\xi})} \end{aligned} $$

and

$$\begin{aligned} U_{\boldsymbol{\xi}}&=\frac{\partial \ell}{\partial \boldsymbol{\xi}}=\sum_{i=1}^{n} \frac{g^{(\xi)}(x; \boldsymbol{\xi})}{g(x; \boldsymbol{\xi})}+(a-1)\sum_{i=1}^{n} \frac{G^{(\xi)}(x; \boldsymbol{\xi})}{G(x; \boldsymbol{\xi})}+(1-b)\sum_{i=1}^{n} \frac{G^{(\xi)}(x; \boldsymbol{\xi})}{\bar{G}(x; \boldsymbol{\xi})}\\ &\quad+(c-1)(a+b)\sum_{i=1}^{n} \frac{G^{(\xi)}(x; \boldsymbol{\xi})}{c+(1-c)G(x;\boldsymbol{\xi})}, \end{aligned} $$

where h ^(ξ)(·) means the derivative of the function h with respect to ξ. Setting these equations to zero, U _a=U _b=U _c=U _ξ=0, and solving them simultaneously yields the MLE $\widehat {\Theta }$ of Θ.

For interval estimation on the model parameters, it is required the observed information matrix, whose elements U _rs=∂ ² ℓ/∂ r ∂ s (for r,s=a,b,c,ξ) can be computed numerically. Under standard regularity conditions (Cox and Hinkley 1979), we can approximate the distribution of $(\widehat {\Theta }-\Theta)$ by the multivariate normal N _r+3(0,J(Θ)⁻¹) distribution, where r is the number of parameters of the baseline distribution.

We can compute the maximum values of the unrestricted and restricted log-likelihoods to construct likelihood ratio (LR) statistics for testing some sub-models of the BMO-G distribution. For example, we may use LR statistics to check if the fit using the BMO-W distribution is statistically “superior” to the fits using the BW, MOW, EW, EE and Weibull distributions for a given data set.

Often with lifetime data and reliability studies, one encounters censoring. Suppose that the lifetimes are independently distributed, and also independent from the censoring mechanism and censoring is random and noninformative. Considering right-censored lifetime data, we observe x _i=min(X _i,C _i) and δ _i=I(X _i≤C _i) such that δ _i=1 if X _i is a time to event and δ _i=0 if it is right censored for i=1,…,n where X _i is the lifetime for the ith individual and C _i is the censoring for the ith individual, i=1,…,n. The censored likelihood L(Θ) for the model parameters is

$$ L(\Theta) \propto \prod_{i=1}^{n}\,[f(x_{i}; a,b,c, \boldsymbol{\xi})]^{\delta_{i}}\,\,[S(x_{i}; a,b,c, \boldsymbol{\xi})]^{1-\delta_{i}}, $$

((20))

where S(x;a,b,c,ξ)=1−F(x;a,b,c,ξ) is the survival function obtained from (3) and f(x;a,b,c,ξ) is given by (4). We maximize the likelihood (20) in the same way as described before.

Empirical illustration

We illustrate the flexibility of the BMO-W and BMO-N distributions by means of two real data sets. Similar investigations could be performed for other BMO distributions. We have chosen these distributions because of the popularity of their baseline distributions. The computations are performed using the software R version 3.0.0 (package bbmle). The maximization follows the BFGS method with analytical derivatives. The algorithm used to estimate the model parameters converged for all current models.

13.1 Illustration 1: Failure time data

We next consider the data studied by (Murthy et al. 2004), which represent failure times for a particular windshield device. The windshield on a large aircraft is a complex piece of equipment, comprised basically of several layers of material, including a very strong outer skin with a heated layer just beneath it, all laminated under high temperature and pressure. Failures of these items are not structural failures. Instead, they typically involve damage or delamination of the nonstructural outer ply or failure of the heating system. These failures do not result in damage to the aircraft but do result in replacement of the windshield. We compare the results of the fits of the BMO-W distribution, its special models (W, EW, BW, MOW and EMOW) and the following distributions: the Kumaraswamy Weibull (Kw-W) model with pdf given by

$$f_{\text{Kw-W}}(x)=a\,b\,\beta\,\alpha^{\beta}\,x^{\beta-1}\,\mathrm{e}^{-(\alpha\,x)^{\beta}}\,[1-\mathrm{e}^{-(\alpha\,x)^{\beta}}]^{a-1}\, \left\{1-\left[1-\mathrm{e}^{-(\alpha\,x)^{\beta}}\right]^{a}\right\}\text{} ^{b-1}, $$

the McDonald Weibull (McW) model with pdf given by

$$\begin{aligned} f_{\text{McW}}(x) =&\,\frac{c\,\beta\,\alpha^{\beta}}{B(a,b)}\,x^{\beta-1}\,\exp\{-(\alpha\,x)^{\beta}\}\,\left[1-\exp\{-(\alpha\,x)^{\beta}\}\right]^{a\,c-1} \\&\times \left\{1-\left[1-\exp\{-(\alpha\,x)^{\beta}\}\right]^{c}\right\}\text{} ^{b-1} \end{aligned} $$

and the Libby-Novic beta Weibull (LNB-W) model with pdf given by

$$ f_{\text{LNB-W}}(x) = \frac{K\,\beta\,\alpha^{\beta}\, x^{\beta-1}\,\exp\{-(\alpha\,x)^{\beta}\}\,\left[1-\exp\left\{-(\alpha\,x)^{\beta}\right\}\right]^{a-1}\,\exp\left\{-(b-1)\,(\alpha\,x)^{\beta}\right\}} {\left\{1-(1-c)\,\left[1-\exp\left\{-(\alpha\,x)^{\beta}\right\}\right]\right\}\text{} ^{a+b}}, $$

where K=c ^a/B(a,b), a>0, b>0, c>0, α>0, β>0 and x>0.

In Table 2, the MLEs and their standard errors (SEs) (in parentheses) of the parameters from nine fitted models and the Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC) and Bayesian Information Criterion (BIC) values are presented. According to the lowest values of the AIC and CAIC statistics, the BMO-W model could be chosen as the best model among the nine fitted models. Formal tests for the extra shape parameters in the BMO-W distribution can be performed based on LR statistics. The results for comparing the models to the current data are given in Table 3. The rejection of the null models is significant for the five LR tests. So, we have evidence of the potential need for the three parameters of the BMO-W distribution for the current data.

Table 2 MLEs (SEs in parentheses) for some fitted models to the failure time data and the AIC, CAIC and BIC values

Full size table

Table 3 LR tests

Full size table

The plots of the fitted BMO-W pdf and of the four fitted pdfs discussed before are displayed in Fig. 5. They indicate that the BMO-W distribution provides a better fit to these data compared to the other models. So, this distribution can be considered a very competitive model to the LNB-W distribution.

13.2 Illustration 2: Plasma ferritin data

Here, consider the data discussed by Weisberg (2014, Section 6.4) which represent 202 athletes collected at the Australian Institute of Sport. The variable evaluated in this study is the plasma ferritin concentration. These data were analyzed recently by (Cordeiro et al. 2014) using the Libby-Novic beta normal (LNB-N) distribution with density function given by

$$ f(x)=\frac{K\,\phi\left(\frac{x-\mu}{\sigma}\right)\,\left[\Phi\left(\frac{x-\mu}{\sigma}\right)\right]^{a-1}\, \left[1-\Phi\left(\frac{x-\mu}{\sigma}\right)\right]^{b-1}}{\sigma\,\left[1-(1-c)\Phi\left(\frac{x-\mu}{\sigma}\right)\right]^{a+b}}, $$

((21))

where x∈ I R, μ∈ I R is a location parameter, σ>0 is scale parameter, a,b and c are positive shape parameters and ϕ(·) and Φ(·) are the pdf and cdf of the standard normal distribution, respectively.

In Table 4, the MLEs and their SEs (in parentheses) of the parameters from fitted nine models and the AIC, CAIC and BIC values are presented. According to the lowest values of these statistics, the BMO-N model could be chosen as the best model among the nine fitted models. Formal tests for the extra shape parameters in the BMO-N distribution can be performed based on LR statistics. The results for comparing the models to the current data are given in Table 5. The rejection of the null models is significant for the five LR tests. So, we have a clear evidence for the three parameters of the BMO-N distribution when modeling data of this type. The plot of the fitted BMO-N pdf and the four fitted pdfs discussed before are displayed in Fig. 6. They indicate that the BMO-N distribution provides the best fit to these data compared to the other models. Finally, the proposed distribution can be considered a very competitive model to the LNB-N distribution.

Table 4 MLEs (SEs in parentheses) for some fitted models to the failure time data and the AIC, CAIC and BIC values

Full size table

Table 5 LR tests

Full size table

Concluding remarks

We define a new class of models, named the beta Marshall-Olkin-G (BMO-G) family of distributions by adding three shape parameters, which generalizes some well-known distributions in the statistical literature such as the normal, Weibull and beta distributions. We provide a mathematical treatment of the proposed family including expansions for the density function, ordinary and incomplete moments and generating function. The BMO-G density function can be expressed as a mixture of exponentiated density functions. This property is important to obtain several other results. We derive a power series for the quantile function. Our formulas related to the BMO-G model are manageable, and with the use of modern computer resources with analytic and numerical capabilities, they may turn into adequate tools for applied statisticians. Some special models are explored. The estimation of the model parameters is carried out by the method of maximum likelihood. Finally, we fit some special models in the new family to two real data sets to demonstrate their potentiality.

Endnote

¹ http://functions.wolfram.com/06.23.06.0004.01

References

Alexander, C, Cordeiro, GM, Ortega, EMM, Sarabia, JM: Generalized beta-generated distributions. Comput. Stat. Data Anal. 56, 1880–1897 (2012).
Article MathSciNet MATH Google Scholar
Alzaatreh, A, Lee, C, Famoye, F: A new method for generating families of continuous distributions. METRON. 71, 63–79 (2013).
Article MathSciNet Google Scholar
Bourguignon, M, Silva, RB, Cordeiro, GM: The Weibull-G family of probability distributions. J. Data Sci. 12, 53–68 (2014).
MathSciNet Google Scholar
Cordeiro, GM, Alizadeh, M, Ortega, EMM: The exponentiated half-logistic family of distributions: Properties and applications. J. Probab. Stat. 2014, 1–21 (2014).
Article MathSciNet Google Scholar
Cordeiro, GM, Castro, M: A new family of generalized distributions. J. Stat. Comput. Simul. 81, 883–898 (2011).
Article MathSciNet MATH Google Scholar
Cordeiro, GM, Nadarajah, S: Closed-form expressions for moments of a class of beta generalized distributions. Braz. J. Probab. Stat. 25, 14–33 (2011).
Article MathSciNet Google Scholar
Cox, DR, Hinkley, DV: Theoretical statistics. Chapman Hill, London (1979).
Google Scholar
Doornik, J: Ox 5: An Object-Oriented Matrix Language. Timberlake Consultants Press, London (2007).
Google Scholar
Eugene, N, Lee, C, Famoye, F: Beta-normal distribution and its applications. Commun. Stat. Theory Methods. 31, 497–512 (2002).
Article MathSciNet MATH Google Scholar
Exton, H: Handbook of Hypergeometric Integrals: Theory, Applications, Tables, Computer Programs. Halsted Press, New York (1978).
MATH Google Scholar
Gupta, RC, Gupta, PL, Gupta, RD: Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods. 27, 887–904 (1998).
Article MATH Google Scholar
Gupta, RC, Gupta, RD: Proportional reversed hazard rate model and its applications. J. Stat. Planning Inference. 137, 3525–3536 (2007).
Article MATH Google Scholar
Jayakumar, K, Mathew, T: On a generalization to marshall–olkin scheme and its application to burr type xii distribution. Stat. Papers. 49, 421–439 (2008).
Article MathSciNet MATH Google Scholar
Jones, MC: Families of distributions arising from distributions of order statistics. Test. 13, 1–43 (2004).
Article MathSciNet MATH Google Scholar
Kenney, JF, Keeping, ES: Mathematics of Statistics, Part 1. 3rd edition, Van Nostrand, New Jersey (1962).
Marshall, AW, Olkin, I: A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 84, 641–652 (1997).
Article MathSciNet MATH Google Scholar
Moors, JJA: A quantile alternative for kurtosis. The Statistician. 37, 25–32 (1988).
Article Google Scholar
Murthy, DNP, Xie, M, Jiang, R: Weibull models. New Jersey (2004).
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Viennas, Austria (2013).
Google Scholar
Rényi, A: On measures of entropy and information, volume I. In: proceedings of the 4th Berkeley symposium on mathematical statistics and probability edition. University of California Press, Berkeley (1961).
Google Scholar
Rigby, RA, Stasinopoulos, DM: Generalized additive models for location, scale and shape (with discussion). Appl. Stat. 54, 507–554 (2005).
MathSciNet MATH Google Scholar
Shannon, CE: Prediction and entropy of printed english. Bell Syst. Tech. J. 30, 50–64 (1951).
Article MATH Google Scholar
Trott, M: The Mathematica Guidebook for Symbolics. With 1 DVD-ROM (Windows, Macintosh and UNIX). Springer, New York (2006).
Google Scholar
Weisberg, S: Applied linear regression. 3rd edition. Wiley, New York (2014).
MATH Google Scholar
Zografos, K, Balakrishnan, N: On families of beta- and generalized gamma-generated distributions and associated inference. Stat. Methodol. 6, 344–362 (2009).
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Persian Gulf University of Bushehr, Bushehr, Iran
Morad Alizadeh
Departamento de Estatística, Universidade Federal de Pernambuco, Recife, Pernambuco, Brazil
Gauss M. Cordeiro
Departamento de Estatística, Universidade Federal da Bahia, Salvador, Bahia, Brazil
Edleide de Brito
Departamento de Ciências Exatas, Universidade de São Paulo, Piracicaba, São Paulo, Brazil
Clarice Garcia B. Demétrio

Authors

Morad Alizadeh
View author publications
You can also search for this author in PubMed Google Scholar
Gauss M. Cordeiro
View author publications
You can also search for this author in PubMed Google Scholar
Edleide de Brito
View author publications
You can also search for this author in PubMed Google Scholar
Clarice Garcia B. Demétrio
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Edleide de Brito.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

MA developed the mathematics of paper and participated in drafting the manuscript. EB developed Sections 3 and 13, produced article figures and participated in drafting the manuscript. GMC and CGBD were advisors and reviewed all the work from the initial idea, through preparation of the manuscript until the final version. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alizadeh, M., Cordeiro, G.M., Brito, E.d. et al. The beta Marshall-Olkin family of distributions. J Stat Distrib App 2, 4 (2015). https://doi.org/10.1186/s40488-015-0027-7

Download citation

Received: 06 January 2015
Accepted: 10 June 2015
Published: 01 July 2015
DOI: https://doi.org/10.1186/s40488-015-0027-7

The beta Marshall-Olkin family of distributions

Abstract

Introduction

The new density

Some special BMO distributions

3.1 The BMO-N distribution

3.2 The BMO-W distribution

3.3 The Beta Marshall-Olkin gamma (BMO-Ga) distribution

Asymptotics and shapes

Corollary 1.

Corollary 2.

Useful representation

Quantile power series

Moments

Generating function

Mean deviations

Entropies

Proposition 1.

Order statistics

Estimation

Empirical illustration

13.1 Illustration 1: Failure time data

13.2 Illustration 2: Plasma ferritin data

Concluding remarks

Endnote

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords