- Methodology
- Open Access
- Published:

# The beta Marshall-Olkin family of distributions

*Journal of Statistical Distributions and Applications*
**volume 2**, Article number: 4 (2015)

## Abstract

We study general mathematical properties of a new generator of continuous distributions with three extra shape parameters called the *beta Marshall-Olkin* family. We present some special models and investigate the asymptotes and shapes. The new density function can be expressed as a mixture of exponentiated densities based on the same baseline distribution. We derive a power series for its quantile function. Explicit expressions for the ordinary and incomplete moments, quantile and generating functions, Bonferroni and Lorenz curves, Shannon and Rényi entropies and order statistics, which hold for any baseline model, are determined. We discuss the estimation of the model parameters by maximum likelihood and illustrate the flexibility of the family by means of two applications to real data. **PACS** 02.50.Ng, 02.50.Cw, 02.50.-r **Mathematics Subject Classification (2010)** 62E10, 60E05, 62P99

## Introduction

Recently, some attempts have been made to define new families to extend well-known distributions and at the same time provide great flexibility in modelling data in practice. So, several classes by adding one or more parameters to generate new distributions have been proposed in the statistical literature. Some well-known generators are: the Marshall-Olkin generated (MO-G) by (Marshall and Olkin 1997), the beta-G by (Eugene et al. 2002), the Kumaraswamy-G (Kw-G for short) by (Cordeiro and Castro 2011), the McDonald-G (Mc-G) by (Alexander et al. 2012), the gamma-G by (Zografos and Balakrishnan 2009), the transformer (T-X) by (Alzaatreh et al. 2013), the Weibull-G by (Bourguignon et al. 2014) and the exponentiated half-logistic by (Cordeiro et al. 2014).

Let *r*(*t*) be the probability density function (pdf) of a random variable *T*∈[*d*,*e*] for −*∞*≤*d*<*e*<*∞* and let *W*[*G*(*x*)] be a function of the cumulative distribution function (cdf) of a random variable *X* such that *W*[*G*(*x*)] satisfies the following conditions:

Alzaatreh et al. 2013 defined the T-X family cdf by

where *W*[*G*(*x*)] satisfies the conditions (1). The pdf corresponding to (2) is given by

In this paper, we propose a new wider class of continuous distributions called the *beta Marshall-Olkin* (BMO) family by taking \(W[G(x)]=\frac {G(x;\boldsymbol {\xi })}{c+(1-c)G(x;\boldsymbol {\xi })}\) and \(r(t)=\frac {1}{B(a,b)}t^{a-1}(1-t)^{b-1},\,\,\,0<t<1,a,b,c>0\). Its cdf is given by

where \(I_{x}(a,b)=B(a,b)^{-1}{\int _{0}^{x}}t^{a-1}\,(1-t)^{b-1}dt\) denotes the incomplete beta function ratio, *G*(*x*;** ξ**) is the baseline cdf depending on a parameter vector

**and**

*ξ**a*>0,

*b*>0 and

*c*>0 are three additional shape parameters. For each baseline G, the BMO-G distribution is defined by the cdf (3). Equation (3) includes as special cases the beta-G, Marshall-Olkin-G (MOG), exponentiated Marshal-Olkin-G (EMOG) and exponentiated classes as those listed in Table 1.

This paper is organized as follows. In Section 2, we provide a physical interpretation of the BMO-G family. Three special cases of this family are defined in Section 3. In Section 4, the shape of the density and hazard rate functions are described analytically. Some useful expansions are derived in Section 5. In Section 6, we obtain a power series for the BMO-G quantile function (qf). In Section 7, we propose explicit expressions for the ordinary and incomplete moments using the qf expansion. The generating function and mean deviations are derived in Sections 8 and 9, respectively. General expressions for the Rényi and Shannon entropies are presented in Section 10. The order statistics are investigated in Section 11. Estimation of the model parameters by maximum likelihood is performed in Section 12. Applications to two real data sets illustrate the performance of the new family in Section 13. The paper is concluded in Section 14.

## The new density

The density function corresponding to (3) is given by

where *g*(*x*;** ξ**) is the baseline pdf. Equation (4) will be most tractable when

*G*(

*x*;

**) and**

*ξ**g*(

*x*;

**) have simple analytic expressions. Hereafter, a random variable**

*ξ**X*with density function (4) is denoted by

*X*∼BMO-G(

*a*,

*b*,

*c*,

**). Further, we can omit sometimes the dependence on the vector**

*ξ***of parameters and write simply**

*ξ**G*(

*x*)=

*G*(

*x*;

**).**

*ξ*The hazard rate function (hrf) of *X* becomes

The BMO family is easily simulated by inverting (3) as follows: if *V* has a beta distribution with positive parameters *a* and *b*, the solution of the nonlinear equation

has the density function (4).

The basic motivations for using the BMO family in practice are: (i) to make the kurtosis more flexible compared to the baseline model; (ii) to produce a skewness for symmetrical distributions; (iii) to construct heavy-tailed distributions that are not longer-tailed for modeling real data; (iv) to generate distributions with symmetric, left-skewed, right-skewed and reversed-J shaped; (v) to define special models with all types of the hrf; (vi) to generate a large number of special distributions as those presented in Table 1; and (vii) to provide consistently better fits than other generated models under the same baseline distribution. A simple example of (ii): the normal distribution is symmetric, but the beta Marshall-Olkin normal (BMO-N) becomes skewed. The fact (vii) is well-demonstrated by fitting the BMO-N and beta Marshall-Olkin Weibull (BMO-W) distributions to two real data sets in Section 13. However, we expect that there are other contexts in which the BMO special models can produce worse fits than other generated distributions. Clearly, the results in Section 13 indicate that the new family is a very competitive class to other known generators with at most three extra shape parameters.

## Some special BMO distributions

The BMO-G density function (4) allows for greater flexibility of its tails and can be widely applied in many areas. The new family extends several widely-known distributions in the literature. Here, we present a few of its many special models.

### 3.1 The BMO-N distribution

The BMO-N pdf is obtained from (4) by taking the normal *N*(*μ*,*σ*) as the parent distribution, where ** ξ**=(

*μ*,

*σ*). Then,

where *x*∈ *I*
*R*, *μ*∈ *I*
*R* is a location parameter, *σ*>0 is a scale parameter, *ϕ*(·) and *Φ*(·) are the pdf and cdf of the standard normal distribution, respectively. The standard BMO-N density comes when *μ*=0 and *σ*=1. For *a*=*b*=*c*=1, it reduces to the normal density. Plots of (6) for some parameter values are displayed in Fig. 1.

### 3.2 The BMO-W distribution

Let *G*(*x*;** ξ**)=1− exp[−(

*α*

*x*)

^{β}] be the Weibull cdf with scale parameter

*α*>0 and shape parameter

*β*>0, where

**=(**

*ξ**α*,

*β*). The BMO-W pdf (for

*x*>0) reduces to

The Weibull pdf (with parameters *α* and *β*) is a special case for *a*=*b*=*c*=1. Some possible shapes of the BMO-W pdf and hrf are displayed in Fig. 2.

### 3.3 The Beta Marshall-Olkin gamma (BMO-Ga) distribution

The gamma cumulative distribution (for *x*>0) with shape parameter *α*>0 and scale parameter *β*>0, ** ξ**=(

*α*,

*β*), is given by

where \(\Gamma (p)=\int _{0}^{\infty }w^{p-1}\,\mathrm {e}^{-w}dw\) is the gamma function and \(\gamma (\alpha, z)={\int _{0}^{z}}w^{\alpha -1}\,\mathrm {e}^{-w}dw\) is the incomplete gamma function. The BMO-Ga pdf (for *x*>0) becomes

For *c*=1, we obtain the beta Weibull (BW) distribution. The beta Marshall-Olkin exponential (BMO-E) distribution corresponds to *β*=1. Figure 3 displays some BMO-Ga pdf’s and hrf’s.

## Asymptotics and shapes

### Corollary 1.

The asymptotics of Eqs. (3), (4) and (5) as *G*(*x*)→0 are given by

### Corollary 2.

The asymptotics of Eqs. (3), (4) and (5) as *x*→*∞* are given by

The shapes of the density and hazard rate functions can be described analytically. The critical points of the BMO-G density function are the roots of the equation:

There may be more than one root to (7). If *x*=*x*
_{0} is a root of (7) then it corresponds to a local maximum, a local minimum or a point of inflexion depending on whether *λ*(*x*
_{0})<0, *λ*(*x*
_{0})>0 or *λ*(*x*
_{0})=0, where \(\lambda (x)=\frac {d^{2}\log [f(x)] }{d x^{2}}\) is given by

The critical points of *τ*(*x*) are the roots of the equation:

There may be more than one root to (8). If *x*=*x*
_{0} is a root of (8) then it corresponds to a local maximum, a local minimum or a point of inflexion depending on whether *ς*(*x*
_{0})<0, *ς*(*x*
_{0})>0 or *ς*(*x*
_{0})=0, where \(\varsigma (x)=\frac {d^{2}\log [\tau (x)] }{d x^{2}}\) is given by

## Useful representation

By using the generalized binomial expansion, we can prove that the cdf (3) of *X* admits the expansion

By exchanging the indices *l* and *k* in the sum symbol, we can write

and then

where (for *k*≥0)

The density function of *X* can be expressed as a mixture of exp-G densities

where *h*
_{
k+1}(*x*)=(*k*+1) *g*(*x*;** ξ**)

*G*

^{k}(

*x*;

**) denotes the exp-G density function with power parameter**

*ξ**k*+1.

Thus, some mathematical properties of the new model can be derived from those exp-G properties. For example, the ordinary and incomplete moments and moment generating function (mgf) of *X* can be obtained from those quantities of the exp-G distribution.

The formulae derived throughout the paper can be easily handled in most symbolic computation software platforms such as Maple, Mathematica and Matlab. These platforms allow to deal with analytic expressions of formidable size and complexity. Established explicit expressions to calculate statistical measures can be more efficient than computing them directly by numerical integration. The infinity limit in these sums can be substituted by a large positive integer such as 20 or 30 for most practical purposes.

## Quantile power series

The qf of *X*, say *x*=*Q*(*u*)=*F*
^{−1}(*u*), can be obtained by inverting (3). Let *z*=*Q*
_{
a,b
}(*u*) be the beta qf. Then,

It is possible to obtain some expansions for *Q*
_{
a,b
}(*u*) in the wolfram website^{1} such as

where *e*
_{
i
}=[*a*
*B*(*a*,*b*)]^{1/a}
*d*
_{
i
} and *d*
_{0}=0, *d*
_{1}=1, *d*
_{2}=(*b*−1)/(*a*+1),

The effects of the shape parameters *a*, *b* and *c* on the skewness and kurtosis of *X* can be based on quantile measures. The shortcomings of the classical kurtosis measure are well-known. The Bowley skewness (Kenney and Keeping 1962) is one of the earliest skewness measures defined by the average of the quartiles minus the median, divided by half the interquartile range, namely

Since only the middle two quartiles are considered and the outer two quartiles are ignored, this adds robustness to the measure. The Moors kurtosis (Moors 1988) is based on octiles

These measures are less sensitive to outliers and they exist even for distributions without moments.

## Moments

We assume that *Y* is a random variable having the baseline cdf *G*(*x*). The moments of *X* can be determined from the (*r*,*k*)th probability weighted moment (PWM) of *Y* defined by

The PWMs are used to derive estimators of the parameters and quantiles of generalized distributions. The moment method of estimation is formulated by equating the population and sample PWMs. These moments have low variance and no severe biases, and they compare favorably with estimators obtained by maximum likelihood. However, the maximum likelihood method is adopted in Section 12 since it is easier to estimate the BMO-G parameters because of several computer routines available in widely known softwares. The maximum likelihood estimators (MLEs) enjoy desirable properties and can be used for constructing confidence intervals and also for test statistics.

We can write from Eq. 10

where \(\omega _{r,k}={\int _{0}^{1}} Q_{G}(u)^{r}\,u^{k} d u\) can be computed at least numerically from any baseline qf.

Thus, the moments of any BMO-G distribution can be expressed as an infinite weighted sum of the baseline PWMs. We now provide the PWMs for three distributions discussed in Section 3. For the BMO-N and BMO-Ga distributions introduced in Sections 3.1 and 3.3, the quantities *ω*
_{
r,k
} can be expressed in terms of the Lauricella functions of type A (see Exton 1978; Trott 2006) defined by

where (*a*)_{
i
}=*a*(*a*+1)…(*a*+*i*−1) is the ascending factorial (with the convention that (*a*)_{0}=1).

In fact, (Cordeiro and Nadarajah 2011) determined *ω*
_{
r,k
} for the standard normal distribution as

This equation holds when *r*+*k*−*l* is even and it vanishes when *r*+*k*−*l* is odd. So, any BMO-N moment can be expressed as an infinite weighted linear combination of Lauricella functions of type A.

For the gamma distribution, the quantities *ω*
_{
r,k
} can be expressed from Eq. (9) of (Cordeiro and Nadarajah 2011) as

As the last example, for the BMO-W distribution discussed in Section 3.2, the quantities *ω*
_{
r,k
} reduce to

Some important questions in economics are answered by knowing the mean and the shape of a distribution. Incomplete moments of an income distribution form natural building blocks for measuring inequality: for example, the Lorenz and Bonferroni curves depend upon the incomplete moments of the income distribution.

The *n*th incomplete moment of *X* is defined by \(m_{n}(y)=\int _{-\infty }^{y}x^{r}\,f(x)dx\). So, *m*
_{
n
}(*y*) follows as

The integral in (13) can be computed at least numerically for most baseline distributions.

## Generating function

We provide two formulae for the mgf *M*(*s*)=E(e^{sX}) of *X*. The first formula for *M*(*s*) comes from Eq. (10) as

where *M*
_{
k+1}(*s*) is the exp-G generating function with power parameter *k*+1.

The second formula for *M*(*s*) follows in terms of the baseline qf as

where the quantity \(\rho _{k}(s)={\int _{0}^{1}}\exp \left [s\,Q_{G}(u)\right ] u^{k} d u\) can be computed numerically. Equations (14) and (15) are the main results of this section.

## Mean deviations

The mean deviations about the mean (\(\delta _{1}=\mathrm {E}(|X-\mu ^{\prime }_{1}|)\)) and about the median (*δ*
_{2}=E(|*X*−*M*|)) of *X* can be expressed as

respectively, where *M*=*Q*(0.5) is the median of *X*, \(\mu ^{\prime }_{1}=\mathrm {E}(X)\) comes from Eq. (12), \(F(\mu ^{\prime }_{1})\) is easily calculated from Eq. (3) and \(m_{1}(z)=\int _{-\infty }^{z} x\,f(x) dx\) is the first incomplete moment.

Now, we provide two alternative ways to compute *δ*
_{1} and *δ*
_{2}. A general equation for *m*
_{1}(*z*) can be derived from Eq. (10) as

where \(J_{k+1}(z)=\int _{-\infty }^{z} x\,h_{k+1}(x)dx\).

Equation (17) is the basic quantity to compute the mean deviations in Eq. 16. A simple application of it refers to the BMO-W model (Section 3.2). The exponentiated Weibull density function (for *x*>0) with power parameter *k*+1, shape parameter *α* and scale parameter *β*, is given by

and then

Using the incomplete gamma function, the last integral reduces to

A second general formula for *m*
_{1}(*z*) can be derived by setting *u*=*G*(*x*) in Eq. 17

where \(T_{k}(z)=\int _{0}^{G(z)}Q_{G}(u)\,u^{k} du\).

The main application of the first incomplete moment refers to the Bonferroni and Lorenz curves that are very useful in economics, reliability, demography, insurance and medicine. For a given probability *π*, applications of these equations can be addressed to obtain these curves defined by \(B(\pi)=m_{1}(q)/(\pi \,\mu ^{\prime }_{1})\) and \(L(\pi)=m_{1}(q)/\mu ^{\prime }_{1}\), respectively, where *q*=*Q*(*π*) is calculated from the parent qf in (11). In Fig. 4, we plot the measures *B* and *L* of the BMO-N and BMO-W distributions. The plots indicate the variability of these measures on the shape parameters.

## Entropies

An entropy is a measure of variation or uncertainty of a random variable *X*. Two popular entropy measures are the Rényi and Shannon entropies (Rényi 1961; Shannon 1951. The Rényi entropy of a random variable with pdf *f*(*x*) is defined by

for *γ*>0 and *γ*≠1. The Shannon entropy of a random variable *X* is defined by *E*{− log[*f*(*X*)]}. It is the special case of the Rényi entropy when *γ*
*↑*1. Direct calculation yields

First, let

Using the generalized binomial expansion, we obtain

After some algebraic manipulations, we have the following proposition.

###
**Proposition**
**1**.

Let *X* be a random variable with pdf (4). Then,

The simplest formula for the entropy of *X* is given by

After some algebra, we obtain an alternative expression for *I*
_{
R
}(*γ*)

where

and \(I(\gamma,a,j)=\int _{0}^{\infty }\,g(x)^{\gamma }\,G(x)^{\gamma (a-1)+j}\)

## Order statistics

Order statistics make their appearance in many areas of statistical theory and practice. Suppose *X*
_{1},…,*X*
_{
n
} is a random sample from any BMO-G distribution. Let *X*
_{
i:n
} denote the *i*th order statistic. The pdf of *X*
_{
i:n
} can be expressed as

where *K*=*n*!/[(*i*−1)! (*n*−*i*)!].

We can demonstrate that the density function of the *i*th order statistic of any BMO-G distribution can be expressed as

where *h*
_{
r+k+1}(*x*) denotes the exp-G density function with parameter *r*+*k*+1,

*β*
_{
r
} is given by (9) and the quantities *f*
_{
j+i−1,k
} can be determined by \(f_{j+i-1,0}=\beta _{0}^{j+i-1}\) and recursively (for *k*≥1)

We can obtain the ordinary and incomplete moments, generating function and mean deviations of the BMO-G order statistics from Eq. (18) and some properties of the exp-G model.

## Estimation

Here, we determine the MLEs of the model parameters of the new family from complete samples only. Let *x*
_{1},…,*x*
_{
n
} be observed values from the BMO-G distribution with parameters *a*,*b*,*c* and ** ξ**. Let

*Θ*=(

*a*,

*b*,

*c*,

**)**

*ξ*^{⊤}be the

*r*×1 parameter vector. The total log-likelihood function for

*Θ*is given by

Numerical maximization of (19) can be performed by using the RS method (Rigby and Stasinopoulos 2005) which is available in the gamlss package (R Development Core Team 2013), SAS (Proc NLMixed) or the Ox program (sub-routine MaxBFGS) (see Doornik 2007) or by solving the nonlinear likelihood equations obtained by differentiating (19). Let *U*
_{
n
}(*Θ*)=(*∂*
*ℓ*
_{
n
}/*∂*
*a*,*∂*
*ℓ*
_{
n
}/*∂*
*b*,*∂*
*ℓ*
_{
n
}/*∂*
*c*,*∂*
*ℓ*
_{
n
}/*∂*
** ξ**)

^{⊤}be the score function, whose components are

and

where *h*
^{(ξ)}(·) means the derivative of the function *h* with respect to *ξ*. Setting these equations to zero, *U*
_{
a
}=*U*
_{
b
}=*U*
_{
c
}=*U*
_{
ξ
}=**0**, and solving them simultaneously yields the MLE \(\widehat {\Theta }\) of *Θ*.

For interval estimation on the model parameters, it is required the observed information matrix, whose elements *U*
_{
rs
}=*∂*
^{2}
*ℓ*/*∂*
*r*
*∂*
*s* (for *r*,*s*=*a*,*b*,*c*,** ξ**) can be computed numerically. Under standard regularity conditions (Cox and Hinkley 1979), we can approximate the distribution of \((\widehat {\Theta }-\Theta)\) by the multivariate normal

*N*

_{ r+3}(0,

*J*(

*Θ*)

^{−1}) distribution, where

*r*is the number of parameters of the baseline distribution.

We can compute the maximum values of the unrestricted and restricted log-likelihoods to construct likelihood ratio (LR) statistics for testing some sub-models of the BMO-G distribution. For example, we may use LR statistics to check if the fit using the BMO-W distribution is statistically “superior” to the fits using the BW, MOW, EW, EE and Weibull distributions for a given data set.

Often with lifetime data and reliability studies, one encounters censoring. Suppose that the lifetimes are independently distributed, and also independent from the censoring mechanism and censoring is random and noninformative. Considering right-censored lifetime data, we observe *x*
_{
i
}=min(*X*
_{
i
},*C*
_{
i
}) and *δ*
_{
i
}=*I*(*X*
_{
i
}≤*C*
_{
i
}) such that *δ*
_{
i
}=1 if *X*
_{
i
} is a time to event and *δ*
_{
i
}=0 if it is right censored for *i*=1,…,*n* where *X*
_{
i
} is the lifetime for the *i*th individual and *C*
_{
i
} is the censoring for the *i*th individual, *i*=1,…,*n*. The censored likelihood *L*(*Θ*) for the model parameters is

where *S*(*x*;*a*,*b*,*c*,** ξ**)=1−

*F*(

*x*;

*a*,

*b*,

*c*,

**) is the survival function obtained from (3) and**

*ξ**f*(

*x*;

*a*,

*b*,

*c*,

**) is given by (4). We maximize the likelihood (20) in the same way as described before.**

*ξ*## Empirical illustration

We illustrate the flexibility of the BMO-W and BMO-N distributions by means of two real data sets. Similar investigations could be performed for other BMO distributions. We have chosen these distributions because of the popularity of their baseline distributions. The computations are performed using the software R version 3.0.0 (package **bbmle**). The maximization follows the BFGS method with analytical derivatives. The algorithm used to estimate the model parameters converged for all current models.

### 13.1 Illustration 1: Failure time data

We next consider the data studied by (Murthy et al. 2004), which represent failure times for a particular windshield device. The windshield on a large aircraft is a complex piece of equipment, comprised basically of several layers of material, including a very strong outer skin with a heated layer just beneath it, all laminated under high temperature and pressure. Failures of these items are not structural failures. Instead, they typically involve damage or delamination of the nonstructural outer ply or failure of the heating system. These failures do not result in damage to the aircraft but do result in replacement of the windshield. We compare the results of the fits of the BMO-W distribution, its special models (W, EW, BW, MOW and EMOW) and the following distributions: the Kumaraswamy Weibull (Kw-W) model with pdf given by

the McDonald Weibull (McW) model with pdf given by

and the Libby-Novic beta Weibull (LNB-W) model with pdf given by

where *K*=*c*
^{a}/*B*(*a*,*b*), *a*>0, *b*>0, *c*>0, *α*>0, *β*>0 and *x*>0.

In Table 2, the MLEs and their standard errors (SEs) (in parentheses) of the parameters from nine fitted models and the Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC) and Bayesian Information Criterion (BIC) values are presented. According to the lowest values of the AIC and CAIC statistics, the BMO-W model could be chosen as the best model among the nine fitted models. Formal tests for the extra shape parameters in the BMO-W distribution can be performed based on LR statistics. The results for comparing the models to the current data are given in Table 3. The rejection of the null models is significant for the five LR tests. So, we have evidence of the potential need for the three parameters of the BMO-W distribution for the current data.

The plots of the fitted BMO-W pdf and of the four fitted pdfs discussed before are displayed in Fig. 5. They indicate that the BMO-W distribution provides a better fit to these data compared to the other models. So, this distribution can be considered a very competitive model to the LNB-W distribution.

### 13.2 Illustration 2: Plasma ferritin data

Here, consider the data discussed by Weisberg (2014, Section 6.4) which represent 202 athletes collected at the Australian Institute of Sport. The variable evaluated in this study is the plasma ferritin concentration. These data were analyzed recently by (Cordeiro et al. 2014) using the Libby-Novic beta normal (LNB-N) distribution with density function given by

where *x*∈ *I*
*R*, *μ*∈ *I*
*R* is a location parameter, *σ*>0 is scale parameter, *a*,*b* and *c* are positive shape parameters and *ϕ*(·) and *Φ*(·) are the pdf and cdf of the standard normal distribution, respectively.

In Table 4, the MLEs and their SEs (in parentheses) of the parameters from fitted nine models and the AIC, CAIC and BIC values are presented. According to the lowest values of these statistics, the BMO-N model could be chosen as the best model among the nine fitted models. Formal tests for the extra shape parameters in the BMO-N distribution can be performed based on LR statistics. The results for comparing the models to the current data are given in Table 5. The rejection of the null models is significant for the five LR tests. So, we have a clear evidence for the three parameters of the BMO-N distribution when modeling data of this type. The plot of the fitted BMO-N pdf and the four fitted pdfs discussed before are displayed in Fig. 6. They indicate that the BMO-N distribution provides the best fit to these data compared to the other models. Finally, the proposed distribution can be considered a very competitive model to the LNB-N distribution.

## Concluding remarks

We define a new class of models, named the *beta Marshall-Olkin-G* (BMO-G) family of distributions by adding three shape parameters, which generalizes some well-known distributions in the statistical literature such as the normal, Weibull and beta distributions. We provide a mathematical treatment of the proposed family including expansions for the density function, ordinary and incomplete moments and generating function. The BMO-G density function can be expressed as a mixture of exponentiated density functions. This property is important to obtain several other results. We derive a power series for the quantile function. Our formulas related to the BMO-G model are manageable, and with the use of modern computer resources with analytic and numerical capabilities, they may turn into adequate tools for applied statisticians. Some special models are explored. The estimation of the model parameters is carried out by the method of maximum likelihood. Finally, we fit some special models in the new family to two real data sets to demonstrate their potentiality.

## Endnote

^{1} http://functions.wolfram.com/06.23.06.0004.01

## References

Alexander, C, Cordeiro, GM, Ortega, EMM, Sarabia, JM: Generalized beta-generated distributions. Comput. Stat. Data Anal. 56, 1880–1897 (2012).

Alzaatreh, A, Lee, C, Famoye, F: A new method for generating families of continuous distributions. METRON. 71, 63–79 (2013).

Bourguignon, M, Silva, RB, Cordeiro, GM: The Weibull-G family of probability distributions. J. Data Sci. 12, 53–68 (2014).

Cordeiro, GM, Alizadeh, M, Ortega, EMM: The exponentiated half-logistic family of distributions: Properties and applications. J. Probab. Stat. 2014, 1–21 (2014).

Cordeiro, GM, Castro, M: A new family of generalized distributions. J. Stat. Comput. Simul. 81, 883–898 (2011).

Cordeiro, GM, Nadarajah, S: Closed-form expressions for moments of a class of beta generalized distributions. Braz. J. Probab. Stat. 25, 14–33 (2011).

Cox, DR, Hinkley, DV: Theoretical statistics. Chapman Hill, London (1979).

Doornik, J: Ox 5: An Object-Oriented Matrix Language. Timberlake Consultants Press, London (2007).

Eugene, N, Lee, C, Famoye, F: Beta-normal distribution and its applications. Commun. Stat. Theory Methods. 31, 497–512 (2002).

Exton, H: Handbook of Hypergeometric Integrals: Theory, Applications, Tables, Computer Programs. Halsted Press, New York (1978).

Gupta, RC, Gupta, PL, Gupta, RD: Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods. 27, 887–904 (1998).

Gupta, RC, Gupta, RD: Proportional reversed hazard rate model and its applications. J. Stat. Planning Inference. 137, 3525–3536 (2007).

Jayakumar, K, Mathew, T: On a generalization to marshall–olkin scheme and its application to burr type xii distribution. Stat. Papers. 49, 421–439 (2008).

Jones, MC: Families of distributions arising from distributions of order statistics. Test. 13, 1–43 (2004).

Kenney, JF, Keeping, ES: Mathematics of Statistics, Part 1. 3rd edition, Van Nostrand, New Jersey (1962).

Marshall, AW, Olkin, I: A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 84, 641–652 (1997).

Moors, JJA: A quantile alternative for kurtosis. The Statistician. 37, 25–32 (1988).

Murthy, DNP, Xie, M, Jiang, R: Weibull models. New Jersey (2004).

R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Viennas, Austria (2013).

Rényi, A: On measures of entropy and information, volume I. In:

*proceedings of the 4th Berkeley symposium on mathematical statistics and probability edition*. University of California Press, Berkeley (1961).Rigby, RA, Stasinopoulos, DM: Generalized additive models for location, scale and shape (with discussion). Appl. Stat. 54, 507–554 (2005).

Shannon, CE: Prediction and entropy of printed english. Bell Syst. Tech. J. 30, 50–64 (1951).

Trott, M: The Mathematica Guidebook for Symbolics. With 1 DVD-ROM (Windows, Macintosh and UNIX). Springer, New York (2006).

Weisberg, S: Applied linear regression. 3rd edition. Wiley, New York (2014).

Zografos, K, Balakrishnan, N: On families of beta- and generalized gamma-generated distributions and associated inference. Stat. Methodol. 6, 344–362 (2009).

## Author information

## Additional information

### Competing interests

The authors declare that they have no competing interests.

### Authors’ contributions

MA developed the mathematics of paper and participated in drafting the manuscript. EB developed Sections 3 and 13, produced article figures and participated in drafting the manuscript. GMC and CGBD were advisors and reviewed all the work from the initial idea, through preparation of the manuscript until the final version. All authors read and approved the final manuscript.

## Rights and permissions

## About this article

### Cite this article

Alizadeh, M., Cordeiro, G.M., Brito, E.d. *et al.* The beta Marshall-Olkin family of distributions.
*J Stat Distrib App* **2, **4 (2015) doi:10.1186/s40488-015-0027-7

Received:

Accepted:

Published:

### Keywords

- Generated family
- Marshall-Olkin family
- Maximum likelihood
- Moment
- Order statistic
- Quantile function
- Rényi entropy