The Marshall-Olkin extended Weibull family of distributions

Santos-Neto, Manoel; Bourguignon, Marcelo; Zea, Luz M; Nascimento, Abraão DC; Cordeiro, Gauss M

doi:10.1186/2195-5832-1-9

Research
Open access
Published: 16 June 2014

The Marshall-Olkin extended Weibull family of distributions

Manoel Santos-Neto¹,
Marcelo Bourguignon²,
Luz M Zea³,
Abraão DC Nascimento⁴ &
…
Gauss M Cordeiro⁴

Journal of Statistical Distributions and Applications volume 1, Article number: 9 (2014) Cite this article

4890 Accesses
12 Citations
Metrics details

Abstract

We introduce a new class of models called the Marshall-Olkin extended Weibull family of distributions based on the work by Marshall and Olkin (Biometrika 84:641–652, 1997). The proposed family includes as special cases several models studied in the literature such as the Marshall-Olkin Weibull, Marshall-Olkin Lomax, Marshal-Olkin Fréchet and Marshall-Olkin Burr XII distributions, among others. It defines at least twenty-one special models and thirteen of them are new ones. We study some of its structural properties including moments, generating function, mean deviations and entropy. We obtain the density function of the order statistics and their moments. Special distributions are investigated in some details. We derive two classes of entropy and one class of divergence measures which can be interpreted as new goodness-of-fit quantities. The method of maximum likelihood for estimating the model parameters is discussed for uncensored and multi-censored data. We perform a simulation study using Markov Chain Monte Carlo method in order to establish the accuracy of these estimators. The usefulness of the new family is illustrated by means of two real data sets.

Mathematics Subject Classification (2010)

60E05; 62F03; 62F10; 62P10

1 Introduction

The Weibull distribution has assumed a prominent position as statistical model for data from reliability, engineering and biological studies (McCool 2012). This model has been exaustively used for describing hazard rates – an important quantity of survival analysis. In the context of monotone hazard rates, some results from the literature suggest that the Weibull law is a reasonable choice due to its negatively and positively skewed density shapes. However, this distribution is not a good model for describing phenomenon with non-monotone failure rates, which can be found on data from applications in reliability and biological studies. Thus, extended forms of the Weibull model have been sought in many applied areas. As a solution for this issue, the inclusion of additional parameters to a well-defined distribution has been indicated as a good methodology for providing more flexible new classes of distributions.

Marshall and Olkin (1997) derived an important method of including an extra shape parameter to a given baseline model thus defining an extended distribution. The Marshall and Olkin (“ $ℳ O$ ” for short) transformation furnishes a wide range of behaviors with respect to the baseline distribution. The geometrical and inferential properties associated with the generated distribution depend on the values of the extra parameter. These characteristics provide more flexibility to the $ℳ O$ generated distributions. Considering the proportional odds model, Sankaran and Jayakumar (2008) presented a detailed discussion about the physical interpretation of the $ℳ O$ family.

This family has a relationship with the odds ratio associated with the baseline distribution. Let X be a distributed $ℳ O$ random variable which describes the lifetime relative to each individual in the population with a vector of p-covariates z=(z₁,…,z_p)^⊤, where (·)^⊤ denotes the transposition operator. Then, the cumulative distribution function (cdf) of X is given by

\bar{F} (x; z) = \frac{k (z) \bar{G} (x)}{1 - [1 - k (z)] \bar{G} (x)},

(1)

where k(z)=λ_G(x)/λ_F(x ; z) is a non-negative function such that z is independent of the time x, λ_F(x ; z) is the proportional odds model [for a discussion about such modeling, see Sankaran and Jayakumar (2008)] and $λ_{G} (x) = G (x) / \bar{G} (x)$ represents an arbitrary odds for the baseline distribution.

In this paper, we consider k(z)=δ. Before, however, it is important to highlight two important properties of the $ℳ O$ transformation: (i) the stability and (ii) geometric extreme stability (Marshall and Olkin 1997). In other words, the $ℳ O$ distribution possesses a stability property in the sense that if the method is applied twice, it returns to the same distribution. In addition, the following stochastic behavior can also be verified: let {X₁,…,X_N} be a random sample from the population random variable equipped with the survival function (1) at k(z)=δ. Suppose that N has the geometric distribution with probability p and that this quantity is independent of X_i, for i=1,…,N. Then, U=m i n(X₁,…,X_N) and V=m a x(X₁,…,X_N) are random variables having survival functions (1) such that k(z) can be equal to p and p⁻¹, respectively, i.e., the $ℳ O$ transform satisfies the geometric extreme stability property.

Due to these advantages, many papers have employed the $ℳ O$ transformation. In Marshall and Olkin work, the exponential and Weibull distributions were generalized. Subsequently, the $ℳ O$ extension was applied to several well-known distributions: Weibull (Ghitany et al.2005, Zhang and Xie 2007), Pareto (Ghitany 2005), gamma (Ristić et al.2007), Lomax (Ghitany et al.2007) and linear failure-rate (Ghitany and Kotz 2007) distributions. More recently, general results have been addressed by Barreto-Souza et al. (2013) and Cordeiro and Lemonte (2013). In this paper, we aim to apply the $ℳ O$ generator to the extended Weibull ( $E W$ ) class of distributions to obtain a new more flexible family to describe reliability data. The proposed family can also be applied to other fields including business, environment, informatics and medicine in the same way as it was originally done with the Birnbaum-Saunders and other lifetime distributions.

Let $\bar{G} (x) = 1 - G (x)$ and g(x)=d G(x)/d x be the survival and density functions of a continuous random variable Y with baseline cdf G(x). Then, the $ℳ O$ extended distribution has survival function given by

\bar{F} (x; δ) = \frac{δ \bar{G} (x)}{1 - \bar{δ} \bar{G} (x)} = \frac{δ \bar{G} (x)}{G (x) + δ \bar{G} (x)}, x \in X \subseteq R, δ > 0,

(2)

where $\bar{δ} = 1 - δ.$

Clearly, δ=1 implies $\bar{F} (x) = \bar{G} (x)$ . The family (2) has probability density function (pdf) given by

f (x; δ) = \frac{δg (x)}{{[1 - \bar{δ} \bar{G} (x)]}^{2}}, x \in X \subseteq R, δ > 0 .

Its hazard rate function (hrf) becomes

τ (x; δ) = \frac{g (x)}{\bar{G} (x) [1 - \bar{δ} \bar{G} (x)]}, x \in X \subseteq R, δ > 0 .

Further, the class of extended Weibull ( $E W$ ) distributions pioneered by Gurvich et al. (1997) has achieved a prominent position in lifetime models. Its cdf is given by

G (x; α, ξ) = 1 - exp [- αH (x; ξ)], x \in D \subseteq R_{+}, α > 0,

(3)

where H(x;ξ) is a non-negative monotonically increasing function which depends on the parameter vector ξ. The corresponding pdf is given by

g (x; α, ξ) = α exp [- αH (x; ξ)] h (x; ξ),

(4)

where h(x;ξ) is the derivative of H(x;ξ).

Different expressions for H(x;ξ) in Equation (3) define important models such as:

(i)
H(x;ξ)=x gives the exponential distribution;
(ii)
H(x;ξ)=x ² leads to the Rayleigh (Burr type-X) distribution;
(iii)
H(x;ξ)= log(x/k) leads to the Pareto distribution;
(iv)
H(x;ξ)=β ⁻¹[ exp(β x)−1] gives the Gompertz distribution.

In this paper, we derive a new family of distributions by compounding the $ℳ O$ and $E W$ classes. We define a new generated family in order to provide a “better fit” in certain practical situations. The compounding procedure follows by taking the $E W$ class (3) as the baseline model in Equation (2). The Marshall-Olkin extended Weibull ( $ℳ O E W$ ) family of distributions contains some special models as those listed in Table 1 with the corresponding H(·;·) and h(·;·) functions and the parameter vectors.

Table 1 Special models and the corresponding functions H ( x ; ξ ) and h ( x ; ξ )

Full size table

The paper unfolds as follows. Section 2 presents the cdf and pdf of the proposed distribution and some expansions for the density function. The main statistical properties of the new family are derived in Section 3 including the moments, moment generating function (mgf) and incomplete moments, quantile function (qf), random number generator, skewness and kurtosis measures, order statistics, mean deviations and average lifetime functions. In Section 4, we derive four measures of information theory: Shannon and Rényi entropies, cross entropy and Kullback-Leibler divergence. The maximum likelihood method to estimate the model parameters is adopted in Section 5. Two special models are studied in some details in Section 6. We perform a simulation study using Monte Carlo’s experiments in order to assess the accuracy of the maximum likelihood estimators (MLEs) in Section 7.1 and two applications to real data in Section 7.2. Conclusions and some future lines of research are addressed in Section 8.

2 The $ℳ O E W$ family

The cdf of the new family of distributions is given by

F (x; δ, α, ξ) = \frac{1 - exp [- αH (x; ξ)]}{1 - \bar{δ} exp [- αH (x; ξ)]}, x \in D,

(5)

where α>0 and δ>0. Using (5), we can express its survival function as

\bar{F} (x; δ, α, ξ) = \frac{δ exp [- αH (x; ξ)]}{1 - \bar{δ} exp [- αH (x; ξ)]}, x \in D

(6)

and the associated hrf reduces to

τ (x; δ, α, ξ) = \frac{α h (x; ξ)}{1 - \bar{δ} exp [- αH (x; ξ)]}, x \in D.

(7)

The corresponding pdf is given by

f (x; δ, α, ξ) = \frac{δ α h (x; ξ) exp [- αH (x; ξ)]}{{1 - \bar{δ} exp [- αH (x; ξ)]}^{2}},

(8)

where H(x;ξ) can be any special distribution listed in Table 1.

Hereafter, let X be a random variable having the $ℳ O E W$ pdf (8) with parameters δ,α and ξ, say $X \sim ℳ O E W (δ, α, ξ)$ . Equation (8) extends several distributions which have been studied in the literature.

The $ℳ O$ Pareto (Ghitany 2005) is obtained by taking H(x; ξ)= log(x/k)(x≥k). Further, for H(x; ξ)=x^γ we obtain the $ℳ O$ Weibull (Ghitany et al. 2005, Zhang and Xie 2007). The $ℳ O$ Lomax (Ghitany et al. 2007) and $ℳ O$ log-logistic are derived from (8) by taking H(x; ξ)= log(1+x^c) with c=1 and H(x; ξ)= log(1+x^c) with α=1, respectively. For H(x; ξ)=a x+b x²/2 and α=1, Equation (8) reduces to the $ℳ O$ linear failure rate (Ghitany and Kotz 2007). In the same way, for H(x; ξ)= log(1+x^c), we have the $ℳ O$ Burr XII (Jayakumar and Mathew 2008). Finally, we obtain the $ℳ O$ Fréchet (Krishna et al. 2013) from Equation (8) by setting H(x; ξ)=x^−γ. Table 1 displays some useful quantities and corresponding parameter vectors for special distributions.

A general approximate goodness-of-fit test for the null hypothesis H₀:X₁,…,X_n with X_i following F(x;θ), where the form of F is known but the p-vector θ=(δ,α,ξ)^⊤ is unknown, was proposed by Chen and Balakrishnan (1995). This method is based on the Cramér-von Mises (CM) and Anderson-Darling (AD) statistics and, in general, the smaller the values of these statistics, the better the fit. In this paper, such methodology is applied to provide goodness-of-fit tests for the distributions under study.

Some results in the following sections can be obtained numerically in any software such as MAPLE (Garvan 2002), MATLAB (Sigmon and Davis 2002), MATHEMATICA (Wolfram 2003), Ox (Doornik 2007) and R (R Development Core Team 2009). The Ox (for academic purposes) and R are freely available at http://www.doornik.com and http://www.r-project.org, respectively. The results can be computed by taking in the sums a large positive integer value in place of ∞.

2.1 Expansions for the density function

For any positive real number a, and for |z|<1, we have the generalized binomial expansion

{(1 - z)}^{- a} = \sum_{k = 0}^{\infty} \frac{{(a)}_{k}}{k!} z^{k},

(9)

where (a)_k=Γ(a+k)/Γ(a)=a(a+1)…(a+k−1) is the ascending factorial and Γ(·) is the gamma function. Applying (9) to (8), for 0<δ<1, gives

f (x; δ, α, ξ) = \sum_{j = 0}^{\infty} η_{j} g (x; (j + 1) α, ξ),

(10)

where $η_{j} = δ \bar{δ} j^{}$ and g(x;(j+1)α,ξ) denotes the $E W$ density function with parameters (j+1)α and ξ. Otherwise, for δ>1, after some algebra, we can express (8) as

f (x; δ, α, ξ) = \frac{g (x; α, ξ)}{δ {\{1 - (1 - 1 / δ) [1 - exp (- αH (x; ξ))]\}}^{2}} .

(11)

In this case, we can verify that |(1−1/δ)[1− exp(−α H(x;ξ))]|<1. Then, applying twice the expansion (9) in Equation (11), we obtain

f (x; δ, α, ξ) = \sum_{j = 0}^{\infty} ν_{j} g (x; (j + 1) α, ξ),

(12)

where

ν_{j} = ν_{j} (δ) = \frac{{(- 1)}^{j}}{δ (j + 1)!} \sum_{k = j}^{\infty} (k + 1)! {(1 - 1 / δ)}^{k} .

We can verify that $\sum_{j = 0}^{\infty} η_{j} = \sum_{j = 0}^{\infty} ν_{j} = 1$ . Then, the $ℳ O E W$ density function can be expressed as an infinite linear combination of $E W$ densities. Equations (10) and (12) have the same form except for the coefficients η j′s in (10) and ν j′s in (12). They depend only on the generator parameter δ. For simplicity, we can write

f (x; δ, α, ξ) = \sum_{j = 0}^{\infty} w_{j} g (x; (j + 1) α, ξ),

(13)

where

w_{j} = \{\begin{array}{l} η_{j}, & if 0 < δ < 1, \\ ν_{j}, & if δ > 1, \end{array}

and η_j and ν_j are given by (10) and (12), respectively. Thus, some mathematical properties of (13) can be obtained directly from those $E W$ properties. For example, the ordinary, incomplete, inverse and factorial moments and the mgf of X follow immediately from those quantities of the $E W$ distribution.

3 General properties

3.1 Moments, generating function and incomplete moments

The n th ordinary moment of X can be obtained from (13) as

\begin{array}{lcr} E (X^{n}) & = & \sum_{j = 0}^{\infty} w_{j} E (Y_{j}^{n}), \end{array}

where from now on $Y_{j} \sim E W ((j + 1) α, ξ)$ denotes a random variable having the $E W$ density function g(y;(j+1)α,ξ).

The mgf and the k th incomplete moment of X follow from (13) as

M_{X} (t) = E (e^{tX}) = \sum_{j = 0}^{\infty} w_{j} M_{j} (t)

and

\begin{array}{lcr} T_{k} (z) = \sum_{j = 0}^{\infty} w_{j} T_{k}^{(j)} (z), \end{array}

(14)

where M_j(t) is the mgf of Y_j and $T_{k}^{(j)} (z) = \int_{- \infty}^{z} x^{k} g (x; (j + 1) α, ξ) d x$ comes directly from the $E W$ model.

3.2 Quantile function and random number generator

The qf of X follows by inverting (5) and it can be expressed in terms of H⁻¹(·) as

Q (u) = H^{- 1} (\frac{1}{α} log (\frac{1 - \bar{δ} u}{1 - u}), ξ) .

(15)

In Table 2, we provide the function H⁻¹(x;ξ) for some special models.

Table 2 The H ⁻¹ ( x ; ξ ) function

Full size table

Hence, the generator for X can be given by the algorithm:

The $ℳ O E W$ distributions can be very useful in modeling lifetime data and practitioners may be interested in fitting one of these models. We provide a script using the R language to generate the density, distribution function, hrf, qf, random numbers, Anderson-Darling test, Cramer-von Mises test and likelihood ratio (LR) tests. This script can be be obtained from the authors upon requested.

3.3 Mean deviations

The mean deviations of X about the mean and the median are given by

δ_{1} = \int_{D} | x - μ | f (x; δ, α, ξ) d x and δ_{2} = \int_{D} | x - M | f (x; δ, α, ξ) d x,

respectively, where μ=E(X) denotes the mean and M=M e d i a n(X) the median. The median follows from the nonlinear equation F(M;δ,α,ξ)=1/2. So, these quantities reduce to

δ_{1} = 2 μ F (μ; δ, α, ξ) - 2 T_{1} (μ) and δ_{2} = μ - 2 T_{1} (M),

where T₁(z) is the first incomplete moment of X obtained from (14) as

T_{1} (z) = \sum_{j = 0}^{\infty} w_{j} T_{1}^{(j)} (z),

and $T_{1}^{(j)} (z) = \int_{- \infty}^{z} x g (x; (j + 1) α, ξ) d x$ is the first incomplete moment of Y_j.

An important application of the mean deviations is related to the Bonferroni and Lorenz curves. These curves are useful in economics, reliability, demography, medicine and other fields. For a given probability p, they are defined by B(p)=T₁(q)/(p μ) and L(p)=T₁(q)/μ, respectively, where q=Q(p) is the qf of X given by (15) at u=p.

3.4 Average lifetime and mean residual lifetime functions

The average lifetime is given by

\begin{array}{lcr} t_{m} = \int_{0}^{\infty} [1 - F (x; δ, α, ξ)] d x = \sum_{j = 0}^{\infty} w_{j} \int_{0}^{\infty} \bar{G} (x; (j + 1) α, ξ) d x. \end{array}

In fields such as actuarial sciences, survival studies and reliability theory, the mean residual lifetime has been of much interest; see, for a survey, Guess and Proschan (1988). Given that there was no failure prior to x₀, the residual life is the period from time x₀ until the time of failure. The mean residual lifetime is given by

\begin{array}{lcr} m (x_{0}; δ, α, ξ) & = & E (X - x_{0} | X \geq x_{0}; δ, α, ξ) = \int_{{x : x > x_{0}}} \frac{(x - x_{0}) f (x; δ, α, ξ)}{Pr (X > x_{0})} d x \\ = & {[Pr (X > x_{0})]}^{- 1} \int_{0}^{\infty} y f (x_{0} + y; δ, α, ξ) d y \\ = & {[\bar{F} (x_{0}; δ, α, ξ)]}^{- 1} \sum_{j = 0}^{\infty} w_{j} \int_{0}^{\infty} y g (x_{0} + y; (j + 1) α, ξ) d y. \end{array}

The last integral can be computed from the baseline $E W$ distribution. Further, m(x₀;δ,α,ξ)→E(X) as x₀→0.

4 Information theory measures

The seminal idea about information theory was pioneered by Hartley (1928), who defined a logarithmic measure of information for communication. Subsequently, Shannon (1948) formalized this idea by defining the entropy and mutual information concepts. The relative entropy notion (which would later be called divergence) was proposed by Kullback and Leibler (1951). The Kullback-Leibler’s measure can be understood like a comparison criterion between two distributions. In this section, we derive two classes of entropy measures and one class of divergence measures which can be understood as new goodness-of-fit quantities such those discussed by Seghouane and Amari (2007). All these measures are defined for one element or between two elements in the $ℳ O E W$ family.

4.1 Rényi entropy

The Rényi entropy of X with pdf (8) is given by

H_{R}^{s} (X) = \frac{1}{1 - s} log (\int_{D} f {(x; δ, α, ξ)}^{s} d x),

where s∈(0,1)∪(1,∞).

It is a difficult problem to obtain $H_{R}^{s} (X)$ in closed-form for the $ℳ O E W$ family. So, we derive an expansion for this quantity.

By using (9), f(x;δ,α,ξ)^s can be expanded as

f {(x; δ, α, ξ)}^{s} = \sum_{j = 0}^{\infty} w_{j}^{'} exp [- (j + s) αH (x; ξ)] h {(x; ξ)}^{s},

(16)

where

w_{j}^{'} = \{\begin{array}{l} η_{j}^{'} (α, δ) = \frac{α^{s} δ^{s} {(2 s)}_{j} {\bar{δ}}^{j}}{j!}, & for 0 < δ < 1, \\ ν_{j}^{'} (α, δ) = \frac{α^{s} δ^{- s}}{j!} \sum_{k = 0}^{\infty} \frac{{(2 s)}_{k} {(k)}_{j}}{k!} {(1 - 1 / δ)}^{k}, & for δ > 1 . \end{array}

The proof of this expansion is given in Appendix 8.

Finally, based on Equation (16), the Rényi entropy can be expressed as

H_{R}^{s} (X) = \frac{1}{1 - s} log \{\sum_{j = 0}^{\infty} w_{j}^{'} \int_{D} exp [- (j + s) αH (x; ξ)] h {(x; ξ)}^{s} d x\} .

An advantage of this expansion is its dependence of an integral which has closed-form for some $E W$ distributions.

4.2 Shannon entropy

The Shannon entropy of X is given by

H_{S} (X) = E_{X} \{- log [f (X; δ, α, ξ)]\},

where the log-likelihood function corresponding to one observation follows from (8) as

\begin{align} log [f (x; δ, α, ξ)] = log (δα) + log [h (x; ξ)] - αH (x; ξ) - 2 log \{1 - \bar{δ} exp [- αH (x; ξ)]\} . \end{align}

Thus, it can be reduced to

H_{S} (X) = - log (αδ) + 2 E \{log [1 - \bar{δ} \bar{G} (X; ξ)]\} - E \{log [h (X; ξ)]\} + α E [H (X; ξ)] .

4.3 Cross entropy and Kullback-Leibler divergence and distance

Let X and Y be two random variables with common support $R_{+}$ whose densities are f_X(x;θ₁) and f_Y(y;θ₂), respectively. Cover and Thomas (1991) defined the cross entropy as

C_{X} (Y) = E_{X} \{- log [f_{Y} (X; θ_{2})]\} = - \int_{0}^{\infty} f_{X} (z; θ_{1}) log [f_{Y} (z; θ_{2})] d z.

We consider that $X \sim ℳ O E W (δ_{x}, α_{x}, ξ_{x})$ and $Y \sim ℳ O E W (δ_{y}, α_{y}, ξ_{y})$ . After some algebraic manipulations, we obtain

\begin{align} C_{X} (Y) & = - \int_{D} f_{X} (z; δ_{x}, α_{x}, ξ_{x}) log [f_{Y} (z; δ_{y}, α_{y}, ξ_{y})] d z \\ = - log (δ_{y} α_{y}) - E_{X} \{log [h (X; ξ_{y})]\} + α_{y} E_{X} [H (X; ξ_{y})] \\ + 2 E_{X} \{log [1 - \bar{δ} \bar{G} (X; ξ_{y})]\} . \end{align}

(17)

An important measure in information theory is the Kullback-Leibler divergence given by

D (X | | Y) = C_{X} (Y) - H_{S} (X) = E_{X} \{log [\frac{f_{X} (X; δ_{x}, α_{x}, ξ_{x})}{f_{Y} (X; δ_{y}, α_{y}, ξ_{y})}]\} .

(18)

Applying (4.2) and (17) in Equation (18) gives

\begin{align} D (X | | Y) = & log (\frac{δ_{x} α_{x}}{δ_{y} α_{y}}) + E_{X} \{log [\frac{h (X; ξ_{x})}{h (X; ξ_{y})}]\} + 2 E_{X} \{log [\frac{1 - \bar{δ} \bar{G} (X; ξ_{y})}{1 - \bar{δ} \bar{G} (X; ξ_{x})}]\} \\ + α_{y} E_{X} [H (X; ξ_{y})] - α_{x} E_{X} [H (X; ξ_{x})] . \end{align}

(19)

According to Cover and Thomas (1991), the Kullback-Leibler measure D(X||Y) is the quantification of the error considering that the Y model is true when the data follow the X distribution. For example, this measure has been proposed as essential parts of test statistics, which has seen strongly applied to contexts of radar synthetic aperture image processing in both univariate (Nascimento et al. 2010) and polarimetric (or multivariate) (Nascimento et al. 2014) perspectives.

In order to work with measures that satisfied the non-negativity, symmetry and definiteness properties, Nascimento et al. (2010) considered the symmetrization of (19)

\begin{align} d_{KL} (X, Y) & = \frac{1}{2} [D (X | | Y) + D (Y | | X)] \\ = \int_{D} \underset{\equiv IntegrandKL (x, y)}{\underset{⏟}{(f_{X} (x; δ_{x}, α_{x}, ξ_{x}) - f_{Y} (x; δ_{y}, α_{y}, ξ_{y})) log (\frac{f_{X} (x; δ_{x}, α_{x}, ξ_{x})}{f_{Y} (x; δ_{x}, α_{x}, ξ_{x})})}} d x, \end{align}

which is given by

\begin{array}{l} 2 d_{KL} (X, Y) & = α_{y} \{E_{X} [H (X; ξ_{y})] - E_{Y} [H (Y; ξ_{y})]\} + α_{x} \{E_{Y} [H (Y; ξ_{x})] - E_{X} [H (X; ξ_{x})]\} \\ + E_{X} \{log [\frac{h (X; ξ_{x})}{h (X; ξ_{y})}]\} + E_{Y} \{log [\frac{h (Y; ξ_{y})}{h (Y; ξ_{x})}]\} \\ + 2 E_{X} \{log [\frac{1 - \bar{δ} \bar{G} (X; ξ_{y})}{1 - \bar{δ} \bar{G} (X; ξ_{x})}]\} + 2 E_{Y} \{log [\frac{1 - \bar{δ} \bar{G} (Y; ξ_{x})}{1 - \bar{δ} \bar{G} (Y; ξ_{y})}]\} . \end{array}

(20)

Although this measure does not satisfy the triangle inequality, it is usually called the Kullback-Leibler distance (Jensen-Shannon divergence). The new measure can be used to answer questions like “how could one quantify the difference in selecting the Phani model with three parameters as the baseline distribution instead of the Weibull Kies distribution which has four parameters?”.

As an illustration for (20), we initially consider two distinct elements of the generated special model from the specifications: H(x;β)=β⁻¹[ exp(β x)−1] and h(x;β)= exp(β x) in (8). This model will be presented with more details in future sections and its parametric space is represented by the vector (δ,α,β). Suppose that we are interested in quantifying the influence of a nuisance degree ε in the parameter α over the distance between two distinct elements, (2,1,3) and (2,1+ε,3), at such parametric space. Figure 1(a) displays the integrand of (20) for ε=0.1, 1, 2 and 4 for which the distances (or areas) associated with d_KL(X,Y) are 6.50×10⁻³, 3.56×10⁻¹, 9.46×10⁻¹ and 2.25, respectively. It is notable that d_KL(X,Y) takes smaller values for more closer points (or, equivalently, for more closer fits) and, therefore, (20) consists of new goodness-of-fit measures. In Figures 1(b) and 1(c), we show the influence of η=α/β on d_KL([δ,α,β],[δ,α,β+ε]) (for β=δ=3 and α∈{1,3,9}) and of δ on d_KL([δ,α,β],[δ+ε,α,β]) (for β=α=3 and δ∈{3,4,5}). For all cases, the contamination ε takes values in the interval (−2.9,2.9).

5 Estimation

Here, we present a general procedure for estimating the $ℳ O E W$ parameters from one observed sample and from multi-censored data. Additionally, we provide a discussion about how one can test the significance of additional parameter at the proposed class. Let x₁,…,x_n be a sample of size n from X. The log-likelihood function for the vector of parameters θ=(δ,α,ξ^⊤)^⊤ can be expressed as

\begin{align} ℓ (θ) = n log (δα) + \sum_{i = 1}^{n} log [h (x_{i}; ξ)] - α \sum_{i = 1}^{n} H (x_{i}; ξ) - 2 \sum_{i = 1}^{n} log \{1 - \bar{δ} exp [- αH (x_{i}; ξ)]\} . \end{align}

From the above log-likelihood, the components of the score vector, $U (θ) = {(U_{δ}, U_{α}, U_{ξ}^{⊤})}^{⊤}$ , are given by

\begin{align} U_{δ} (θ) = \frac{∂ℓ (θ)}{∂δ} = & \frac{n}{δ} - 2 \sum_{i = 1}^{n} \frac{exp [- αH (x_{i}; ξ)]}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]}, \\ U_{α} (θ) = \frac{∂ℓ (θ)}{∂α} = & \frac{n}{α} - \sum_{i = 1}^{n} H (x_{i}; ξ) - 2 \bar{δ} \sum_{i = 1}^{n} \frac{H (x_{i}; ξ) exp [- αH (x_{i}; ξ)]}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]} and \\ U_{ξ_{k}} (θ) = \frac{∂ℓ (θ)}{\partial ξ_{k}} = & \sum_{i = 1}^{n} \frac{1}{h (x_{i}; ξ)} \frac{∂h (x_{i}; ξ)}{\partial ξ_{k}} - α \sum_{i = 1}^{n} \frac{∂H (x_{i}; ξ)}{\partial ξ_{k}} \\ - 2 \bar{δ} α \sum_{i = 1}^{n} \frac{∂H (x_{i}; ξ)}{\partial ξ_{k}} \frac{exp [- αH (x_{i}; ξ)]}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]} . \end{align}

Finally, the partitioned observed information matrix for the $ℳ O E W$ family is

whose elements are

\begin{array}{l} U_{δδ} (θ) = - n δ^{- 2}, U_{δα} (θ) = 2 \sum_{i = 1}^{n} \frac{H (x_{i}; ξ) exp [- αH (x_{i}; ξ)]}{{\{1 - \bar{δ} exp [- αH (x_{i}; ξ)]\}}^{2}}, \\ U_{δ ξ_{k}} (θ) = 2 α \sum_{i = 1}^{n} \frac{∂H (x_{i}; ξ)}{\partial ξ_{k}} \frac{exp [- αH (x_{i}; ξ)]}{{\{1 - \bar{δ} exp [- αH (x_{i}; ξ)]\}}^{2}}, \\ U_{αα} (θ) = - \frac{n}{α^{2}} + 2 \bar{δ} \sum_{i = 1}^{n} \frac{H {(x_{i}; ξ)}^{2} exp [- αH (x_{i}; ξ)]}{{\{1 - \bar{δ} exp [- αH (x_{i}; ξ)]\}}^{2}}, \\ U_{α ξ_{k}} (θ) = - 2 \bar{δ} \sum_{i = 1}^{n} \frac{∂H (x_{i}; ξ)}{\partial ξ_{k}} \frac{exp [- αH (x_{i}; ξ)]}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]} [1 - \frac{αH (x_{i}; ξ)}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]}] \\ + \sum_{i = 1}^{n} \frac{∂H (x_{i}; ξ)}{\partial ξ_{k}} and \\ U_{ξ_{k} ξ_{j}} (θ) = \sum_{i = 1}^{n} \frac{1}{h (x_{i}; ξ)} [\frac{\partial^{2} h (x_{i}; ξ)}{\partial ξ_{k} ξ_{j}} - \frac{1}{h (x_{i}; ξ)} \frac{∂h (x_{i}; ξ)}{\partial ξ_{k}} \frac{∂h (x_{i}; ξ)}{\partial ξ_{j}}] - α \sum_{i = 1}^{n} \frac{\partial^{2} H (x_{i}; ξ)}{\partial ξ_{k} ξ_{j}} \\ - 2 α \bar{δ} \sum_{i = 1}^{n} \frac{exp [- αH (x_{i}; ξ)]}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]} [\frac{\partial^{2} H (x_{i}; ξ)}{\partial ξ_{k} ξ_{j}} - \frac{∂H (x_{i}; ξ)}{\partial ξ_{k}} \frac{αH (x_{i}; ξ)}{1 - \bar{δ} exp [- αH (x_{i}; ξ)]}] . \end{array}

When some standard regularity conditions are satisfied (Cox and Hinkley 1974), one can verify that $\sqrt{n} ({[\hat{α}, \hat{δ}, \hat{ξ}]}^{⊤} - {[α, δ, ξ]}^{⊤})$ converges in distribution to the multivariate $N_{p + 2} (0, K {([α, δ, ξ])}^{- 1})$ distribution, where p denotes the dimension of ξ and $K ([α, δ, ξ])$ is the expected information matrix for which the limit identity ${lim}_{n \to \infty} J_{n} ([α, δ, ξ]) = K ([α, δ, ξ])$ is satisfied. Based on this result, one can compute confidence regions for the $ℳ O E W$ parameters. Such regions can be used as decision criteria in several practical situations.

For checking if δ is statistically different from one, i.e. for testing the null hypothesis H₀:δ=1 against H₁:δ≠1, we use the LR statistic given by $LR = 2 \{ℓ (\hat{θ}) - ℓ (\tilde{θ})\}$ , where $\hat{θ}$ is the vector of unrestricted MLEs under H₁ and $\tilde{θ}$ is the vector of restricted MLEs under H₀. Under the null hypothesis, the limiting distribution of LR is a $χ_{1}^{2}$ distribution. If the test statistic exceeds the upper 100(1−α)% quantile of the $χ_{1}^{2}$ distribution, then we reject the null hypothesis.

Censored data occur very frequently in lifetime data analysis. Some mechanisms of censoring are identified in the literature as, for example, types I and II censoring (Lawless 2003). Here, we consider the general case of multi-censored data: there are n=n₀+n₁+n₂ subjects of which n₀ is known to have failed at the times $x_{1}, \dots, x_{n_{0}}$ , n₁ is known to have failed in the interval [ s_i−1,s_i], i=1,…,n₁, and n₂ survived to a time r_i, i=1,…,n₂, but not observed any longer. Note that type I censoring and type II censoring are contained as particular cases of multi-censoring. The log-likelihood function of θ=(δ,α,ξ^⊤)^⊤ for this multi-censoring data reduces to

\begin{array}{lcr} ℓ (θ) & = & n_{0} log (δα) + \sum_{i = 1}^{n_{0}} log [h (x_{i}; ξ)] - α \sum_{i = 1}^{n_{0}} H (x_{i}; ξ) - 2 \sum_{i = 1}^{n_{0}} log \{1 - \bar{δ} exp [- αH (x_{i}; ξ)]\} \\ + \sum_{i = 1}^{n_{1}} log \{\frac{1 - exp [- α H (s_{i}; ξ)]}{1 - \bar{δ} exp [- α H (s_{i}; ξ)]} - \frac{1 - exp [- α H (s_{i - 1}; ξ)]}{1 - \bar{δ} exp [- α H (s_{i - 1}; ξ)]}\} \\ + n_{2} log (δ) - α \sum_{i = 1}^{n_{2}} H (r_{i}; ξ) - 2 \sum_{i = 1}^{n_{2}} log \{1 - \bar{δ} exp [- αH (r_{i}; ξ)]\} . \end{array}

(21)

The score functions and the observed information matrix corresponding to (21) is too complicated to be presented here.

6 Two special models

In this section, we study two special $ℳ O E W$ models, namely the Marshall-Olkin modified Weibull ( $ℳ O ℳ W$ ) and Marshall-Olkin Gompertz ( $ℳ O G$ ) distributions. We provide plots of the density and hazard rate functions for some parameters to illustrate the flexibility of these distributions.

6.1 The $ℳ O ℳ W$ model

For H(x;λ,γ)=x^γ exp(λ x) and h(x;λ,γ)=x^γ−1 exp(λ x)(γ+λ x), we obtain the $ℳ O ℳ W$ distribution. Its density function is given by

\begin{align} f (x; α, δ, λ, γ) = δα (γ + λx) x^{γ - 1} \frac{exp [λx - α x^{γ} exp (λx)]}{{\{1 - \bar{δ} exp [- α x^{γ} exp (λx)]\}}^{2}}, x > 0, \end{align}

where λ,γ≥0. If δ=1, it leads to the special case of the modified Weibull ( $ℳ W$ ) distribution (Lai et al.2003). In addition, when λ=0, it gives the Weibull distribution. Its cdf and hrf are given by

F (x; α, δ, λ, γ) = \frac{1 - exp [- α x^{γ} exp (λx)]}{1 - \bar{δ} exp [- α x^{γ} exp (λx)]}

and

τ (x; α, δ, λ, γ) = \frac{α x^{γ - 1} exp (λx) (γ + λx)}{1 - \bar{δ} exp [- α x^{γ} exp (λx)]},

respectively. In Figures 2(a), 2(b), 2(c) and 2(d), we note some different shapes of the $ℳ O ℳ W$ pdf. Further, Figures 3(a), 3(b), 3(c) and 3(d) display plots of the $ℳ O ℳ W$ hrf, which can have increasing, decreasing, non-monotone and bathtub forms.

The r th raw moment of the $ℳ O ℳ W$ distribution comes from (13) as

E (X^{r}) = \sum_{j = 1}^{\infty} w_{j} μ_{r} (j),

(22)

where $μ_{r} (j) = \int_{0}^{\infty} x^{r} g (x; (j + 1) α, γ, λ)) d x$ denotes the r th raw moment of the $ℳ W$ distribution with parameters (j+1)α,γ and λ. Carrasco et al. (2008) determined an infinite representation for μ_r(j) given by

μ_{r} (j) = \sum_{i_{1}, \dots, i_{r} = 1}^{\infty} \frac{A_{i_{1}, \dots, i_{r}} Γ (s_{r} / γ + 1)}{{[(j + 1) α]}^{s_{r} / γ}},

(23)

where

A_{i_{1}, \dots, i_{r}} = a_{i_{1}}, \dots, a_{i_{r}} and s_{r} = i_{1}, \dots, i_{r},

and

a_{i} = \frac{{(- 1)}^{i + 1} i^{i - 2}}{(i - 1)!} {(\frac{λ}{γ})}^{i - 1} .

Hence, the $ℳ O ℳ W$ moments can be obtained directly from (22) and (23).

Let x₁,…,x_n be a sample of size n from $X \sim ℳ O ℳ W (α, δ, λ, γ)$ . The log-likelihood function for the vector of parameters θ=(α,δ,λ,γ)^⊤ can be expressed as

\begin{array}{lcr} ℓ (θ) & = & n log (δα) + \sum_{i = 1}^{n} log (γ + λ x_{i}) + (γ - 1) \sum_{i = 1}^{n} log (x_{i}) + λ \sum_{i = 1}^{n} x_{i} - α \sum_{i = 1}^{n} x_{i}^{λ} exp (λ x_{i}) \\ - 2 \sum_{i = 1}^{n} log (1 - \bar{δ} exp [- α x_{i}^{γ} exp (λ x_{i})]) . \end{array}

6.2 The $ℳ O G$ model

For H(x;β)=β⁻¹[ exp(β x)−1] and h(x;β)= exp(β x), we obtain the $ℳ O G$ distribution. Its pdf is given by

\begin{align} f (x; α, δ, β) = \frac{δα exp \{βx - α / β [exp (βx) - 1]\}}{{\{1 - \bar{δ} exp \{- α / β [exp (βx) - 1]\}\}}^{2}}, x > 0, \end{align}

where −∞<β<∞. For δ=1, it follows the Gompertz distribution as a special case. The $ℳ O G$ model is a special case of the Marshall-Olkin Makeham distribution (EL-Bassiouny and Abdo 2009). The cdf and hrf of the $ℳ O G$ distribution are given by

F (x; α, δ, β) = \frac{1 - exp \{- α / β [exp (βx) - 1]\}}{1 - \bar{δ} exp \{- α / β [exp (βx) - 1]\}}

and

τ (x; α, δ, β) = \frac{α exp (βx)}{1 - \bar{δ} exp \{- α / β [exp (βx) - 1]\}} .

Figures 4(a), 4(b) and 4(c) display some plots of the density functions for some values of α, δ and β. The hrf of the Gompertz distribution is increasing (β>0) and decreasing (β<0). Besides these two forms, Figures 5(a), 5(b) and 5(c) indicate that the $ℳ O G$ hrf can be bathtub shaped.

From Equation (15), the $ℳ O G$ qf becomes

Q (u) = β^{- 1} log [\frac{β}{α} log (\frac{1 - \bar{δ} u}{1 - u}) + 1] .

Let x₁,…,x_n be a sample of size n from the $ℳ O G$ model. The log-likelihood function for the vector of parameters θ=(δ,α,β)^⊤ can be expressed as

\begin{array}{l} ℓ (θ) & = n log (δα) + β \sum_{i = 1}^{n} x_{i} - \frac{α}{β} \sum_{i = 1}^{n} [exp (β x_{i}) - 1] \\ - 2 \sum_{i = 1}^{n} log (1 - \bar{δ} exp \{- α [exp (β x_{i}) - 1] / β\}) . \end{array}

7 Simulation and applications

This section is divided in two parts. First, we perform a simulation study in order to assess the performance of the MLEs on some points at the parametric space of one of the special models. Second, an application to real data provides evidence in favor of one distribution in the $ℳ O E W$ class.

7.1 Simulation study

We present a simulation study by means of Monte Carlo’s experiments in order to assess the performance of the MLEs described in Section 5. To that end, we work with the $ℳ O G$ distribution. One of advantages of this model is that its cdf has tractable analytical form. This fact implies in a simple random number generation (RNG) determined by the $ℳ O G$ qf given in Section 6.2. The $ℳ O G$ generator is illustrated in Figure 6.

The simulation study is conducted in order to quantify the influence of η=α/β over the estimation of the extra parameter δ. It is known that η>1 gives the Gompertz distribution which presents mode at zero or, for η<1, having their modes at x^∗=β⁻¹ [1 − log(η)]. An initial discussion using the Kullback-Leibler distance derived in Section 4.3 points out that increasing the contamination (or the bias of the estimates) can affect the quality of fit.

In this study, the following scenarios are taken into account. For the sample size n=50,100,150,200, we adopt as the true parameters the following cases:

Scenario η<1: (α,β)=(1,2) and δ∈{0.3,1,4};
Scenario η=1: (α,β)=(2,2) and δ∈{0.3,1,4};
Scenario η>1: (α,β)=(4,2) and δ∈{0.3,1,4}.

Also, we use 10,000 Monte Carlo’s replications and, at each one of them, we quantify (i) the average of the MLEs and (ii) the mean square error (MSEs).

Table 3 gives the results of the simulation study. In general, the MLEs present smaller values of the biases and MSEs when the sample size increases. It is important to highlight the following atypical case: for the MLEs of α at the scenarios (α,δ,β)∈{(1,4,2),(2,1,2),(4,0.3,2),(4,1,2)} and of δ at (4,0.3,2), the associated biases do not have an inverse monotonic relationship with sample sizes, as expected.However, based on the fact that their MSEs tend to zero, we can expect that there exists a sample size n₀ such that biases of the MLEs decrease when the sample sizes increase from n₀.

Table 3 Performance of the MLEs for the $ℳ O G$ distribution

Full size table

The results provide evidence that the scenarios under the condition η>1 yield a hard estimation (having larger variation ranges of the MSEs than those obtained for the cases when η<1) for α and β parameters, and that the MLEs present smaller values of the MSEs under such conditions. Figure 7 illustrates the above behavior for the cases δ∈{0.3,0.8,1,2,4} and n=200. In summary, the scenario with less numerical problems is (η,δ)=(2,0.1), whereas that one which requires more attention for estimating the $ℳ O G$ parameters is (η,δ)=(0.5,4).

7.2 Applications

Here, the usefulness of the $ℳ O E W$ distribution is illustrated by means of two real data sets.

7.2.1 Uncensored data

Here, we compare the fits of some special models of the $ℳ O E W$ family using a real data set. The estimation of the model parameters is performed by the maximum likelihood method discussed in Section 5. We use the maxLik function of the maxLik package in R language. In this function, if the argument “method” is not specified, a suitable method is selected automatically. For this application, we use the Newton-Raphson method. The data represent the percentage of body fat determined by underwater weighing for 250 men. For more details about the data see http://lib.stat.cmu.edu/datasets/bodyfat.

Table 4 provides some descriptive measures. They suggest an empirical distribution which is slightly asymmetric and platykurtic.

Table 4 Descriptive statistics

Full size table

We compare the classical models and generalized models within the $ℳ O$ family. The null hypothesis H₀:δ=1 is tested against H₁:δ≠1 using the LR statistic. The comparisons are presented in Table 5. For the $ℳ O W$ and $ℳ O E P$ models, one cannot say that the parameter δ is statistically different from one at the 10% significance level. Based on this result, we fit the , exponential power ( $E P$ ), $ℳ O G$ and Marshall-Olkin flexible Weibull extension ( $ℳ O F W E$ ) models to the current data (see Table 1). These models are compared with two other three-parameter models, namely: the modified Weibull ( $ℳ W$ ) and generalized Birnbaum-Saunders ( $G ℬ S$ ) (Owen 2006) distributions. The $G ℬ S$ density is given by

f (x; ϕ, η, κ) = \frac{1}{ϕ \sqrt{2 πη} x^{κ}} (1 - κ + \frac{ηκ}{x}) exp [- \frac{1}{2 ϕ^{2}} \frac{{(x - η)}^{2}}{η x^{2 κ}}], x > 0 .

Table 5 Comparison of fitted models using the LR test

Full size table

In Table 6, we present the MLEs (standard errors in parentheses) of the parameters of the fitted $ℳ O F W E$ , $ℳ O G$ , $E P$ , , $ℳ W$ and $G ℬ S$ distributions. Also, we provide the goodness-of-fit measures (p-values in parentheses). Thus, these values indicate that the null models are strongly rejected for the $ℳ O F W E$ and $ℳ O G$ distributions, since the associated p-values are much lower than 0.001.

Table 6 MLEs and goodness-of-fit statistics

Full size table

Table 7 gives the values of the Akaike information criterion (AIC), Bayesian information criterion (BIC), consistent Akaike information criterion (CAIC) and Hannan-Quinn information criterion (HQIC). Since the values of the AIC, CAIC and HQIC are smaller for the $ℳ O F W E$ distribution compared to those values of the other fitted models. Thus, this new distribution seems to be a very competitive model to explain the current data.

Table 7 Statistics AIC, BIC, CAIC and HQIC

Full size table

Figures 8(a) and 8(b) display the estimated density and survival functions of the $ℳ O F W E$ distribution. The plots confirm the excellent fit of this distribution to the data. Figure 8(c) shows that the estimated $ℳ O F W E$ hrf is an increasing curve.

7.2.2 Censored data

Now, we consider a set of remission times from 137 cancer patients [Lee and Wang (2003), pag. 231]. Lee and Wang (2003) showed that the log-logistic ( $ℒ ℒ$ ) model provides a good fit to the data. Ghitany et al. (2005) compared the fits of the $ℳ O W$ and models to these data. Now, we present a more detailed study by comparing the fitted , $ℒ ℒ$ , $E P$ , $ℳ O W$ , Marshall-Olkin log-logistic ( $ℳ O ℒ ℒ$ ), $ℳ O E P$ and $G ℬ S$ models to these data. The functions H(x;γ,c)= log(1+γ x^c) and h(x;γ,c)=γ c x^c−1/(1+γ x^c) are associated with the $ℒ ℒ$ model.

The hypothesis that the underlying distribution is (or $E P$ ) versus the alternative hypothesis that the distribution is the $ℳ O W$ (or $ℳ O E P$ ) is rejected with p-value = 0.0055 (or p-value = <0.0001). Further, the hypothesis test that the underlying distribution is $ℒ ℒ$ versus the $ℳ O ℒ ℒ$ distribution yields the p-value =1.0000. Thus, we compare the $ℳ O W$ , $ℳ O E P$ , $ℒ ℒ$ and $G ℬ S$ models to determine which model gives the best fit to the current data.

Table 8 lists the MLEs (and corresponding standard errors in parentheses) of the parameters and the values of the AD and CM statistics (their p-values in parentheses). The figures in this table, specially the p-values, suggest that the $ℳ O W$ distribution yields a better fit to these data than the other three distributions.

Table 8 MLEs and goodness-of-fit statistics

Full size table

Table 9 lists the values of the AIC, BIC, CAIC and HQIC statistics. The figures in this table indicate that there is a competitiveness among the $ℳ O W$ , $ℳ O E P$ and $ℒ ℒ$ models. However, if we observe the Figures 9(a), 9(b) and 9(c), we note that the $ℳ O W$ and $ℳ O E P$ models present better fits to the current data.

Table 9 Statistics AIC, BIC, CAIC and HQIC

Full size table

Figure 9(d) really shows that the $ℳ O W$ and $ℳ O E P$ distributions present good fits to the current data. We can conclude that the $ℳ O W$ and $ℳ O E P$ distributions are excellent alternatives to explain this data set.

8 Conclusion

In this paper, the Marshall-Olkin extended Weibull family of distributions is proposed and some of its mathematical properties are studied. The maximum likelihood procedure is used for estimating the model parameters. Two special models in the family are described with some details. In order to assess the performance of the maximum likelihood estimates, a simulation study is performed by means of Monte Carlo experiments. Special models of the proposed family are compared (through goodness-of-fit measures) with other well-known lifetime models by means of two real data sets. The proposed model outperforms classical lifetime models to these data.

Appendix: An expansion for f(x;δ,α,ξ)F(x;δ,α,ξ)^c

Here, we obtain an expansion for the quantity f(x;δ,α,ξ)F(x;δ,α,ξ)^c. First, we consider an expansion for F(x;δ,α,ξ)^c. Based on (5), the power of the cdf can be expressed as

F {(x; δ, α, ξ)}^{c} = \underset{\equiv A}{\underset{⏟}{{1 - exp [- αH (x; ξ)]}^{c}}} \underset{\equiv B}{\underset{⏟}{{1 - \bar{δ} exp [- αH (x; ξ)]}^{- c}}} .

Applying expansion (9), we have

A = \sum_{k = 0}^{\infty} {(- 1)}^{k} (\binom{c}{k}) exp [- kαH (x; ξ)] .

Now, we expand the quantity B. Equation (9) under the restriction δ<1 (implying that $\bar{δ} exp [- αH (x; ξ)] < 1$ ) yields

B = \sum_{j = 0}^{\infty} \frac{{(c)}_{j}}{j!} {\bar{δ}}^{j} exp [- jαH (x; ξ)] .

Moreover, it is clear that δ=1 implies B=1. Finally, for δ>1 (i.e., ${1 - \bar{δ} exp [- αH (x; ξ)]} > 1$ ), the quantity B can be rewritten as

B = {\{1 - [1 - {1 - \bar{δ} exp [- αH (x; ξ)]}^{- 1}]\}}^{c} .

Using the binomial expansion, we have

B = \sum_{j = 0}^{\infty} {(- 1)}^{j} (\binom{c}{j}) {[1 - {\{1 - \bar{δ} exp [- αH (x; ξ)]\}}^{- 1}]}^{j} .

Thus,

\begin{align} F {(x; δ, α, ξ)}^{c} & = I_{(δ < 1)} \sum_{j, k = 0}^{\infty} {(- 1)}^{k} \frac{{(c)}_{j}}{j!} (\binom{c}{k}) {\bar{δ}}^{j} exp [- (j + k) αH (x; ξ)] \\ + I_{(δ = 1)} \sum_{k = 0}^{\infty} {(- 1)}^{k} (\binom{c}{k}) exp [- kαH (x; ξ)] \\ + I_{(δ > 1)} \sum_{j, k = 0}^{\infty} {(- 1)}^{j + k} (\binom{c}{k}) (\binom{c}{j}) exp [- kαH (x; ξ)] \\ \times {[1 - {1 - \bar{δ} exp [- αH (x; ξ)]}^{- 1}]}^{j} . \end{align}

Hence, based on Equation (13), the following expansion holds

\begin{align} f (x; δ, α, ξ) F {(x; δ, α, ξ)}^{c} & = (\sum_{v = 0}^{\infty} w_{v} g (x; (v + 1) α, ξ)) F {(x; δ, α, ξ)}^{c} = I_{(δ < 1)} \sum_{j, k, v = 0}^{\infty} {(- 1)}^{k} \\ \times w_{v} \frac{{(c)}_{j}}{j!} (\binom{c}{k}) {\bar{δ}}^{j} exp [- (j + k) αH (x; ξ)] g (x; (v + 1) α, ξ) \\ + I_{(δ = 1)} \sum_{k, v = 0}^{\infty} {(- 1)}^{k} w_{v} (\binom{c}{k}) exp [- kαH (x; ξ)] g (x; (v + 1) α, ξ) \\ + I_{(δ > 1)} \sum_{j, k, v = 0}^{\infty} {(- 1)}^{j + k} w_{v} (\binom{c}{k}) (\binom{c}{j}) exp [- kαH (x; ξ)] \\ \times {[1 - {1 - \bar{δ} exp [- αH (x; ξ)]}^{- 1}]}^{j} g (x; (v + 1) α, ξ) . \end{align}

(24)

References

Bain LJ: Analysis for the linear failure-rate life-testing distribution. Technometrics 16: 551–559.
Article MathSciNet Google Scholar
Barreto-Souza W, Lemonte AJ, Cordeiro GM: General results for the Marshall and Olkin’s family of distributions. An. Acad. Bras. Cienc 85: 3–21.
Article MathSciNet Google Scholar
Bebbington M, Lai CD, Zitikis R: A flexible Weibull extension. Reliability Eng. Syst. Saf 92: 719–726.
Article Google Scholar
Carrasco JMF, Ortega EMM, Cordeiro GM: A generalized modified Weibull distribution for lifetime modeling. Comput. Stat. Data Anal 53: 450–462.
Article MathSciNet Google Scholar
Chen Z: A new two-parameter lifetime distribution with bathtub shape or increasing failure rate function. Stat. Probability Lett 49: 155–161.
Article MathSciNet Google Scholar
Chen G, Balakrishnan N: A general purpose approximate goodness-of-fit test. J. Qual. Technol 27: 154–161.
Google Scholar
Cordeiro GM, Lemonte AJ: On the Marshall-Olkin extended Weibull distribution. Stat. Paper 54: 333–353.
Article MathSciNet Google Scholar
Cover TM, Thomas JA: Elements of Information Theory. John Wiley & Sons, New York;
Book Google Scholar
Cox DR, Hinkley DV: Theoretical Statistics. Chapman and Hall, London;
Book Google Scholar
Doornik J: Ox 5: object-oriented matrix programming language. Timberlake Consultants, London; (2007)
Google Scholar
EL-Bassiouny AH, Abdo NF: Reliability properties of extended makeham distributions. Comput. Methods Sci. Technol 15: 143–149. (2009)
Article Google Scholar
Fisk PR: The graduation of income distributions. Econometrica 29: 171–185.
Article Google Scholar
Fréchet M: Sur la loi de probabilite de l’écart maximum.́. Ann. Soc. Polon. Math 6: 93–93. (1927)
Google Scholar
Garvan F: The Maple Book. Chapman and Hall/CRC, London; (2002)
Google Scholar
Gompertz B: On the nature of the function expressive of the law of human mortality and on the new model of determining the value of life contingencies. Philos. Trans. R. Soc. Lond 115: 513–585. (1825)
Article Google Scholar
Guess F, Proschan F: Mean residual life: Theory and applications. In Handbook of Statistics, vol. 7. Edited by: Krishnaiah PR, Rao CR. Elsevier; http://dx.doi.org/10.1016/S0169-7161(88)07014-2
Google Scholar
Ghitany ME: Marshall-Olkin extended Pareto distribution and its application. Int. J. Appl. Math 18: 17–31.(2005)
MathSciNet Google Scholar
Ghitany ME, Kotz S: Reliability properties of extended linear failure-rate distributions. Probability Eng. Informational Sci 21: 441–450. (2007)
Article MathSciNet Google Scholar
Ghitany ME, AL-Hussaini EK, AL-Jarallah: Marshall-Olkin extended Weibull distribution and its application to Censored data. J. Appl. Stat 32: 1025–1034. (2005)
Article MathSciNet Google Scholar
Ghitany ME, AL-Awadhi FA, Alkhalfan LA: Marshall-Olkin extended Lomax distribution and its applications to censored data. Comm. Stat. Theor. Meth 36: 1855–1866. (2007)
Article MathSciNet Google Scholar
Gurvich M, DiBenedetto A, Ranade S: A new statistical distribution for characterizing the random strength of brittle materials. J. Mater. Sci 32: 2559–2564. (1997)
Article Google Scholar
Hartley RVLL: Transmission of information. Bell Syst. Techn. J 7: 535–563. (1928)
Article Google Scholar
Jayakumar K, Mathew T: On a generalization to Marshall-Olkin scheme and its application to Burr type XII distribution. Stat. Paper 49: 421–439. (2008)
Article MathSciNet Google Scholar
Johnson NL, Kotz S, Balakrishnan N: Continuous Univariate Distributions. Wiley, New York; (1994)
Google Scholar
Kies JA: The Strength of Glass, NRL Report 5093. Naval Research Lab., Washington, DC (1958)
Google Scholar
Krishna E, Jose KK, Ristić M: Applications of Marshal-Olkin Fréchet distribution. Comm. Stat. Simulat. Comput 42: 76–89. (2013)
MathSciNet Google Scholar
Kullback S, Leibler RA: On information and sufficiency. Ann. Math. Stat 22: 79–86. 1951
Article MathSciNet Google Scholar
Lai CD, Xie M, Murthy DNP: A modified Weibull distribution. Trans. Reliab 52: 33–37. 2003
Article Google Scholar
Lawless JF: Statistical Models and Methods for Lifetime Data. Wiley, New York; 2003
Google Scholar
Lee ET, Wang JW: Statistical Methods for Survival Data Analysis. Wiley, New York; 2003
Book Google Scholar
Lomax KS: Business failures; another example of the analysis of failure data. J. Am. Stat. Assoc 49: 847–852. 1954
Article Google Scholar
Marshall A, Olkin I: A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641–652. 1997
Article MathSciNet Google Scholar
McCool JI: Using the Weibull Distribution: Reliability, Modeling and Inference. John Wiley & Sons, New Jersey; 2012
Book Google Scholar
Nadarajah S, Kotz S: On some recent modifications of Weibull distribution. IEEE Trans. Reliab 54: 561–562. 2005
Article Google Scholar
Nascimento ADC, Cintra RJ, Frery AC: Hypothesis testing in speckled data with stochastic distances. IEEE Trans. Geosci. Remote Sensing 48: 373–385. 2010
Article Google Scholar
Nascimento ADC, Horta MM, Frery AC, Cintra RJ: Comparing edge detection methods based on stochastic entropies and distances for PolSAR imagery. IEEE J. Selected Topics Appl. Earth Observations Remote Sensing 7: 648–663. 2014
Article Google Scholar
Nikulin M, Haghighi F: A chi-squared test for the generalized power Weibull family for the head-and-neck cancer censored data. J. Math. Sci 133: 1333–1341. 2006
Article MathSciNet Google Scholar
Owen WJ: A new three-parameter extension to the Birnbaum-Saunders distribution. IEEE Trans. Reliab 55: 475–479. 2006
Article Google Scholar
Pham H: A vtub-shaped hazard rate function with applications to system safety. Int. J. Reliab. Appl 3: 1–16. 2002
Google Scholar
Phani KK: A new modified Weibull distribution function. Commun. Am. Ceramic Soc 70: 182–184. 1987
Google Scholar
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna; 2009
Google Scholar
Rayleigh JWS: On the resultant of a large number of vibrations of the same pitch and of arbitrary phase. Phil. Mag 10: 73–78. 1880
Article Google Scholar
Ristić MM, Jose KK, Ancy J: A Marshall-Olkin gamma distribution and minification process. STARS: Stress Anxiety Res. Soc 11: 107–117. 2007
Google Scholar
Rodriguez N: A guide to the Burr type XII distributions. Biometrika 64: 129–134. 1977
Article MathSciNet Google Scholar
Sankaran PG, Jayakumar K: On proportional odds model. Stat. Paper 49: 779–789. 2008
Article MathSciNet Google Scholar
Seghouane A-K, Amari S-I: The AIC criterion and symmetrizing the Kullback–Leibler divergence. IEEE Trans. Neural Netw 18: 97–106. 2007
Article Google Scholar
Shannon CE: A mathematical theory of communication. Bell Syst. Techn. J 27: 379–423. 1948
Article MathSciNet Google Scholar
Sigmon K, Davis TA: MATLAB Primer. Chapman and Hall/CRC, London; 2002
Google Scholar
Smith RM, Bain LJ: An exponential power life testing distribution. Comm. Stat. Theor. Meth 4: 469–481. 1975
Article Google Scholar
Wolfram S: The Mathematica Book. Wolfram Media, Cambridge; 2003
Google Scholar
Xie M, Lai D: Reliability analysis using additive Weibull model with bathtub-shaped failure rate function. Reliab. Eng. Syst. Saf 52: 87–93. 1995
Article Google Scholar
Xie M, Tang Y, Goh TN: A modified Weibull extension with bathtub-shaped failure rate function. Reliab. Eng. Syst. Saf 76: 279–285. 2002
Article Google Scholar
Zhang T, Xie M: Failure data analysis with extended Weibull distribution. Comm. Stat. Simulat. Comput 36: 579–592. 2007
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge financial support from CAPES and CNPq. The authors are also grateful to three referees and an associate editor for helpful comments and suggestions.

Author information

Authors and Affiliations

Departamento de Estatística, Universidade Federal de Campina Grande, Bodocongó, 58429-970, Campina Grande, PB, Brazil
Manoel Santos-Neto
Departamento de Estatística, Universidade Federal do Piauí, Ininga, 64049-550, Teresina, PI, Brazil
Marcelo Bourguignon
Departamento de Estatística, Universidade Federal do Rio Grande do Norte, Campus Universitário Lagoa Nova, 59078-970, Natal, RN, Brazil
Luz M Zea
Departamento de Estatística, Universidade Federal de Pernambuco, Cidade Universitária, 50740-540, Recife, PE, Brazil
Abraão DC Nascimento & Gauss M Cordeiro

Authors

Manoel Santos-Neto
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Bourguignon
View author publications
You can also search for this author in PubMed Google Scholar
Luz M Zea
View author publications
You can also search for this author in PubMed Google Scholar
Abraão DC Nascimento
View author publications
You can also search for this author in PubMed Google Scholar
Gauss M Cordeiro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Manoel Santos-Neto or Marcelo Bourguignon.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors MS-N, MB, LMZ, ADCN and GMC proposed a new class of models named the Marshall-Olkin extended Weibull distributions and investigated some of its structural properties including ordinary and incomplete moments, generating and quantile functions, mean deviations, information theory measures and some types of entropies. Two special models were discussed and the estimation of the family model parameters was performed by maximum likelihood. They provided a simulation study and two applications to real data. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Santos-Neto, M., Bourguignon, M., Zea, L.M. et al. The Marshall-Olkin extended Weibull family of distributions. J Stat Distrib App 1, 9 (2014). https://doi.org/10.1186/2195-5832-1-9

Download citation

Received: 28 November 2013
Accepted: 22 April 2014
Published: 16 June 2014
DOI: https://doi.org/10.1186/2195-5832-1-9

The Marshall-Olkin extended Weibull family of distributions

Abstract

Mathematics Subject Classification (2010)

1 Introduction

2 The ℳOEW family

2.1 Expansions for the density function

3 General properties

3.1 Moments, generating function and incomplete moments

3.2 Quantile function and random number generator

3.3 Mean deviations

3.4 Average lifetime and mean residual lifetime functions

4 Information theory measures

4.1 Rényi entropy

4.2 Shannon entropy

4.3 Cross entropy and Kullback-Leibler divergence and distance

5 Estimation

6 Two special models

6.1 The ℳOℳW model

6.2 The ℳOG model

7 Simulation and applications

7.1 Simulation study

7.2 Applications

7.2.1 Uncensored data

7.2.2 Censored data

8 Conclusion

Appendix: An expansion for f(x;δ,α,ξ)F(x;δ,α,ξ)c

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Competing interests

Authors’ contributions

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

2 The $ℳ O E W$ family

6.1 The $ℳ O ℳ W$ model

6.2 The $ℳ O G$ model

Appendix: An expansion for f(x;δ,α,ξ)F(x;δ,α,ξ)^c