 Research
 Open Access
 Published:
The gamma extended Weibull distribution
Journal of Statistical Distributions and Applications volume 3, Article number: 7 (2016)
Abstract
The twoparameter Weibull has been the most popular distribution for modeling lifetime data. We propose a fourparameter gamma extended Weibull model, which generalizes the Weibull and extended Weibull distributions, among several other models. We obtain explicit expressions for the ordinary and incomplete moments, generating and quantile functions and mean deviations. We employ the method of maximum likelihood for estimating the model parameters. We propose a loggamma extended Weibull regression model with censored data. The applicability of the new models is well justified by means of two real data sets.
Introduction
There are hundreds of continuous univariate distributions and recent developments focus on constructing general distributions from classic ones. Many distributions have been used as models to make inferences about a population based on a set of empirical data from that population. Determining an adequate model to make inferences is a very important problem. The Weibull distribution is commonly used for modeling systems with monotone failure rates. However, the data sets in reliability analysis, especially over the lifecycle of the product, can involve high initial failure rates and eventual high failure rates due to aging and wear out, indicating a bathtub failure rate. The distributions that allow only monotone failure rates might not be adequate for modeling populations giving rise to such data. The major weakness of the Weibull distribution is its inability to accommodate nonmonotone hazard rates, which has led to new generalizations of this distribution. One of the first extensions allowing for nonmonotone hazard rates, including the bathtub shaped hazard rate function (hrf), is the exponentiated Weibull (ExpW) (Mudholkar and Srivastava 1993; Mudholkar et al. 1995; and Mudholkar et al. 1996) distribution. It has been well established in the literature that the ExpW distribution provides significantly better fits than the wellknown exponential, gamma, Weibull and lognormal distributions. In the last paper, the authors presented a more flexible threeparameter extended Weibull (EW) model. Further, Shao et al. (2004) used this distribution to study flood frequency and Hao and Singh (2008) described some of its applications in hydrology. In this paper, we propose a further generalization by taking the EW distribution as the baseline model.
The threeparameter EW distribution is defined by the probability density function (pdf) and cumulative distribution function (cdf) (Mudholkar et al. 1996)
and
respectively, where λ>0 is a scale parameter and α≥0 and β>0 are shape parameters. The support of the EW distribution is \(\mathbb {R}^{+}\). The forms of the pdf and cdf when α goes to zero tend to those ones when α=0. Clearly, the cdf (2) extends the Weibull cdf and this fact justifies the terminology EW model. Due to the shape parameter α, more flexibility can be incorporated in model (1), which is very useful for lifetime data. The survival function associated to (1) is S _{ λ,α,β }(x)=1−G _{ λ,α,β }(x) for α>0 and S _{ λ,β }(x)=1−G _{ λ,β }(x) for α=0.
A family of univariate distributions generated by gamma random variables was proposed by Zografos and Balakrishnan (2009) and Ristic and Balakrishnan (2012). For any baseline cdf G(x), \(x\in \mathbb {R}\), they defined the gammaG (denoted with the prefix “GG” for short) model with an extra parameter a>0 by the pdf and cdf
and
respectively, where g(x)=d G(x)/d x, \(\Gamma (a) = \int _{0}^{\infty }t^{a1}\,{\mathrm {e}}^{t}dt\) is the gamma function, \(\gamma (a,z) = {\int _{0}^{z}} t^{a1}\,{\mathrm {e}}^{t}dt\) denotes the incomplete gamma function and γ _{1}(a,z)=γ(a,z)/Γ(a) is the incomplete gamma function ratio.
The GG model has the same parameters of the parent G distribution plus one extra shape parameter a>0. Each new GG distribution can be determined from a specified G model. For a=1, the G distribution is a basic exemplar with a continuous crossover towards cases with different shapes (for example, a particular combination of skewness and kurtosis).
We introduce a new fourparameter model called the “gamma extended Weibull” (“GEW”) distribution, which contains as special models some distributions such as the EW distribution. In fact, this model represents only a basic exemplar of the GEW distribution. We also study some of its mathematical properties. The paper is outlined as follows. In Section 2, we define the GEW distribution and provide some of its special cases. Further, two useful expansions for its density and cumulative distributions are derived in Section 3. In Section 4, we obtain its quantile function (qf). The ordinary moments and mean deviations are investigated in Section 5. Maximum likelihood estimation of the model parameters and some inferential tools are discussed in Section 6. In Section 7, we propose the loggamma extended Weibull regression model, which can be applied for lifetime analysis. The importance of the new models is shown empirically by means of two real data sets in Section 8. Some conclusions are offered in Section 9.
The GEW distribution
By taking the pdf (1) and cdf (2) of the EW distribution with scale parameter λ>0 and shape parameters α≥0 and β>0, the pdf and cdf of the GEW distribution can be obtained from Eqs. (3) and (4) (for x>0) as
where τ=(α,β,λ). Clearly, if \(\alpha \rightarrow 0\), the first expressions in (5) and (6) tend to the second ones in these equations. For α=0, the GEW model is identical to the generalized gamma distribution pioneered by Stacy (1962). Hereafter, a random variable X having pdf (5) is denoted by X∼GEW(τ,a). Evidently, the density function (5) does not involve any complicated function and the EW distribution arises as the basic exemplar for a=1. It is a positive point for the current generalization. The GEW model has several submodels: the EW when a=1, gamma Weibull (GW) when α=0, gamma extended exponential (GEE) when β=1, extended exponential (EE) when β=a=1, gamma exponential (GE) when α=0 and β=1, Weibull (W) when α=0 and a=1, and exponential when α=0 and a=β=1. We show empirically the flexibility of the GEW distribution by comparing it with some of its submodels by means of a real data set in Section 8. Plots of the density function and hrf of X for selected parameter values are displayed in Figs. 1 and 2, respectively.
The new distribution is easily simulated as follows: if V is a gamma random variable with parameter a, then
has the GEW(τ,a) distribution. This generating scheme is straightforward because of the existence of fast generators for gamma random variables.
Useful expansions
Expansions for (5) and (6) can be derived using the concept of exponentiated distributions. The exponentiated extended Weibull (EEW) distribution is defined by raising the cdf (2) to a power a>0 and the associated random variable is denoted by Y∼EEW(τ,a). The cdf and pdf of Y are given by
and
respectively. The properties of some exponentiated distributions have been investigated by several authors, see Mudholkar and Srivastava (1993) and Mudholkar et al. (1995) for exponentiated Weibull (ExpW), Gupta et al. (1998) for exponentiated Pareto, Gupta and Kundu (2001) for exponentiated exponential (ExpE) and Nadarajah and Gupta (2007) for exponentiated gamma (ExpG) distributions. More recently, Cordeiro et al. (2011) studied these properties for the exponentiated generalized gamma (ExpGG) distribution.
The following formula holds (http://functions.wolfram.com/ElementaryFunctions/Log/06/01/04/03/)
where a>0 is a real parameter and the constants p _{ j,k } can be determined recursively by
for k=1,2,… and p _{ j,0}=1. Further, for any real parameter a>0, we define
and then the first equation in (5) can be expressed as
where \(h_{a+k}(x)=(a+k)\,\lambda \,\beta \,x^{\beta 1}\,\left (1+\alpha \lambda \,x^{\beta }\right)^{1/\alpha 1}\,\left [1\left (1+\alpha \lambda \,x^{\beta }\right)^{\frac {1}{\alpha }}\right ]^{a+k1}\) denotes the EEW(τ,a+k) density function. So, the GEW density function is a linear mixture of EEW densities.
Similarly, we can derive an expansion for the GW density when α=0. Using Eq. (5) for α=0, we obtain the same expression in (8), but the function h _{ a+k }(x) denotes, when α=0, the GEW(τ,a+k) pdf given by the second equation of (7).
Inserting (7) in Eq. (8), we obtain
where \(e_{j}= \sum ^{\infty }_{k=0}(1)^{j}\,(a+k)\,b_{k}\,\binom {a+k1}{j}\), \(g_{\lambda ^{\star },\alpha ^{\star },\beta }(x)\phantom {\dot {i}\!}\) denotes the EW pdf with parameters λ ^{⋆}=(j+1)λ, α ^{∗}=α/(j+1) and β and \(g_{\lambda ^{\star },\beta }(x)\phantom {\dot {i}\!}\) denotes the Weibull pdf with parameters λ ^{⋆} and β.
Equation (9) is the main result of this section. It reveals that the GEW density function is a linear mixture of EW (when α>0) and Weibull (when α=0) densities. So, several GEW structural properties can be obtained from those properties of the EW and Weibull distributions.
Quantile function
First, we consider the general case α>0. The GEW qf, say Q(u)=F ^{−1}(u), can be expressed in terms of the EW qf, say Q _{ EW }(·). Inverting F(x)=u given by (6), the qf of X (for 0<u<1) follows as
where Q ^{−1}(a,1−u) is the inverse function of Q(a,z)=1−γ _{1}(a,1−u). A power series for it is given in the Wolfram website as
where a _{0}=0, a _{1}=Γ(a+1)^{1/a}, a _{2}=Γ(a+1)^{2/a}/(a+1), etc. Quantities of interest can be obtained from (10) by substituting appropriate values for u.
Further, the EW qf is given by
Moments and mean deviations
Some of the most important features and characteristics of a distribution can be studied through moments. We can write from Eq. (8)
where \(e_{j}^{\star }=(j+1)\,e_{j}\), B(a,b)=Γ(a)Γ(b)/Γ(a+b) is the beta function and \(\Gamma (a)=\int _{0}^{\infty }\,z^{a1}{\mathrm {e}}^{z}dz\) is the gamma function.
Established algebraic expansions to determine \(\mu _{r}^{\prime }\) can be more efficient than computing these moments directly by numerical integration of (5), which can be prone to rounding errors among others. The skewness γ _{1} and kurtosis γ _{2} of X are easily obtained from the ordinary moments using wellknown formulae. Figure 3 displays some plots for the skewness and kurtosis of the GEW model.
For lifetime models, it is of interest to know the rth incomplete moment of X defined by \(T_{r}(y)={\int _{0}^{y}}\,x^{r}\,f(x)dx\). Based on (9), we obtain (for α>0)
where \(\rho (y;r,p,q)={\int _{0}^{y}}\,x^{r}\,(1+p\,x)^{q1} dx\) for r,p,q>0. Using Maple, this integral can be determined as
where _{2} F _{1} is the hypergeometric function defined by
and
We obtain the incomplete moments by combining (13) and the expression for ρ(y;r,p,q).
For α=0, the quantity T _{ r }(y) reduces to
An important application of T _{1}(y) refers to the mean deviations about the mean and about the median, which can be used as measures of spread in a population. They are given by \(\delta _{1}=E\left (X\mu _{1}^{\prime }\right)=2\mu _{1}^{\prime }\,F\left (\mu _{1}^{\prime }\right)2\,T_{1}\left (\mu _{1}^{\prime }\right)\) and \(\delta _{2}=E(Xm)=\mu _{1}^{\prime }2\,T_{1}(m)\), respectively, where the mean \(\mu _{1}^{\prime }\) is determined from (12) and T _{1}(·) comes from (13) with r=1 as
Here, ρ(y;1,p,q) become
Further, for α=0, we have
and then T _{1}(q) reduces to
For a given probability π, both equations for T _{1}(·) can be used to determine the Lorenz and Bonferroni curves defined by \(L(\pi)=T_{1}(q)/\mu _{1}^{\prime }\) and \(B(\pi)=T_{1}(q)/\left (\pi \,\mu _{1}^{\prime }\right)\), respectively, where q=Q _{ GEW }(π) comes from Eq. (10) (see Fig. 4).
Estimation
The maximum likelihood method is used for estimating the parameters of the GEW model. We determine the maximum likelihood estimates (MLEs) from complete samples only. Let x _{1},…,x _{ n } be a sample of size n from the GEW (α,β,λ,a) distribution. The loglikelihood function for the vector of parameters θ=(α,λ,β,a)^{T} is given by \(l({\boldsymbol {\theta }})=\sum _{i=1}^{n}\log [f(x_{i})]=\sum _{i=1}^{n} l_{i}({\boldsymbol {\theta }})\), where
The expression for l(θ) when α=0 gives the loglikelihood function for the GG distribution, a submodel of the GEW distribution. Next, we maximize \(l({\boldsymbol {\theta }})=\sum _{i=1}^{n} l_{i}({\boldsymbol {\theta }})\) for α>0.
The components of the score vector U(θ) are given by \(\sum _{i=1}^{n} \partial l_{i}({\boldsymbol {\theta }})/\partial \theta _{j}\), j=1,…,4, where the θ _{ j }’s denote the parameters α, β, λ or a. The elements ∂ l _{ i }(θ)/∂ θ _{ j }, j=1,…,4, are given below
where ψ(·) is the digamma function.
Setting these equations to zero and solving them simultaneously yield the MLEs of the four parameters. They can be solved numerically by using the Rlanguage or any iterative methods such as the NR (NewtonRaphson), BFGS (BroydenFletcherGoldfarbShanno), BHHH (BerndtHallHallHausman), NM (NelderMead), SANN (SimulatedAnnealing) and LimitedMemory QuasiNewton code for BoundConstrained Optimization (LBFGSB). Regarding the initial values for the parameters, several authors (Ugray et al. 2007; Glover 1998; Dixon and Szegö 1978; Varadhan and Gilbert 2009) suggest to find local maxima starting from widely varying starting points and then pick the maximum out of these, i.e., they suggest the use of multiple starting points (commonly referred to as multistart optimization). We followed their advice in this paper. As a useful tool we suggest the use of the R rotine multiStart of R package BB (for solving and optimizing largescale nonlinear systems) for dealing with multiple starting points to obtain multiple solutions and to test sensitivity to starting values.
For interval estimation of the model parameters, we require the 4×4 total observed information matrix J(θ). The elements of J(θ)={J _{ rs }}, where r,s∈{α,β,λ,a}, can be obtained from the authors upon request. The multivariate normal \(N_{4}\left (0,J(\widehat {\boldsymbol {\theta }})^{1}\right)\) distribution, where \(J(\widehat {\boldsymbol {\theta }})^{1}\) is the inverse observed matrix evaluated at \({\boldsymbol {\theta }}=\widehat {\boldsymbol {\theta }}\), can be used to construct approximate confidence intervals for the parameters.
The likelihood ratio (LR) statistic is useful for comparing the GEW distribution with some of its special models. We can evaluate the maximum values of the unrestricted and restricted loglikelihoods to obtain LR statistics for testing some of its submodels. In any case, hypothesis tests of the type H _{0}: ψ=ψ _{0} versus H: ψ≠ψ _{0}, where ψ is a vector formed with some components of θ and ψ _{0} is a specified vector, can be performed using LR statistics.
The LGEW regression model
If X is a random variable having the GEW density function (5), then Y=log(X) has the loggamma extended Weibull (LGEW) distribution. The density function of Y, parameterized in terms of β=σ ^{−1} and λ=e^{−μ/σ}, is given by
where \(y \in \mathbb {R}\), a>0 and α>0 are shape parameters, σ>0 and \(\mu \in \mathbb {R}\). If Y is a random variable having density function (14), we write Y∼LGEW(a,α,μ,σ). Plots of the density function of Y for selected parameter values are displayed in Fig. 5. They reveal great flexibility for different values of the shape parameters a and α.
Thus
The corresponding survival function of Y is given by
The pdf of the standardized random variable Z=(Y−μ)/σ is given by
The special case α=0 and a=1 refers to the logWeibull (LW) (or extremevalue) distribution and, for α=0, we obtain the loggammaWeibull (LGW) model.
In many practical applications, the lifetimes x _{ i } are affected by explanatory variables such as the cholesterol level, blood pressure and many others. Let v _{ i }=(v _{ i1},…,v _{ ip })^{T} be the explanatory variable vector associated with the ith response variable y _{ i } for i=1,…,n. Consider a sample (y _{1},v _{1}),…,(y _{ n },v _{ n }) of n independent observations, where each random response is defined by \(y_{i}=\min \{\log (x_{i}),\log (c_{i})\}\), and \(\log (x_{i})\) and \(\log (c_{i})\) are the loglifetime and logcensoring, respectively. We consider noninformative censoring such that the observed lifetimes and censoring times are independent.
We construct a linear regression model for the response variable y _{ i } based on the LGEW distribution given by
where the random error z _{ i } has the density function (15), \({\boldsymbol {\beta }}=\left (\beta _{1},\ldots,\beta _{p}\right)^{T}\), σ>0, a>0 and α>0 are unknown scalar parameters and v _{ i } is the vector of explanatory variables modeling the location parameter \(\mu _{i}=\textbf {v}_{i}^{T} {\boldsymbol {\beta }}\). Hence, the location parameter vector \({\boldsymbol {\mu }}=\left (\mu _{1},\ldots,\mu _{n}\right)^{T}\) of the LGEW model has a linear structure μ=v ^{T} β, where \(\textbf {v}=\left (\textbf {v}_{1},\ldots,\textbf {v}_{n}\right)^{T}\) is a known model matrix. The logWeibull (or the extreme value) regression model is defined by (16) with α=0 and a=1.
Let F and C be the sets of individuals for which y _{ i } is the loglifetime or logcensoring, respectively. The total loglikelihood function for the model parameters \({\boldsymbol {\theta }}=\left (a,\alpha,\sigma,{\boldsymbol {\beta }}^{T}\right)^{T}\) can be obtained from Eqs. (15) and (16) as
where q is the observed number of failures and \(z_{i}=\left (y_{i}\textbf {v}_{i}^{T} {\boldsymbol {\beta }}\right)/\sigma \). The MLE \(\widehat {{\boldsymbol {\theta }}}\) of θ can be obtained by maximizing the loglikelihood function (17). We use the procedure NLMixed in SAS to evaluate the estimate \(\widehat {{\boldsymbol {\theta }}}\). Initial values for β and σ are taken from the fit of the LW regression model with α=0 and a=1.
From the fitted model (16), the survival function for y _{ i } can be estimated by
Under general regularity conditions, the asymptotic distribution of \((\widehat {\boldsymbol {\theta }}{\boldsymbol {\theta }})\) is multivariate normal \(N_{p+3}\left (0,K({\boldsymbol {\theta }}\right)^{1})\), where K(θ) is the (p+3)×(p+3) expected information matrix. The asymptotic covariance matrix K(θ)^{−1} of \(\widehat {{\boldsymbol {\theta }}}\) can be estimated by \(\widehat {{\boldsymbol {\theta }}}^{1}\) and then the inference on the parameter vector θ can be based on the normal approximation \(N_{p+3}\left (0,J(\widehat {{\boldsymbol {\theta }}})^{1}\right)\) for \(\widehat {{\boldsymbol {\theta }}}\). In fact, an 100(1−η)% asymptotic confidence interval for each parameter θ _{ r } is given by
where \(\widehat {J}^{r,r}\) represents the rth diagonal element of \(J(\widehat {{\boldsymbol {\theta }}})^{1}\) and z _{ η/2} is the quantile 1−η/2 of the standard normal distribution. The LR statistic can be used to discriminate between the LGEW and LW regression models since they are nested models. In this case, the hypotheses to be tested are H _{0}: (a,α)^{T}=(1,0)^{T} versus H _{1}: H _{0} i s n o t t r u e, and the LR statistic reduces to \(w=2\left \{l(\widehat {{\boldsymbol {\theta }}})l(\widetilde {{\boldsymbol {\theta }}})\right \}\), where \(\widetilde {{\boldsymbol {\theta }}}\) is the MLE of θ under H _{0}. The null hypothesis is rejected if \(w>\chi _{2,1\eta }^{2}\), where \(\chi _{2,1\eta }^{2}\) is the quantile of the chisquare distribution with two degrees of freedom.
Applications
8.1 The GEW model
In the first application, we use the warp break data for six types of weaving warps discussed by Tippett (1950, p. 105). We describe his experiment: “The results of a weaving experiment was conducted in a factory. There were 6 lots of warp yarn labelled respectively AL, AM, etc. They were spun from two growths of cotton, A and B, and each cotton was spun to three twists (i.e., the number of turns in the yarn per inch): low (L), medium (M), and high (H). The combination of these three factors give 6 kinds of yarn, which are the experimental treatments. From each yarn were prepared 9 warps (a warp is a quantity of warp yarn that goes into one loom as a unit), and, as a loom came available in the course of events, a warp chosen at random from the 54 was assigned to it, until ultimately all 54 were disposed of. More than one warp was woven in some looms, but that did not affect the randomness of the distribution. The number of warp threads that broke during the weaving of each warp was counted and expressed as a rate of so many breaks per unit of warp.”
We analyze the warp breakage rates for individual warps disregarding the factors. First, we fit the GEW model and some of its submodels to these data by the method of maximum likelihood. Afterwards, we compare the GEW model with some fourparameter competitive models.
The GEW model, its submodels and some competitors
We compare the GEW model with some of its submodels and we also compare it with some competitor models: the Burr XII geometric (BXIIG) (Silva and Cordeiro 2015), the beta lognormal (BLN) (Castellares et al. 2013) and the exponentiated WeibullPoisson (EWP) (Mahmoudi and Sepahdar 2013) distributions. The MLEs of the models parameters and the Akaike Information Criterion (AIC) statistic for some models fitted to the data are listed in Table 1. In such table the parameters used for the competitor models are the same in the cited references. However, for the BXIIG we included the scale parameter s that was not considered in Silva and Cordeiro (2015). Even though it is expected that the GEW model will provide a better fit than its submodels, we can verify whether or not there are significant differences. Some model comparison is given in Table 2.
The histogram of the data and the plots of the fitted densities for some models are displayed in Fig. 6. They indicate that the new distribution provides a better fit than other submodels. A comparison of the new distribution with four submodels using LR statistics is performed in Table 2. These statistics indicate that the new distribution is the most adequate model to describe the data.
In the case of the competitor models, the AIC values suggest that the GEW model is quite comparable to the BLN and EWP models (being slightly superior) and it is a bit better than the BXIIG model. Figure 7 displays the histogram of the data and the fitted GEW, BXIIG, BLN and EWP densities. The last comments about the AIC are in agreement with the general notion that the GEW density provides the best fit for the current data, although all of them are very close.
The required numerical evaluations are implemented by using an R script (subroutine nlminb that can be found at https://cran.rproject.org ). The nlminb() is a derivativefree method for function minimization as it requires only the function to be evaluated. Other R routine choices such as the optimx package (Nash and Varadhan, 2011) provided similar results. The R rotine multiStart of R package BB is also used in this work. The data set warpbreaks is available in the R data frame.
Some estimation issues
We now discuss some estimation issues related to the GEW distribution. The GEW density is given by (5) and, when \(\alpha \rightarrow 0\), the first expression in (5) tends to the second one. For α=0, the GEW distribution is identical to the generalized gamma distribution. Therefore, according to Theorem 2.1 in Mäkeläinen et al. (1981), the GEW density is not constant on the boundary since
This fact implies that the MLE \(\hat {{\boldsymbol {\theta }}}\) of θ=(α,λ,β,a) of the GEW model may not be unique.
As mentioned before, in order to handle the case of multiple solutions we adopt different initial points, say k, in the maximization procedure and use the estimate which maximizes the loglikelihood among these k values. For the current data sets, we have always been able to find a solution.
As mentioned above, the GEW distribution has different expressions depending on whether α>0 or α=0. In order to study possible estimation difficulties for this model when the true value of α is either zero or around it, we simulate data from the gamma generalized distribution, i.e., by taking α=0 for some choices of the parameters a, λ and β and then use the GEW model in the estimation procedure. We note that the GEW model is able to estimate all the parameter values correctly. As expected, for larger samples, the estimated values tend to be closer to the true values. Of course, for any similar problem, the program may fail to converge for some choices of initial values.
We also perform some additional analyses in order to evaluate the GEW robustness. We simulate two different data sets from the betaWeibull distribution (which does not belong to the GEW family) and use them to fit the GEW distribution. The resulting fits (not shown here) indicate that the GEW distribution is very robust. In fact, the estimated GEW densities capture the main aspects of the betaWeibull generated data in the sense that they are able to correctly locate the mode of the histograms obtained from the generated data, inflection points and other data characteristics.
Regarding the impact of the sample size on the estimated values of the parameters, a simulation study is conducted in which we generate GEW data considering (α,λ,β,a)=(0.895137,0.000026,9.792369,23.609396) as the true parameter values. We simulate data by taking k=500 replicates for each of the sample sizes n=50,100,500,1000,5000 and n=10,000. For a given sample size and k=500 estimated values we evaluate the average of those estimated vectors and the mean squared errors (MSEs). We can conclude that the estimated expected vector does approach the true vector, but the MSE decreases slowly.
8.2 The LGEW model
We now illustrate some of the ideas and methodology of regression models for the LGEW model using the data set from a twoarm clinical trial considered earlier by Efron (1988) and Mudholkar et al. (1996). Efron observed that the empirical hazard functions for both samples start near zero, suggesting an initial highrisk period in the beginning, a decline for a while, and then stabilization after about one year. Specifically, Efron’s data from a head and neck cancer clinical trial consist of survival times of 51 patients in arm A who were given radiation therapy and 45 patients in arm B who were given radiation plus chemotherapy. Nine patients in arm A and 14 patients in arm B were lost to followup and were regarded as censored. In this paper, we consider only one predictor: (v1): twoArm (Arm A=0, Arm B=1).
We fit the LGEW regression model
where the errors z _{1},…,z _{96} are independent random variables with density function (15).
An alternative approach for modeling these data can be provided by the LW distribution. There are various extensions of this lifetime distribution; see, for example, the logbeta Weibull (LBW) (Cordeiro et al. 2013) and Kumaraswamylogistic (KwL) (Santana et al. 2012) distributions, among others. The LBW and KwL density functions are given below:

LBW distribution
$$\begin{array}{@{}rcl@{}} f(y)=\frac{1}{\sigma B(a,b)}\exp\left\{\left(\frac{y\mu}{\sigma}\right)b\exp\left(\frac{y\mu}{\sigma}\right) \right\} \left\{1\exp\left[\exp\left(\frac{y\mu}{\sigma}\right)\right]\right\}^{a1}, \end{array} $$where \(\infty <y<\infty \), σ>0 and \(\infty <\mu <\infty \). See, for example, more details and properties in Cordeiro et al. (2013) and Ortega et al. (2015).

KwL distribution
$$\begin{array}{@{}rcl@{}} f(y)&=&\frac{a\,b}{\sigma}\exp\left[a\left(\frac{y\mu}{\sigma}\right)\right] \left[1+\exp\left(\frac{y\mu}{\sigma}\right)\right]^{(a+1)} \\&&\times\left\{1\left[1\frac{1}{1+\exp\left(\frac{y\mu}{\sigma}\right)}\right]^{a}\right\}^{b1}, \end{array} $$where \(\infty <y<\infty \), σ>0 and \(\infty <\mu <\infty \). Some applications of the KwL distribution are discussed in Santana et al. (2012) and Nadarajah et al. (2012).
Taking μ=β _{0}+β _{1} v _{ i1} for the location parameter, σ is the dispersion parameter and a and b are shape parameters.
Table 3 lists the MLEs of the parameters for the fitted LGEW, LGW, LW, LBW and KwL regression models to the current data using the NLMixed procedure in SAS. Initial values for β and σ are taken from the fitted LW regression model with α=0 and a=1.
A comparison of the new distribution with two of its submodels using LR statistics is performed in Table 4. These statistics indicate that the LGEW regression model is the most adequate model to explain the data.
The LGEW model involves an extra parameter, which gives it more flexibility to fit the data. We note from the fitted LGEW regression model that the dummy variable v _{1} is not significant at 5 %. We note that there is no significant difference between “Arm A” and “Arm B” clinical trial for the survival times. The LGEW regression model outperforms the other models irrespective of the criteria and it can be used effectively in the analysis of these data.
A graphical comparison of the fitted LGEW, LGW and LW models (see Fig. 8) indicates that the LGEW model provides a superior fit. The curves displayed in Fig. 8 represent the empirical survival function and the estimated survival functions obtained from Eq. (18). Finally, we conclude that the LGEW regression model provides a good fit to these data.
Conclusions
We introduce a new model named the gamma extended Weibull (GEW) distribution and study some of its structural properties. It generalizes some important distributions in the literature and provides means of its continuous extension to still more complex situations. The new model contains several distributions as special models including the extended Weibull (Mudholkar et al. 1996), gamma Weibull (Zografos and Balakrishnan 2009) and generalized gamma (Stacy 1962). We derive explicit expressions for the density function, ordinary and incomplete moments, quantile function and mean deviations. The model parameters are estimated by maximum likelihood. The usefulness of the GEW distribution is illustrated by means of an application to real data, where we show empirically that it gives a better fit than some of its submodels. We also propose the loggamma extended Weibull (LGEW) regression model, which has greater flexibility as shown by means of an application to real data.
References
Castellares, F, Montenegro, LC, Cordeiro, GM: The beta lognormal distribution. J. Stat. Comput. Simul. 83(2), 203–228 (2013).
Cordeiro, GM, Nadarajah, S, Ortega, EMM: General results for the beta Weibull distribution. J. Stat. Comput. Sim. 83, 1082–1114 (2013).
Cordeiro, GM, Ortega, EMM, Silva, GO: The exponentiated generalized gamma distribution with application to lifetime data. J. Stat. Comput. Simul. 81, 827–842 (2011).
Dixon, LCW, Szegö, GP: The Global Optimization Problem: an Introduction. In: Dixon, LCW, Szegö, GP (eds.)Towards Global Optimisation, Vol. 2, pp. 1–15. North Holland, Amsterdam (1978).
Efron, B: Logistic regression, survival analysis, and the KaplanMeier curve. J. Am. Stat. Assoc. 83, 414–425 (1988).
Glover, F: A template for scatter search and path relinking. In: Hao, JK, Lutton, E, Ronald, E, Schoenauer, M, Snyers, D (eds.)Artificial Evolution. Lecture Notes in Computer Science, 1363, pp. 13–54. Springer, Berlin/Heidelberg (1998).
Gupta, RC, Gupta, RD, Gupta, PL: Modeling failure time data by Lehman alternatives. Commun. Stat. Theor. M. 27, 887–904 (1998).
Gupta, RD, Kundu, D: Exponentiated exponential family: an alternative to gamma and Weibull distributions. Biometrical J. 43, 117–130 (2001).
Hao, Z, Singh, VP: Entropybased parameter estimation for extended Burr XII distribution. Stoch. Env. Res. Risk A. 23, 1113–1122 (2008).
Mahmoudi, E, Sepahdar, A: Exponentiated WeibullPoisson distribution: Model, Properties and Applications. Math. Comput. Simul. 92, 76–97 (2013).
Mäkeläinen, T, Schmidt, K, Styan, GPH: On the existence and uniqueness of the maximum likelihood estimate of a vVectorvalued parameter in fixedsize. Ann. Stat. 9, 758–767 (1981).
Mudholkar, GS, Srivastava, DK: Exponentiated Weibull family for analyzing bathtub failurerate data. IEEE T. Reliab. 42, 299–302 (1993).
Mudholkar, GS, Srivastava, DK, Freimer, M: The exponentiated Weibull family: a reanalysis of the busmotorfailure data. Technometrics. 37, 436–445 (1995).
Mudholkar, GS, Srivastava, DK, Kollia, GD: A generalization of the Weibull distribution with application to the analysis of survival data. J. Am. Stat. Assoc. 91, 1575–1583 (1996).
Nadarajah, S, Cordeiro, GM, Ortega, EMM: General results for the KumaraswamyG distribution. J. Stat. Comput. Sim. 82, 951–979 (2012).
Nadarajah, S, Gupta, AK: The exponentiated gamma distribution with application to drought data. Calcutta Stat. Assoc. 59, 29–54 (2007).
Nash, JC, Varadhan, R: Unifying optimization algorithms to aid software system users: optimx for R. J. Stat. Software. 43, 1–14 (2011).
Ortega, EMM, Cordeiro, GM, Campelo, AK, Kattam, MW, Cancho, VG: A power series beta Weibull regression model for predicting breast carcinoma. Stat. Med. 34, 1366–1388 (2015).
Ristic, MM, Balakrishnan, N: The gamma exponentiated exponential distribution. J. Stat. Comput. Simul. 82, 1191–1206 (2012).
Santana, TVF, Ortega, EMM, Cordeiro, GM, Silva, GO: The KumaraswamyLogLogistic Distribution. J. Stat. Theory Appl. 11, 265–291 (2012).
Shao, Q, Wong, H, Xia, J, WaiCheung, I: Models for extremes using the extended threeparameter Burr XII system with application to flood frequency analysis. Hydrolog. Sci. J. 49, 685–702 (2004).
Silva, RB, Cordeiro, GM: The Burr XII power series distributions: A new compounding family. Stat. Braz. J. Probab. Stat. 29(3), 565–589 (2015).
Stacy, EW: A generalization of the gamma distribution. Ann. Math. Stat. 33, 1187–1192 (1962).
Tippett, LHC: Technological Applications of Statistics. Wiley, New York (1950).
Ugray, Z, Lasdon, L, Plummer, J, Glover, F, Kelly, J, Marti, R: Scatter search and local NLP solvers: a multistart framework for global optimization. INFORMS J. Comput. 19(3), 328–340 (2007).
Varadhan, R, Gilbert, PD: BB: An R package for solving a large system of nonlinear equations and for optimizing a highdimensional nonlinear objective function. J. Stat. Softw. 32, 4 (2009).
Zografos, K, Balakrishnan, N: On families of beta and generalized gammagenerated distributions and associated inference. Statist. Method. 6, 344–362 (2009).
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
GMC proposed the model, wrote sections 1, 2, 4, 5, 9 (1, 2 and 4 jointly with MCSL, and 5 jointly with AEG) and drafted the manuscript. MCSL wrote and obtained the results in sections 1 to 4 (1, 2 and 4 jointly with GMC). AEG wrote and obtained the results in sections 5 (jointly with GMC), 6 and 8.1 (jointly with CQS). CQS wrote and obtained the results in Sections 6 and 8.1 (jointly with AEG), and the plots in Fig. 1 to 4, 6 and 7. EMMO proposed the log gamma extended Weibull regression model described in section 7, wrote and obtained the results in that section and in section 8.2, and also the plots in Fig. 5 and 8.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Received
Accepted
Published
DOI
Keywords
 Extended Weibull distribution
 Gamma extended Weibull distribution
 Generating function
 Maximum likelihood estimation
 Mean deviation
 Quantile function
MSC
 62E10
 62E20
 62F10