Generalized logistic distribution and its regression model

Aljarrah, Mohammad A.; Famoye, Felix; Lee, Carl

doi:10.1186/s40488-020-00107-8

Research
Open access
Published: 07 September 2020

Generalized logistic distribution and its regression model

Mohammad A. Aljarrah¹,
Felix Famoye² &
Carl Lee²

Journal of Statistical Distributions and Applications volume 7, Article number: 7 (2020) Cite this article

14k Accesses
7 Citations
Metrics details

Abstract

A new generalized asymmetric logistic distribution is defined. In some cases, existing three parameter distributions provide poor fit to heavy tailed data sets. The proposed new distribution consists of only three parameters and is shown to fit a much wider range of heavy left and right tailed data when compared with various existing distributions. The new generalized distribution has logistic, maximum and minimum Gumbel distributions as sub-models. Some properties of the new distribution including mode, skewness, kurtosis, hazard function, and moments are studied. We propose the method of maximum likelihood to estimate the parameters and assess the finite sample size performance of the method. A generalized logistic regression model, based on the new distribution, is presented. Logistic-log-logistic regression, Weibull-extreme value regression and log-Fréchet regression are special cases of the generalized logistic regression model. The model is applied to fit failure time of a new insulation technique and the survival of a heart transplant study.

Introduction

The use of logistic distribution in various disciplines can be found in (Johnson et al. 1995) and the references therein. The logistic distribution has the cumulative distribution function (CDF) defined as

$$ F(x)={\left(1+\exp \left(-\frac{x-\mu }{\sigma}\right)\right)}^{-1},-\infty <x,\mu <\infty, \sigma >0. $$

(1)

Note that the logistic distribution is the limiting distribution of the average of largest and smallest values of random samples of size n from a symmetric distribution of exponential type (Gumbel 1958).

The CDF of the standard logistic distribution is F(y) = (1 + e^−y)⁻¹, − ∞ < y < ∞. The standard logistic density function with kurtosis 4.2 is symmetric about zero, and is more peaked and has heavier tails than the normal density function. These properties make logistic distribution a popular choice for fitting symmetric non-normal data.

The first type of extreme value distribution is commonly known as the Gumbel-type distribution due to Gumbel (1958), who made several significant contributions to the extreme value analysis and practical applications of extreme value statistics in distributions of human lifetimes, radioactive emissions, and flood analysis (see, e.g., Johnson et al. 1995). Gumbel used the distribution to model the maximum and minimum values of samples from various distributions. The CDFs of maximum and minimum Gumbel distributions are defined, respectively, as

$$ {F}_{Gu-\max}\left(x;\mu, \sigma \right)=\exp \left\{-\exp \left(-\frac{x-\mu }{\sigma}\right)\right\},-\infty <x<\infty, -\infty <\mu <\infty, \sigma >0, $$

(2)

$$ {F}_{Gu-\min}\left(x;\mu, \sigma \right)=1-\exp \left\{-\exp \left(\frac{x-\mu }{\sigma}\right)\right\},-\infty <x<\infty, -\infty <\mu <\infty, \sigma >0, $$

(3)

where μ and σ are location and scale parameters, respectively. Gumbel distribution is good to fit skewed data while logistic distribution is for symmetric data. It is interesting to note that there is a relation between these two distributions. If X ~ Gumbel(μ_X, σ) and Y ~ Gumbel(μ_Y, σ), then, (X − Y) ∼ logistic(μ_X − μ_Y, σ).

In order to improve the goodness of fit of the logistic and Gumbel distributions, many generalizations of these distributions have been studied in the literature. For example, Prentice (1976) proposed logistic type IV to model binomial regression data. Stukel (1988) proposed logistic regression model. Balakrishnan and Leung (1988) proposed three types of generalized logistic distribution. Johnson et al. (1995) summarized several generalizations of the logistic distribution. Wahed and Ali (2001) proposed the skew logistic distribution (SLD). An extension of SLD was presented and studied by Nadarajah (2009) by introducing a scale parameter. Gupta and Kundu (2010) defined two generalizations of logistic distribution, namely the skew logistic using the skew normal method proposed by Azzalini (1985) and defined the Type-II logistic distribution as a member of the proportional reversed hazard family with the baseline distribution as the logistic distribution. The T-X framework proposed by Alzaatreh et al. (2013), which was further expanded by Aljarrah et al. (2014) are two general methods that have been applied to derive various generalization of distributions, including logistic distribution. Recently, Ghosh and Alzaatreh (2018) defined the exponentiated-exponential logistic (EEL) distribution as a generalization of the logistic distribution and various properties were studied by the authors.

Similar to the logistic distribution, several generalizations of the Gumbel distribution have appeared in the literature. For a review of generalizations of the Gumbel extreme value distribution, one may refer to Pinheiro and Ferrari (2016).

There is already a long list of literatures for generalized logistic and Gumbel distributions. Why are we developing yet another family of generalized logistic distributions? As pointed out by Johnson et al. (1994, p. 15) “For most practical purposes, it is sufficient to use four parameters. There is no doubt that at least three parameters are needed; for some purposes this is enough.” The main motivation is to develop highly flexible three-parameter distributions that can fit wide range of right and left skewed data. The method proposed here has several advantages that are not available among the existing generalizations:

(a)
The method proposed is not to develop a single generalized logistic distribution, it can be applied to generate different families of generalized logistic distributions. A generalized normal distribution using similar technique was studied in Aljarrah et al. (2019), which was shown to be a much more flexible distribution than the skewed normal proposed by Azzalini (1985) and its generalizations.
(b)
A member of the family of the generalized logistic distributions, the exponential-logistic {Generalized Weibull} distribution (E-L {GW}) is defined and studied in detail in this article. This distribution has three parameters: location, scale and a shape parameter. As shown in the article, the E-L {GW} distribution is a generalization of both logistic and Gumbel distributions.
(c)
The E-L {GW} is shown to be more flexible than existing generalizations of logistic and Gumbel distributions in two ways: (i) It fits very well left and right skewed data. Existing generalized logistic or Gumbel distributions can fit heavy right-skewed data, but not able to fit heavy left-skewed data. (ii) It fits very well data with a wider range of skewness and kurtosis when compared with existing generalizations such as skew logistic (Gupta and Kundu 2010), beta-logistic distribution (Nassar and Elmasry 2012), generalized logistic distribution (Ghosh and Alzaatreh 2018), generalized Gumbel (Cooray 2010), as well as skew normal (Azzalini 1985) and its five-parameter generalized distribution (Choudhury and Abdul 2011).
(d)
The generalized regression model derived by assuming the response follows E-L {GW} distribution is a very flexible model that takes logistic-log-logistic regression, Weibull-extreme value regression and log-Fréchet regression as special cases.

In Section 2, we define the E-L {GW} distribution. Some properties of the E-L {GW} distribution including the shapes of the probability density function (PDF) and hazard function, and quantile function are studied. An expression for the moment, properties of the hazard function, and the relationship between the mean, variance, skewness, kurtosis and the shape parameter are investigated in Section 3. In Section 4, the method of maximum likelihood is presented for estimating the parameters of the distribution, and a simulation study is performed to assess the small sample performance of the method. In Section 5, a generalized logistic regression model based on E-L {GW} distribution is developed. In Section 6, applications to several real data sets are given to demonstrate the flexibility and usefulness of the new distribution and its regression model. Summary and conclusions are given in Section 7.

The exponential-logistic {generalized Weibull} (E-L {GW}) distribution

Let the random variable R be a standard logistic distribution. Using a shape parameter ξ > 0, location parameter − ∞ < μ < ∞, scale reflection parameter σ ≠ 0, and following the technique that Aljarrah et al. (2019) used to define the combined exponential-normal {GW} distribution, we define the combined E-R {GW} family as

$$ {F}_X(x)=0.5+\operatorname{sgn}\left(\sigma \right)\left(0.5-\exp \left\{-\left({\left({\overline{F}}_R\left(\frac{x-\mu }{\sigma}\right)\right)}^{-\xi }-1\right)/\xi \right\}\right), $$

(4)

where sgn(σ) is the sign of the parameter σ. Note that the CDF defined in (4) reduces to $ {F}_R\left(\frac{x-\mu }{\mid \sigma \mid}\right) $ distribution as ξ → 0. The corresponding PDF to (4) is given by

$$ {f}_X(x)=\frac{f_R\left(\frac{x-\mu }{\sigma}\right)}{\mid \sigma \mid {\left({\overline{F}}_R\left(\frac{x-\mu }{\sigma}\right)\right)}^{\xi +1}}\exp \left(-\frac{{\left({\overline{F}}_R\left(\frac{x-\mu }{\sigma}\right)\right)}^{-\xi }-1}{\xi}\right). $$

(5)

The E-L {GW} distribution can be defined from Eq. (4) by letting R be the logistic random variable as follows:

Definition (E-L {GW} distribution): The CDF and PDF of the E-L {GW} distribution are defined, respectively, as

$$ {F}_X(x)=0.5+\operatorname{sgn}\left(\sigma \right)\left(0.5-\exp \left\{\left(1-{\left(1+\exp \left(\frac{x-\mu }{\sigma}\right)\right)}^{\xi}\right)/\xi \right\}\right), $$

(6)

and

$$ {f}_X(x)=\frac{1}{\mid \sigma \mid}\exp \left(\frac{x-\mu }{\sigma}\right){\left(1+\exp \left(\frac{x-\mu }{\sigma}\right)\right)}^{\xi -1}\exp \left\{\left(1-{\left(1+\exp \left(\frac{x-\mu }{\sigma}\right)\right)}^{\xi}\right)/\xi \right\},-\infty <x,\mu <\infty, \sigma \ne 0,\xi >0. $$

(7)

Note the E-L {GW} is derived as a generalization of the symmetric logistic distribution for fitting highly skewed data. This provides a good comparison of performance when comparing with various existing three-parameter distributions. The following Corollary presents some special sub-models.

Corollary 1: The PDF of E ‐ L{GW}(μ, σ, ξ) in (7) reduces to the following sub-models:

a)
When ξ → 0, the PDF in (7) reduces to a logistic distribution in (1).
b)
When ξ = 1 and σ < 0, the PDF in (7) reduces to the PDF of maximum Gumbel distribution in (2) with location and scale parameters μ and ∣σ∣, respectively.
c)
When ξ = 1 and σ > 0, the PDF in (7) reduces to the PDF of minimum Gumbel distribution in (3) with location and scale parameters μ and σ, respectively.

Proof: a) $ \underset{\xi \to 0}{\lim }{f}_X(x)=\frac{1}{\mid \sigma \mid}\exp \left(\frac{x-\mu }{\mid \sigma \mid}\right)/{\left(1+\exp \left(\frac{x-\mu }{\mid \sigma \mid}\right)\right)}^2,\kern0.37em \mathrm{that}\ \mathrm{is}\kern0.37em X\sim \mathrm{Logistic}\left(\mu, |\sigma |\right) $. The cases (b) and (c) are obtained directly by substituting ξ = 1 in (7). □

Quantile functions are useful for generating pseudo-random numbers from a probability distribution. Proposition 1 gives the quantile function for the E-L {GW} distribution.

Proposition 1: The quantile function for the E-L {GW} distribution is given by

$$ {Q}_X(u)=\mu +\sigma \log \left({\left\{1-\xi \log \left(\frac{1}{2}-\operatorname{sgn}\left(\sigma \right)\left(u-\frac{1}{2}\right)\right)\right\}}^{1/\xi }-1\right),u\in \left(0,1\right). $$

(8)

Proof: By setting F_X(Q_X(u)) = u in Eq. (6) and solving for Q_X(u) in terms of u, the quantile function in (8) is obtained. □

Proposition 2:

a)
If T is a standard exponential random variable, then X = μ+ σ log((1 + ξT)^1/ξ − 1) follows the E-L {GW} (μ, σ, ξ) distribution in Eq. (6).
b)
If X~ E-L {GW} (μ, σ, ξ), then (2μ − X)~ E-L {GW} (μ, −σ, ξ).

Proof: Using the CDF method, the results in (a) and (b) follow. □

The hazard rate function (HRF) of the E-L {GW} distribution is obtained after using the CDF in (6) and PDF in (7), and it is given by

$$ h(x)=\left\{\begin{array}{l}\frac{1}{\sigma}\exp \left(\frac{x-\mu }{\sigma}\right){\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi -1},\kern10em \sigma >0,\\ {}\frac{\exp \left(\frac{x-\mu }{\sigma}\right){\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi -1}}{\mid \sigma \mid \left\{\exp \left(\frac{1}{\xi}\left[{\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi }-1\right]\right)-1\right\}},\kern5.3em \sigma <0.\end{array}\right. $$

(9)

Figures 1 and 2 show the plots of PDF and HRF for E-L {GW} distribution. The PDF can be positively or negatively skewed, while the HRF shows increasing with J shape, increasing with S shape, and increasing-decreasing shapes. The graphs in Fig. 1 indicate that the distribution tends to be symmetric as ξ → 0, skewed to the left when σ > 0, and skewed to the right when σ < 0. When the sign of parameter σ is changed, the curve of the PDF is reflected about the line x = 0. Also as ξ increases, the mode decreases when σ > 0, and as ξ increases, the mode increases when σ < 0. The graphs in Fig. 2 show the hazard function in (9) is increasing when σ > 0. When σ < 0, the hazard function increases or first constant, increases and then decreases.

Properties of exponential-logistic {generalized Weibull} distribution

In this section, some properties of the E-L {GW} distribution are studied. These properties include, mode, shape property of the HRF, moments and moment generating function.

Mode:

Theorem 1: The E-L {GW} distribution is unimodal. The mode is at the point x_∗ = μ whenever ξ = { 0, 1}. Otherwise the mode is at the point x_∗ = μ + σ log(u_∗), where u_∗ satisfies the equation

$$ \xi u+1=u{\left(u+1\right)}^{\xi },u>0. $$

(10)

Proof: See Appendix.

Corollary 2: The HRF is increasing whenever σ > 0, and asymptotic to the line y = 1/ ∣ σ∣ as x → ∞ whenever σ < 0.

Proof: See Appendix.

It is noteworthy to mention that the graphs in Fig. 2 are consistent with the above results and the asymptotic feature of the curves in Corollary 2.

Moments: The moments are valuable for describing and identifying distribution properties such as the center, variance, skewness and kurtosis. In order to derive the moments of E-L {GW}, we first provide a series expansion of PDF of E-R {GW} in Eq. (5), by applying the exponential series, as follows.

$$ {f}_X(x)=\frac{f_R\left(\frac{x-\mu }{\sigma}\right)\exp \left(1/\xi \right)}{\mid \sigma \mid}\sum \limits_{i=0}^{\infty}\frac{{\left(-1\right)}^i{\left({\overline{F}}_R\left(\frac{x-\mu }{\sigma}\right)\right)}^{-\left(\xi i+\xi +1\right)}}{i!{\xi}^i}. $$

By applying negative binomial series expansion $ {\left(1-x\right)}^{-r}=\sum \limits_{j=0}^{\infty}\frac{\Gamma \left(r+j\right)}{\Gamma \left(j+1\right)\Gamma (r)}{x}^j $, ∣x ∣ < 1 on $ {\left({\overline{F}}_R\left(\left(x-\mu \right)/\sigma \right)\right)}^{-\left(\xi i+\xi +1\right)} $, we get

$ {f}_X(x)=\frac{f_R\left(\frac{x-\mu }{\sigma}\right)\exp \left(1/\xi \right)}{\mid \sigma \mid}\sum \limits_{i=0}^{\infty}\sum \limits_{j=0}^{\infty}\frac{{\left(-1\right)}^i\Gamma \left(\xi i+\xi +j+1\right)}{\xi^ii!\Gamma \left(j+1\right)\Gamma \left(\xi i+\xi +1\right)}{\left({F}_R\left(\frac{x-\mu }{\sigma}\right)\right)}^j $, which can be written as

$$ {f}_X(x)=\sum \limits_{i=0}^{\infty}\sum \limits_{j=0}^{\infty}\frac{\omega_{i,j}}{\mid \sigma \mid }{k}_{\left(j+1\right)}\left(\left(x-\mu \right)/\sigma \Big)\right). $$

(11)

where

$$ {\omega}_{i,j}=\frac{{\left(-1\right)}^i\exp \left(1/\xi \right)\Gamma \left(\xi i+\xi +j+1\right)}{\xi^ii!\Gamma \left(j+2\right)\Gamma \left(\xi i+\xi +1\right)}, $$

(12)

and k_(j + 1)(x) = (j + 1)f_R(x)(F_R(x))^j denotes the PDF of exponentiated R random variable with power parameter j + 1.

Theorem 2: The n^th absolute moment of the E-L {GW} distribution exists for any μ, σ ≠ 0, ξ > 0 and satisfies the inequality

$$ E\left({\left|X\right|}^n\right)\le {e}^{-1}{\left(1+\xi \right)}^{1+1/\xi}\sum \limits_{i=0}^n\left(\begin{array}{l}n\\ {}i\end{array}\right){\left|\mu \right|}^{n-i}{\left|\sigma \right|}^iE\left({\left|L\right|}^i\right), $$

(13)

where L is a standard logistic random variable.

Proof: See Appendix.

Moments of E-L {GW} as a series expression is given in the following theorem.

Theorem 3: The r^th moment, E(X^r), of the E-L {GW} distribution is given by

$$ E\left({X}^r\right)=\sum \limits_{n=0}^r\sum \limits_{i=0}^{\infty}\sum \limits_{j=0}^{\infty}\left(\begin{array}{c}r\\ {}n\end{array}\right){\mu}^{r-n}{\sigma}^n{\omega}_{i,j}E\left({L}_{j+1}^n\right), $$

(14)

where ω_{i, j} is defined in (12) and $ E\left({L}_{i+1}^n\right) $ is the n^th moment of the exponentiated logistic distribution with power parameter j + 1 and given by Ali et al. (2007) as

$$ E\left({L}_{j+1}^n\right)=\left(j+1\right)n!\left(\sum \limits_{k=0}^{\infty}\frac{{}_{\left(-j-2\right)}P_k}{k!{\left(k+1\right)}^{n+1}}+{\left(-1\right)}^n\sum \limits_{k=0}^{\infty}\frac{{}_{\left(-j-2\right)}P_k}{k!{\left(k+j+1\right)}^{n+1}}\right). $$

Proof: See Appendix.

Proposition 3: Suppose X has the PDF in (6), then the moment generating function (MGF) of X is given by

$$ {M}_X(t)={e}^{\mu t+1/\xi}\sum \limits_{i=0}^{\infty}\frac{\Gamma \left(\sigma t+1\right){\left(-1\right)}^i}{\Gamma \left(\sigma t-i+1\right)\Gamma \left(i+1\right)}{\xi}^{\left(\sigma t-i\right)/\xi}\Gamma \left(\left(\sigma t-i\right)/\xi +1,1/\xi \right), $$

(15)

where

$$ \left\{\begin{array}{l}t\in \left(-\infty, 1/|\sigma |\right),\kern3.1em \operatorname{}\xi \ge 1,\sigma <0\\ {}t\in \left(-1/\sigma, \infty \right),\kern3.6em \operatorname{}\xi \ge 1,\sigma >0\;\\ {}t\in \left(-1/|\sigma |,1/|\sigma |\right),\operatorname{}\kern1.8em \xi <1,\sigma \ne 0.\end{array}\right. $$

Proof: See Appendix.

In Fig. 3, the mean and variance of E-L {GW} distribution are plotted in terms of the parameter ξ for μ = 0 and σ = {1, −1}. Figure 3(a) shows that when σ > 0, the mean decreases as ξ increases, When σ < 0, the mean increases as ξ increases. Also, Fig. 3(b) shows that the variance decreases as ξ increases.

In Fig. 4, we plot the skewness and kurtosis of E-L {GW} distribution in terms of the parameters ξ when μ = 0 and σ = {1, −1}. Figure 4(a) shows that when σ > 0, the skewness decreases as ξ increases and the E-L {GW} distribution is left skewed, and when σ < 0, the skewness increases as ξ increases and the E-L {GW} distribution is right skewed. The distribution is symmetric as ξ → 0. We note that the degree of skewness of the E-L {GW} distribution is measured by ξ, and the parameter σ plays two roles: characterizing the scale property and determining left skewed (σ > 0) or right skewed (σ < 0). Figure 4(b) shows the kurtosis increases as ξ increases, and it is not affected by σ.

The flexibility of the E-L {GW} is compared with skew normal (SN) (Azzalini 1985), extended skew generalized normal (ESGN) (Choudhury and Abdul 2011), generalized normal (GN) (Aljarrah et al. 2019), beta-generalized logistic (BGL) (Nassar and Elmasry 2012), proportional reversed hazard logistic (PRHL) (Gupta and Kundu 2010), generalized Gumbel (GG) (Cooray 2010) and EEL (Ghosh and Alzaatreh 2018). Table 1 summarizes the ranges of skewness and kurtosis of these distributions. It is shown that the E-L {GW} fits the widest range of skewness and kurtosis with the exception that the PRHL can fit platykurtic distributions.

Table 1 A comparison of skewness and kurtosis of some generalized logistic and Gumbel distributions

Full size table

Estimation and simulation

Estimation

In this subsection, we discuss the maximum likelihood estimation method for the parameters of E-L {GW} distribution. Let x₁, x₂, …, x_n be a random sample from E-L {GW} distribution with parameters θ = (ξ, μ, σ)^t, the log-likelihood function is given by

$$ \mathrm{\ell}\left(\boldsymbol{\theta}, \boldsymbol{x}\right)=-n\log \mid \sigma \mid +\sum \limits_{i=1}^n\left(\frac{x_i-\mu }{\sigma}\right)+\left(\xi -1\right)\sum \limits_{i=1}^n\log \left[1+\exp \left(\frac{x_i-\mu }{\sigma}\right)\right]+\frac{n}{\xi }-\frac{1}{\xi}\sum \limits_{i=1}^n{\left[1+\exp \left(\frac{x_i-\mu }{\sigma}\right)\right]}^{\xi }. $$

Letting z_i = exp((x_i − μ)/σ), the score function of the distribution parameters is given by U_n(θ) = (∂ℓ/∂ξ, ∂ℓ/∂μ, ∂ℓ/∂σ), where

$$ \frac{\mathrm{\partial \ell }}{\partial \xi }=-\frac{1}{\xi}\sum \limits_{i=1}^n\left[{\left({z}_i+1\right)}^{\xi}\log \left({z}_i+1\right)\right]+\frac{1}{\xi^2}\sum \limits_{i=1}^n{\left[{z}_i+1\right]}^{\xi }-\frac{n}{\xi^2}+\sum \limits_{i=1}^n\log \left({z}_i+1\right), $$

(16)

$$ \frac{\mathrm{\partial \ell }}{\partial \mu }=\frac{1}{\sigma}\sum \limits_{i=1}^n\left[{z}_i{\left({z}_i+1\right)}^{\xi -1}\right]-\frac{1}{\sigma}\sum \limits_{i=1}^n\frac{\xi {z}_i+1}{z_i+1}, $$

(17)

$$ \frac{\mathrm{\partial \ell }}{\partial \sigma }=\frac{1}{\sigma}\sum \limits_{i=1}^n\left[{z}_i\log {z}_i{\left({z}_i+1\right)}^{\xi -1}\right]-\sum \limits_{i=1}^n\frac{\left(1+\xi \log {z}_i\right){z}_i+1+\log {z}_i}{\sigma \left({z}_i+1\right)}. $$

(18)

The maximum likelihood estimates (MLEs) of the parameters can be obtained by solving the nonlinear Eqs. (16), (17) and (18). The initial values of μ and σ are taken to be the mean and ± standard deviation of the data respectively. The initial value of σ is taken as s (or -s) if the data is skewed left (or right). The initial value of ξ is taken to be 1.

Simulation

A simulation study is conducted to explore the performance of the MLE for the parameters of the E-L {GW} distribution. Many combinations of the parameters of the E-L {GW} model, namely, highly, moderately, and weakly left (or right) skewed, are considered and represent all possible shapes of the model. Different sample sizes n = {50, 100, 200, 500, 1000} are also considered. The MLE of the parameters ξ, μ and σ are computed for 200 repetitions in order to calculate the bias and the standard deviation (SD) for each set of parameter combinations and sample size. Table 2 shows the results of the simulation, and Figs. 5 and 6 present the illustrations. The results show that the bias and SD decrease as the sample size increases. The estimated PDF curve also moves closer to the actual curve with the increase in the sample size. These results indicate that the MLE method can be used to estimate the parameters of the E-L {GW} distribution.

Table 2 Bias and SD of the parameter estimates using MLE method

Full size table

Generalized logistic regression model based on E-L {GW}

In this section, we propose a generalized logistic regression model by assuming the response Y follows E-L {GW} distribution. If the variable of interest is non-negative such as survival time, T, then the response Y is defined as log(T). In the following, we derive a generalized logistic regression model for modeling life-time data. Univariate survival functions and censored data regression problems can be estimated using parametric models for covariate effects. Parametric models produce precise estimates of the quantities of interest when they provide a good fit to the lifetime data set. The reason is that these estimates are based on few parameters in this way. On the basis of the E-L {GW} distribution, the following regression model is considered:

$$ {y}_i={\boldsymbol{v}}_i^T\boldsymbol{\beta} +\sigma {z}_i,i=1,\dots, n, $$

(19)

where the response variable y_i = log(t_i) is the logarithm of the survival time t_i, β = (β₀, β₁, …, β_p)^T, and σ ≠ 0 are unknown parameters. Each y_i has a covariate vector $ {\boldsymbol{v}}_i^T=\left(1,{v}_{i1},\dots, {v}_{ip}\right) $ that models the linear predictor $ {\mu}_i={\boldsymbol{v}}_i^T\boldsymbol{\beta} $. The random error z_i has the E-L {GW} density (7). The shape parameter ξ can be treated as a nuisance parameter, which may be tested against special cases of the E-L {GW} distribution. It can also be modeled with a vector of covariates $ {\xi}_i=\exp \left({\boldsymbol{v}}_i^T\boldsymbol{\gamma} \right) $ that depends on the covariate vector $ {\boldsymbol{v}}_i^T $ and parameter vector γ = (γ₀, γ₁, …, γ_p)^T. The corresponding survival function is

$$ S\left({y}_i|\mu \left(\boldsymbol{v}\right),\sigma, \xi \left(\boldsymbol{v}\right)\right)=0.5-\operatorname{sgn}\left(\sigma \right)\left(0.5-\exp \left\{\left(1-{\left(1+\exp \left(\frac{y_i-{\boldsymbol{v}}_i^T\boldsymbol{\beta}}{\sigma}\right)\right)}^{\exp \left({\boldsymbol{v}}_i^T\boldsymbol{\gamma} \right)}\right)/\exp \left({\boldsymbol{v}}_i^T\boldsymbol{\gamma} \right)\right\}\right). $$

(20)

The corresponding PDF to the survival function in (20) is given by

$$ f\left({y}_i\right)=\frac{1}{\mid \sigma \mid}\exp \left(\frac{y_i-{\boldsymbol{v}}_i^T\boldsymbol{\beta}}{\sigma}\right){\left(1+\exp \left(\frac{y_i-{\boldsymbol{v}}_i^T\boldsymbol{\beta}}{\sigma}\right)\right)}^{\exp \left({\boldsymbol{v}}_i^T\boldsymbol{\gamma} \right)-1}\exp \left\{\left(1-{\left(1+\exp \left(\frac{y_i-{\boldsymbol{v}}_i^T\boldsymbol{\beta}}{\sigma}\right)\right)}^{\exp \left({\boldsymbol{v}}_i^T\boldsymbol{\gamma} \right)}\right)/\exp \left({\boldsymbol{v}}_i^T\boldsymbol{\gamma} \right)\right\}. $$

(21)

The generalized logistic regression model consists of many popular regression models as nested models. Some special regression models are as follows:

1.
Logistic-log-logistic regression model: this model is obtained as a special case from (20) when γ₁ = γ₁ = … = γ_p = 0 and γ₀ → − ∞ (or ξ → 0). The survival function is

$$ S(y)={\left\{1+\exp \left(\frac{y-{\boldsymbol{v}}^T\boldsymbol{\beta}}{\mid \sigma \mid}\right)\right\}}^{-1}, $$

which is the logistic-log-logistic regression model, Lawless (2003, p. 303).

2.
Weibull-extreme value regression model: this model is obtained as a special case from (20) when γ₀ = γ₁ = … = γ_p = 0 (or ξ = 1), and σ > 0. The survival function is

$$ S(y)=\exp \left\{-\exp \left(\frac{y-{\boldsymbol{v}}^T\boldsymbol{\beta}}{\sigma}\right)\right\}, $$

which is the classical Weibull regression model, Lawless (2003, p. 296).

3.
Log-Fréchet regression model: this model is obtained as a special case from (20) when γ₀ = γ₁ = … = γ_p = 0 (or ξ = 1), and σ < 0. The survival function is

$$ S(y)=1-\exp \left\{\exp \left(-\frac{y-{\boldsymbol{v}}^T\boldsymbol{\beta}}{\mid \sigma \mid}\right)\right\}, $$

which is the log-Fréchet regression model (Alamoudi et al. 2017).

A sample (y₁, v₁), …, (y_n, v_n) of n independent observations is considered, where each random response is defined by y_i = min {log(t_i), log(c_i)}, where c_i is the censoring time. We assume non-informative censoring and independent observed lifetimes and censoring times. Let Ω and C denote the sets of individuals for which y_i is the log-lifetime and log-censoring respectively. The total log-likelihood function for the model parameters θ = (σ, β^T, γ^T)^T is given as

$$ \mathrm{\ell}\left(\boldsymbol{\theta} \right)=\sum \limits_{i\in \Omega}\log \left[f\left({y}_i\right)\right]+\sum \limits_{i\in C}\log \left[S\left({y}_i\right)\right], $$

(22)

where S(y_i) is the survival function in (20) and f(y_i) is the PDF of S(y_i) in (21). The MLE $ \hat{\boldsymbol{\theta}} $ of the parameter vector θ = (σ, β^T, γ^T)^T of the E-L {GW} regression model can be obtained by maximizing the log-likelihood function in (22).

Applications

In this section, we apply the E-L {GW} distribution to fit two skewed data and apply the generalized logistic regression to model two censored lifetime data. For the first two data sets, the fits of the E-L {GW} distribution are compared with those of other recent generalizations of logistic and Gumbel distributions, namely, the EEL distribution by Ghosh and Alzaatreh (2018), PRHL distribution by Gupta and Kundu (2010), GG by Cooray (2010), and transmuted extreme value (TEV) by Aryal and Tsokos (2009). Maximum likelihood method is used to estimate the model parameters in these applications.

The fitted distributions are compared by using the Akaike information criterion (AIC) and Kolmogorov-Smirnov (KS) statistic and its p-value. Data have a good fit when the values of AIC and KS are small, and the p-value of KS is large. The plots of the fitted PDFs of some models are demonstrated for visual comparison. Table 3 gives the descriptive statistics of the two data sets. For the third and fourth applications, the generalized logistic regression models are compared with some nested sub-models. The goodness of fits are compared using AIC, the corrected AIC (AICC), and Bayesian information criterion (BIC) statistics. The estimation process is straightforward, and the R programming language is used for the first two data sets, while SAS programming language is used for the third and fourth data sets.

Table 3 The summary statistics of the data sets

Full size table

Adiponectin data

The data consist of 116 measurements of Adiponectin from Patrício et al. (2018). The data set is fitted to the E-L {GW} model presented in Section 2 and EEL, PRHL, GG, and TEV distributions. Table 4 indicates that the p-values of KS statistics of the distributions provide adequate fit to the data. While the five distributions all have three parameters, E-L {GW} provides the best fit to the data set. Therefore, the E-L {GW} distribution is a better alternate distribution to EEL, PRHL, GG, and TEV distributions. The large skewness and kurtosis of the sample data in Table 3 and the wide range of theoretical skewness and kurtosis in Table 1 suggest that E-L {GW} should fit better than other comparable distributions. Figure 7 shows the estimated PDFs of the fitted distributions.

Table 4 MLEs, their standard errors (SEs) (in parentheses) and goodness of fit measures for the Adiponectin’s data set

Full size table

Turbocharger data

This data set contains the time to failure (10³ h) of turbocharger of a type of engine from Xu et al. (2003). These data were studied by Alzaatreh et al. (2016) and Cordeiro et al. (2019) using Weibull-gamma {log-logistic} and odd Lomax-Lomax distributions, respectively. For this data set, we fit E-L {GW}, EEL, PRHL, GG, and TEV models. The sample data is slightly left-skewed and slightly flatter than normal. It is anticipated that all distributions should fit properly. Table 5 shows all models fit the data set properly, while E-L {GW} has a better fit according to the p-values of the KS test statistics. As noticed, the shape parameter estimates of the four distributions that fit better to the data are not statistically significant. This is not surprising since the degree of left-skewness is minor. However, without shape parameter, symmetric distributions do not fit the data properly. Figure 8 shows the fitted models to the turbocharger data set.

Table 5 MLEs, their SEs (in parentheses) and goodness of fit measures for the turbocharger data set

Full size table

Generalized logistic regression model applied to censored class-H insulation data

The data are hours to failure of 40 motorettes with a new Class-H insulation run at 190 °C, 220 °C, 240 °C, and 260 °C by Nelson (2004). Midway between the inspection time when the failure is found, and the time of the previous inspection is considered the failure time. The test aims to estimate the median life of such insulation at its design temperature of 180 °C. A median life of over 20,000 h is desired. The data consist of (n = 40) observations (observed or right censored). The censoring indicator is 0 for censoring and 1 for observed. Each motorette is assigned one of the four test stress levels (10 motorettes in each level). Seven motorettes (1 in level 220, 1 in level 240, and 5 in level 260) are lost to follow-up and considered censored. The response variable y_i = log(t_i) is the logarithm of failure times (hours) t_i or the logarithm of the censoring time c_i, and the covariate v_i refers to the test stress levels (190, 220, 240, and 260).

The data are analyzed to determine the relationship between y and the level of test stress (v). The following regression model is considered:

$$ {y}_i={\beta}_0+{\beta}_1{v}_i^{\ast }+\sigma {z}_i, $$

where $ {v}_i^{\ast }=\left({v}_i-180\right) $ is the centered stress level obtained by subtracting the design stress value 180, and y_i follows the E-L {GW} distribution in (21) with the shape parameter $ {\xi}_i=\exp \left({\gamma}_0+{\gamma}_1{v}_i^{\ast}\right) $ for i = 1, …, 40. The model parameters in these applications are estimated by maximum likelihood method. Table 6 indicates that the AIC, AICC, and BIC statistic values of the E-L {GW} regression model are smaller than those of the other fitted models. The estimates β₁ and γ₁ are significant at the 5% level, and the levels of test stress have significant differences. The likelihood ratio (LR) statistic is used to compare the E-L {GW} regression model with some nested models. As shown in Table 6, the E-L {GW} model gives better fit to these data than the other nested models. Table 7 shows the LR statistics and the corresponding p-values. The implication of the results in Table 7 is that the E-L {GW} outperforms all the sub-models. Thus, one should use the E-L {GW} regression model to analyze the data.

Table 6 MLEs of the parameters (SE in parentheses), p-values bellow SE, and goodness of fit measures for the Class-H Insulation Data

Full size table

Table 7 LR statistics for the Class-H Insulation Data

Full size table

Generalized logistic regression model applied to censored heart transplant data

The data consist of n = 103 heart transplant patients of which 69 patients received transplants and 34 did not. The data were from Crowley and Hu (1977) and reported by Kalbfleisch and Prentice (2002). The data can be used to assess the effect of transplantation on patients’ survival. The response variable y_i = log(t_i) is the logarithm of survival time in days (the time from the enrollment until death or until the study ended). The covariates are v_i1 (age in years at acceptance) and v_i2 (transplant status: 1 = transplanted, 0 = not transplanted). The survival status or censoring indicator is 0 for alive and 1 for dead. Thus, the data are analyzed to investigate the relationship between survival time and the covariates age and transplant status. The following regression model is considered:

$$ {y}_i={\beta}_0+{\beta}_1{v}_{i1}+{\beta}_2{v}_{i2}+\sigma {z}_i, $$

where y_i follows the E-L {GW} distribution in (21) with the shape parameter ξ_i = exp(γ₀ + γ₁v_i1) for i = 1, …, 103. The model parameters in these applications are estimated by maximum likelihood method. Table 8 indicates that the AIC, AICC, and BIC statistic values of the E-L {GW} regression model are smaller than those of the other fitted models. The estimates β₁, β₂, and γ₁ are significant at the 5% level, and the status of transplant have significant differences. The LR statistic is used to compare the E-L {GW} regression model with some nested models. Table 9 shows the LR statistics and the corresponding p-values. As shown in Table 8, the E-L {GW} model gives the best goodness of fit statistic among all models.

Table 8 MLEs of the parameters (SEs in parentheses), p-values bellow SE and goodness of fit measures for the Heart transplant data set

Full size table

Table 9 LR statistics for the Heart transplant data set

Full size table

Summary and conclusions

The logistic and Gumbel (maximum and minimum) distributions have been widely studied, and many generalizations have been considered to model real-life applications. We propose a new generalization for the logistic and Gumbel distributions called the generalized exponential-logistic distribution. We study the structural properties of this new distribution and the relationships between the parameters and the mean, variance, skewness, and kurtosis. With only three parameters, the E-L {GW} can fit data with a very wide range of skewness (left and right) and kurtosis. The proposed method for developing generalized distributions has a high potential for practitioners. A generalized logistic regression model based on the E-L {GW} distribution is developed. Some existing regression models are sub-models, which makes the generalized logistic regression model a good choice for modeling a wide variety of response variables. Four real data sets are applied to illustrate the usefulness of the new distribution and its regression for fitting skewed data. The applications suggest that these generalized logistic and Gumbel distributions can fit highly skewed data sets effectively.

Availability of data and materials

Interested readers can contact the first author.

Abbreviations

AIC:: Akaike information criterion
AICC:: Corrected AIC
BGL:: Beta-generalized logistic
BIC:: Bayesian information criterion
CDF:: Cumulative distribution function
EEL:: Exponentiated-exponential logistic
E-L {GW}:: Exponential-logistic {Generalized Weibull}
ESGN:: Extended skew generalized normal
GG:: Generalized Gumbel
GN:: Generalized normal
HRF:: Hazard rate function
KS:: Kolmogorov-Smirnov
LR:: Likelihood ratio
MGF:: Moment generating function
MLEs:: Maximum likelihood estimates
PDF:: Probability density function
PRHL:: Proportional reversed hazard logistic
SD:: Standard deviation
SEs:: Standard errors
SLD:: Skew logistic distribution
SN:: Skew normal
TEV:: Transmuted extreme value

References

Alamoudi, H.H., Mousa, S.A., Baharith, L.A.: Estimation and application in log-Fréchet regression model using censored data. Int. J. Adv. Stat. Probability. 5(1), 23–31 (2017)
Article Google Scholar
Ali, M.M., Pal, M., Woo, J.: Some Exponentiated distributions. Korean Commun. Stat. 14(1), 93–109 (2007)
Google Scholar
Aljarrah, M.A., Famoye, F., Lee, C.: A new generalized normal distribution: properties and applications. Commun. Stat. Theory Methods. 48(18), 4474–4491 (2019)
Article MathSciNet Google Scholar
Aljarrah, M.A., Lee, C., Famoye, F.: On generating T-X family of distributions using quantile functions. J. Stat. Distrib. Appl. 1, 2 (2014)
Article Google Scholar
Alzaatreh, A., Lee, C., Famoye, F.: A new method for generating families of continuous distributions. Metron. 71(1), 63–79 (2013)
Article MathSciNet Google Scholar
Alzaatreh, A., Lee, C., Famoye, F.: Family of generalized gamma distributions: properties and applications. Hacettepe J. Math. Stat. 45, 869–886 (2016)
MathSciNet MATH Google Scholar
Aryal, R., Tsokos, P.: On the transmuted extreme value distribution with application. Nonlin. Anal. 71(12), 1401–1407 (2009)
Article MathSciNet Google Scholar
Azzalini, A.: A class of distributions which includes the normal ones. Scand. J. Stat. 12, 171–178 (1985)
MathSciNet MATH Google Scholar
Balakrishnan, N., Leung, M.Y.: Order statistics from the type I generalized logistic distribution. Commun. Stat. Simul. Comput. 17(1), 25–50 (1988)
Article Google Scholar
Choudhury, K., Abdul, M.M.: Extended skew generalized normal distribution. Metron. 69, 265–278 (2011)
Article MathSciNet Google Scholar
Cooray, K.: Generalized Gumbel distribution. J. Appl. Stat. 37(1), 171–179 (2010)
Article MathSciNet Google Scholar
Cordeiro, G.M., Afify, A.Z., Ortega, E.M.M., Suzuki, A.K., Mead, M.E.: The odd Lomax generator of distributions: properties, estimation and applications. J. Comput. Appl. Math. 347, 222–237 (2019)
Article MathSciNet Google Scholar
Crowley, J., Hu, M.: Covariance analysis of heart transplant data. J. Am. Stat. Assoc. 72, 27–36 (1977)
Article Google Scholar
Ghosh, I., Alzaatreh, A.: A new class of generalized logistic distribution. Commun. Stat. Theory Methods. 47(9), 2043–2055 (2018)
Article MathSciNet Google Scholar
Gradshteyn, I.S., Ryzhik, I.M.: Table of Integrals, Series, and Products, 6th edn. Academic Press, San Diego (2000)
MATH Google Scholar
Gumbel, E.J.: Statistics of Extremes. Columbia University Press, New York (1958)
Book Google Scholar
Gupta, R.D., Kundu, D.: Generalized logistic distributions. J. Appl. Stat. Sci. 18, 51–66 (2010)
MathSciNet Google Scholar
Johnson, N.L., Kotz, S., Balakrishnan, N.: Continuous Univariate Distributions: Vol. 1, 2nd edn. John Wiley and Sons, New York (1994)
MATH Google Scholar
Johnson, N.L., Kotz, S., Balakrishnan, N.: Continuous Univariate Distributions: Vol. 2, 2nd edn. Wiley, New York (1995)
MATH Google Scholar
Kalbfleisch, J.D., Prentice, R.L.: The Statistical Analysis of Failure Time Data, 2nd edn. Wiley, New York (2002)
Book Google Scholar
Lawless, J.F.: Statistical Models and Methods for Lifetime Data, 2nd edn. Wiley, Hoboken New York (2003)
MATH Google Scholar
Nadarajah, S.: The skew logistic distribution. Asta Adv. Stat. Anal. 93, 187–203 (2009)
Article MathSciNet Google Scholar
Nassar, M.M., Elmasry, A.: A study of generalized logistic distributions. J. Egypt. Math. Soc. 20(2), 126–133 (2012)
Article MathSciNet Google Scholar
Nelson, W.B.: Accelerated testing: statistical models, test plans, and data analyses. Wiley, New York (2004)
Google Scholar
Patrício, M., Pereira, J., Crisóstomo, J., Matafome, P., Gomes, M., Seiça, R., Caramelo, F.: Using Resistin, glucose, age and BMI to predict the presence of breast cancer. BMC Cancer. 18, 29 (2018). https://doi.org/10.1186/s12885-017-3877-1
Article Google Scholar
Pinheiro, E.C., Ferrari, S.L.: A comparative review of generalizations of the Gumbel extreme value distribution with an application to wind speed data. J. Stat. Comput. Simul. 86(11), 2241–2261 (2016)
Article MathSciNet Google Scholar
Prentice, R.L.: A generalization of the Probit and Logit methods for dose response curves. Biometrics. 32(4), 761–768 (1976)
Article MathSciNet Google Scholar
Stukel, T.: Generalized logistic models. J. Am. Stat. Assoc. 83(402), 426–431 (1988)
Article MathSciNet Google Scholar
Wahed, A.S., Ali, M.M.: The skew-logistic distribution. J. Stat. Res. 35, 71–80 (2001)
MathSciNet Google Scholar
Xu, K., Xie, M., Tang, L.C., Ho, S.L.: Application of neural networks in forecasting engine systems reliability. Appl. Soft Comput. 2(4), 255–268 (2003)
Article Google Scholar

Download references

Acknowledgements

The authors are very grateful to the handling Editor and the two anonymous reviewers for various constructive comments and suggestions that have greatly improved the presentation of the paper.

Funding

There is no funding support for the research work.

Author information

Authors and Affiliations

Department of Mathematics, Tafila Technical University, Tafila, 66110, Jordan
Mohammad A. Aljarrah
Department of Statistics, Actuarial & Data Sciences, Central Michigan University, Mt. Pleasant, MI, 48859, USA
Felix Famoye & Carl Lee

Authors

Mohammad A. Aljarrah
View author publications
You can also search for this author in PubMed Google Scholar
Felix Famoye
View author publications
You can also search for this author in PubMed Google Scholar
Carl Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors, viz. MAA, FF and CL with the consultation of each other carried out this work and drafted the manuscript together. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Mohammad A. Aljarrah.

Ethics declarations

Competing interests

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Proof of Theorem 1

The derivative of f_X(x) in (7) is given by

$$ {f}_X^{\prime }(x)=-\frac{1}{\sigma \mid \sigma \mid }{\left(\exp \left(\frac{x-\mu }{\sigma}\right)+1\right)}^{\xi -2}\exp \left(\left(\frac{x-\mu }{\sigma}\right)+\left\{1-{\left(\exp \left(\frac{x-\mu }{\sigma}\right)+1\right)}^{\xi }-1\right\}/\xi \right)w(x), $$

where $ w(x)=\exp \left(\frac{x-\mu }{\sigma}\right){\left(\exp \left(\frac{x-\mu }{\sigma}\right)+1\right)}^{\xi }-\xi \exp \left(\frac{x-\mu }{\sigma}\right)-1,-\infty <x<\infty $. By setting w(x) to zero and replacing $ \exp \left(\frac{x-\mu }{\sigma}\right) $ by u, we obtain (10). If ξ = { 0, 1}, then from (10) the mode is at u = 1, equivalently, x = μ. When ξ ≠ { 0, 1}, then the curve on the right hand side of (10), k(u) = u(u + 1)^ξ is convex in u (k^″(u) > 0 for all u > 0). Therefore, the curve k(u) and the line ξu + 1 on the left hand side of (10) can intersect at most twice. This means w(x) = 0 has at most two solutions, and so is f^′(x) = 0. Now, since $ \underset{x\to -\infty }{\lim }{f}_X(x)=\underset{x\to \infty }{\lim }{f}_X(x)=0 $, then f_X(x) has exactly one mode. Note that if we assume that f_X(x) has two modes (or more), then w(x) = 0 will have three solutions (two modes and local minimum). This is a contradiction with w(x) = 0 has at most two solutions, therefore, f_X(x) is unimodal. □

Proof of Corollary 2

When σ > 0, the derivative of the hazard function in (9) is given by

$$ {h}^{\prime }(x)=\frac{1}{\sigma^2}\exp \left(\frac{x-\mu }{\sigma}\right)\left\{\xi \exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}{\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi -2}. $$

(23)

From (23), h^′(x) ≥ 0 for all − ∞ < x < ∞, therefore, h(x) is increasing whenever σ > 0. When σ < 0 and by using L’Hopital’s rule, we find that

$ \underset{x\to \infty }{\lim }h(x)=\underset{x\to \infty }{\lim}\frac{\frac{1}{\sigma}\exp \left(\frac{x-\mu }{\sigma}\right){\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi -1}+\left(\xi -1\right)\exp \left(2\frac{x-\mu }{\sigma}\right){\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi -2}}{\frac{1}{\sigma}\mid \sigma \mid \exp \left(\frac{x-\mu }{\sigma}\right){\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi -1}\exp \left(\frac{1}{\xi}\left[{\left\{\exp \left(\frac{x-\mu }{\sigma}\right)+1\right\}}^{\xi }-1\right]\right)}=\frac{1}{\mid \sigma \mid } $. □

Proof of Theorem 2

Let Z = (X − μ)/σ, and using binomial expansion, yields

$$ E\left({\left|X\right|}^n\right)\le \sum \limits_{i=0}^n\left(\begin{array}{l}n\\ {}i\end{array}\right){\left|\mu \right|}^{n-i}{\left|\sigma \right|}^iE{\left|Z\right|}^i, $$

(24)

where Z is E-L {GW} random variable with μ = 0 and σ = 1.

Now, using definition, we have

$$ E\left({\left|Z\right|}^i\right)=\underset{-\infty }{\overset{\infty }{\int }}{\left|z\right|}^i\exp (z){\left(1+\exp (z)\right)}^{\xi -1}\exp \left\{-\left[{\left(1+\exp (z)\right)}^{\xi }-1\right]/\xi \right\} dz, $$

$$ \kern2.04em =\underset{-\infty }{\overset{\infty }{\int }}{\left|\mathrm{z}\right|}^i\frac{\exp \left(\mathrm{z}\right)}{{\left(1+\exp \left(\mathrm{z}\right)\right)}^2}g\left(\mathrm{z}\right) dz, $$

(25)

where g(z) = (1 + exp(z))^ξ + 1 exp {−[(1 + exp(z))^ξ − 1]/ξ}. By using the elementary calculus, we find that $ \underset{-\infty <z<\infty }{\sup}\left\{g(z)\right\}={e}^{-1}{\left(1+\xi \right)}^{1/\xi +1} $. From (25) we obtain,

$$ E\left({\left|\mathrm{Z}\right|}^i\right)\le {e}^{-1}{\left(1+\xi \right)}^{1/\xi +1}E\left({\left|L\right|}^i\right), $$

(26)

where $ E\left({\left|L\right|}^i\right)=\underset{-\infty }{\overset{\infty }{\int }}{\left|\mathrm{z}\right|}^i\frac{\exp \left(\mathrm{z}\right)}{{\left(1+\exp \left(\mathrm{z}\right)\right)}^2} dz $ is the i^th absolute moment of standard logistic distribution.

Using (26) in (24), the result in (13) is obtained. □

Proof of Theorem 3

Let Z = (X − μ)/σ. We have

$$ E\left({X}^r\right)=\sum \limits_{n=0}^r\left(\begin{array}{c}r\\ {}n\end{array}\right){\mu}^{r-n}{\sigma}^nE\left({Z}^n\right). $$

Using Eq. (11), the moments E(Zⁿ) are obtained as

$$ E\left({Z}^n\right)=\sum \limits_{i=0}^{\infty}\sum \limits_{j=0}^{\infty }{\omega}_{i,j}E\left({L}_{j+1}^n\right). $$

(27)

Therefore, the result in (14) is obtained from (27) directly. □

Proof of Proposition 3

Let Z = (X − μ)/σ, then the MGF of Z can be written as

$$ {M}_Z(t)={\int}_{-\infty}^{\infty}\exp \left( zt+z\right){\left(1+\exp \left(\mathrm{z}\right)\right)}^{\xi -1}\exp \left(-\left[{\left(1+\exp \left(\mathrm{z}\right)\right)}^{\xi }-1\right]/\xi \right) dz. $$

(28)

On setting u = [(1 + exp(z))^ξ − 1]/ξ in (28), we obtain

$$ {M}_Z(t)={\int}_0^{\infty }{\left({\left(1+\xi u\right)}^{1/\xi }-1\right)}^t\exp \left(-u\right) du. $$

(29)

Using the generalized binomial theorem $ {\left(x+y\right)}^{\alpha }=\sum \limits_{i=0}^{\infty}\frac{\Gamma \left(\alpha +1\right)}{\Gamma \left(\alpha -i+1\right)\Gamma \left(i+1\right)}{x}^i{y}^{\alpha -i},\mid x\mid <\mid y\mid $, (29) can be written as

$$ {M}_Z(t)=\sum \limits_{i=0}^{\infty}\frac{\Gamma \left(t+1\right){\left(-1\right)}^i}{\Gamma \left(t-i+1\right)\Gamma \left(i+1\right)}{\int}_0^{\infty }{\left(1+\xi u\right)}^{\left(t-i\right)/\xi}\exp \left(-u\right) du. $$

By using formula (3.382–4) in Gradshteyn and Ryzhik (2000), we obtain

$$ {M}_Z(t)=\sum \limits_{i=0}^{\infty}\frac{\Gamma \left(t+1\right){\left(-1\right)}^i}{\Gamma \left(t-i+1\right)\Gamma \left(i+1\right)}{\xi}^{\left(t-i\right)/\xi}\exp \left(1/\xi \right)\Gamma \left(\left(t-i\right)/\xi +1,1/\xi \right). $$

(30)

Now, the MGF of the X = μ + σZ is defined as

$$ {M}_X(t)=E\left(\exp (Xt)\right)=\exp \left(\mu t\right){M}_Z\left(\sigma t\right). $$

(31)

Using (31) with (30), the result in (15) is obtained.

Note that the values of the augment t that makes (15) exist can be obtained directly from (29) by noting that u < ((1 + ξu)^1/ξ − 1) < e^u when u > 0, 0 < ξ < 1, and 0 < ((1 + ξu)^1/ξ − 1) < u when u > 0, ξ ≥ 1. □

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Aljarrah, M.A., Famoye, F. & Lee, C. Generalized logistic distribution and its regression model. J Stat Distrib App 7, 7 (2020). https://doi.org/10.1186/s40488-020-00107-8

Download citation

Received: 11 March 2020
Accepted: 06 August 2020
Published: 07 September 2020
DOI: https://doi.org/10.1186/s40488-020-00107-8

Keywords

2010 Mathematics subject classification

62E15, 62F10, 62 J12, 62P10

Generalized logistic distribution and its regression model

Abstract

Introduction

The exponential-logistic {generalized Weibull} (E-L {GW}) distribution

Properties of exponential-logistic {generalized Weibull} distribution

Estimation and simulation

Estimation

Simulation

Generalized logistic regression model based on E-L {GW}

Applications

Adiponectin data

Turbocharger data

Generalized logistic regression model applied to censored class-H insulation data

Generalized logistic regression model applied to censored heart transplant data

Summary and conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s Note

Appendix

Appendix

Proof of Theorem 1

Proof of Corollary 2

Proof of Theorem 2

Proof of Theorem 3

Proof of Proposition 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

2010 Mathematics subject classification