Skip to main content

The Kumaraswamy transmuted Pareto distribution


In this work, a new five-parameter Kumaraswamy transmuted Pareto (KwTP) distribution is introduced and studied. We discuss various mathematical and statistical properties of the distribution including obtaining expressions for the moments, quantiles, mean deviations, skewness, kurtosis, reliability and order statistics. The estimation of the model parameters is performed by the method of maximum likelihood. We compare the distribution with few other distributions to show its versatility in modeling data with heavy tail.


Furtherance to the work by Eugene et al. (2002), who proposed and defined the beta-generated class of distributions for a continuous random variable, derived from the logit of the beta random variable, many statistical distributions have been proposed and studied by numerous authors. According to Eugene et al. (2002), suppose X is a random variable with cumulative distribution function (CDF) F(x), the CDF for the beta-generated family is obtained by applying the inverse probability transformation to the beta density function. The CDF for the beta-generated family of distributions is given by

$$ G(x)=\frac{1}{B(u,v)}\int_{0}^{F(x)}t^{u-1}(1-t)^{v-1}\,dt,\,\,0<u, v < \infty, $$

where B(u,v)=Γ(u)Γ(v)/Γ(u+v). The corresponding probability density function (PDF) is given by

$$ g(x)=\frac{1}{B(u,v)}[F(x)]^{u-1}[1-F(x)]^{v-1}\left[\frac{d}{dx}F(x)\right]. $$

A closely related generalized distribution similar to using the beta random variable as the baseline distribution as defined in Eqs. (1) and (2), is the work due to Cordeiro and de Castro (2011), where the authors combined the works of Eugene et al. (2002) and Jones (2009) by replacing the baseline distribution with the Kumaraswamy (Kw) distribution to construct a new class of Kw-G distributions. See also Bourguignon et al. (2013), Cordeiro et al. (2010), and Elbatal (2013). The work by Shams (2013) also introduced and studied another type of generalization of the Kumaraswamy distribution.

A new approach of constructing and generalizing statistical distributions was proposed by Shaw and Buckley (2007,2009). Their concept is defined by transmutation maps that are functional composition of the cumulative distribution function of one distribution with the quantile function of another distribution. One of the maps is referred to as the Quadratic Rank Transmutation Map (QRTM). Specifically, given two distributions with a common sample space with CDFs F 1 and F 2, we can define a pair of general rank transmutation map

$$G_{R_{12}}(u)=F_{2}\left(F_{1}^{-1}(u)\right), ~~G_{R_{21}}(u)=F_{1}\left(F_{2}^{-1}(u)\right), $$

where the pair of maps \(\{G_{R_{12}}(u)\), \(G_{R_{21}}(u)\}\) takes the unit interval I [0,1] into itself, and under suitable assumptions are mutual inverses, satisfying the conditions G ij (0)=0 and G ij (1)=1. A useful additional condition requires that the rank transmutation maps are monotone and continuously differentiable, otherwise transmuted density may be discontinuous.

In this paper, we use the transmutation map approach outlined in Shaw and Buckley (2007), and also defined in Shaw and Buckley (2009). This approach considered for |λ|≤1,

$$G_{R_{12}}(u)=u+\lambda u (1-u). $$

The consequence of the above results in the relationship

$$\begin{array}{*{20}l} F_{2}(x)=(1+\lambda)F_{1}(x)-\lambda F_{1}(x)^{2}, \end{array} $$

which on differentiation yields,

$$ f_{2}(x)=f_{1}(x)[(1+\lambda)-2\lambda F_{1}(x)], $$

where f 1(x) and f 2(x) are the corresponding PDFs associated with F 1(x) and F 2(x) respectively.

We will use the above formulation for a pair of distributions F(x) and G(x), where G(x) is a submodel or the baseline distribution of F(x). By a formal definition, a random variable X is said to have a transmuted probability distribution with CDF F(x) if

$$\begin{array}{*{20}l} F(x) = (1+\lambda) G(x)-\lambda G(x)^{2}, \qquad |\lambda|\leq 1 \end{array} $$

The corresponding PDF of the transmuted probability distribution is

$$\begin{array}{*{20}l} f(x) = g(x)(1+\lambda -2\lambda G(x)). \end{array} $$

Note that when λ=0, the above relation reduces to the distribution (G(x)) of the baseline random variable.

Following the work by Aryal and Tsokos (2009) on the transmuted extreme value distribution, a number of transmuted family of distributions have been proposed and discussed by many authors in the literature. A few of them are the work by Aryal and Tsokos (2011): transmuted Weibull distribution; Khan and King (2013): transmuted modified Weibull distribution; Ashour and Eltehiwy (2013): transmuted Lomax distribution; Hady and Ebraheim (2014): exponentiated transmuted Weibull distribution; Hussian (2014): transmuted exponentiated gamma distribution; Merovci and Puka (2014): transmuted Pareto distribution; Owokolo et al. (2015): transmuted exponential distribution, and Khan et al. (2016): transmuted Kumaraswamy distribution; Afify et al. (2014): transmuted complementary Weibull geometric distribution; Yousof et al. (2015): the transmuted exponentiated generalized-G family of distributions, and a host of many others. Tahir and Cordeiro (2016) provided a comprehensive list of contributed works on transmuted distributions. This paper is outlined as follows. In Section 2, we introduce the Kumaraswamy transmuted Pareto distribution. Some of its mathematical properties are discussed in Sections 3, 4, 5, 6, 7, 8 and 9. Section 10 discusses the estimation of the parameters of the distribution by the method of maximum likelihood. We provide the application of the distribution to two real life data in Section 11. In Section 12 we have simulation, and concluding remarks in Section 13.

The Kumaraswamy transmuted Pareto (KwTP) distribution

Following the transmutation map by Shaw and Buckley (2007) as defined in Eq. (3), we combine the Kumaraswamy distribution and the transmuted Pareto to form what we refer to as the Kumaraswamy transmuted Pareto (KwTP) distribution. Without loss of generality, we give a brief description of these component distributions in what follows.

2.1 Component distributions

2.1.1 The Kumaraswamy-Kw distribution

Kumaraswamy (1980) proposed and discussed a probability distribution for handling double-bounded random processes with varied hydrological applications. Let T be a random variable with the Kumaraswamy’s distribution. The PDF and CDF are defined, respectively, as

$$f_{T}(t)=a b t^{a-1}(1-t^{a})^{b-1}, \quad 0<t<1, $$


$$ F_{T}(t)=1-(1-t^{a})^{b}, \quad 0<t<1, $$

where both a>0 and b>0 are shape parameters. A generalized form of this distribution is obtained when t is replaced by the CDF G(t) of another random variable, to have what is referred to as the Kumaraswamy-G (Kw-G) distribution. The beta and Kumaraswamy distributions share similar properties. For example, the Kumaraswamy distribution, also referred to as the minimax distribution, is unimodal, uniantimodal, increasing, decreasing or constant depending on the values of its parameters. A more detailed description, background, genesis, and properties of Kumaraswamy distribution are outlined in Jones (2009), where the author highlighted several advantages of the Kumaraswamy distribution over the beta distribution, namely; its simple normalizing constant, simple explicit formulas for the distribution and quantile functions, and simple random variate generation procedure. There have been many contributions to the theory and applications of the Kumaraswamy distribution in the literature. See for example the works by Cordeiro et al. (2010): Kumaraswamy Weibull; Cordeiro et al. (2012): Kumaraswamy Gumbel; Pascoa et al. (2011): Kumaraswamy generalized gamma; Saulo (2012): Kumaraswamy Birnbaum-Saunders; Akinsete et al. (2014): Kumaraswamy geometric, and a host of many others.

2.1.2 The Pareto and the transmuted Pareto (TP) distributions

The Pareto distribution was named after a sociologist Vilfredo Pareto. It is used in modelling the distribution of incomes and other financial variables, and in the description of social and other phenomena. Many distributions have been derived using the Pareto distribution. A few examples are the gamma-Pareto distribution by Alzaatreh et al. (2012), the beta exponentiated Pareto distribution by Zea et al. (2012), and the beta-Pareto distribution by Akinsete et al. (2008).

The PDF and CDF of the Pareto distribution are given respectively by

$$r(x) = \frac{k\theta^{k}}{x^{k+1}}, $$


$$ R(x) = 1-(\theta/x)^{k}, ~~ k>0, ~~ \theta>0, ~~ x\geq \theta. $$

Using the quadratic rank transmutation map, Merovci and Puka (2014) generalized the Pareto distribution to obtain what the authors called the transmuted Pareto distribution. They provided a comprehensive description of the mathematical properties of the distribution and its application in modeling real life data. Formally, a random variable X is said to have the transmuted Pareto distribution with scale parameter θ>0, shape parameter α>0, and the transmuted parameter λ (|λ|≤1), if its PDF is given by

$$ g(x;\alpha, \theta, \lambda)=\frac{\alpha \theta^{\alpha}}{x^{\alpha+1}}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg], $$

where θ is the (necessarily positive) minimum possible value of X. The corresponding CDF of the transmuted Pareto distribution is given by

$$ G(x;\alpha, \theta, \lambda)=\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg]. $$

The distribution in Eq. (5) becomes the Pareto when λ=0.

2.2 The KwTP distribution

By substituting Eq. (5) into Eq. (4), the cumulative distribution function of the five-parameter Kumaraswamy transmuted Pareto (KwTP) distribution is given by

$$\begin{array}{@{}rcl@{}} F(x;\alpha, \theta, \lambda, a, b)&=&1-[1- G(x;\alpha, \theta, \lambda)^{a}]^{b} \\ &=&1-\Bigg\{1-\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\Bigg\}^{b}, \end{array} $$

where x>θ, α>0, a>0, and b>0 are shape parameters, θ>0 is the scale parameter, while |λ|≤1 is the transmuted parameter. The corresponding probability density function of the KwTP distribution may be expressed as

$$\begin{array}{@{}rcl@{}} f(x;\alpha, \theta, \lambda, a, b)&=&ab g(x)G(x)^{a-1}[1-G(x)^{a}]^{b-1} \\ &=& \frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg] \Bigg\{\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg] \Bigg\}^{a-1} \\ {}&&\times \Bigg\{1-\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\Bigg\}^{b-1}. \end{array} $$

Figure 1 shows the graphs of the PDF of KwTP distribution for selected values of the parameters. The graphs indicate that KwTP is unimodal, and may be used in modeling different phenomena exhibited by real world data. Figure 2 also shows the graphs of the CDF for various values of the parameters.

Fig. 1

PDF of KwTP distribution for selected values of the parameters

Fig. 2

CDF of KwTP distribution for selected values of the parameters

Special cases of KwTP

Some special submodels may be obtained from KwTP by varying the values of the parameters. Here is a list of few these submodels.

  • When λ=0, KwTP reduces to KwP distribution by Bourguignon et al. (2013) with PDF

    $$\begin{array}{*{20}l} f(x;\alpha, \theta, a, b)= \frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}} \times \Bigg\{\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg] \Bigg\}^{a-1} \Bigg\{1-\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\Bigg\}^{b-1}. \end{array} $$
  • When a=b=1, KwTP reduces to the transmuted Pareto distribution by Merovci and Puka (2014) with PDF

    $$\begin{array}{*{20}l} f(x;\alpha, \theta, \lambda)=\frac{\alpha \theta^{\alpha}}{x^{\alpha+1}}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg]. \end{array} $$
  • By setting b=1 and λ=0, the KwTP reduces to the exponentiated Pareto distribution, defined by Nadarajah (2005), with PDF

    $$\begin{array}{*{20}l} f(x;\alpha, \theta, a) = \frac{a\alpha \theta^{\alpha}}{x^{\alpha+1}} \times \Bigg\{\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg] \Bigg\}^{a-1}. \end{array} $$
  • When a=1=b and λ=0, KwTP reduces to the classical Pareto distribution with PDF

    $$\begin{array}{*{20}l} f(x;\alpha, \theta) = \frac{\alpha \theta^{\alpha}}{x^{\alpha+1}}. \end{array} $$

Mixture representation

In this section, we provide alternative and useful expressions for the PDF and the CDF of KwTP distribution. Let X be a random variable having the KwTP density (7).

For any b>0 real non-integer and |z|<1, consider the power series

$$\begin{array}{*{20}l} \left(1- z\right)^{b-1}= \sum_{k=0}^{\infty} \frac{(-1)^{k} \Gamma(b)}{k! \Gamma(b-k)}z^{k}. \end{array} $$

As mentioned by Afify et al. (2016), applying the power series (8) to Eq. (7), we can write the PDF as a mixture of exponentiated-transmuted Pareto with the power parameter (k+1)a as

$$\begin{array}{*{20}l} f(x)= \sum_{k=0}^{\infty} v_{k} g(x) G(x)^{(k+1)a-1}, \end{array} $$


$$\begin{array}{*{20}l} v_{k}= \frac{(-1)^{k} ab \Gamma(b)}{k! \Gamma(b-k)}=ab\binom{b-1}{k}(-1)^{k}, \end{array} $$

and g(x) and G(x) are the PDF and CDF of the transmuted Pareto distribution respectively.

The PDF (9) can be expressed as a mixture of exp-G densities

$$\begin{array}{*{20}l} f(x)= \sum_{k=0}^{\infty} w_{k} \gamma_{k} g(x) G(x)^{\gamma_{k}-1}, \end{array} $$

where \(w_{k}=\frac {v_{k}}{(k+1)a}\), and γ k =(k+1)a. By integrating (10), we get a similar mixture representation for the CDF

$$\begin{array}{*{20}l} F(x)= \sum_{k=0}^{\infty} w_{k} G(x)^{(k+1)a}, \end{array} $$

where G(x) is the CDF of the transmuted Pareto distribution with power parameter (k+1)a. Using the binomial expansions of the expressions

$$\begin{array}{@{}rcl@{}} \left[\!1-\left(\!1-(\theta/x)^{\alpha}\!\right) \left(\!1+\lambda (\theta/x)^{\alpha}\!\right) \!\right]^{b-1}, \quad \left(\!1-(\theta/x)^{\alpha}\!\right)^{(k+1)a-1}~~ \text{and} \quad \left(1+\lambda (\theta/x)^{\alpha}\right)^{(k+1)a-1}, \end{array} $$

the PDF of the KwTP distribution can be expressed as

$$\begin{array}{*{20}l} f(x)&= \frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg] \times \Bigg\{\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg] \Bigg\}^{a-1} \\ &\quad\times \Bigg\{1-\bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{a}\Bigg\}^{b-1} \\ &= \sum_{k=0}^{\infty} \frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}}{b-1\choose k}(-1)^{k}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg] \\ &\quad\times \bigg[1-\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{(k+1)a-1}\bigg[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\bigg]^{(k+1)a-1} \\ &=\sum_{k=0}^{\infty} \frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}}{b-1\choose k}(-1)^{k} \bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg] \\ &\quad\times \sum_{i=0}^{\infty} \sum_{j=0}^{\infty}{(k+1)a-1\choose i}{(k+1)a-1\choose j}(-1)^{i}\lambda^{j}\left(\frac{\theta}{x}\right)^{\alpha(i+j)}, \end{array} $$

where α>0, a>0, b>0, θ>0, |λ|≤1 and x>θ. Let

$$w^{k}_{ij}={(k+1)a-1\choose i}{(k+1)a-1\choose j}(-1)^{i}\lambda^{j}. $$

By setting i+j=m, and \(w_{k,m}^{*}=\sum \limits _{i,j:i+j=m}^{} w^{k}_{ij}= \sum \limits _{i=0}^{m}\sum \limits _{j=0}^{m-i} w^{k}_{ij}\), we can further express the PDF of the KwTP in the form

$$\begin{array}{*{20}l} f(x)= \frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\bigg] \sum_{m=0}^{\infty}u_{m} \left(\frac{\theta}{x}\right)^{\alpha m}, \end{array} $$


$$\begin{array}{*{20}l} u_{m}=\sum_{k=0}^{\infty}w_{k,m}^{*}{b-1\choose k}(-1)^{k}. \end{array} $$

In particular,

$$u_{0}=\sum_{k=0}^{\infty}w_{k,0}^{*}{b-1\choose k}(-1)^{k}, ~~\text{where}~~ w_{k,0}^{*}=\sum_{i=0}^{0} \sum_{j=0}^{0-i}w_{ij}^{k}.$$

That is, m=0 i=0,j=0. From \(w_{ij}^{k}={(k+1)a-1\choose i}{(k+1)a-1\choose j}(-1)^{i}\lambda ^{j}\) and i=0,j=0, we have w 0,0=1. Thus, \(w_{k,0}^{*}=1\), and hence, \(u_{0}=\sum \limits _{k=0}^{\infty }{b-1\choose k}(-1)^{k}.\)

Similarly, using (11) we can express F(x) as

$$\begin{array}{*{20}l} F(x)&= \sum_{k=0}^{\infty} w_{k} G(x)^{(k+1)a} \\ &= \sum_{k=0}^{\infty} w_{k} \left[1-\left(\frac{\theta}{x}\right)^{\alpha}\right]^{(k+1)a} \times \left[1+\lambda\left(\frac{\theta}{x}\right)^{\alpha}\right]^{(k+1)a} \\ &=\sum_{k=0}^{\infty} \sum_{i=0}^{\infty} \sum_{j=0}^{\infty}w_{k}(-1)^{i}\lambda^{j}{(k+1)a\choose i}{(k+1)a\choose j}\left(\frac{\theta}{x}\right)^{\alpha(i+j)}. \end{array} $$

Let \(v^{k}_{ij}={(k+1)a\choose i}{(k+1)a\choose j}(-1)^{i}\lambda ^{j}\), where in particular \(v^{k}_{00}=1\).

Setting i+j=m, \(v_{k,m}^{*}=\sum \limits _{i,j:i+j=m}^{}v^{k}_{ij}= \sum \limits _{i=0}^{m}\sum \limits _{j=0}^{m-i}v^{k}_{ij}\), \(v=\left (\frac {\theta }{x}\right)^{\alpha }\), and

$$\begin{array}{*{20}l} \rho_{m} =\sum_{k=0}^{\infty}v_{k,m}^{*}w_{k}, \end{array} $$

we can express the CDF of the KwTP as,

$$\begin{array}{*{20}l} F(x)&=\sum_{m=0}^{\infty}\sum_{k=0}^{\infty}v_{k,m}^{*} w_{k}\left(\frac{\theta}{x}\right)^{\alpha m} \\ &=\sum_{m=0}^{\infty} \left(\frac{\theta}{x}\right)^{\alpha m} \rho_{m} \\ &=\sum_{m=0}^{\infty} \rho_{m}v^{m}, \end{array} $$

where ρ 0 may be expressed as

$$\begin{array}{*{20}l} \rho_{0}&=\sum_{k=0}^{\infty}v^{*}_{k,0}w_{k}=\sum_{k=0}^{\infty}v^{k}_{00}w_{k}=\sum_{k=0}^{\infty}w_{k} \\ &=\sum_{k=0}^{\infty}\frac{v_{k}}{a(k+1)}=\sum_{k=0}^{\infty}(-1)^{k}\frac{ab{b-1\choose k}}{a(k+1)} \\ &=b\sum_{k=0}^{\infty}\frac{(-1)^{k}}{k+1}{b-1\choose k}. \end{array} $$

Quantile function

The quantile function of a distribution is the real solution of F(x q )=q for 0≤q≤1. It is defined as the inverse function of the CDF of a random variable X. Using Eq. (6), the quantiles (x q ) of KwTP distribution may be expressed in the form

$$\begin{array}{*{20}l} x_{q}=F^{-1}(q)=\theta\left(\frac{\lambda -1+\sqrt{\left(1+\lambda \right)^{2}-4 \lambda\left(\left\{1-(1-q)^{\frac{1}{b}}\right\}^{\frac{1}{a}}\right)}}{2\lambda}\right)^{-\frac{1}{\alpha}}. \end{array} $$

For specified values of parameters α,θ,λ,a and b, and a set of random number u(0,1), we can generate a random variable X having the KwTP distribution (6), and satisfying

$$\begin{array}{*{20}l} X=Q_{u}=F^{-1}(u)=\theta\left(\frac{\lambda -1+\sqrt{\left(1+\lambda \right)^{2}-4 \lambda\left(\left\{1-(1-u)^{\frac{1}{b}}\right\}^{\frac{1}{a}}\right)}}{2\lambda}\right)^{-\frac{1}{\alpha}}. \end{array} $$

In general, the quantile function enables one to find the relationship between one random variable and another random variable. For example, if Y is a random variable with CDF G(y), we can show that XKw T P(x|α,θ,λ,a,b) if

$$\begin{array}{*{20}l} X=\theta\left(\frac{2\lambda}{\lambda -1+\sqrt{\left(1+\lambda \right)^{2}-4 \lambda\left(\left\{1-(G(y))^{\frac{1}{b}}\right\}^{\frac{1}{a}}\right)}}\right)^{\frac{1}{\alpha}}. \end{array} $$

Moment functions

We define in this section, various moment expressions for the KwTP distribution. These consist of the raw moments and their related functions, e.g., skewness and kurtosis, and the moment generating function. By definition, the rth-moment of the KwTP random variable is defined by

$$\begin{array}{*{20}l} E(X^{r})&=\mu_{r}^{'}(\alpha, \theta,\lambda,a,b) = \int_{\theta}^{\infty} x^{r} f(x) dx \\ &=ab \alpha \theta^{\alpha}\left[(1-\lambda)\sum_{m=0}^{\infty}u_{m} \theta^{\alpha m} \int_{\theta}^{\infty}x^{r-1-\alpha-\alpha m} dx+2\lambda \sum_{m=0}^{\infty}u_{m} \theta^{\alpha +\alpha m} \right. \\ &\quad\times \left.\int_{\theta}^{\infty} x^{r-1-2\alpha-\alpha m} dx {\vphantom{\sum\limits_{\theta}^{\infty}}}\right] \\ &=ab \alpha \theta^{\alpha}\Bigg\{(1-\lambda)\sum_{m=0}^{\infty}u_{m} \theta^{\alpha m}\bigg[\frac{x^{r-\alpha-\alpha m }}{r-\alpha-\alpha m}\bigg]\Bigg|_{\theta}^{\infty} \\ & \quad+2\lambda \sum_{m=0}^{\infty}u_{m} \theta^{\alpha +\alpha m} \bigg[\frac{x^{r-2\alpha-\alpha m }}{r-2\alpha -\alpha m}\bigg]\Bigg|_{\theta}^{\infty}\Bigg\} \\ &=(1-\lambda)ab \alpha \theta^{r}\sum_{m=0}^{\infty}\frac{u_{m}}{\alpha m+\alpha-r} + 2\lambda ab \alpha \theta^{r} \sum_{m=0}^{\infty}\frac{u_{m}}{\alpha m+2\alpha-r} \\ &= ab \alpha \theta^{r} \sum_{m=0}^{\infty} u_{m} \left[\frac{1-\lambda}{\alpha +\alpha m -r}+\frac{2\lambda}{2\alpha +\alpha m -r}\right],~~~r<\alpha. \end{array} $$

In particular if λ=0,a=b=1, we have the Pareto distribution. In this case, we must have m=0,u 0=1, so that the moment expression becomes

$$\begin{array}{*{20}l} E(X^{r})&=\mu_{r}^{'}(\alpha, \theta,0,1,1) = \frac{\alpha\theta^{r}}{\alpha-r},~ r<\alpha. \end{array} $$

It is interesting to note that the mean of X is

$$\begin{array}{*{20}l} E(X)&=\mu_{1}^{'}(\alpha, \theta,0,1,1) = \frac{\alpha \theta }{\alpha-1}, ~\alpha >1. \end{array} $$

By similar procedure, the moment generating function is defined as follows:

$$\begin{array}{*{20}l} M_{X}(t) &=\int_{\theta}^{\infty}e^{tx}f(x)dx \\ &=ab\alpha \theta^{\alpha}\left[(1-\lambda)\sum_{m=0}^{\infty}u_{m} \theta^{\alpha m} \int_{\theta}^{\infty}e^{tx}x^{-\alpha-\alpha m-1} dx+2\lambda \sum_{m=0}^{\infty}u_{m} \theta^{\alpha +\alpha m}\right. \\ &\quad\left.\times \int_{\theta}^{\infty}e^{tx}x^{-\alpha m-2\alpha -1} dx {\vphantom{\sum_{m=0}^{\infty}u_{m}}}\right] \\ &= ab \alpha \bigg[(1-\lambda)\sum_{m=0}^{\infty}u_{m} (-t\theta)^{\alpha+\alpha m} \Gamma(-\alpha -\alpha m, -t \theta)\\ &\quad+2\lambda \sum_{m=0}^{\infty}u_{m} (-t\theta)^{2 \alpha+\alpha m}\Gamma(-2\alpha -\alpha m, -t \theta) \bigg], \\ &=ab \alpha \sum_{m=0}^{\infty}u_{m} (-t\theta)^{\alpha+\alpha m}\big[(1-\lambda)\Gamma(-\alpha -\alpha m, -t \theta)\\ &\quad+2\lambda (-t\theta)^{\alpha }\Gamma(-2\alpha -\alpha m, -t \theta) \big], \qquad t<0, \end{array} $$

where Γ(.,.) denotes the upper incomplete gamma function. That is, \(\Gamma (s,\theta)=\int _{\theta }^{\infty } e^{-t} t^{s-1} dt\).

The variance, skewness and kurtosis of the KwTP distribution can be calculated from (18) using the relations given below.

$$\begin{array}{@{}rcl@{}} Variance(X) &=&E(X^{2})-[E(X)]^{2},~\alpha>2 \\ Skewness(X) &=&\frac{E(X^{3})-3E(X)E(X^{2})+2E^{3}(X)}{Var^{3/2}(X)},~\alpha >3 \\ Kurtosis(X) &=&\frac{E(X^{4})-4E(X)E(X^{3})+6E(X^{2})E^{2}(X)-3E^{4}(X)}{ Var^{2}(X)}, ~\alpha >4. \end{array} $$

We provide in Table 1, the numerical measures of the median, mean, variance, skewness and kurtosis of the KwTP distribution for selected values of the parameters, to illustrate the effects of the parameters on these measures. We had arbitrarily fixed the value of parameter θ at 0.5.

Table 1 Median, mean, variance, skewness, kurtosis for selected values of the parameters

The values of the statistics; mean, median, variance, skewness and kurtosis increase or decrease according to the changes in parameter values. For example, these statistics decrease as α or b increases, while other parameters are held constant. We also observe that each statistic in the set consisting of the mean, median and variance increases by increasing the value of parameter a, while each statistic in the set decreases as λ increases. On the other hand, the skewness and kurtosis both decrease for increasing value of a, while the values in this set both increase, and then decrease, for increasing value of λ. The effects of parameters a and b on the mean, variance, skewness and kurtosis for given values of λ,θ and α are displayed in Figs. 3 and 4 respectively.

Fig. 3

Plots of mean and variance of KwTP distribution

Fig. 4

Plots of skewness and kurtosis of KwTP distribution

In addition to the traditional measures of skewness and kurtosis, we also obtain the Bowley’s measure of skewness, introduced by Bowley (1920), and the Moors’ measure of kurtosis, introduced by Moors (1988). These measures are quartile alternatives to the traditional skewness and kurtosis, and are more robust estimation of these measures, see Kenney and Keeping (1962).

  • Bowley skewness (B sk ): By definition, the Bowley’s measure of skewness is expressed as

    $$ B_{sk} =\frac{Q_{3}+Q_{1}-2Q_{2}}{Q_{3}-Q_{1}}= \frac{Q_{0.75}-2Q_{0.5}+Q_{0.25}}{Q_{0.75}-Q_{0.25}}. $$
  • Moors kurtosis M kur : This is defined as

    $$M_{kur}=\frac{(E_{7}-E_{5})+(E_{3}-E_{1})}{E_{6} - E_{2}}= \frac{Q_{0.875}-Q_{0.625}+Q_{0.375}-Q_{0.125}}{Q_{0.75}-Q_{0.25}}, $$

where Q i is the i th quartile for i=1,2,3, and E i =F −1(i/8), i=1,2,,7 represents the i th octile.

The graphs of the Bowley skewness and Moors kurtosis for different values of a and b, with λ=0.2,α=2 and θ=0.1, are shown in Fig. 5. The graphs show that both Bowley skewness and Moors kurtosis depend on the choice of the parameters.

Fig. 5

Bowley skewness (left) and Moors kurtosis (right) for KwTP distribution as a function of the parameters a and b

Mean deviation

The deviation from the mean, in the case of symmetric distributions, or deviation from the median, in the case of skewed distributions, can be used as a measure of spread in a population. Given that the random variable X has the KwTP distribution. Let μ=E(X) be the mean and M be the median of X. The mean deviation about the mean D(μ) and the mean deviation about the median D(M) can be expressed respectively as

$$\begin{array}{*{20}l} D(\mu)&=\int_{\theta}^{\infty}|x-\mu|f(x) dx=\int_{\theta}^{\mu}(\mu-x)f(x) dx + \int_{\mu}^{\infty}(x-\mu)f(x) dx \\ &=2\mu F(\mu)-2J(\mu), \end{array} $$


$$\begin{array}{*{20}l} D(M)&=\int_{\theta}^{\infty}|x-M|f(x) dx=\int_{\theta}^{M}(M-x)f(x) dx + \int_{M}^{\infty}(x-M)f(x) dx \\ &=\mu-2J(M), \end{array} $$

where F(.) is the CDF of the KwTP distribution, and \(J(t)=\int _{\theta }^{t}x f(x)dx.\)

We compute J(t) as follows:

$$\begin{array}{*{20}l} J(t)&=\int_{\theta}^{t} x f(x) dx \\ &=ab \alpha \theta^{\alpha}\bigg[(1-\lambda)\sum_{m=0}^{\infty}u_{m} \theta^{\alpha m} \int_{\theta}^{t}x^{-\alpha-\alpha m} dx+2\lambda \sum_{m=0}^{\infty}u_{m} \theta^{\alpha +\alpha m} \\ &\quad\times \int_{\theta}^{t} x^{-2\alpha-\alpha m} dx \bigg] \\ &=ab\alpha \bigg[(1-\lambda)\sum_{m=0}^{\infty}u_{m} \theta^{\alpha+\alpha m} \left(\frac{t^{-\alpha-\alpha m+1}}{1-\alpha-\alpha m}+\frac{\theta^{-\alpha-\alpha m+1}}{\alpha+\alpha m-1} \right) \\ &\quad+2\lambda \sum_{m=0}^{\infty}u_{m} \theta^{2\alpha +\alpha m} \left(\frac{t^{-2 \alpha-\alpha m+1}}{1-2\alpha-\alpha m}+\frac{\theta^{-2\alpha-\alpha m+1}}{2\alpha+\alpha m-1} \right) \bigg] \\ &=ab \alpha \sum_{m=0}^{\infty}u_{m} \bigg[ \frac{(1-\lambda)t}{1-\alpha-\alpha m}\left(\frac{\theta}{t}\right)^{\alpha}+ \frac{(1-\lambda)\theta}{\alpha+\alpha m-1}+\frac{2\lambda t}{1-2\alpha-\alpha m}\left(\frac{\theta}{t}\right)^{2\alpha+\alpha m} \\ &\quad+\frac{2\lambda\theta}{2 \alpha+\alpha m-1}\bigg]. \end{array} $$

Using Eq. (21), we can write appropriate expressions for J(μ) and J(M). Combining these with the expression for F(μ), it is easy to obtain the expressions for D(μ) and D(M) in Eqs. (19) and (20) respectively.


According to Ashour and Eltehiwy (2013), the reliability function, also called the survival function, is the characteristic of an explanatory variable that maps a set of events, usually associated with mortality or failure of some system onto time. In other words, the reliability function R(t) is the probability of an item not failing prior to some time t, and is defined by R(t)=1−F(t). The reliability function of the KwTP distribution is

$$\begin{array}{*{20}l} R(t&)= 1-F(t) \\ &=\bigg\{1-\bigg[1-\left(\frac{\theta}{t}\right)^{\alpha}\bigg]^{a}\bigg[1+\lambda\left(\frac{\theta}{t}\right)^{\alpha}\bigg]^{a}\bigg\}^{b}. \end{array} $$

The other characteristic of interest of a random variable is the hazard rate function h(t) which is also known as instantaneous failure rate. The hazard function of a random variable X with PDF f(x) and associated CDF F(x) is defined as

$$ h(t)=\frac{f(t)}{1-F(t)}. $$

Substituting the PDF and CDF for the KwTP distribution into the above expression, we have

$$\begin{array}{*{20}l} h(t)&=\frac{f(t)}{1-F(t)} \\ &=\frac{ab \alpha \theta^{\alpha}\left[1-\lambda+2\lambda \left(\frac{\theta}{t}\right)^{\alpha} \right]\left[1-\left(\frac{\theta}{t}\right)^{\alpha}\right]^{a-1}\left[1+\lambda \left(\frac{\theta}{t}\right)^{\alpha}\right]^{a-1}}{t^{\alpha+1}\left\{1-\left[1-\left(\frac{\theta}{t}\right)^{\alpha}\right]^{a}\left[1+\lambda \left(\frac{\theta}{t}\right)^{\alpha}\right]^{a}\right\}}. \end{array} $$

The flexibility of KwTP distribution to model reliability data is illustrated by varying shapes of the hazard rate function in Fig. 6.

Fig. 6

Hazard rate function of KwTP for selected values of α,θ,λ,a and b

The following lemma shows the limiting behavior of the hazard function:

Lemma 1

If h(t) is the hazard function of the Kumaraswamy transmuted Pareto distribution, then

$$\lim\limits_{t\rightarrow \theta} h(t) = \begin{cases} 0, & a >1 \\ \frac{ab \alpha(1+\lambda)}{\theta}, & a =1\\ \infty, & a <1, \end{cases} $$


$$\lim\limits_{t\rightarrow \infty} h(t) = 0. $$


First note that

$$\lim\limits_{t\rightarrow \theta} G(t) =\lim\limits_{t\rightarrow \theta} \bigg[1-\left(\frac{\theta}{t}\right)^{\alpha}\bigg]\bigg[1+\lambda\left(\frac{\theta}{t}\right)^{\alpha}\bigg] = 0, $$


$$\lim\limits_{t\rightarrow \theta} g(t) =\lim\limits_{t\rightarrow \theta}\frac{\alpha \theta^{\alpha}}{t^{\alpha+1}}\bigg[1-\lambda+2\lambda \left(\frac{\theta}{t}\right)^{\alpha}\bigg] = \frac{\alpha(1+\lambda)}{\theta}. $$

Then we have

$$\begin{array}{*{20}l} \lim\limits_{t\rightarrow \theta}h(t)&=\lim\limits_{t\rightarrow \theta} \frac{f(t)}{1-F(t)}=\lim\limits_{t\rightarrow \theta}\frac{ab g(t)G(t)^{a-1}}{1-G(t)^{a}} \\ &=\begin{cases} 0, & \text{if } a >1 \\ \frac{ab\alpha(1+\lambda)}{\theta}, & \text{if } a =1\\ \infty, & \text{if } a<1. \end{cases} \end{array} $$

For the second part, we already know that \(\lim \limits _{t\rightarrow \infty } G(t) =1 \), \(\lim \limits _{t\rightarrow \infty } g(t) = 0\), \(\lim \limits _{t\rightarrow \infty } F(t) = 1\), and \(\lim \limits _{t\rightarrow \infty } f(t) = 0\). Then using the L’Hospital’s rule, we have

$$\begin{array}{*{20}l} \lim\limits_{t\rightarrow \infty}h(t)&=\lim\limits_{t\rightarrow \infty}\frac{ab g(t) G(t)^{a-1}}{1-G(t)^{a}}\\ &=\lim\limits_{t\rightarrow \infty}\frac{ab g(t)}{G(t)^{1-a}-G(t)} \\ &=\lim\limits_{t\rightarrow \infty}\frac{ab g'(t) }{(1-a)G(t)^{-a}g(t)-g(t)}\\ &=\lim\limits_{t\rightarrow \infty}\frac{ab}{[(1-a)G(t)^{-a}-1]} \times \lim\limits_{t\rightarrow \infty}\frac{g'(t)}{g(t)}\\ &=-b\lim\limits_{t\rightarrow \infty}\frac{g'(t)}{g(t)}=0. \end{array} $$

Order statistics

Let X 1,X 2,……,X n be a simple random sample from KwTP (x;α,θ,λ,a,b) with CDF (6) and PDF (7). Also let X (1),X (2),……,X (n) denote the order statistics from this sample. The PDF f (i:n)(x) of the i-th order statistics is given by

$$\begin{array}{*{20}l} f_{i:n}(x)=\frac{1}{B(i, n-i+1)}[F(x)]^{i-1} [1-F(x)]^{n-i} f(x), \end{array} $$

and the CDF is given by

$$\begin{array}{*{20}l} F_{i:n}(x)&=\sum_{k=i}^{n} {n \choose k} [F(x)]^{k} [1-F(x)]^{n-k} = \int_{0}^{F(x)}\frac{1}{B(i, n-i+1)} t^{i-1}(1-t)^{n-i}dt. \end{array} $$

Using Eqs. (12), (13), (14), (15) and binomial expansions, Eq. (22) becomes

$$\begin{array}{*{20}l} f_{i:n}(x)&=\frac{1}{B(i, n-i+1)}f(x)\sum_{s=0}^{n-i}(-1)^{s}{n-i \choose s}F(x)^{i+s-1} \\ &=\frac{1}{B(i,n-i+1)}\left(\frac{ab\alpha \theta^{\alpha}}{x^{\alpha+1}}\left[1-\lambda+2\lambda \left(\frac{\theta}{x}\right)^{\alpha}\right] \sum_{l=0}^{\infty}u_{l} \left(\frac{\theta}{x}\right)^{\alpha l}\right) \\ & \quad\times \sum_{s=0}^{n-i}(-1)^{s} {n-i \choose s} \times \left(\sum_{m=0}^{\infty}\rho_{m}v^{m} \right)^{i+s-1}. \end{array} $$

From Gradshteyn and Ryzhik (2000, sec. 0.314), for any positive integer r,

$$\begin{array}{*{20}l} \bigg(\sum_{m=0}^{\infty}\rho_{m}v^{m}\bigg)^{r}=\sum_{m=0}^{\infty}d_{r,m} v^{m}, \end{array} $$

where the coefficients d r,m , for m=1,2,3,...., can be determined from the recurrence equation

$$\begin{array}{*{20}l} d_{r,m}=(m\rho_{0})^{-1}\sum_{q=1}^{m} \{q(r+1)-m\}\rho_{q} d_{r,m-q} \end{array} $$

and \(d_{r,0}=(\rho _{0}^{*})^{r}\).

We can obtain d r,m from d r,0, , d r,m−1 and, therefore, from ρ 0,ρ 1, , ρ m .

Hence (23) becomes

$$\begin{array}{*{20}l} f_{i:n}(x)&= \frac{ab\alpha \theta^{\alpha}\left(1-\lambda+2\lambda v\right) }{x^{\alpha +1}B(i,n-i+1)}\sum_{l=0}^{\infty} u_{l} v^{l} \\ & \quad\times\sum_{s=0}^{n-i}(-1)^{s} {n-i \choose s} \times \bigg(\sum_{m=0}^{\infty} d_{i+s-1, m} v^{m} \bigg) \\ &= \frac{ab\alpha \theta^{\alpha}\left(1-\lambda+2\lambda v\right)}{x^{\alpha +1}B(i,n-i+1)}\sum_{l=0}^{\infty}u_{l} v^{l}\sum_{m=0}^{\infty}c_{i}(m) v^{m} \\ &= \frac{ab\alpha \theta^{\alpha}\left(1-\lambda+2\lambda v\right)}{x^{\alpha +1}B(i,n-i+1)}\sum_{l=0}^{\infty}\sum_{m=0}^{\infty}u_{l} c_{i}(m) v^{l+m}, \end{array} $$


$$\begin{array}{*{20}l} v= \bigg(\frac{\theta}{x}\bigg)^{\alpha},~~ c_{i}(m)=\sum_{s=0}^{n-i}(-1)^{s} {n-i \choose s} d_{i+s-1,m}, \end{array} $$
$$\begin{array}{*{20}l} d_{i+s-1,m}&=(m\rho_{0})^{-1} \sum_{q=1}^{m}\big[q(i+s)-m\big]\rho_{q}d_{i+s-1,m-q} \end{array} $$

and d i+s−1,0=(ρ 0)i+s−1, where ρ 0 is defined in Eq. (16).

Parameter estimation

In this section we estimate the parameters of the Kumaraswamy transmuted Pareto distribution by the method of maximum likelihood estimation.

Let X 1,X 2,……,X n be a random sample from the Kumaraswamy transmuted Pareto distribution with observed values x 1,x 2,……,x n , and γ = (α,θ,λ,a,b)T be the vector of the model parameters. The log-likelihood function for γ can be written as

$$\begin{array}{@{}rcl@{}} l({\boldsymbol \gamma})&=& \log L(x;\alpha,\theta, \lambda, a,b) \\ &=& n \log a+n \log b + n\log \alpha + n\alpha \log \theta -(\alpha+1)\sum_{i=1}^{n}\log\left(x_{i}\right) \\ &+&\sum_{i=1}^{n}\log\left(1-\lambda+2\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right) \\ &+&(a-1)\left[\sum_{i=1}^{n}\log\left(1-\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right)+\sum_{i=1}^{n}\log\left(1+\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right)\right] \\ &+&(b-1)\sum_{i=1}^{n}\log \Bigg\{1-\left[1-\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right]^{a}\bigg[1+\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\bigg]^{a} \Bigg\}. \end{array} $$

Since x(θ,), the maximum likelihood estimator of θ is the first order statistic X (1). Following the normal routine of parameter estimation for the maximum likelihood estimation of α,λ,a and b, we differentiate Eq. (24) with respect to α,λ,a and b to obtain the score vector \(\left (\frac {\partial l}{\partial \alpha },\frac {\partial l}{\partial \lambda }, \frac {\partial l}{\partial a}, \frac {\partial l}{\partial b}\right)^{T}\). The elements of score vectors are

$$\begin{array}{*{20}l} \frac{\partial l}{\partial \alpha}&= \frac{n}{\alpha}+ n\log \theta -\sum_{i=1}^{n}\log\left(x_{i}\right)+ \sum_{i=1}^{n} \frac{2 \lambda \left(\frac{\theta}{x_{i}}\right)^{\alpha}.\ln\left(\frac{\theta}{x_{i}}\right)}{\left(1-\lambda+2\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right)} \\ &\quad-(a-1)\left[ \sum_{i=1}^{n} \frac{\left(\frac{\theta}{x_{i}}\right)^{\alpha}.\ln\left(\frac{\theta}{x_{i}}\right) }{\left(1-\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right)}- \sum_{i=1}^{n} \frac{\lambda \left(\frac{\theta}{x_{i}}\right)^{\alpha}\ln\left(\frac{\theta}{x_{i}}\right) }{\left(1+\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right)} \right] \\ &\quad-a(b-1)\sum_{i=1}^{n}\frac{G^{a-1}(x_{i};\alpha,\theta,\lambda) \left(\frac{\theta}{x_{i}}\right)^{\alpha}\bigg[1-\lambda+2\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\bigg]\ln\left(\frac{\theta}{x_{i}} \right)}{1-G^{a}(x_{i};\alpha,\theta,\lambda)}, \end{array} $$
$$\begin{array}{*{20}l} \frac{\partial l}{\partial \lambda} &=\sum_{i=1}^{n} \frac{2\left(\frac{\theta}{x_{i}}\right)^{\alpha}-1} {1-\lambda+2\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}}+(a-1)\sum_{i=1}^{n} \frac{\left(\frac{\theta}{x_{i}}\right)^{\alpha}} {1+\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}} \\ &\quad-a(b-1) \sum_{i=1}^{n} \frac{\left(\frac{\theta}{x_{i}}\right)^{\alpha}\left[1-\left(\frac{\theta}{x_{i}}\right)^{\alpha} \right]^{a} \left[1+\lambda\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right]^{a-1}}{1-G^{a}(x_{i};\alpha,\theta,\lambda)}, \end{array} $$
$$\begin{array}{*{20}l} \frac{\partial l}{\partial a} &=\frac{n}{a}+\left[\sum_{i=1}^{n}\log\left(1-\left(\frac{\theta}{x_{i}}\right)^{\alpha}\right)+\sum_{i=1}^{n}\log\left(1+\lambda\left(\frac{\theta} {x_{i}}\right)^{\alpha}\right)\right] \\ &\quad-(b-1)\sum_{i=1}^{n} \frac{G^{a}(x_{i};\alpha,\theta,\lambda)\times \log \{G^{a}(x_{i};\alpha,\theta,\lambda)\}}{1-G^{a}(x_{i};\alpha,\theta,\lambda)}, \end{array} $$


$$\begin{array}{@{}rcl@{}} \frac{\partial l}{\partial b}&=& \frac{n}{b}+ \sum_{i=1}^{n}\log \left\{1-G^{a}(x_{i};\alpha,\theta,\lambda) \right\}, \end{array} $$

where G(x i ;α,θ,λ) is the PDF of the transmuted Pareto distribution.

The maximum likelihood estimates \(\hat {\alpha }, \hat {\lambda }, \hat {a},\hat {b}\) of the unknown parameters α,λ,a,b respectively, can be obtained by setting Eqs. (25) - (28) equal zero and solving for the parameters. We can use numerical methods such as the quasi-Newton algorithm to numerically optimize the log-likelihood function given in Eq. (24), to get the maximum likelihood estimates of the parameters α,λ,a,b. To compute the standard error and the asymptotic confidence interval, we use the usual large sample approximation in which the maximum likelihood estimators for γ can be treated as being approximately normal. This will require the computations of the second order derivatives of Eq. (24) with respect to the vector of parameters.

This procedure will result in

$$\begin{array}{@{}rcl@{}} \left(\begin{array}{l} \hat{\alpha}\\ \hat{\lambda}\\ \hat{a} \\ \hat{b} \end{array}\right) & \sim & Normal \left[\left(\begin{array}{l} {\alpha}\\ {\lambda}\\ {a} \\ {b} \end{array}\right),\left(\begin{array}{llll} \hat{V}_{\alpha \alpha}&\hat{V}_{\alpha \lambda}&\hat{V}_{\alpha a}&\hat{V}_{\alpha b}\\ \hat{V}_{\lambda \alpha}&\hat{V}_{\lambda \lambda}&\hat{V}_{\lambda a}&\hat{V}_{\lambda b}\\ \hat{V}_{a \alpha}&\hat{V}_{a \lambda}&\hat{V}_{a a}&\hat{V}_{a b}\\ \hat{V}_{b \alpha}&\hat{V}_{b \lambda}&\hat{V}_{b a}&\hat{V}_{b b}\\ \end{array}\right)\right]\\ \end{array} $$

with \({\hat {V}_{u,v}= \left. V_{(u,v))}\right |_{(u,v)=(\hat {u},\hat {v})}}\), and the asymptotic variance-covariance matrix of the MLEs is,

$$\begin{array}{@{}rcl@{}} \left(\begin{array}{llll} \hat{V}_{\alpha \alpha}&\hat{V}_{\alpha \lambda}&\hat{V}_{\alpha a}&\hat{V}_{\alpha b}\\ \hat{V}_{\lambda \alpha}&\hat{V}_{\lambda \lambda}&\hat{V}_{\lambda a}&\hat{V}_{\lambda b}\\ \hat{V}_{a \alpha}&\hat{V}_{a \lambda}&\hat{V}_{a a}&\hat{V}_{a b}\\ \hat{V}_{b \alpha}&\hat{V}_{b \lambda}&\hat{V}_{b a}&\hat{V}_{b b}\\ \end{array}\right) = -E \left(\begin{array}{llll} V_{\alpha \alpha}&V_{\alpha \lambda}&V_{\alpha a}&V_{\alpha b}\\ {V_{\lambda \alpha}} & V_{\lambda \lambda}&V_{\lambda a}&V_{\lambda b}\\ {V_{a\alpha}}&{V_{a \lambda}} &V_{a a}&V_{a b}\\ {V_{b\alpha}}&{V_{b \lambda}} &V_{b a}&V_{b b} \end{array}\right) ^{-1} \end{array} $$

where entries are obtained from

$$V_{\alpha \alpha}=\frac{\partial^{2} l}{\partial \alpha^{2}}, ~\qquad V_{\alpha \lambda}=\frac{\partial^{2} l}{\partial \alpha \partial \lambda},\qquad V_{\alpha a}=\frac{\partial^{2} l}{\partial \alpha \partial a},\qquad V_{\alpha b}=\frac{\partial^{2} l}{\partial \alpha \partial b} $$
$$V_{\lambda \alpha}=\frac{\partial^{2} l}{\partial \lambda \partial \alpha}, \qquad V_{\lambda \lambda}=\frac{\partial^{2} l}{\partial \lambda^{2}},~\qquad V_{\lambda a}=\frac{\partial^{2} l}{\partial \lambda \partial a},\qquad V_{\lambda b}=\frac{\partial^{2} l}{\partial \lambda \partial b} $$
$$V_{a \alpha}=\frac{\partial^{2} l}{\partial a \partial \alpha}, \qquad V_{a \lambda}=\frac{\partial^{2} l}{\partial a \partial \lambda},\qquad V_{a a}=\frac{\partial^{2} l}{\partial a^{2}},~~\qquad V_{a b}=\frac{\partial^{2} l}{\partial a \partial b} $$
$$V_{b \alpha}=\frac{\partial^{2} l}{\partial b \partial \alpha}, \qquad V_{b \lambda}=\frac{\partial^{2} l}{\partial b \partial \lambda},~\qquad V_{b a}=\frac{\partial^{2} l}{\partial b \partial a},~~\qquad V_{b b}=\frac{\partial^{2} l}{ \partial b^{2} } $$

and l is the log-likelihood function given in (24). Approximate 100(1−ϕ)% two sided confidence intervals for α,λ,a and b are, respectively, given by

$$\hat{\alpha}\pm z_{\frac{\phi}{2}}\sqrt{\hat{V}_{\alpha \alpha}}~~,\qquad \hat{\lambda}\pm z_{\frac{\phi}{2}}\sqrt{\hat{V}_{\lambda \lambda}}~~,\qquad \hat{a}\pm z_{\frac{\phi}{2}}\sqrt{\hat{V}_{aa}}~~,\qquad \text{and} \qquad \hat{b}\pm z_{\frac{\phi}{2}}\sqrt{\hat{V}_{bb}}~~, $$

where z ϕ is the upper ϕ−th percentile of the standard normal distribution.

Applications of KwTP

In this section we apply the KwTP to two data sets. These are exceedances of flood peaks (in m 3/s) of the Wheaton River near Carcross in Yukon Territory, Canada, and the Norwegian fire insurance claims data.

11.1 The exceedances of flood peaks (in m 3/s) of the Wheaton River near Carcross in Yukon Territory, Canada

The data are the exceedances of flood peaks (in m 3/s) of the Wheaton River near Carcross in Yukon Territory, Canada. The data consists of 72 exceedances for the years 1958-1984. These data have been analyzed by many authors including Choulakian and Stephens (2001), Nadarajah (2005), Akinsete et al. (2008), Bourguignon et al. (2013), Merovci and Puka (2014), and Chhetri et al. (2017), among others. A summary of the descriptive statistics of the data set is given in Table 2. The analysis of these data with the KwTP is comapared with the results from few commmonly used distributions in the literature. All required computations are carried out using the AdequacyModel script of R-package (Marinho et al. (2016)). In particular we compare the KwTP with the Kumaraswamy Pareto distribution (KwP), the transmuted Pareto distribution (TP), exponentiated Pareto distribution (EP), beta transmuted Pareto distribution (BTP), beta Pareto distribution (BP), and the Pareto distribution (P). Using the method of maximum likelihood estimation, we estimate the distribution parameters. The goodness of fit measures, including the log-likelihood function (−), Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), Consistent Akaike Information Criterion (CAIC), and Hannan-Quinn Information Criteria (HQIC) are obtained for the fitted models.

Table 2 Descriptive statistics for the Wheaton river data

Table 3 lists the values the MLEs and their standard errors, whereas the values of \(-\widehat {\ell }\), AIC, CAIC, HQIC, BIC and Kolmogorov (K-S) statistic are given in Table 4. From Table 4, it is noted that the KwTP distribution has the lowest values for the \(-\widehat {\ell }\), AIC, CAIC, HQIC, BIC, and K-S statistic. One can employ the likelihood ratio statistic to test the superiority of the KwTP distribution over the other distributions. The plots comparing the KwTP distribution with other distributions are given in Figs. 7, 8 and 9. The values in Table 4, and the plots, all indicate that the KwTP distribution is more superior and fits the data more adequately than the other comparative distributions.

Fig. 7

Fitted CDF of the Wheaton river data

Fig. 8

QQ-plots of the Wheaton river data

Fig. 9

Fitted PDF of the Wheaton river data

Table 3 Estimated parameters and their standard errors for the Wheaton river data
Table 4 The AIC, CAIC, BIC, HQIC and K-S test statistic of the Wheaton river data

11.2 The Norwegian fire insurance data (in kr)

We apply the KwTP to the Norwegian fire insurance claims data for the years 1988 and 1990. Several authors have analyzed these data. See for example, Mdziniso and Cooray (2017) and related references in the paper. The 1988 Norwegian fire claims data consist of 827 fire insurance losses in thousand Norwegian krones, ranging from 500 to 465,365 thousand Norwegian krones. The 1990 Norwegian fire data consist of 628 fire insurance losses in thousand Norwegian krones, ranging from 500 to 78,537 thousand Norwegian krones. Data for both years are highly positively skewed. Using these data, Mdziniso and Cooray (2017) compared the performances of the Pareto (P), the 3-paramater generalized Pareto (GP), the odd-Pareto (OP) and its extension (OP ), the 3-parameter Burr, the exponentiated Pareto (EP), the exponentiated odd Pareto (EOP) and the odd generalized Pareto (OGP) distributions.

We provide in Table 5, the parameter estimation of the insurance data for both 1988 and 1990. Comparing the performance of KwTP with the distributions listed in Table 4 of Mdziniso and Cooray (2017) for the 1988 data, we see from the values of the AIC, A-D, and K-S, that KwTP displays superiority over the P, GP, OP, OP , Burr, and EP distributions. It also performs better than the OGP judging from their K-S values. The same pattern of performances of KwTP and the distributions listed in Table 5 of the paper are observed in the case for the 1990 data. With the exception of the exponentiated odd Pareto (EOP) distribution, the KwTP is a better fit for these sets of data.

Table 5 Norwegian fire insurance claims: Estimated values for KwTP

Simulation study

In this section we perform some simulation study to assess the reliability of the MLEs. An ideal technique for simulating random numbers from the KwTP distribution is the inversion method. For fixed selected combinations of α,λ,a and b in Eq. (17), we generate samples of sizes n=5,10,20,50,100,200,500 and 1000 from the KwTP distribution. We repeated the simulations N=1,000 times and calculated the mean estimates, the root mean square errors (RMSEs) and the mean absolute errors (MAEs). The empirical results obtained using R are given in Table 6. Observe that RMSEs decay as the sample size increases, while the maximum likelihood estimates get closer to their true values.

Table 6 Simulation from KwTP of the MLE for α=2,λ=0.1,a=3, and b=2.5


We have proposed in this article, a new distribution that is being referred to as the Kumaraswamy transmuted Pareto (KwTP). The transmuted Pareto distribution is used as a baseline distribution in the Kumaraswamy distribution to construct the KwTP distribution. Many mathematical and statistical properties and special cases of the KwTP are obtained. The estimation of the model parameters is performed by the maximum likelihood method. We compare the distribution with few other distributions in modeling two real datasets. Various statistics indicate that KwTP better fits the Wheaton river data set than other comparative distributions, and majority of its competitive distributions for the Norwegian insurance claims data. We conducted a simulation study to assess the performance of the maximum likelihood estimation procedure for estimating the parameters of the KwTP distribution. It is expected that the KwTP distribution will serve as a better alternative in modeling data sets exhibiting the extreme value properties.


  1. Afify, AZ, Cordeiro, GM, Yousof, HM, Alzaatreh, A, Nofal, ZM: The Kumaraswamy transmuted-g family of distributions: Properties and applications. J. Data Sci. 14, 245–270 (2016).

    Google Scholar 

  2. Afify, AZ, Nofal, ZM, Butt, NS: Transmuted complementary Weibull geometric distribution. Pak. J. Stat. Oper. Res.4, 435–454 (2014).

    MathSciNet  Article  Google Scholar 

  3. Akinsete, A, Famoye, F, Lee, C: The beta-Pareto distributions. Statistics. 42(6), 547–563 (2008).

    MathSciNet  Article  MATH  Google Scholar 

  4. Akinsete, AA, Famoye, F, Lee, C: The Kumaraswamy - geometric distribution. J. Stat. Distrib. Appl. 1, 1–21 (2014).

    Article  MATH  Google Scholar 

  5. Aryal, GR, Tsokos, CP: On the transmuted extreme value distribution with application. Nonlinear Anal. Theory Methods and Appl. 7, 1401–1407 (2009).

    MathSciNet  Article  MATH  Google Scholar 

  6. Aryal, GR, Tsokos, CP: Transmuted Weibull distribution: A generalization of the Weibull probability distribution. Eur. J. Pur. Appl. Math. 2(2), 89–102 (2011).

    MathSciNet  MATH  Google Scholar 

  7. Ashour, SK, Eltehiwy, MA: Transmuted Lomax distribution. Am. J. Appl. Math. Stat. 1(6), 121–127 (2013).

    Article  Google Scholar 

  8. Bourguignon, MB, Silva, RB, Zea, LM, Cordeiro, GM: The Kumaraswamy Pareto distribution. J. Stat. Theory Appl. 12(2), 129–144 (2013).

    MathSciNet  Article  Google Scholar 

  9. Bowley, AL: Elements of Statistics. 4th edn. Charles Scribner, New York, NY (1920).

    Google Scholar 

  10. Chhetri, S, Long, H, Aryal, G: The beta transmuted Pareto distribution: Theory and applications. J. Stat. Appl. Prob. 6(2), 243–258 (2017).

    Article  Google Scholar 

  11. Choulakian, V, Stephens, MA: Goodness-of-fit for the generalized Pareto distribution. Technometrics. 43, 478–484 (2001).

    MathSciNet  Article  Google Scholar 

  12. Cordeiro, GM, de Castro, M: A new family of generalized distributions. J. Stat. Comput. Simul. 81, 883–898 (2011).

    MathSciNet  Article  MATH  Google Scholar 

  13. Cordeiro, GM, Ortega, EMM, Nadarajah, S: The Kumaraswamy Weibull distribution with application to failure data. J. Frankl. Inst. 347, 1399–1429 (2010).

    MathSciNet  Article  MATH  Google Scholar 

  14. de Pascoa, MAR, Ortega, EMM, Cordeiro, GM: The Kumaraswamy generalized gamma distribution with application in survival analysis. Stat Methodol. 8, 411–433 (2011).

    MathSciNet  Article  MATH  Google Scholar 

  15. Elbatal, I: The Kumaraswamy exponentiated Pareto distribution. Econ. Qual. Control. 28(1), 1–8 (2013).

    MathSciNet  Article  MATH  Google Scholar 

  16. Eugene, N, Lee, C, Famoye, F: The beta-normal distribution and its applications. Commun. Stat. Theory Methods. 31, 497–512 (2002).

    MathSciNet  Article  MATH  Google Scholar 

  17. Gradshteyn, IS, Ryzhik, IM: Table of Integrals, Series, and Products. 3rd edn. Academic Press, New York (2000).

    Google Scholar 

  18. Hady, AE, Ebraheim, N: Exponentiated transmuted Weibull distribution: A generalization of the Weibull distribution. Int. Sch. Sci. Res. Innov. 8(6), 903–911 (2014).

    Google Scholar 

  19. Hussian, MA: Transmuted exponentiated gamma distribution: A generalization of the exponentiated gamma probability distribution. Appl. Math. Sci. 8(27), 1297–1310 (2014).

    MathSciNet  Google Scholar 

  20. Jones, MC: Kumaraswamy’s distribution: A beta-type distribution with some tractability advantages. Stat Methodol. 6(1), 70–81 (2009).

    MathSciNet  Article  MATH  Google Scholar 

  21. Kenny, JF, Keeping, ES: Mathematics of Statistics. 3rd edn. D. Van Nostrand Co., New Jersey (1962).

    Google Scholar 

  22. Khan, MS, King, R: Transmuted modified Weibull distribution: A generalization of the modified Weibull probability distribution. Eur. J. Pur. Appl. Math. 6(1), 66–88 (2013).

    MathSciNet  MATH  Google Scholar 

  23. Khan, MS, King, R, Hudson, IL: Transmuted Kumaraswamy distribution. Stat. Transit. 17(2), 183–210 (2016).

    Google Scholar 

  24. Kumaraswamy, P: A generalized probability density function for double-bounded random processes. Hydrology. 46, 79–88 (1980).

    Article  Google Scholar 

  25. Marinho, PRD, Bourguignon, M, Dias, CRB: R Package - Adequacymodel (2016).

  26. Mdziniso, NC, Cooray, K: Odd Pareto families of distributions for modelling loss payment data. Scand. Actuar. J.1–22 (2017).

  27. Merovci, F, Puka, L: Transmuted Pareto distribution. ProbStat Forum. 07, 1–11 (2014).

    MathSciNet  MATH  Google Scholar 

  28. Moors, JJA: A quantile alternative for kurtosis. J. R. Stat. Series D Stat. 37(1), 25–32 (1988).

    Google Scholar 

  29. Nadarajah, S: Exponentiated Pareto distribution. Stat. J. Theor. Appl. Stat. 39(3), 255–260 (2005).

    MathSciNet  MATH  Google Scholar 

  30. Owokolo, EA, Oguntade, PE, Adejumo, AO: Performance rating of the transmuted exponential distribution: an analytical approach. SpringerPlus. 4, 818 (2015). doi:10.1186/s40064-015-1590-6.

    Article  Google Scholar 

  31. Saulo, H: The Kumaraswamy Birnbaum-Saunders distribution. J. Stat. Theory Pract. 6(4), 745–759 (2012).

    MathSciNet  Article  Google Scholar 

  32. Shams, TM: The Kumaraswamy-generalized Lomax distribution. Middle East J. Sci. Res. 17(5), 641–646 (2013).

    Google Scholar 

  33. Shaw, WT, Buckley, IRC: The alchemy of probability distributions: beyond Gram-Charlier & Cornish-Fisher expansions, and skew-normal or kurtotic-normal distributions. Conference on Computational Finance, IMA, 6670 (2007). Research Report.

  34. Shaw, WT, Buckley, IRC: The alchemy of probability distributions: beyond Gram-Charlier expansions, and a skew-kurtotic-normal distribution from a rank transmutation map. Conference on Computational Finance, IMA, 0901–0434 (2009). Research Report.

  35. Tahir, MH, Cordeiro, GM: Compounding of distributions: a survey and new generalized classes. J. Stat. Distrib. Appl. 3, 183–210 (2016).

    Article  MATH  Google Scholar 

  36. Yousof, HM, Afify, AZ, Alizadeh, M, Butt, NS, Hamedani, GG, Ali, MM: The transmuted exponentiated generalized-g family of distributions. Pak. J. Stat. Oper. Res.4, 441–464 (2015).

    MathSciNet  Article  Google Scholar 

Download references


The authors would like to thank the Editor and the anonymous reviewers for their valuable comments and suggestions which have greatly improved the quality of the paper.

Author information




The authors, SBC, AAA, GA and HL with the consultation of each other carried out this work and drafted the manuscript together. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Alfred A. Akinsete.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Chhetri, S., Akinsete, A., Aryal, G. et al. The Kumaraswamy transmuted Pareto distribution. J Stat Distrib App 4, 11 (2017).

Download citation


  • Kumaraswamy distribution
  • Quadratic rank transmutation map (QRTM)
  • Pareto distribution
  • Hazard function
  • Maximum likelihood estimation

AMS Subject Classification

  • 60E05
  • 62E15
  • 62H12