Extended Conway-Maxwell-Poisson distribution and its properties and applications

Chakraborty, Subrata; Imoto, Tomoaki

doi:10.1186/s40488-016-0044-1

Research
Open access
Published: 25 February 2016

Extended Conway-Maxwell-Poisson distribution and its properties and applications

Subrata Chakraborty¹ &
Tomoaki Imoto²

Journal of Statistical Distributions and Applications volume 3, Article number: 5 (2016) Cite this article

4713 Accesses
8 Citations
Metrics details

Abstract

A new four parameter extended Conway-Maxwell-Poisson (ECOMP) distribution which unifies the recently proposed COM-Poisson type negative binomial (COM-NB) distribution [Chakraborty, S. and Ong, S. H. (2014): A COM-type Generalization of the Negative Binomial Distribution, Accepted in Communications in Statistics-Theory and Methods] and the generalized COM-Poisson (GCOMP) distribution [Imoto, T. :(2014) A generalized Conway-Maxwell-Poisson distribution which includes the negative binomial distribution, Applied Mathematics and Computation, 247, 824–834] is proposed. The additional parameter allows this distribution to have longer (shorter) tail compared to COM-NB and GCOMP. The proposed distribution can be formulated as an exponential combination of negative binomial and COM-Poisson distribution and also arises from a queuing system with state dependent arrival and service rates and belongs to exponential family when one of the parameter is considered as nuisance. Important distributional, reliability and stochastic ordering properties along with asymptotic approximations for the normalizing constant and the mean of this distribution is investigated. Method of parameter estimation and three comparative data fitting applications are also discussed.

1 Introduction

Recently, two new generalizations of the well known COM-Poisson (Conway and Maxwell 1962) was proposed. One by Chakraborty and Ong (2014) known as the COM-Negative binomial distribution and the other by Imoto (2014) referred to as the generalized COM-Poisson Distribution. In this section we briefly introduce these two distributions along with a hypergeometric type series which is used in the sequel.

COM-Poisson type negative binomial distribution: Chakraborty and Ong (2014) proposed a new COM-Poisson type generalization of negative binomial distribution that includes some well-known distributions including COM-Poisson, Negative Binomial (page 208–250, Chapter 5, Johnson et al. 2005), as particular case and Bernoulli (page 108, Chapter 3, Johnson et al. 2005), COM-Poisson as limiting cases among others. This distribution is log-concave and flexible enough to model under, equi- and over dispersed count data.

A random variable (rv) X is said to follow the COM - Poisson type Negative Binomial distribution with parameters (v, p, α) [COM-NB(v, p, α)] if its pmf is given by

$$ P\left(X=k\right)={\left(\nu \right)}_k\kern0.24em {p}^k/\left\{{\left(k!\right)}^{\alpha }{}_1H_{\alpha -1}\left(\nu;\;1;p\right)\right\},\kern0.24em k=0,\;1,\;2,\cdots $$

(1)

$$ \mathrm{Where}\kern2em {}_1H_{\alpha -1}\left(\nu;\;1;p\right)={\displaystyle \sum_{k=0}^{\infty }{\left(\nu \right)}_k\kern0.24em {p}^k/\;{\left(k\;!\right)}^{\alpha }} $$

(2)

The distribution is defined in the parameter space

$$ {\Theta}_{COM-NB}=\left\{\nu >0,\;p>0,\;\alpha >1\right\}\cup \left\{\nu >0,\;0<p<1,\;\alpha =1\right\}. $$

When α is a positive integer, ₁ H _α − 1(ν; 1; p) can be expressed as a particular case of generalized hypergeometric series $ {}_mF_n\left({a}_1,{a}_2,\cdots, {a}_m;{b}_1,{b}_2,\cdots, {b}_m;z\right)={\displaystyle \sum_{k=0}^{\infty}\frac{{\left({a}_1\right)}_k{\left({a}_2\right)}_k\cdots {\left({a}_m\right)}_k}{\;{\left({b}_1\right)}_k{\left({b}_2\right)}_k\cdots {\left({b}_n\right)}_k}\;\frac{z^k}{k\;!}} $ as ₁ F _α − 1(ν; 1, 1, ⋯, 1; p).

Generalized COM-Poisson distribution: Imoto (2014) proposed another generalization where an rv X is said to follow the GCOM-Poisson distribution with parameters (v, p, β) that is GCOMP (v, p, β) if its pmf is given by

$$ P\left(X=k\right)=\frac{{\left\{\Gamma \left(\nu +k\right)\right\}}^{\beta }}{C\left(\beta, \nu, p\right)}\frac{p^k}{k!} $$

(3)

$$ \mathrm{Where}\kern2.5em C\left(\beta, \nu, p\right)={\displaystyle \sum_{k=0}^{\infty}\frac{{\left\{\Gamma \left(\nu +k\right)\right\}}^{\beta }}{k!}{p}^k} $$

(4)

The distribution is defined in the parameter space

$$ {\Theta}_{GCOMP}=\left\{\nu >0,\;p>0,\;\beta <1\right\}\cup \left\{\nu >0,\;0<p<1,\;\beta =1\right\}. $$

A hypergeometric type series: We introduce the series

$$ {}_mS_{\alpha}^{\beta}\left({a}_1,{a}_2,\cdots, {a}_m;b;p\right)={\displaystyle \sum_{k=0}^{\infty}\frac{{\left\{{\left({a}_1\right)}_k\right\}}^{\beta }{\left({a}_2\right)}_k\cdots {\left({a}_m\right)}_k}{\;{\left\{{(b)}_k\right\}}^{\alpha }}\;\frac{p^k}{k\;!}}\kern0.24em , $$

where (a)_k = a(a + 1) ⋯ (a + k − 1) = Γ(a + k)/Γa is the Pochhammer’s notation (see Johnson et al. 2005, chapter 1, page 2). The series converges if (i) for any finite p, β + m − 2 < α or (ii) |p| < 1, β + m − 2 < α. For α, β and m all positive integers, it reduces to a particular case of the generalized hypergeometric function _{β + m − 1} F _α(a ₁, a ₁, ⋯, a ₁, a ₂, ⋯, a _m; b, b, ⋯, b; p). With this notation we have

$$ {\displaystyle \sum_{k=0}^{\infty }{\left\{{\left(\nu \right)}_k\right\}}^{\beta}\kern0.24em {p}^k/\;{\left(k\;!\right)}^{\alpha }}={}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)={}_{\beta }F_{\alpha -1}\left(\nu;\;1,\;1,\cdots,\;1;p\right) $$

(5)

Some important special cases of $ {}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right) $ are

i.
$ {}_1S_{\alpha -1}^1\left(\nu;\;1;p\right)={}_1H_{\alpha -1}\left(\nu;\;1;p\right) $ [Chakraborty and Ong, 2014]
ii.
$ {}_1S_{{}_0}^{\beta}\left(\nu;\;1;p\right)=C\left(\beta,\;\nu,\;p\right)/{\left(\Gamma \nu \right)}^{\beta } $ [Imoto 2014]
iii.
$ {}_1S_0^1\left(\nu;\;1;p\right)={\left(1-p\right)}^{-\nu } $ [geometric series]
iv.
$ {}_1S_{{}_{\alpha -1}}^{\beta}\left(1;\;1;p\right)=Z\left(p,\alpha -\beta \right) $ [Conway and Maxwell 1962]
v.
$$ {}_1S_{\gamma}^{\gamma}\left(1;\;1;p\right)= \exp (p) $$

Some important limiting cases of $ {}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right) $ are
vi.
$$ \underset{\alpha \to \infty }{ \lim}\kern0.24em {}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)=1+{\nu}^{\beta }p. $$
vii.
$ \underset{\alpha \to \infty }{ \lim}\kern0.24em {}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)={\displaystyle \sum_{k=0}^{\infty }{\lambda}^k/{\left(k!\right)}^{\alpha }}=Z\left(\lambda, \alpha \right) $, where ν ^β p = λ is finite positive.

In the present article we propose a natural four parameter extension of the COM-Poisson distribution which includes the recently introduced COM-NB and GCOM-Poisson distributions as special cases. This new distribution with additional parameters is more flexible in terms of tail length and dispersion index. The definition of the proposed distribution along with some of its important distributional properties are presented in the Section 2. Reliability and stochastic ordering results are discussed in Section 3. In Section 4 we presented applications of the proposed distribution by considering three real life data sets. Concluding remarks is provided in the Section 5 which if followed by an appendix containing the proofs of the results and propositions in the article.

2 Extended COM-Poisson (ECOMP) distribution

Here we introduce a new distribution that unifies both the COM-NB and GCOMP distributions.

Definition 1. An rv X is said to follow the extended COM-Poisson distribution with parameters (v, p, α, β) [ECOMP (v, p, α, β)] iff its pmf is given by

$$ P\left(X=k\right)=\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }}{{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)\;}\frac{p^k}{{\left(k!\right)}^{\alpha }}=\frac{{\left\{\Gamma \left(\nu +k\right)\right\}}^{\beta }}{{\left(\Gamma \nu \right)}^{\beta }{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)}\frac{p^k}{{\left(k!\right)}^{\alpha }} $$

(6)

The distribution is defined in the parameter space

$$ {\Theta}_{E-COM}=\left\{\nu \ge 0,\;p>0,\alpha >\beta \right\}\cup \left\{\nu >0,\;0<p<1,\;\alpha =\beta \right\}. $$

It may be noted that unlike in the COM-NB distribution where the parameter α ≥ 1 and in the GCOMP distribution where the parameter β ≤ 1, in the ECOMP distribution these two parameters can be either positive or negative with the restriction of α ≥ β.

Particular cases: The ECOMP (ν, p, α, β) distribution reduces to COM-NB (ν, p, α) for β = 1, to GCOMP (ν, p, β) for α = 1, to COMP (p, α − β) for ν = 1, to COMP (p, α) for β = 0, to Poisson (p) for ν = 1, α = β + 1, also to Poisson (p) for β = 0, α = 1, to NB (ν, p) for α = β = 1 and to a new generalization of NB(NGNB) distribution when α = β = γ with pmf

$$ P\left(X=k\right)={\left(\begin{array}{l}\nu +k-1\\ {}\kern0.96em k\end{array}\right)}^{\gamma }{p}^k/{}_1S_{\gamma -1}^{\gamma}\left(\nu;\;1;p\right) $$

(7)

For 0 < ν ≤ 1, the distribution in (7) is log-convex as will be seen in proposition 4 in the Section 2.7.

2.1 Shape of the pmf

It is observed from the plots of the pmf of the ECOMP(v, p, α, β) distribution for different values of the parameters in Fig. 1, that the distribution is very flexible and can be non increasing with mode at zero, unique non zero mode, two modes and also bimodal with one mode always at zero.

2.2 Approximations of the normalizing constant

2.2.1 Approximation using truncation of the series

The normalizing constant $ {}_1S_{\alpha -1}^{\beta}\left(\nu; 1;p\right) $ of the ECOMP(v, p, α, β) distribution is not expressed in a closed form and includes the summation of infinite series. Therefore, we need approximations of this constant to compute the pmf and moments of the distribution numerically.

A simple approximation is to truncate the series, that is

$$ {}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right)={\displaystyle \sum_{k=0}^m\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }}{{\left(k\;!\right)}^{\alpha }}\kern0.24em {p}^k,} $$

(8)

where m is an integer chosen such that ε _m = (ν − m + 1)^β p/m ^α < 1. The relative truncation error is then given by the expression R _m(ν, p, α, β) $ =\left\{{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)-{}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right)\;\right\}/{}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right)\;. $ Then the relative error about the pmf is give by {P _m(k) − P(k)}/P(k), where P(k) is given by the right hand side (r.h.s.) of equation (6) in Section 2 and P _m(k) is given by the r.h.s. of (6) with $ {}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right) $ substituted by $ {}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right) $. The upper bound of the relative truncation error is then found to be

$$ {R}_m\left(\nu,\;p,\alpha, \beta \right)<\frac{{\left\{{\left(\nu \right)}_{m+1}\right\}}^{\beta }{p}^{m+1}}{{\left\{\left(m+1\right)!\right\}}^{\alpha }{}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right)}{\displaystyle \sum_{k=0}^{\infty }{\varepsilon}_m^k}=\frac{{\left\{{\left(\nu \right)}_{m+1}\right\}}^{\beta }{p}^{m+1}}{{\left\{\left(1-{\varepsilon}_m\right)\left(m+1\right)!\right\}}^{\alpha }{}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right)} $$

For α − β ≥ 1, this truncated approximation is good because ε _m = O(1/m) and thus, the truncation point m is not large. However, for 0 < α − β < 1 and p > 1, the truncation point become too large to compute the approximation. For example, when ν = 1.5, p = 3, α = 3.1, β = 3, m has to be over 50,000. This is not practicable. To avoid this difficulty it is useful to make a restriction for the parameter p such that p < 1 when α − β → 0. For example, with the restriction p < 10^{α − β}, we see the relative truncation error R ₅₀(1.5, 3, 3.1, 3) < 0.001.

2.2.2 Asymptotic approximation of the normalizing constant using the Laplace’s method

It is also useful to consider an asymptotic approximation formula of the normalizing constant $ {}_1S_{\alpha -1}^{\beta}\left(\nu, 1,p\right) $. The approximation formula by the Laplace’s method (Bleistein and Handelsman 1986, Ch 8.3, pages 331–340) is given by

$$ {}_1S_{\alpha}^{\beta}\left(v;1,p\right)\approx \frac{p^{\left\{1-\alpha +\left(2\nu -1\right)\beta \Big\}/2\Big(\alpha -\beta \right)} \exp \left\{\left(\alpha -\beta \right){p}^{1/\left(\alpha -\beta \right)}\right\}}{{\left(2\pi \right)}^{\left(\alpha -\beta -1\right)/2}\sqrt{\alpha -\beta }{\left\{\Gamma (v)\right\}}^{\beta }} $$

(9)

This formula reduces to the asymptotic formula by Minka et al. (2003) when ν = 1 or β = 0 and that by Imoto (2014) when α = 1. The proof and numerical investigation about the formula (9) are given in Appendix A.1.

2.3 Recurrence relation for probabilities

The ECOMP (ν, p, α, β) pmf has a simple recurrence relation given by

$$ \frac{P\left(X=k+1\right)}{P\left(X=k\right)}=\frac{p{\left(\nu +k\right)}^{\beta }}{{\left(k+1\right)}^{\alpha }}\Rightarrow {\left(k+1\right)}^{\alpha }P\left(X=k+1\right)=p{\left(\nu +k\right)}^{\beta }P\left(X=k\right) $$

(10)

with $ P\left(X=0\right)={\left[{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)\;\right]}^{-1} $. This will be useful for the computation of the probabilities. Further using (10) we can see that the ECOMP(v, p, α, β) distribution has a longer (shorter) tail than the COM-NB(v, p, α) for α < (>)1 and a longer (shorter) tail than the GCOMP(v, p, β) for β > (<)1.

2.4 Exponential family

The pmf in (6) can also be expressed as

$$ P\left(X=k\right)= \exp \left[\beta\;\log {\left(\nu \right)}_k-\beta\;\log \Gamma \left(\nu \right)-\alpha\;\log k!+k\; \log p- \log {}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)\;\right] $$

(11)

Which immediately implies that the ECOMP (ν, p, α, β) distribution belongs to the exponential family with parameters ( log p, α, β) when v, is a nuisance parameter or when its value is given.

2.5 Index of dispersion

The pmf of ECOMP (ν, p, α, β) distribution in (6) can be seen as a weighted Poisson (p) distribution with weight function w(x) = {Γ(ν + x)}^β/(Γ(1 + x))^α − 1. As such it will be over (under) dispersed if w(x) in log-convex (log-concave). That is if $ \frac{d^2}{d{x}^2} \log \left[w(x)\right]\ge \left(\le \right)\;0 $. [See theorem 4 of Kokonendji et al. 2008]

$$ \begin{array}{l}\Rightarrow \beta \frac{d^2}{d{x}^2} \log \Gamma \left(\nu +x\right)+\left(1-\alpha \right)\frac{d^2}{d{x}^2} \log \Gamma \left(1+x\right)\ge \left(\le \right)\;0\\ {}\Rightarrow \beta {\displaystyle \sum_{k\ge 0}\frac{1}{{\left(\nu +x+k\right)}^2}}-\left(\alpha -1\right){\displaystyle \sum_{k\ge 0}\frac{1}{{\left(x+1+k\right)}^2}}\ge \left(\le \right)\;0\end{array} $$

[On using result 6.4.10 page 260 from Abramowitz and Stegun, 1970].

Hence, ECOMP (ν, p, α, β) is over dispersed (i) if α < 1, β ≥ 0 for all v (ii) if {α ≥ 1, β > 0} or {α < 1, β < 0} when {0 < ν ≤ 1, β ≤ α ≤ β + 1} or {ν > 1, α ≤ 1} and under dispersed (i) if α ≥ 1, β < 0 for all v (ii) for {α ≥ 1, β > 0} or {α < 1, β < 0} if {0 < ν ≤ 1, α ≥ β + 1} or {ν > 1, α ≥ 1}.

As a particular cases of the above result, when β = 1, we can see that the COM-NB (ν, p, α) distribution always over dispersed for {0 < ν ≤ 1, 1 ≤ α < 2} or {ν > 1, α = 1} and under dispersed compared to COMP distribution for {0 < ν ≤ 1, α ≥ 2}. Similarly when α = 1, the GCOMP(v, p, β) distribution is seen to be is over dispersed for 0 < β ≤ 1 and under dispersed for β < 0. When ν = 1, we derive that COMP (p, α − β) is over dispersed for α − β > 1 under dispersed for α − β < 1 and equi-dispersed when α − β = 1. Finally, the new generalized NB distribution with pmf (7) is over dispersed when γ = 1 (which is when it reduces to Negative binomial) and under dispersed if γ > 1.

It can also be checked that ECOMP (v, p, α, β) is over (under) dispersed for α ≥ β > (≤)0 w.r.t. COM-NB (v, p, α) and w.r.t. GCOM-Poisson (v, p, β) it is over (under) dispersed for β ≤ α < 1 (1 < β ≤ α).

2.6 Different formulations of ECOMP (v, p, α, β)

Two different formulations of the proposed distribution are presented in this section.

2.6.1 ECOMP (v, p, α, β) as a distribution from a queuing set up

Like the COM-Poisson distribution, the ECOMP (v, p, α, β) distribution can also be derived as the probability of the system being in the k ^th state for a queuing system with state dependent service and arrival rate.

Consider a single server queuing system with state dependent (that is dependent on the system state, k ^th state means k number of units in the system) arrival rate λ _k = (ν + k)^β λ, and state dependent service rate μ _k = k ^α μ , where, 1/μ and 1/λ are respectively the normal mean service and mean arrival time for a unit when that unit is the only one in the system; α and v are the pressure coefficients, reflecting the degree to which the service and arrival rates of the system are affected by the system state.

Proposition 1. Under the above set up where the arrival rate and the service rate increases exponentially as queue lengthens (i.e. as k increases) the probability of the system being in the k ^th state is ECOMP (v, p, α, β).

Proof: See Appendix B.1

2.6.2 ECOMP (v, p, α, β) as exponential combination formulation

The general form of the exponential combination of two pmfs say f ₁(x; θ ₁) and f ₂(x; θ ₂) is given by (Atkinson 1970)

$$ {\left\{{f}_1\left(x;{\theta}_1\right)\right\}}^{\beta }{\left\{{f}_2\left(x;{\theta}_2\right)\right\}}^{1-\beta }/{\displaystyle \sum {f}_1{\left(x;{\theta}_1\right)}^{\beta }{f}_2{\left(x;{\theta}_2\right)}^{1-\beta }} $$

This combining of the pmf was suggested by Cox (1961, 1962) for combining the two hypotheses (β = 1, i.e. the distribution is f ₁ and β = 0 that is the distribution is f ₂) in a general model of which they would both be special cases. The inferences about β made in the usual way and testing the hypothesis that the value of β is zero or one is equivalent to testing for departures from one model in the direction of the other.

Proposition 2. ECOMP (ν, p, α, β) distribution is an exponential combination NB (v, λ) and COM-Poisson (μ, θ) distributions, with λ ^β μ ^1 − β = p and α = θ(1 − β) + β.

Proof: See Appendix B.2.

From the above formulations it is clear that for ECOMP (v, p, α, β), β close to zero will indicate departure from COM-Poisson towards NB, while β close to one will indicate the reverse. Thus ECOMP (v, p, α, β) can also be regarded as a natural extension of COM-Poisson, and negative binomial distributions.

2.7 Log-concavity and modality

Proposition 3. The ECOMP (v, p, α, β) has a log-concave pmf when {ν > 1, p > 0, α ≥ β}

Proof: See Appendix B.3.

From the above result the corresponding results of COM-NB (v, p, α) and GCOMP (v, p, β) can be obtained as particular cases. That is COM-NB (v, p, α) is log-concave when {ν > 1, p > 0, α ≥ 1} and GCOMP (v, p, β) is log-concave when {ν > 1, p > 0, β ≤ 1}.

Following two important results follows as a consequence of log-concavity:

If {ν ≥ 1, p > 0, α > β} the ECOMP (v, p, α, β) distribution is

➢ a strongly unimodal distribution
➢ has an increasing failure rate function

Using the recurrence relation of the probabilities in (10) it is observed that the ECOMP (ν, p, α, β) has

(i)
a non increasing pmf with a unique mode at X = 0 if ν ^β p < 1,

e.g. ν = 2, α = 3, β = 2, p should be less than 0.25 to have unique mode at X = 0.
(ii)
a unique mode at X = k if k ^α/(ν + k − 1)^β < p < (k + 1)^α/(ν + k)^β

e.g. ν = 2, α = 3, β = 2, p should be between 1.6875 and 2.560 to have unique mode at X = 3.
(iii)
two modes at X = k and X = k − 1 if (ν + k − 1)^β p = k ^α. In particular the two modes are at X = 0 and X = 1 if ν ^β p = 1.

e.g. ν = 2, α = 3, β = 2, p should be equal to 4.408 to have two modes at X = 5 and X = 6.

Graphical illustrations of the above three examples are presented in the first plots of Fig. 1. It is interesting to note that the distribution may be bimodal with one of the mode always at zero as shown the last two plots in Fig. 1.

Proposition 4. ECOMP (v, p, α, β) has a log-convex pmf for {0 < ν ≤ 1, α = β}

Proof. See Appendix B.4.

Following important results follows as a consequence of log-convexity:

If {ν ≤ 1, p > 0, α = β} the ECOMP (v, p, α, β) distribution with pmf in (7)

➢ is Infinitely divisible (see Warde and Katti 1971) distribution, hence Discrete Compound Poisson distribution. (see page 409 of Gómez-Déniz et al. 2011)
➢ has an decreasing failure rate function, hence increasing mean residual life function
➢ has an upper bound for variance as p ν ^β (using result of page 410 of Gómez-Déniz et al. 2011)

2.8 Moments

The r ^th factorial moment E(X ^[r]) = μ ^[r] of the ECOMP (v, p, α, β) is given by

$$ \begin{array}{l}{\mu}^{\left[r\right]}=\frac{{\left\{{\left(\nu \right)}_r\right\}}^{\beta}\;{p}^r}{{\left(r\;!\right)}^{\alpha -1}}\;\frac{{}_1S_{\alpha -1}^{\beta}\left(\nu +r,r+1,p\right)}{{}_1S_{\alpha -1}^{\beta}\left(\nu, 1,p\right)}\\ {}=\frac{{\left\{{\left(\nu \right)}_r\right\}}^{\beta}\;{p}^r}{{\left(r\;!\right)}^{\alpha -1}}\frac{{}_{\beta }F_{\alpha -1}\left(\nu +r; \kern0.24em r+1,\;r+1,\cdots,\;r+1;\kern0.24em p\right)}{{}_{\beta }F_{\alpha -1}\left(\nu; \kern0.24em 1,\;1,\cdots,\;1;\kern0.24em p\right)},\end{array} $$

where the second expression in terms of hypergeometric function is for the case when α, β are both positive integers.

Since the ECOMP (v, p, α, β) distribution is a member of exponential family (see Section 2.4), the mean is given by differentiating the logarithm of the normalizing constant with respect to p. Hence an asymptotic approximation for the mean is obtained by differentiating the logarithm of the function (9) as

$$ {p}^{\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$\left(\alpha -\beta \right)$}\right.}+\frac{1-\alpha +\left(2\nu -1\right)\beta }{2\left(\alpha -\beta \right)}. $$

(12)

This function approximates the mean of the ECOMP (v, p, α, β) distribution for large p and small |α − β|, where it is difficult to compute the approximation by truncation. A numerical illustration of this asymptotic approximation is presented in the Appendix A.2.

3 Reliability characteristics and stochastic ordering

3.1 Survival and failure rate functions

The survival function is given by

$$ \begin{array}{c}\hfill S(t)=1-P\left(X<t\right)=1-\frac{1}{{}_1S_{\alpha -1}^{\beta}\left(\nu,;\;1; \kern0.24em p\right)}{\displaystyle \sum_{k=0}^{t-1}\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }{p}^k}{{\left(k!\right)}^{\alpha }}}\hfill \\ {}\hfill =1-\frac{1}{{}_{\beta }F_{\alpha -1}\left(\nu;\;1,\;1,\cdots,\;1;p\right)}{\displaystyle \sum_{k=0}^{t-1}\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }{p}^k}{{\left(k!\right)}^{\alpha }}}\hfill \end{array}. $$

Alternatively, S(t) can also be expressed as

$$ S(t)=\frac{{\left(\nu \right)}_t\;{p}^t}{{\left(t\;!\right)}^{\alpha }}\frac{{}_2S_{\alpha -1}^{\beta}\left(\nu +t,1;\kern0.24em t+1;\kern0.24em p\right)}{{}_1S_{\alpha -1}^{\beta}\left(\nu,;\;1; \kern0.24em p\right)}=\frac{{\left(\nu \right)}_t\;{p}^t}{{\left(t\;!\right)}^{\alpha }}\;\frac{{}_{\beta +1}F_{\alpha -1}\left(\nu +t,\;1;\kern0.24em t+1,\;t+1,\cdots,\;t+1;\kern0.24em p\right)}{{}_{\beta }F_{\alpha -1}\left(\nu; \kern0.24em 1,\;1,\cdots,\;1;\kern0.24em p\right)}. $$

The failure rate function is given by

$$ r(t)=\frac{P\left(X=t\right)}{P\left(X\ge t\right)}=\frac{1}{{}_2S_{\alpha -1}^{\beta}\left(\nu +t,1;\kern0.24em t+1;\kern0.24em p\right)}=\frac{1}{{}_{\beta +1}F_{\alpha}\left(\nu +t,\;1;\kern0.24em t+1,\;t+1,\cdots,\;t+1;\kern0.24em p\right)}, $$

where the second expression in terms of hypergeometric function is for the case when α, β are positive integers.

3.2 Stochastic orderings

An rv X with pmf P(X = n) is said to be smaller than another rv Y pmf P(Y = n) in the likelihood ratio order that is X ≤ _lr Y if P(Y = n)/P(X = n) increases in n over the union of the supports of X and Y. Again X ≤ _lr Y implies X is smaller than Y in the hazard rate order and subsequently in the mean residual (MRL) life order (see Gupta et al. 2014).

Theorem 1. X ~ ECOMP (ν, p, α, β) is smaller than Y ~ COM-NB (ν, p, α) in the likelihood ratio order i.e. X ≤ _lr Y when β < 1.

Proof: If X ~ ECOMP (v, p, α, β) and Y ~ COM-NB (v, p, α), then

$$ \frac{P\left(Y=n\right)}{P\left(X=n\right)}={\left\{{\left(\nu \right)}_n\right\}}^{1-\beta}\;\frac{{}_1S_{\alpha -1}^{\beta}\left(\nu, 1,p\right)}{{}_1S_{\alpha -1}^1\left(\nu, 1,p\right)}. $$

This is clearly increasing in n as β < 1 (Definition 1.C.1 of Chapter 1, Shaked and Shanthikumar 2007 and Gupta et al. 2014). Hence the result is proved.

As an implication of theorem 1, we get X ≤ _hr Y ⇒ X ≤ _MRL Y, for β < 1.

Theorem 2. X ~ ECOMP (v, p, α, β) is smaller than Y ~ GCOMP (v, p, β) in the likelihood ratio order i.e. X ≤_lr Y when α > 1.

Proof: If X ~ ECOM-NB (v, p, α, β) and Y ~ GCOMP (v, p, β), then

$$ \frac{P\left(Y=n\right)}{P\left(X=n\right)}=\frac{{\left(n\;!\right)}^{\alpha -1}}{{\left(\Gamma \nu \right)}^{\beta }}\;\frac{{}_1S_{\alpha -1}^{\beta}\left(\nu, 1,p\right)}{{}_1S_0^{\beta}\left(\nu, 1,p\right)}. $$

This is clearly increasing in n as α > 1 (Definition 1.C.1 of Chapter 1, Shaked and Shanthikumar 2007 and Gupta et al. 2014). Hence the result is proved.

As an implication of theorem 2, we get X ≤ _hr Y ⇒ X ≤ _MRL Y, for α > 1.

4 Numerical examples

To fit the proposed distribution, we have to estimate the parameters (v, p, α, β) in (6). The maximum likelihood (ML) estimation is often used for fitting to real data, but the log likelihood function of the proposed distribution

$$ \mathrm{L}\left(\nu, p,\;\alpha,\;\beta \right)=\beta {\displaystyle \sum_{i=0}^k{f}_i \log {\left(\nu \right)}_i}-\alpha {\displaystyle \sum_{i=0}^k{f}_i \log i}!+ \log p{\displaystyle \sum_{i=0}^ki\;{f}_i-N \log {}_1S_{\alpha -1}^{\beta}\left(\nu, 1,p\right)} $$

(13)

where f _i is the observed frequency of i ^th observed value(event), $ N={\displaystyle \sum_{i=1}^k{f}_i} $, k is the highest observed value, has some local maximum points for some datasets, or the likelihood equations do not always have unique solution. Therefore, we use the profile likelihood estimation. We first consider the maximum likelihood estimation by fixing the parameter v and finding the maximum point $ \left({\widehat{p}}_{\nu },{\widehat{\alpha}}_{\nu },{\widehat{\beta}}_{\nu}\right) $ of the function (13). The maximum point $ \left({\widehat{p}}_{\nu },{\widehat{\alpha}}_{\nu },{\widehat{\beta}}_{\nu}\right) $ is uniquely determined because the proposed distribution belongs to the exponential family when v is fixed. For finding $ \left({\widehat{p}}_{\nu },{\widehat{\alpha}}_{\nu },{\widehat{\beta}}_{\nu}\right) $ computationally, it is convenient to use some initial values. The simple initial values can be obtained as follow. Putting c _x = P(X = x + 1)/P(X = x) and d _x = log(c _x + 1/c _x), where X is the rv following ECOMP (ν, p, α, β) distribution, we have the equation

$$ {A}_x(v)\;\left(\begin{array}{c}\hfill \alpha \hfill \\ {}\hfill \beta \hfill \end{array}\right)=\left(\begin{array}{c}\hfill {d}_x\hfill \\ {}\hfill {d}_{x+1}\hfill \end{array}\right),\kern0.5em \mathrm{where}\kern0.5em {A}_x(v)=\left(\begin{array}{cc}\hfill \log \frac{x+1}{x+2}\hfill & \hfill \log \frac{v+x+1}{v+x}\hfill \\ {}\hfill \log \frac{x+2}{x+3}\hfill & \hfill \log \frac{v+x+2}{v+x+1}\hfill \end{array}\right). $$

For given v, we choose the integer k such that |A _k(v)| ≠ 0 and put

$$ \left(\begin{array}{c}\hfill {s}_{1,k}(v)\hfill \\ {}\hfill {s}_{2,k}(v)\hfill \end{array}\right)={A}_k{(v)}^{-1}\left(\begin{array}{c}\hfill {d}_k\hfill \\ {}\hfill {d}_{k+1}\hfill \end{array}\right) $$

where P(X = x) is substituted with f _x in d _x. Then we can obtain the initial values $ \left({\overset{\sim }{p}}_k(v),{\overset{\sim }{\alpha}}_k(v),{\overset{\sim }{\beta}}_k(v)\right) $ for (p, α, β) as

$$ {\overset{\sim }{\alpha}}_k(v)=\left\{\begin{array}{cc}\hfill {s}_{1,k}(v)\hfill & \hfill {s}_{1,k}(v)>{s}_{2,k}(v)\hfill \\ {}\hfill {s}_{2,k}(v)\hfill & \hfill otherwise\hfill \end{array}\right.,\ {\overset{\sim }{\beta}}_k(v)={s}_{2,k}(v)\kern0.5em \mathrm{and}\kern0.75em {\overset{\sim }{p}}_l(v)=\frac{{\left(l+1\right)}^{\overset{\sim }{\alpha_k(v)}}}{{\left(v+l\right)}^{\overset{\sim }{\beta_k(v)}}}, $$

where l is the lowest observed value (e.g. l = 0 for neither censored nor truncated data). These values are available even for the truncated version of ECOMP (v, p, α, β) distribution. Then by studying the behavior of $ \mathrm{L}\left(\nu, {\widehat{p}}_{\nu },{\widehat{\alpha}}_{\nu },{\widehat{\beta}}_{\nu}\right) $ with v varying, we find the range of v where the function will give the global maximum. For the range, the maximum point of the function (13) gives the ML estimates $ \left(\widehat{\nu},\widehat{p},\widehat{\alpha},\widehat{\beta}\right) $.

By using this method, we fit the proposed distribution to three datasets and compare with NB (r, p), COMP (θ, p) COM-NB (v, p, α) and GCOMP (v, p, β). Simultaneously, we fit Delaporte distribution, which is derived from the convolution of a NB (r, p) and Poisson (λ) rv, and some mixed Poisson distributions; mixing with generalized gamma distribution of Agarwal and Kalla (1996) with parameters (δ, m, α, n), mixing with generalized inverse Gaussian gamma distribution of Jorgensen (1982) with parameters (χ, η, ω, λ), mixing with generalized exponential distribution of Ong and Lee (1986) with parameters (v, a, d, β). These distributions are derived as the generalized negative binomial distributions and used for long-tailed count data. The detailed studies are given in Gupta and Ong (2005). Here we show only the best fitting distribution among these distributions in Gupta and Ong (2005). The performances of various distributions are compared using the χ ² goodness of fit and the Akaike Information Criterion (AIC). Following Burnham and Anderson (2004) we look at the difference Δ_i = AIC_i − AIC_min where AIC_min is the minimum of the AIC values of the all the fitted model and AIC_i is that of the i ^th model. According to Burnham and Anderson (2004), models having Δ_i ≤ 2 had substantial support (evidence) and those in which 4 ≤ Δ_i have considerably less support. For computing the χ ² goodness of fit statistics we group the cells whose expected number is less than 5 such that the expected number of grouped cell is not less than 5.

4.1 The spots in southern pine beetle

The first example is the frequency distribution of Corbet’s Malayan Buttery with zeros (Corbet 1942). Corbet caught altogether 620 species, but he also estimated that the total buttery fauna of the area contained 924 species, so that 304 species were missing from the collection and treated as count zero. In this dataset, the counts more than 24 are grouped as 25+, so we use the log-likelihood function of the form

$$ {\displaystyle \sum_{i=0}^{24}{f}_i} \log P\left(X=i\right)+{f}_{25} \log P\left(X\ge 25\right), $$

where X is the rv of the fitted distribution.

Comparing the performance of the distributions presented in Table 1, we see that the Delaporte distribution gives best and marginally better fit than the ECOMP distribution in terms of AIC and χ ² goodness of fit but looking at the value Δ_i suggests that the ECOMP distribution also has substantial support (evidence) for the data. Both theses two distributions give much better fittings for the count 0, 1 and the tail part 25+ compared to the rest.

Table 1 Distribution of Corbet’s Malayan Buttery with zeros (Corbet 1942)

Full size table

More over for it can be observed, the ML estimate $ \widehat{\alpha} $ of the COM-NB distribution and ML estimate $ \widehat{\beta} $ of the GCOMP distribution show these two distributions reduce to the negative binomial distribution, while the proposed ECOMP distribution does not seem to reduce to the negative binomial distribution. Actually, the likelihood ratio test for H ₀: Negative binomial distribution (α = β = 1) Vs H ₁: ECOMP distribution (α ≠ 1 or β ≠ 1) rejects the negative binomial distribution (p-value is 0.001). So the ECOMP distribution brings in substantial improvement in fitting this data set over both COM-NB and GCOMP distributions.

4.2 The spots in southern pine beetle

The second example is the frequency data of the number of spots (k) in southern pine beetle, Dentroctonus frontails Zimmerman, (Coleopetra: Scolytidae), in Southeast Texas (Lin 1985). Table 2 shows the fitting results and Poi-GE means the mixed Poisson distribution with generalized exponential distribution. From χ ² goodness of fit and AIC, the GCOMP distribution gives the best fitting among fitted distributions. However, a look at the value Δ _i suggests that the ECOMP distribution gives equally good fitting to the data. From the estimated parameters of the ECOMP distribution, we can see that fitted ECOMP distribution reduces to the new generalization of NB distribution given in equation (7) with estimated parameters $ \widehat{\nu}=0.002,\;\widehat{p}=0.69,\;\widehat{\gamma}=0.28 $. Further, by virtue of proposition 2 in Section 2.6.2 and we can conclude that fitted ECOMP distribution reduces to an exponential combination of NB (0.003, λ) and Geometric (μ) in the ratio 0.28:0.72, where λ and μ can be calculated using the formula given in the Section 2.6.2.

Table 2 The number of spots in southern pine beetle (Lin 1985)

Full size table

4.3 Borrowing library books

The third example shows the number of books that were borrowed k times (k ≥ 1) from the long loan collection at Sussex University over the period of a year (Burrell and Cane 1982). For fitting to this dataset, we consider the zero-truncation of each distribution. Table 3 shows the fitting results. From χ ² goodness of fit and AIC, the zero-truncation of the ECOMP distribution gives best fitting among fitted distributions. Studying the values of Δ_i suggest that the COM-NB distribution also has good support (evidence) while rest of the models have considerably less support for the data.. Here we interpret the size of queue in Section 2.6.1 as the popularity of books. Then, from the estimated parameters of the ECOMP distribution, we see that new interest is hard to increase but the popularity is hard to decrease for the book which is borrowed many times. This might be because, according as a book is borrowed more times, there are fewer opportunities to borrow the book.

Table 3 The number of books that were borrowed k times (Burrell and Cane 1982)

Full size table

5 Concluding remarks

Extended Conway-Maxwell-Poisson distribution proposed here unifies the COM-NB and GCOMP which were recently introduced to add more flexibility to the COM-Poisson distribution. The proposed distribution with additional parameter has more flexibility in terms of its tail behavior and dispersion level. Further it also arises from queuing theory set up and as exponential combination of negative binomial and COM-Poisson distribution and has many interesting properties. It is therefore envisaged that ECOMP distribution has the potential in modeling varieties of count data.

References

Abramowitz, M., Stegun, I.A.: Handbook of Mathematical Functions. 9^th Print. Dover, New York (1970)
Google Scholar
Agarwal, S.K., Kalla, S.L.: A generalized gamma distribution and its application in reliability. Commun. Stat. Theory Methods 25, 1, 201–210 (1996)
Article MathSciNet Google Scholar
Atkinson, A.C.: A method for discriminating between models. J. R. Stat. Soc. Series B (Methodological) 32, 3, 323–353 (1970)
Google Scholar
Bleistein, N., Handelsman, R.A.: Asymptotic expansions of integrals. Dover, New York (1986)
Burnham, K.P., Anderson, D.R.: Multimodel Inference-Understanding AIC and BIC in Model Selection. Sociol. Methods Res. 33, 2, 261–304 (2004)
Article MathSciNet Google Scholar
Burrell, Q.L., Cane, V.R.: The analysis of library data. J. R. Stat. Soc., Series A 145, 439–471 (1982)
Article Google Scholar
Chakraborty, S., Ong, S.H. A COM-type generalization of the negative binomial distribution, Accepted in April 2014, (available on line since 07 November 2015) to appear in Communications in Statistics-Theory and Methods
Conway, R.W., Maxwell, W.L.: A queueing model with state dependent service rates. J Industrl Engng 12, 132–136 (1962)
Google Scholar
Corbet, A.S.: The distribution of butteries in the Malay peninsula. Proc. R. Entomol. Soc. London, Series A, General Entomology 16, 101–116 (1942)
Article Google Scholar
Cox, D.R.: Tests of separate families of hypotheses. Proc. 4th Berkeley Symp. 1, 105–123 (1961)
Google Scholar
Cox, D.R.: Further results on tests of separate families of hypotheses. J. R. Statist. Soc. B 24, 406–424 (1962)
MATH Google Scholar
Gómez-Déniz, E., María Sarabia, J., Calderín-Ojeda, E.: A new discrete distribution with actuarial applications. Insur. Math. Econ. 48, 406–412 (2011)
Article MATH Google Scholar
Gupta, R.C., Ong, S.H.: Analysis of long-tailed count data by Poisson mixtures. Commun. Stat. Theory Methods 34, 557–574 (2005)
Article MathSciNet MATH Google Scholar
Gupta, P.L., Gupta, R.C., Tripathi, R.C.: On the monotonic properties of discrete failure rates. J. Stat. Plan. Inference 65, 255–268 (1997)
Article MathSciNet MATH Google Scholar
Gupta, R.C., Sim, S.Z., Ong, S.H.: Analysis of discrete data by Conway-Maxwell Poisson distribution. AStA Adv. Stat. Anal. 98, 327–343 (2014)
Article MathSciNet Google Scholar
Imoto, T.: A generalized Conway-Maxwell-Poisson distribution which includes the negative binomial distribution. Appl. Math. Comput. 247, 824–834 (2014)
Article MathSciNet Google Scholar
Johnson, N.L., Kemp, A.W., Kotz, S.: Univariate discrete distributions. Wiley, New York (2005)
Book MATH Google Scholar
Jorgensen, B.: Statistical properties of the generalized inverse Gaussian distribution. Lecture Notes in Statistics, Springer-Verlag, New York (1982)
Kokonendji, C.C., Mizère, D., Balakrishnan, N.: Connections of the Poisson weight function to over dispersion and unde rdispersion. J. Stat. Plan. Inference 138, 1287–1296 (2008)
Article MATH Google Scholar
Lin, S-K.: Characterization of lightning as a disturbance to the forest ecosystem in East Texas. M.Sc. thesis. Texas A & M University, College Station (1985)
Google Scholar
Minka, T.P., Shmueli, G., Kadane, J.B., Borle S., and Boatwright, P.: Computing with the COM-Poisson distribution. Technical Report: 776, Department of Statistics, Carnegie Mellon University, http://repository.cmu.edu/cgi/viewcontent.cgi?article=1174&context=statistics . (2003)
Ong, S.H., Lee, P.A.: On a generalized non-central negative binomial distribution. Commun. Stat. Theory Methods 15, 1065–1079 (1986)
Article MathSciNet MATH Google Scholar
Shaked, M., Shanthikumar, J.G.: Stochastic orders. Springer Verlag, New York (2007)
Warde, W.D., Katti, S.K.: Infinite divisibility of discrete distributions II. Ann. Math. Stat. 42, 3, 1088–1090 (1971)
Article MathSciNet Google Scholar

Download references

Acknowledgments

The corresponding author Prof. Subrata Chakraborty would like to thank the Editors –in-Chief Prof. Felix Famoye andProf. Carl Lee, for the invitation to write a paper for this esteemed Journal. Both the authors acknowledge the comments and suggestions of the editor and both the reviewers which lead to substantial improvement in the presentation of the work.

Author information

Authors and Affiliations

Department of Statistics, Dibrugarh University, Dibrugarh, 786004, Assam, India
Subrata Chakraborty
The Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo, 190-8562, Japan
Tomoaki Imoto

Authors

Subrata Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Tomoaki Imoto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Subrata Chakraborty.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

SC conceptually developed the proposed distribution with related mathematical results of the paper and drafted the manuscript. TI developed the Sections 2.2, 2.8 and 4 of the manuscript. Both authors read and approved the final manuscript.

Appendix

A. Approximations of the normalizing constant $ {}_1S_{\alpha -1}^{\beta}\left(\nu; 1;p\right) $ and the mean

A.1 Asymptotic approximation of the normalizing constant $ {}_1S_{\alpha -1}^{\beta}\left(\nu; 1;p\right) $ using the Laplace’s method

Defining $ i=\sqrt{-1} $, we have the identity for non-negative integers n and k

$$ \frac{1}{2\pi }{\displaystyle \underset{-\pi }{\overset{+\pi }{\int }} \exp \left\{{e}^{iz}-izn\right\}}{e}^{-izk}dz=\frac{1}{\left(n+k\right)!}. $$

This leads to the identities

$$ \begin{array}{l}\frac{1}{2\pi }{\displaystyle \underset{-\pi }{\overset{+\pi }{\int }} \exp \left\{{e}^{iz}\right\}{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p{e}^{iz}\right)\;}dz={}_1S_{\alpha}^{\beta}\left(\nu;\;1;p\right)\kern0.5em \mathrm{and}\\ {}\frac{1}{2\pi }{\displaystyle \underset{-\pi }{\overset{+\pi }{\int }} \exp \left\{{e}^{iz}-iz\left(\nu -1\right)\right\}{\left\{\Gamma (v)\right\}}^{\beta }{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p{e}^{iz}\right)\;}dz={\left\{\Gamma (v)\right\}}^{\beta -1}{}_1S_{\alpha -1}^{\beta -1}\left(\nu;\;1;p\right)\end{array} $$

From these two identities, we get the formula for integer values α ≥ 0 and β ≤ 0

$$ \begin{array}{l}{}_1S_{\alpha -1}^{\beta}\left(\nu;\;1;p\right){\left\{\Gamma (v)\right\}}^{\beta}\\ {}=\frac{1}{{\left(2\pi \right)}^{\alpha -\beta -1}}{\displaystyle \underset{-\pi }{\overset{+\pi }{\int }}\cdots {\displaystyle \underset{-\pi }{\overset{+\pi }{\int }} \exp \left\{{\displaystyle \sum_{l=1}^{\alpha -\beta -1}{e}^{i{z}_1}+p{e}^{-{\displaystyle {\sum}_{l=1}^{\alpha -\beta -1}i{z}_l}}-\left(\nu -1\right){\displaystyle \sum_{l=1}^{\alpha -\beta -1}i{z}_l}}\right\}\;}d{z}_1\cdots }d{z}_{\alpha -\beta -1}\end{array} $$

Changing the variables iz _l = ix _l + log p/(α − β) and then applying the Laplace’s method for approximation of multiple integral, we obtain the formula (9).

The formula (9) has been derived for an integer values α ≥ 1 and − β ≥ 0, but numerical studies suggest that it holds for 0 < α − β < 1 and p > 1, where it is difficult to compute $ {}_1S_{\alpha -1}^{\beta}\left(\nu, 1,p\right) $ by truncated approximation (8). The Table 4 gives the percentage errors $ 100\left\{{}_1\overset{\sim }{S}_{\alpha -1}^{\beta}\left(\nu;\;1;p\right)-{}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right)\;\right\}/{}_1S_{\alpha -1,m}^{\beta}\left(\nu;\;1;p\right) $, with m = 18000 such that R _m(ν, p, α, β) < 10^− 28, where $ {}_1\overset{\sim }{S}_{\alpha -1}^{\beta}\left(\nu;\;1;p\right) $ is the r.h.s. of the formula (9)

Table 4 The percentage errors for approximation (9) with β = 2.5

Full size table

A.2 Asymptotic approximation of the mean

A numerical illustration of the performance of the asymptotic approximation formula of mean in equation (12) is provided in Table 5.

Table 5 The percentage errors for mean with β = 2.5 using the approximation formula in equation (12)

Full size table

B. Proof of the propositions

B.1 Proof of proposition 1

Following Conway and Maxwell (1962), the system differential difference equations are given by

$$ {P}_0\left(t+\Delta \right)=\left(1-\lambda\;{\nu}^{\beta}\;\Delta \right){P}_0(t)+\mu\;\Delta {P}_1(t) $$

(14)

and

$$ {P}_k\left(t+\Delta \right)=\left(1-\lambda\;{\left(\nu +k\right)}^{\beta}\Delta -\mu\;{k}^{\alpha}\;\Delta \right){P}_k(t)+\lambda\;{\left(\nu +k-1\right)}^{\beta}\;\Delta\;{P}_{k-1}(t)+\mu\;{\left(k+1\right)}^{\alpha}\;\Delta {P}_{k+1}(t)\Big)\kern0.5em k=1,2,\cdots $$

(15)

Let λ/μ = p. Then from (14) and (15) we get

$$ \begin{array}{l}\left\{{P}_0\left(t+\Delta \right)-{P}_0(t)\right\}/\Delta =-\mu\;p\;{\nu}^{\beta}\;{P}_0(t)+\mu\;{P}_1(t)\kern0.5em \mathrm{and}\\ {}\left\{{P}_k\left(t+\Delta \right)-{P}_k(t)\right\}/\Delta =\mu \kern0.24em p\;{\left(\nu +k-1\right)}^{\beta}\kern0.24em {P}_{k-1}(t)-\mu \kern0.24em \left(p\;\left(\nu +k\right)+{k}^{\alpha}\;\right){P}_k(t)\\ {}+\mu\;{\left(k+1\right)}^{\alpha}\;{P}_{k+1}(t)\Big),k=1,2,\cdots \end{array} $$

Now as Δ → 0 we get

$$ \begin{array}{l}{P_0}^{/}(t)\Big\}=-\mu\;p\;{\nu}^{\beta}\;{P}_0(t)+\mu\;{P}_1(t)\kern0.5em \mathrm{and}\\ {}{P_k}^{/}(t)=\mu\;p\;{\left(\nu +k-1\right)}^{\beta}\;{P}_{k-1}(t)-\mu \kern0.24em \left(p\;{\left(\nu +k\right)}^{\beta }+{k}^{\alpha}\;\right){P}_k(t)+\mu\;{\left(k+1\right)}^{\alpha}\;{P}_{k+1}(t)\Big),k=1,2,\cdots \end{array} $$

Assuming a steady state (i.e. $ {P}_k^{/}(t)=0 $ for all k) we get

$$ \begin{array}{l}{P}_1(t)=p\;{\nu}^{\beta}\;{P}_0(t)\kern0.5em \mathrm{and}\\ {}{P}_{k+1}(t)=\frac{k^{\alpha }}{{\left(k+1\right)}^{\alpha }}\;{P}_k(t)-\frac{{\left(\nu +k\right)}^{\beta }}{{\left(k+1\right)}^{\alpha }}p\;{P}_k(t)+\frac{{\left(\nu +k-1\right)}^{\beta }}{{\left(k+1\right)}^{\alpha }}p\;{P}_{k+1}(t),k=1,2,\cdots \end{array} $$

Putting k = 1 we get

$$ \begin{array}{c}\hfill {P}_2(t)=\frac{1}{2^{\alpha }}\;{P}_1(t)-\frac{{\left(\nu +1\right)}^{\beta }}{2^{\alpha }}p\;{P}_1(t)+\frac{\nu^{\beta}\;z}{2^{\alpha }}p\;{P}_0(t)\hfill \\ {}\hfill =\frac{1}{2^{\alpha }}\;{\nu}^{\beta}\;p\;{P}_0(t)-\frac{{\left(\nu +1\right)}^{\beta }}{2^{\alpha }}{\nu}^{\beta}\;p\;{P}_0(t)+\frac{\nu^{\beta}\;z}{2^{\alpha }}p\;{P}_0(t)\hfill \\ {}\hfill =\frac{{\left\{\nu \left(\nu +1\right)\right\}}^{\beta }}{{\left(2!\right)}^{\alpha }}{p}^2\;{P}_0(t)=\frac{{\left\{{\left(\nu \right)}_2\right\}}^{\beta }}{{\left(2!\right)}^{\alpha }}{p}^2\;{P}_0(t)\hfill \end{array} $$

Similarly, for k = 2 we get

$$ \begin{array}{c}\hfill {P}_3(t)=\left(\frac{2^{\alpha }}{3^{\alpha }}-\frac{{\left(\nu +2\right)}^{\beta }}{3^{\alpha }}p\;\right){P}_2(t)+\frac{{\left(\nu +1\right)}^{\beta}\;p}{3^{\alpha }}\;{P}_1(t)\hfill \\ {}\hfill =\left(\frac{2^{\alpha }}{3^{\alpha }}-\frac{{\left(\nu +2\right)}^{\beta }}{3^{\alpha }}p\;\right)\frac{{\left\{\nu \left(\nu +1\right)\right\}}^{\beta }}{{\left(2!\right)}^{\alpha }}{p}^2\;{P}_0(t)+\frac{{\left(\nu +1\right)}^{\beta}\;z}{3^{\alpha }}\;\nu\;p\;{P}_0(t)\kern0.5em =\frac{{\left\{\nu \left(\nu +1\right)\left(\nu +2\right)\right\}}^{\beta }}{{\left(3!\right)}^{\alpha }}{p}^3\;{P}_0(t)\hfill \end{array} $$

In general, $ {P}_k(t)=\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }}{{\left(k!\right)}^{\alpha }}{p}^k\;{P}_0(t) $, where $ {P}_0(t)=1/{\displaystyle \sum_{i=0}^{\infty}\left\{\frac{{\left\{{\left(\nu \right)}_i\right\}}^{\beta }}{{\left(i!\right)}^{\alpha }}{p}^i\right\}} $.

Since we have assumed a steady state (i.e. $ {P}_k^{/}(t)=0 $ for all k) P _k(t) can be replaced by P _k.

B.2 Proof of proposition 2

The probability function resulting from the exponential combination of NB (v,λ) and COM-Poisson (μ, θ) is given by

$$ \begin{array}{l}{\left\{\frac{{\left(\nu \right)}_k}{k!}{\lambda}^k\right\}}^{\beta}\kern0.28em {\left\{\frac{\mu^k}{{\left(k!\right)}^{\theta }}\right\}}^{1-\beta }/{\displaystyle \sum_{i\ge 0}{\left\{\frac{{\left(\nu \right)}_i}{i!}{\lambda}^i\right\}}^{\beta}\kern0.28em {\left\{\frac{\mu^i}{{\left(i!\right)}^{\theta }}\right\}}^{1-\beta }}\kern0.28em \\ {}=\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }{\left\{{\lambda}^{\beta }{\mu}^{1-\beta}\right\}}^k}{{\left(k!\right)}^{\theta \left(1-\beta \right)+\beta }}/{\displaystyle \sum_{i\ge 0}\frac{{\left\{{\left(\nu \right)}_i\right\}}^{\beta }{\left\{{\lambda}^{\beta }{\mu}^{1-\beta}\right\}}^i}{{\left(i!\right)}^{\theta \left(1-\beta \right)+\beta }}}=\frac{{\left\{{\left(\nu \right)}_k\right\}}^{\beta }{p}^k}{{\left(k!\right)}^{\alpha }}/{\displaystyle \sum_{i\ge 0}\frac{{\left\{{\left(\nu \right)}_i\right\}}^{\beta }{p}^i}{{\left(i!\right)}^{\alpha }}},\end{array} $$

substituting λ ^β μ ^1 − β = p and α = θ(1 − β) + β

This is the pmf of ECOMP (v, p, α, β).

B.3 Proof of proposition 3

For a distribution to be log-concave we must have (see Gupta et al. 1997)

$$ \Delta\;\eta (t)=P\left(t+1\right)/P(t)-P\left(t+2\right)/P\left(t+1\right)>0. $$

For ECOMP (v, p, α, β), $ \Delta\;\eta (t)=p\frac{{\left(\nu +t\right)}^{\beta }{\left(t+2\right)}^{\alpha }-{\left(\nu +t+1\right)}^{\beta }{\left(t+1\right)}^{\alpha }}{{\left(t+1\right)}^{\alpha }{\left(t+2\right)}^{\alpha }} $

$$ \begin{array}{l}\mathrm{Now}\kern0.62em p\frac{{\left(\nu +t\right)}^{\beta }{\left(t+2\right)}^{\alpha }-{\left(\nu +t+1\right)}^{\beta }{\left(t+1\right)}^{\alpha }}{{\left(t+1\right)}^{\alpha }{\left(t+2\right)}^{\alpha }}>0\\ {}\Rightarrow {\left(\nu +t\right)}^{\beta }{\left(t+2\right)}^{\alpha }-{\left(\nu +t+1\right)}^{\beta }{\left(t+1\right)}^{\alpha }>0\kern1em \mathrm{since}\kern1em {\left(t+1\right)}^{\alpha }{\left(t+2\right)}^{\alpha }>0,p>0\\ {}\Rightarrow {\left(\nu +t\right)}^{\beta }{\left(t+2\right)}^{\alpha }-{\left(\nu +t+1\right)}^{\beta }{\left(t+1\right)}^{\alpha }>0\\ {}\mathrm{But}\ \mathrm{f}\mathrm{o}\mathrm{r}\kern1em \nu >1,\kern0.5em \left(t+1\right)/\left(t+2\right)<\left(\nu +t\right)/\left(\nu +t+1\right)\\ {}\Rightarrow {\left\{\left(t+1\right)/\left(t+2\right)\right\}}^{\alpha }<{\left\{\left(\nu +t\right)/\left(\nu +t+1\right)\right\}}^{\alpha}\le {\left\{\left(\nu +t\right)/\left(\nu +t+1\right)\right\}}^{\beta}\kern0.75em \mathrm{since}\kern1em \alpha \ge \beta \\ {}\Rightarrow {\left\{\left(t+1\right)/\left(t+2\right)\right\}}^{\alpha }{\left\{\left(\nu +t+1\right)/\left(\nu +t\right)\right\}}^{\beta }<1\\ {}\Rightarrow 1-{\left\{\left(t+1\right)/\left(t+2\right)\right\}}^{\alpha }{\left\{\left(\nu +t+1\right)/\left(\nu +t\right)\right\}}^{\beta }>0\\ {}\Rightarrow {\left(\nu +t\right)}^{\beta }{\left(t+2\right)}^{\alpha }-{\left(\nu +t+1\right)}^{\beta }{\left(t+1\right)}^{\alpha }>0\end{array} $$

B.4 Proof of proposition 4

ECOMP (v, p, α, β) has a log-convex probability mass function if Δ η(t) ≤ 0. That is

$$ \begin{array}{l}{\left(\nu +t\right)}^{\beta }{\left(t+2\right)}^{\alpha }<{\left(\nu +t+1\right)}^{\beta }{\left(t+1\right)}^{\alpha}\\ {}\Rightarrow {\left\{\left(t+2\right)/\left(t+1\right)\right\}}^{\alpha }<{\left\{\left(\nu +t+1\right)/\left(\nu +t\right)\right\}}^{\beta}\\ {}\Rightarrow \Big\{{\left(1+1/\left(t+1\right)\right\}}^{\alpha }<{\left\{1+1/\left(\nu +t\right)\right\}}^{\beta}\end{array} $$

(16)

Since α ≥ β the inequality in (16) cannot hold for ν > 1.

Now for 0 < ν ≤ 1 the inequality in (16) implies

$$ \Rightarrow \alpha /\beta \le \log \left(1+1/t+\nu \right)/ \log \left(1+1/t+1\right)\ge 1 $$

⇒ α/β ≤ 1. Which implies α = β since α ≥ β.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Chakraborty, S., Imoto, T. Extended Conway-Maxwell-Poisson distribution and its properties and applications. J Stat Distrib App 3, 5 (2016). https://doi.org/10.1186/s40488-016-0044-1

Download citation

Received: 10 October 2015
Accepted: 09 February 2016
Published: 25 February 2016
DOI: https://doi.org/10.1186/s40488-016-0044-1

Extended Conway-Maxwell-Poisson distribution and its properties and applications

Abstract

1 Introduction

2 Extended COM-Poisson (ECOMP) distribution

2.1 Shape of the pmf

2.2 Approximations of the normalizing constant

2.2.1 Approximation using truncation of the series

2.2.2 Asymptotic approximation of the normalizing constant using the Laplace’s method

2.3 Recurrence relation for probabilities

2.4 Exponential family

2.5 Index of dispersion

2.6 Different formulations of ECOMP (v, p, α, β)

2.6.1 ECOMP (v, p, α, β) as a distribution from a queuing set up

2.6.2 ECOMP (v, p, α, β) as exponential combination formulation

2.7 Log-concavity and modality

2.8 Moments

3 Reliability characteristics and stochastic ordering

3.1 Survival and failure rate functions

3.2 Stochastic orderings

4 Numerical examples

4.1 The spots in southern pine beetle

4.2 The spots in southern pine beetle

4.3 Borrowing library books

5 Concluding remarks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Appendix

Appendix

A. Approximations of the normalizing constant \( {}_1S_{\alpha -1}^{\beta}\left(\nu; 1;p\right) \) and the mean

A.1 Asymptotic approximation of the normalizing constant \( {}_1S_{\alpha -1}^{\beta}\left(\nu; 1;p\right) \) using the Laplace’s method

A.2 Asymptotic approximation of the mean

B. Proof of the propositions

B.1 Proof of proposition 1

B.2 Proof of proposition 2

B.3 Proof of proposition 3

B.4 Proof of proposition 4

Rights and permissions

About this article

Cite this article

Share this article

Keyword

Mathematics Subject Classification (2010)