A new trivariate model for stochastic episodes

We study the joint distribution of stochastic events described by (X , Y ,N), where N has a 1-inflated (or deflated) geometric distribution and X, Y are the sum and the maximum of N exponential random variables. Models with similar structure have been used in several areas of applications, including actuarial science, finance, and weather and climate, where such events naturally arise. We provide basic properties of this class of multivariate distributions of mixed type, and discuss their applications. Our results include marginal and conditional distributions, joint integral transforms, moments and related parameters, stochastic representations, estimation and testing. An example from finance illustrates the modeling potential of this new model.


Introduction
This paper introduces a new model for stochastic events such as growth/decline periods of a financial return, flood, drought, or a heat wave, among others. The model describes the duration N, the magnitude X and the peak value Y of such events through the random structure (X, Y , N) where the {X i } i≥1 are independent and identically distributed (IID) exponential random variables given by the probability density function (PDF) N is an integer-valued random variable, independent of the {X i }, and N i=1 X i denotes the maximum of {X i } i=1,...,N . Events like this arise in many applications. A common process generating such observations is Peaks-over-Threshold process where we are interested in observations exceeding (or below) a threshold. For example, in finance, time Zuniga et al. Journal of Statistical Distributions and Applications (2021) 8:2 Page 2 of 21 periods with positive/negative log-returns give growth/decline periods for an asset. In climate/hydrology, a flood may be described as stage of a stream exceeding a levy, heat wave may be described as consecutive days when the maximum daily temperature exceeds a high threshold, deluge can be thought of as consecutive observations (daily, hourly, etc.) of precipitation exceeding a high threshold. In energy research, heating degree days are those days when the maximum daily temperature is below 64 degrees Fahrenheit (the threshold temperature varies by country and region). Drought may be considered to be a time period when the maximum precipitation (daily, annual, etc.) is below a seasonal low for a given region. Various bivariate and trivariate models for stochastic episodes/events with the structure of (1) were developed and applied in different fields in Arendarczyk et al. (2018a, b), Barreto-Souza (2012), Barreto-Souza and Silva (2019), Biondi et al. (2002Biondi et al. ( , 2005Biondi et al. ( , 2008, Panorska (2005, 2008), and Kozubowski et al. (2008aKozubowski et al. ( , b, 2010Kozubowski et al. ( , 2011. Most of the existing models assume that the underlying observations {X i } are IID exponential or (dependent) Pareto variables, thus cover the cases of light and heavy tailed processes. In the aforementioned work the duration N was modeled with geometric distribution.
Our interest in extending the models developed earlier was motivated by the observation that the duration of (exceedingly) many events in processes such as financial asset returns is one time period. While geometric distribution allows for values of one, in many processes the events of duration 1 (above/below threshold) are either more or less common than in the geometric model. For example, heat waves are often a "hot day" (so 1-inflated), financial returns often switch daily from positive to negative (1 inflated), large precipitation is often lasting one day (1 -inflated). Summarizing, while the geometric durations work well for some applications, they do not work well for every applications. Thus, we introduced the generalization allowing 1-inflation or 1-deflation.
Thus, we developed an extension of the models with geometric duration, to allow more flexibility when accounting for the frequency of data with the duration of one. Models for count process, such as duration, are typically discrete, positive or non-negative, integer valued random variables such as geometric, negative binomial or Poisson. There are many works in the literature (medical, ecology, social science, actuarial science, etc.) describing counts data with a very large number of zeros. The models used to account for the "excess" zeros fall to two general types: zero inflated (ZI) or hurdle (H) models. We discuss and provide examples of applications for these models in Section 2. We also provide a representation for the ZI and H models via waiting times for the first success in independent Bernoulli trials with different probabilities of success. This paper is organized as follows. Section 2 is devoted to the discussion of ZI nad H models and introduces our model for 1-inflated geometric distribution. Section 3 introduces our trivariate model and presents its basic properties. Section 4 provides information about marginal and conditional distributions for the trivariate vector. Section 5 is devoted to estimation and testing connected with the new model. An illustrative data example is given in Section 6. Selective proofs and auxiliary results are collected in the Supplementary Material.

Mixture models for duration
In this section we briefly discuss zero inflated and hurdle models and introduce our (shifted hurdle) model for duration. As noted in the introduction, the two common ways Zuniga et al. Journal of Statistical Distributions and Applications (2021) 8:2 Page 3 of 21 of dealing with extensive zeros in the literature are zero -inflated (ZI) (or zero-adjusted, zero-altered) and hurdle (H) models (see, e.g., Trivedi 1998, 2005;Lambert 1992;Mullahy 1986;1997;Panicha 2018;Zuur et al. 2009;Alshkaki 2016; and references therein). These two approaches to account for large number of zeros involve mixture distributions with two components, but they differ in the way that zeros can occur. The models are mixtures of a point mass at zero and a counting distribution. In the ZI models, zero can occur as an outcome of the point mass variable or the counting variable. On the other hand, in H models zero can only occur as an outcome of the point mass while the counting variable is truncated at zero. Examples of zero-inflated or hurdle models used in the literature include applications in econometrics (see, e.g., Trivedi 1998, 2005;Zeileis et al. 2008), ecology (Panicha 2018;Zuur et al. 2009), public health, epidemiology and bioinformatics (Hu et al. 2011;Zelterman 2004;Chipeta et al. 2014). There are also interesting applications in social science, criminology and actuarial science (Aryal 2011;Constantinescu et al. 2019;Iwunor 1995;Pandey and Tiwari 2011;Sharma and Landge 2013;Tüzen and Erbaş 2018). In ecology, the use of ZI nad H models is connected with estimation population sizes using various capture-recapture type methods. In public health and epidemiology, these models are used to estimate the number of sick with a given disease, in bioinformatics ZI and H models serve for estimation of the size of the population of drug users, and in criminology and social science to estimate the size of rural-urban migration, the size of homeless populations or violators of a certain law, or the number of highway crashes (see, e.g., Famoye and Singh 2006;Iwunor 1995;Pandey and Tiwari 2011;Sharma and Landge 2013). In actuarial science, zero-inflated discrete and dependent by mixture Pareto distribution was used in modeling probability of ruin in the compound binomial risk model in Constantinescu et al. (2019).
We now turn to the definitions of the ZI and H models. We start with the notation used in the rest of this paper: the set of non-negative integers (including zero) shall be denoted by N 0 , while N shall stand for the set of natural numbers (excluding zero).

The zero-inflated model
The ZI model is a mixture of point mass at zero and a counting random variable N. In practice, the latter is often chosen to follow a standard discrete distribution such as Poisson, geometric or negative binomial (see, e.g., Mullahy 1986;Lambert 1992). It is important to note that in the ZI model, the zeros may come from two different sources: the point mass or the count variable. The probability mass function (PMF) f ZI of a zeroinflated random variable N ZI , derived from a "base" discrete random variable N with the PMF f , is of the form where I A is the indicator function of the set A.

Remark 1
The corresponding mixture representation, connecting the relevant random variables, is as follows: where J is a Bernoulli random variable with parameter 1 − q, independent of N. Zuniga et al. Journal of Statistical Distributions and Applications (2021)

The hurdle model
The hurdle model is also a mixture, where the components are a point mass at zero and a counting "base" random variable N with the PMF f. However, the base random variable is truncated below at zero before mixing. Due to the truncation, the PMF f of N is converted to f T , where the latter is the PMF of the conditional distribution of N given that N ≥ 1, , n ∈ N.
Mixing this distribution with a point mass at zero leads to the hurdle distribution based on N, with the PMF of the form ( 3 ) Remark 2  Both the IZ and H models have appeared in the literature. The practical convenience of the hurdle model comes from the ease of estimation procedures compared with those for the IZ model. Since in this work the count random variable N represents the duration of an event, our N is always greater than or equal to one. Further, our data may show an unusual frequency of ones (not zeros). Thus, we use a hurdle-type model (shifted up by one) for the duration N. We discuss it in more details below.

A hurdle-type geometric distribution
We start with the definition of geometric random variable we use in this work: a random variable with the PMF will be referred to as a geometric random variable with parameter (probability of success) p, and denoted by N p ∼ GEO(p). Note that this variable "starts" at one, and accounts for the number of trials until the first success in a series of IID Bernoulli trials with parameter p. Using this variable, we can define a hurdle-type model with the PMF of the form Next we define our counting variable N that would represent the duration in the trivariate model (1). Namely, this will be the distribution given by (5) shifted up by one, with the PMF of the form We shall denote this distribution by HGEO(p, q), which stands for hurdle -geometric distribution, and write N ∼ HGEO(p, q) when the random variable N follows this distribution. Note, that depending on the relation between p and q, the HGEO(p, q) model may over-inflate the number of ones (p < q) or under-inflate the number of ones (p > q) compared to geometric distribution with probability of success p. The following result provides a useful stochastic representation of this distribution.

Proposition 1 If N ∼ HGEO(p, q) then
where N p is geometric with the PMF (4) and I is Bernoulli with parameter 1 − q, independent of N p .
It is easy to see that when p = q then the HGEO model (7) and its PMF (6) reduce to geometric distribution with parameter p, and the PMF given by (4). It is interesting to compare the hurdle-type HGEO model above with one analogous to zero-inflation, and also built upon the geometric distribution. The PMF of the latter will be of the form Remark 3 Both of the models introduced above have interpretations as waiting times for the first success in a sequence of independent Bernoulli trials {I j }. Namely, if the probabilities of success are given by P(I 1 = 1) = q and P(I j = 1) = p for j ≥ 2 then the number of trials till the first success will have the HGEO distribution given by the PMF (6). On the other hand, if the probabilities of success are the same as above for n ≥ 2 while for n = 1 we have P(I 1 = 1) = q + (1 − q)p, then the corresponding waiting time will have a distribution with the PMF (8). Because of this, it is clear that the first model is more flexible than the second: for the hurdle type model, we have P(I 1 = 0) = 1 − q, which can fall anywhere in the unit interval, while an analogous probability for the second model is equal to (1 − p)(1 − q), which does not cover the entire unit interval as q changes in (0, 1).

Definition and basic properties of the new trivariate model
In this section we formally define the new distribution of (1) and derive its basic properties. Here, and elsewhere in the paper, the notation EX P(β) stands for the exponential distribution with the PDF (2). (1), where the {X i } are IID exponential random variables with the PDF (2) and N ∼ HGEO(p, q) with the PMF (6), independent of the {X i }, is said to have a generalized TETLG (GT) distribution, denoted by GT (p, q, β).

Definition 1 The random vector (X, Y , N) with the stochastic representation given in
We note that when p = q, then the variable N has a geometric distribution with parameter p, and the random vector (X, Y , N) above has the TETLG distribution studied in Kozubowski et al. (2011), where the name stands for Trivariate distribution with Exponential, Truncated Logistic and Geometric marginal distributions. Our construction provides a flexible generalization of the TETLG model that accounts for the excess of ones in the data.
We now derive basic characteristics of the GT model, starting with its PDF. For this, we use the bivariate distribution of which is the conditional distribution of (X, Y ) given N = n in the GT model. This distribution, referred to as the BGGE model in Qeadan et al. (2012), has the PDF of the form where, for all n ∈ N, The values of the joint PDF in (9) depend on which of the sectors S k the input (x, y) belongs to, where The support of this distribution is the set The joint distribution of (X, Y , N) ∼ GT (p, q, β) can now be derived via a standard hierarchical approach, where N ∼ HGEO(p, q) with the PMF f given by (6) and (X, Y )|N = n has the PDF f (x, y|n) given by (9), so that This leads to the following result.
where the function H is defined in (10).

Remark 4
We note that the support of the GT distribution is the same as that of its special case of TETLG distribution, which consists of the set y, n). In other words, with probability q, the distribution is concentrated on the set A 1 × {1}, and represents the random vector (E 0 , E 0 , 1), where E 0 ∼ EX P(β). However, when n ≥ 2, the conditional distribution of (X, Y ) given N = n is absolutely continuous with the PDF f (x, y|n) given in (9), in which case the joint PDF in (11) becomes showing that the GT distribution is a mixture of a degenerate distribution of (E 0 , E 0 , 1) and a proper, trivariate distribution (with the PDF f 2 ), where the mixing probabilities correspond to the events N = 1 and N ≥ 2, respectively. This absolutely continuous component is the same as its analogue in the TETLG model.
The mixture representation (12) of the GT distribution can also be seen from the stochastic representation of this model, stated below.
where N ∼ HGEO(p, q) and the {E i } are independent EX P(β) variables, independent of N.
Next, we provide an alternative stochastic representation, involving a geometric variable N p rather than the mixed geometric variable N, which is the part of the random vector (X, Y , N). Both of these representations are useful for deriving further properties of the GT distribution.
where all the variables on the right-hand-side of (14) are mutually independent, N p ∼ GEO(p), I is a Bernoulli random variable with parameter (1 − q), and the {E i } are independent EX P(β) random variables.

Remark 5
We note that in the special case p = q, both of the above stochastic representations result in the TETLG distribution studied in Kozubowski et. al. (2011), describing the random vector where N p and the {E i } are as above. Further, it can be shown that in general the random vector (X, Y , N) ∼ GT (p, q, β) can be directly related to a TETLG random vector (X,Ỹ ,Ñ) viz. another stochastic representation, which involves the operation ⊕ j defined below. This operation acts component-wise on two vectors, x = (x 1 , . . . , x n ) and y = (y 1 , . . . , y n ), and returns another vector, denoted by x⊕ j y. The latter is obtained by adding the corresponding coordinates of x and y, with the exception of the j-th coordinate, where we take x j ∨ y j (the maximum of x j and y j ). Thus, we have x⊕ j y = (x 1 +y 1 , . . . , x j−1 +y j−1 , x j ∨y j , x j+1 +y j+1 , . . . , x n +y n ), j ∈ {1, 2, . . . , n}, x, y ∈ R n .
(15) where E 0 and I are as above and the operation ⊕ 2 acts as the maximum of the second coordinates and the sum of the first and the third coordinates, so that (x 1 , x 2 , x 3 ) ⊕ 2 (y 1 , y 2 , y 3 ) = (x 1 + y 1 , x 2 ∨ y 2 , x 3 + y 3 ).

With this notation we have
Using the above representations, we obtain the characteristic function (ChF) of the GT model, presented below.

Proposition 5
The characteristic function of (X, Y , N) ∼ GT (p, q, β) is given by When p = q, Eq. (17) yields the ChF of the TETLG model of Kozubowski et al. (2011). The joint moments of the GT model are presented below.
..,k n l 1 ,...,l n k k 1 · · · k n l l 1 · · · l n n j=1 where the double summation is taken over all sets of non-negative integers k 1 , . . . , k n and l 1 , . . . , l n that add up to k and l, respectively.
One can use the above result to obtain the mean vector and the covariance matrix of the GT distribution, which are presented below.
and the elements of the covariance matrix =[ σ i,j ] are as follows: where the constant c p in the expression for σ 2,2 is given by

Marginal and conditional distributions
In this section we present the marginal and (selected) conditional distributions of the new trivariate GT distribution.

Bivariate margins
Here we discuss the three bivariate marginal distributions of (X, N), (Y , N), and (X, Y ), starting with the joint distribution of (X, N).

The marginal distribution of X, N
In view of the stochastic representation in (1), the joint PDF of (X, N) can be derived through a standard conditioning argument using the fact that N ∼ HGEO(p, q) and, given N = n, the variable X is the sum of n IID exponential variables with parameter β > 0. Thus, X|N = n has a gamma distribution GAM(n, β), with the PDF given by: We obtain the joint PDF of (X, N) by multiplying the conditional PDF (20) by the marginal PMF of N given by (6), leading to the following result.
Remark 6 Clearly, in the special case p = q we obtain the bivariate BEG distribution (see Kozubowski and Panorska 2005), describing the random vector

where N p ∼ GEO(p) and the {E i } are IID EX P(β). Further, it can be seen from Proposition 4 or the relation (16) that in general the random vector (X, N) with the PDF (21) can be related to a BEG random vector (X,Ñ) viz.
where E 0 ∼ EX P(β), I is Bernoulli with parameter 1 − q, and all the variables on the right-hand-side of (22) are mutually independent.

The marginal distribution of Y, N
We now turn to the joint distribution of (Y , N). Here, given N = n, the variable Y is the maximum of n IID exponential random variables. Thus, it has a generalized exponential distribution with the PDF f Y |N=n (y) = nβe −βy (1 − e −βy ) n−1 , y ∈ R + .
By proceeding as above, we obtain the joint PDF of (Y , N) as follows. Proposition 9 If (X, Y , N) ∼ GT (p, q, β) then the joint PDF of (Y , N) is given by f Y ,N (y, n) = qβe −βy for n = 1, nβe −βy (1 − e −βy ) n−1 (1 − q)p(1 − p) n−2 for n ≥ 2. (24) Remark 7 We note that in the special case p = q we obtain the bivariate BTLG distribution (see Kozubowski and Panorska 2008), describing the random vector where N p and the {E i } are as above. Further, it can be seen from the relation (16) that in general the random vector (Y , N) with the PDF (24) can be related to a BTLG random vector (Ỹ ,Ñ) viz.
where E 0 and I are as above and the operation ⊕ j is given by (15), so the ⊕ 1 above acts as the maximum of the first coordinates and the sum of the second coordinates,

The marginal distribution of X, Y
Finally, we turn to the last of the three bivariate distributions, the distribution of (X, Y ). Its PDF can be obtained in a standard way by adding up the trivariate PDF of the GT model (given in Theorem 2) across all the values of n ∈ N. The support of this new bivariate distribution is the set Lengthy algebra produces the result below, which can be proven in the same way as Theorem 3.1 in Kozubowski et al. (2011).
where g 1 (x, y) = βe −βx I A 1 (x, y) and Remark 8 The result shows that the distribution of (X, Y ) is a mixture of a degenerate distribution of the vector (E 0 , E 0 ) with E 0 ∼ EX P(β), which is a singular part of the distribution (corresponding to the event N = 1, which occurs with probability q) and an absolutely continuous component with the PDF g 2 given by (27), supported on the set A (corresponding to the event N ≥ 2, which occurs with probability 1 − q). Similar interpretation applies to the special case of the BETL distribution discussed in Kozubowski et al. (), obtained here when p = q. In that case the above proposition yields the PDF of Zuniga et al. Journal of Statistical Distributions and Applications (2021) where N p and the {E i } are as above. Further, it can be seen from the relation (16) that in general the random vector (X, Y ) with the PDF (26) can be related to a BETL random vector (X,Ỹ ) viz.
where E 0 and I are as above and the operation ⊕ j is given by (15), so that

Univariate margins
We now discuss the univariate margins. Since N has a hurdle-type generalized geometric distribution given by (6), we shall focus on the marginal distributions of X and Y, starting with X.

The marginal distribution of X
The PDF of X can be calculated in a straightforward way by summing up the joint PDF of X and N given by (21) across all the values of n ∈ N, leading to the result below.
with the corresponding CDF of the form

Remark 9
The results shows that X is a generalized mixture of two exponential distributions, one with parameter β and another with parameter pβ. The term "generalized" signifies the fact that the two weights in (30), although add up to one, are not necessarily restricted to the unit interval. It is worth noting that the distribution of X is also a proper mixture of two distributions, of which one is again exponential with parameter β while the other has a hypoexponential distribution, also known as generalized Erlang distribution (see, e.g., Johnson et al. 1994), given by the PDF The above is the PDF of X 1 + X 2 , where X 1 is exponential with parameter pβ and X 2 is exponential with parameter β, independent of X 1 (the term "hypoexponential" describes convolutions of exponential variables with different parameters). While the above hypoexponential variable is also a generalized mixture of exponential distributions, the distribution of X is a proper mixture as its PDF can be expressed as with g(·) given by (31). This representation can be obtained directly from Proposition 4, which shows that X is either equal to X 1 (representing the exponential part E 0 , with probability q) or X 1 + X 2 , where X 2 is exponential with parameter pβ. This X 2 corresponds to Zuniga et al. Journal of Statistical Distributions and Applications (2021)  the first sum on the right-hand-side of the representation (14), and the hypoexponential component X 1 + X 2 occurs with probability 1 − q.

Remark 10
The above mixture representations of X provide a convenient tool in deriving basic characteristics of the distribution of X. For example, the moment generating function (MGF) of X is of the form while the moments of X are given by

The marginal distribution of Y
The PDF of Y shown in the result below can be calculated as that of X, by summing up the joint PDF of Y and N given by (24) across all the values of n ∈ N.

Remark 11
The structure of the above CDF reveals that the distribution of Y is also a two component mixture. Indeed, Y can be thought of as either an exponential variable E 0 with parameter β (with probability q) or the maximum of two independent variables, E 0 ∨Ỹ , whereỸ with N p and {E i } as above. The variableỸ has a truncated logistic distribution on R + , with the CDF , y ∈ R + , studied by Marshall and Olkin (1997). This can also be seen from the representation (29),

Conditional distributions
Here we summarize basic facts concerning bivariate and univariate conditional distributions connected with the GT distribution. Since the results below are established by routine derivations involving ratios of the relevant PDFs, their elementary proofs are omited.

Bivariate conditional distributions
Here we consider the three bivariate conditional distributions of (X, Y )|N = n, (X, N)|Y = y, and (Y , N)|X = x.

The distribution of X and Y given N = n
The conditional distribution of (X, Y ) given N = n ∈ N was studied in Qeadan et al. (2012), and is known as the BGGE(β, n) model. Its PDF, given by (9) -(10), provided the basis for our derivation of the GT PDF. In particular, the conditional distribution of X given N = n is Gamma with the PDF given in (20) while the conditional distribution of Y given N = n is generalized exponential (see, e.g., Gupta and Kundu 2007) with the PDF given in (23).

The distribution of X and N given Y = y
Next, we consider the conditional PDF of (X, N) given Y = y > 0, which turns out to be of the form where the function H(x, y, n) is given by (10) and We note that when n = 1, this conditional PDF is non-zero only if x = y, in which case it takes on the value of q/v(y). We also note that when n ≥ 2, the function H(x, y, n) will be non-zero only if x satisfies ky < x ≤ (k + 1)y, k = 1, . . . , n − 1, in which case

The distribution of Y and N given X = x
Finally, we have the following expression for the PDF of (Y , N) given X = x > 0: where the function H(x, y, n) is given by (10) and Again, when n = 1 the PDF above is non-zero only if y = x, in which case it takes on the value of q(1 − p)/u(x), and when n ≥ 2 the function H(x, y, n) will be non-zero only if x satisfies x/(k + 1) ≤ y < x/k, k = 1, . . . , n − 1, in which case we have (32).

Univariate conditional distributions
It turns out that all three univariate conditional distributions of X, Y, and N given the other two variables are the same as their counterparts in the special TETLG case (p = q) studied by Kozubowski et al. (2011). We present their formulas below for the convenience of the reader.

The distribution of X given Y = y, n = n
The PDF of the conditional distribution of X given Y = y > 0, N = n ∈ N is given by H(x, y, n). Similarly to the cases discussed above, when n = 1 the PDF is non-zero only if y = x, in which case it takes on the value of 1. In turn, when n ≥ 2 the function H(x, y, n) will be non-zero only if x satisfies ky < x ≤ (1 + k)y, k = 1, . . . , n − 1, in which case we have (32).

The distribution of Y given X = x, n = n
The conditional PDF of Y given X = x > 0, N = n ∈ N is of the form H(x, y, n). Again, for n = 1 we have f (y|x, n) = 1 if y = x (and zero otherwise), while for n ≥ 2 the function H(x, y, n) will be non-zero only if y satisfies x/(k +1) ≤ y < x/k, k = 1, . . . , n−1, in which case we have (32). We also note that this particular distribution is parameter-free.

The distribution of N given X = x, y = y
As in the TETLG case, the conditional distribution of N given X = x > 0 and Y = y > 0 reduces to a point mass at 1 when x = y. On the other hand, for (x, y) ∈ S k , k ∈ N, the PMF of this distribution is of the form where W s is defined in (28).

Estimation and testing
In this section we consider the problems of estimating the parameters of the GT model and testing the hypothesis that p = q (so that the GT model reduces to the TETLG model) based on a random sample (X 1 , Y 1 , N 1 ), . . . , (X k , Y k , N k ) from the GT (p, q, β) distribution.

Maximum likelihood estimation
We start with the Fisher information matrix I(p, q, β) corresponding to the distribution of (X, Y , N) ∼ GT (p, q, β). Routine calculations lead to Next, we turn to the parameter estimation viz. maximum likelihood. While the PDF of the GT model is rather complicated, fortunately the function H(x, y, n) is parameterfree and the derivation of the maximum likelihood estimators (MLEs) is straightforward. Indeed, the likelihood function can be written as where C is parameter-free, k 1 is the number of data points with N i = 1, and X k , N k are the sample means of the {X i } and the {N i }, respectively. Thus, the statistics k 1 /k, X k and N k are jointly sufficient. We note that these statistics do not involve the values of {Y i } i=1,...,k . This is due to the fact that the conditional distribution of Y given X and N is parameter free (see Section 4.3.7). Thus the values of N and X carry all the information necessary to estimate all the parameters. We also note that the likelihood function can be maximized with respect to each parameter separately from the other parameters, leading to explicit MLEs provided below.  k  510  160  60  25  15  12  10  20  35  60  85  240 1,000 8,000 27,000 The top row lists a selection of true values of the parameter p. The bottom row lists the corresponding minimal samples sizes for which the χ 2 1 to be a reasonable approximation to the distribution of 2 log study. Noting that the speed of convergence may depend on the true value of p, we simulated 10,000 samples of (varying) size k from GEO(p) distribution for selected values of p. We then found the smallest k for which the (empirical) distribution of 2 log can be assumed to be χ 2 1 . We used Kolmogorov-Smirnov goodness-of-fit test with significance level of 0.05 to assess whether the distribution of 2 log can be reasonably considered to be χ 2 1 . We summarized the results of this simulation study in Table 1. The simulation shows that when the true value of p is between 0.1 and 0.9, the sample size needed for the limiting distribution to be a good approximation for the distribution of 2 log is below 100. However, once the value of p becomes closer to 0 or 1, the sample sizes required for a reasonable approximation are growing. In particular, note that the sample size required for large values of p (close to 1) are much larger than those for the small values of p (close to 0).

An illustrative data example
In this section, we illustrate potential applications of the new GT model using S&P500 index return data.
The data was downloaded from 'Yahoo! Finance' historical data archive. The initial data were the daily closing prices for the S&P500 index, covering the period from Dec 30, 1927 to April 17, 2020. These were converted to (n = 23,183) daily log-returns, i.e. natural logarithms of the ratios of the closing prices for two consecutive days. Finally, we converted these data to the growth periods, where the daily closing prices increase from one day to the next one, so that the log-returns stay positive (5,540 growth periods). In this case, the N i are the durations (in days) of the growth periods, while the X i and the Y i are the magnitude and the maximum daily return for the i-th growth period. We call this data set S&P500.
We estimated the parameters p, q, and β of the underlying GT (p, q, β) model viz. maximum likelihood, using the results given in Proposition 13. The resulting estimates are shown in Table 2, along with estimated margins of error (ME) of the 95% (asymptotic) confidence intervals described in the remark following Proposition 14.
Note, that in this case, p < q and so we have an example of under-inflated ones data. This may be reflection of the fact that S&P500 is a composite index, which is more stable, that is less prone to changes from growth to decline, than an individual stock return. Further, we tested the hypotheses H 0 : p = q versus H 1 : p = q on significance level 0.05 using the likelihood ratio test described in Section 5, and obtained the test statistic Table 2 Maximum likelihood estimates of the parameters for the S&P500 data with the corresponding (approximate) 95% margins of error in parentheseŝ p (ME)q (ME)β (ME)