Chi-p distribution: characterization of the goodness of the fitting using L p norms

Livadiotis, George

doi:10.1186/2195-5832-1-4

Research
Open access
Published: 11 June 2014

Chi-p distribution: characterization of the goodness of the fitting using L^p norms

George Livadiotis¹

Journal of Statistical Distributions and Applications volume 1, Article number: 4 (2014) Cite this article

3266 Accesses
19 Citations
Metrics details

Abstract

This paper derives (1) the Chi-p distribution, i.e., the analog of Chi-square distribution but for datasets that follow the General Gaussian distribution of shape p, and (2) develops the statistical test for characterizing the goodness of the fitting with L^p norms. It is shown that the statistical test has double role when the fitting method is induced by the L^p norms: For given the shape parameter p, the test is rated based on the estimated p-value. Then, a convenient characterization of the fitting rate is developed. In addition, for an unknown shape parameter and if the fitting is expected to be good, then those L^p norms that correspond to unlikely p-values are rejected with a preference to the norms that maximized the p-value. The statistical test methodology is followed by an illuminating application.

1. Introduction

The fitting of a given dataset ${\{f_{i} \pm σ_{f_{i}}\}}_{i = 1}^{N}$ to the values ${\{V_{i}\}}_{i = 1}^{N}$ of a statistical model V(X; α) in the domain $X \in D_{x} \subseteq ℜ$ (McCullagh2002; Adèr2008), involves finding the optimal parameter value α = α* in $α \in D_{α} \subseteq ℜ$ that minimizes the total square deviations (TSD) between model and data,

TSD {(α)}^{2} = \sum_{i = 1}^{N} σ_{f_{i}}^{- 2} {[f_{i} - V (x_{i}; α)]}^{2},

(1)

where the inverse of the variance of the data measurements ${\{w_{i} = σ_{f_{i}}^{- 2}\}}_{i = 1}^{N}$ is weighting the summation. The deviations may be also defined using the total absolute deviations (TAD),

TAD (α) = \sum_{i = 1}^{N} σ_{f_{i}}^{- 2} |f_{i} - V (x_{i}; α)| .

(2)

A class of generalized fitting methods has been considered by Livadiotis (2007), using the metric induced by the p-norms L^p, p ≥ 1, that denotes a complete normalized vector space with finite Lebesgue integral. The total deviations (TD) are now defined by

TD {(α)}^{p} = \sum_{i = 1}^{N} σ_{f_{i}}^{- p} {|f_{i} - V (x_{i}; α)|}^{p} .

(3)

The least square method based on the Euclidean norm, p = 2, and the least absolute deviations method based on the “Taxicab” norm, p = 1, are some cases of the general fitting methods based on the L^p-norms (see Burden and Faires1993; for more applications of the fitting methods based on L^p, see: Sengupta1984; Livadiotis and Moussas2007; Livadiotis2008;2012; for fitting methods based on other effect sizes e.g., correlation, see: Livadiotis and McComas2013a).

The goodness of the least square fitting is typically measured using the estimated Chi-square value, that is the least squared value, $χ_{est}^{2} = TSD {(α^{*})}^{2}$ . Then, this $χ_{est}^{2}$ is compared with the Chi-square distribution, to examine whether such a value is frequent or not (see next sections). However, this test can apply only to datasets ${\{f_{i} \pm σ_{f_{i}}\}}_{i = 1}^{N}$ that follow the normal distribution $f_{i} ~ N (μ_{f_{i}}, σ_{f_{i}})$ . There is no similar test for cases where the dataset follows the General Gaussian distribution of shape p, $f_{i} ~ GG (μ_{f_{i}}, σ_{f_{i}}, p)$ (see Section 2 and Appendix A). Livadiotis (2012) showed the connection between the fitting with L^p norms, as in Eq. (3), and datasets that follow the General Gaussian distributions, $f_{i} ~ GG (μ_{f_{i}}, σ_{f_{i}}, p)$ .

The purpose of this paper is to (1) construct the formulation of the Chi-p distribution, the analog of Chi-square distribution but for datasets that follow the General Gaussian distribution of shape p, and (2) develop the statistical test for characterizing the goodness of the fitting with L^p norms, which corresponds to datasets that follow the General Gaussian distribution of shape p. Therefore, in Section 2, we revisit the Chi-square derivation, and following similar steps, we construct the Chi-p distribution. In Section 3, we develop the statistical test for characterizing the goodness of the fitting with L^p norms, using the Chi-p distribution and the p-value. In Section 4, we provide an application of the statistical test. Finally, in Section 5, we summarize the conclusions. Appendix A briefly describes the General Gaussian distribution, while Appendix B shows the mathematical derivation of the surface of the sphere of higher dimensions in L^p space.

2. Chi-p distribution

We first revisit the derivation of Chi-square distribution. This distribution is necessary to test the goodness of fitting of measurements that follow the Gaussian distribution. This test applies to datasets ${\{x_{i} \pm σ_{x_{i}}\}}_{i = 1}^{N}$ that follow the normal distribution $x_{i} ~ N (μ_{x_{i}}, {σ_{x}}_{i})$ . The Chi-square is given by

χ^{2} = \sum_{i = 1}^{N} {(\frac{x_{i} - μ_{x_{i}}}{σ_{x_{i}}})}^{2},

(4)

that is the sum of squares of N independent random variables. The distribution of this sum is given by

P (X; N) dX = \frac{2^{- \frac{N}{2}}}{Γ (\frac{N}{2})} e^{- \frac{1}{2} X} X^{\frac{N}{2} - 1} dX, with X \equiv χ^{2} .

(5)

The estimated value of the Chi-square for a fitting is given by the minimum at α = α* of the function χ²(α) = TSD(α)², as shown in Eq. (1) (least squares). Considering that the Chi-square minimum, χ²(α*), is equivalently referred to all the M = N-1 degrees of freedom (for N number of data), then each of them contributes to this minimum by a factor of $\frac{1}{M} χ^{2} (α^{*})$ . This is the estimated value of the reduced Chi-square. For multi-parametrical fitting (Livadiotis2007) of n free parameters, the degrees of freedom are M = N-n. In general, the Chi-square distribution in Eq. (5) is referred to M degrees of freedom.

For testing the goodness of fitting of measurements ${\{x_{i} \pm σ_{xi}\}}_{i = 1}^{N}$ that follow the General Gaussian distribution of shape p, x_i ~ GG(μ_xi, σ_xi, p), we need to construct the Chi-p distribution connected with L^p fitting methods, where the minimization of χ^p(α) is given by Eq. (3). The General Gaussian distribution of shape p, $f_{i} ~ GG (μ_{f_{i}}, σ_{f_{i}}, p)$ (Appendix A). This distribution is parameterized by the mean μ, the variance σ, and the shape parameter p,

P (x; μ, σ, p) dx = C_{p} \cdot e^{- η_{p} \cdot {|\frac{x - μ}{σ}|}^{p}} d (\frac{x - μ}{σ}),

(6)

where the involved coefficients are

C_{p} = \sqrt{\frac{p sin (\frac{π}{p})}{4 π (p - 1)}}, η_{p} = {[\frac{sin (\frac{π}{p}) Γ {(\frac{1}{p})}^{2}}{π p (p - 1)}]}^{\frac{p}{2}} .

(7)

Figure 1 depicts the distribution $P (z = \frac{x - μ}{σ}; p) \equiv P (x; μ, σ, p)$ for various shape parameters p. Note that the normalized coefficient C_p is derived by setting $\int_{- \infty}^{\infty} P (x; μ, σ, p) dx = 1$ , while the exponential coefficient η_p is derived so that the L^p-normed variance to equal σ². The theory of L^p-normed mean and variance was developed by Livadiotis (2012), which for the case of the General Gaussian distribution (6) leads to the following Propositions:

Proposition 1: The L ^p-normed mean of the distribution (6) is < x > _p = μ, ∀ p ≥ 1.
Proposition 2: The L ^p-normed variance of the distribution (6) is $σ_{p}^{2} = σ^{2}$ , ∀ p ≥ 1.

The proofs of the two Propositions are shown in Appendix A.

We continue with the development of the Chi-p distribution. We start with the following Lemma:

Lemma 1: The surface of the N-dimensional sphere of unit radius in L ^p space is given by
$Β_{p, N} = p {[(\frac{2}{p}) Γ (\frac{1}{p})]}^{N} / Γ (\frac{N}{p}) .$
(8)

The proof is shown in Appendix B.

Theorem 1:

The Chi-p is given by the sum of absolute values to the exponent p of N independent random variables,

χ^{p} = \sum_{i = 1}^{N} {|\frac{x_{i} - μ_{x_{i}}}{σ_{x_{i}}}|}^{p} .

(9)

For M degrees of freedom (M = N-n, N number of data, n number of independent variables), the Chi-p distribution is given by

P (X; M; p) = \frac{{η_{p}}^{\frac{M}{p}}}{Γ (\frac{M}{p})} e^{- η_{p} X} X^{\frac{M}{p} - 1},

(10)

where the estimated Chi-p value X is given by the minimum at α = α* of the function χ^p(α) = TD(α)^p, as shown in Eq. (3) (least L^p deviations). Figure 2 plots the Chi-p distribution for various values of the shape parameter p (that correspond to various L^p norms).

Proof of Theorem 1. The distribution of Chi-p can be derived as follows. The normalization of the joint distribution function of all the data is
$1 = \int_{- \infty}^{+ \infty} \prod_{i = 1}^{N} \frac{C_{p}}{σ_{x_{i}}} e^{- η_{p} {|\frac{x_{i} - μ_{x_{i}}}{σ_{x_{i}}}|}^{p}} d x_{1} \dots d x_{N},$
(11)

where the coefficients (Livadiotis2012) are given by Eq. (7).

By setting $z_{i} \equiv \frac{x_{i} - μ_{x}}{σ_{x_{i}}}$ , we derive

1 = \int_{- \infty}^{+ \infty} \prod_{i = 1}^{N} C_{p} e^{- η_{p} {|z_{i}|}^{p}} d z_{1} \dots d z_{N} = \int_{- \infty}^{+ \infty} C_{p}^{N} e^{- η_{p} \sum_{i = 1}^{N} {|z_{i}|}^{p}} d z_{1} \dots d z_{N},

(12)

that is

1 = \int_{\vec{z} \in Β_{p, N}} d^{N - 1} Ω_{N} \cdot \int_{0}^{+ \infty} C_{p}^{N} e^{- η_{p} Z^{p}} Z^{N - 1} dZ,

(13)

where we denote $Z^{p} \equiv \sum_{i = 1}^{N} {|z_{i}|}^{p}$ , and $Β_{p, N} \equiv \int_{\vec{z} \in Β_{p, N}} d^{N - 1} Ω_{N}$ is the surface of the N-dimensional sphere of unit radius in L^p space (Lemma 1), so that

1 = \int_{0}^{+ \infty} C_{p}^{N} Β_{p, N} e^{- η_{p} Z^{p}} Z^{N - 1} dZ = \int_{0}^{+ \infty} C_{p}^{N} \frac{1}{p} Β_{p, N} e^{- η_{p} X} X^{\frac{N}{p} - 1} dX \equiv \int_{0}^{+ \infty} P (X; N; p) dX,

where we have used the identity $C_{p}^{N} \frac{1}{p} Β_{p, N} = η_{p}^{\frac{N}{p}} / Γ (\frac{N}{p})$ . Hence, we find

P (X; N; p) dX = \frac{{η_{p}}^{\frac{N}{p}}}{Γ (\frac{N}{p})} e^{- η_{p} X} X^{\frac{N}{p} - 1} dX, with X \equiv χ^{p} .

(14)

In general, for M degrees of freedom, the Chi-p distribution is given by Eq. (10).

3. Statistical test of a fitting

In order to estimate the goodness of the fitting, we minimize the Chi-p, χ^p,

χ^{p} = \sum_{i = 1}^{N} {σ_{f}}_{i}^{- p} {[f_{i} - V (x_{i}; α)]}^{p},

(15)

similar to the minimization of the Chi-square, χ², for the case of the Euclidean norm,

χ^{2} = \sum_{i = 1}^{N} σ_{f_{i}}^{- 2} {[f_{i} - V (x_{i}; α)]}^{2} .

(16)

We begin with the established method of Chi-square, and then we will proceed to the generalized method of Chi-p.

The goodness of a fitting can be estimated by the reduced Chi-square value, $χ_{red}^{2} = \frac{1}{M} χ_{test}^{2}$ , where M = N-1 indicates again the degrees of freedom. The meaning of $χ_{red}^{2}$ is the portion of χ² that corresponds to each of the degrees of freedom, and this has to be ~1 for a good fitting. We can easily understand this, for example, when the given data have equal error σ_f, with ${\{f_{i} \pm σ_{f}\}}_{i = 1}^{N}$ , i.e., $σ_{f_{i}} = σ_{f}$ for all i = 1,...., N. Then, the optimized model value, V(x_i; α*), gives the expected value of the data point f_i, so that the variance can be approached by $σ_{f}^{2} = \frac{1}{M} \sum_{i = 1}^{N} [f_{i} - V (x_{i}; α^{*}] 2^{}$ (sample variance). Hence, the derived Chi-square becomes $χ_{est}^{2} = σ_{f}^{- 2} \sum_{i = 1}^{N} {[f_{i} - V (x_{i}; α^{*})]}^{2} = M$ , and its reduced value $χ_{red}^{2} = \frac{1}{M} χ_{est}^{2} = 1$ . Therefore, a fitting can be characterized as "good" when $χ_{red}^{2} ~ 1$ , otherwise there is an overestimation, $χ_{red}^{2} < 1$ , or underestimation, $χ_{red}^{2} > 1$ , of the errors. When the deviations of the data ${\{f_{i}\}}_{i = 1}^{N}$ from the model values ${\{V (x_{i}; α)\}}_{i = 1}^{N}$ are small, the fitting is expected to be good. However, this characterization is meaningless if the errors of the data ${\{{σ_{f}}_{i}\}}_{i = 1}^{N}$ are either (i) quite larger than their deviations from the model values, i.e., if σ_fi > > |f_i - V(x_i; α)|, or (ii) quite smaller, i.e., if σ_fi < < |f_i - V(x_i; α)| (e.g., see Figure 3). Then, a perfect matching between data and model is useless when the errors of the data are comparably large or small.

Furthermore, a better estimation of the goodness is derived from comparing the calculated χ² value and the Chi-square distribution, that is the distribution of all the possible χ² values for data with normally distributed errors (parameterized by the degrees of freedom M),

P (χ^{2}; M) d χ^{2} = \frac{2^{- \frac{M}{2}}}{Γ (\frac{M}{2})} e^{- \frac{1}{2} χ^{2}} {(χ^{2})}^{\frac{M}{2} - 1} d χ^{2},

(17)

(e.g., see Melissinos1966). The likelihood of having an χ² value equal to or smaller than the estimated value $χ_{est}^{2}$ , is given by the cumulative distribution

P (0 \leq χ^{2} \leq χ_{est}^{2}) = \int_{0}^{χ_{est}^{2}} P (χ^{2}; M) d χ^{2} = 1 - \frac{Γ (\frac{1}{2} M; \frac{1}{2} χ_{est}^{2})}{Γ (\frac{1}{2} M)},

(18)

where $Γ (x; b) = \int_{x}^{\infty} e^{- X} X^{b - 1} dX$ is the incomplete Gamma function. In addition, the likelihood of having an χ² value equal to or larger than the estimated value $χ_{est}^{2}$ , is given by the complementary cumulative distribution

P (χ_{est}^{2} \leq χ^{2} < \infty) = \int_{χ_{est}^{2}}^{\infty} P (χ^{2}; M) d χ^{2} = \frac{Γ (\frac{1}{2} M; \frac{1}{2} χ_{est}^{2})}{Γ (\frac{1}{2} M)} .

(19)

The probability of having a result χ² larger than the estimated value $χ_{est}^{2}$ , defines the p-value that equals $P (χ_{est}^{2} \leq χ^{2} < \infty)$ . The larger the p-value, the better the fitting is (e.g., Melissinos1966). However, the p-value test fails when p > 0.5. Indeed, p-values larger than 0.5 correspond to $χ_{est}^{2} < M$ or $χ_{red}^{2} < 1$ . Even larger p-values, up to p = 1, correspond to even smaller Chi-squares, down to $χ_{red}^{2} ~ 0$ . Thus, an increasing p-value above the threshold of 0.5 cannot lead to a better fitting but to a worse, similar to the indication $χ_{red}^{2} < 1$ . For this reason, we use the "p-value of the extremes". According to this, the probability of taking a result χ², more extreme than the observed value is given by the p-value that equals the minimum between $P (0 \leq χ^{2} \leq χ_{est}^{2})$ and $P (χ_{est}^{2} \leq χ^{2} < \infty)$ , i.e.,

p - value = min [\frac{Γ (\frac{1}{2} M; \frac{1}{2} χ_{est}^{2})}{Γ (\frac{1}{2} M)}, 1 - \frac{Γ (\frac{1}{2} M; \frac{1}{2} χ_{est}^{2})}{Γ (\frac{1}{2} M)}],

(20)

(see some applications in Livadiotis and McComas2013b; Frisch et al.2013; Funsten et al.2013). Note that the maximum p-value is 0.5, and this corresponds to the estimated Chi-square ${χ_{est,}^{2}}_{1 / 2} ≅ M - \frac{2}{3}$ . This is larger than the Chi-square that maximizes the distribution, ${χ_{est,}^{2}}_{max} = M - 2$ . Hence, $χ_{est, max}^{2} < χ_{est, 1 / 2}^{2}$ , i.e., the Chi-square that corresponds to p-value = 0.5, is located always at the right of the maximum.

The statistical test of the fitting for the evaluation of its goodness comes from the null hypothesis that the given data are described by the fitted statistical model. If the derived p-value is smaller than the significance level of ~0.05, then the hypothesis is typically rejected, and the hypothesis that the data are described by the examined statistical model is characterized as unlikely.

A convenient rate for a statistical test is to give more detailed characterization than “likely” when p-value > 0.05, or “unlikely” when p-value < 0.05. For this reason, it is necessary to ascribe an 1–1 relation between the domain of p-values $\{p \in [0, 0.5]\}$ and the range of a rating values $\{T \in [- 1, 1]\}$ , with the correspondence: 1) Impossible $p = 0 \leftrightarrow T = - 1$ ; 2) indefinite $p = 0.05 \leftrightarrow T = 0$ ; 3) certain $p = 0.5 \leftrightarrow T = 1$ . Choosing a power-law function, $(T + 1) / 2 = {(p / p_{0})}^{γ}$ , we find $p_{0} = 0.5$ and γ = log 2, i.e.,

(T + 1) / 2 = {(2 p)}^{log 2} .

(21)

We can easily now characterize the testing rates by a linear separation of the values of T, as shown in Table 1.

Table 1 Testing rates and characterizations

Full size table

In the case of data that follow the General Gaussian distribution of shape p, the derived p-value is dependent on the shape p. Indeed, we have

P (χ^{p}; M; p) d χ^{p} = \frac{{η_{p}}^{\frac{M}{p}}}{Γ (\frac{M}{p})} \cdot {(χ^{p})}^{\frac{M}{p} - 1} \cdot e^{- η_{p} χ^{p}} d χ^{p},

(22)

and

P (0 \leq χ^{p} \leq χ_{est}^{p}) = \int_{0}^{χ_{est}^{p}} P (χ^{p}; M; p) d χ^{p} = 1 - \frac{Γ (\frac{1}{p} M; η_{p} χ_{est}^{p})}{Γ (\frac{1}{p} M)},

(23)

P (χ_{est}^{p} \leq χ^{p} < \infty) = \int_{χ_{est}^{p}}^{\infty} P (χ^{p}; M; p) d χ^{p} = \frac{Γ (\frac{1}{p} M; η_{p} χ_{est}^{p})}{Γ (\frac{1}{p} M)},

(24)

and the p-value that equals the minimum between $P (0 \leq χ^{p} \leq χ_{est}^{p})$ and $P (χ_{est}^{p} \leq χ^{p} < \infty)$ , i.e.,

p - value = min [\frac{Γ (\frac{1}{p} M; η_{p} χ_{est}^{p})}{Γ (\frac{1}{p} M)}, 1 - \frac{Γ (\frac{1}{p} M; η_{p} χ_{est}^{p})}{Γ (\frac{1}{p} M)}] .

(25)

Note that the maximum p-value = 0.5 corresponds to the estimated Chi-square $χ_{est, 1 / 2}^{p} ≅ \frac{1}{p η_{p}} M - \frac{1}{3 η_{p}}$ . This is larger than the Chi-square that maximizes the distribution, $χ_{est, 1 / 2}^{p} ≅ \frac{1}{p η_{p}} M - \frac{1}{η_{p}}$ . Hence, again we find $χ_{est, max}^{p} < χ_{est, 1 / 2}^{p} .$

The statistical test has double role in the case of L^p norms. If the shape parameter p is known, then the test can be rated by deriving the p-value and according to Table 1. If the shape parameter is unknown and the fitting is expected to be good, then all the shape values p that correspond to unlikely p-values can be rejected. In fact, the largest p-value corresponds to the most-likely shape parameter p of the examined data. These are shown in the following applications.

4. Applications

Table 2 contains a dataset of observations of the ratio of the umbral area to the whole sunspot area, ${\{f_{i}\}}_{i = 1}^{N}$ , N = 6 (Edwards1957). Assuming that each of them follows a General Gaussian distribution about their mean, f_i ~ GG(μ_i, σ_i, p), what is the likelihood of these measurements to represent a constant physical quantity? Let this constant be indicated by μ_p, which can be derived from the fitting of ${\{f_{i} \pm σ_{f_{i}}\}}_{i = 1}^{N}$ , and thus, it is typically depended on the p-norm. However, different values of the p-norm lead to different estimated values of the Chi-p, $χ_{est}^{p}$ . Thus, the p-value of the null hypothesis (H_o) depends also on the p-norm.

Table 2 Testing rates and characterizations

Full size table

We apply a statistical test to examine whether the data of the sunspot area ratios are dependent with heliolatitude on not. Therefore, the null hypothesis is that the dataset is described by the statistical model of constant value, i.e., ${\{V (x_{i}; α) = α\}}_{i = 1}^{N}$ . We construct and minimize the Chi-p, given by

χ^{p} (α) = \sum_{i = 1}^{N} {|\frac{f_{i} - α}{σ_{f_{i}}}|}^{p},

(26)

so that the L^p-mean value α_p = α_p(p) is implicitly given by

\sum_{i = 1}^{N} {|\frac{f_{i} - α_{p}}{σ_{f_{i}}}|}^{p} sign (f_{i} - α_{p}) = 0,

(27)

and the estimated Chi-p is

χ^{p} (p) = \sum_{i = 1}^{N} {|\frac{f_{i} - α_{p}}{σ_{f_{i}}}|}^{p} .

(28)

Figure 4(a) shows the six data points co-plotted with four values of α_p, that correspond to p → 1, p → ∞, and the two shape parameter values p₁, p₂ for which the p-value is equal to 0.05. The whole diagram of α_p = α_p(p) is shown in Figure 4(b) and the p-value as a function of p is shown in Figure 4(c).

We observe that the function α_p is monotonically increasing converging to some constant value for p → ∞. The corresponding mean value, α_∞, is given by

α_{\infty} = \frac{\frac{x_{min}}{σ_{x_{min}}} + \frac{x_{max}}{σ_{x_{max}}}}{\frac{1}{σ_{x_{min}}} + \frac{1}{σ_{x_{max}}}} ≅ 0.166 .

(29)

The p-value has a minimum value at p ~ 2.08 and increases for larger shape values p until it reaches p ~ 5.77 where becomes p-value ~ 0.5 (not shown in the figure). If the shape p of the dataset is known, e.g., p = 2, then the null hypothesis is rejected, i.e., the sunspot area ratio data are dependent on the heliolatitude. On the other hand, if the data are expected to be invariant with the heliolatitude, and thus the null hypothesis to be accepted, then all the norms between p₁ ~ 1.7 and p₂ ~ 2.5 are rejected, and the norm L^p with p ~ 5.77 characterizes better these data points; the respective mean value is given by α_p(5.77)~0.164. Therefore, if we know the shape/norm p that characterizes the data, we can proceed and rate the goodness of the fitting. However, if p is unknown, at least we could detect those values of p for which the null hypothesis is accepted or rejected.

One of the most intriguing questions regarding the L^p-normed fitting is how can we determine the characteristic p-norm of the data. This is the suitable norm that should be used for the fitting of those data (Livadiotis2007). The maximization of the p-value is one promising method. We demonstrate this as follows. We construct N = 10⁴ data, ${\{f_{i}\}}_{i = 1}^{N}$ , of a random variable that follows the General Gaussian distribution of shape p, f_i ~ GG(μ = 0, σ = 1, p = 3). Figure 5(a) shows that the normalized histogram of these values matches this General Gaussian distribution. The p-value is approximated using the asymptotic behavior of (complete and incomplete) Gamma functions for large degrees of freedom, M = 9999. Hence, in order to derive the maximum p-value, it is sufficient to maximize

p - value ~ {(\frac{e}{M} p η_{p} χ_{est}^{p})}^{\frac{M}{p}} e^{- η_{p} χ_{est}^{p}} .

(30)

This is shown in Figure 5(b), where the peak is at p ≅ 2.95 ± 0.08. Therefore, the p-value is maximized at the same value of p-norm as the shape of the General Gaussian distribution.

5. Conclusions

This paper (1) presented the derivation of the Chi-p distribution, the analog of Chi-square distribution but for datasets that follow the General Gaussian distribution of shape p, and (2) developed the statistical test for characterizing the goodness of the fitting with L^p norms, which corresponds to datasets that follow the General Gaussian distribution of shape p.

It was shown that the statistical test has double role in the case of L^p norms: (1) If the shape parameter p is fixed and known, then the test can be rated by deriving the p-value. A convenient characterization of the fitting rate was developed. (2) If the shape parameter is unknown and the fitting is expected to be good for some shape parameter value p, a method for estimating p was given by fitting a General Gaussian distribution of shape p to the data, and then use this estimated shape parameter p to the Chi-p distribution to characterize the goodness of fitting. In particular, all the shape values p that correspond to unlikely p-values can be rejected, while the largest p-value corresponds to the most-likely shape parameter p of the examined data. This was verified by an illuminating example where the method of the fitting based on L^p norms was applied.

Appendix A: General Gaussian distribution

According to the theory of L^p-normed mean and variance, developed by Livadiotis (2012), the L^p-normed mean < x > _p of the random variable X with probability distribution P(x), is implicitly defined by

\int_{- \infty}^{\infty} P (x) {|x - < x >_{p}|}^{p - 1} sign (x - < x >_{p}) dx = 0,

(A1)

where sign(u) returns the sign of u. The L^p-normed variance $σ_{p}^{2}$ is given by

σ_{p}^{2} = \frac{\int_{- \infty}^{\infty} P (x) {|x - < x >_{p}|}^{p} dx}{(p - 1) \int_{- \infty}^{\infty} P (x) {|x - < x >_{p}|}^{p - 2} dx} .

(A2)

Next, we derive the L^p-normed mean and variance of the General Gaussian distribution (6), which are Propositions 1 and 2, stated in Section 2.

Proposition 1: Given the distribution (6), we have that the L ^p-normed mean is < x > _p = μ, ∀ p ≥ 1.
Proof. We have
$\int_{- \infty}^{\infty} e^{- η_{p} \cdot {|z|}^{p}} {|z - < z >_{p}|}^{p - 1} sign (z - < z >_{p}) dz = 0,$
(A3)

for z ≡ (x - μ)/σ, < z > _p ≡ (< x > _p - μ)/σ. Let’s assume that < z > _p = 0. Then, the left-hand side of Eq.(A3) is

\int_{- \infty}^{\infty} e^{- η_{p} \cdot {|z|}^{p}} {|z|}^{p - 1} sign (z) dz = 0,

(A4)

because the integrant is a product of symmetric and antisymmetric function. Then, (A3) is true for < z > _p = 0, and given the uniqueness of the L^p-normed mean for each p, we end up with proposition 1. (Note that it is not surprising that the mean, < x > _p = μ, is independent of p. Livadiotis (2012) showed that symmetric probability distributions lead to L^p-normed means that are independent of p.)

Proposition 2: Given the distribution (6), we have that the L ^p-normed variance is $σ_{p}^{2} = σ^{2}$ , ∀ p ≥ 1.
Proof. We have $< {|z|}^{q} > = \int_{- \infty}^{\infty} P (z) {|z|}^{q} dz = 0$ , i.e.,
$\begin{array}{l} \int_{- \infty}^{\infty} e^{- η_{p} \cdot {|z|}^{p}} {|z|}^{q} dz & = 2 \int_{0}^{\infty} e^{- η_{p} \cdot z^{p}} z^{q} dz = 2 {η_{p}}^{- \frac{q + 1}{p}} \int_{0}^{\infty} e^{- w} w^{\frac{q + 1}{p} - 1} dw \\ = 2 {η_{p}}^{- \frac{q + 1}{p}} Γ (\frac{q + 1}{p}), \end{array}$
(A5a)

or,

< z^{q} > = C_{p} \frac{2}{p} {η_{p}}^{- \frac{q + 1}{p}} Γ (\frac{q + 1}{p}) .

(A5b)

Hence, from (A2) we obtain

σ_{p}^{2} = \frac{\int_{- \infty}^{\infty} P (z) {|z|}^{p} dz}{(p - 1) \int_{- \infty}^{\infty} P (z) {|z|}^{p - 2} dz} \cdot σ^{2} = \frac{{η_{p}}^{- \frac{2}{p}} Γ (1 + \frac{1}{p})}{(p - 1) Γ (1 - \frac{1}{p})} \cdot σ^{2} = σ^{2} .

(A6)

Appendix B: Surface of the N-dimensional sphere in L^pspace, Β_p,N

This appendix shows the proof of Lemma 1, stated in Section 2.

Lemma 1: The surface of the N-dimensional sphere of unit radius in L ^p space, Β _p,N, is given by Eq.(8). This is involved in the proof of Chi-p distribution (10), as shown below.
Proof of Lemma 1.

Let the integral

1 = \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} F (\vec{z}) d z_{1} \dots d z_{N},

(B1)

where $\vec{z} = (z_{1}, \dots, z_{N})$ , $Z^{p} \equiv \sum_{i = 1}^{N} {|z_{i}|}^{p}$ . The magnitude Z is the only quantity with dimensions the same as each of the components z_i. Indeed, if we define c_i ≡ z_i/ζ, where $ζ \equiv \sqrt{\sum_{i = 1}^{N} {z_{i}}^{2}}$ is the Euclidean magnitude of $\vec{z}$ , then, $Z = {(\sum_{i = 1}^{N} {|z_{i}|}^{p})}^{\frac{1}{p}} = ζ \cdot {(\sum_{i = 1}^{N} {|c_{i}|}^{p})}^{\frac{1}{p}}$ , i.e., Z and ζ have the same dimensions. (In the previous sections the components z_i were dimensionless by definition, i.e., $z_{i} \equiv \frac{x_{i} - μ_{x}}{σ_{x_{i}}}$ . However, we can still use this dimension analysis, since the components z_i may have dimensions in the generic case). Hence, we write Eq.(B1) as dz₁ … dz_N = Z^N - 1dZ d^N - 1Ω_N, i.e.,

1 = \int_{0}^{+ \infty} \int_{\vec{z} \in Β_{p, N}} F (Z; Ω_{N}) Z^{N - 1} dZ d^{N - 1} Ω_{N},

(B2)

where $F (\vec{z}) = F (Z; Ω_{N})$ ; Ω_N symbolizes all the angular dependence, and d^N - 1Ω_N denotes the angular infinitesimal. Since F(Z; Ω_N) = F(Z), we have $Β_{p, N} \equiv \int_{\vec{z} \in Β_{p, N}} d^{N - 1} Ω_{N}$ , or

\begin{array}{l} 1 & = \int_{\vec{z} \in Β_{p, N}} d^{N - 1} Ω_{N} \cdot \int_{0}^{+ \infty} F (Z) Z^{N - 1} dZ = Β_{p, N} \cdot \int_{0}^{+ \infty} F (Z) Z^{N - 1} dZ \\ = C_{p}^{N} \frac{1}{p} Β_{p, N} \cdot \int_{0}^{+ \infty} F (X^{\frac{1}{p}}) X^{\frac{N}{p} - 1} dX, \end{array}

where $F (Z) = C_{p}^{N} e^{- η_{p} Z^{p}}$ , $F (X^{\frac{1}{p}}) = C_{p}^{N} e^{- η_{p} X}$ . Therefore,

1 = \int_{0}^{+ \infty} C_{p}^{N} \frac{1}{p} Β_{p, N} e^{- η_{p} X} X^{\frac{N}{p} - 1} dX \equiv \int_{0}^{+ \infty} P (X; N; p) dX,

or,

P (X; N; p) = C_{p}^{N} \frac{1}{p} Β_{p, N} e^{- η_{p} X} X^{\frac{N}{p} - 1} .

(B3)

The normalization $\int_{0}^{+ \infty} P (X; N; p) dX = 1$ gives $C_{p}^{N} \frac{1}{p} Β_{p, N} = {η_{p}}^{\frac{N}{p}} / Γ (\frac{N}{p})$ , or

Β_{p, N} = p {η_{p}}^{\frac{N}{p}} / [C_{p}^{N} Γ (\frac{N}{p})] = p {[(\frac{2}{p}) Γ (\frac{1}{p})]}^{N} / Γ (\frac{N}{p}) .

(B4)

Another way to show Eq.(B4) is through the integration of all the components,

\begin{array}{l} \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} F (\vec{z}) d z_{1} \dots d z_{N} = 2^{N} \cdot \int_{0}^{+ \infty} \dots \int_{0}^{+ \infty} F (\vec{z}) d z_{1} \dots d z_{N} \\ = 2^{N} \cdot \int_{0}^{+ \infty} F (Z) \int_{\begin{array}{l} \vec{z} \in Β_{p, N} \\ z_{i} \geq 0 \end{array}} {(Z^{p} - z_{2}^{p} - z_{3}^{p} \dots - z_{N}^{p})}^{\frac{1}{p} - 1} Z^{p - 1} dZd z_{2} \dots d z_{N}, \end{array}

by substituting $F (\vec{z}) = F (Z)$ and $z_{1} = {(Z^{p} - z_{2}^{p} - z_{3}^{p} \dots - z_{N}^{p})}^{\frac{1}{p}}$ (for z_i ≥ 0). The integration range $\vec{z} \in Β_{p, N}$ , z_i ≥ 0, means $0 \leq z_{i} \leq {(Z^{p} - \sum_{i + 1}^{N} z_{j}^{p} \dots - z_{N}^{p})}^{\frac{1}{p}}$ for i = 1,…, N-1, and 0 ≤ z_N ≤ Z. Similar, we have

\begin{array}{l} \int_{\begin{array}{l} \vec{z} \in Β_{p, N} \\ z_{i} \geq 0 \end{array}} {(Z^{p} - z_{2}^{p} - z_{3}^{p} \dots - z_{N}^{p})}^{\frac{1}{p} - 1} d z_{2} \dots d z_{N} = a_{1, p} \int_{\begin{array}{l} \vec{z} \in Β_{p, N} \\ z_{i} \geq 0 \end{array}} {(Z^{p} - z_{3}^{p} \dots - z_{N}^{p})}^{\frac{2}{p} - 1} d z_{3} \dots d z_{N} \\ = a_{1, p} a_{2, p} \int_{\begin{array}{l} \vec{z} \in Β_{p, N} \\ z_{i} \geq 0 \end{array}} {(Z^{p} - z_{4}^{p} \dots - z_{N}^{p})}^{\frac{3}{p} - 1} d z_{4} \dots d z_{N} = \prod_{i = 1}^{N - 2} a_{i, p} \cdot \int_{\begin{array}{l} \vec{z} \in Β_{p, N} \\ z_{i} \geq 0 \end{array}} {(Z^{p} - z_{N}^{p})}^{\frac{N - 1}{p} - 1} d z_{N} \\ = \prod_{i = 1}^{N - 1} a_{i, p} \cdot Z^{N - p}, \end{array}

where

a_{i, p} \equiv \int_{0}^{1} {(1 - t^{p})}^{\frac{i}{p} - 1} dt .

(B5)

Hence, we derive

\int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} F (\vec{z}) d z_{1} \dots d z_{N} = 2^{N} \cdot \prod_{i = 1}^{N} a_{i, p} \cdot \int_{0}^{+ \infty} F (Z) Z^{N - 1} dZ,

(B6)

while, on the other hand, we have

\begin{array}{l} \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} F (\vec{z}) d z_{1} \dots d z_{N} & = \int_{\vec{z} \in Β_{p, N}} d^{N - 1} Ω_{N} \cdot \int_{0}^{+ \infty} F (Z) Z^{N - 1} dZ \\ = Β_{p, N} \cdot \int_{0}^{+ \infty} F (Z) Z^{N - 1} dZ, \end{array}

(B7)

thus,

Β_{p, N} = 2^{N} \cdot \prod_{i = 1}^{N - 1} a_{i, p} .

(B8)

We easily find that

a_{i, p} = \frac{1}{p} \int_{0}^{1} y^{\frac{1}{p} - 1} {(1 - y)}^{\frac{i}{p} - 1} dy = \frac{1}{p} B (\frac{1}{p}, \frac{i}{p}),

(B9)

where B(x, y) ≡ Γ(x)Γ(y)/Γ(x + y) is the Beta function. Hence, we have

Β_{p, N} = p {(\frac{2}{p})}^{N} Γ {(\frac{1}{p})}^{N - 1} \cdot \prod_{i = 1}^{N - 1} Γ (\frac{i}{p}) / Γ (\frac{i + 1}{p}) .

(B10)

Since, $\prod_{i = 1}^{N - 1} Γ (\frac{i}{p}) / Γ (\frac{i + 1}{p}) = Γ (\frac{1}{p}) / Γ (\frac{N}{p})$ , finally, we end up with Eq.(B4).

References

Adèr HJ: Modelling (Chapter 12). In Advising on Research Methods: A consultant’s companion. Edited by: with contributions by D.J. Hand, Adèr HJ, Mellenbergh GJ. Huizen, The Netherlands: Johannes van Kessel Publishing; 2008:271–304.
Google Scholar
Burden RL, Faires JD: Numerical Analysis. Boston, MA: PWS Publishing Company; 1993:437–438.
Google Scholar
Edwards AWF: The proportion of umbra in large sunspots, 1878–1954. The Observatory 1957, 77: 69–70.
Google Scholar
Frisch PC, Bzowski M, Livadiotis G, McComas DJ, Mӧbius E, Mueller HR, Pryor WR, Schwadron NA, Sokól JM, Vallerga JV, Ajello JM: Decades-long changes of the interstellar wind through our solar system. Science 2013, 341: 1080. 10.1126/science.1239925
Article Google Scholar
Funsten HO, Frisch PC, Heerikhuisen J, Higdon DM, Janzen P, Larsen BA, Livadiotis G, McComas DJ, Mӧbius E, Reese CS, Reisenfeld DB, Schwadron NA, Zirnstein E: The circularity of the IBEX Ribbon of enhanced energetic neutral atom flux. Astrophys. J. 2013, 776: 30. 10.1088/0004-637X/776/1/30
Article Google Scholar
Livadiotis G: Approach to general methods for fitting and their sensitivity. Physica A 2007, 375: 518–536. 10.1016/j.physa.2006.09.027
Article Google Scholar
Livadiotis G: Approach to the block entropy modeling and optimization. Physica A 2008, 387: 2471–2494. 10.1016/j.physa.2008.01.002
Article Google Scholar
Livadiotis G: Expectation values and Variance based on L^p norms. Entropy 2012, 14: 2375–2396. 10.3390/e14122375
Article MathSciNet Google Scholar
Livadiotis G, McComas DJ: Fitting method based on correlation maximization: Applications in Astrophysics. J. Geophys. Res. 2013, 118: 2863–2875.
Article Google Scholar
Livadiotis G, McComas DJ: Evidence of large scale phase space quantization in plasmas”. Entropy 2013, 15: 1116–1132.
Article Google Scholar
Livadiotis G, Moussas X: The sunspot as an autonomous dynamical system: A model for the growth and decay phases of sunspots. Physica A 2007, 379: 436–458. 10.1016/j.physa.2007.02.003
Article Google Scholar
McCullagh P: What is statistical model? Ann. Stat. 2002, 30: 1225–1310.
Article MathSciNet Google Scholar
Melissinos AC: Experiments in Modern Physics. London, UK: Academic Press Inc; 1966:464–467.
Google Scholar
Sengupta A: A rational function approximation of the singular eigenfunction of the monoenergetic neutron transport equation. J. Phys. A 1984, 17: 2743–2758. 10.1088/0305-4470/17/14/018
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Southwest Research Institute, San Antonio, TX, USA
George Livadiotis

Authors

George Livadiotis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to George Livadiotis.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Livadiotis, G. Chi-p distribution: characterization of the goodness of the fitting using L^p norms. J Stat Distrib App 1, 4 (2014). https://doi.org/10.1186/2195-5832-1-4

Download citation

Received: 06 June 2013
Accepted: 24 December 2013
Published: 11 June 2014
DOI: https://doi.org/10.1186/2195-5832-1-4

Chi-p distribution: characterization of the goodness of the fitting using Lp norms