Modelling changes in Arctic Sea Ice Cover: an application of generalized and inflated beta and gamma densities

Congdon, Peter

doi:10.1186/2195-5832-1-3

Research
Open access
Published: 11 June 2014

Modelling changes in Arctic Sea Ice Cover: an application of generalized and inflated beta and gamma densities

Peter Congdon¹

Journal of Statistical Distributions and Applications volume 1, Article number: 3 (2014) Cite this article

2166 Accesses
Metrics details

Abstract

A modelling framework for changing Arctic sea ice extent is developed reflecting different trends and seasonal extremes in nine Arctic sub-regions. Core sub-regions retain partial ice cover throughout the year, and in winter show complete ice cover, while in peripheral sub-regions, winter coverage is not complete, and there is no ice cover at all in the summer. A generalized beta representation is developed for monthly ice extents in core sub-regions, with inflation to model maximum winter extents. For peripheral sub-regions, a gamma time series with excess zeroes (representing summer sea ice absence) is developed. Different trend representations (deterministic vs. stochastic) are compared for non-extreme observations. Other potential applications of the generalized beta density allowing zero or maximum inflation are discussed.

1 Introduction

In recent years, Arctic sea ice has been declining with wider climatic implications. The latter are multi-faceted and subject to discussion and uncertainty; see, for example, (Screen et al. 2013). However, one implication follows from the role of sea ice in regulating the global temperature via its ability (compared to the ocean surface) to reflect the sun’s radiation. The albedo of snow-covered sea ice is 0.90, meaning it reflects 90% of the sun’s radiation, whereas the ocean surface reflects only 10%. Less sea ice and more ocean surface will lead to a warmer Arctic, and contribute to global warming. There is also evidence that decline in Arctic sea ice is influencing atmospheric circulation (including the jet stream) within and beyond the Arctic, with impacts including winter cold surges in Europe and North America (Liu et al. 2012).

Satellite records of Arctic sea ice extent (available with full annual coverage since 1979) show a downward trend, albeit with fluctuations such as unusually low points in 1990 (Serreze et al. 1995) and in 2007 (Comiso et al. 2008). For the Arctic as a whole, the rate of decline in summer ice extent has been greater (Stroeve et al. 2012), with an 8.2% linear rate of decline (per decade) in September ice extent between 1979 and 2011, as against a 3.9% linear rate of decline for March ice extent (see Table 1).

Table 1 Arctic sea ice extent

Full size table

However, the Arctic extent totals in Table 1 aggregate over sub-regions with dissimilar trends and seasonal extremes; for example, some sub-regions show substantial falls in extent in both winter and summer months. The present study accordingly reviews issues in modelling sea ice trends for Arctic sub-regions, and includes short term projections three years beyond the range of the observed data. While a number of forecasts of sea ice cover have been developed (e.g. Stroeve et al. 2007), these generally consider the entire Arctic, and do not consider sub-regional differences. The observed data are monthly NASA satellite records from 1979 to 2011 (396 observations), disaggregated to nine Arctic sub-regions. These are the central Arctic Ocean, the Canadian Archipelago, Hudson Bay, Sea of Okhotsk, Bering Sea, Baffin Bay, Greenland Sea, Kara-Barents Sea, and Gulf of St. Lawrence (see Figure 1). The observed data were extracted in June 2013 at http://neptune.gsfc.nasa.gov/csb/index.php?section=59.

There are important regional differences in trends and seasonal patterns according to location (e.g. Arctic core vs. periphery) that are relevant to statistical analysis and forecasts. These include modelling seasonal extremes. For central regions there is still some summer cover but a reversion to complete cover (maximum possible extent) in winter months. To represent this pattern, a generalized beta representation is applied, including "maximum inflation" to account for winter total coverage.

By contrast, in some non-core regions such as the Sea of Okhotsk and Bering Sea, winter extents do not cover the entire sea region, and there is no ice cover in the summer. For these regions, a gamma density with excess zeroes (representing summer sea ice absence) is developed. In both situations, regression models including trend and seasonal components are applied to represent the change in mean sea ice extent, while a separate logit regression is applied to the probability of inflation. Contrasting stochastic and deterministic trend representations are evaluated. Estimation is based on monthly observations in the first 31 years of satellite observations (1979-2009) with cross-validation using the remaining two years of observed data as a test sample. Bayesian estimation and inference are implemented using the Winbugs package via Markov chain Monte Carlo (MCMC) methods (Lunn et al. 2009).

2 Generalized beta time series regression

2.1 Application of the generalized beta to sea ice extents

As mentioned above, while there is decline in Arctic sea ice overall, there are considerable regional differences in trends and seasonal ice extent. What are here denoted "core" Arctic sea regions (namely the central Arctic Ocean, the Canadian Archipelago, and Hudson Bay) retain partial ice cover throughout the year, and in winter months show complete ice cover. For example, in 2011, the central Arctic had readings of 7.158mn k m² throughout January to March while the Canadian Archipelago had readings of 0.751 mn k m² for January through to April. Figure 2 shows monthly extent totals for the central Arctic ocean during 2007-11 and illustrates maximum inflation in winter. The trend in these three regions is similar to that in the Arctic ocean considered as an aggregate, namely stronger declines in summer extent.

A time series representation needs to express the annual reversion to complete cover (maximum recurrence) in winter months, together with the irregular trend for declining extent in non-winter months. The application of a generalized form of the beta density is motivated by the fact that the observed ice extents y can be seen as ratios r = y/d to a maximum possible extent d, though substantive interest is in trends in extents y. It is important that the bounded nature of the response is included in any model. Another possibility might be some form of truncated sampling mechanism (e.g. a log-normal for extent readings y with ceiling d) but this precludes any analysis of the factors producing seasonal extremes.

Consider the beta distribution on (0,1), with density function given by

f (z | a, b) = \frac{1}{B (a, b)} z^{a - 1} {(1 - z)}^{b - 1},

with a > 0,b > 0. An alternative representation (Ospina and Ferrari 2010) involves mean and precision parameters (μ,ϕ), where a = μ ϕ,b = (1 - μ)ϕ, namely

f (z | μ, ϕ) = \frac{1}{B (μ ϕ, ϕ - ϕ μ)} z^{μ ϕ - 1} {(1 - z)}^{(1 - μ) ϕ - 1},

with ϕ > 0, and 0 < μ < 1. This form facilitates separate modelling of mean and variance trends (Huang and Oosterlee 2008). For values (a,b) apart from a = b = 1, the Beta(a,b) density has zero mass at the extreme values 0 and 1, and zero-inflated or one-inflated versions of the beta need to be applied (Ospina and Ferrari 2010). Let g = 0 or 1, then inflation at either boundary is achieved by the mechanism

f (z | α_{g}, μ, ϕ) = \{\begin{array}{l} α_{g}, & z = g \\ (1 - α_{g}) f (z | μ, ϕ), & z \in (0, 1), \end{array}

where α_g is an inflation probability.

The generalized beta is obtained by extending the support interval to an arbitrary bounded interval (c,d) (with d > 0) via a linear transformation y = c + z(d - c) (Pham-Gia and Duong 1989), so that

f (y | a, b, c, d) = \frac{1}{B (a, b) {(d - c)}^{a + b - 1}} {(y - c)}^{a - 1} {(d - y)}^{b - 1},

with equivalent representation

f (y | μ, ϕ, c, d) = \frac{1}{B (μ ϕ, ϕ - ϕ μ) {(d - c)}^{ϕ - 1}} {(y - c)}^{μ ϕ - 1} {(d - y)}^{(1 - μ) ϕ - 1},

(1)

with mean c + (d - c)μ, and variance (d - c)²μ(1 - μ)/(ϕ + 1).

For the generalized beta in (1), inflation will need to be applied for values occurring at the boundary points, when y = c or y = d (this may be termed minimum and maximum inflation). The maximum inflated version of the generalized beta is particularly relevant to the sea ice application and has the form

f (y | α_{d}, μ, ϕ, c, d) = \{\begin{array}{l} α_{d}, & y = d \\ (1 - α_{d}) f (y | μ, ϕ, c, d), & y \in (c, d) . \end{array}

In the generalized beta applied to core Arctic regions, c = 0 while d is the maximum winter extent (namely d₁ = 7.158 mn k m² for the central Arctic ocean, d₂ = 0.751mn k m² for the Canadian Archipelago, and d₃ = 1.233 mn k m² for Hudson Bay).

While summer minimum extents in the central Arctic and Canadian Archipelago remain well in excess of zero, those in Hudson Bay are becoming relatively small, e.g. y = 0.025 mn k m² in September 2010. This raises the possibility of needing to represent both maximum and minimum inflation in the generalized beta. This can be handled by the mechanism

f (y | α_{c}, α_{d}, μ, ϕ, c, d) = \{\begin{array}{l} α_{c}, & y = c \\ α_{d}, & y = d \\ (1 - α_{c} - α_{d}) f (y | μ, ϕ, c, d), & y \in (c, d), \end{array}

where the vector of probabilities (α_c,α_c,1 - α_c - α_d) should be modelled using a multinomial logistic.

2.2 Generalized beta time series regression with maximum inflation for sea ice extents

Let {μ_t,ϕ_t,α_{d
t}} denote the series of parameters underlying the y_t series, m = m_t represent the month that observation t corresponds to, and s = s_t represent the year corresponding to observation t (e.g. s = 2 in 1980 for observations t = 13,..,24 and s = 31 for observations t = 361,..,372). A parsimonious time series model is sought (Ledolter and Abraham 1981), combining close fit with low predictive variability, especially for cross-validatory and out-of-sample predictions. As discussed in Section 4, these aspects of fit are assessed using a posterior predictive fit criterion (Laud and Ibrahim 1995). A parsimonious model for the level of the series is expressed by a logit regression in μ_t,

logit(μ_t) = Δ_ms + ϑ_m, where Δ_ms represents trend for each combination of month m and year s, and ϑ_m represents seasonal effects.

Two options for the trend are considered. One option is a stochastic trend, with random variation around a central linear trend. To allow for steeper declines in some months, a discrete mixture is implemented via

\begin{array}{l} Δ_{ms} = ρ_{m} φ_{1 s} + (1 - ρ_{m}) φ_{2 s}, \\ ρ_{m} \sim Bern (π_{m}), \\ φ_{0 s} = δ_{00} + δ_{01} s + u_{s}, \\ φ_{1 s} = δ_{10} + δ_{11} s + u_{s}, \\ u_{s} \sim N (u_{s - 1}, σ_{u}^{2}), \end{array}

(2)

with ρ_m ∼ Bern(π_m) being binary indicators. With the constraint δ₁₁ < δ₀₁,φ_1s represents the stronger downward trend.

The other form of trend assumption (deterministic) involves a linear trend in years s_t combined with a short term AR1 lag effect in extents y_t. The latter represents carry over between successive months; for example, if September extent is relatively low in a particular year, then October extent may also be relatively low. The linear trend may vary between months and by broad sub-period. For example, (Comiso et al. 2008) report a stronger decline during 1996-2007 than 1979-96. Here we consider three sub-periods p = 1,...,3 of 12 years, including out-of-sample years (2012-14), namely 1979-1990, 1991-2002, and 2003-2014. The trend parameter for a particular time point is chosen by a monthly specific discrete mixture between guide linear trend parameters, specific to broad period, Γ_0p and Γ_1p, with Γ_1p < Γ_0p. The AR1 lag effect is also taken to vary by month, with normal priors for each monthly lag parameter. Thus for month m = m_t, and year s_t,

\begin{align} logit (μ_{t}) = Δ_{t} + ϑ_{m}, \\ Δ_{t} = γ_{0} + γ_{1 m} s_{t} + γ_{2 m} y_{t - 1}, \end{align}

where the linear trend γ_1m for month m = m_t in period p is chosen using a discrete mixture

γ_{1 m} = ρ_{m} Γ_{0 p} + (1 - ρ_{m}) Γ_{1 p}; ρ_{m} \sim Bern (π_{m}) .

Remaining aspects of the model are applicable across different representations of trend. Seasonal (monthly) effects are represented by a Fourier series (Höhle and Paul 2008),

ϑ_{m} = \underset{j = 1}{\sum^{J_{1}}} (β_{2 j - 1} cos (ω jm) + β_{2 j} sin (ω jm)),

(3)

where ω = 2π/M, with M = 12, and J₁ is the number of harmonics. To allow for changing precision it is assumed that

log (ϕ_{t}) = η_{1} + η_{2 m} s,

(4)

namely a linear trend (varying by month) in year units. For example, (Stroeve et al. 2012) find evidence of increased variability in overall Arctic sea ice extent, especially in summer months.

To represent extreme data (complete winter coverage), a logit regression is used to model the probabilities α_dt of maximum inflation, with form

logit (α_{dt}) = ξ_{0} + \underset{j = 1}{\sum^{J_{2}}} (ξ_{2 j - 1} cos (ω {jm}_{t}) + ξ_{2 j} sin (ω {jm}_{t})) .

A trend element in the inflation probability is not included as it would be confounded with the trend model in the mean.

2.3 Other generalized beta applications

While the application here focuses on sea ice extent and a time series application, the generalized inflated beta with mechanisms or regressions for both extreme and non-extreme observations has potential applications in other settings where the observations can be regarded as ratios r_i = y_i/d_i of actual extents to a maximum extent d_i, but substantive interest is in the extents y_i. The extents may be, inter alia, expressed in spatial units (e.g. areas in millions of square kilometres) or time units (e.g. durations in hours). As an example with time extents, one might analyse hours with cloud cover y_i in relation to daylight hours d_i, while a spatial application might consider desertified extents y_i in relation to total area extents d_i.

Data of this form can be considered as a form of compositional data, and widely used methods (Aitchison and Egozcue 2005, Butler and Glasbey 2008) focus on the ratios r_i, or more specifically the log ratios. Spatial applications involving compositional data and zero inflation have been described in (Leininger et al. 2013), but also focus on the ratios.

However, for policy purposes, the interest may be in trends or patterns in the extents themselves (i.e. the y-data), rather than in the ratios, as is the case with the sea ice extents. Alternatively stated, "compositions provide information only about the relative magnitudes of the compositional components and so interpretations involving absolute values... cannot be justified" (Aitchison and Egozcue 2005) p. 839. Thus in the case of desertification (e.g. Zhao et al. 2010), the substantive focus may be on spreading desertification, implying analysis of desertified extents y_i. Some areas may be totally desertified with y_i = d_i (maximum inflation). Regression modelling of desertification extents would then need to include a mechanism or regression describing maximum inflation, as would spatial forecasting (or interpolation) of desertified extents in situations where comprehensive assessment of desertification status is only available for some area units.

The generalized beta density with maximum or zero inflation might be potentially extended to Dirichlet density applications, and to generalized inflated Dirichlet densities parallel to equation (1), where there are more than two categories and where extreme observations can occur. For example, with three categories the observations would be (y_1i,y_2i,y_3i), with $\sum_{k} y_{ki} = d_{i},$ and maximum inflation occurring when any y_ki = d_i. The inflation probabilities can be modelled using a multiple logistic. For example, in the sea ice application, one may distinguish by sea-ice type (e.g. Fissel et al. 2011) between perennial multi-year sea ice and first-year ice, so that sub-region observations become (y₁,y₂,y₃) for area covered by multi-year ice, area covered by first-year ice, and area without ice cover respectively.

3 Inflated gamma time series regression

In what are here denoted "peripheral" Arctic regions (Sea of Okhotsk, Bering Sea, Gulf of St. Lawrence), the winter ice extents do not provide complete ice cover of the sea region, while in at least one peak summer month (July, August, September) there is no ice cover at all. For example, Figure 3 shows the extent series for the Bering Sea for 2007-11. Let y_t denote the observed extents (mn k m²) for these regions following a gamma density,

f (y | μ, ϕ) = \frac{{(ϕ / μ)}^{ϕ}}{Γ (ϕ)} y^{ϕ - 1} e (- ϕ y / μ),

with mean μ and variance μ²/ϕ. A zero inflated version of the gamma is motivated by the need to represent absence of ice cover in summer months, as in the mechanism

f (y | α_{0}, μ, ϕ) = \{\begin{array}{l} α_{0}, & y = 0 \\ (1 - α_{0}) f (y | μ, ϕ), & y \in (0, \infty) . \end{array}

Let {μ_t,ϕ_t,α_0t} denote the series of parameters underlying the observations y_t. The link for μ_t depends on which of two options for trend is applied.

One option is a stochastic trend, with log-link

log (μ_{t}) = Δ_{ms} + ϑ_{m},

(5)

where Δ_ms is as in (2) above. The other option, a deterministic trend, combines a linear trend in years s with a short term lag effect in the response. The linear trend may vary between months, and also between sub-period p. As above, the trend parameter for a particular time point is chosen according to a monthly specific discrete mixture between guide linear trends, specific to broad period, Γ_0p and Γ_1p, with Γ_1p < Γ_0p. Lag effects are also taken to vary by month. To avoid an explosive impact of the lag effect (Blundell et al. 2002, Jung et al. 2006), the following link scheme provides the deterministic trend option,

μ_{t} = exp (ζ_{0} + ζ_{1 m} s_{t} + ϑ_{m}) + ζ_{2 m} y_{t - 1},

(6)

with the constraint ζ_2m ≥ 0. For example, one may take ζ_2m to have exponential or gamma priors.

Remaining aspects of the model are applicable across different representations of trend. Seasonal effects ϑ_m are represented by a Fourier series, as in (3), and precision parameters modelled using the regression form (4). To represent summer extremes (no ice cover at all), a logit regression for the probability of zero inflation is used, with

logit (α_{0 t}) = ξ_{0} + \underset{j = 1}{\sum^{J_{2}}} (ξ_{2 j - 1} cos (ω {jm}_{t}) + ξ_{2 j} sin (ω {jm}_{t})) .

For the remaining three Arctic sub-regions (Baffin Sea, Greenland Sea, Kara-Barents), the data series demonstrate neither maximum recurrence in winter, nor complete summer ice disapperance as yet. The maximum values observed for these three series (in millions k m²) are 1.766 (Baffin Sea, March 1993), 1.115 (Greenland Sea, January 1982) and 2.168 (Kara-Barents, April 1979), whereas recent winter maxima are below these levels. Recent summer extents might be taken to indicate incipient ice disappearance in summer, such as a reading of 0.04 mn km² in the Baffin Sea in August 2011. However, for these regions a gamma regression may be adopted, without any inflation mechanism. The time series regression therefore involves {μ_t,ϕ_t}, and stochastic and deterministic trend options, as in (5) and (6) respectively, may be compared.

4 Model fit and comparison

4.1 Analysis framework

To illustrate a full comparative analysis, three sub-regions are considered as representative of the three sets of regions described above: the central Arctic, the Sea of Okhotsk and the Greenland Sea. Different models are applied to monthly observations from January 1979 to December 2009 (namely for t = 1,..,T with T = 372), with in-sample cross-validation for the two year period 2010-2011, namely for t = T + 1,...,T₁ with T₁ = 396. Out of sample predictions are made for a further 36 months, January 2012 to December 2014. Posterior inferences are based on the second halves of two chain runs of 50,000 iterations, with convergence assessed using Brooks-Gelman-Rubin statistics (Brooks and Gelman 1998).

4.2 Generalized beta regression with maximum inflation

The central Arctic is one of three sea regions with recurrent winter maxima and partial summer ice cover, and Figure 4 shows the trend in yearly averages for the central Arctic, with the anomalous low cover years (1990, 2007) apparent. In applying the generalized beta regression, normal priors with mean zero and variance 1000 are adopted on fixed effect parameters {β,ξ,η,Γ,δ,u₁,γ₀,γ₁}, while the lag parameters γ_2m are assigned N(0,1) priors. A gamma prior with shape 1 and scale 0.001 is adopted for the precision $1 / σ_{u}^{2},$ while Beta(1,1) priors are adopted for the π_m. The Fourier seasonal representations for the means μ_t and inflation probabilities α_dt have J₁ = J₂ = 3 harmonics, as insignificant regression effects {β,ξ} were obtained at higher numbers of harmonics.

Predictions are obtained by generating replicate inflation indicators D_new,t ∼ Bern(α_dt), and replicate scaled beta values d z_new,t, where z_new,t ∼ Beta(μ_tϕ_t,ϕ_t-ϕ_tμ_t). Predictions are then

y_{new, t} = D_{new, t} d + (1 - D_{new, t}) {dz}_{new, t} .

Let M_new,t and V_new,t be the posterior means and variances of y_new,t. In-sample predictive fit (within the training dataset from January 1979 through to the end of 2009) is based on the L-criterion (Laud and Ibrahim 1995), namely the square root of

L^{2} = \overset{T}{\sum_{t = 1}} V_{new, t} + \overset{T}{\sum_{t = 1}} {(M_{new, t} - y_{t})}^{2} .

Cross-validatory predictions beyond T = 372 (for the 24 months in 2010 and 2011) are made by treating ${y_{T + 1}, ..., y_{T_{1}}}$ as missing data, and generating predictions ${y_{new, T + 1}, ..., y_{new, T_{1}}} .$ These can be compared with observed test data y_obs,t for 2010 and 2011. Cross validatory fit using the L criterion, denoted CV-L, is based on

{CV-L}^{2} = \overset{T + 24}{\sum_{t = T + 1}} V_{new, t} + \overset{T + 24}{\sum_{t = T + 1}} {(M_{new, t} - y_{obs, t})}^{2} = P_{1} + P_{2},

which will reflect the precision of future predictions (P₁) as well as the fit (P₂). To further assess predictive performance, a check is made whether observed extents are within 95% intervals of y_new. In a model that effectively reproduces the data, predictive coverage is at or above 95% (Gelfand 1996, p. 158).

Table 2 shows posterior means and variances during the cross-validation period, while Table 3 summarises the L-criteria for both cross-validation periods (test data) and the in-sample estimation (training data). It is apparent that the deterministic trend has a better cross-validation performance but a worse in-sample performance. In particular, the deterministic trend is less adaptable to anomalous observations such as occurred during the summers of 1990 and 2007. Both models are effective in reproducing the original data satisfactorily in terms of coverage obtained by y_new,t.

Table 2 In-sample cross validation (for 2010 and 2011)

Full size table

Table 3 Predictive fit (L-criterion) and predictive checks, sea ice extent models, central Arctic ocean

Full size table

The better out-of-sample cross validation under the deterministic trend is due to lower predictive variability (P₁ = 2.06 under the deterministic trend, whereas P₁ = 6.13 under the stochastic trend); pure fit is broadly similar between the two options (P₂ = 0.41 under deterministic trend, as against P₂ = 0.34 under stochastic trend). Figure 5 shows observations and predictions for 2010-11, as well as out-of-sample predictions through to December 2014, under the deterministic trend option. Posterior mean predictions for September extent in 2012, 2013 and 2014 are respectively 3.88, 3.78 and 3.68mn k m². Table 4 summarizes selected parameters from the deterministic trend model. The strongest downward trends in mean extent, as represented by the guide parameters Γ_0p and Γ_1p, are for 1979-90 and for the partially observed third period (2003-14), with estimates based on observations for 2003-09. Parameters η_2m for trends in precision are nonsignificant in summer months (i.e the 95% credible intervals straddle zero), though the 95% credible intervals for η_2m in July and August are biased to negative values, namely (-0.052,0.009) and (-0.058,0.009).

Table 4 Deterministic trend parameters

Full size table

4.3 Gamma regression time series

The Sea of Okhotsk is one of three regions with recurrent summer disappearance of ice cover. Table 5 shows that unlike the trend for the whole Arctic as an aggregate (e.g. see Table 1), the most pronounced declines in average extent in the Sea of Okhotsk have been in the first quarter months (January, February, March). In applying gamma time series regression with zero inflation, priors are as in Section 4.2, except that exponential E(1) priors are adopted for the lag parameters ζ_2m, in order to ensure that μ_t is positive - see equation (6). The Fourier seasonal effects series included in representations for μ_t and the inflation probabilities α_0t have J₁ = J₂ = 3 harmonics. Predictions are obtained by generating replicate inflation indicators D_new,t ∼ Bern(α_0t), and replicate gamma values w_new,t ∼ Gamma(ϕ_t,ϕ_t/μ_t). Predictions are then obtained as

y_{new, t} = 0 \times D_{new, t} + (1 - D_{new, t}) w_{new, t} .

Table 5 Trends in extent by month, sea of Okhotsk

Full size table

Table 6 summarises the L-criteria for the cross-validation period (when predictions are compared to test data) and for the in-sample estimation period (when predictions compared to training data). The deterministic trend has better performance (see Figure 6 for actual and fitted series). March extents (in mn k m²) are predicted to fall to 0.895 in 2012, 0.889 in 2013 and 0.879 in 2014 as compared to 0.964 in 2011. The strongest downward trend in mean extent as shown by the deterministic trend guide parameters {Γ_0p,Γ_1p} is actually in the first sub-period (1979-1990). It may be noted that the actual data for the Sea of Okhotsk show a fall in annual average extent (mn k m²) from 0.58 in 1979 to 0.37 in 1990. The posterior means (sd’s) for Γ_1p are respectively -0.016 (0.010), -0.0085 (0.0044), and -0.014 (0.005).

Table 6 Posterior predictive loss, sea ice extent model, sea of Okhotsk

Full size table

For the three remaining sea regions (e.g. the Greenland Sea) with partial summer cover and no winter maximum recurrence, a gamma regression may be applied. In fact, like the Sea of Okhotsk, the Greenland Sea has declines in extent across both winter and summer months. For example, the average March extent (in mn k m²) fell from 0.98 in 1979-89 to 0.82 in 1990-2000 and 0.78 in 2001-11, while the average September extent fell from 0.37 (1979-89) to 0.34 (1990-2000) and 0.25 (2001-11). For this sub-region, a deterministic trend model has better in-sample fit and cross-validatory fit (see Figure 7). March extents are predicted as 0.767 in 2012, 0.766 in 2013 and 0.762 in 2014 as compared to 0.752 in 2011, and 0.764 in 2010. September extents are predicted as 0.294 in 2012, 0.290 in 2013 and 0.290 in 2014 as compared to 0.332 in 2011, and 0.264 in 2010.

4.4 Implications for prediction over entire Arctic

It is relevant to assess how far aggregated predictions (totalling over models for each of the nine regions) compare with predictions from a single model applied to the extent time series for the entire Arctic sea (i.e. to the data summarised in Table 1). The aggregated predictions are based on training data for 1979-2009 and combine

a)
generalized beta regressions (with maximum inflation) applied to extent data for the Central Arctic, Canadian Archipelago, and Hudson Bay;
b)
zero inflated gamma regressions applied to extent data for the Sea of Okhotsk, Bering Sea, and Gulf of St.Lawrence;
c)
gamma regressions without inflation applied to extent data for the Baffin Sea, Greenland Sea, and Kara-Barents.

The deterministic trend model is adopted, and MCMC iterations 25,000-26,000 are cumulated over the nine models, providing posterior summary statistics (mean, variance, 2.5% percentile, 97.5% percentile) for cross-validatory predicted extents in 2010-11 across the entire Arctic region. By contrast, the single region approach (applied to the extent series for 1979-2009 encompassing the entire Arctic region) involves a gamma regression with deterministic trend - since the stochastic trend option provided much less precise predictions to 2010-11.

The comparative L-criteria for cross-validatory fit (over the 24 months in 2010-11) are 2.43 (aggregated predictions over sub-regions) and 3.70 (single region approach). Figure 8 shows the cross-validatory predictions (2010-11) and out-of-sample predictions (2012-14) for the combined forecast aggregating over sub-region models. March extents (in mn km²) are predicted as 14.42 in 2012, 14.35 in 2013 and 14.31 in 2014, as compared to actual extents of 14.34 in 2011, and 14.88 in 2010. September extents (in mn km²) are predicted as 4.97 in 2012, 4.84 in 2013 and 4.71 in 2014, as compared to actual totals of 4.60 in 2011, and 4.90 in 2010.

Although sub-region data are not (at the time of writing) available beyond 2011, totals for the entire Arctic are available from ftp://sidads.colorado.edu/DATASETS/NOAA/G02135/. (Note that the latter differ slightly before 2012 from entire Arctic extents based on totalling the sub-regional series at http://neptune.gsfc.nasa.gov/csb/index.php?section=59). In fact, the March 2012 figure of 15.21 mn k m² (NSIDC, 2012) was the highest March average ice extent since 2008. By contrast, the September 2012 figure was anomalously low at 3.61 mn k m².

5 Summary and conclusions

This paper has considered sub-regional aspects of the observed downward trend in Arctic ice sea cover. Arctic sub-regions differ in observed trends and also in seasonal extremes. Thus some regions still have total winter cover but partial summer cover, while other regions have partial winter cover and no ice cover in summer. While downward trends in sea ice extent across the Arctic as a whole show a stronger summer decline, this is not necessarily the case when sub-regions are considered.

These differences are important for choosing the appropriate distribution, and inflation mechanism. Sub-regional differences in ice loss may also be important for assessing relatively localized impacts on climate or economic activities (e.g. Fissel et al. 2011). There may also be benefits in prediction (see Section 4.4) through considering such sub-regional differences and in aggregating over region-specific models.

Here generalized beta densities (with maximum inflation) and gamma regression (with zero inflation) have been used to represent recurrent winter maxima and summer ice disappearance respectively. Other applications of this methodology may be envisaged, outside time series applications. Possible applications of the generalized inflated beta density are discussed in Section 2.3. It may be noted that transformations of ice extent such as w_t = log(1 + y_t) might be envisaged as ways of avoiding beta or gamma regressions, and instead leading possibly to modelling using normal or skewnormal likelihoods. However, the problem of seasonal extreme inflation at particular values remains, and the methodology proposed in the paper provides a suitable representation for such extremes or for explaining them.

Another approach, as in compositional data analysis, is to focus on the ratios r_t = y_t/d_t to the maximum, in the case when the data has two categories (e.g. area covered by sea ice y_t, remaining area d_t - y_t). Generally compositional data analysis involves Gaussian likelihoods applied to log-transformed ratios. This method can adjust for zero inflation (e.g. Butler and Glasbey, 2008), but does not generate direct inferences or out-of-sample predictions for the extent data themselves.

References

Aitchison J, Egozcue J: Compositional data analysis: Where are we and where should we be heading? Math. Geol 2005, 37: 829–850. 10.1007/s11004-005-7383-7
Article MathSciNet Google Scholar
Blundell R, Griffith R, Windmeijer F: Individual effects and dynamics in count data models. Journal of Econometrics 2002, 108: 113–131. 10.1016/S0304-4076(01)00108-7
Article MathSciNet Google Scholar
Brooks S, Gelman A: General methods for monitoring convergence of iterative simulations. J. Comput. Graph.l Stat 1998, 7: 434–455.
MathSciNet Google Scholar
Butler A, Glasbey C: A latent Gaussian model for compositional data with zeros. J. R. Stat. Soc. Series C 2008, 57: 505–520. 10.1111/j.1467-9876.2008.00627.x
Article MathSciNet Google Scholar
Comiso J, Parkinson C, Gersten R, Stock L: Accelerated decline in the Arctic sea ice cover. Geophys. Res. Lett 2008, 35: L01703. doi:10.1029/2007GL031972 doi:10.1029/2007GL031972
Google Scholar
Fissel D, de Saavedra Álvarez M, Kulan N, Mudge T, Marko J: Long-term trends for Sea Ice in the Western Arctic Ocean: implications for shipping and offshore oil and gas activities. In Proceedings of the Twenty-first 2011 International Offshore and, Polar Engineering Conference. Hawaii, USA: Maui; 2011. International Society of Offshore and Polar Engineers (ISOPE). http://www.isope.org/publications/ISOPEproceedinglist.htm International Society of Offshore and Polar Engineers (ISOPE).
Google Scholar
Gelfand A: Model determination using sampling based methods. Edited by: W. Gilks, S. Richardson, D. Spiegelhalter. Boca Raton, pp. 145–157: Markov Chain Monte Carlo in Practice, Chapman and Hall/CRC; 1996.
Google Scholar
Höhle M, Paul M: Count data regression charts for the monitoring of surveillance time series. Comput. Stat. & Data Anal 2008, 52: 4357–4368. 10.1016/j.csda.2008.02.015
Article MathSciNet Google Scholar
Huang X, Oosterlee C: Generalized beta regression models for random loss-given-default. Department of Applied Mathematical Analysis, Delft University of Technology, Report 08–10; 2008.
Google Scholar
Jung R, Kukuk M, Liesenfeld R: Time series of count data: modeling, estimation and diagnostics. Comput. Stat. Data Anal 2006, 51: 2350–2364. 10.1016/j.csda.2006.08.001
Article MathSciNet Google Scholar
Laud P, Ibrahim J: Predictive model selection. J R Stat Soc 1995, 57B: 247–262.
MathSciNet Google Scholar
Ledolter J, Abraham B: Parsimony and its importance in time series forecasting. Technometrics 1981, 23: 411–414. 10.1080/00401706.1981.10487687
Article Google Scholar
Leininger T, Gelfand A, Allen J, Silander J: Spatial regression modeling for compositional data with many zeros. J. Agricultural Biol. Environ. Stat 2013, 18(3):314–334. 10.1007/s13253-013-0145-y
Article MathSciNet Google Scholar
Liu J, Curry J, Wang H, Song M, Horton R: Impact of declining Arctic sea ice on winter snowfall. Proc. Nat. Acad. Sci 2012, 109: 4074–4079. 10.1073/pnas.1114910109
Article Google Scholar
Lunn D, Spiegelhalter D, Thomas A, Best N: The BUGS project: Evolution, critique and future directions. Stat. Med 2009, 28: 3049–3067. 10.1002/sim.3680
Article MathSciNet Google Scholar
National Snow and Ice Data Center 2012.http://nsidc.org/arcticseaicenews/2012/04/arctic-sea-ice-enters-the-spring-melt-season/
Ospina R, Ferrari S: Inflated beta distributions. Stat. Papers 2010, 51: 111–126. 10.1007/s00362-008-0125-4
Article MathSciNet Google Scholar
Pham-Gia T, Duong Q: The generalized beta- and F-distributions in statistical modelling. Math. Comput. Modell 1989, 12: 1613–1625. 10.1016/0895-7177(89)90337-3
Article MathSciNet Google Scholar
Screen J, Deser C, Simmonds I, Tomas R: Atmospheric impacts of Arctic sea-ice loss 1979–2009: separating forced change from atmospheric internal variability. Climate Dynamics 2013. doi:10.1007/s00382–013–1830–9 doi:10.1007/s00382-013-1830-9
Google Scholar
Serreze M, Maslanik J, Key J, Kokaly R, Robinson D: Diagnosis of the record minimum in arctic sea ice area during 1990 and associated snow cover extremes. Geophys. Res. Lett 1995, 22: 2183–2186. 10.1029/95GL02068
Article Google Scholar
Stroeve J, Holland M, Meier W, Scambos T, Serreze M: Arctic sea ice decline: faster than forecast. Geophys. Res. Lett 2007, 34: L09501.
Google Scholar
Stroeve J, Serreze M, Holland M, Kay J, Malanik J, Barrett A: The Arctic’s rapidly shrinking sea ice cover: a research synthesis. Climatic Change 2012, 110: 1005–1027. 10.1007/s10584-011-0101-1
Article Google Scholar
Zhao X, Luo Y, Wang S, Huang W, Lian J: Is desertification reversion sustainable in northern China: a case study in Naiman county, part of a typical agro-pastoral transitional zone in Inner-Mongolia, China. Global Environ. Res 2010, 14: 63–70.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Geography, Queen Mary University of London, London, E1 4NS, UK
Peter Congdon

Authors

Peter Congdon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter Congdon.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Congdon, P. Modelling changes in Arctic Sea Ice Cover: an application of generalized and inflated beta and gamma densities. J Stat Distrib App 1, 3 (2014). https://doi.org/10.1186/2195-5832-1-3

Download citation

Received: 20 June 2013
Accepted: 25 October 2013
Published: 11 June 2014
DOI: https://doi.org/10.1186/2195-5832-1-3

Modelling changes in Arctic Sea Ice Cover: an application of generalized and inflated beta and gamma densities

Abstract

1 Introduction

2 Generalized beta time series regression

2.1 Application of the generalized beta to sea ice extents

2.2 Generalized beta time series regression with maximum inflation for sea ice extents

2.3 Other generalized beta applications

3 Inflated gamma time series regression

4 Model fit and comparison

4.1 Analysis framework

4.2 Generalized beta regression with maximum inflation

4.3 Gamma regression time series

4.4 Implications for prediction over entire Arctic

5 Summary and conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords