This article aims at providing a comprehensive survey of recent developments in the field of integer-valued time series modelling, paying particular attention to models obtained as discrete counterparts of conventional autoregressive moving average and bilinear models, and based on the concept of thinning. Such models have proven to be useful in the analysis of many real-world applications ranging from economy and finance to medicine. We review the literature of the most relevant thinning operators proposed in the analysis of univariate and multivariate integer-valued time series with either finite or infinite support. Finally, we also outline and discuss possible directions of future research.
Modelling and predicting the temporal dependence and evolution of time series of low counts have attracted a lot of attention over the last years. This is partially due to the increasing availability of relevant high-quality data sets in various fields of applications such as social science, industry, finance and economy, medicine and ecology just to mention a few. This great variety of application areas is illustrated by the data examples being plotted in Figures 1 and 2. Figure 1a shows a time series of monthly counts of sex offences (21st police car beat in Pittsburgh, 1990–2001) as discussed in Ristić et al. (2009). Figure 1b shows the monthly number of countries in the EA17 Euro area having stable prices (i.e., with an inflation rate below 2 %, period 2000–11); see Weiß and Kim (2014) for details.
Source: Authors' own.
Figure 2 shows examples of bivariate count data time series. The time series in Figure 2a-b (having the infinite range ) are about the daily number of daytime and nighttime road accidents in Schiphol area (Netherlands) in the year 2001; see Pedeli and Karlis (2011) for a detailed description of the data. Figure 2c-d show the number of rainy days per week in the German towns of Bremen and Cuxhaven (period 2000–10), which constitutes a time series with the finite range ; further details are provided by Scotto et al. (2014).
Source: Authors' own.
It is important to stress that there is no unifying approach applicable to modelling all the above time series of counts. Consequently, the analysis of such time series has to be restricted to special classes of integer-valued models. An useful conceptual division of these models can be made as being either observation-driven or parameter-driven models, though this distinction may be blurred Jung and Tremayne(2011a). A suitable class of observation-driven models is the one including models based on thinning operators. Models belonging to this class are obtained by replacing the multiplication in the conventional time series models (e.g., ARMA) by an appropriate thinning operator, along with considering a discrete distribution for the sequence of innovations in order to preserve the discreteness of the counts.
In modelling time series having an infinite range of counts, the class of INteger-valued AutoRegressive Moving Average (INARMA) models based on the binomial thinning operator of Steutel and van Harn (1979) plays a prominent role. Steutel and van Harn's binomial thinning operator (sometimes also referred to as ‘binomial subsampling’, see Puig and Valero 2007 takes the form
and otherwise, where is a collection of independent and identically distributed (i.i.d.) Bernoulli counting random variables (r.v's) with (fixed) probability of success . Furthermore, it is assumed that is a non-negative integer-valued random variable independent of . Note that, conditioned on , is binomially distributed, i.e., . The binomial thinning operator 1.1 shares several properties with the multiplication operator, namely,
;
;
, with the r.v's having support in .
The last property above holds true provided that the counting sequences involved in and , are independent. Another important similarity between both operators, as quoted in Hall (2001), which is particularly useful for the analysis of the extremal properties of models involving the binomial thinning operator is the following: if has a regularly varying right tail of the form
for some slowly varying function at , then the distribution tail of behaves asymptotically as the tail of . Furthermore, and are tail equivalent in the sense that , as . Unfortunately, such property does not hold for other important distributions namely those having exponential-type tails such as the geometric and the negative binomial distribution.
Turning now to the differences between the binomial thinning operator and the simple multiplication, the most important is that the distributive property does not hold for the binomial thinning operator since . Another important difference between both operators is due to the fact that
The INARMA model of order and based on the binomial thinning operator 1.1 is defined through the recursion
where is an i.i.d. sequence of integer-valued r.v's with finite mean and variance. Furthermore, it is assumed that all thinning operators are performed independently of each other and of . The time index below the binomial thinning operator emphasizes the fact that the operators are performed at each time . When in 1.2, is called an INAR of order ; when , is referred to as INMA of order . In the subsequent sections, several subclasses of INARMA models will be explained in detail.
While INARMA models directly imitate the classical ARMA recursion, another approach useful to handle time series of counts is to consider generalized linear models, where the serial dependence structure is incorporated through a link function. This leads to a very flexible class of models, where, e.g., covariate information are easily incorporated. On the other hand, model identification and interpretation may become more problematic as well as closed-form formulae concerning the stationary marginal distribution. A commonly used regression model, with a linear link function, is the integer-valued generalized autoregressive conditional heteroscedastic (INGARCH) process of orders Heinen (2003); Ferland et al. (2006), defined as
This article is intended to be an update of the survey published by Weiß(2008a) on the early developments in the analysis of univariate integer-valued time series based on thinning operators. It aims to provide a concise introduction to the subject with plentiful references. We review the thinning operators recently proposed in the literature (i.e., from 2008 onwards) as well as the thinning-based models commonly used in the analysis of univariate, and also multivariate integer-valued time series defined over bounded and unbounded sets. Efforts have been made to include all relevant contributions on the field to make this literature survey as complete and self-contained as possible, which has resulted in a rather voluminous list of references at the end of the paper.
The rest of the article is organized as follows: Section 2 surveys several classes of univariate binomial thinning-based models useful for representing integer-valued time series with finite and infinite support. Section 3 introduces several generalizations of the binomial thinning operator which are commonly used in the analysis of integer-valued time series exhibiting features, such as, non-stationarity, positive and negative autocorrelations, and overdispersion. Extensions of univariate thinning operators for the bivariate and multivariate case are discussed in Section 4, together with models based on such thinning operators to fitting bivariate/multivariate integer-valued time series defined over bounded and unbounded sets. Section 5 is devoted to conclusions and to discuss likely directions of future research.
Univariate binomial thinning-based models
A simple procedure to obtain models for integer-valued data is to replace the multiplication in conventional time series models, e.g., ARMA, bilinear or threshold models, by thinning operators in order to ensure the integer discreteness of the process. Among the most successful integer-valued time series models based on thinning operators are the INARMA models defined in 1.2. McKenzie (1985) and Al-Osh and Alzaid (1988) introduced the so-called stable INAR model,
with in 1.2, which constitutes an instance of a subcritical branching process with immigration. If the innovations have a finite mean, then the INAR process in 2.1 is stationary Weiß (2013). The INAR model shares many properties with the conventional AR model, namely, the fact that for both models the autocorrelation function (ACF) takes the form Note, however, that the range of variation of the ACF in the INAR case is restricted to . Another important property of the INAR model in 2.1 is that the discrete self-decomposable (DSD) distributions are possible marginal distributions, since the probability generating function (p.g.f.) of the INAR model satisfies
Many well-known distributions belong to the class of DSD distributions, namely, the negative binomial, Poisson, generalized Poisson distribution, generalized discrete Mittag-Leffler distribution and discrete stable distributions as a sub-class. By appropriately varying the innovations’ distribution, also zero-modified or underdispersed marginal distributions can be obtained; see Weiß (2013), Jazi et al. (2012) and others. In view of the closeness property with regard to addition and binomial thinning as shown in Puig and Valero (2007), a very natural approach is to assume the innovations to be compound-Poisson-distributed Schweer and Weiß (2014). These options, however, do not include distributions defined on bounded sets, ruling out the possibility to consider, e.g. the binomial distribution as a marginal distribution for . A modified AR(1)’like model for binomial counts making use of binomial thinning is discussed further in the text.
Empirically relevant generalizations of the stable INAR model have been suggested by several authors. For example, Silva et al. (2005) introduced the replicated INAR process being the th time series , defined as
with and . In order to make the INAR models more flexible with respect to real data applications, it may be of interest to include also covariates in the model to account for the dependence of the thinning probabilities on several factors. For this purpose, Brännäs (1995) and Monteiro et al. (2008) considered the INAR model in 2.1 and introduced the effect of explanatory variables through the thinning parameter . Usually, the conventional specification with and , is adopted. The explanatory variables vectors are treated as fixed and represents the corresponding vectors of unknown parameters. Further important contributions are from Brännäs et al. (2002) (panel data) and Brännäs and Hellström (2001) (extended dependence structure). INAR models contaminated with additive and innovational outliers were introduced and analyzed by Barczy et al. (2010); Barczy et al. (2012). Extensions of the INAR model into the spatial context were considered by Ghodsi et al. (2012).
The counterpart to the conventional AR model with , in the context of INARMA models, is the so-called INAR model which follows the recursion
We can distinguish three types of INAR processes, namely, the stable case , the unstable case and the explosive case . We concentrate first on the stable case. Note that since the thinning operators are probabilistic, the joint distribution of has to be considered leading to different types of INAR() models. Alzaid and Al-Osh (1990) assume a conditional multinomial distribution whereas Du and Li (1991) require conditional independence. It is important to note that the statistical properties of the Alzaid and Al-Osh (1990) model are very different from the properties of the model suggested by Du and Li (1991). In terms of second-order structure, for instance, the ACF resembles the one obtained for the usual ARMA model, whereas Du and Li's formulation implies that the ACF is the same as that of an AR() model. The stationarity condition for the INAR() process by Du and Li (1991) is that the roots of the characteristic polynomial are outside the unit circle, which is equivalent to .
A major drawback of both representations for the INAR model is that its stationary marginal distribution, for , differs from that for . To tackle this problem, Zhu and Joe (2006) and Weiß(2008c) introduced the so-called Combined INAR() (CINAR()) model, which is constructed by using a probabilistic mixing mechanism. If is an i.i.d. process with independent of , then the CINAR() process is defined by the recursion
Assumptions concerning the joint distributions of the r.v's being involved in recursion 2.4 are discussed by Weiß(2008c). The main advantage of the CINAR() model when compared with the INAR() model is that if is a stationary CINAR() process then its p.g.f. satisfies 2.2. Hence, the possible marginal distributions of a stationary CINAR() process include the DSD family.
For the unstable case (i.e., when the characteristic polynomial has a unit root), Barczy et al. (2011) proved that under some natural assumptions, namely, the INAR process being primitive (i.e., and the greatest common divisor of the set being equal to one) and that , the INAR process converges weakly towards a squared Bessel process (also known as Cox-Ingersoll-Ross process). The asymptotic behaviour of the unstable INAR process differs remarkably from that of the conventional unstable AR counterpart: the unit root of the characteristic polynomial has multiplicity one (contrary to the unstable AR model which may has real or complex unit roots with different multiplicities), and while the unstable INAR model converges to a well-defined limit, in contrast, however, for the unstable AR model such a limit does not exist in general. Further results for the unstable case can be found in Barczy et al. (2011); Barczy et al. (2014). Contrary to the other two cases, the explosive INAR process has been overlooked in the literature and remains as a topic for future investigation.
Discrete counterparts to the conventional th-order moving average processes (referred to as INMA()) based on the binomial thinning operator have also been proposed in the literature. The INMA() model is defined by
where is an i.i.d. sequence of non-negative integer-valued r.v's with finite mean and variance, and where with and, usually, . In analogy to the INAR case, each is involved in thinning operators, at different times . Hence, for the same set of thinning parameters , a wide variety of models exhibiting very different autocovariance functions and joint distributions can be constructed by only changing the dependence structure of the thinning operators. Such types of INMA models were proposed by Al-Osh and Alzaid (1988), McKenzie (1988) and Brännäs and Hall (2001). Weiß(2008b) showed that all these models can be embedded, indeed, into a single family of INMA() models.
Using the concept of binomial thinning, also conventional bilinear models can be adapted to the integer case leading to the class of integer-valued bilinear (INBL) models. This class of models is particularly suitable for modelling processes which assume low values with high probability, but exhibiting, at the same time, sudden bursts of large values. Drost et al. (2008) introduced the super diagonal INBL process, which is a particular class of the more general INBL class with given by the form
where if and is an i.i.d. sequence of non-negative, integer-valued r.v's having finite mean and variance. Doukhan et al. (2006) analyzed the special INBL model
The stationary INBL process is capable of producing, despite its simplicity, sudden bursts of large values and hence is particularly suitable for modelling integer-valued time series showing heavy-tails of power law types (e.g., Zipf's law). Doukhan et al. (2006) and Drost et al. (2008) assume that the distinct binomial thinning operators involved in 2.6 and 2.7 are independent and also independent when applied to the different random variables. Figure 3 displays three simulated sample paths consisting in 1000 observations of the bilinear model 2.7 with and and Po in all cases. As expected, when the bilinear parameter tends to 1, bursts of large values become more obvious.
Source: Authors' own.
Drost et al. (2008) derived a condition guaranteeing the existence of a unique strictly stationary process satisfying 2.6 with finite first moment, and Doukhan et al. (2006) obtained conditions to ensure that the INBL process is second-order stationary. For the bilinear model in 2.7, Drost et al. (2008) also provided sufficient conditions for the existence of higher order moments of . For these INBL processes, the ACF (when it exists) often provides little insight into the dependence structure of the process, and its empirical counterpart can behave in a very unpredictable way. Thus, second-order methods depending on the sample ACF for identification and estimation can misguide the practitioner and result in an inappropriate model being selected. It is worth also to mention here that conditions for invertibility of the general INBL model (which are crucial for estimation and prediction) are not known. Therefore, the use of this class of models in practice is by far quite limited. Furthermore, the analysis of the probabilistic structure of INBL models (in particular, e.g., weak limits of extreme values) also remains an open question. This is partially due to the fact that it is difficult to obtain an explicit analytic expression for the stationary distribution of (2.6).
When the focus is on the analysis of integer-valued time series with a finite range of counts, say , the models given above are useless. Addressing this issue, McKenzie (1985) gave a noticeable contribution by suggesting to replace the innovation term in the INAR recursion 2.1 by the term , leading to the following representation for ,
with , for and where all thinning operators are performed independently of each other, and being the thinning operators at time independent of . Note that the representation for in 2.8 guarantees that the range of is given by . The process in 2.8 used to be referred to as binomial AR process and is a stationary Markov chain with states and binomial marginal distribution . Alternative correlated processes with binomial marginals based on the so-called hypergeometric thinning operator was introduced by Al-Osh and Alzaid (1991); see Weiß(2008a) for details. The binomial AR process shares some properties with the conventional AR models, namely, the fact that the ACF decays to zero at an exponential rate. Other important features of the binomial AR process are that both the conditional mean and variance of given are linear in , and the fact of being time-reversible. Closed-form expressions for the joint moments and cumulants were obtained by Weiß and Kim(2013a), whereas issues related with parameter estimation have been addressed by Cui and Lund (2010), Weiß and Pollett (2014) and Weiß and Kim(2013a); Weiß and Kim(2013b). Extensions for higher- order binomial models were first proposed by Weiß (2009). In order to preserve the range of to be limited by , the author introduced the class of combined binomial AR processes in analogy with the class of CINAR models in 2.4, by defining through the recursion
where for , with and defined as in 2.8, and . All thinning operators at time are performed independently of each other and of the i.i.d. process being independent of all and , for . A particularly relevant class of combined binomial AR models is the one containing the so-called independent thinning models, because it leads to a conventional AR autocorrelation structure while preserving the stationary marginal of the underlying first-order model.
Extensions of the binomial thinning operator
The binomial thinning operator 1.1 and the related time series models presented in Section 2 were generalized in a number of different ways. Such generalizations were motivated by the need for, e.g., capturing overdispersion, a modified probability for observing a zero, a particular type of dependence structure or including non-negative integers into the range of the process. In the present section, we concentrate on those generalizations leading to a univariate time series, while later in Section 4, we consider multivariate extensions.
Generalized thinning operators
Various authors have proposed modifications of the binomial thinning operator in 1.1 in order to make the integer-valued models based on thinning more flexible for practical purposes. Latour (1998) introduced the generalized thinning operator by allowing the components in to be i.i.d. integer-valued r.v's with finite mean and variance although not necessarily Bernoulli-distributed. Modifying the INAR(1) recursion 2.1 accordingly leads to the generalized INAR(1) (GINAR(1)) process, and Latour's GINAR model is the generalized counterpart of the INAR model by Du and Li (1991). The case in which ’s are i.i.d., r.v's with geometric distribution, say , was analyzed by Ristić et al. (2009) and referred to as the negative binomial thinning operator, say ‘’. The operator ‘’ is then used to modify the INAR(1) recursion 2.1. Ristić et al. (2009) showed that if the marginal distribution of is also geometric (referred to as ‘new’ geometric INAR(1) process, NGINAR(1)), say , then is a mixture of and . Nastić et al. (2012) proposed a th order autoregressive extension by adapting the CINAR() approach in equation 2.4. Ristić et al.(2012a) considered the case of negative binomial marginals, the NBINAR(1) process. A slightly modified type of negative binomial thinning (with an additional mixture) and related INARMA models are discussed by Aly and Bouzar(1994a); Aly and Bouzar(1994b).
Also the extended Poisson INAR model by Weiß (2015) can be understood as a special instance of Latour's model, but which uses binomial-Poisson thinning. In fact, also the (Poisson) INARCH model (i.e., the 1.3 with and ) may be subsumed under the class of AR-like thinning models (see, e.g., Grunwald et al. (1997); Weiß (2015) by considering the Poisson thinning operator. This argument also extends to the whole INARCH family, which can be seen as the corresponding instance of the GINAR model by Latour (1998). Another important special case of Latour's operator is the extended thinning operator proposed by Zhu and Joe (2003) being the ’s i.i.d. r.v's, independent of , having the same distribution as a random variable with p.g.f.
with mean and variance . Clearly, the binomial thinning operator corresponds to in 3.1. Also the expectation thinning operator as proposed by Zhu and Joe (2010) is an instance of the generalized thinning operator, in which constitutes a family of self-generalized r.v's with support on and , for all .
Joe (1996) proposes a very general way of obtaining stationary INAR models of the form , with being a random operator. The two terms on the right-hand side of the equation are assumed to be independent. Here, the idea is to properly select a marginal distribution and also an appropriate thinning operator, , to ensure that the marginal distribution of and are from the same family. For this purpose, the author assumes that the marginal distribution of is an infinitely divisible exponential dispersion model with parameters and (in short ED) which includes as special cases the Poisson, negative binomial and the generalized Poisson distribution. Furthermore, the random operator is such that where is referred to as the contraction corresponding to ED. Joe (1996) proved that under this setting the marginal distribution of is also ED and independent of . Examples of contractions include the binomial, the quasi-binomial and the beta-binomial distributions. The quasi-binomial thinning of Alzaid and Al-Osh (1993) is a special case of the random operator of Joe (1996) in which the distributions of , and are generalized Poisson with parameters , and , respectively. Extensions of Joe's results for higher order INARMA models have been considered by Jørgensen and Song (1998) and Jung and Tremayne(2011b).
Random coefficient thinning
Joe (1996) and Zheng et al. (2006); Zheng et al. (2007) suggested to consider stochastic thinning operators by allowing in 1.1 to be random itself. The resulting thinning operator is called random coefficient thinning. The concept of random coefficient thinning has been extended by Gomes and Canto e Castro (2009) by allowing a different discrete distribution associated to the thinning operator, in analogy to Latour's generalized thinning operator. Specifically, the authors define the generalized random coefficient thinning operator , based on a discrete random variable , a random variable with support on , and a family of discrete-type distribution functions parametrized by the mean and the standard deviation , as , with mean and variance , being a function depending on and . Possible choices for are: (i) Binomial; (ii) Negative Binomial; (iii) Poisson; and (iv) Geometric. Leonenko et al. (2007) generalized the concept of random coefficient thinning by introducing a class of mixed thinning operators, where the probability of success in 1.1 takes the form , with and being a random variable with distribution function concentrated in the interval . The case , leads to the usual random coefficient thinning.
INAR models based on random coefficient thinning operators of the form
where is a sequence of i.i.d. r.v's each one taking values in the interval , and is a sequence of i.i.d. integer-valued non-negative r.v's, independent of , were considered by Zhang and Wang (2015), Roitershtein and Zhong (2013), Bakouch and Ristić (2010), Gomes and Canto e Castro (2009) and Zheng et al. (2007). Gomes and Canto e Castro (2009) and Zheng et al. (2007) proved that if and are finite, then the condition , ensures that there exists a unique non-negative integer-valued weakly stationary process satisfying 3.2. Leonenko et al. (2007) analyzed the INAR model based on the mixed thinning operator, i.e., by replacing the operator by the mixed operator in 2.1, where is a sequence of i.i.d. r.v's concentrated on the interval with common distribution . The authors proved that if process has finite first and second moments then Moreover, extensions for the INMA model in 2.5 based on stochastic thinning sequences were considered by Hall et al. (2010), who assume that the ’s form a sequence of independent r.v's independent of . Hall et al. (2010) analyzed the extremal properties (in particular the limiting distribution of the normalized maxima) of this class of INMA models.
For modelling of count data time series with a finite range, Weiß and Kim (2014) introduced a beta-binomial autoregressive model using the concept of random coefficient thinning. This particular model generalizes the binomial AR(1) model 2.8 to the case of finite counts showing extra-binomial variation (overdispersion with respect to the binomial distribution). The definition of the beta-binomial AR(1) model resembles the one in 2.8 by replacing the thinning operators and by and which are assumed to be r.v.’s following a Beta distribution with parameters and being , respectively.
Thinning operators with dependence structure
All thinning operators above rely upon the assumption of independence across the counting variables. Extensions of binomial thinning based on Bernoulli-distributed dependent r.v's were proposed by Brännäs and Hellström (2001), and more recently by Ristić et al. (2013) who assume that the variables in 1.1 take the form
where and are sequences of i.i.d. Bernoulli r.v's with parameters and , respectively, and is also a Bernoulli random variable with parameter . Furthermore, it is assumed that , and are independent for all . The representation in 3.3 implies that forms a sequence of dependent Bernoulli r.v's with parameter , since for and . The case corresponds to the binomial thinning operator. Notice that equation 3.3 also relates to the Pegram's operator Puig and Valero (2007) upon which Jacobs and Lewis (1983) and Biswas and Song (2009) described a unified framework to yield discrete-valued ARMA models. Pegram's operator is defined as follows: given two independent discrete r.v's and , and a mixing weight , Pegram's operator mix and with and , to produce a new random variable , with probability mass function
Another type of additional dependence structure is obtained through the assumption of time-dependence of the thinning parameters. Monteiro et al. (2010) considered an INAR model based on the binomial thinning operator with periodically varying parameter of the form with for . Such models are useful, e.g. in the analysis of fire activity. On the other hand, Brännäs and Nordström (2006) proposed the following modification of the binomial AR process in 2.8 to modelling daily accommodation time series for hotels and cottages,
Note that the parameter is allowed to vary in time. The authors assume that in 3.4 represents the number of occupied rooms in some hotel at day , and the parameter represents the hotel capacity at day . Brännäs and Nordström (2006) also consider the situation in which the thinning parameters and in 3.4 are time-dependent variables, aiming to capture seasonal patterns in guest night time series as well as the income level of the country of origin and exchange rates. Some authors suggest to generalize the binomial thinning mechanism by allowing the thinning probabilities to dependent on previous observations (i.e., state-dependent thinnings). One such example is the functional coefficient INAR(1) (FINAR(1)) model as proposed by Triebsch (2008). She assumed the thinning parameter at time in recursion 2.1 to be a measurable (and typically decreasing) function of , i.e., , and established conditions for the weak dependence of the resulting FINAR process. In the discussed data applications, a logistic form is assumed for . A related approach for the case of finite counts was considered by Weiß and Pollett (2014) who introduced the density-dependent binomial AR process
Unlike the binomial AR model in 2.8, the thinning parameters and are now presumed to be time-dependent, varying according to the ‘density’ . The authors derived a number of results concerning large approximations, law of large numbers and central limit theorems. Furthermore, the authors illustrate the usefulness of their approach by considering two particular subfamilies of density-dependent binomial AR models which permit a wide range of overdispersion and underdispersion scenarios as well as positive and negative autocorrelations.
Another approach of state-dependence is given through the concept of self-exciting threshold models being motivated by the so-called piecewise phenomenon. The fundamental reason for introducing such classes of models is the need to model random cyclic behaviour that exists in many time series. In the continuous-valued case, threshold models are typically characterized by having a linear (ARMA) structure in each regime. The basic idea is as follows: we start with a linear model for and allow the parameters to vary according to the values of a finite number of past values of (self-exciting), or a finite number of past values of an associate process. Basic results on the probabilistic structure of this class of models can be found in Turkman et al. (2014) and Tong (1990). For dealing with time series of counts exhibiting piecewise-type patterns, Monteiro et al. (2012) introduced a class of self-exciting threshold INAR (SETINAR) models of order one and two regimes based on the binomial thinning operator. It falls within the class of the two-regimes SETINAR model of orders (SETINAR defined as
where it is assumed that , for , and that the i.i.d. innovation sequences and have distributions and on , respectively. The constant represents the threshold level of the process and the regime switch is triggered by the lag- value of the series. Furthermore, the binomial thinning operators take the form where , for , are sequences of i.i.d. Bernoulli r.v's, independent of and , with success probabilities and , respectively. Monteiro et al. (2012) proved the existence of a strictly stationary process satisfying 3.5 for the case .
Thinning operators for -valued time series
All approaches discussed so far consider the case of count data time series, i.e., for time series having the non-negative integers as their range. Arguably, the first paper considering times series with possible values in the full set of integers, from now on abbreviated as -valued time series, is the one by Kim and Park (2008). These authors introduced the signed binomial thinning operator being defined as follows:
where if and otherwise, and is defined as in 1.1 but having probability of success . Based on their thinning operator, Kim and Park (2008) extended the INAR model in 2.3 to fit integer-valued time series taking values on . Bakouch and Ristić (2010) used this operator to define a first-order model with Skellam-distributed innovations. The signed binomial thinning operator of Kim and Park (2008) has been generalized in many ways. Kachour and Truquet (2011), e.g., define the generalized signed thinning operator as
if and 0 otherwise. The ’s have a common distribution on a subset of and are independent of , in analogy to the generalized thinnings being discussed in Section 3.1. Kachour and Truquet (2011) used the signed-type thinning operator 3.7 to extend the INAR model in 2.3 to fit -valued time series. In particular, they introduced the signed INAR(1) (SINAR) process by the recursion with taking values on and charging zero with positive probability and . Chesneau and Kachour (2012) analyzed the particular SINAR model in which the distribution is defined as follows: , and , where . Note that with Bi and
where is a -valued random variable and is Bessel-distributed with parameters and . Moreover, is defined as in 1.1 being independent of , and . Furthermore, is a sequence of i.i.d. r.v's independent of and , and has the probability mass distribution and . Alzaid and Omair (2014) proved that within this framework, follows an extended binomial distribution Alzaid and Omair (2012) with parameters and . Note that the extended binomial thinning operator includes the binomial operator in 1.1 provided that is non-negative and in 3.8. Operator 3.8 is then used by Alzaid and Omair (2014) to define the Poisson difference INAR(1) process by
where is an i.i.d. sequence of r.v's with Skellam distribution and is a parameter taking values and describing the sign of the correlation. Alzaid and Omair (2014) proved that under the above conditions, the marginal distribution of is also Skellam.
Kim and Park's operator was also extended by Zhang et al. (2010) who introduced the signed generalized power series thinning operator. Zhang and co-workers’ operator is defined as in 3.6 but with the ’s being i.i.d. with a generalized power series distribution. Such family of distributions includes, among others, the Poisson, the logarithmic series, the binomial and the negative binomial one.
An important advantage of the aforementioned models is that their ACF preserves the structure of the ACF of conventional ARMA models which, in turn, implies that their ACF will alternate between positive and negative values provided that some of models’ parameters are negative (in particular, in 3.6, the mean of the distribution in 3.7, and parameter in Alzaid and Omair's model). Hence, such models are good candidates for fitting -valued time series with some negative autocorrelations.
Another approach for obtaining an operator acting on a truly integer-valued random variable was proposed by Freeland (2010). Freeland's operator is defined as follows: let be the difference of two latent and independent Poisson random variables and with the same mean , then follows the Skellam distribution with parameter . The corresponding thinning operator ‘’ is defined as for , which is then used to define the so-called True INAR(1) model, abbreviated as TINAR. The TINAR model can be expressed as the difference of two independent Poisson INAR models. This concept was also adapted to the negative binomial thinning operator Ristić et al. (2009) by Barreto-Souza and Bourguignon (2015) and Nastić et al. (2014). The corresponding first-order model, the skew TINAR model, has observations following a skew discrete Laplace distribution.
Before concluding this section a reference should be made to the two random operators (referred to as first- and second-order random rounding operators) introduced by Liu and Yuan (2013). The usefulness of such operators rely mainly on the fact that the conditional mean and the conditional variance of the process are modelled separately, which implies that no assumptions on the relationship between the conditional mean and variance are needed. This is in contrast, e.g., to the class of binomial thinning-based INARMA models and the class of INGARCH models in which strong assumptions between the conditional mean and variance are imposed. Further advantages of models based on Liu and Yuan's operators include their flexibility in handling -valued time series with range of variation of the ACF in .
Parameter estimation, testing and forecasting
A large number of articles cover aspects of parameter estimation for INAR-type models. For first- order models, also results from the literature about branching processes with immigration can often be adopted (see Weiß (2015) for some references). But also for higher- order models, types of, e.g., moment estimators, conditional least-squares (CLS) estimators or maximum likelihood (ML) estimators have been developed, and their asymptotic properties have been discussed, see, e.g., Latour (1998) (CLS estimation) or Bu et al. (2008) (ML estimation) for the case of GINAR models. It is well known, however, that the CLS method is statistically inefficient and that the ML method is difficult to implement, mainly due to the complication in computing the transition probabilities involved in the log-likelihood function, as the order of the model increases. To overcome such difficulties, Pedeli et al. (2014) proposed a saddlepoint-based technique to obtain accurate approximation of the transition probabilities. A semiparametric estimation approach for INAR models was proposed and investigated by Drost et al. (2009), while Weiß and Kim(2013b) consider diverse estimation approaches for binomial AR(1) models. For the INBL models, in contrast, only the method of moments has been investigated so far as an estimation technique.
Also model identification and diagnostic techniques have been proposed in the literature for INARMA models, while corresponding approaches for INBL models are currently not available. Jung and Tremayne (2003), for instance, compared several tests for uncovering significant serial dependence as being caused by INARMA processes. Sun and McCabe (2013) developed further such tests based on the likelihood score for diverse types of innovations’ distribution, while Schweer and Weiß (2014) proposed a dispersion test for INAR(1) models. Hudecová et al. (2015) discuss tests based on the empirical p.g.f. Concerning time series of counts with a finite range, we find, e.g., a test for uncovering extra-binomial variation in binomial AR(1) processes in the work by Weiß and Kim (2014), while Kim and Weiß (2015) investigate a number of goodness-of-fit tests for binomial AR(1) processes.
Procedures for constructing forecasts for INAR-type models have been also proposed in the literature. The traditional way of producing forecasts is via the -step-ahead conditional distribution of given , . Forecasting can be pursued within the setting of conditional expectations in which case the -step-ahead predictor yielding minimum mean squared error forecasts is . The major drawback of forecasting based on the conditional expectation is that it hardly produces coherent (i.e., integer-valued) predictions. In order to generate data coherent predictions, the median of the -step-ahead conditional distribution (which minimizes the expected absolute error) or its corresponding mode can be employed as a point forecast. However, as pointed out by Freeland and McCabe (2004), using the median or the mode may be potentially misleading specially when the range of variation of is small. In such cases, a more enlightened approach is to provide point estimates and confidence intervals for , i.e., the probability of occurrence of , instead of obtaining forecasts based on the conditional mean or median. More recently, McCabe et al. (2011) proposed a novel method for producing efficient probabilistic forecasts in the INAR class in 2.3. The method involves estimating the forecast distribution of the INAR model for a given horizon and to quantify the sampling variation that is associated with future counts, via subsampling techniques. McCabe and co-authors’ technique for assessing the effect of sampling variation mimics the conventional prediction intervals, but ensures at the same time, that the non-negativity and the unit property of the forecast distribution still stands true. Furthermore, the authors also proved that, under certain conditions, the estimated forecast distribution based on the non-parametric ML estimates for the model's parameters and , is an asymptotic efficient estimator of the forecast distribution in the sense of the Hajek convolution theorem.
Multivariate models based on thinning operators
Multinomial thinning
Extensions of univariate thinning operators for the multivariate case is nowadays a topic of active research and of great applicative interest. One of the first approaches towards a multivariate thinning mechanism is the one by McKenzie (1988). In a first step, binomial thinning is generalized to where is a vector of probabilities with . The are i.i.d. random vectors being independent of , which are multinomially distributed according to . Obviously, corresponds to the binomial thinning operator in 1.1. In the second step, a matrix with column vectors as above is defined. If the random vector has range contained in , then
is also a random vector having a range contained in . It is assumed that all thinning operators are mutually independent and independent of . Under these assumptions, the thinning operator 4.1 is referred to as multinomial thinning. Like for binomial thinning, for the multinomial thinning it also holds that and Multinomial thinning operators were used by McKenzie (1988) to construct types of multivariate Poisson ARMA models. McKenzie's multinomial thinning approach was extended by Aly and Bouzar(1994b) by using a type of negative binomial thinning, also see Section 3.1.
Matrix-binomial thinning
Franke and Subba Rao (1993) introduced a multivariate INAR(1) model based upon independent binomial thinning operators. An extension for multivariate integer-valued autoregressive models based on generalized thinning operators (see Section 3.1) was proposed by Latour (1997). The thinning concept of Franke and Subba Rao (1993) is as follows: let be a random vector with values in . For a given -matrix with , the matrix thinning gives a -dimensional random vector with th component
where is a sequence of i.i.d. Bernoulli r.v's with probability of success . Furthermore, it is assumed that the thinning operators are performed independently of each other. Karlis and Pedeli (2013) and Pedeli and Karlis (2011); Pedeli and Karlis(2013a); Pedeli and Karlis(2013b) restrict to the diagonal case such that for , i.e., the marginals behave like the univariate binomial thinning operator in 1.1. However, this nice feature is obtained at the cost of no additional cross-correlation between and . On the contrary, if is allowed as in Franke and Subba Rao (1993) and Boudreault and Charpentier (2011), then the marginals of no longer behave like the univariate binomial thinning operators. Pedeli and Karlis (2011) introduced the bivariate INAR(1) (BINAR) model with bivariate Poisson and bivariate negative binomial innovations. Their model is defined as
where are assumed to be independent -valued random pairs. In order to also account for negative correlation between the time series, Karlis and Pedeli (2013) suggest to use an appropriate bivariate copula for the innovations. Note that, one of the drawbacks of the model 4.2 is that the autoregression matrix is diagonal, which means that the thinning operation causes no cross-correlation in the counts. The more general -variate INAR process based upon the assumption that is a non-diagonal matrix was introduced by Franke and Subba Rao (1993) and later by Boudreault and Charpentier (2011) and Pedeli and Karlis(2013c), as
for some -matrix with entries in , and some i.i.d. random vector taking values in . Franke and Subba Rao (1993) derived criteria ensuring the existence of a strictly stationary -variate INAR(1) process satisfying 4.3. Quoreshi (2006) applied the non-diagonal matrix-binomial thinning operators to define a multivariate version of the INMA model according to 2.5; stochastic properties of this model are derived for the bivariate case.
While matrix-binomial thinning was successfully applied to the case of count data time series with an infinite range, one observes difficulties if trying to extend the binomial AR model in 2.8 for the bivariate case. It is quite tempting to consider the following representation for ,
where . If the matrices and are diagonal ones (as in Pedeli and Karlis’ models), then no cross-correlation will be observed because no innovations are included in model recursion 4.4. On the other hand, if the matrices and are non-diagonal, then and , for , do not necessarily hold anymore, so it may happen that . Such kind of problems are avoided if the concept of bivariate binomial thinning is used.
Bivariate binomial thinning
A different generalization of the binomial thinning operator 1.1 for the bivariate case, based on the bivariate binomial distribution of type II (BVB) Kocherlakota and Kocherlakota (1992), has been recently introduced by Scotto et al. (2014). This new operator has a number of advantages compared to the approach of the previous Section 4.2, see the details below. Scotto et al. (2014) define the bivariate binomial thinning operator as
where with and defined on a proper set (see Scotto et al. (2014) p. 235). Note that for this operator, it follows that and , i.e., marginally, it behaves like the univariate binomial thinning operator. Furthermore, the bivariate binomial thinning operator induces cross-correlation as long as , and the additional cross-correlation may be both positive or negative provided that or , respectively. Note that the diagonal matrix thinning used by Karlis and Pedeli (2013) and Pedeli and Karlis (2011); Pedeli and Karlis(2013a); Pedeli and Karlis(2013b) is a special case of 4.5 with .
Bivariate binomial thinning can also be utilized for the case of finite counts. To tackle this task, Scotto et al. (2014) introduced the bivariate binomial AR(1) model (BVB-AR(1)), where the bivariate process with satisfies the following recursion where the thinning operators are performed independently of each other. Here, , and are the thinning parameters defined as and , where and for . It is important to stress that the marginals of the BVB-AR process are distributed like the usual binomial AR process according to 2.8.
A bivariate extension of negative binomial thinning
A bivariate extension of the random coefficient thinning approach (see Section 3.2) was first considered by Ristić et al.(2012b). Their approach is based on the negative binomial thinning operator ‘’ discussed in Section 3.1, defined as
where and are mutually i.i.d. random vectors with and , with . Hence, the operator 4.6 can also be expressed as a mixture of negative binomial thinnings. Ristić et al.(2012b) introduced a bivariate INAR(1) model for with positively correlated geometric marginals based on the thinning operator 4.6. This model can also be understood as some kind of mixture of two NGINAR(1) processes Ristić et al. (2009).
Signed matrix thinning
Bulla et al. (2012) introduced the so-called signed matrix (SM) thinning operator as an extension of the signed thinning operator in 3.7 for the bivariate case. The SM thinning operator is defined as , where represents the common distribution of , for . Furthermore, it is assumed that all counting sequences associated with ‘’, , are mutually independent.
Bulla et al. (2012) introduced the class of bivariate signed INAR(1) (B-SINAR) processes, which is an extension of the SINAR(1) process of Kachour and Truquet (2011) (see Section 3.4) to the bivariate case. A bivariate process is said to be a B-SINAR process if admits the representation
The case can be seen as an extension of Pedeli and Karlis’ model in 4.2 on . For this case, the authors assume that is modeled through a bivariate Skellam distribution Bulla et al. (2014).
Discussion
This article has presented a comprehensive review of the literature on thinning-based models for the analysis of univariate and multivariate integer-valued time series with finite and infinite support. We surveyed a wide range of thinning operators which, by construction, are useful for fitting integer-valued time series exhibiting features such as lack of stationarity, positive and negative autocorrelations, and also overdispersion. There are a number of further possibilities for future research in this area. We briefly highlight some of them.
Univariate models: Count models with long memory (in the covariance sense) need to be developed and studied in detail; examples of time series of counts exhibiting long memory can be found in stock transactions Brännäs and Quoreshi (2010); Quoreshi (2014). To develop schemes for detecting possible structural breaks in non-stationary integer-valued time series models is also an impending problem; here Davis et al. (2008) and Kashikar et al. (2013) provide a good starting point. Extensions of INARMA models to space-time INARMA models will be also highly desirable. This topic has been long overlooked in the literature although it has enumerable applications (e.g., number of arrivals to the emergency services at several hospital in the same area). Generally, the areas of -valued times series and finite-valued time series are still in their infancy.
Multivariate models: Though still facing many challenges, multivariate integer-valued time series modelling has attracted more attention in the recent years. Scotto et al. (2014) suggested to extend the INAR model 2.1 for the analysis of bivariate time series with an infinite range of counts by using the bivariate binomial thinning operator 4.5, i.e., This model includes the bivariate models of Karlis and Pedeli (2013) and Pedeli and Karlis (2011); Pedeli and Karlis(2013a); Pedeli and Karlis(2013b) as a special case but allows also for negative cross-covariance. Extensions of multivariate models to higher orders need to be considered, possibly by adapting the approach of Weiß(2008c). Moreover, periodic correlated bivariate and multivariate count models need to be developed. Potential applications of such models can be found in the analysis of fire activity.
Extreme value theory: An important topic for future work is to characterize the tail behaviour and the extremal properties of thinning-based models, and to look at large deviations of partial sums obtained from such models. Anderson (1970) gave a remarkable contribution by defining a particular class of discrete distributions for which the maximum term (under an i.i.d. setting) possesses an almost stable behaviour; extensions for certain stationary sequences were proposed by McCormick and Park (1992) and Hall (1996). The analysis of the extremal behaviour INMA-type models driven by innovations with distribution belonging either to Anderson's class and to the class of heavy-tailed distribution became a topic of lively research; details can be found in Turkman et al. (2014). However, much of the available literature on this topic has been restricted to models based on the binomial thinning operator. Besides analyzing the effect of other thinning operators, also extensions to characterize the extremes of INBL and SETINAR models will be very welcome, as well as the study of the tail behaviour and the large deviation of partial sums based on univariate and bivariate INMA and INBL models.
Footnotes
Acknowledgments
The authors thank the associate editor and the three referees for carefully reading the manuscript and for their valuable comments, which greatly improved the article. This work was supported by the European Regional Development Fund (FEDER) through the COMPETE programme and by the Portuguese government through the FCT, Fundação para a Ciência e a Tecnologia, in the scope of the project UID/MAT/ 04106/2013 (Centro de I&D em Matemática e Aplicações, cidma.mat.ua.pt/) and projects PEst-OE/EEI/UI0127/2014 and UID/CEC/00127/2013 (Instituto de Engenharia Electrónica e Informática de Aveiro, IEETA/UA, www.ieeta.pt). S. Gouveia acknowledges the postdoctoral grant by FCT (ref. SFRH/BPD/87037/2012). The authors are grateful to Dr Xanthi Pedeli, Athens University of Economics and Business (Greece) for contributing the accidents counts data for a,b.
References
1.
Al-OshMAAlzaidAA (1987) First order integer-valued autoregressive INAR process. Journal of Time Series Analysis, 8, 261–75.
Al-OshMAAlzaidAA (1991) Binomial autoregressive moving average models. Communications in Statistics –Stochastic Models, 7, 261–82.
4.
AlyEEAABouzarN (1994a) Explicit stationary distributions for some Galton-Watson processes with immigration. Communications in Statistics –Stochastic Models, 10, 499–517.
5.
AlyEEAABouzar,N (1994b) On some integer-valued autoregressive moving average models. Journal of Multivariate Analysis, 50, 132–51.
6.
AlzaidAAAl-OshMA (1990) An integer-valued th-order autoregressive structure (INAR()) process. Journal of Applied Probability, 27, 314–24.
7.
AlzaidAAAl-OshMA (1993) Generalized Poisson ARMA processes. Annals of the Institute of Statistical Mathematics, 45, 223–32.
8.
AlzaidAAOmairMA (2012) An extended binomial distribution with applications. Communications in Statistics –Theory and Methods, 41, 3511–27.
9.
AlzaidAAOmairMA (2014) Poisson difference integer valued autoregressive model of order one. Bulletin of the Malaysian Mathematical Sciences Society, 37, 465–85.
10.
AndersonCW (1970) Extreme value theory for a class of discrete distributions with applications to some stochastic processes. Journal of Applied Probability, 7, 99–113.
11.
AnderssonJKarlis,D (2014). A parametric time series model with covariates for integers in . Statistical Modelling, 14, 135–56.
BarczyMIspányMPapG (2014) Asymptotic behaviour of conditional least squares estimators for unstable integer-valued autoregressive models of order 2. Scandinavian Journal of Statistics, 41, 866–92.
17.
Barreto-SouzaWBourguignonM (2015) A skew INAR process on . AStA –Advances in Statistical Analysis, 99, 189–208.
18.
BiswasASongPXK (2009) Discrete-valued ARMA processes. Statistics and Probability Letters, 79, 1884–89.
BrännäsKQuoreshiAMMS (2010) Integer-valued moving average modelling of the number of transactions in stocks. Applied Financial Economics, 20, 1429–40.
25.
BrännäsKHellströmJNordströmJ (2002) A new approach to modelling and forecasting monthly guest nights in hotels. International Journal of Forecasting, 18, 19–30.
26.
BuRMcCabeBPMHadriK (2008). Maximum likelihood estimation of higher-order integer-valued autoregressive processes. Journal of Time Series Analysis, 29, 973–94.
27.
BullaJChesneauCKachourM (2012) A bivariate first-order signed integer-valued autoregressive process. Submitted.
28.
BullaJChesneauCKachourM (2014) On the bivariate Skellam distribution. Communications in Statistics –Theory and Methods. To app ear.
29.
ChesneauCKachourM (2012) A parametric study for the first-order signed integer-valued autoregressive process. Journal of Statistical Theory and Practice, 6, 760–82.
30.
CuiYLundR (2010) Inference in binomial AR models. Statistics and Probability Letters, 80, 1985–90.
31.
DavisRALeeTRodriguez-YamG (2008) Break detection for a class of nonlinear time series models. Journal of Time Series Analysis, 29, 834–67.
32.
DoukhanPLatourAOraichiD (2006) A simple integer-valued bilinear time series model. Advances in Applied Probability, 38, 559–78.
33.
DrostFC van denAkkerRWerkerBJM (2008) Note on integer-valued bilinear time series models. Statistics and Probability Letters, 78, 992–96.
34.
DrostFC van denAkkerRWerkerBJM (2009) Efficient estimation of auto-regression parameters and innovation distributions for semiparametric integer-valued AR() models. Journal of the Royal Statistical Society: Series B, 71, 467–85.
35.
DuJGLiY (1991) The integer valued autoregressive (INAR()) model. Journal of Time Series Analysis, 12, 129–42.
36.
FerlandRLatourAOraichiD (2006) Integer-valued GARCH processes. Journal of Time Series Analysis, 27, 923–42.
37.
FokianosK (2012) Count time series models. In T Subba Rao S Subba Rao and CR Rao(eds). Handbook of statistics 30: Time series –methods and applications, pp. 315–47. Amsterdam: Elsevier B.V.
38.
FokianosKTjøstheimD (2012) Nonlinear Poisson autoregression. Annals of the Institute of Statistical Mathematics, 64, 1205–25.
39.
FokianosKRahbekATjøstheimD (2009) Poisson autoregression. Journal of the American Statistical Association, 104, 1430–39.
40.
FrankeJSubbaRao T (1993) Multivariate first-order integer-valued autoregression. Technical report, Universität Kaiserslautern.
41.
FreelandRK (2010) True integer value time series. AStA –Advances in Statistical Analysis, 94, 217–29.
42.
FreelandRKMcCabeBPM (2004) Forecasting discrete valued low count time series. International Journal of Forecasting, 20, 427–34.
43.
GhodsiAShitanMBakouchHS (2012) A first-order spatial integer-valued autoregressive SINAR model. Communications in Statistics –Theory and Methods, 41, 2773–87.
44.
GomesDCantoeCastro L (2009) Generalized integer-valued random coefficient for a first order structure autoregressive (RCINAR) process. Journal of Statistical Planning and Inference, 139, 4088–97.
45.
GrunwaldGKHyndmanRJTedescoL (1997) A unified view of linear AR models. Research report, Department of Statistics, University of Melbourne.
46.
HallA (1996) Maximum term of a particular autoregressive sequence with discrete margins. Communications in Statistics –Theory and Methods, 25, 721–36.
47.
HallA (2001) Extremes of integer-valued moving averages models with regularly varying tails. Extremes, 4, 219–39.
48.
HallAScottoMGCruzJP (2010) Extremes of integer-valued moving average sequences. Test, 19, 359–74.
49.
HeinenA (2003) Modelling time series count data: an autoregressive conditional Poisson model. CORE Discussion Paper 2003/62.
50.
HudecováSHuškováMMeintanisSG (2015) Tests for time series of counts based on the probability-generating function. Statistics, 49, 316–37.
51.
JacobsPALewisPAW (1983) Stationary discrete autoregressive-moving average time series generated by mixtures. Journal of Time Series Analysis, 4, 19–36.
52.
JaziMAJonesGLaiCD (2012) First-order integer valued AR processes with zero inflated Poisson innovations. Journal of Time Series Analysis, 33, 954–63.
53.
JoeH (1996) Time series models with univariate margins in the convolution-closed infinitely divisible class. Journal of Applied Probability, 33, 664–77.
54.
JørgensenBSongPXK (1998) Stationary time-series models with exponential dispersion model margins. Journal of Applied Probability, 35, 78–92.
55.
JungRCTremayneAR (2003) Testing for serial dependence in time series models of counts. Journal of Time Series Analysis, 24, 65–84.
56.
JungRCTremayneAR (2011a) Useful models for time series of counts or simply wrong ones?AStA –Advances in Statistical Analysis, 95, 59–91.
57.
JungRCTremayneAR (2011b) Convolution-closed models for count time series with applications. Journal of Time Series Analysis, 32, 268–80.
58.
KachourMTruquetL (2011) A -order signed integer-valued autoregressive (SINAR()) model. Journal of Time Series Analysis, 32, 223–36.
59.
KarlisDPedeliX (2013) Flexible bivariate INAR processes using copulas. Communications in Statistics –Theory and Methods, 42, 723–40.
60.
KashikarASRohanNRamanathanTV (2013) Integer autoregressive models with structural breaks. Journal of Applied Statistics, 40, 2653–69.
KocherlakotaSKocherlakotaK (1992) Bivariate discrete distributions. Marcel Dekker, Inc.
64.
LatourA (1997) The multivariate GINAR process. Advances in Applied Probability, 29, 228–48.
65.
LatourA (1998) Existence and stochastic structure of a non-negative integer-valued autoregressive processes. Journal of Time Series Analysis, 4, 439–55.
McCabeBPMMartinGMHarrisD (2011) Efficient probabilistic forecasts for counts. Journal of the Royal Statistical Society: Series B, 73, 253–72.
69.
McCormickWPParkYS (1992) Asymptotic analysis of extremes from autoregressive negative binomial processes. Journal of Applied Probability, 29, 904–20.
70.
McKenzieE (1985) Some simple models for discrete variate time series. Water Resources Bulletin, 21, 645–50.
71.
McKenzieE (1988) Some ARMA models for dependent sequences of Poisson counts. Advances in Applied Probability, 20, 822–35.
72.
MonteiroMPereiraIScottoMG (2008) Optimal alarm systems for count processes. Communications in Statistics –Theory and Methods, 37, 3054–76.
73.
MonteiroMScottoMGPereiraI (2010) Integer-valued autoregressive processes with periodic structure. Journal of Statistical Planning and Inference, 140, 1529–41.
74.
MonteiroMScottoMGPereiraI (2012) Integer-valued self-exciting threshold autoregressive processes. Communications in Statistics –Theory and Methods, 41, 2717–37.
75.
NastićASRistićMMBakouchHS (2012) A combined geometric INAR() model based on negative binomial thinning. Mathematical and Computer Modelling, 55, 1665–72.
76.
NastićASRistićMMDjordjevićMS (2014) An INAR model with discrete Laplace marginal distributions. Brazilian Journal of Probability and Statistics. Forthcoming.
77.
PedeliXKarlisD (2011) A bivariate INAR process with application. Statistical Modelling, 11, 325–49.
78.
PedeliXKarlisD (2013a) On composite likelihood estimation of a multivariate INAR model. Journal of Time Series Analysis, 34, 206–20.
79.
PedeliXKarlisD (2013b) On estimation of the bivariate Poisson INAR process. Communications in Statistics –Simulation and Computation, 42, 514–33.
80.
PedeliXKarlisD (2013c) Some properties of multivariate INAR processes. Computational Statistics and Data Analysis, 67, 213–25.
81.
PedeliXDavisonACFokianosK (2014) Likelihood estimation for the INAR(1) model by saddlepoint approximation. Journal of the American Statistical Association. Forthcoming.
82.
PegramGGS (1980) An autoregressive model for multilag Markov chains. Journal of Applied Probability, 17, 350–62.
83.
PuigPValeroJ (2007) Characterization of count data distributions involving additivity and binomial subsampling. Bernoulli, 13, 544–55.
84.
QuoreshiAMMS (2006) Bivariate time series modelling of financial count data. Communications in Statistics –Theory and Methods, 35, 1343–58.
85.
QuoreshiAMMS (2014) A long-memory integer-valued time series model INARFIMA, for financial application. Quantitative Finance, 14, 2225–35.
86.
RistićMMBakouchHSNastićAS (2009) A new geometric first-order integer-valued autoregressive (NGINAR) process. Journal of Statistical Planning and Inference139, 2218–26.
87.
RistićMMNastićASBakouchHS (2012a) Estimation in an integer-valued autoregressive process with negative binomial marginals (NBINAR(1)). Communications in Statistics –Theory and Methods, 41, 606–18.
88.
RistićMMNastićASJayakumarKBakouchHS (2012b) A bivariate INAR time series model with geometric marginals. Applied Mathematics Letters, 25, 481–85.
89.
RistićMMNastićASMiletićIlić AV (2013) A geometric time series model with dependent Bernoulli counting series. Journal of Time Series Analysis, 34, 466–76.
90.
RoitershteinAZhongZ (2013) On random coefficient INAR processes. Science China Mathematics, 56, 177–200.
SchweerSWeißCH (2014) Compound Poisson INAR(1) processes: stochastic properties and testing for overdispersion. Computational Statistics and Data Analysis, 77, 267–84.
93.
SilvaISilvaMEPereiraISilvaN (2005) Replicated INAR processes. Methodology and Computing in Applied Probability, 7, 517–42.
94.
SilvaMEOliveiraVL (2004) Difference equations for the higher order moments and cumulants of the INAR model. Journal of Time Series Analysis, 25, 317–33.
95.
SteutelFWvanHarn K (1979) Discrete analogues of self-decomposability and stability. Annals of Probability, 7, 893–99.
96.
SunJMcCabeBPM (2013) Score statistics for testing serial dependence in count data. Journal of Time Series Analysis, 34, 315–29.
97.
TjøstheimD (2012). Some recent theory for autoregressive count time series. Test, 21, 413–38.
98.
TongH (1990) Non-linear time series. Oxford: Science Publications, Oxford.
99.
TriebschLK (2008) New integer-valued autoregressive and regression models with state-dependent parameters. Doctoral dissertation, TU Kaiserslautern, Verlag Dr. Hut, Munich.
100.
TurkmanKFScottoMGdeZeaBermudezP (2014) Non-linear time series: extreme events and integer value problems. Switzerland: Springer-Verlag.
101.
WeißCH (2008a) Thinning operations for modelling time series of counts –a survey. AStA –Advances in Statistical Analysis, 92, 319–41.
102.
WeißCH (2008b) Serial dependence and regression of Poisson INARMA models. Journal of Statistical Planning and Inference, 138, 2975–90.
103.
WeißCH (2008c) The combined INAR() models for time series of counts. Statistics and Probability Letters, 78, 1817–22.
104.
WeißCH (2009) A new class of autoregressive models for time series of binomial counts. Communications in Statistics –Theory and Methods, 38, 447–60.
105.
WeißCH (2013) Integer-valued autoregressive models for counts showing underdispersion. Journal of Applied Statistics, 40, 1931–48.
106.
WeißCH (2015). A Poisson INAR(1) model with serially dependent innovations. Metrika. Forthcoming.
107.
WeißCHKimHY (2013a) Binomial AR processes: moments, cumulants, and estimation. Statistics, 47, 494–10.
108.
WeißCHKimHY (2013b) Parameter estimation for binomial AR models with applications in finance and industry. Statistical Papers, 54, 563–90.
109.
WeißCHKimHY (2014) Diagnosing and modelling extra-binomial variation for time-dependent counts. Applied Stochastic Models in Business and Industry, 30, 588–608.
WeißCHPollettPK (2014) Binomial autoregressive processes with density dependent thinning. Journal of Time Series Analysis, 35, 115–32.
112.
ZhangHWangDZhuF (2010) Inference for INAR processes with signed generalized power series thinning operator. Journal of Statistical Planning and Inference, 140, 667–83.
113.
ZhangHWangD (2015) Inference for random coefficient INAR process based on frequency domain analysis. Communications in Statistics –Simulation and Computation, 44, 1078–100.
114.
ZhengHBasawaIVDattaS (2006) Inference for th-order random coefficient integer-valued autoregressive processes. Journal of Time Series Analysis, 27, 411–40.
115.
ZhengHBasawaIVDattaS (2007) First-order random coefficient integer-valued autoregressive processes. Journal of Statistical Planning and Inference, 173, 212–29.
116.
ZhuRJoeH (2003) A new type of discrete self-decomposability and its applications to continuous-time Markov processes for modelling count data time series. Stochastic Models, 19, 235–54.
117.
ZhuRJoeH (2006) Modelling count data time series with Markov processes based on binomial thinning. Journal of Time Series Analysis, 27, 725–38.
118.
ZhuRJoeH (2010) Negative binomial time series models based on expectation thinning operators. Journal of Statistical Planning and Inference, 140, 1874–88.