Abstract
Tests of homogeneity in zero-inflated models for count data have been well discussed in the literature, but the existing methodologies have relied primarily on score statistics. An often cited justification for the use of these statistics is that they do not require the model to be fitted under the alternative. But the advent of computer software with robust functions and procedures has made it easy for these alternative models to be fitted routinely in practice. In this article, we exploit this opportunity by using results generated from these analyses to develop a Wald test statistic to evaluate homogeneity in the class of zero-inflated models. We show how the proposed test can be performed with minimal programming effort for the practicing statistician. Technically, the test is based on a reparameterization of the mixing weights that naturally incorporates covariates under heterogeneity, a characteristic often ignored by existing testing procedures. A quasi-likelihood function derived from a working independence model coupled with the so-called sandwich estimator of the covariance matrix of the parameter estimates is used to accommodate correlation in the data. A simulation study is conducted to evaluate the empirical performance of the proposed Wald test and its real life applications are illustrated using correlated dental caries counts in young children.
Keywords
Introduction
Count data with excess zeros relative to generalized linear models are common in population-based studies. A well accepted and established statistical technique to analyze such data is the use of zero-inflated models. These models view the data as being generated from a model resulting from the mixture of a distribution that places all its mass at zero and a non-degenerate distribution. Under this mixture model, the zero counts are generated from two distribution sources, resulting in a heterogeneous population. In this article, the term heterogeneity refers to this mixture in distribution, for which the extent is measured by the associated mixing weight. Likewise, the term homogeneity will be used to characterize a population generated from a non-degenerate distribution. In other words, under the homogeneous model, the mixing weight is equal to zero.
The choice of the non-degenerate distribution for the most part depends on the nature of counts under consideration. For bounded counts, a binomial distribution is typically used (Hall, 2000). In the case of unbounded counts, Poisson and negative binomial distributions are the standard (Böhning et al., 1999). Ridout et al. (1998) and references therein provide an extensive review of this literature.
In practice, it is often the case for count data with potentially extra zeros to be correlated due to the longitudinal or hierarchical structure of the study design. Extensions of zero-inflated models to accommodate correlated count data are well established. These correlated data models can be broadly classified into two categories: (1) models that explicitly model the association in the data using random effects terms, and as such allow a full likelihood approach to parameter estimation (Hall, 2000; Lee et al., 2006); and (2) models that treat association in the data as a nuisance but use the so-called sandwich estimator of the covariance matrix of the parameter estimates to correct for any misspecification of the working correlation model (Hall and Zhang, 2004; Hall and Shen, 2010).
In many applications of zero-inflated models for count data, it is often of interest to evaluate whether the inherent heterogeneity is consistent with observed data. Existing testing methodologies have relied primarily on constant alternatives under heterogeneity (Broek, 1995; Xiang et al., 2006). Although valuable, such a restrictive approach may lead to inconsistent estimates of the parameters of interest and may suffer from reduced power in detecting heterogeneity that varies with covariate profiles. Most importantly, existing testing procedures often rely on score statistics (van den Broek, 1995; Jansakul and Hinde, 2009; Todem et al., 2012). An often cited motivation for the use of score test statistics is that they only require the model to be fitted under the null hypothesis. But the advent of computer software with robust functions and procedures has made it easy for the alternative models to be fitted routinely in practice. As a result, conducting a Wald test may require significantly less programming effort than conducting a score test. Another competing testing procedure is the likelihood ratio test, which like the Wald test enjoys the ease of implementation in real life applications. But in settings where a likelihood is not assumed and a general estimating function is used for model estimation, this testing procedure is not an option. In this article we exploit estimation results generated from the computer software to develop a Wald test statistic to evaluate homogeneity in this class of models. Technically, the test is based on a reparameterization of the mixing weights that naturally incorporates covariates into the mixing weights under the alternative, a characteristic often ignored by the existing testing procedures. We extend the methodology to account for the correlation in the data using a quasi-likelihood function derived from a working independence model coupled with the so-called sandwich estimator of the covariance matrix of the parameter estimates. From a practical perspective, we show how the proposed test can be performed with minimal programming effort for the applied statistician.
The work proposed in this article is motivated by dental caries data (discussed in Section 4.2) generated from a unique three-wave longitudinal study designed to collect oral health information on low-income African–American children under the age of 6 and their main caregivers residing in the city of Detroit. Todem et al. (2012) used these data to show how a test that neglects parameter heterogeneity under the alternative could fail to reject homogeneity in the population. They conducted their analysis using the first wave data and found that these data exhibit, relatively to a negative binomial model, an inflation of zeros for younger children and a deflation of zeros for older children that are roughly of the same magnitude. Thus, by averaging over the covariate space, a testing procedure based on constant mixing weights had virtually no power in detecting heterogeneity in this population. In this article, we develop a testing procedure to detect such forms of heterogeneity across the three waves of these dental caries data.
In Section 2, we describe the marginal representation of zero-inflated models that naturally accommodates both inflation and deflation of zeros in count data, and extend the model to correlated data. A Wald test statistic for homogeneity against varying alternatives is constructed and described in Section 3. In Section 4, we conduct a numerical study to evaluate the finite sample properties of the test statistic, and apply the methodology to evaluate homogeneity of dental caries scores in children from the Detroit longitudinal study. Some concluding remarks are given in Section 5. Finally, a sample SAS programme that can be used to fit the zero-inflated model under its marginal representation and conduct the test is given in the Appendix.
Zero-inflated models for correlated count data
The general data structure of interest is similar to that of many correlated data. There are n independent clusters indexed by i (i = 1, …, n), and within each cluster i there are λi correlated observations indexed by j (j = 1, …, λi). Let Yij be the random variable representing the count response for subunit j in cluster i drawn from M, a mixture of an atom distribution at 0 and an unknown non-degenerate discrete distribution H governed by a finite dimensional parameter vector
Here hij(yij; θ) represents the probability mass associated with the distribution H at observed response yij. Parameter ϕij measures the extent of heterogeneity in the population and its support depends on the representation of the mixture model. Under the hierarchical representation of the model, ϕij is a probability mass requiring the distributional constraints 0 ≤ ϕij ≤ 1, j = 1, …, λi, i = 1, …, n. But the marginal representation of the model only requires the condition 0 ≤ Pr(Yij = yij) ≤ 1 which results in constraints on mixing weights,
The constraint in (2.2) simply allows the model to accommodate not only inflation (positive ϕij) but also deflation (negative ϕij), a property that the hierarchical representation of the model, implied by the classical constraints 0 ≤ ϕij ≤ 1, does not have. In this article, we will focus on the marginal representation of the model which has the ability to accommodate both inflation and deflation at zero (Dietz and Böhning, 2000). We follow Todem et al. (2012) and adopt the transformation,
where πij ∈ [0, 1] represents the probability at 0 under the model in (2.1). One appealing feature of this transformation is that it naturally embeds the mixing weight constraints into the model, therefore avoiding the need for a constrained optimization.
Regarding the dependency in the data, one popular approach to accommodate the inherent within-cluster association consists of assuming random effects coupled with the so-called conditional independence assumption to tie together all observations from the same cluster (Lee et al., 2006; Xiang et al., 2006; Xiang et al., 2007). This approach is full likelihood and explicitly models the association in the data, which can be cumbersome in some practical settings. Alternatively, a quasi-likelihood that treats the within-cluster association as a nuisance but generates consistent estimates of parameters of interest is often advocated when the full likelihood is intractable. In this article, we adopt such an approach and consider a working independence model (WIM) for which the contribution of cluster i to the quasi-likelihood function is:
where
where
In general, we have
We are interested in evaluating the model H against the model M in (2.1) coupled with the constraints in (2.2). We are specifically interested in the two-sided hypotheses,
where
Evaluating these hypotheses is not trivial, owing to their high dimensional nature under the alternative. To address this curse of dimensionality, existing inferential procedures typically neglect parameter heterogeneity by assuming
In this article, we consider a general formulation where the parameters ϕij, j = 1, …, λi, i = 1, …, n are allowed to depend on covariates. Such an approach has been proposed by Jansakul and Hinde (2009) who related the mixing weight to covariates via a regression technique. A limitation however is that the identity link function was used and estimation under the alternative model required a constrained optimization to satisfy conditions in (2.2). To avoid this numerical complexity, we adopt the transformation in (2.3) which naturally incorporates covariates into the testing scheme. Under this transformation, the hypotheses under evaluation become,
where
To allow ϕij to depend on covariates, we relate πij and hij (0; i) to covariate vectors xij and zij, respectively, of dimensions r and s, observed alongside the count outcomes yij, j = 1, …, λi, i = 1, …, n. Covariate xij is related to the probability hij (0; i) through the regression model
where γ* and β* represent the true values of γ and β. For the Poisson model, g(.) is known, but is in general unknown. When H is a negative binomial process, g(.) is a parametric function indexed by the overdispersion parameter of the distribution H. A general form of these transformations can be found in Todem et al. (2012). In the analysis of dental caries data, we give an explicit form of this transformation when H is a negative binomial process.
Based on (2.6), the new hypothesis formulation can be further rewritten as a linear combination of elements of the parameter vector
A Wald test statistic of homogeneity that accommodates correlated count data is,
where
Computations
Statistical software with robust optimization functions and subroutines can be used to fit the zero-inflated model under the marginal representation described in Section 2 and to perform the proposed Wald test. For users familiar with SAS® software (SAS Institute, Cary, NC, USA), the procedure PROC NLMIXED provides a convenient module to compute the sandwich estimator of the variance-covariance matrix of the model estimates using the ‘EMPIRICAL’ option. This option, however, requires the practicing statistician to specify random effects terms in the model formulation. To circumvent this requirement, the user can elect to use one random effect term for each cluster and request that only one non-adaptive Gaussian quadrature point can be used for the numerical integration by using options ‘QPOINTS=1’ and ‘NOAD’. With these specifications and restrictions, only the point zero is used for the numerical integration of the full likelihood, resulting in the WIM likelihood. Once the model is fitted and the sandwich variance-covariance matrix estimate obtained, the Wald statistic can be readily obtained by multiplying the observed F statistic under the ‘CONTRAST’ statement by the rank of the contrast matrix. An illustration of this implementation in SAS® is provided in the Appendix. An alternative approach is to output model estimates (parameter estimates and corresponding variance-covariance matrix estimate) to a dataset and use any matrix oriented software such as PROC IML to compute the Wald statistic and perform the test.
Application to the Detroit Dental Caries data
Study description
We apply the proposed methodology to data generated from a longitudinal survey conducted by the Detroit Center for Research on Oral Health Disparities, hosted at the University of Michigan, Ann Arbor. This is a unique three-wave study designed to collect oral health information on low-income African–American children (0–5 years) and their main caregivers (14+ years), living in the city of Detroit. The overall goal of the study is to promote oral health and reduce its disparities within the community of low-income African Americans through the understanding of the social, familial, biological, and neighbourhood determinants of dental caries and periodontal disease. Data on oral examinations involving assessment of dental caries, periodontal disease, oral cancer and oral hygiene were collected during each of the three waves of examinations and interviews completed between 2002–2007. For our illustration, we focus on dental caries scores which represent the cumulative severity of tooth decay for each surveyed child. Specifically, we focus on the number of decayed surfaces (DS) showing signs of clinically detectable enamel lesions comprising both noncavitated and cavitated lesions. These dental caries scores have well documented shortcomings but they continue to be instrumental in evaluating and comparing the risks of dental caries across population groups (Lewsey and Thomson, 2004). The DS score represents the generic response variable Yij introduced in earlier sections. Considered covariates include AgeB (child’s age at baseline in years), SC (child’s daily sugar consumption, a wave dependent variable), Wave2 (second wave indicator), and Wave3 (third wave indicator). A more detailed description of the study can be found in the papers of Sohn et al. (2007) and Ismail et al. (2011).
Analysis of the Detroit Caries Indices in early childhood
Our numerical computations are based on data collected on 1,021 surveyed children. Dental caries scores collected across children are assumed independent but those collected on the same child across the three waves are correlated. It is a common practice to use zero-inflated negative binomial models to describe dental caries scores in children (Böhning et al., 1999). In this article, we are interested in evaluating whether or not the inclusion of mixing weights is consistent with observed data. We first fit a zero-inflated negative binomial model with constant mixing weight

The analyses above then draw into question the efficiency of models that ignore this varying heterogeneity under alternatives. We fit the proposed zero-inflated negative binomial model with a covariate-dependent mixing weight,
where γ = (γ0, γ1, …, γ7)
t
. Note here that ϕij = xij and the regression model
The fitted‡ homogeneous, constant mixing weight and covariate-dependent mixing weight models to longitudinal dental caries data
*: p-value < 0.05, ϕ-test
T: constant mixing weight
QIC
We conducted a numerical study to evaluate the finite sample performances of the proposed Wald test statistic W
n
for homogeneity. These performances are compared to those of the Wald test statistic
To keep the simulation simple, unlike in the real data analysis, the homogeneous model is a Poisson process. To simplify the data generating mechanism for the correlation model, the systematic components of the data such as the true mean of the homogeneous Poisson model and the true mixing weights may vary with each cluster i but not with the subunit j. In order words, the covariates are allowed to be cluster level but not individual level. For example, the true mean of the Poisson process takes the form
To induce correlation among the data Yi1, Yi2, …, Yim from the same unit i, we use independent Bernoulli variables coupled with independent zero-inflated Poisson processes. Let Ui be an independent random variable with the same distribution as YijIND and aij be an independent Bernoulli variable with success probability t ∈ (0, 1). Define
A popular alternative for introducing correlation in the data is the use of random effects coupled with the so-called conditional independence assumption (see, for example, Hedeker et al., 1994; Lee et al., 2006; Xiang et al., 2006). This approach is not chosen here because of its potential effects on the mean structure when random effects are integrated out. The mechanism adopted here introduces correlation in the data without affecting the marginal mean structure. This is only achieved in our scheme when the mean structure is constant across the index j.
In our simulations, we perform the proposed Wald test assuming the working mixing weight
Finally, all simulations are replicated 1,000 times for sample sizes n = 100, 200 and 400 with m = 3 replicates within each cluster.
To study the empirical size of the proposed tests, we generate data under the null hypothesis by setting
Empirical size and power of Wald test statistics to detect constant mixing weight
at 5% significance level, and integrated mean squared errors (IMSEs) for estimating
using the constant mixing weight model and the proposed covariate-dependent mixing weight model
Empirical size and power of Wald test statistics to detect constant mixing weight
Empirical size and power of Wald test statistics to detect various specifications of
It is worth noting that the power of the tests also improves when the two components are well separated (as b* increases) and with increasing sample sizes.
Finally, to evaluate the impact of the degree of correlation on the Wald tests, we perform the tests using data generated under different within-cluster correlations (t2), taking value in the set {0.25, 0.49, 0.81}. The results are given in Table 4. The empirical sizes appear not to be affected by the degree of correlation, however as we expected, the empirical powers of all considered tests are affected by t2. The empirical power increases as the true correlation decreases, regardless of the specification of the true mixing weight. Similar findings have been reported in the literature (see, for example, Fitzmaurice, 1995).
Empirical size and power of Wald test statistics to detect various specifications of
The issue of evaluating the homogeneity hypothesis in zero-inflated models has attracted considerable attention in the literature, but most of this debate has focused primarily on restrictive assumptions on the form of the alternative. One common limitation is that constant mixing weights are often assumed under heterogeneity. This article extends the literature by developing a Wald test statistic to detect heterogeneity that varies with the covariate profile in correlated count data. Most importantly, it shows how the constraints on the mixing weight under the proposed zero-inflated model that accommodates both inflation and deflation of zeros in the population can be naturally embedded into the testing procedure. We note however that there may be a slight deterioration of power when the true mixing weight does not depend on covariates. Given that the true data generating mechanism is usually unknown to the analyst, a general approach that captures various forms of heterogeneity in the population is the most conservative data analytic strategy.
It is worth noting that the zero-inflated model coupled with the transformation in (2.3) simply reduces to a two-part model where one part is a binary outcome model and another part is a truncated count model (see Cameron and Trivedi, 1998, pp. 123–25). However, unlike in the classical hurdle model, which is also a two-part model, the two components of the proposed model potentially share some parameters. For example, in the analysis of the Detroit data, the ‘hurdle’ probability πij depends on the overdispersion parameter of the negative binomial model, which violates the assumption of typical hurdle models. But as discussed in the article, such a restriction has the elegant property that both inflation and deflation can be evaluated against the homogeneous model.
Finally, one fundamental assumption of our testing methodology is that the mixing parameters are linearly related to covariates through the proposed transformation. Such an assumption may not be tenable in settings where the mixing weights are related to covariates through an unspecified function. For such a complication, smoothing techniques such as generalized additive models and spline models can be used to judiciously estimate the underlying relationship between the mixing weight and covariates (see, for example, Engle et al., 1986; Barry and Welsh, 2002; Xue et al., 2004; Lam et al., 2006). This extension and other generalizations of the methodology are outside of the scope of this article and are subject of further research.
Footnotes
Acknowledgements
The authors thank the two referees for their constructive comments and suggestions. We are also grateful to Amid Ismail for his permission to use the dental caries data. This work was supported by Dr Todem’s NCI/NIH K-award, 1K01 CA131259 and its supplement from the 2009 ARRA funding mechanism.
