Abstract
The heterogeneous choice model (HCM) has been proposed as an extension of the standard logit and probit models, which allows taking into account different error variances of explanatory variables. In this note, I show that in an important special case, this model is just another way to specify an interaction effect.
The heterogeneous choice model (HCM) has been proposed as an extension of the standard logit and probit models (Williams 2009). In this note, I show that in an important special case, this model is just another way to specify an interaction effect.
For developing the argument, I refer to a logit model. Let Y denote a binary variable. The logit model makes the distribution of Y dependent on values of explanatory variables:
where X is a vector of explanatory variables with corresponding parameter vector β
x
, Z is a further explanatory variable with parameter β
z
, α is a constant, and
In equation (1), L is not used as a distribution function for a corresponding random variable but as a function for linking covariates to the distribution of Y, that is, a binomial distribution characterized by a single parameter. The HCM starts from a different model that relates to a latent variable, say Y*, which is connected with Y through
Assuming that
where it is now assumed that the residual variance depends on the value of Z (
I now consider the special case where Z is a binary variable. Instead of equation (4), one can consider a logit model with an interaction term xz (being a vector of interaction terms if x is a vector):
Both models are equivalent, meaning that they entail the same conditional probabilities. This can be achieved by setting
In fact, these relationships are implicitly satisfied when estimating the models with maximum likelihood. They also entail that one cannot add the
How to interpret the equivalence of equations (4) and (5) depends on the intended analysis. I suppose that one is interested in investigating how a binary variable, Y, depends on another binary variable (representing, e.g., two groups). The interest concerns the dependence of the probabilities
Given a significant interaction parameter, one might ask whether the HCM suggests a different interpretation. The answer depends on the understanding of the HCM. I first assume that the HCM is taken as a model for the binary variable Y as formulated in equation (4). Then, if X and Z interact in model (5), the same is true for model (4). Of course, both models provide different parameter values, but this is just a consequence of a different parameterization of the same model. For the research interest mentioned previously, it is only important that both parameterizations entail identical conditional probabilities.
Now I assume that the HCM is taken as a model for the latent variable Y* as formulated in equation (3). Referring to the conditional expectation of Y*, that is
suggests the conclusion that the effect of X on Y* is independent of Z. However, the interest eventually concerns the effect of X on Y, not on Y*, and the effect of X on Y depends not only on the expectation of Y* but also on its variance. Consequently, if the variance of Y* depends on Z, as assumed by the HCM, one immediately gets an interaction between X and Z (as already shown by the equivalence of equations (4) and (5)).
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
