Abstract
Assessments with a large amount of small, similar, or often repetitive tasks are being used in educational, neurocognitive, and psychological contexts. For example, respondents are asked to recognize numbers or letters from a large pool of those and the number of correct answers is a count variable. In 1960, George Rasch developed the Rasch Poisson counts model (RPCM) to handle that type of assessment. This article extends the RPCM into the world of diagnostic classification models (DCMs) where a Poisson distribution is applied to traditional DCMs. A framework of Poisson DCMs is proposed and demonstrated through an operational dataset. This study aims to be exploratory with recommendations for future research given in the end.
Diagnostic classification models (DCMs) classify examinees into groups of possessing or not possessing a set of attributes based on their item responses. New DCMs are burgeoning every day to handle complex and/or unique needs of assessments in educational and behavioral sciences. In this article, we consider situations where test scores are count variables (e.g., Doebler et al., 2014; Jansen, 1997; Spray, 1990). For example, examinees are asked to circle correctly spelled words in a 200-word test, and their scores are the number of words they correctly pick out. Another example is that examinees read aloud a passage and the number of errors is counted (Rasch, 1960). It could also be that examinees are asked to check yes/no from 50 simple conditions regarding stress and anxiety such as blush, nightmare, diarrhea, and sweat, and their scores are the number of conditions that they select (Taylor, 1953). A shared feature of the aforementioned situations is that the number of items is large, and items are either the same or very similar.
Current DCMs are item-based models where the unit of analysis is each examinee’s response on each item, just like most item response theory (IRT) models. As a result, each item is associated with at least two parameters under a DCM. This happens when the item only measures one attribute, is modeled under the simplest DCM, and is scored in a binary fashion. When an item measures more attributes, is modeled under a more complex DCM, or scored in a polytomous fashion, the item could be associated with 10, 20, or even more parameters. Using existing DCMs to model the example situations above brings in an enormously large amount of and unnecessary parameters. Under those situations, it may be more appropriate to model the counts of total successes rather than the responses on each individual item. Counts are often modeled as coming from a Poisson distribution. The purpose of this article is to develop a family of Poisson DCMs (PDCMs) where counts of successes or errors on a set of tasks is modeled. The family of PDCMs closely mirrors the Rasch Poisson counts model (RPCM; Rasch, 1960) in IRT and extends its basic structure into the DCMs. Before introducing the PDCMs, we first review the basics of the Poisson distribution and the RPCM in the next section.
Model Development
Poisson Distribution
In 1830, Siméon-Denis Poisson, a French mathematician, developed a function to describe the number of times an event could happen in a fixed time period when you know how often the event has happened. That function is known as the Poisson distribution. The Poisson distribution defines the probability that the event happens s times as
where
Rasch Poisson Counts Model
In 1960, George Rasch applied the Poisson distribution to model the number of errors students make during oral reading tasks and thus developed the RPCM. Although it is one of the earliest models that Rasch has developed, the RPCM has received less attention than other binary or polytomous Rasch models. Suppose we ask students to read 200 words in 10 blocks where each block has 20 words, and we record the number of successes in each block. In the RPCM, the unit of analysis is no longer students’ correct/incorrect responses on each word; instead, an item i is defined as a block, and a student’s score s on item i is the total number of successes in that block. In other words, we could think of an item in the RPCM as a subtest or a collection of tasks. The RPCM defines the probability of obtaining a score of s on item i given a unidimensional latent trait
where
We can obtain the original
The Poisson Diagnostic Classification Model Framework
In essence, DCMs are confirmatory latent class models with different parameterizations of the measurement component. The basic structure a DCM can be written as
where
Let us explain each equation. Equation 5 is the item response function, which takes on a form similar to the RPCM in Equation 2. Equation 6 describes the parameterization of the
This additive specification in Equation 8 allows us to connect the PDCM to other DCMs that are based on generalized linear mixed models such as the log-linear cognitive diagnosis model (LCDM; Henson et al., 2009). The LCDM is the most general DCM for binary items that subsumes most earlier DCMs such as the “deterministic inputs, noisy, and gate” (DINA) model (Haertel, 1989; Junker & Sijtsma, 2001) and the linear logistic model (LiLM; Maris, 1999). The LCDM defines the probability of examinees in attribute profile c correctly answering item i as
Comparing Equations 9 to 8, we should be able to see the same set up of the
To estimate parameters in the PDCM, we impose two types of constraint, one for identifiability where some estimation approaches need and the other one for combating label switching. For identifiability, we could adapt the approach developed in Jansen and van Duijn (1992) and impose
and for all higher order interactions. Alternatively, one could put constraints on the parameter distributions, which will be demonstrated in the operational study. To combat label switching, we force
Models Under the PDCM Framework
The PDCM provides a modeling framework that could accommodate item response functions that reflect specific theories. The family of PDCMs share the same structure in Equation 5 but with different parameterizations of the
We can see that the
We can see that the
Operational Study
To demonstrate how to use the PDCM in operational settings, we obtained data from 808 kindergarten students in China on an English Recognition Assessment. The assessment measures their abilities to recognize basic English in three domains: numbers, colors, and objects. Each domain has eight blocks, each containing 10 English words. Students need to drag the numbers/colors/objects into the blanks that match the English words shown on the screen. An example block is given in Figure 1. In this example, we can see that it may not be adequate to model each word as an item because they are so simple and similar. Instead, we could count the total number of correct or incorrect answers in each block (which is treated as an item) and fit the proposed PDCM. Figure 2 shows the frequencies of examinees’ incorrect counts in each block under each domain. Because most examinees have got most English words correct in each block, we chose to count the number of incorrect answers in each block as the input for the s parameter in Equation 5. When interpreting the results, we flipped the binary definitions of each attribute where

An example block in the operational study.

Distributions of the counts of examinees’ incorrect answers in each block.
We estimated the PDCM parameters using Hamiltonian Monte Carlo (HMC) algorithms in Stan (Carpenter et al., 2017). The Stan syntax can be found in the Supplemental Material. Specifically, we estimated a vector of class membership probabilities:
Different priors may be selected based on theories and properties of the dataset, which may influence parameter estimates. Four Markov chains were specified with each a length of 20,000, where the first 10,000 draws were discarded and the rest 10,000 were used for inference. Gelman-Rubin’s
Table 1 lists the mean and standard deviation of the posterior distribution for each item parameter. We also computed the
Mean and Standard Deviation of the Posterior Distribution for Each Item Parameter in the Operational Study.
and
Similarly, we computed those values for each block and displayed the results in Figure 3. Figure 3 only showed the probabilities of up to six incorrect answers because the probabilities for more than six incorrect answers became too small. However, one could compute the probabilities of getting any numbers of incorrect answers up to the total number of blocks. The overall trend in Figure 3 is that for masters, the probability of getting more incorrect answers was constantly decreasing for items that measure any of the three attributes. For non-masters, the probability of getting more incorrect answers was decreasing for items that measure

Model predicted probabilities of the counts of examinees’ incorrect answers in each block.
Discussion
This article provided a framework for modeling count data under the DCM framework and demonstrated its uses and interpretations with an operational dataset. The PDCM framework can be especially useful for assessments with many small and repetitive units. For example, in education, teachers give out words or numbers to test whether students have mastered them; in psychology, simple recognition tasks are used to test respondents’ attention, memory, or thinking skills. Integrating the Poisson distribution into the DCMs offers tools for teachers to decide student mastery on specific knowledge and for psychologists to classify respondents with mental disorders.
The current study aims to be an exploratory piece of work that taps into DCM with count data. Future work is needed on at least the following aspects. First, although it is expected that the PDCM holds similar characteristics as other psychometric models where larger sample sizes and/or longer test length and Q-matrices with fewer cross loadings produce more accurate results, simulation studies could be conducted to verify that. Second, some operational assessments of this type may have time limits on each task. Extending the current PDCM to include time factor could be studied. Specifically, one could consider
Supplemental Material
sj-pdf-1-epm-10.1177_00131644211017961 – Supplemental material for Poisson Diagnostic Classification Models: A Framework and an Exploratory Example
Supplemental material, sj-pdf-1-epm-10.1177_00131644211017961 for Poisson Diagnostic Classification Models: A Framework and an Exploratory Example by Ren Liu, Haiyan Liu, Dexin Shi and Zhehan Jiang in Educational and Psychological Measurement
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
