Abstract
Traditional psychometric models focus on studying observed categorical item responses, but these models often oversimplify the respondent cognitive response process, assuming responses are driven by a single substantive trait. A further weakness is that analysis of ordinal responses has been primarily limited to a single substantive trait at one time point. This study applies a significant expansion of this modeling framework to account for complex response processes across multiple waves of data collection using the item response tree (IRTree) framework. This study applies a novel model, the longitudinal IRTree, for response processes in longitudinal studies, and investigates whether the response style changes are proportional to changes in the substantive trait of interest. To do so, we present an empirical example using a six-item sexual knowledge scale from the National Longitudinal Study of Adolescent to Adult Health across two waves of data collection. Results show an increase in sexual knowledge from the first wave to the second wave and a decrease in midpoint and extreme response styles. Model validation revealed failure to account for response style can bias estimation of substantive trait growth. The longitudinal IRTree model captures midpoint and extreme response style, as well as the trait of interest, at both waves.
Researchers and measurement practitioners often assume the substantive trait of interest (TOI) drives responses to ordinal items, but response styles may also influence respondent option selection. Response styles describe a respondent’s propensity to select item response options in a systematic way, regardless of both the respondent’s TOI and the content of the item. One commonly studied response style, extreme response style (ERS; Cronbach, 1946), is the tendency to endorse the lowest and highest categories of a response scale. In contrast, midpoint response style (MRS) describes a respondent’s propensity to use the midpoint response category (Messick, 1968; Plieninger & Heck, 2018; Schuman & Presser, 1981). Other response styles include acquiescence response style (ARS), the propensity of a respondent to agree with or affirm conceptually distinct ordinal items (Martin, 1964; Ray, 1983) and disacquiescence response style (DRS), which is the tendency to disagree with conceptually distinct ordinal items.
The presence of response styles in self-reported measures is a concern, as the validity of inferences made from instruments using these measures is threatened by response styles that have not been accounted for. Respondents’ preference for extreme responses can bias measurements of traits of interest and can confound interpretations of scores, thus interfering with the psychometric study of an instrument. For example, items may appear to function differently in their measurement of the TOI when respondents vary only in response style (Bolt & Johnson, 2009; Park & Wu, 2019). The presence of response styles presents a host of other issues. For example, ERS can lead to spurious correlations between constructs as respondents may have inflated or deflated scores in unrelated constructs (Jeon & De Boeck, 2016; Moors, 2012; Park & Wu, 2019; Paulhus, 1991), which may lead to overestimation or underestimation of the construct of interest and score inflation (Park & Wu, 2019) or manifest as an issue in diagnostic surveys where a respondent’s score plays a role in a clinical diagnosis or access to services. Furthermore, response styles threaten dimensionality, construct validity, predictive validity, and the reliability of self-reported measures (Baumgartner & Steenkamp, 2001; Cheung & Rensvold, 2000; de Jong et al., 2008; van Herk et al., 2004; van Rosmalen et al., 2010). This can result in differential item functioning, a lack of measurement invariance, and present negative effects on parameter recovery (Bolt & Johnson, 2009; Liu et al., 2017). Going further, response styles threaten comparisons of diverse groups and international or cross-cultural assessment. There are many documented correlates (i.e., age, individual traits, or cultural background) to different types of response styles that often vary by culture or region, thus making accurate comparisons difficult (Harzing, 2006; Khorramdel & von Davier, 2014).
Stability in individuals’ response styles has led researchers to suggest that these tendencies should be conceptualized as trait-like individual difference variables of substantive interest (e.g., Böckenholt, 2017; Wetzel et al., 2016). Stability of response styles implies that individuals systematically use the response scale within the same questionnaire (e.g., ARS: Danner et al., 2015; ARS and ERS: Weijters et al., 2010b) and across scales (e.g., ERS: Aichholzer, 2013; Bolt & Newton, 2011; Wetzel et al., 2013; ARS & ERS: Weijters et al., 2010b).
Previous research has also indicated that response styles are stable over both short and long periods of time. Evidence of response style stability has been found over a 1-year period (ARS, ERS, DRS, and MRS: Weijters et al., 2010a), a 2-year period (ARS and ERS: Bachman & O’Malley, 1984), a 4-year period (ARS: Billiet & Davidov, 2008), and an 8-year period (ARS and ERS: Wetzel et al., 2016). The stability of personality and attitude constructs, such as self-esteem or perceived ethnic threat, may be ascribed to response style stability (e.g., Bachman & O’Malley, 1984; Billiet & Davidov, 2008), although this is not the only option for observed stability in constructs. However, due to the possibility and evidence of response style stability over constructs and time, Böckenholt (2017) stresses the use of models that account for individual differences due to response style tendencies, especially for longitudinal studies, as emphasized by Weijters et al. (2010b). The methods used to answer this call are still developing, particularly those to account for response styles over multiple time points using an item response framework. As such, this study describes and applies one such model.
Measuring Response Style
Because of potential response style contamination in surveys and assessments, researchers propose and use multiple approaches to measure the presence and impact of response styles. van Vaerenbergh and Thomas (2013) provide a useful review of approaches. Some of the earliest methods, often termed the classical methods, record the frequency of occurrence for which a participant selects a particular response category, such as the extreme categories, and use this frequency, or proportion, as a measure of the response style (e.g., Bachman & O’Malley, 1984; Greenleaf, 1992). The primary limitation of this approach is that it confounds measurement of response styles and the trait(s) of interest. That is, a person providing many responses in the lowest category, implying they could be an extreme responder, or the respondent could also be very low on the substantive TOI. Although this method may aid in quantifying a respondent’s response style, this classical approach cannot disentangle response style and the substantive trait; as such, there is no direct way of correcting the TOI measure for response style bias (Bolt et al., 2014).
In a promising approach that addresses issues with the classical approach, Wetzel et al (2016) use a trait-state-occasion model for ERS and ARS, building on Weijters et al. (2010a). They measure the stability of response styles as the amount of variance in the state response style factors that is explained by the trait response style factors. Wetzel et al. (2016) also use the latent-state model, which measures stability of response styles as the correlations between time-specific response style factors across waves. However, indicators of response style were based on sum scores for a random sample of items and do not utilize item-level responses.
Multidimensional item response models address the shortcoming of the classical approaches and those using sum scores by simultaneously modeling the TOI and response styles utilizing item-level information. One multidimensional approach, the item response tree (IRTree), models systemic response tendencies when an individual is assumed to respond to ordinal items in a multiple-stage decision-making process (Böckenholt, 2012). As an example, consider a five-category ordinal item. One three-process model hypothesizes that individuals respond to ordinal items using three distinct decisions, as illustrated in Figure 1, with each decision driven by a separate latent trait (

Item response tree (IRTree) model accounting for midpoint response style (MRS), the trait of interest (TOI), and extreme response style (ERS) for a 5-point ordinal item.
The probability of each Likert-type item response category can be specified by tracing the path in Figure 1, assuming the three latent traits are considered independent. For example, the probability of person i responding to item j in the highest category, strongly agree, is given by:
where
Each
where
Multiple response styles, such as ERS and MRS, may be present in survey data, but previous multidimensional models have been limited by accounting for only one response style per survey (e.g., Bolt & Johnson, 2009). The IRTree provides flexibility to simultaneously account for, and parse out the effects of, multiple response styles, such as MRS and ERS, from the TOI. For example, Spratto et al. (2021) analyzed a 7-option response scale using two-competing IRTrees to model MRS and ERS. The first IRTree modeled four dichotomous hypothesized stages
Despite the beneficial perspective of IRTrees, there has been limited research to date on the stability of response style over multiple time points within this valuable framework. The purpose of this study is to apply a novel model, the longitudinal IRTRee (LIRTree), to a real data set and evaluate the validity of inferences made from the model. The specific research questions are as follows: Does the LIRTree validly account for multiple response styles across multiple waves of data collection (i.e., multiple time points)? Are extreme and MRS traits stable across multiple waves of data collection in an empirical data set measuring sexual knowledge? How do extreme and MRS traits change in relation to the substantive TOI across multiple waves of data collection in an empirical data set measuring sexual knowledge? How do the substantive interpretations when accounting for response style differ from the interpretations made when the explicit modeling of response style is ignored?
Method
The longitudinal model for response processes builds on the longitudinal IRT model for dichotomous responses (Kim & Camilli, 2014), which leads into an extension for response styles, the longitudinal IRTree (LIRTree; Ames & Leventhal, 2021).
Longitudinal IRT
A traditional two-parameter logistic IRT model is defined via:
with longitudinal extension, per specification in Kim and Camilli (2014):
where
Longitudinal IRTree
Following the longitudinal IRT framework, Ames and Leventhal (2021) extend the IRTree in Equation 2 to now include multiple time points, resulting in
In the LIRTree approach, the linear latent score for person i, at time t, in decision-stage k, is a function of the item effects
In the LIRTree, µkt and
The covariance structure of θ allows for covariance among different times and traits using an unstructured covariance matrix, with the exception that the mean and variance of the latent traits at the first time point are fixed to 0 and 1, respectively. That is,
Empirical Example: Measurement Instrument
We present an empirical example to demonstrate the usefulness of the model and parameter interpretation. The empirical example uses a 6-item sexual knowledge scale from the National Longitudinal Study of Adolescent to Adult Health (Add Heath; Harris, 2009), measured across two time points (Wave I and Wave II of Add Health data collection). Wave I took place between 1994 and 1995. For Wave II, nearly 15,000 of the Wave I respondents were interviewed from April to August 1996, approximately 1 year after the Wave I interview. The six items are presented in Table 1, along with the original variable name from the Add Health data set. All items respond to the prompt: “Do you agree or disagree with the following statement?” Response options include strongly disagree, disagree, neither agree nor disagree, agree, and strongly agree, numerically represented by scores 1 to 5, respectively. A random sample of n = 3,000 respondents was selected from the full sample using the SURVEYSELECT procedure in SAS (SAS Institute Inc., 2008). All respondents in the random sample had complete data, such that they provided item responses to both waves of data collection at
Sexual Knowledge Scale of the Add Health Survey.
Note. Wave I and Wave II contain the variable names in the Add Health public use data file.
Description of the Sample
Response percentages and item mean scores of the six items are provided in Table 2. Average item scores increased from Wave I to Wave II for all items, indicating an increase in average sexual knowledge scale scores for the sample. The largest increase in mean item score was for Item 2 (
Response Percentages.
Note. Values in table represent the percentage of respondents in each category.
Estimation
All data analysis and model fitting was performed in Mplus, which relies on the Gibbs sampling algorithm. Details of the sampling algorithm can be found in Asparouhov and Muthén (2010) for item response models. Convergence was assessed using the potential scale reduction factor (PSRF; Gelman & Rubin, 1992), the default in Mplus. PSRF values for all model parameters were close to 1. As a rule of thumb, a PSRF lower than 1.2 would indicate convergence (Muthén & Asparouhov, 2012). Prior distributions for the item parameters were weakly informative (i.e.,
Parameter Recovery
Previous simulation studies (Ames & Leventhal, 2021) demonstrated adequate parameter recovery of LIRTree parameters, manipulating conditions of survey length (10, 20 items); sample size (1,000, 2,000 people); time points (t = 2, 3); latent trait variances (
Outcomes in Ames and Leventhal (2021) included mean absolute bias (MAB), defined as the average absolute deviance between the posterior point estimate (i.e., expected a posteriori [EAP]) and the generating value, and coverage rate, defined as the percentage of replications in which the highest posterior density encompassed the generating value. Each condition was replicated 250 times, with estimation in Mplus version 8.3 (Muthén & Muthén, 1998-2017), which relies on the Gibbs sampler for MCMC estimation (Asparouhov & Muthén, 2010; Gelman et al., 2004). Convergence times ranged from 3 minutes, 44 seconds (i.e., for the condition with 1,000 people, 10 ordinal items, two time points) to 2 hours, 36 minutes and 27 seconds (i.e., for the condition with 2,000 people, 20 ordinal items, three time points) when run in parallel on two processors. The computer processor is an Intel® Xeon® W-2155 CPU with 10 cores and 20 threads. Times to run indicate that computational time for Bayesian estimation of tree-like models is not prohibitive.
Item parameter recovery results improved as sample size increased, as measured by MAB, a finding that agrees with previous simulation that investigated noncompensatory MIRT models. Adequate item parameter recovery was found for a sample size of 2,000 at both survey lengths. Larger correlations between latent traits, such as
Matching priors improved MAB for item loading and variance parameter recovery, and had minor improvement for shift parameter recovery. Specifically, Ames and Leventhal (2021) demonstrated recovery was better in the “matching” prior conditions than for the Mplus baseline conditions for item loadings, particularly with two time points. The same gains were not seen across conditions for the three time point model, indicating that multiple measurements of the same item improves parameter recovery for loadings. However, when the
Results
Drift
To evaluate the longitudinal assumption that item parameters were invariant across the two waves of data collection, we undertook a systematic procedure using the same approach as Kim and Camilli (2014). To evaluate longitudinal invariance from an IRT Millsap (2010) suggests fitting a model that constrains the item parameters to be invariant across all measurement occasions. Then, compare, using a likelihood ratio test, the invariant model to one that allows for varying item parameters across measurement occasions. This is essentially the approach we have taken, as have Kim and Camilli (2014), but adapted to a Bayesian framework.
First, we allowed only
where
DIC Comparison.
Note. Values in table represent change in DIC over baseline, with values in parentheses the percentage change from baseline. The column “TOI Only Varies” is the model for which only the item difficulty parameters of the TOI node were allowed to vary and the column “ERS, MRS, TOI Vary” is the model for which item difficulty parameters of all three nodes were allowed to vary. “LGRM” represents the longitudinal drift analysis for the LGRM. TOI = trait of interest; ERS = extreme response style; MRS = midpoint response style; LGRM = longitudinal graded response model; DIC = deviance information criterion.
Parameter Interpretation
Because the drift analysis indicated the base model is most appropriate, we present the results from the analysis with fixed item parameters across all k decision-stages and both waves.
Item Parameters
The results in Table 4 present the item parameter posterior means and standard deviations. Item easiness in the MRS decision stage indicate high levels of MRS are required to endorse the midpoint option. Item 6 was the most difficult for midpoint endorsement (
Item Parameter Estimates.
Note. Mean is the posterior mean (EAP estimate) and SD is posterior standard deviation. ERS = extreme response style; MRS = midpoint response style; TOI = trait of interest; EAP = expected a posteriori.
Person Parameters
Table 5 shows a decrease in levels of extreme (
Latent Trait Parameter Estimates.
Note. Mean is the posterior mean (EAP estimate) and SD is posterior standard deviation. µ is the shift, representing the latent trait mean at Wave II and

Wave I to Wave II trajectories for midpoint response style (MRS).

Wave I to Wave II trajectories for the sexual knowledge trait of interest (TOI).

Wave I to Wave II trajectories for extreme response style (ERS).
Correlation Among Latent Traits.
Note. Values represent Pearson’s correlations among latent trait point estimated. GRM is the latent trait from the LGRM; and Coll is the latent trait from the GRM with categories collapsed. T1 and T2 are Time 1 and Time 2, respectively. ERS = extreme response style; MRS = midpoint response style; TOI = trait of interest from the LIRTree; LGRM = longitudinal graded response model.
Benefits of LIRTree
To demonstrate the benefit of the LIRTree framework, we conducted a series of studies. The first was a comparison of the conclusions drawn from the LIRTree, as they relate to the substantive TOI, and those drawn from a model ignoring response style using the longitudinal graded response model (LGRM; GRM from Samejima, 1969). To determine category threshold drift, we used a parallel process and similar DIC cut criterion as with the LIRTree drift analysis described previously. We also used the same Bayesian MCMC framework for this analysis in Mplus. Similarly, no category threshold parameters indicated evidence of drift (see Table 3). Table 5 also contains the shift, variance, and covariance parameters of the LGRM for comparison with the LIRTree. Using the LGRM, which fails to account for response style, growth in the sexual knowledge substantive trait was smaller for the LGRM (
To understand shifts in traits based on response patterns, we present responses and latent trait estimates for four individuals in Table 7. Each person has the same response set at Wave I (i.e., responding with “5, Strongly Agree” to all items), but differing response sets at Wave II. Although they have the same response sets at Wave I,
Illustration of Response Shifts.
Note.
ERS
First, we consider ERS tendencies. Each respondent selected 6 extreme responses at Wave I (all: “5, strongly agree”), but only Respondents 1 and 4 selected extreme responses at Wave II (six extreme responses vs. three, respectively). Therefore, when the entire set is considered, Respondents 1 and 4 have the greatest
Respondents 2 and 3 failed to endorse any extreme options at Wave II, resulting in
MRS
For MRS, Respondents 1 and 2 failed to endorse the midpoint on items at both Wave I and Wave II. For these respondents, the difference between
TOI
Due to these patterns in response style tendencies,
Respondent 3 is estimated to have high ERS tendencies at Wave I and high MRS tendencies at Wave II. Therefore, as compared with TOI estimates using the LGRM,
Model Assumption Efficacy
We conducted model assumption efficacy studies of the response process assumed by the LIRTree. To do so, we collapsed extreme and nonextreme categories and fit modified responses using a three-category LGRM. Namely, we collapsed Category 1 with Category 2 and collapsed Category 4 with Category 5. Figure 5 presents scatterplots among LIRTree TOI, ERS, and MRS latent trait estimates with three-category LGRM TOI and 5-category LGRM TOI at both waves. In Figure 5, the top two rows represent Waves I and II of the five-category LGRM and the bottom two rows represent Waves I and II of the three-category LGRM. The columns, from left to right, represent scatter plots between the MRS, TOI, and ERS LIRTree latent trait point estimates with the LGRM TOI latent trait point estimates. The triangular relationship between LIRTree MRS and LGRM TOI at both waves indicates that individuals with high LIRTree MRS estimates tend to have average GRM TOI for both the three- and five-category LGRM models. Those with low MRS tendencies tend to have considerable variance in GRM TOI estimates. However, respondents with high and low GRM TOI had low LIRTree MRS at both time points. This pattern provides evidence the LIRTree MRS is capturing a midpoint response tendency. That is, collapsing the extreme categories has little effect on the relationship of LGRM TOI trait with the LIRTree MRS trait.

Comparison of TOI, ERS, MRS estimated using IRTree, TOI estimated using GRM 5 categories, and TOI estimated using GRM 3 categories from Wave I to Wave II.
There is a relationship between LIRTree TOI and LGRM TOI for three- and five-category models. There are trivial differences in the strength of the relationship between the three-category LGRM TOI and LIRTree TOI and the strength of the relationship between the five-category LGRM TOI and LIRTree TOI at both waves. This indicates that the hypothesized LIRTree TOI information is captured adequately with the three-category LGRM. Comparing three- and five-category LGRM TOI correlates indicates the LIRTree method captures ERS. At both waves, there is no relationship between three-category LGRM TOI and LIRTree ERS. There is, however, a relationship between LIRTree ERS and five-category LGRM TOI. This indicates that extreme response behavior is captured in having Categories 1 and 5; as proposed in the LIRTree. Together, these results indicate that LIRTree TOI and five-category LGRM TOI are capturing similar information, however, the LIRTree captures ERS and MRS tendencies that are confounding TOI trait estimates using the five-category LGRM.
Conclusions
This study has presented and evaluated a longitudinal response process model. Applied researchers will need to convert the ordinal survey items to dichotomous pseudo-items in software such as SAS or R. The appendix provides an example of converting 10 ordinal survey at two time points items to 60 dichotomous pseudo-items (i.e., 10 survey items represents 30 pseudo-items, and this must be replicated for two time points). The appendix provides generic Mplus code for the simple example with 10 items and two time points.
The study’s research questions explored LIRTree model specification, and model validation. We saw a decrease in response styles from Wave I to Wave II and an increase in the substantive sexual knowledge TOI. The correlation between Wave I and Wave II for ERS was similar in magnitude to those in Wetzel et al. (2016), who found a positive correlation of 0.678. This study found a correlation of 0.794 Despite the time frame differences examined (i.e., 1 year for this study compared with 2 years for Wetzel et al. [2016]), agreement of this model with another modeling approach validates use of the LIRTree for measuring changes in response styles over time. Similar to conclusions found in Wetzel et al. (2016) and Weijters et al. (2010a), we found stability of response styles across time points, echoing the call for response style effects to be considered in the analysis of questionnaire data obtained with ordinal rating scales. The approach in this study is novel in that it allows for modeling a mean change (
We demonstrated through four individuals in the sample that the latent traits of the LIRTree model are informed by response sets across all waves of data collection. To further evaluate the validity of the models, we undertook a series of studies. We found that failure to account for response style underestimated sexual knowledge trait growth from one wave to the next by comparing the growth from a unidimensional LGRM to the growth realized by the LIRTree. An alternative explanation is that the LIRTree overestimated growth. While validation of LIRTree inferences is still in its infancy, a series of studies in this article show that the LIRTree method captures ERS and MRS. The latent trait from the LIRTree TOI and the LGRM TOI showed a strong relationship, illustrating that the TOI from the LIRTree can be used as a refined measure of the substantive trait, mitigating the effects of response style contamination.
To strengthen the argument for use of the LIRTree, we present some validity evidence from other studies. Ames and Myers (2021) examined the consequences of fitting an IRTree to unidimensional data (i.e., data generated from a GRM without response styles). In brief, they reported that magnitude of the discrimination parameter is a useful tool for detecting the presence of response styles. With unidimensional data, estimated IRTree item discrimination parameters tend to be very small, near zero, for the response style traits, but large for the substantive TOI. Examining the discrimination parameters in Table 4, we see the discrimination parameters are nonzero, providing evidence of the presence of response styles in the data. Ames and Leventhal (2021) demonstrated accurate estimation of the shift parameter. Plieninger and Meiser (2014) report on convergent evidence of construct and criterion validity of the IRTree model. Taken together, the validity evidence for use of the IRTree is growing, but we recognize the need for more studies investigating the LIRTree.
While we only estimated changes across two waves, the model easily generalizes to incorporate multiple time points.
Future Research
Model comparison indices, such as the DIC, were of limited use for this study because the effective number of parameters was over three times as large as the sample size. Plummer (2008) states, [An] important assumption is that the effective number of parameters pD must be small in relation to the sample size n. When this assumption does not hold, 2pD is a poor approximation to the optimism of the plug-in deviance, and DIC under-penalizes complex models.
Effective use of the DIC in the context of drift analysis will help determine whether measurement invariance holds over multiple time points. In addition, model-data fit approaches should be developed to evaluate the appropriateness of the model. In the Bayesian framework, the primary approach is posterior predictive model-checking, but such approaches have not been investigated for IRTrees or their longitudinal extensions. Choosing adequate priors and assessing prior sensitivity is also an important consideration.
The IRTree model at time point t contains three unidimensional, dichotomous pseudo-items rather than one multidimensional polytomous item. The pseudo-items are modeled explicitly as independent items, akin to a simple structure factor structure. The result is explicitly modelling the pseudo-items as noncompensatory. That may be reasonable for some traits, such as MRS and TOI, but perhaps not for ERS and TOI. For example, a respondent selection strongly agree, under the current model, is considered an extreme response and cannot be obtained by high levels of the TOI alone. One alternative option is the proposed model from Myers and Ames (2020), which allows for compensatory response process between TOI and ERS trait (i.e., high levels of TOI can compensate for low levels of ERS). Thus, strongly agree responses may be obtained without high levels of ERS. However, more research is needed on the compensatory IRTree before it is extended to the LIRTree.
Despite the limitations and need for more research into model evaluation, the longitudinal IRTree is an important addition to the survey research methodology. Kim and Camilli (2014) describe a possible extension for nonlinear growth in a longitudinal IRT framework, which could easily be adapted to the LIRTree approach, further illustrating the flexibility of the model. Stability of personality and attitude constructs may be ascribed to response style stability (e.g., Bachman & O’Malley, 1984; Billiet & Davidov, 2008), as demonstrated by the longitudinal model comparison. Böckenholt (2017) stresses the use of models that account for individual differences due to response style tendencies, especially for longitudinal studies (emphasized by Weijters et al., 2010a; Wetzel et al., 2016). The LIRTree is one important tool to account for response style in survey responses in longitudinal contexts.
Footnotes
Appendix
/*SAS code for converting survey items to dichotomous pseudo-items:*/
data recode;
set original;
array t1item{10} t1item1-t1item10;
array t2item{10} t2item1-t2item10;
array MRS{20} MRS1-MRS20;
array TOI{20} TOI1-TOI20;
array ERS{20} ERS1-ERS20;
do j=1 to 10;
if t1item[j]=3 then do; MRS[j]=1; TOI[j]=-99; ERS[j]=-99; end;
if t1item[j]=1 then do; MRS[j]=0; TOI[j]=0; ERS[j]=1; end;
if t1item[j]=2 then do; MRS[j]=0; TOI[j]=0; ERS[j]=0; end;
if t1item[j]=4 then do; MRS[j]=0; TOI[j]=1; ERS[j]=0; end;
if t1item[j]=5 then do; MRS[j]=0; TOI[j]=1; ERS[j]=1; end;
end;
do j=1 to 10;
if t2item[j]=3 then do; MRS[j+10]=1; TOI[j+10]=-99; ERS[j+10]=-99; end;
if t2item[j]=1 then do; MRS[j+10]=0; TOI[j+10]=0; ERS[j+10]=1; end;
if t2item[j]=2 then do; MRS[j+10]=0; TOI[j+10]=0; ERS[j+10]=0; end;
if t2item[j]=4 then do; MRS[j+10]=0; TOI[j+10]=1; ERS[j+10]=0; end;
if t2item[j]=5 then do; MRS[j+10]=0; TOI[j+10]=1; ERS[j+10]=1; end;
end;
run;
##Mplus code for 10 items:
TITLE:
LIRTree: 10 ordinal items
DATA:
File=C:\Users\data.csv;
VARIABLE:
IDVARIABLE=ID;
NAMES=ID item1-item60;
USEVARIABLES=all;
CATEGORICAL= item1-item60;
MISSING ARE all (-99);
ANALYSIS:
Estimator=Bayes;
Chains=2;
Bseed=11;
Point=Mean;
Processors=1;
MODEL PRIORS:
amrs1-amrs10~N(0,2);
aers1-aers10~N(0,2);
atoi1-atoi10~N(0,2);
bmrs1-bmrs10~N(0,2);
bers1-bers10~N(0,2);
btoi1-btoi10~N(0,2);
shiftMRS~N(0,2); shiftERS~N(0,2); shiftTOI~N(0,2);
varMRSt2~N(1,2); varERSt2~N(1,2); varTOIt2~N(1,2);
MODEL:
MRS1 BY i1-i10* (amrs1-amrs10);
[i1$1-i10$1] (bmrs1-bmrs10);
MRS2 BY i31-i40* (amrs1-amrs10);
[i31$1-i40$1] (bmrs1-bmrs10);
TOI1 BY i11-i20* (aTOI1-aTOI10);
[i11$1-i20$1] (bTOI1-bTOI10);
TOI2 BY i41-i50* (aTOI1-aTOI10);
[i41$1-i50$1] (bTOI1-bTOI10);
ERS1 BY i21-i30* (aERS1-aERS10);
[i21$1-i30$1] (bERS1-bERS10);
ERS2 BY i51-i60* (aERS1-aERS10);
[i51$1-i60$1] (bERS1-bERS10);
MRS1@1 ERS1@1 TOI1@1;
[MRS1@0 ERS1@0 TOI1@0];
MRS2* ERS2* TOI2* (varMRSt2 varERSt2 varTOIt2);
[MRS2* ERS2* TOI2*] (shiftMRS shiftERS shiftTOI);
OUTPUT:
tech1 tech8 cinterval(hpd);
SAVEDATA:
results are C:\Users\OUT.dat;
bparameters=C:\Users\POST.dat;
save=fscores(100);
factors=mrs1 toi1 ers1 mrs2 toi2 ers2;
file=C:\Users\PERSONOUT.dat;
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
