Abstract
Background: Little is known about the factors affecting participation in clinical assessments after HEmochromatosis and IRon Overload Screening. Methods: Initial screening of 101,168 primary care patients in the HEmochromatosis and IRon Overload Screening study was performed using serum iron measures and hemochromatosis gene (HFE) genotyping. Using iron phenotypes and HFE genotypes, we identified 2256 cases and 1232 controls eligible to participate in a clinical examination. To assess the potential for nonresponse bias, we compared the sociodemographic, health status, and attitudinal characteristics of participants and nonparticipants using adjusted odds ratios (ORs) and 95% confidence interval (CI). Results: Overall participation was 74% in cases and 52% in controls; in both groups, participation was highest at a health maintenance organization and lowest among those under 45 years of age (cases: OR = 0.68; 95% CI 0.53, 0.87; controls: OR = 0.59; 95% CI 0.44, 0.78). In controls only, participation was also lower among those over 65 years of age than the reference group aged 46-64 (OR = 0.64; 95% CI 0.47, 0.88). Among cases, participation was higher in HFE C282Y homozygotes (OR = 3.98; 95% CI 2.60, 6.09), H63D homozygotes (OR = 2.79; 95% CI 1.23, 6.32), and C282Y/H63D compound heterozygotes (OR = 1.82; 95% CI 1.03, 3.22) than in other genotypes, and lower among non-Caucasians and those who preferred a non-English language than in Caucasians and those who preferred English (p < 0.0001). Conclusions: Subjects with greatest risk to have iron overload (C282Y homozygotes; cases ≥45 years; Caucasians) were more likely to participate in a postscreening clinical examination than other subjects. We detected no evidence of strong selection bias.
Introduction
H
Selection bias is a form of systematic error caused by differences in characteristics between those who do and those who do not participate in a study. Selection bias is important scientifically because it can invalidate conclusions and generalizations from studies in which it occurs (Last, 2001). For example, samples of patients from tertiary care settings may be biased because the participants have more severe or intractable disease than all those affected in the population. Selection bias due to self-selection occurs due to differences between people who choose to participate and those who do not. Self-selection bias can occur in cross-sectional and case-control studies, and at each point in a longitudinal study. In cross-sectional studies and case-control studies, and at the baseline of longitudinal investigations, it is a component of “nonresponse” bias. In longitudinal studies, it is one source of biased follow-up (Greenland, 1977).
The HEmochromatosis and IRon Overload Screening (HEIRS) study is a multiracial, multiethnic, primary care-based screening study of hemochromatosis and iron overload conducted at five field centers in the United States and Canada (McLaren et al., 2003; Adams et al., 2005). Some general results from the initial screening phase have been reported (Adams et al., 2005; Barton et al., 2005, 2006; Acton et al., 2006a, 2006b). The purpose of the present analyses was to address the potential for nonresponse bias at the second (clinical examination) phase by comparing characteristics between those who did and did not participate, separately for a group of cases designated as at higher risk for hemochromatosis and iron overload, and a matched control group.
Materials and Methods
Study design and approval
The design and methodology of the HEIRS study has been reported in detail elsewhere (McLaren et al., 2003). The local Institutional Review Board of each field center approved the study protocol (McLaren et al., 2003; Adams et al., 2005). Briefly, five HEIRS study field centers recruited participants ≥25 years of age who were able to give informed consent from primary care practices and blood-drawing laboratories. There were no other inclusion or exclusion criteria. The HEIRS study had two principal phases: initial screening and subsequent clinical examination (McLaren et al., 2003). In the present analyses, we sought to evaluate the observed differences between participants and nonparticipants at the clinical examination phase.
Selection of study subjects
The HEIRS study screened 101,168 primary care patients over a 2-year period (February 2001-March 2003). As part of the informed consent process, the study provided background information on hemochromatosis and iron overload to all potential participants. All sites had information available in English; in addition, specific sites had written material and recruiters fluent in Mandarin, Spanish, and Vietnamese. Some initial screening participants were subsequently designated as cases or controls. Cases (n = 2256) were defined as (1) homozygous for the HFE C282Y mutation (n = 333), with (n = 174) or without (n = 159) elevated levels of serum ferritin (SF >300 μg/L for men and >200 μg/L for women) or transferrin saturation (TS >50% for men and >45% for women) or (2) not homozygous for HFE C282Y but with the same elevated levels of SF and TS (n = 1923). Potential controls were eligible if they lacked HFE C282Y or H63D, and had nonelevated TS and SF values as defined above. Selected controls (n = 1247) were frequency matched to the cases at a 1:1 ratio on field center, age group (25-44, 45-64, and >65), and date of initial screening.
Case participants were informed of their TS and SF levels and HFE genotype by letter and invited by telephone to participate in a clinical examination. They were able to ask questions about their test results during the telephone call in their preferred language (English, Mandarin, Spanish, or Vietnamese). The content of the letters sent to cases was tailored to their specific genotype and phenotype. For example, cases not homozygous for HFE C282Y but with elevated iron values were “recommended” to share their results with their physician and “talk to a genetics counselor about risks to your family members” (Anderson et al., 2006). Cases also received an information pamphlet. Control participants received a letter informing them that no phenotype or genotype abnormalities were detected, that they were “welcome” to share the information with their physician, and were invited by telephone to participate in the clinical examination as control subjects. Cases and controls were reimbursed for their out-of-pocket expenses such as parking and child care.
Screening laboratory methods
Phenotype measurements
Methodology and quality control used for measurements of TS and SF are described elsewhere (Barton et al., 2005). The HEIRS study defined these initial screening phenotypes to be elevated: TS >50% for men and TS >45% for women; SF >300 μg/L for men and SF >200 μg/L for women (McLaren et al., 2003).
Genotype analyses
HFE C282Y and H63D were detected using buffy coat samples from whole-blood ethylenediaminetetraacetic acid samples and a modification of the Invader assay (Third Wave Technologies, Madison, WI) as described in detail elsewhere (Barton et al., 2005). Participants without C282Y or H63D were designated as having HFE wild-type genotype (wt/wt).
Statistical considerations
The primary outcome variable was participation in the clinical examination (yes = 1 and no = 0). Potential explanatory factors were drawn from initial screening data and included age, sex, race/ethnicity, recruiting field center, self-report of chronic conditions associated with iron overload (arthritis, diabetes, liver disease or liver cancer, heart failure, fertility problems, or impotence) and “too much iron in your body, iron overload, or hemochromatosis,” self-rated health status, and attitudes about genetic testing and the perceived role of lifestyle in health. For cases only, participation was compared across six specific HFE genotypes: C282Y homozygosity; H63D homozygosity; C282Y/H63D compound heterozygosity; C282Y heterozygosity; H63D heterozygosity; and HFE wt/wt in participants who had elevated screening iron measures. By selection, all controls had the genotype HFE wt/wt.
We compared characteristics of participants and nonparticipants and analyzed data from cases and controls separately. We used Pearson's chi-squared statistic to assess independence between strata of binary variables and across all strata for variables with multiple categories. Crude (unadjusted) odds ratios (ORs) and 95% confidence intervals (CIs) were calculated to estimate the strength of association between participation and each variable. For variables with more than two levels, an OR and 95% CI were calculated for each stratum, compared to a reference category (OR = 1.0). These multiple-level variables included several questions about health beliefs and attitudes about genetic testing that were asked using Likert-type response options. Health beliefs (e.g., “My health is excellent”) ranged from “Definitely true” to “Definitely false”; psychological effect (e.g., “During the past 4 weeks, have you felt downhearted and blue”?) ranged from “All of the time” to “None of the time”; attitudes toward genetic testing (e.g., “In general, I think genetic testing to find out about disease is a good idea”) ranged from “Strongly Agree” to “Strongly Disagree”; and self-rated health response options were Excellent, Good, Average, Fair, and Poor.
Because these variables were measured on an ordinal scale, the stratum-specific OR permitted examination of linear trends in the strength of association across levels of the variable. Specifically, the hypotheses being tested were that if self-rated health, or the strength of agreement with an attitude, were associated with participation in the clinical examination, then there should a linear trend of increasing (or decreasing) OR across the ordinal response categories for these variables.
Multiple logistic regression was used to identify covariates that independently predicted participation, controlling for other factors in the model. Because there were different eligibility criteria and potential motivations for participation, logistic modeling was done separately for case and control strata. A univariate model for each covariate available in the dataset was fitted, and the p-values were examined. Covariates that yielded p-values <0.10 in these univariate models were considered further in the model building, and those with p-values ≥0.10 were not considered further. The models were built sequentially by adding covariates, resulting in final models in which all remaining covariates were deemed statistically significant at p < 0.05. For brevity, only variables with Pearson's chi-squared p-values <0.05 were included in the univariate results.
Results
Univariate results
Table 1 shows the distributions of characteristics between participants and nonparticipants separately for cases and controls for variables associated with participation at p <0.05. Participation was 74% for cases and 52% for controls (p < 0.001).
Cases met either or both genotypic or phenotypic criteria for the clinical examination.
Controls met neither genotypic nor phenotypic criteria and were matched to the controls on age group, field center, and date of initial screening visit.
Chi-squared p-value for the 2 × n table for that characteristic. Only variables with p-values <0.05 in cases or controls are presented.
Referent group.
CI, confidence interval; OR, odds ratio; TS, transferrin saturation; SF, serum ferritin; UAB, University of Alabama at Birmingham; UCI, University of California, Irvine.
Table 1 also shows stratum-specific OR and 95% CIs for strata of the above factors. For genotype variables (cases only), compared to those with HFE wt/wt, participation was significantly greater in C282Y homozygotes (OR = 4.17), H63D homozygotes (OR = 2.62), and C282Y/H63D compound heterozygotes (OR = 2.19). With respect to phenotype (elevated iron measures, cases only), participants with elevated iron parameters were 2.4 times more likely to participate than those with normal iron values (OR = 2.44).
In cases, the relative odds of participation were 27% higher for women (OR = 1.27); in controls, there was no significant difference in the proportions of men and women who participated. Age group was associated with participation in cases (p = 0.002) and in controls (p < 0.001). In cases, there was a linear trend in OR reflecting greater participation in older age groups than in younger age groups. No linear trend in participation by age group was observed in control subjects, but controls <45 years of age were 41% less likely to participate than in older participants (OR = 0.59).
Non-Caucasian cases were 35-39% less likely than Caucasian cases to participate in the clinical examination (p < 0.001); non-Caucasian controls were 22-43% less likely to participate than Caucasian controls (p = 0.008). Cases recruited from the four field centers that were not based at a health maintenance organization (HMO) were 64-73% less likely to participate than those recruited at the Kaiser Permanente Field Center (p < 0.001); non-HMO controls were 31-60% less likely to participate (p < 0.001). Cases with a language preference other than English were 36-52% less likely to participate than those who preferred English (p < 0.001). Among controls, language preference overall was associated with participation (p = 0.002); controls who preferred Spanish were 56% less likely to participate than those who preferred another language (OR = 0.44; 95% CI 0.28, 0.71).
Health conditions associated with participation in cases were self-report of arthritis (p = 0.045), heart failure (p =0.030), and infertility/impotence (p = 0.003). Those who reported “no” or “not sure” about infertility/impotence were 66% less likely to participate than those who reported these conditions.
Several psychosocial variables, and attitudes about genetic testing, were significantly associated with participation. In participants who qualified as cases, these included the beliefs that genetic information should be shared with family members (p = 0.016); genetic testing is generally a good idea (p = 0.029); genetic testing is good because the results allow one to change to a healthier lifestyle (p < 0.002); genetic testing is bad because it may affect one's insurance (p < 0.001); lifestyle is an important factor in health (p = 0.047); and responses to the statement “I am a happy person” (p = 0.047). In controls, participation was associated with self-rated health (p < 0.001); the belief that lifestyle is an important factor in health (p = 0.018); and responses to the statements “My health is excellent” (p = 0.024) and “I am a happy person” (p = 0.015).
When ORs were examined for trends among ordinal Likert-type response options, cases who rated their health as other than excellent were consistently less likely to participate, but there was not a linear trend in OR for cases or controls. A linear trend in OR was seen for only one variable, in cases only: compared to those who strongly agreed with the view that “genetic testing is a good idea because it enables a change to a healthier lifestyle,” those who agreed were 16% less likely to participate. Those who disagreed were 62% less likely, and those who strongly disagreed were 85% less likely to participate. A partial trend in ORs was seen in controls, in whom the odds of participation increased with the frequency of being happy. Compared to those who were happy “none of the time,” participation increased by 80% in those happy “a little of the time,” by 126% for “some of the time,” by 147% for “a good bit of the time,” and by over 300% for those happy “most of the time.” Variables not associated (p > 0.05; not shown) with participation in either cases or controls were self-reported diabetes, reporting that blood relatives have iron overload, the expectation that health will get worse, feeling downhearted and blue, and agreement with the belief that heredity is an important reason why people get sick.
Because hypothesized linear trends in OR across ordinal strata in most of the psychosocial and attitudinal questions were absent, statistically significant OR in specific strata should be interpreted with caution, particularly given the number of hypotheses tested herein.
Adjusted analyses
After multiple logistic modeling (Table 2), variables remaining as independent predictors of participation in cases were C282Y homozygosity (OR = 3.98), H63D homozygosity (OR = 2.79), or C282Y/H63D compound homozygosity (OR = 1.82), whereas age <45 years was associated with lower participation than age ≥45 years (OR = 0.68). Controls <45 years (OR = 0.59) or >65 years (OR = 0.64) were less likely to participate. Participation was lower among those recruited at non-HMO field centers for both groups, with statistically significant adjusted OR of 0.30-0.37 in cases and 0.38-0.62 in controls.
Adjusted for HFE genotype, field center, and age group in cases and for field center and age group in controls.
Referent group.
Discussion
In the present study, subjects selected as cases with greatest risk to have iron overload (HFE C282Y homozygotes, and phenotypically or genotypically defined cases ≥45 years) were more likely to participate in a postinitial screening clinical examination than other subjects. A major factor associated with study participation is the salience, or personal interest, that an eligible participant has in the specific topic under investigation (Heberlein and Baumgartner, 1978). In case-control studies, participation rates of cases are typically higher than those of control subjects. Salience was detected in the present longitudinal study, wherein participation among cases, eligible for clinical examination on the basis of phenotype or genotype abnormalities, was 74%; participation was significantly lower (52%) among controls who were informed by letter that they had no identified abnormality. This is consistent with a review that reported median case-control participation rates of 81% and 74%, respectively, in 107 such studies published in 2003 (Morton et al., 2006). Still, it is noteworthy that at least one half of the eligible controls participated in a clinical examination in spite of no elevated risk for the disease under investigation. Salience might also explain different participation rates within the present cases. Participation was highest in C282Y homozygotes, H63D homozygotes, and C282Y/H63D compound heterozygotes (84-91) and lowest in those with HFE genotypes C282Y/wt, H63D/wt, and wt/wt (69-71%). Similarly, hemochromatosis probands with C282Y homozygosity diagnosed during routine medical care were significantly more likely to comply with maintenance phlebotomy recommendations than probands with other HFE genotypes (Hicken et al., 2003). It is difficult to directly ascertain the effect on participation played by participants' understanding of their results. However, in an analysis of a sample of non-C282Y homozygous HEIRS study participants who did not meet the criteria for the clinical exam and who received SF, TS, and HFE gene test results by mail only (n = 1281), Harrison et al. (2008) found that those with normal HFE test results, normal iron test results, and English-speaking participants understood test results and study recommendations better than those with one or two HFE mutations, out-of-range TS or SF results, and non-English-speaking participants, respectively. We also cannot test the influence—positive or negative—of participants' physicians and other care providers on the decision to attend the clinical examination. The content of the letters, specifically whether people were “welcome,” “suggested,” “encouraged,” or “recommended” to share their results with their MD, varied by phenotype and genotype (Anderson et al., 2006). It is possible but unknown if some eligible participants followed up with their own physicians independent of further involvement in the HEIRS study.
We observed that factors primarily related to study design and sociodemographic characteristics have similar associations with clinical examination participation in both case and control groups. Specifically, the participation proportions of cases and controls recruited at the single HMO were uniformly greater than those of corresponding groups from other field centers. Participation was lower in both cases and controls among non-Caucasian groups than in Caucasians, and lower in those with language preference other than English than in those who preferred English. The direction and magnitude of potential bias in incidence estimates derived from the clinical examination data are more difficult to predict with factors such as field center and ethnicity than with age, because these factors are not as uniformly associated with disease incidence as with age.
Most other factors we analyzed, many of which were measured at an ordinal level, had no consistent relationship with participation by either cases or controls. Initial screening in the HEIRS study involved genotyping, but participation in the second, more involved, postscreening phase of the study was not associated with the strength of stated attitudes toward genetic testing. No psychosocial or attitudinal variables remained in the final logistic models.
Investigations of factors affecting participation in two-stage studies, the designs of which resemble that of the HEIRS study, have shown greater participation among those with specific health concerns (Cobb et al., 1957). For example, those who participated in the examination phase of National Health and Nutrition Examination Survey II were more likely than nonparticipants to have a health problem they wished to discuss with a physician (Forthofer, 1983). In the present study, the ORs for controls who reported that it was “mostly true” that their health was going to get worse and those who felt this was “definitely true” were not statistically significant. Altogether, these observations suggest that health-related motives for participation in a phenotype and genotype screening study involving a postscreening clinical examination are complex and may not be predictable. Only knowledge of true exposure and outcome status in both participants and nonparticipants allows accurate estimate of the magnitude and direction of selection bias. This knowledge is usually only available in countries that have population-based registries of both risk factor and health outcome data and that permit investigators to sample from them. This situation does not exist in either the United States or Canada, the countries from which HEIRS study participants were recruited. Nonetheless, the unpredictability of most factors studied suggests that the clinical examination participation appears to be random with respect to these factors. The use of the OR as the measure of association, coupled with their cautious interpretation when they are close to the null, should yield valid inferences from the data collected during the clinical examination phase of the HEIRS study or similar screening programs.
Footnotes
Acknowledgments
Following are the participating HEIRS study investigators and institutions:
Disclosure Statement
No competing financial interests exist.
