Abstract
Objectives:
Many guidelines and regulations allow children and adolescents to be enrolled in research without the prospect of clinical benefit when it poses minimal risk. However, few systematic methods exist to determine when research risks are minimal. This situation has led to significant variation in minimal risk judgments, raising concern that some children are not being adequately protected. To address this concern, we describe a new method for implementing the widely endorsed “risks of daily life” standard for minimal risk. This standard defines research risks as minimal when they do not exceed the risks posed by daily life activities or routine examinations.
Methods:
This study employed conceptual and normative analysis, as well as use of an illustrative example.
Results:
Different risks are composed of the same basic elements: Type, likelihood, and magnitude of harm. Hence, one can compare the risks of research and the risks of daily life by comparing the respective basic elements with each other. We use this insight to develop a systematic method, direct comparative analysis, for implementing the “risks of daily life” standard for minimal risk. The method offers a way of evaluating research procedures that pose the same types of risk as daily life activities, such as the risk of experiencing anxiety, stress, or other psychological harm. We thus illustrate how direct comparative analysis can be applied in practice by using it to evaluate whether the anxiety induced by a respiratory CO2 challenge poses minimal or greater than minimal risks in children and adolescents.
Conclusions:
Direct comparative analysis is a systematic method for applying the “risks of daily life” standard for minimal risk to research procedures that pose the same types of risk as daily life activities. It thereby offers a method to protect children and adolescents in research, while ensuring that important studies are not blocked because of unwarranted concerns about research risks.
Introduction
C
Prominent guidelines and regulations, including the the World Medical Association's Declaration of Helsinki, attempt to avoid this concern by allowing “nonbeneficial” research with individuals who cannot consent, when the risks are minimal (Emanuel et al. 2000; Council for International Organizations of Medical Sciences 2002; World Medical Association 2013). For example, regulations in the United States allow research with children that offers no prospect of direct benefit when an institutional review board (IRB) finds that the research involves no more than minimal risks. If the IRB judges the risks to be minimal, parents then decide whether to enroll their children based on their own judgment of the associated risks (unless the IRB waives the requirement for informed consent) (United States Department of Health and Human Services 1991).
Many guidelines and regulations define minimal risks based on the risks posed by specified comparator activities. The Council of International Organizations of Medical Sciences (CIOMS) defines risks as minimal when they do not exceed the risks posed by routine examinations (Council for International Organizations of Medical Sciences 2002). Regulatory guidance in many countries – including the United States, Canada, India, Nepal, South Africa, and Uganda – define risks as minimal when they do not exceed the risks ordinarily encountered in daily life or during routine examinations (United States Department of Health and Human Services 2009; Canadian Institutes for Health Research 2010; Indian Council of Medical Research 2006; Nepal Health Research Council 2001; South African Medical Research Council 2002; Uganda National Council for Science and Technology 2007). Furthermore, some commentators propose that risks in pediatric research should be considered minimal when they do not exceed the risks posed by acceptable charitable activities for children or adolescents, such as participating in a soccer game for a charity fundraiser (Wendler 2005).
Comparing the risks of research interventions with the risks posed by other activities provides a context in which to evaluate the risks of “nonbeneficial” research procedures and studies. However, absent a systematic method for making these comparisons, investigators, sponsors, and IRBs must rely largely on intuition alone. This approach is problematic for several reasons. First, intuitive judgments about risk do not systematically take account of the relevant data. These judgments are, therefore, less likely to reflect the actual risks faced by study participants. Second, intuitive risk judgments are subject to cognitive biases, which in turn are influenced by numerous contextual factors (Tversky et al. 1974; Slovic 1987; Weinstein 1989; Kahnemann 2012). These factors increase the chances that IRB assessments of research risks are unduly affected by subjective and, possibly, irrelevant factors. Third, as a result, intuitive judgments about which research risks are acceptable vary significantly, leading to inconsistent judgment across individuals, research sites, and studies (Shah et al. 2004). Such variation can be a particular challenge for pediatric studies that frequently need to recruit from several sites.
In some cases, IRBs may have no or extremely limited data on the risks posed by a research intervention. For example, it can be difficult to predict whether a first-in-humans trial of a novel compound poses no, some, or a high risk of harm to children or adolescents. In cases of significant uncertainty about the risks of research interventions, IRBs should err on the side of caution and categorize interventions as having more than minimal risk. More frequently, IRBs have some data on the risks posed by the research intervention, but are unsure whether these risks qualify as minimal or greater than minimal risks. For example, it may be known that a “punch” skin biopsy typically poses a 1 per 100 risk of bleeding that will require another clinical visit, as well as a 1–10 per 1000 risk of infection that can require treatment with oral antibiotics (Rid and Wendler 2011). In these cases, protecting children without inadvertently blocking valuable research requires IRBs to determine whether the risks are minimal or greater than minimal.
Relying on intuitive judgment alone to make these determinations can leave children and adolescents insufficiently protected in some studies, whereas other studies may be rejected out of unwarranted concerns about their risk level. To address this concern, we describe a systematic method, direct comparative analysis (DCA), that IRBs and others can use to categorize the risk level of research interventions that pose the same types of risks as one or more specified comparator activities. To illustrate how DCA works in practice, we consider how it can be used to determine under United States regulations whether a respiratory CO2 challenge poses minimal or greater than minimal risk in children and adolescents.
Clarifying the Minimal Risk Standard
United States regulations define minimal risk as follows: “The probability and magnitude of harm or discomfort anticipated in the research are not greater than those ordinarily encountered in daily life or during the performance of routine physical or psychological examinations or tests” (United States Department of Health and Human Services 1991). As written, one could interpret this definition as specifying that research procedures qualify as having greater than minimal risk whenever children do not ordinarily encounter the procedure in daily life. However, this interpretation suggests that a study on the impact of seated meditation in attention-deficit/hyperactivity disorder (ADHD) necessarily poses greater than minimal risk in children who do not ordinarily meditate in their daily lives. Therefore, whether an intervention poses minimal or greater than minimal risk depends upon whether the intervention's risk level exceeds the level of risk children ordinarily encounter in daily life or during routine examinations, not on whether children undergo the intervention in daily life. To make this determination, reviewers need a way to systematically compare the level of risk posed by research interventions with the level of risk posed by activities in daily life or during routine examinations and tests.
Others might interpret the risks “ordinarily encountered in daily life” as referring to the risks ordinarily encountered by the individuals enrolled in the study. However, this interpretation would allow investigators to justify risky research in children and adolescents who live in war-torn areas on the grounds that the risks are no greater than those they face in their daily lives. To avoid exploiting children's circumstances in this way, we will use for the present analysis the widely endorsed “objective” interpretation of the “risks of daily life” standard. This interpretation defines risks as minimal when they do not exceed the risks of activities ordinarily encountered in daily life by average, healthy, normal children and adolescents (Field and Behrman 2004; Kopelman 2004). In addition, we assume that risks are minimal only when they do not exceed the risks involved in appropriate daily life activities.
Direct Comparative Analysis
To systematically compare the risks of research procedures to the risks in daily life for average, healthy children, it is helpful to recall that all risks are composed of the same basic elements: Type of potential harm, magnitude of potential harm (including duration), and likelihood that the potential harm will be realized. This fundamental fact suggests the possibility of systematizing the comparison of different risks by resolving them into their basic elements and comparing the respective elements to each other. For example, under the minimal risk standard endorsed in the United States, IRBs and others would determine the type, magnitude, and likelihood of the potential harms posed by the research intervention and compare these elements to the type, magnitude, and likelihood of harms that result from one or more daily life activities or routine examinations for average, healthy, normal children.
To date, we are aware of only one systematic method that has been developed to help IRBs make these comparisons (Rid et al 2010). This method allows IRBs to compare harms of very different types. For example, it allows IRBs to compare the risks of infection from a research biopsy to the risks of bone fracture from riding a bicycle. As a result, it can be used to assess the risk level of almost any research intervention. However, this flexibility also makes the method complex to implement (Rid et al 2010). This complexity raises the need for methods that are both systematic and relatively straightforward to use.
DCA offers such a method when the risks of the research intervention in question are of the same type as the risks of one or more activities of daily life, or any other appropriate comparator activity specified by relevant guidelines or regulations (e.g., routine examinations). Specifically, DCA includes five steps that enable IRBs to compare the likelihood and magnitude of the harm occurring as a result of the research intervention to the likelihood and magnitude of the same type of harm occurring as a result of an appropriate comparator activity (Table 1). The five steps of DCA are: 1. Identifying the potential harms of the research intervention 2. Estimating the magnitude and likelihood of each potential harm based on available data 3. Identifying an appropriate comparator activity, or appropriate comparator activities, that pose the same type(s) of harm 4. Estimating the magnitude and likelihood of the harms occurring as a result of the comparator activity, drawing on the available data 5. Comparing the magnitudes and likelihoods of the harms from the two activities. If the likelihoods and magnitudes of the potential research harms are all lower than the likelihoods and magnitudes of the comparator harms, then the risks of the research intervention do not exceed the risks of the comparator activity, and the intervention qualifies as being of minimal risk.
The example uses the “routine examinations” standard for minimal risk that is stipulated by many research regulations. The given data represent rough estimates. More details on the available evidence can be found in the text.
Given that subjective experiences of negative emotions (e.g., anxiety, distress) are often comparable across different activities, DCA is especially useful for evaluating procedures that pose risks of psychological harm. Therefore, to illustrate how DCA can be used in practice, we use it to assess whether CO2 challenge studies in children and adolescents pose minimal or greater than minimal risks based on the widely endorsed “risks of daily life” standard.
Example: CO2 Challenge in Children and Adolescents
Panic disorder (PD) is a debilitating condition that is characterized by sudden attacks of terror, and affects>6,000,000 people in the United States alone (Kessler et al. 2005; Roy-Byrne et al. 2006). PD presents as various subtypes, with one of the most notable subtypes involving perturbed respiration as manifested by heightened sensitivity to inhalation of CO2-enriched air in the clinical or research setting. Distinct PD subtypes often respond differently to treatment, and novel treatments may be identified through research on subtype-specific perturbations in physiology (Briggs et al. 1993; Klein 1993; Pine et al. 2005a).
Because the first signs of PD often manifest during adolescence, research on physiological risk traditionally focuses on children and adolescents. Studies in this area typically expose participants to CO2-enriched air using the same procedure as is used to identify a respiratory subtype of PD in adults. This involves breathing through a face mask, initially with room air, followed by 5% CO2. The procedure can be stopped at any time by lowering the face mask. In addition, participants are asked to rate their subjective levels of anxiety or fear during the procedure, typically using ordinal Likert Scales.
Because CO2 inhalation offers no prospect of clinical benefit for healthy children, IRBs in the United States can approve its use in healthy children only if it poses minimal risk (United States Department of Health and Human Services 1991). It is, therefore, essential to determine whether CO2 inhalation poses minimal or greater than minimal risk.
Applying Direct Comparative Analysis
Many activities in children's daily life (e.g., being home alone) and routine examinations (e.g., physical examination) pose a risk of anxiety. The anxiety experienced during these activities is often measured in the same way it is measured in the research context, namely by asking children to evaluate their subjective level of anxiety on ordinal Likert scales. DCA, therefore, offers a way to evaluate whether undergoing a CO2 challenge poses minimal or greater than minimal risk, by directly comparing the magnitude and likelihood of the anxiety with the magnitude and likelihood of the anxiety resulting from activities in daily life.
Step 1: Identifying the potential harms posed by the research intervention
Respiratory challenge with 5% CO2 can cause transient anxiety in some children that is associated with a range of symptoms, including shortness of breath, mild chest pain or discomfort, numbness or tingling, and mild headache. The anxiety and most associated symptoms are fully reversible within several minutes of stopping the challenge; mild headaches may last up to several hours. Previous studies report no other adverse events and no long-term adverse effects following a 15 minute 5% CO2 challenge in healthy and at-risk children (Pine et al. 2005b). In particular, there are no reports of exacerbation of ongoing anxiety, emergence of new anxiety symptoms related to participation, or reports of new-onset panic disorder up to 3 years after undergoing a 15 minute 5% CO2 challenge (Pine et al. 2005b). These findings are consistent with data from ∼1000 adults, which show no long-term adverse effects up to 4 years after undergoing 35% CO2 challenge (Gorman et al. 1984, 1988, 1994; Fredholm et al. 2000).
Step 2: Estimating the magnitude and likelihood of the potential harms
The only data of which we are aware regarding the likelihood and magnitude of CO2-induced anxiety comes from previously unpublished data that were collected as part of a study involving 197 children and adolescents, ages 9–20 years (Table 2; other results from the study are published elsewhere) (Pine et al. 2000; Roberson-Nay et al. 2010). Of the 197 participants, 161 were at risk for PD because of a positive family history, and 36 were healthy (i.e., not at risk to the best of the investigators' knowledge). Participants were exposed to a maximum of 15 minutes of CO2-enriched air using the procedure described. They were also asked to rate their state anxiety before, 5 minutes into, and immediately following the CO2 exposure. A 0–10 Likert scale was used for this purpose, with “0” indicating no anxiety and “10” indicating the most severe anxiety the child or adolescent could imagine.
Subjects at risk: Parents diagnosed with the respiratory or cognitive type of panic disorder.
Healthy controls in this study self-reported a mean anxiety score of 1.00 (SD 1.93) after 5 minutes of CO2 inhalation, the point at which the CO2 challenge is fully effective and the anxiety peaks in most individuals. Children or adolescents at risk for PD reported a mean of 1.04 (SD 1.41). These scores did not differ significantly from each other and from the measures at 0 and 15 minutes of the CO2 challenge.
The likelihood of experiencing no anxiety (anxiety score=0) was ∼50–60% in both groups. The likelihood of experiencing moderate to significant anxiety (anxiety score≥4) was 5–10%, with the highest reported scores of 8 out of 10 in one healthy participant, and 6 in three participants at risk for PD. Only two of the participating children and adolescents, one a healthy control and one at risk for PD, ended the procedure before 5 minutes of CO2 inhalation.
Step 3: Identifying an appropriate comparator activity that poses the same type of harm as the research intervention
Average, healthy, normal children ordinarily engage in a number of appropriate activities that pose a risk of anxiety, including taking tests in school, watching the TV news, and receiving recommended vaccinations. Although anxiety is a complex phenomenon, it seems plausible to conceptualize the anxiety associated with these activities to be of the same type of harm as the anxiety associated with undergoing a CO2 challenge. In particular, although anxiety can be caused by a wide range of triggers, the negative, subjective experience is relatively similar across different triggers. This assumption is supported by the literature on the clinical effects of anxiety, which establishes that reports of the level of anxiety experienced during a range of activities predict long-term impairment (Pine et al. 1998), and treatments to mitigate these impairments target the experienced anxiety (Walkup et al. 2008).
Step 4: Estimating the magnitude and likelihood of the harm occurring as a result of the comparator activity
We conducted a literature search using PubMed, Cumulative Index to Nursing and Allied Health (CINAHL), PsycINFO, Scopus, and Google Scholar to identify studies that provide self-report data on 1) the anxiety, worry, fear, or distress that 2) healthy children or adolescents (3–18 years of age) experience in the context of 3) appropriate activities of daily life or recommended routine examinations or procedures using 4) ordinal Likert-type scales. In consultation with the coauthors, one author (E.A.) reviewed the search results, selected the studies that met the inclusion criteria, and extracted the data. The search yielded 22 publications, and we selected data points on 67 daily life activities that we deemed appropriate and 41 routine examinations (Table 3 and Supplementary Tables, Appendices I–III) (Supplementary Tables, are available in the online version of this article at
See Supplementary Tables, Appendices I–III for the full version and additional explanation. Supplementary Tables, are available in the online version of this article at
Data converted from original recordings. See Supplementary Tables, Appendices II and III.
Because the existing studies used different scales, we normalized all ratings to a common scale. We chose a 0–10 Likert scale to allow for comparisons with the data on CO2-induced anxiety. For each scale, we determined the number of segments between no anxiety – set at “0” corresponding to the 0–10 Likert scale used in the CO2 challenge study – and the highest possible anxiety. We then used a simple fraction calculation to normalize the reported mean to a 0–10 Likert scale with 10 segments, and “10” as the number for the highest possible anxiety. For example, in one study children reported a mean anxiety of 1.51 on a five point smiley face, where “1” signified not worried and “5” signified very worried (Howard and Freeman 2007). This scale has four segments. Adjusting the reported mean to a scale from 0 to 4, where “0” signifies no anxiety, yielded a rating of 0.51. To normalize this finding, we calculated (0.51/4 )=(x/10), yielding a score of 1.28. In another study, children were asked how anxious they were when watching TV news. They reported a mean anxiety of 1.5 on a five point Likert scale that ranged from “not at all scared” (0) to “very very scared” (4) (Smith and Wilson 2000). We normalized the finding as above, calculating (1.5/4)=(x/10) to yield a score of 3.75.
The median of the normalized mean anxiety ratings was 3.37 on a 0–10 scale for all identified data. This suggests that the average child experiences overall mild to low-moderate anxiety (3.37 out of 10) during a broad range of appropriate daily life activities and routine examinations and procedures.
Step 5: Comparing the likelihoods and magnitudes of the harm
The mean anxiety resulting from CO2 inhalation in our data set was lower than the mean anxiety reported in all of the 67 appropriate daily life activities. This applied to both the healthy controls and children or adolescents at risk for PD. In addition, the mean anxiety from CO2 challenge was lower than the mean anxiety for 33 of the 41 activities associated with routine examinations. Children's fear of certain aspects of routine dental procedures was lower in two publications (Alvesalo et al. 1993; Peretz and Efrat 2000). However, the same publications and several others reported higher levels of anxiety associated with other aspects of dental visits, suggesting that obtaining dental care overall is likely to exceed the anxiety associated with a respiratory CO2 challenge (Ollendick 1983; Alvesalo et al. 1993; Wong et al. 1998; Peretz and Efrat 2000; Rantavuori et al. 2005; Howard and Freeman 2007; Laing et al. 2009; Nicolas et al. 2010).
Even though the mean anxiety is lower for CO2 challenge, a finding that CO2 challenge exposes a significant number of children and adolescents to significantly greater anxiety than they experience during appropriate activities of daily life activities would suggest that CO2 challenge poses greater than minimal risk. Therefore, in addition to comparing the mean anxiety ratings, we also compared the percentages for children and adolescents who experience more significant anxiety. For CO2 challenge, 8.7% (17/195) children or adolescents in our data set reported anxiety scores of 4/10 or higher, all of which were fully reversible within several minutes of stopping the challenge. In comparison, in the limited number of studies that recorded these data, the percentage of children and adolescents reporting higher anxiety scores from activities of daily life was 7–35%. These activities included watching the news and being vaccinated (Mindell and Barrett 2002; Lacey et al. 2008; van der Molen and Bushman 2008; Nicolas et al. 2010).
In addition, two studies reported a higher mean anxiety rating for two comparator activities – getting immunized and losing a pet – than the highest score recorded in two out of the 195 children undergoing a CO2 challenge in our data set (Neff and Dale 1996; Cohen et al. 2002). Hence, this comparison equally suggests that the risk of experiencing anxiety from a 15 minute challenge with 5% CO2 is lower than the risk of anxiety from activities of daily life and routine examination. Overall, this analysis suggests that respiratory CO2 challenge qualifies as a minimal risk procedure in children and adolescents, under United States regulations (United States Department of Health and Human Services 1991).
The Need for Judgment
Although DCA provides a systematic method to evaluate the risks of research interventions that pose the same types of risk as appropriate comparator activities, it does not eliminate the need for reviewers to use their judgment. First, the risks of some comparator activities may differ in morally significant ways from the risks of research participation. For example, the risks of many daily life activities (e.g., mountain biking) are accepted because the activities offer children and adolescents the potential for direct benefit. As a result, these risks do not necessarily provide an appropriate standard for evaluating the risks of research interventions that do not offer subjects a prospect of direct benefit (Field and Behrman 2004; Wendler 2005).
One way of addressing this concern is to compare the risks of research interventions with the risks of activities that are designed to benefit others. In the present case, we compared the risks of CO2 challenge with the risks of a broad range of daily life activities. Some of these activities seem appropriate for children and adolescents as part of charitable fundraisers that primarily aim to benefit others (e.g,. taking a test, playing sports) (Wendler 2005).
Second, investigators and IRBs must determine that the comparator activity poses a sufficiently similar type of harm. In the present case, we assumed that the anxiety experienced during CO2 challenge was essentially the same type of harm as the anxiety experienced as the result of activities of daily life. Although subjective experiences of psychological harm will typically be comparable across activities, other research harms may not be similar enough for using DCA. In this case, IRBs should use an alternative systematic method for evaluating research risks (Rid et al. 2010).
Third, investigators and IRBs need to use their judgment to decide whether, given the strength and relevance of the available data, they can be confident of the risk assessments. When there is significant uncertainty about the risks of an intervention because the available data are insufficient, IRBs should err on the side of caution and categorize the procedure as entailing more than minimal risk. This is because uncertainty implies large confidence intervals or standard deviations for likelihood estimates of potential research harms, as well as an increased likelihood of previously unknown harms. Under conditions of significant uncertainty, the risks of an intervention are, therefore, likely to be greater than minimal.
Fourth, reviewers need to exercise their judgment to estimate and compare the magnitude and the likelihood of the harms resulting from the research intervention and the comparator activity. For example, for the purposes of ethical evaluation, a mean anxiety rating of 1.4 on a 0–10 Likert scale seems normatively equivalent to a rating of 1.5. Reviewers may also need to balance likelihoods against harms, provided the available data are of sufficient quality. For example, the likelihood of experiencing moderate anxiety may be somewhat greater in research than in the comparator activity, whereas the likelihood of experiencing serious anxiety may be significantly lower. In that case, reviewers need to judge whether the decrease in the one risk compensates for the increase in the other.
Finally, although important to protecting pediatric research subjects, a finding that a research intervention poses minimal risk does not necessarily imply that it is acceptable. IRBs and others also need to ensure that additional ethical requirements are satisfied, including obtaining assent and fair subject selection (Emanuel et al. 2000). With regard to the present case, it is important to assess whether it is ethically appropriate for researchers to intentionally provoke negative symptoms in participants, even when the risks are minimal. To address this question, investigators and IRBs should consult the literature on the ethics of intentionally provoking some negative symptoms in participants (Miller and Grady 2001; Hope and McMillan 2004).
Practical Implementation
Given the relative complexity of making systematic risk evaluations, it may make most sense for dedicated committees with the expertise and access to the relevant data to use DCA to develop default risk determinations for common research interventions (Rid et al. 2010). Interventions that primarily pose risks of psychological harm (e.g., completing anxiety-provoking questionnaires, undergoing an MRI) would be ideal candidates. IRBs could then consider whether there is anything about the study or population in question that implies that the risks might be greater than the default judgment suggests.
Conclusions
Determining whether research interventions pose minimal or greater than minimal risk is essential for protecting pediatric research participants while allowing acceptable research to proceed. Many regulations and guidelines define minimal risks in comparison to the risks posed by the activities of daily life and routine examinations. However, there are few systematic methods available to systematically compare the risks of research procedures with the risks posed by these activities.
DCA provides a method to make these comparisons when the research risks are of the same type as the risks of activities of daily life. As a result, DCA is particularly valuable for assessing procedures that primarily pose risks of psychological harm, such as anxiety. This suggests that DCA is especially valuable for ensuring that pediatric research subjects are not exposed to excessive risks, while allowing important and appropriate research in pediatric psychiatry and psychopharmacology to proceed and benefit future patients.
Footnotes
Acknowledgments
The authors thank Frank Miller (Department of Bioethics, National Institutes of Health [NIH] Clinical Center) for connecting them to each other, and Karen Smith (NIH Library) and Lorelei Woody (The Taubman Health Sciences Library, University of Michigan) for support with the literature search. These individuals received no compensation for their contributions.
Disclosures
No competing financial interests exist. Annette Rid received financial support from the University of Zurich (Nachwuchsförderungskredit) for this project. The sponsor had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
