Open Science and Single-Case Design Research

Abstract

Research indicating many study results do not replicate has raised questions about the credibility of science and prompted concerns about a potential reproducibility crisis. Moreover, most published research is not freely accessible, which limits the potential impact of science. Open science, which aims to make the research process more open and reproducible, has been proposed as one approach to increase the credibility and impact of scientific research. Although relatively little attention has been paid to open science in relation to single-case design, we propose that open-science practices can be applied to enhance the credibility and impact of single-case design research. In this article, we discuss how open-science practices align with other recent developments in single-case design research, describe four prominent open-science practices (i.e., preregistration, registered reports, data and materials sharing, and open access), and discuss potential benefits and limitations of each practice for single-case design.

Keywords

single subject research methodology evidence-based practice

Scientific research should yield credible (i.e., reliable and valid) findings that can be used to inform practice and policy. However, large-scale replication projects indicate research findings may not reliably reproduce when studies are independently replicated (see Fidler & Wilcox, 2018), precipitating a crisis of confidence in contemporary science (Baker, 2016). Many factors underlie the perceived crisis, including replication failures; publication bias, which results in fewer studies with null effects being published; undisclosed flexibility in conducting and reporting research (e.g., researchers can analyze data in multiple ways but only report the analyses yielding desired results); and the inaccessibility of a large portion of the research base that lies behind publisher paywalls (see Piwowar et al., 2018). Vazire (2018) framed the reproducibility crisis as an opportunity for a credibility revolution and advocated the adoption of open-science reforms to enhance the credibility and impact of scientific research.

Open science aims to promote transparency, openness, and reproducibility in research to address issues related to irreproducible study results, undisclosed researcher flexibility, publication bias, and inaccessible research for the ultimate purpose of improving the credibility and impact of research (Stracke, 2020). Although issues related to the reproducibility crisis and open-science reforms have largely focused on research using group-contrast designs, single-case designs (SCDs) may face related challenges to which open-science practices can be fruitfully applied. In this article, we examine critical characteristics of and recent developments in SCD, review prominent open-science practices, examine whether and how open-science practices might be applied in SCD, and consider the potential benefits and challenges of doing so.

Defining Characteristics of SCD Research

A primary purpose of SCD research is to establish functional or causal relationships between independent and dependent variables for individual cases, with the ultimate aim of positively affecting real-life outcomes (Kazdin, 2011). Studies employing SCD methods share critical features to fulfill this purpose, including research questions and experimental design features. Research questions in SCD examine whether changes in conditions are causally related to changes in behavior (e.g., Does self-monitoring improve on-task behavior of elementary students?). Researchers may use group-comparison studies to address similar research questions. However, circumstances and foci sometimes require alternative experimental approaches to group-contrast methods. For instance, there may be too few individuals with a low-incidence disability to make a group-contrast study possible. Perhaps more fundamentally, changes in outcomes for specific individuals may be more important to examine than differences between groups of individuals. If individual outcomes are more important than average outcomes, then SCD research methods are warranted.

SCD methods rely on several critical features, including repeated and reliable measurement of dependent variables (i.e., behavior) over time, repeated manipulation of the experimental conditions (e.g., systematically introducing or withdrawing an intervention), and examining data using a within-participants (not between-participants) analysis of participants’ behavior (i.e., each case is its own control; Johnston & Pennypacker, 1992; Kazdin, 2011; Tawney & Gast, 1984). Additional features include a focus on observable behavior, documentation of characteristics of independent variables (i.e., fidelity), focus on a small number of cases, and visual analysis of graphed data. Adhering to these core principles has allowed SCD researchers to address a range of research questions and examine whether functional relations exist between independent variables and socially valid changes in dependent variables. Although the fundamental purpose and features of SCD have remained consistent, researchers have recently advanced SCD methods with the intent of increasing the credibility and impact of the SCD research base and using it to generalize the effects of interventions to populations, not just individual participants.

Recent Developments in SCD Research

Practices and interventions are considered evidence based when they are supported by bodies of credible experimental research showing they reliably cause improved outcomes for a population (Cook et al., 2020). SCDs have played a prominent role in identifying evidence-based practices and interventions in many fields (Shadish et al., 2015). In this section, we describe innovations for using SCD within an evidence-based practice framework, which are consistent with and may serve as antecedents for open-science reforms in the field.

Inductive and Deductive Approaches to SCD

As proposed by Johnson and Cook (2019), SCD researchers have varied perspectives regarding the purpose and conduct of SCD research, ranging from deductive-static to inductive-dynamic approaches. The inductive-dynamic perspective is closely aligned with the core principles of applied behavior analysis, including the primacy of achieving socially significant outcomes for the participant (Wolf, 1978). Researchers who are more on the inductive-dynamic side of the continuum primarily seek to identify contingencies controlling a participant’s behavior rather than examine the effects of a specific intervention on a specified outcome. As such, these researchers may modify independent and dependent variables during the study if the (a) intervention does not result in therapeutic and observable change of the dependent variable or (b) the participant begins exhibiting another behavior requiring immediate attention, respectively.

In contrast, SCD researchers who utilize more deductive-static perspectives maintain the independent and dependent variables throughout their study to test an a priori hypothesis (Johnson & Cook, 2019). For example, a researcher may aim to understand whether an intervention affects participant responding in ways consistent with prior research. Such research may be considered deductive because it poses an a priori hypothesis regarding the effects of an intervention, a study is designed to test the hypothesis, and then data are collected and evaluated. The defensibility of findings of such SCD studies requires the possibility of the intervention not causing improved outcomes (i.e., non-effects). SCD studies utilizing a deductive-static approach seem appropriate when examining the effectiveness of interventions and potentially establishing those interventions as evidence-based practices.

Masked Visual Analysis and Randomization

SCD researchers have traditionally used a response-guided approach to inform decisions made during the course of a study, such as when to initiate intervention (Johnston & Pennypacker, 2009). Although the intent of researchers making decisions in response to data is to reduce ambiguity associated with variable response patterns, they may not always be fully reported (perhaps due to page limits), can be influenced by knowledge of environmental contexts (e.g., waiting to intervene until the day after a class party), and can influence study findings (e.g., increase the likelihood of data supporting a functional relationship; Allison et al., 1992). Similarly, procedures for conducting visual analysis, the primary approach for examining the presence of a functional relationship, are often not clearly reported and inherently involve some level of subjectivity (Ninci et al., 2015). Some researchers have proposed randomization and masked visual analysis to increase the transparency and reproducibility of SCD decision-making and data analysis.

Randomization can be used in SCD research to determine the order of participants to receive the intervention, the session number at which to intervene, and whether the participant will receive the intervention or baseline condition first—thereby making these decisions fully transparent and objective (Edgington & Onghena, 2007). Masked visual analysis uses randomization in combination with an independent researcher who is not aware of conditions and other study details to make decisions about phase changes (Ferron & Jones, 2006). For instance, in a multiple baseline design, a masked visual analyst inspects graphed data in an ongoing, response-guided manner. Once stability is reached for participants in baseline, the visual analyst instructs the research team to randomly select a participant to receive the intervention. Following implementation of the independent variable to the selected participant, the masked visual analyst—unaware of which participant has received the intervention—continues to review outcome data until there is evidence of a meaningful change in outcomes for one participant, at which point they instruct the research team to randomly select a second participant to receive the intervention, and so on. Masked visual analysis guards against researchers capitalizing on their intimate knowledge of the study by (a) reducing the likelihood of researchers introducing the intervention based on a brief period of stable responding not representative of the participant’s true performance, and (b) providing an objective and transparent means for determining when and with whom to intervene.

Randomization (with or without masked visual analysis) also allows for using a widely accepted standard for estimating the probability of rejecting the null hypothesis (statistical significance testing), rather than relying solely on visual analysis. Applicable to a wide range of SCDs, randomization testing establishes the context for statistical permutation tests to evaluate intervention effects (Ferron & Levin, 2014). When using randomization testing, researchers construct a distribution of all possible outcomes for the test statistic to compare with the observed performance of the participants (Todman & Dugard, 2001). The resulting p value represents the probability of obtaining the observed data given the distribution of possible outcomes if there was no actual improvement during the intervention compared with the baseline phase (i.e., the null hypothesis). Although masked visual analysis and randomization testing provide opportunities for researchers to investigate interventions with increased transparency in decision-making and analyses, response-guided approaches and visual analysis remain important features in SCD.

Meta-Analysis and Effect Sizes

Methodological advances in meta-analyzing SCDs have focused on supporting credible causal relations in the SCD literature used to establish generally effective, evidence-based practices (Shadish et al., 2015). The inductive tradition of single-case research, as well as core design features such as individuals as the unit of analysis, repeated measures, and reliance on visual analysis, present barriers to meta-analyses. In response, researchers have developed meta-analytic methods to measure the magnitude of treatment effects in SCD research. Although multiple SCD effect sizes have been proposed (Maggin et al., 2019), some consensus is developing in support of effect sizes for generalized inferences (Shadish et al., 2015) that represent the average magnitude of change across study cases in units comparable with those used in experimental group research (Barton et al., 2017; Odom et al., 2018).

Non-Effects

SCD researchers have traditionally provided evidence for internal validity via study results (Johnston & Pennypacker, 2009). Internal validity is evident when changes in participant outcomes reliably coincide with the manipulation of an independent variable. This inductive approach makes it difficult to interpret SCD study results when changes in the independent variable do not correspond reliably with changes in the dependent variable (i.e., non-effects; Tincani & Travers, 2018). Thus, few SCDs with non-effects have been published (Kittelman et al., 2018; Therrien & Cook, 2018), and SCDs have not traditionally been used to identify ineffective interventions or the boundaries of effective interventions (Kratochwill et al., 2018; Tincani & Travers, 2018). The scarcity of published SCDs with non-effects may undermine the credibility of syntheses and meta-analyses of the published SCD research base as well as efforts to establish evidence-based practices using SCD research.

Tincani and Travers (2018) suggested SCD researchers design their experiments to allow for the possibility of valid findings of non-effects. For example, SCD researchers may use specific designs (e.g., alternating-treatments, A-B-C-B-C designs) to both (a) establish a functional relationship between an independent and dependent variable and also (b) permit claims about whether another intervention was partially or entirely ineffective (Ledford & Gast, 2018). Another approach entails using a well-established design and addressing critical indicators of internal validity for SCDs (e.g., document intervention intensity; establish adequate treatment integrity, control condition integrity, and procedural fidelity; conduct a moderator analysis). To meaningfully test the effectiveness of interventions and contribute to establishing generally effective interventions, deductive-static approaches to SCD research should allow for the possibility of valid findings of non-effects (Kratochwill et al., 2018; Shadish et al., 2016).

Collectively, these developments in SCD apply a deductive-static perspective to enhance the ability of SCD researchers to establish the generalizable effectiveness of interventions. In the following section, we describe open-science reforms and the issues in the research enterprise they aim to address, and whether and how prominent open-science practices might build on the developments described in this section to advance the credibility and impact of the SCD research base, especially in relation to establishing generally effective practices using a deductive-static orientation.

Open Science

Issues in Contemporary Research

Over the past decade, there have been multiple reports about the limited reproducibility of scientific research (e.g., Open Science Collaboration, 2015), raising concerns about the credibility of scientific claims. Evidence suggests relatively few replication studies are conducted, and when independent replications are conducted, they often fail to reproduce the results reported in the original studies (Makel et al., 2016; Makel & Plucker, 2014). One factor underlying low levels of reproducibility may be pressures on researchers to obtain positive results to publish their studies in influential journals (Nosek et al., 2012). Such pressures may foster publication bias and other questionable research practices, resulting in irreproducible study findings and research bases biased toward positive findings (Giner-Sorolla, 2012).

Indeed, evidence suggests publication bias—or the over-representation of studies with positive findings in the published literature—exists in many research bases (Dwan et al., 2013; Ferguson & Heene, 2012; Vera-Badillo & Riechelmann, 2018), including in education and special education (Chow & Ekholm, 2018; Gage et al., 2017; Polanin et al., 2016). Accordingly, null findings in the published research base are exceedingly rare (Fanelli, 2012). At the study level, non-transparent researcher flexibility may promote unreliable, irreproducible results (Simmons et al., 2011). To obtain desired findings, researchers have the flexibility to engage in, but not fully report, a variety of questionable practices. For example, researchers have reported engaging in data peeking (i.e., checking findings throughout the study and stopping data collection when desired results are attained), p-hacking (e.g., running different analyses until desired results are attained), selective outcome reporting (i.e., selectively reporting statistically significant results), and hypothesizing after results are known (i.e., HARKing; Fraser et al., 2018; John et al., 2012; Makel et al., in press). Engaging in these types of practices markedly increases the likelihood of false positive results (Simmons et al., 2011). To be clear, there is nothing intrinsically wrong with exploring multiple approaches to analyzing data or generating a hypothesis from one’s data—what Nosek et al. (2018) termed postdictive research. Such practices become questionable when they are not made transparent.

A related issue is the relative inaccessibility of most published research, which typically lies behind paywalls and is accessible only to those willing and able to pay (Piwowar et al., 2018). The inaccessibility of research to practitioners is especially problematic in applied fields, like special education, and likely contributes to the long-standing gap between research and practice. Simply put, practitioners cannot be expected to inform their practice with research they cannot access.

Open-Science Practices

Open-science reforms have been proposed to help address irreproducible research findings, publication bias, undisclosed questionable research practices, and inaccessibility of research findings by making research more open, transparent, and reproducible—with the ultimate goal of improving the credibility and impact of scientific research. In this section, we briefly discuss four prominent open-science practices: preregistration, registered reports, open data and materials, and open access.

Preregistration (Gehlbach & Robinson, 2018; Nosek et al., 2018) involves researchers publicly registering key study elements—such as research questions and hypotheses, variables, design, and analyses—before conducting the study. Preregistration allows for a clear distinction between (a) predictive or confirmatory research (i.e., using predetermined procedures to test a priori hypotheses) and (b) postdictive or exploratory research (i.e., analyzing a data set in multiple ways to explain latent variability). Preregistration does not preclude exploratory analyses, but rather provides a transparent basis for determining when researchers are conducting them. Preregistration enables reviewers and other research consumers to examine whether researchers may have engaged in questionable research practices such as HARKing, selective outcome reporting, and p-hacking by identifying unexplained discrepancies between posted study plans and research reports. Perhaps by making questionable research practice discoverable, preregistration is associated with smaller—and potentially more credible—study effects (Kaplan & Irvin, 2015; Schäfer & Schwarz, 2019).

Registered reports (Chambers, 2019; Cook et al., in press) involve researchers submitting study plans for peer review before conducting the research. In Stage 1 review, researchers submit prospective introduction and method sections, and reviewers provide constructive feedback to improve the prospective study. Based on the importance and methodological rigor of the study, the Stage 1 manuscript may be granted in-principle acceptance for publication. Then, after conducting the study, researchers write their results and discussion sections and resubmit the final report for Stage 2 review. Reviewers then evaluate the paper for adherence to study plans (deviations should be identified and explained) and completeness of reporting. Importantly, the study cannot be rejected due to direction, significance, or magnitude of effects. Registered reports guard against undisclosed questionable research practices such as HARKing, p-hacking, and selective outcome reporting by removing the incentive of generating positive findings to increase the likelihood of publication. Indeed, the only way for a Stage 1 proposal with in-principle acceptance not to be accepted at Stage 2 is if the researchers do not report and justify modifications to the approved plans. Moreover, registered reports help address publication bias because publishability is based on methodological rigor and determined before results are known. Like preregistration, registered reports are associated with smaller, and perhaps more credible, study effects (Allen & Mehler, 2019; Scheel et al., 2020).

Data and materials sharing are at the heart of research transparency and open science. Researchers who make study data openly accessible (e.g., by posting on a repository) provide the opportunity for others to independently verify, replicate, and extend study findings (Adelson et al., 2019; Levenstein & Lyle, 2018). Thus, data sharing can enhance the credibility and reproducibility of findings by allowing others to check for errors in data entry, data analyses, and reporting of findings. In addition, open data are associated with greater citation rates and may therefore improve scholarly impact (McKiernan et al., 2016; Piwowar & Vision, 2013). Materials sharing involves publicly posting information and resources (e.g., survey items, details on intervention implementation) not included in published manuscripts due to length restrictions. Materials sharing can be critical for improving the ability of other researchers to replicate studies.

Open access subsumes multiple approaches for making research freely available to research consumers. For example, researchers can pay an article-processing charge to make articles freely accessible in gold- (journals in which all articles are open access) and hybrid- (traditional journals in which most articles are behind a paywall but specific articles are open to all) access journals. Researchers can also provide open access to their research by posting an author-formatted version of a paper before it is submitted for publication (i.e., preprint) to a preprint archive (e.g., www.edarxiv.org; Fleming & Cook, in press). Open-access and preprinted studies appear to increase the effects of research, as these papers receive more citations and social media attention than those behind paywalls (Fu & Hughey, 2019; Piwowar et al., 2018). Preprints may also help address publication bias because researchers can post and make discoverable any study, regardless of direction of findings.

Concerns about the reproducibility and transparency of research findings, as well as the promotion of open-science reforms to address these concerns, have focused primarily on group-design research, with relatively little attention paid to whether and how such issues may apply in SCD (Kratochwill et al., 2018). However, as discussed in the following section, these issues may have important implications for and applications in SCD.

Open Science and SCD Research

Issues in Contemporary SCD Research

Although the reproducibility crisis has received relatively little attention in the SCD research community, SCD researchers may also experience contingencies reinforcing positive effects. As Kratochwill et al. (2018) noted,

Some part of the SCD research community may have an explicit ethos that an intervention must produce a functional relationship . . . for a study to proceed through the publication process. If so, publication bias in SCD research is typically driven by finding/producing effects rather than by the common “statistical significance” criterion that is characteristic of conventional “group” research. (p. 69)

In other words, although SCD researchers may not have a bright-line test of statistical significance (p < .05) to evaluate the effects of their research, they may nonetheless perceive studies demonstrating functional relations as more publishable.

Indeed, Shadish et al. (2016) reported that SCD researchers are significantly less likely to recommend a study for publication if it has small as opposed to large effects. Although Shadish and colleagues’ survey described a “very well done” A-B-A-B study involving three cases (p. 660), when the effect was small, data overlap high, and data variability low, the average rating for recommending the paper for publication was between somewhat and very unlikely. In contrast, when the effect was large, data overlap low, and data variability low, the average rating was between somewhat and very likely to recommend publication (see also Mahoney, 1977). Strong intervention effects enhancing the likelihood of publication among SCD researchers raise the possibility that publication bias and undisclosed questionable research practices may exist in and undermine the credibility of claims based on SCD research.

When Sham and Smith (2014) compared the effects of pivotal response therapy for children with autism reported in unpublished dissertations and published studies, average percentage of nonoverlapping data (PND) for the dissertations (k = 10) was 47%, compared with 69% for published studies (k = 21)—a large (d = 3.93) and statistically significant difference. Both published and unpublished studies were generally high quality and involved similar participants and interventionists. The authors concluded disparity in effects may, then, “be indicative of publication bias such that studies with favorable outcomes are more likely to be published than studies that are less propitious” (p. 674). See Kratochwill et al. (2018) for further discussion of publication bias in SCD research. Consistent with findings suggestive of publication bias, non-effects are rarely found in the published SCD literature. For example, Therrien and Cook (2018) reviewed intervention studies published between 2012 and 2017 in three research journals in the field of learning disabilities; none of the 23 SCD studies reported non-effects across all cases (see also Kittelman et al., 2018).

SCD researchers traditionally make important decisions after examining the data, such as how many data points to collect in each phase (i.e., response-guided methods). These decisions may be influenced, perhaps unintentionally, by aspirations to demonstrate positive effects and may not be reported transparently. Selective reporting of these post hoc decisions might be considered a questionable research practice, especially when SCD researchers are testing a priori hypotheses using a deductive-static approach. Some SCD researchers may change the hypotheses, independent variable, number of participants, dependent variables, tools for visually analyzing data, number and type of effect sizes, and other study elements during the course of a study in ways affecting study outcomes. If such changes are not fully reported, then readers may be left with the impression the intervention was more effective than it actually was. Indeed, in their survey of SCD researchers, Shadish et al. (2016) found “a non-trivial minority” (p. 656; 5%–14% depending on the condition) of respondents reported they would drop one or two cases with small effects before submitting a study for publication. Some participants noted they perceived results needing to be positive to warrant publication, and although they felt dropping a case was unethical, one participant noted “it is done quite frequently” (p. 666).

Published SCD studies, like other types of research, are stored behind paywalls and are largely inaccessible to many potential research consumers. Thus, it appears issues related to publication bias, undisclosed questionable research practices, and lack of accessibility may also exist in the SCD literature (Johnson & Cook, 2019; Kratochwill et al., 2018). To the degree these issues are common, they may pose threats to the credibility and impact of SCD research. Open science—and specifically preregistration, registered reports, data and materials sharing, and open access—may help address these issues in SCD research. As summarized in Table 1, we briefly describe how and why SCD researchers can engage in these key open practices.

Table 1.

Action Steps Needed to Engage in Open-Science Practices in Single-Case Design Research.

Practice	Researcher actions	External resources
Preregistration	• Prior to conducting study, publish research questions, hypotheses, target participant criteria and recruitment strategies, independent and dependent variable details, plan for executing phase changes, and plans for examining a functional relation and estimating effect sizes (as relevant) • Conduct study, updating preregistration with any modifications (and rationales for those changes) made prior to study completion • Write manuscript, citing preregistration and detailing any changes made from what was described in the preregistration and reasons for those changes	Registry of Efficacy and Effectiveness Studies https://sreereg.icpsr.umich.edu/sreereg/
Registered Reports	• Submit study introduction and prospective method sections to journal for peer review prior to conducting the research • Once Registered Report receives in-principle acceptance from journal, conduct study as proposed • Add results and discussion to accepted Stage 1 manuscript and resubmit the complete paper to journal • If authors closely adhere to approved methods, paper guaranteed journal acceptance	Center for Open Science Registered Report Primer https://www.cos.io/initiatives/registered-reports Journals that accept Registered Reports https://www.cos.io/initiatives/registered-reports
Data and Materials Sharing	• Write study IRB protocol to allow raw data to be shared • Once study is complete, upload data, code book, and study materials (e.g., intervention protocols) to a data repository	Data Repositories: Open Science Framework https://osf.io/dashboard Inter-University Consortium for Political and Social Research (ICPSR) https://www.icpsr.umich.edu/web/pages/
Open Access and Preprints	• Publish your paper in a gold or hybrid open-access journal, which involves paying a processing fee, and/or publish a preprint of your manuscript by posting manuscript to a preprint server prior to submission to journal	Directory of Open Access Journals https://doaj.org/ Education preprint archive https://edarxiv.org/

Note. IRB = Internal Review Board.

Preregistration

Preregistration in SCD is predicated on the assumption that transparently delineating research questions, design, and procedures before conducting a study increases credibility in the resulting findings by making questionable research practices more readily discoverable (Nosek et al., 2018). Johnson and Cook (2019) provided initial recommendations for specific components of SCD studies to preregister, with differential guidance for some points depending on the theoretical approach of the research team. The Registry of Efficacy and Effectiveness Studies (Society for Research on Educational Effectiveness, 2020) also provides a protocol for preregistering SCD studies. In the absence of consensus regarding how to preregister SCD studies, we provide a brief overview of some critical considerations.

Research questions should be preregistered, regardless of one’s theoretical viewpoint, although the presence of accompanying hypotheses is subject to the researcher’s view of their necessity and utility. The form of these hypotheses may be informed by the suggestions of Shadish and colleagues (2015), who argued that SCD is a process of complex hypothesis testing. Relevant participant characteristics should also be described to the extent possible. Because preregistration typically occurs before participant recruitment, specific characteristics may not yet be known. Thus, researchers might initially preregister the intended number of participants and their age ranges, and then update the preregistration with specific information as soon as participants are recruited. This provides protection against manipulation of study participants and corresponding results by a researcher. Unfortunately, removing cases with non-effects appears to be a practice engaged in by some SCD researchers and recommended by some reviewers (Shadish et al., 2016).

SCD researchers will likely be able to preregister information about research questions and participants regardless of their theoretical orientation. However, other components of a study’s preregistration will vary depending on researchers’ positions along the deductive-static to inductive-dynamic continuum. In general, researchers who are closer to the deductive-static end of the continuum should be able to preregister a majority of their design’s components, whereas researchers on the inductive-dynamic end will likely preregister their intended procedures for determining these components (Johnson & Cook, 2019). Preregistrations can then be updated once these decisions are made.

If a researcher designs a deductive-static study to examine the effect of a specific intervention on a specific outcome, then they will likely be able to preregister the core components, or active ingredients, of the intervention they plan to implement, as well as their measurement methods, operational definitions for the outcome variable, and expected effects of the intervention (e.g., will effects be immediate and large, or gradual and moderate?). On the contrary, a researcher may partner with a school to work with a student with aggressive behaviors, operationally define a target behavior after observations, identify the behavior function using a functional analysis, and then implement a function-based intervention based on the results of the functional analysis. In such an instance, rather than preregister specific independent and dependent variables, the researcher could preregister the decision-making procedures used for determining them, and then update the preregistration as soon as those decisions are made.

Finally, the timing of phase changes is one of the most fundamental decisions made by researchers during an SCD study. Response-guided decision making for phase changes is likely the norm for most SCD researchers, wherein a steady state should be observed before implementing an intervention. However, other more transparent methods are available for determining phase changes, including randomization, pre-determining a number of data points prior to implementation, and masked visual analysis (e.g., Kratochwill & Levin, 2014). Therefore, preregistration of phase-change rules should reflect the information available to the researcher prior to implementation. If, for example, a specific number of data points will be used in each phase, then that number should be preregistered. If a process for identifying a steady state is used, then this process should be carefully described in the preregistration.

Registered Reports

Although registered reports are becoming an increasingly common publication format (Chambers, 2019; Cook et al., in press), we are not aware of any published examples of SCD registered reports. As in group research, registered reports can help guard against undisclosed questionable research practices and publication bias in SCD by initially evaluating and conditionally accepting manuscripts prior to data collection. Thus, likelihood of publication is not enhanced by demonstrating positive results (e.g., a strong functional relation) but rather depends on conducting a well-designed, internally valid study as proposed. As such, SCD researchers taking a deductive-static perspective in their research should specify in Stage 1 manuscripts what design features and procedures are used to enhance confidence in the internal validity of a study in the absence of a functional relation (see Tincani & Travers, 2018).

Norms and practices for Stage 1 SCD manuscripts will likely vary along the deductive-static to inductive-dynamic continuum. Manuscripts for SCD studies using a more deductive-static approach can be written and reviewed similarly to group-research registered reports because researchers aim to adhere to a specific protocol as closely as possible in both. Thus, Stage 1 manuscripts should be as specific as possible, and Stage 2 manuscripts should fully describe and justify any changes that occurred. However, for registered reports of more inductive-dynamic SCD studies, Stage 1 manuscripts will need to be written and reviewed based on transparent decision-making protocols for aspects of the study not fully determined a priori. For example, some researchers may wish to manipulate the independent variable actively, depending on the responsiveness of individual participants. This will require researchers to develop and describe a clear decision-making protocol of when and how to modify the independent variable. Adherence to this decision-making framework, in turn, will be a determining factor for final acceptance and publication of the Stage 2 manuscript. Whereas studies from a more deductive-static perspective are likely always suitable for registered reports, researchers may be unable to describe the decision-making procedures thoroughly for some inductive-dynamic SCD studies a priori, making meaningful Stage 1 evaluation by reviewers difficult, if not impossible.

Data and Materials Sharing

Data from published SCD research can be electronically extracted from graphs with high reliability (Drevon et al., 2017), assuming one has access to the publication, which can then be used by other researchers to compute effect sizes, synthesize, and meta-analyze (Maggin et al., 2011). However, sharing raw data saves time associated with data extraction, eliminates occasional errors in extraction, and provides study data to those who cannot access the publication. Given the evolving and sometimes complex estimation of SCD effect sizes, we also recommend sharing analytic code used to calculate effect sizes in the published article. In addition, investigators can share other types of non-graphed data summarized in SCD studies (e.g., interobserver agreement data, social validity ratings, fidelity data). Sharing materials, including detailed intervention protocols and methods of data collection, is critical for enabling meaningful replication of SCD studies by independent researchers, and for applying the intervention with fidelity in practice. SCD researchers using more inductive-dynamic perspectives might share detailed information about why and how response-guided decisions were made, making this aspect of SCD research more transparent and replicable.

Open Access and Preprints

Removing paywalls that block access to most published SCD research—whether by publishing in a gold-access journal, paying an article-processing charge in a hybrid journal, or posting a preprint—enables practitioners and other stakeholders who do not otherwise have access to a publication to read and apply its findings. Providing open access to SCD research to broaden its impact and application is consistent with the strong emphasis on social validity in SCD research. Similarly, posting SCD research as a preprint allows research consumers access to research findings immediately, without having to wait for potentially multiple rounds of peer review and formatting by a journal. Although the lack of peer review for preprints is an important concern, open access may help address publication bias. Given potential bias toward publication of positive results in SCD (Shadish et al., 2016), researchers may not consistently write up internally valid SCD studies without a functional relationship because such studies may be perceived as unlikely to be published. Posting internally valid SCD research with small and non-effects as preprints, rather than relegating it to the file drawer, enables SCD researchers to openly disseminate their work so it can (a) inform research, policy, and practice; (b) help combat publication bias (by making these studies discoverable to research consumers and meta-analysts); and (c) garner citations.

Challenges to Applying Open Science in SCD Research

Some challenges to open-science practices apply across research designs. For example, open-science reforms generally require additional time and effort, and different workflows than researchers are accustomed to (Cook et al., 2018). For instance, thinking through and writing up preregistrations and Stage 1 manuscripts for registered reports before studies are undertaken will require additional time and a change in routines for many researchers—especially for SCD researchers, who often are accustomed to making and modifying plans during research. Similarly, posting data sets, study materials, and preprints takes time and effort, especially the first time a researcher engages in the practice. Finally, it is important the confidentiality of study participants is strictly protected when sharing data, regardless of study design. This may involve not fully reporting the location of the study and some participant characteristics, especially for individuals with low-incidence disabilities who often participate in SCD studies.

Other challenges to open-science practices exist across research designs, but they may be particularly acute for SCD researchers. For example, one important concern about the use of preprints, regardless of research design, is they are typically not peer reviewed. This means deeply flawed and misleading research can be posted as preprints. Research consumers must critically evaluate the methodological rigor of preprints. This may be especially problematic for SCD research because it typically involves an intuitive approach for evaluating intervention effects (i.e., visual analysis). Thus, it seems possible that SCD preprints are more likely than group experiments to be accessed and applied by individuals without the advanced research training needed for critical evaluation and interpretation of study results.

Finally, some challenges may apply in unique ways to SCD. Preregistration may be especially difficult for SCD researchers who take a more inductive-dynamic approach to their research. For instance, if researchers intend to determine the intervention to be applied based on the results of a functional analysis, they would need to preregister how the functional analysis will be conducted and how its findings will be applied in developing the intervention. The preregistration would then need to be updated once the functional analysis is completed and the intervention selected (Johnson & Cook, 2019). Similarly, norms and guidelines do not yet exist for writing and reviewing Stage 1 SCD manuscripts for registered reports. SCD researchers will have to grapple with how to provide reasonable evidence of internal validity without graphed data (see Maggin et al., 2020, for ideas on doing this, including providing graphs of hypothetical data). In addition, registered reports may not be a viable option for SCD researchers intervening on severe behavior (e.g., aggression to others), as the intervention would be delayed by the Stage 1 review process. Most broadly, SCD researchers who take an inductive-dynamic perspective may have strong concerns about the wisdom of specifying important study decisions in the absence of outcome data, as is done in both preregistration and registered reports.

Conclusion

The fundamentals of SCD research have remained relatively consistent over time: repeatedly measuring outcomes across researcher-manipulated conditions to examine whether manipulating an independent variable causes socially valid changes in dependent variables. Yet SCD has also evolved, with recent developments aiming to increase transparency and reproducibility for the purpose of using SCDs to identify generally effective, or evidence-based, practices. We believe open-science practices can be applied in SCD to support and extend recent methodological developments intended to enhance the credibility of claims derived from SCD research.

We are optimistic about the potential for open-science reforms to enhance the credibility and impact of knowledge claims based on SCD research, both within and beyond special education. However, we recognize the relevance and value of open-science practices may be viewed skeptically by some SCD researchers. For example, we expect many researchers who take a primarily inductive-dynamic approach to SCD may view specifying key design elements in the absence of participant outcomes (i.e., preregistration, registered reports) as counterproductive. Moreover, given the important differences between SCD and group-design research, we expect that open science in the SCD field will evolve and be applied in unique ways. For SCD researchers who perceive potential benefits of applying open-science practices, we recommend a gradual approach—starting small by gathering information and resources, trying one practice, and eventually adopting multiple practices (Adelson et al., 2019; Cook et al., 2018). Like other reforms, it is unlikely open science will be adopted and sustained without both bottom-up and top-down supports. Therefore, we recommend that the SCD research community consider how to develop resources and infrastructure to support open science, provide experiences to make it easy, develop community norms to make it expected, provide incentives to make it rewarding, and—perhaps eventually—institute policies to make it required (Nosek, 2019).

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iDs

Bryan G. Cook

Erin E. Barton

Jason C. Travers

References

Adelson

J. L.

Barton

E. E.

Bradshaw

C. P.

Bryant

B. R.

Bryant

D. P.

Cook

B. G.

Coyne

M. D.

deBettencourt

DeHaven

A. C.

Dymond

S. K.

Esposito

Farmer

T. W.

Flake

J. K.

Gage

N. A.

Kennedy

M. J.

Kern

Lane

K. L.

Lee

D. L.

Lembke

Levine

F. J.

(2019, February 18). A roadmap for transparent research in special education and related disciplines. https://doi.org/10.31219/osf.io/sqfy3

Allen

Mehler

D. M.

(2019). Open science challenges, benefits and tips in early career and beyond. PLOS Biology, 17(5), Article e3000246. https://doi.org/10.1371/journal.pbio.3000246

Allison

D. B.

Franklin

R. D.

Heshka

(1992). Reflections on visual inspection, response guided experimentation, and Type I error rate in single-case designs. Journal of Experimental Education, 61(1), 45–51. https://doi.org/10.1080/00220973.1992.9943848

Baker

(2016). Reproducibility crisis? Nature, 533(26), 353–366. https://doi.org/10.1038/533452a

Barton

E. E.

Pustejovsky

J. E.

Maggin

D. M.

Reichow

(2017). Technology-aided instruction and intervention for students with ASD: A meta-analysis using novel methods of estimating effect sizes for single-case research. Remedial and Special Education, 38(6), 371–386. https://doi.org/10.1177/0741932517729508

Chambers

(2019). What’s next for registered reports? Nature, 573, 187–189. https://doi.org/10.1038/d41586-019-02674-6

Chow

J. C.

Ekholm

(2018). Do published studies yield larger effect sizes than unpublished studies in education and special education? A meta-review. Educational Psychology Review, 30, 727–744. https://doi.org/10.1007/s10648-018-9437-7

Cook

B. G.

Collins

L. W.

Cook

S. C.

Cook

(2020). Evidence-based reviews: How evidence-based practices are systematically identified. Learning Disabilities Research & Practice, 35(1), 6–13. https://doi.org/10.1111/ldrp.12213

Cook

B. G.

Lloyd

J. W.

Mellor

Nosek

B. A.

Therrien

W. J.

(2018). Promoting open science to increase the trustworthiness of evidence in special education. Exceptional Children, 85(1), 104–118. https://doi.org/10.1177/0014402918793138

10.

Cook

B. G.

Maggin

D. E.

Robertson

R. E.

(in press). Registered reports in special education: Introduction to the special series. Remedial and Special Education.

11.

Drevon

Fursa

S. R.

Malcolm

A. L.

(2017). Intercoder reliability and validity of WebPlotDigitizer in extracting graphed data. Behavior Modification, 41(2), 323–339. https://doi.org/10.1177/0145445516673998

12.

Dwan

Gamble

Williamson

P. R.

Kirkham

J. J.

(2013). Systematic review of the empirical evidence of study publication bias and outcome reporting bias—An updated review. PLOS ONE, 8(7), Article e66844. https://doi.org/10.1371/journal.pone.0066844

13.

Edgington

Onghena

(2007). Randomization tests. CRC Press. https://doi.org/10.1111/j.1751-5823.2007.00015_21.x

14.

Fanelli

(2012). Negative results are disappearing from most disciplines and countries. Scientometrics, 90(3), 891–904. https://doi.org/10.1007/s11192-011-0494-7

15.

Ferguson

C. J.

Heene

(2012). A vast graveyard of undead theories: Publication bias and psychological science’s aversion to the null. Perspectives on Psychological Science, 7(6), 555–561. https://doi.org/10.1177/1745691612459059

16.

Ferron

J. M.

Jones

P. K.

(2006). Tests for the visual analysis of response-guided multiple-baseline data. The Journal of Experimental Education, 75(1), 66–81. https://doi.org/10.3200/JEXE.75.1.66-81

17.

Ferron

J. M.

Levin

J. R.

(Eds.). (2014). Single-case permutation and randomization statistical tests: Present status, promising new developments. In Kratochwill

T. R.

Levin

J. R.

(Eds.), Single-case intervention research: Methodological and statistical advances (pp. 153–183). American Psychological Association.

18.

Fidler

Wilcox

(2018). Reproducibility of scientific results. In Zalta

E. N.

(Ed.), The Stanford encyclopedia of philosophy. https://plato.stanford.edu/archives/win2018/entries/scientific-reproducibility/

19.

Fleming

J. I.

Cook

B. G.

(in press). Open access in special education: A review of journal and publisher policies. Remedial and Special Education. https://doi.org.10.1177/0741932521996461

20.

Fraser

Parker

Nakagawa

Barnett

Fidler

(2018). Questionable research practices in ecology and evolution. PLOS ONE, 13(7), Article e0200303. https://doi.org/10.1371/journal.pone.0200303

21.

D. Y.

Hughey

J. J.

(2019). Releasing a preprint is associated with more attention and citations for the peer-reviewed article. eLife, 8, Article e52646. https://dx.doi.org/10.7554/eLife.52646

22.

Gage

N. A.

Cook

B. G.

Reichow

(2017). Publication bias in special education meta-analyses. Exceptional Children, 83(4), 428–445. https://doi.org/10.1177/0014402917691016

23.

Gehlbach

Robinson

C. D.

(2018). Mitigating illusory results through preregistration in education. Journal of Research on Educational Effectiveness, 11(2), 296–315. https://doi.org/10.1080/19345747.2017.1387950

24.

Giner-Sorolla

(2012). Science or art? How aesthetic standards grease the way through the publication bottleneck but undermine science. Perspectives on Psychological Science, 7(6), 562–571. https://doi.org/10.1177/1745691612457576

25.

John

L. K.

Loewenstein

Prelec

(2012). Measuring the prevalence of questionable research practices with incentives for truth telling. Psychological Science, 23(5), 524–532. https://doi.org/10.1177%2F0956797611430953

26.

Johnson

A. H.

Cook

B. G.

(2019). Preregistration in single-case design research. Exceptional Children, 86(1), 95–112. https://doi.org/10.1177/0014402919868529

27.

Johnston

J. M.

Pennypacker

H. S.

(1992). Strategies and tactics of human behavioral research (2nd ed.). Lawrence Erlbaum.

28.

Johnston

J. M.

Pennypacker

H. S.

(2009). Strategies and tactics of behavioral research (3rd ed.). Routledge.

29.

Kaplan

R. M.

Irvin

V. L.

(2015). Likelihood of null effects of large NHLBI clinical trials has increased over time. PLOS ONE, 10(8), Article e132382. https://dx.doi.org/10.1371/journal.pone.0132382

30.

Kazdin

A. E.

(2011). Single-case research designs: Methods for clinical and applied settings (2nd ed.). Oxford University Press.

31.

Kittelman

Gion

Horner

R. H.

Levin

J. R.

Kratochwill

T. R.

(2018). Establishing journalistic standards for the publication of negative results. Remedial and Special Education, 39(3), 171–176. https://doi.org/10.1177/0741932517745491

32.

Kratochwill

T. R.

Levin

J. R.

(Eds.). (2014). Enhancing the scientific credibility of single-case intervention research: Randomization to the rescue. In Kratochwill

T. R.

Levin

J. R.

(Eds.), Single-case intervention research: Methodological and statistical advances (pp. 53–89). American Psychological Association. https://doi.org/10.1037/14376-003.

33.

Kratochwill

T. R.

Levin

J. R.

Horner

R. H.

(2018). Negative results: Conceptual and methodological dimensions in single-case intervention research. Remedial and Special Education, 39(2), 67–76. https://doi.org/10.1177/0741932517741721

34.

Ledford

J. R.

Gast

D. L.

(2018). Single case research methodology: Applications in special education and behavioral sciences. Routledge.

35.

Levenstein

M. C.

Lyle

J. A.

(2018). Data: Sharing is caring. Advances in Methods and Practices in Psychological Science, 1(1), 95–103. https://doi.org/10.1177/2515245918758319

36.

Maggin

D. M.

Cook

B. G.

Cook

(2019). Making sense of single-case design effect sizes. Learning Disabilities Research & Practice, 34(3), 124–132. https://doi.org/10.1111/ldrp.12204

37.

Maggin

D. M.

O’Keeffe

B. V.

Johnson

A. H.

(2011). A quantitative synthesis of methodology in the meta-analysis of single-subject research for students with disabilities: 1985–2009. Exceptionality, 19(2), 109–135. https://doi.org/10.1080/09362835.2011.565725

38.

Maggin

D. M.

Robertson

R. E.

Cook

B. G.

(2020). Introduction to the special series on results-blind peer review. An experimental analysis on editorial recommendations and manuscript evaluation. Behavioral Disorders, 45(4), 195–206. https://doi.org/10.1177/0198742920936619

39.

Mahoney

M. J.

(1977). Publication prejudices: An experimental study of confirmatory bias in the peer review system. Cognitive Therapy and Research, 1(2), 161–175.

40.

Makel

M. C.

Hodges

Cook

B. G.

Plucker

(in press). Both questionable and open research practices are prevalent in education research. Educational Researcher.

41.

Makel

M. C.

Plucker

J. A.

(2014). Facts are more important than novelty: Replication in the education sciences. Educational Researcher, 43, 304–316. https://doi.org/10.3102/0013189X14545513

42.

Makel

M. C.

Plucker

J. A.

Freeman

Lombardi

Simonsen

Coyne

(2016). Replication of special education research: Necessary but far too rare. Remedial and Special Education, 37, 205–212. https://doi.org/10.1177%2F0741932516646083

43.

McKiernan

E. C.

Bourne

P. E.

Brown

C. T.

Buck

Kenall

Lin

. . . Spies

J. R.

(2016). Point of view: How open science helps researchers succeed. eLife, 5, Article e16800. https://doi.org/10.7554/eLife.16800.001

44.

Ninci

Vannest

K. J.

Willson

Zhang

(2015). Interrater agreement between visual analysts of single-case data: A meta-analysis. Behavior Modification, 39(4), 510–541. https://doi.org/10.1177/0145445515581327

45.

Nosek

B. A.

(2019, June 11). Strategy for culture change. https://www.cos.io/blog/strategy-for-culture-change

46.

Nosek

B. A.

Ebersole

C. R.

DeHaven

A. C.

Mellor

D. T.

(2018). The preregistration revolution. Proceedings of the National Academy of Sciences, 115(11), 2600–2606. https://doi.org/10.1073/pnas.1708274114

47.

Nosek

B. A.

Spies

J. R.

Motyl

(2012). Scientific utopia: II. Restructuring incentives and practices to promote truth over publishability. Perspectives on Psychological Science, 7(6), 615–631. https://doi.org/10.1177/1745691612459058

48.

Odom

S. L.

Barton

E. E.

Reichow

Swaminathan

Pustejovsky

J. E.

(2018). Between-case standardized effect size analysis of single case designs: Examination of the two methods. Research in Developmental Disabilities, 79, 88–96. https://doi.org/10.1016/j.ridd.2018.05.009

49.

Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251), aac4716. https://doi.org/10.1126/science.aac4716

50.

Piwowar

H. A.

Priem

Larivière

Alperin

J. P.

Matthias

Norlander

. . . Haustein

(2018). The state of OA: A large-scale analysis of the prevalence and impact of Open Access articles. PeerJ, 6, Article e4375. https://doi.org/10.7717/peerj.4375

51.

Piwowar

H. A.

Vision

T. J.

(2013). Data reuse and the open data citation advantage. PeerJ, 1, Article e175. https://doi.org/10.7717/peerj.175

52.

Polanin

J. R.

Tanner-Smith

E. E.

Hennessy

E. A.

(2016). Estimating the difference between published and unpublished effect sizes: A meta-review. Review of Educational Research, 86(1), 207–236. https://doi.org/10.3102/0034654315582067

53.

Schäfer

Schwarz

(2019). The meaningfulness of effect sizes in psychological research: Differences between sub-disciplines and the impact of potential biases. Frontiers in Psychology, 10, Article 813. https://doi.org/10.3389/fpsyg.2019.00813

54.

Scheel

A. M.

Schijen

Lakens

(2020, February 5). An excess of positive results: Comparing the standard psychology literature with registered reports. https://doi.org/10.31234/osf.io/p6e9c

55.

Shadish

W. R.

Hedges

L. V.

Horner

R. H.

Odom

S. L.

(2015). The role of between-case effect size in conducting, interpreting, and summarizing single-case research (NCER 2015-002). National Center for Education Research. https://ies.ed.gov/ncser/pubs/2015002/

56.

Shadish

W. R.

Zelinsky

N. A. M.

Vevea

J. L.

Kratochwill

T. R.

(2016). A survey of publication practices of single-case design researchers when treatments have small or large effects. Journal of Applied Behavior Analysis, 49, 656–673. https://doi.org/10.1002/jaba.308

57.

Sham

Smith

(2014). Publication bias in studies of an applied behavior-analytic intervention: An initial analysis. Journal of Applied Behavior Analysis, 47(3), 663–678. https://doi.org/10.1002/jaba.146

58.

Simmons

J. P.

Nelson

L. D.

Simonsohn

(2011). False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychological Science, 22, 1359–1366. https://doi.org/10.1177%2F0956797611417632

59.

Society for Research on Educational Effectiveness. (2020). Registry of efficacy and effectiveness studies. https://sreereg.icpsr.umich.edu/sreereg/

60.

Stracke

C. M.

(2020). Open science and radical solutions for diversity, equity and quality in research: A literature review of different research schools, philosophies and frameworks and their potential impact on science and education. In Burgos

(Ed.), Radical solutions and open science: An open approach to boost higher education (pp. 17–37). Springer.

61.

Tawney

J. W.

Gast

D. L.

(1984). Single-subject research in special education. Charles E. Merrill.

62.

Therrien

W. J.

Cook

B. G.

(2018). Null effects and publication bias in learning disabilities research. Learning Disabilities Research & Practice, 33, 5–10. https://doi/org/10.1111/ldrp.12163

63.

Tincani

Travers

(2018). Publishing single-case research design studies that do not demonstrate experimental control. Remedial and Special Education, 39, 118–128. https://doi.org/10.1177/0741932517697447

64.

Todman

J. B.

Dugard

(2001). Single-case and small-n experimental designs: A practical guide to randomization tests. Psychology Press.

65.

Vazire

(2018). Implications of the credibility revolution for productivity, creativity, and progress. Perspectives on Psychological Science, 13(4), 411–417. https://doi.org/10.1177/1745691617751884

66.

Vera-Badillo

F. E.

Riechelmann

R. P.

(2018). Identifying bias in clinical cancer research. In Araújo

Riechelmann

R. P.

(Eds.), Methods and biostatistics in oncology (pp. 227–240). Springer.

67.

Wolf

M. M.

(1978). Social validity: The case for subjective measurement or how applied behavior analysis is finding its heart. Journal of Applied Behavior Analysis, 11(2), 203–214. https://doi.org/10.1901/jaba.1978.11-203