Abstract
Background:
When conducting a randomized controlled trial, it is common to specify in advance the statistical analyses that will be used to analyze the data. Typically, these analyses will involve adjusting for small imbalances in baseline covariates. However, this poses a dilemma, as adjusting for too many covariates can hurt precision more than it helps, and it is often unclear which covariates are predictive of outcome prior to conducting the experiment.
Objectives:
This article aims to produce a covariate adjustment method that allows for automatic variable selection, so that practitioners need not commit to any specific set of covariates prior to seeing the data.
Results:
In this article, we propose the “leave-one-out potential outcomes” estimator. We leave out each observation and then impute that observation’s treatment and control potential outcomes using a prediction algorithm such as a random forest. In addition to allowing for automatic variable selection, this estimator is unbiased under the Neyman–Rubin model, generally performs at least as well as the unadjusted estimator, and the experimental randomization largely justifies the statistical assumptions made.
It is common when analyzing randomized controlled trials to adjust for small imbalances in baseline covariates in order to improve the precision of the treatment effect estimate. 1 To avoid the possibility of data snooping, and to ensure the validity of statistical inference, several authors have advocated that the statistical methods be fully specified in advance and reported in the trial protocol (e.g., Begg et al., 1996; Schulz, Altman, & Moher, 2010). 2 However, in cases where the analysis methods must be prespecified, it can be unclear which covariates should be used and if covariate adjustment will even be helpful. An overly aggressive adjustment that adjusts for too many covariates can hurt precision more than it helps (e.g., Freedman 2008; Miratrix, Sekhon, & Yu 2013).
A second concern when adjusting for baseline covariates is bias. Statisticians often allow for biased estimates in order to reduce the overall mean squared error, and many common methods for covariate adjustment do introduce a small amount of bias. However, in some cases, practitioners may find exact unbiasedness inherently desirable for various reasons. We discuss one such example in the section “Motivation.” Spiess (2018) presents another argument for unbiasedness when analyzing randomized experiments.
In this article, we propose a covariate adjustment method, the “leave-one-out potential outcomes” (LOOP) estimator, to simultaneously address both the concerns discussed above. The method is unbiased and model selection occurs in a “black box,” so any postselection inference remains valid. In particular, the method allows for automatic variable selection, so one need not know which covariates to use ahead of time. This method is also design based, meaning that the experimental randomization largely justifies the statistical assumptions, and it generally performs no worse than the simple difference-in-means estimator but can often substantially improve performance.
This article is organized as follows. The second section reviews the covariate adjustment literature and relates our method to other estimators. The third section discusses the randomized trial that motivates our work; in this example, both model selection and bias were concerns. The fourth section introduces notation and assumptions and discusses the simple difference-in-means and LOOP estimators. The fifth section relates the LOOP estimator to poststratification and the simple difference-in-means estimator. In the sixth section, we provide an estimate of the variance. The seventh section discusses how to modify the procedures to account for different experimental designs such as block designs. In the eighth section, we apply the LOOP estimator to examples using simulated data and real experimental data. The ninth section concludes.
Relation to Prior Literature
One of the virtues of randomized experiments is that the physical act of randomization largely justifies the statistical assumptions of the Neyman–Rubin model, a nonparametric model which was first introduced by Neyman (Splawa-Neyman, Dabrowska, & Speed, 1990; translation of the original 1923 paper) and further developed by Rubin (1974). Covariate adjustment is often done through linear regression; however, the standard ordinary least squares (OLS) model is quite different from the Neyman–Rubin model and randomization fails to justify the standard assumptions of OLS. In fact, the OLS estimate is biased under the Neyman–Rubin model; see Freedman (2008) and Lin (2013) for further discussion on OLS adjustments. Other types of regression adjustments can be used: Berk et al. (2013a) build on the work of Freedman (2008) and Lin (2013), while Bloniarz, Liu, Zhang, Sekhon, and Yu (2016) propose the use of lasso adjustments when the number of covariates is large, especially when the number of covariates exceeds the number of experimental units. In addition, regression adjustments can be used to analyze randomized experiments besides treatment-control studies (e.g., Lu 2016).
Various other covariate adjustment methods have been proposed, including several that are explicitly design based. For example, poststratification (Holt & Smith, 1979) is an adjustment made by stratifying on a pretreatment variable, estimating the treatment effect within each stratum, and taking the weighted average over all strata. Miratrix, Sekhon, and Yu (2013) explore the properties of the poststratified estimator under the Neyman–Rubin model. Koch, Amara, Davis, and Gillings (1982) and Koch, Tangen, Jung, and Amara (1998) propose a method that tests Fisher’s sharp null hypothesis (i.e., that all individual treatment effects are zero). They compute the covariance matrix of the treatment and covariates under the sharp null and note that a quadratic form involving this covariance matrix has an approximate χ2 distribution, which they use to obtain a p value. Rosenbaum (2002) introduces a similar covariate adjustment method that involves inverting hypothesis tests of the sharp null to obtain an estimate of the treatment effect. Rosenbaum’s method is quite flexible and allows for automatic variable selection; however, it assumes a constant treatment effect across units. In this article, we propose the LOOP estimator, which is also design based and allows for automatic variable selection. Unlike Rosenbaum, we do not assume a constant treatment effect.
Aronow and Middleton (2013) introduce another design-based estimator, which is related to the Horvitz–Thompson (1952) estimator. This estimator involves the estimation of a function of the covariates such that the function is predictive of the outcome, resulting in a reduction in variance. In addition, so long as this function is independent of the treatment assignment, the resulting estimate of the average treatment effect will be unbiased. Following a result from Williams (1961), Aronow and Middleton (2013) suggest sample splitting to ensure independence when estimating the function of the covariates. However, many of their calculations assume that the function is a constant fixed in advance and not estimated using a sample splitting procedure. In this article, we propose a special case of Aronow and Middleton’s estimator with a sample splitting approach. We successively leave out each observation and then impute that observation’s treatment and control potential outcomes using a prediction algorithm such as a random forest (Breiman, 2001).
Our work is similar to that of Wager, Du, Taylor, and Tibshirani (2016), who also propose a set of estimators that build on the work of Aronow and Middleton. Wager et al. propose the use of sample splitting and machine learning methods to impute potential outcomes. They also provide a variance estimate but work under a model in which they assume that the experimental units are drawn from a superpopulation and focus primarily on the population average treatment effect. In this article, we assume that the potential outcomes and the covariates are fixed and that the only source of randomness is in the treatment assignment. While the point estimate for the average treatment effect need not change under this model, variance estimation is different, and we derive an estimate for the variance of the LOOP estimator under this framework. Note that we focus specifically on the case where the sample splitting is a leave-one-out procedure. As we will show later, this allows for direct comparison to traditional estimators such as a simple difference-in-means and poststratification.
Our method is also related to the augmented inverse probability weighted (“AIPW”) estimator, which was proposed and developed by Robins, Rotnitzky, and Zhao (1994), Robins (2000), and Scharfstein, Rotnitzky, and Robins (1999) to estimate treatment effects in observational studies with missing data. Like the estimator proposed by Aronow and Middleton (2013), AIPW can be considered an extension of the Horvitz–Thompson estimator. It involves a difference in means (inversely weighted by the propensity score) and a regression adjustment based on the expectation of the outcome conditional on the covariates and treatment assignment. See also Chernozhukov et al. (2018) for a related estimator, which employs both sample splitting and machine learning methods to estimate the treatment effect in a high-dimensional setting.
Several other methods use an AIPW-like estimator specifically in randomized experiments (e.g., Spiess, 2018; Rothe, 2018; Tsiatis, Davidian, Zhang, & Lu, 2008). Tsiatis, Davidian, Zhang, and Lu (2008) separate the modeling of covariate-outcome relationships and the evaluation of the treatment effect in order to ensure valid inference after variable selection. Other methods have been proposed to ensure valid postselection inferences. For example, Moore and van der Laan (2009) use targeted maximum likelihood estimation to make covariate adjustments when the outcome is binary. This method involves modeling the probability that the outcome will be 0 or 1 conditional upon the covariates and the treatment assignment. One can use any procedure to model these conditional probabilities, including methods with automatic variable selection. Steingrimsson, Hanley, and Rosenblum (2017) give recommendations for the use of targeted maximum likelihood estimation in practice.
Motivation
Our work is motivated by a so-called pay for success program in the state of Illinois. In brief, a pay for success program is one in which a government contracts an outside organization to provide needed services but only pays the organization if the services are shown to be effective, typically in a randomized controlled experiment. In our example, the contracted organization is to provide special social services to at-risk youth, and one metric for success (among others) is a reduction in the number of days spent in juvenile detention. Success of the program will be evaluated according to the results of a 6-year experiment in which eligible youth are randomly selected to receive either the special services or ordinary care. The evaluation will be conducted by researchers in the School of Social Work at the University of Michigan, and we assisted the evaluators in planning the design and analysis of the experiment.
Several hundred youth are expected to take part in the program. Eligible participants are independently randomized to treatment or control, each with probability 1/2. More elaborate designs were considered but were too logistically challenging. A key difficulty is the fact that the participants enter into the experiment continually over time, making designs such as blocking infeasible.
Several baseline covariates will be available, at least some of which (e.g., age) are known to be highly predictive of outcome. The interested parties (the state, the outside organization providing the services, and the evaluators) agreed that some form of adjustment for these covariates would be desirable. However, there was initially no clear consensus on which adjustment procedure to use.
One concern was bias. Unbiasedness was felt to be desirable, perhaps more so in this example than in many others, because the state’s payment rate will be directly proportional to the estimated size of the treatment effect. Any bias in the estimator therefore effectively results in a bias in the payment. Indeed, one high ranking state official was opposed to any amount of bias, even if it might reduce the mean squared error. To paraphrase, the magnitude of the error was not so much a concern, as long as it was a fair bet. Other officials were open to using a biased estimator, so long as the bias was negligible. Critically, however, it was felt that the bias should still be quantified, and in the case of biased estimators, it was unclear how to produce a concrete number for the bias. For this reason as well, an unbiased estimator was preferred. Ultimately, it was decided to use poststratification.
A second concern was which covariates to adjust for. It was required to fully specify the analysis protocol in advance. Many potential covariates were available; however, adjusting for too many covariates could result in overadjustment, leading to inflated variance. Poststratification is especially sensitive to overadjustment, and considerable discussion was required to come to a consensus on both the number of covariates and which specific covariates to be used.
The challenges outlined above motivate our work. We wish to produce a method that provides automatic variable selection in order to eliminate the guesswork in deciding which covariates to use, while remaining exactly unbiased under the Neyman–Rubin model.
The LOOP Estimator
In this section, we introduce the LOOP estimator, which we can use to obtain an unbiased estimate of the average treatment effect while adjusting for covariates.
Model and Notation
Consider a randomized controlled experiment in which there are N participants, indexed by
for
and assume
Associated with each of the N participants are two fixed (nonrandom) potential outcomes, ti and ci. We assume that we observe ti if participant i is assigned to treatment and ci if participant i is assigned to control. That is, the observed outcome Yi for participant i is
We define the individual treatment effect
and the average treatment effect
which is our primary parameter of interest.
Lastly, some additional notation. Let
Note that when
and note that Ui has expectation 0.
Average and Individual Treatment Effects
It is not possible to observe any single participant’s treatment effect
This provides an unbiased estimate of the average treatment effect (conditional on
It is also possible to provide an unbiased estimate of an individual participant’s treatment effect
and thus
Although this is an unbiased estimator of
As an alternative estimator of
If
where in the last line, we use the fact that
but both
To summarize,
Finally, note that
Leave-One-Out Imputation
We now define the LOOP estimator of the average treatment effect
where
As an example, suppose we wish to estimate
Because we leave out the i th observation when we compute
It is worth noting that although we use the individual treatment effect estimates
The covariance term
Imputing the Potential Outcomes
In the subsequent sections, we propose several methods for imputing the potential outcomes in order to estimate mi. First, we impute the potential outcomes without making use of covariates, simply taking the mean of the observed outcomes in each treatment group. When we do this, we see that the LOOP estimator is exactly equal to the simple difference estimator. We also impute the potential outcomes using decision trees and discuss the connection between poststratification and the LOOP estimator. Finally, we propose the use of random forests, which may provide an improvement over poststratification and allow us to take advantage of automatic variable selection.
Imputing Potential Outcomes Ignoring Covariates: LOOP Equals the Simple Difference Estimator
In this section, we impute the potential outcomes without making use of covariates. We simply take the mean of the observed outcomes in the treatment group (excluding observation i) to estimate ti and the mean of the observed outcomes in the control group (excluding observation i) to estimate ci. That is, we estimate ti and ci as:
If the assignment probabilities are all equal, that is, if
Imputing Potential Outcomes Using Decision Trees: LOOP Equals Poststratification
In this section, we discuss the connection between the LOOP estimator and poststratification. Poststratification is a covariate adjustment method made by stratifying on pretreatment variables, estimating the treatment effect within each stratum by taking a simple difference in means, and then taking the weighted average over all strata (Holt & Smith, 1979). We argue that when we impute potential outcomes using a decision tree (see James, Witten, Hastie, & Tibshirani, 2013, for a summary of decision trees), the LOOP estimator is equivalent to poststratification.
Given a single decision tree (fixed in advance), we impute the potential outcomes as follows. First, we assign each observation i to a group; this is done by applying the decision tree to observation i‘s covariates. (This group may be viewed as a “leaf” or a “stratum.”) For each i, we then impute ti using the average observed outcome of the treated units within the same group (excluding observation i itself). We impute ci similarly. Thus, using the same argument given above in the section “Imputing Potential Outcomes Ignoring Covariates: LOOP Equals the Simple Difference Estimator,” it is simple to show that the average of the
Imputing Potential Outcomes Using Random Forests
In their analysis of poststratification, Miratrix et al. (2013) show that poststratification is nearly as efficient as blocking. However, one disadvantage of poststratification is that we must be parsimonious in the number of variables selected. If we include too many covariates, we end up partitioning our data too finely. We can overcome this limitation and also improve on the poststratified estimate using the LOOP estimator. One advantage of the LOOP estimator is that estimation of mi is very flexible. One can impute the potential outcomes using any method, so long as
One such method is the random forest algorithm, and random forests will be our method of choice for imputing the potential outcomes for the remainder of this article. For a description of tree-based methods, including random forests, see James, Witten, Hastie, and Tibshirani (2013). In order to impute the potential outcomes using random forests, we could first omit observation i and then create a random forest using the remaining
Because random forests are typically an improvement over individual decision trees, they allow us to obtain a more precise estimate of the average treatment effect
Variance Estimation
Aronow and Middleton (2013) give a conservative estimate of the variance of the Horvitz–Thompson estimator. They also provide an estimate for the variance of their own estimator; however, this estimate is derived under the assumption that the function of the covariates (i.e., our
Variance of
In Online Appendix B.1, we show that:
and that
where
Combining Equations 9 and 10 yields:
Limiting our attention to the special case that
where
and
Estimating the Variance
In Online Appendix B.2, we show that when
where
and
We estimate Mt and Mc by leave-one-out cross validation:
In Online Appendix B.3, we show that these estimates are unbiased. We plug Equations 14 and 15 into the bound (Equation 13) to obtain an estimate for the first term in Equation 12:
Next, we provide an unbiased estimator of
where
Estimating the Variance in Practice
In practice, we recommend making two modifications when estimating the variance. First, we recommend estimating Mt and Mc as
and
particularly when N is small. Note that these approximations require that
Second, we recommend omitting the second term in Equation 18 for computational efficiency. In many cases,
To see why we might expect
In this case, it can be shown (see Online Appendix C.1) that
The two modifications discussed in this section yield the following estimate for the variance of
If there is concern that in a particular application
Relationship Between
and the Sample Variance
We show in Online Appendix D that when we impute potential outcomes ignoring covariates (i.e., we calculate
and
where
with equality in Equation 22 when
Dependent Treatment Assignments
In the preceding sections, we assumed that the treatment assignments are independent of each other. In this section, we consider study designs in which the treatment assignments are not independent. For example, it is common for researchers to randomly assign a fixed number n of participants to treatment and leave the remaining
Because this procedure ensures that
Note that a similar procedure could be used in a block-randomized experiment, in which a fixed number of participants within each block are assigned to treatment, and the rest to control. In this case, when computing
Finally, we note that the independence of
Results
Below, we apply the LOOP estimator (with random forests) to both simulated and actual data. In our first simulation, we compare methods when the treatment effects are either homogeneous or heterogeneous and also demonstrate the bias of the point estimate and standard error for the OLS estimator. Next, we consider a simulation in which we examine the performance of the LOOP estimator when many of the covariates are not predictive. In our third simulation, we empirically demonstrate that the covariance terms discussed in the section “Variance Estimation” are negligible. Finally, we apply the LOOP estimator to the experiment conducted by Barrera-Osorio, Bertrand, Linden, and Perez-Calle (2011) on the effects of various cash transfer programs on educational outcomes in Colombia.
Simulation 1: Heterogeneous and Homogeneous Treatment Effects
Consider a randomized experiment in which there are N subjects and there is a single covariate, Z, with three possible values: 0, 1, and 2. For each value of Z, there are
Simulation 1: Potential Outcome Values.
For each of the four cases, we do the following. We generate a single set of treatment and control potential outcomes for the N subjects. We then create 100,000 random assignment vectors (T), where the treatment assignments are independent Bernoulli random variables with probability 1/2. For each of these 100,000 treatment assignment vectors, we compute the observed outcomes (Y) and estimate the average treatment effect and nominal standard error.
We compare the results using OLS, the LOOP estimator with random forests, and cross estimation with random forests (Wager, Du, Taylor, & Tibshirani, 2016). Note that for cross estimation, we use the code provided on GitHub; however, we remove the specified node size parameter. This modification improves performance in the context of this simulation. The bias is estimated as the mean point estimate minus the true average treatment effect. We also show the mean nominal standard error and estimate the true standard error using the standard deviation of the 100,000 point estimates. The nominal standard errors for the LOOP estimator are calculated using the method of the section “Estimating the Variance in Practice,” while the nominal standard errors for cross estimation are calculated using the estimator provided by Wager et al. (2016). For OLS, the point estimate is obtained by regressing Y on T and Z (without any interaction terms), while the nominal standard errors are calculated using the usual formulas (not robust standard errors). Z is treated as a continuous variable in the regression (not as a factor). We also compute the coverage probabilities at a confidence level of 95%. We show the results in Table 2. Finally, note that in Table 2, the nominal standard error refers to the mean nominal standard error over the 100,000 trials, while the true standard error refers to the estimate for the true standard error described above. We continue this practice throughout the remainder of this article.
Simulation 1 Results.
Note. LOOP = leave-one-out potential outcomes estimator; OLS = ordinary least squares.
We can see that LOOP and cross estimation are both unbiased, while the OLS estimate is biased. This bias is smaller for homogeneous treatment effects and when N is larger. We can also see that the true standard errors are similar for LOOP and cross estimation. However, in the case of heterogeneous treatment effects, the nominal standard error of cross estimation is quite conservative, even when N increases. The nominal standard error for LOOP is also conservative, but less so. In the case of cross estimation, this conservative bias is partially because Wager et al. assume that the experimental units are drawn from a superpopulation and must account for this additional uncertainty. For LOOP, the conservative bias is related to the inequality (Equation 13). For a discussion on a related inequality for the simple difference estimator, see Aronow et al. (2014).
Technical note
Cross estimation is slightly biased as implemented. This is due to the difference between the out-of-bag and the leave-one-out estimates of the potential outcomes. This issue can easily be fixed by reducing (by one) the size of the bootstrap sample used in the random forest when making out-of-bag predictions of the potential outcomes.
Simulation 2: Estimating the Treatment Effect for a Binary Response
In our second simulation, we consider a randomized experiment in which the response is either zero or one. Each of the N subjects has one of three sets of potential outcomes: (a) zero regardless of treatment assignment, (b) zero if control and one if treatment, and (c) one regardless of treatment assignment. Like in the previous simulation, the treatment assignments are independent Bernoulli random variables with probability 1/2. We also have one covariate (
We generate
Under this framework, we consider three sets of simulations. First, we assume that both the number of subjects (

Comparison of standard errors for Simulation 2. All standard errors are relative. That is, each value has been divided by the standard error for the simple difference estimator. We use solid lines to denote the the true standard error and dotted lines to denote the nominal standard error. Method used is shown by the color and width of the lines: (1) simple difference estimator, black lines, (2) ordinary least squares, thin gray lines, and (3) leave-one-out potential outcomes estimator, bold light gray lines.
We observe that while the performance of OLS declines as the number of noise covariates increases, the performance of LOOP remains constant relative to the simple difference estimator. Similarly, OLS performs worse than the simple difference estimator when the number of subjects is small, while the LOOP estimator outperforms the simple difference estimator for all sample sizes. Finally, it is important to note that covariate adjustment does not help when the covariates are not useful for predicting the outcomes. When
Technical note
We slightly modify the procedure described in the section “Simulation 1: Heterogeneous and Homogeneous Treatment Effects.” This is because we compare different simulations in each chart with varying parameter values, and we wish to avoid the variability associated with using a single set of potential outcomes for each simulation. For each of the 10,000 trials, we generate new covariates and potential outcomes and obtain a point estimate and a nominal standard error. We then calculate the nominal standard error as the average of the 10,000 nominal standard errors and the true standard error by taking the standard deviation of the 10,000 differences between each point estimate and the true
Simulation 3: Negligibility of
In the section “Estimating the Variance in Practice,” we argue that
for each of the first

Estimate of
Cash Transfer Programs and Enrollment
In their experiment in 2005, Barrera-Osorio et al. studied the effects of several conditional cash transfer programs on educational outcomes for students in Bogota, Colombia. They conducted experiments in two localities of Bogota, San Cristobal and Suba. For our analysis, we focus on the San Cristobal experiment. The San Cristobal experiment involved 10,907 students from Grades 6 to 11. These students were selected by lottery to be assigned to one of the two treatments or to control: 3,427 students were assigned to the “basic” treatment, 3,424 to the “savings” treatment, and the remaining 4,056 were assigned to control. In the basic treatment, each student received a bimonthly payment of roughly US$15 so long as the student attended school at least 80% of days that month. In the savings treatment, each student received a bimonthly payment of roughly US$10 so long as they met the attendance threshold. The remaining third was held in a bank account and paid to the students’ families when it was time to reenroll for the subsequent year. Barrera-Osorio et al. use the following covariates. For each student, they use age, age squared, gender, grade, years behind (or ahead) relative to their grade, and indicator for whether the student is over age for their grade. They also record the marital status, age, and years of education for the head of household, as well as several household characteristics: Whether or not the residence is rented or owned, income, total number of people, number of children, and an indicator for single parent household. Finally, they include household values for indices that relate to access to utilities, possession of durable goods, the physical infrastructure of the house, and poverty.
In their experiment, Barrera-Osorio et al. collected reenrollment status from administrative records. However, they were unable to obtain reenrollment status for approximately 10% of the observations. In our analysis, we consider both reenrollment status itself and whether the reenrollment status is missing as outcome variables. For each outcome variable, we estimate the average treatment effect for the basic treatment compared to the savings treatment, the basic treatment compared to control, and the savings treatment compared to control. We use the same covariates and restrict our analysis to students in Grades 6–10 as in Barrera-Osorio et al. (2011). We compare the standard errors using LOOP (with random forests), the simple difference estimator, OLS, and cross estimation (with random forests) in Table 3. See Online Appendix F.2 for the full results, including additional methods (LOOP with OLS and OLS with interaction terms) and the point estimates for the treatment effect.
Comparison of Standard Errors With Missing and Reenrollment Status as Outcomes.
Note. LOOP = leave-one-out potential outcomes estimator; OLS = ordinary least squares.
As we can see, OLS, cross estimation, and LOOP provide improvement over the simple difference estimator when missing status is the outcome variable of interest. We can also see that even in this traditional setting (i.e., a large sample size with relatively few covariates), LOOP performs at least as well as OLS. Finally, covariate adjustment does not help when reenrollment status is the outcome variable, as the covariates are less predictive of outcome.
Discussion
While methods of covariate adjustment can improve the precision of the estimate of the average treatment effect, they often require the researchers to perform variable selection. For example, when using poststratification, we must be careful not to use too many covariates, otherwise we partition the data set too finely. Overadjustment can result in poorer performance with linear regression as well: OLS performs poorly when the sample size is small relative to the number of covariates or as the number of noise covariates increases.
The LOOP estimator is an unbiased estimate of the average treatment effect and randomization justifies the assumptions made. One advantage of the LOOP estimator is that estimation of mi is very flexible. One can impute the potential outcomes using any method, so long as
In this article, we suggest the use of random forests to impute the potential outcomes, as they are computationally efficient relative to other methods, likely improve performance over a poststratified estimate, and allow for automatic variable selection. Because of the automatic variable selection, we can adjust for covariates without knowing ahead of time which covariates we wish to use, and any postselection inference is still valid. Finally, as with any covariate adjustment method, the LOOP estimator only improves precision over the unadjusted estimator if the covariates are predictive of outcome. However, we see that even when the covariates are not predictive of outcome, the LOOP estimator generally performs as well as the simple difference estimator.
Implementation in R
The LOOP estimator is implemented in R as the “loop.estimator” package (version 1.0.0.0) and is available on GitHub at https://github.com/wuje/LOOP.
Supplemental Material
Online_Appendixes - The LOOP Estimator: Adjusting for Covariates in Randomized Experiments
Online_Appendixes for The LOOP Estimator: Adjusting for Covariates in Randomized Experiments by Edward Wu, and Johann A. Gagnon-Bartsch in Evaluation Review
Footnotes
Acknowledgments
We would like to thank Yotam Shem-Tov, Luke Miratrix, and Ben Hansen for helpful comments and suggestions.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This material is based upon work supported by the National Science Foundation under Grant No. 1646108.
Supplemental Material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
