Using Stochastic Approximation Techniques to Efficiently Construct Confidence Intervals for Heritability

Abstract

Estimation of heritability is an important task in genetics. The use of linear mixed models (LMMs) to determine narrow-sense single-nucleotide polymorphism (SNP)-heritability and related quantities has received much recent attention, due of its ability to account for variants with small effect sizes. Typically, heritability estimation under LMMs uses the restricted maximum likelihood (REML) approach. The common way to report the uncertainty in REML estimation uses standard errors (SEs), which rely on asymptotic properties. However, these assumptions are often violated because of the bounded parameter space, statistical dependencies, and limited sample size, leading to biased estimates and inflated or deflated confidence intervals (CIs). In addition, for larger data sets (e.g., tens of thousands of individuals), the construction of SEs itself may require considerable time, as it requires expensive matrix inversions and multiplications. Here, we present FIESTA (Fast confidence IntErvals using STochastic Approximation), a method for constructing accurate CIs. FIESTA is based on parametric bootstrap sampling, and, therefore, avoids unjustified assumptions on the distribution of the heritability estimator. FIESTA uses stochastic approximation techniques, which accelerate the construction of CIs by several orders of magnitude, compared with previous approaches as well as to the analytical approximation used by SEs. FIESTA builds accurate CIs rapidly, for example, requiring only several seconds for data sets of tens of thousands of individuals, making FIESTA a very fast solution to the problem of building accurate CIs for heritability for all data set sizes.

1. Introduction

Heritability, or the proportion of phenotypic variation that is explained by genetic variation, is an important population parameter in human genetics, in evolution, in plant and animal breeding, and more. Estimating the heritability has been traditionally performed using related individuals such as in twin studies or pedigree designs (Fisher, 1918; Silventoinen et al., 2003; Macgregor et al., 2006). More recently, genetic variation has been estimated using genetic marker information and, in particular, in genome-wide association studies (GWASs) (Manolio et al., 2008; Welter et al., 2014), which have identified thousands of genetic variants that are associated with dozens of common diseases. However, genome-wide significant associations were generally found to explain only a small proportion of the heritability of complex diseases.

To cope with this challenge, linear mixed model (LMM) approaches (Kang et al., 2008, 2010; Visscher et al., 2008; Lippert et al., 2011; Vattikuti et al., 2012; Zhou and Stephens, 2012; Wright et al., 2014; Kruijer et al., 2015) have been applied to estimate the heritability explained by common SNPs (the narrow-sense SNP-heritability, to which we refer as heritability, and denote by h²) from cohorts of unrelated individuals, such as those found in GWASs (Yang et al., 2010). Estimation under the LMM is usually performed using restricted maximum likelihood (REML) estimation, and is implemented in some widely used tools, like the Genome-wide Complex Trait Analysis (GCTA) software package (Yang et al., 2011). LMMs utilize all variants from a GWAS and not just the variants that are statistically significant and, therefore, are able to account for variants with small effect sizes.

As in any statistical analysis, the process of estimating the heritability suffers from statistical uncertainty. Typically, confidence intervals (CIs) are reported alongside with point estimates to quantify this uncertainty. Usually, such CIs are constructed from standard errors (SEs), which make the assumption that the estimators asymptotically follow a normal distribution. However, it has been shown (Lohr and Divan, 1997; Burch, 2007; Burch, 2011; Kruijer et al., 2015; Schweiger et al., 2016) that such CIs can be highly inaccurate. This is because estimators do not necessarily obey the conditions required for them to asymptotically follow the normal distribution. In addition, these CIs may spread beyond the natural boundaries of their parameters, for example, including negative values for heritability. As a result, these CIs are often inaccurate, difficult to interpret, or lead to erroneous conclusions.

To handle these issues, previous approaches have taken several directions. Nonstandard asymptotic theory for boundary and near-boundary maximum likelihood estimates has been developed (e.g., Chernoff, 1954; Moran, 1971; Self and Liang, 1987), and it has been suggested to replace the asymptotic normality assumption with the asymptotics developed for the nonstandard boundary case (Stern and Welsh, 2000). Visscher and Goddard, (2015) derived an analytical expression for the asymptotic variance of the heritability estimator in a range of pedigree- and marker-based experimental designs. Unfortunately, these conditions typically do not hold for genomic data sets, mainly due to the limited sample size, making either of these approximations ineffective (Schweiger et al., 2016). Other approaches include hierarchical bootstrapping schemes (e.g., Thai et al., 2013), extending the REML estimation method with Bayesian priors (e.g., Wolfinger and Kass, 2000; Chung et al., 2011), using alternative statistics as a basis for building CIs (Harville and Fenech, 1985; Burch and Iyer, 1997; Burch, 2007), or using Bayesian posterior distribution of the heritability value (Furlotte et al., 2014).

An alternative approach is the parametric bootstrap test inversion technique, which constructs CIs through sampling phenotypes, performing heritability estimation on the sampled phenotypes, estimating the distribution of the heritability estimator, and using these estimates as a basis for CI construction (Carpenter and Bithell, 2000). The main advantage of using a parametric bootstrap approach is that it does not require any assumptions on the distribution of the heritability estimator or of Bayesian priors. As a naive implementation of this approach would be computationally prohibitive, the Accurate LMM-based heritability Bootstrap confidence Intervals (ALBI) method (Schweiger et al., 2016) utilizes a highly accurate approximation that allows an efficient construction of accurate CIs. However, ALBI still requires a preprocessing step. Newer data sets [e.g., the UK Biobank (Sudlow et al., 2015)] may contain tens or hundreds of thousands of individuals, for which this step may require hours of computation time. In addition, the need for a preprocessing step can be an obstacle in the adoption of a better CI construction method.

In this article, we introduce FIESTA (Fast confidence IntErvals using STochastic Approximation), which dramatically reduces the running time of CI construction by several orders of magnitude, for example, to mere seconds for data set with tens of thousands of individuals, compared with hours or days. The key ingredient of our approach is a CI construction algorithm from the field of stochastic approximation (for a review, see Kushner and Yin, 2003). Originating in the work of Robbins and Monro (1951), stochastic approximation algorithms are recursive update rules that can be used, among other things, to solve optimization problems or function inversion problems when the collected data are subject to noise. It has been shown (Garthwaite and Buckland, 1992) that stochastic approximation can be used to construct CIs for general families of parametric distributions, given the ability to randomly sample from them, and this is the approach we employ here. We validate FIESTA on two real data sets, the Northern Finland Birth Cohort (NFBC) data set (Sabatti et al., 2009) and the Wellcome Trust Case Control Consortium 2 (WTCCC2) (Sawcer et al., 2011) data set.

In addition to the significant speedup in time, FIESTA requires no preprocessing step beyond calculating the eigendecomposition of the kinship matrix, which is usually already performed as a part of heritability estimation. Finally, we show that FIESTA is even significantly faster than the analytical SE formulation. In summary, FIESTA can effectively be used extremely easily to rapidly generate accurate CIs for REML heritability estimates. FIESTA is available as part of the ALBI toolkit at https://github.com/cozygene/albi

2. Results

2.1. A faster method for calculating CIs for heritability

CIs constructed from SEs, which are based on the assumption of a normal distribution for the heritability estimators, were previously shown to be inaccurate (Lohr and Divan, 1997; Burch, 2007; Burch, 2011; Kraemer, 2012; Kruijer et al., 2015; Schweiger et al., 2016). In this article, we introduce FIESTA, a method that generates accurate CIs for h², the true heritability value, given \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} , the REML estimator for h² (see section 3). FIESTA uses the principle of test inversion to construct accurate CIs, using a stochastic approximation method that directly estimates the CI boundaries. We review FIESTA hereunder; for a full description, see section 3.

The methodology of test inversion can be described as follows. The estimator \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} is a function of the phenotype, which is a random variable whose distribution depends on h², assuming a fixed kinship matrix. Therefore, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} is distributed differently for every value of h². For each true value of h², we select a subset of possible \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} values that has a sampling probability of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha$$ \end{document} , where \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} is distributed under the assumption of a true heritability value h². We define this subset to be the acceptance region for that value of h². The CI accompanying an estimate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} is the interval containing all values of h² whose acceptance region includes \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} , namely, for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} does not imply the rejection of the null hypothesis that the true heritability value is h², with a significance level of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\alpha$$ \end{document} .

It remains to define suitable acceptance regions. In section 3, we review our scheme for defining acceptance regions. A basic ingredient of our construction of acceptance regions is inverting certain quantile functions of the distribution of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} , as a function of h². For example, finding the inverse of a value H² of the 95%-quantile function is finding a heritability value h² for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \le {H^2} ) = 0.95$$ \end{document} , that is, the probability to get an heritability estimate of H² or below is precisely 95%, when \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} is distributed with the heritability value h².

Instead of carrying out this task by a full parametric bootstrap estimate of the distribution of the estimator, we employ a technique from the field of stochastic approximation to achieve the same results with a fraction of the computational cost. The modified Robbins–Monro procedure (Joseph, 2004), described in section 3, is an iterative method that finds the inverse of the quantile function of a one-parameter distribution. It operates by iteratively (1) drawing a sample with a true heritability value equal to our current guess for the required inverse value, (2) comparing its estimated heritability to H², (3) updating our current guess accordingly, by moving in the right direction, with a step size that decreases with the number of iterations. An additional speedup is acquired by using a fast method to calculate the derivative of likelihood of the sample, and using the derivative to compare its estimated heritability to H², instead of performing the full likelihood maximization.

We applied FIESTA to construct 95% CIs for the NFBC data set (Sabatti et al., 2009) and the WTCCC2 data set (Sawcer et al., 2011), as seen in Figure 1. We then turned to verify the accuracy of these CIs, which can be measured as follows. Draw multiple phenotype vectors from the distribution assumed by the LMM with parameters that correspond to a true heritability value h². From each such phenotype, construct a CI for its estimated heritability with a confidence level of, for example, 95%. If the constructed CIs are accurate, then they should cover the true underlying h² 95% of the time. Then, check the percentage of times in which the CI covered h², as a function of h². We measured the accuracy of FIESTA, with CIs designed to have a coverage of 95%. The results are shown in Figure 2, demonstrating that FIESTA accurately achieves the desired confidence levels.

FIG. 1.

The 95% CIs for the NFBC and WTCCC2 data sets. Accurate 95% CIs constructed for the NFBC data set (Sabatti et al., 2009) (left) and the WTCCC2 (Sawcer et al., 2011) data set (right) by FIESTA. For each \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} on a fine grid of 1000 values (x axis), we constructed a CI, whose boundaries are shown (y axis). For example, for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2} = 0.5$$ \end{document} (denoted by a dashed line), the CI for NFBC is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$[ 0.282 , 0.705 ]$$ \end{document} (denoted by a full line). CI, confidence interval; FIESTA, Fast confidence IntErvals using STochastic Approximation; NFBC, Northern Finland Birth Cohort; WTCCC2, Wellcome Trust Case Control Consortium 2.

FIG. 2.

Accuracy of CIs for the NFBC and WTCCC2 data sets. The coverage probabilities of the FIESTA CIs. The coverage probabilities are shown for CIs designed to have coverage probabilities of 95%. The CIs achieve accurate coverage.

2.2. Benchmarks

We compared the speed of the stochastic approximation approach, implemented in FIESTA, with that of using the parametric bootstrap for estimating the distribution of heritability estimator. The latter was tested either as implemented naively by using either GCTA (Yang et al., 2011) and pylmm (Furlotte and Eskin, 2015) or by using ALBI (Schweiger et al., 2016). Both the stochastic approximation and parametric bootstrap approaches require the calculation of the eigendecomposition of the kinship matrix. As this is already often a part of the heritability estimation algorithm, its calculation time is separated in the benchmarks. In section 4, we discuss how this step could be avoided altogether.

One difference between the two approaches is that the bootstrap approach performs a lengthy preprocessing step that estimates many distributions. Once these distributions are estimated, constructing a CI is very rapid. In contrast, the stochastic approximation approach does not perform a preprocessing step, but performs a nontrivial calculation per CI.

The construction of a single CI with FIESTA consists of calculating six to eight values using the modified Robbins–Monro procedure (see section 3). The first four values depend only on the kinship matrix, but not on the heritability estimate for which we construct a CI, so they need to be calculated only once per kinship matrix, and can then be shared between several CIs. Each modified Robbins–Monro run has the complexity of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$O ( nT )$$ \end{document} , where n is the number of individuals in the sample and T is the number of iterations (in the order of 1000; see section 3). Therefore, in total, the time complexity to construct K CIs with FIESTA grows linearly with K, T and n.

We also compared FIESTA with the performance of the analytical SE approach. Although often inaccurate, analytical SEs are often the go-to method by many practitioners: first, their calculation is conceptually easy to understand, since a closed-form formula exists for the SEs (see section 5); second, using a closed-form expression is often perceived as faster than more involved algorithmic procedures. However, this is not the case for heritability estimation, as SEs are calculated using variants of the Fisher information matrix [e.g., the average information (AI) matrix, as in GCTA (Yang et al., 2011)], whose calculation requires matrix-by-vector multiplications, which are \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$O ( {n^2} )$$ \end{document} . In contrast, FIESTA is linear in n, giving it an advantage at larger data sets in particular.

We performed a benchmark to evaluate FIESTA, using the NFBC and WTCCC2 data sets. We estimated the distributions of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${h^2} = 0 , \,0.01 , \ldots , \,1$$ \end{document} , with GCTA (Yang et al., 2011) and pylmm (Furlotte and Eskin, 2015), both of which perform full estimation, using 1000 random bootstrap samples. For the same task, we also used ALBI (Schweiger et al., 2016), at a grid resolution of 0.001. The accuracy of CIs constructed according to the full estimation approach, as implemented in ALBI, is shown in section 5. As already explained, the time of construction of CIs given these distributions is negligible relative to the time required for the estimation of the distributions. We also constructed analytical SEs for both data sets using the AI method (see section 5). These times are reported in Table 1.

Table 1.

Benchmarks

Algorithm	Software	Time for NFBC	Time for WTCCC2
Eigendecomposition	GCTA	50 seconds	2 hours
Full bootstrap	GCTA	>30 days	>30 days
Full bootstrap	pylmm	3.8 hours	>8 days
Full bootstrap	ALBI	5.35 minutes	2.5 hours
Analytical SEs	n/a	∼3.1 seconds × no. of CIs, e.g.:	∼6.2 minutes × no. of CIs, e.g.:
		1 CI, .∼3 seconds	1 CI, ∼6 minutes
		5 CIs, ∼15 seconds	5 CIs, ∼31 minutes
		10 CIs, ∼31 seconds	10 CIs, ∼1 hours
		50 CIs, .∼2.6 minutes	50 CIs, ∼5 hours
Stochastic	FIESTA	∼1.8 seconds +0.6 seconds \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\times$$ \end{document} no. of CIs, e.g.:	∼6 seconds +2.8 seconds \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\times$$ \end{document} no. of CIs, e.g.:
Approximation		1 CI, ∼3 seconds	1 CI, ∼9 seconds
		5 CIs, ∼6 seconds	5 CIs, ∼20 seconds
		10 CIs, ∼8 seconds	10 CIs, ∼34 seconds
		50 CIs, ∼33 seconds	50 CIs, ∼2.4 minutes

Running times of FIESTA, compared with previous methods (see section 2 for more details). Running times are reported for the NFBC (2520 individuals) and WTCCC2 (13,950 individuals) data sets.

ALBI, Accurate LMM-based heritability Bootstrap confidence Intervals; CI, confidence interval; FIESTA, Fast confidence IntErvals using STochastic Approximation; GCTA, Genome-wide Complex Trait Analysis; n/a, not applicable; NFBC, Northern Finland Birth Cohort; SEs, standard errors; WTCCC2, Wellcome Trust Case Control Consortium 2.

As a comparison, we used FIESTA to construct varying number of CIs, using 1000 iterations in the modified Robbins–Monro procedure (see section 3). In Table 1, it can be seen that FIESTA is significantly faster, particularly when few CIs are needed. We also note that FIESTA is currently implemented in the Python language, using the numpy package; a significant additional speedup can be obtained by migrating to a compiled language, for example, C++.

We then continued to investigate the stability of CI construction and its dependency on the number of iterations. We ran FIESTA 100 times to construct CI for the NFBC and WTCCC2 data sets using 200, 500, 1000, or 2000 iterations. We measured the variance in the constructed CI endpoints (Table 2). As expected, the variance decreases with the number of iterations. In addition, we measured the mean and variance of the coverage of CIs under a grid of true heritability values. Here, also, we observed that variance of coverage decreases with the number of iterations. We note that 500 iterations are sufficient for reasonably accurate CIs for these data sets, and that the coverage of even 200 iterations is only slightly biased downwards.

Table 2.

Stability of Confidence Interval Construction

Data set	NFBC				WTCCC2
No. of iterations	200	500	1000	2000	200	500	1000	2000
CI lower point SE	0.0201	0.0132	0.0094	0.0067	0.0050	0.0032	0.0023	0.0016
CI upper point SE	0.0206	0.0133	0.0096	0.0070	0.0050	0.0031	0.0023	0.0016
Mean coverage (%)	94.20	94.71	94.87	94.95	94.720	95.217	95.323	95.373
SE of coverage (%)	0.45	0.34	0.30	0.28	0.781	0.575	0.486	0.442

Ninety-five percent CIs for the NFBC and WTCCC2 data sets were constructed 100 times, with either 200, 500, 1000, or 2000 iterations. CIs were constructed for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2} = 0 , 0.001 , \ldots , 1$$ \end{document} . To assess the variance of the construction process, the mean empirical SE of the lower and upper endpoints is reported, where the mean was calculated over all nonconstant endpoints, across all \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} values. In addition, the CI coverage for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${h^2} = 0 , 0.01 , \ldots , 1$$ \end{document} was calculated as shown in Figure 2. The average mean and SE across all 100 runs, calculated across all h², is reported.

3. Methods

For clarity of presentation, we begin by defining the heritability under the LMM, and briefly reviewing stochastic approximation and its relevance to finding CIs. Finally, we introduce FIESTA, our improved method for faster construction of CIs for heritability.

3.1. The LMM and REML

We consider the following standard LMM (see Searle et al., 2009 for a detailed review). Let n be the number of individuals and m be the number of SNPs. Let y be an \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$n \times 1$$ \end{document} vector of phenotype measurements for each individual. Let X be an \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$n \times p$$ \end{document} matrix of p covariates (possibly including an intercept vector \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${{ \bf{1}}_n}$$ \end{document} as a first column, as well as other covariates such as gender and age). Let Z be the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$n \times m$$ \end{document} standardized genotype matrix, that is, columns have zero mean and unit variance. Let \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta$$ \end{document} be a \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$p \times 1$$ \end{document} vector of fixed effects, s an \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$m \times 1$$ \end{document} vector of random effects, and e an \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$n \times 1$$ \end{document} vector of errors. Then, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \bf{y}} = { \bf{X}} \beta + { \bf{Zs}} + { \bf{e}}.$$ \end{document} We assume s and e are statistically independent and are distributed normally as \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$ { \bf { s } } \sim { \cal N } \left( { { { \bf { 0 } } _m } , \frac { 1 } { m } \sigma _g^2 { { \bf { I } } _m } } \right)$$ \end{document} , \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \bf{e}} \sim { \cal N} \left( {{{ \bf{0}}_n} , \sigma _e^2{{ \bf{I}}_n}} \right)$$ \end{document} . The fixed effects \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta$$ \end{document} and the coefficients \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _g^2$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _e^2$$ \end{document} are the parameters of the model.

Define \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$ { \bf { K } } = \frac { 1 } { m } { \bf { Z } } { { \bf { Z } } ^ { \bf { T } } } $$ \end{document} . Typically, K is commonly called the kinship matrix, or the genetic relationship matrix. Under these conditions, it follows (Yang et al., 2010) that \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \bf{y}} \sim { \cal N} \left( {{ \bf{X}} \beta , \sigma _g^2{ \bf{K}} + \sigma _e^2{{ \bf{I}}_n}} \right) . \tag{1} \end{align*} \end{document}

The narrow-sense heritability due to genotyped common SNPs is defined as the proportion of total variance explained by genetic factors (Visscher et al., 2008): \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { h^2 } = { \frac { \sigma _g^2 } { \sigma _g^2 + \sigma _e^2 } } \;. \tag { 2 } \end{align*} \end{document}

Defining \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _p^2 = \sigma _g^2 + \sigma _e^2$$ \end{document} , Equation 1 becomes \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \bf{y}} \sim { \cal N} \left( {{ \bf{X}} \beta , \sigma _p^2{{ \bf{V}}_{{h^2}}}} \right) ,$$ \end{document} where \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${{ \bf{V}}_{{h^2}}} = {h^2}{ \bf{K}} + ( 1 - {h^2} ) {{ \bf{I}}_n}$$ \end{document} .

The most common way of estimating h² is REML estimation. REML consists of maximizing the likelihood function associated with the projection of the phenotype onto the subspace orthongonal to that of the fixed effects of the model (Patterson and Thompson, 1971). In Schweiger et al. (2016), it is shown that the distribution of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} depends only on h², and is invariant under changes to \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _p^2$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta$$ \end{document} . We may, therefore, limit our study to the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} estimator alone, in the special case of fixed \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _p^2 = 1$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta = {{ \bf{0}}_p}$$ \end{document} , which substantially simplifies the problem; namely, we may focus on properties of the distribution \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal N} \left( {{{ \bf{0}}_n} , {{ \bf{V}}_{{h^2}}}} \right)$$ \end{document} instead of the more general \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal N} \left( {{ \bf{X}} \beta , \sigma _p^2{{ \bf{V}}_{{h^2}}}} \right)$$ \end{document} .

3.2. CIs for h²

We wish to build CIs with a coverage probability of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha$$ \end{document} (e.g., 95%). The full derivation is developed in Schweiger et al. (2016), and is reviewed in section 5; we cite the results here.

Let \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${c_ \beta } ( {h^2} )$$ \end{document} be the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta$$ \end{document} -th quantile function of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} , when the true heritability is h²; that is, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \le {c_ \beta } ( {h^2} ) ) = \beta$$ \end{document} . Define s and t to be the values for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2} = s} ( { \hat h^2} = 0 ) = \alpha / 2$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2} = t} ( { \hat h^2} = 1 ) = \alpha / 2$$ \end{document} . In addition, let \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${s^*} = {c_{1 - \alpha }} ( 0 ) , {t^*} = {c_ \alpha } ( 1 )$$ \end{document} . Then the lower and upper CI boundaries for an estimate H² are given, respectively, by \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{matrix} {{l_{{H^2}}} = \left( { \begin{matrix} 0 & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \le {s^*}} \\ {c_{1 - \alpha }^{ - 1} ( {H^2} ) } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} c_{1 - \alpha }^{ - 1} ( {H^2} ) < s} \\ s & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} s \in [ c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) , c_{1 - \alpha }^{ - 1} ( {H^2} ) ] } \\ {c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} s < c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } {} \\ \end{matrix} } \right.} \\ \end{matrix} \tag{3} \end{align*} \end{document}

and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{matrix} {{u_{{H^2}}} = \left( { \begin{matrix} {c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} c_{ \alpha / 2}^{ - 1} ( {H^2} ) < t} \\ t & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} t \in [ c_ \alpha ^{ - 1} ( {H^2} ) , c_{ \alpha / 2}^{ - 1} ( {H^2} ) ] } \\ {c_ \alpha ^{ - 1} ( {H^2} ) } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} t < c_ \alpha ^{ - 1} ( {H^2} ) } \\ 1 & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {t^*} \le {H^2}{ \kern 1pt} .} {} \\ \end{matrix} } \right.} \\ \end{matrix} \tag{4} \end{align*} \end{document}

3.3. Using stochastic approximation to calculate CIs

Robbins–Monro

Stochastic approximation methods are a family of iterative stochastic optimization algorithms that attempt to find zeroes, inverses, or extrema of functions that cannot be computed directly, but only estimated through noisy observations. The classical Robbins–Monro algorithm presents a methodology for solving a function inversion problem, where the function is the expected value of a parametrized family of distributions. Namely, a function \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta )$$ \end{document} is given, for which we want to find an inverse, that is, a value \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\bar \theta$$ \end{document} for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \bar \theta ) = C$$ \end{document} , for some constant C. However, the function g is not directly available to us, but rather we are only able to obtain noisy observations from it. The Robbins–Monro procedure is a modification of Newton's method, wherein the step sizes are instead an appropriately decreasing sequence. Starting with an initial guess, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \theta _0}$$ \end{document} , at iteration n we obtain a noisy sample y_n from a distribution whose mean is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( { \theta _n} )$$ \end{document} , and update our estimate with \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \theta _{n + 1}} = { \theta _n} - { \gamma _n} \cdot ( {y_n} - C ) , \tag{5} \end{align*} \end{document}

where \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \gamma _n} = 1 / n$$ \end{document} . The Robbins–Monro procedure is shown to converge to the correct solution when (1) the random variables defining our sampling process at each \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta )$$ \end{document} are uniformly bounded, (2) \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta )$$ \end{document} is nondecreasing, and (3) \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ^\prime ( \bar \theta )$$ \end{document} exists and is positive (Robbins and Monro, 1951).

Using Robbins–Monro to calculate CIs

Garthwaite and Buckland (1992) have used the Robbins–Monro process for finding the endpoints of CIs, as we now describe. We discuss the case of one-sided CIs, but the application to two-sided CIs is immediate.

Suppose that \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$[ 0 , {u_{ \hat \theta }} )$$ \end{document} is the one-sided \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha$$ \end{document} CI for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\theta$$ \end{document} , when data y have been observed, with an estimate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\hat \theta = \hat \theta ( { \bf{y}} )$$ \end{document} . Then, the correct endpoint satisfies \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \mathop {{ \rm{P}}r} \nolimits_{ \theta = {u_{ \hat \theta }}} \left( { \hat \theta \le \hat \theta ( { \bf{y}} ) } \right) = \alpha . \tag{6} \end{align*} \end{document}

If we define \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta ) = \mathop {{ \rm{P}}r} \nolimits_ \theta \left( { \hat \theta \ge \hat \theta ( { \bf{y}} ) } \right)$$ \end{document} (to make it nondecreasing), then finding \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${u_{ \hat \theta }}$$ \end{document} is equivalent to finding the inverse of g at \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha$$ \end{document} . However, under these settings, we do not have direct access to g. Rather, we sample a binary random variable \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${Y_ \theta }$$ \end{document} , indicating that a sample \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${{ \bf{y}}_ \theta }$$ \end{document} randomly drawn from \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta )$$ \end{document} has an estimate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\hat \theta ( {{ \bf{y}}_ \theta } )$$ \end{document} larger than \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\hat \theta ( { \bf{y}} )$$ \end{document} . By definition, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_ \theta ( {Y_ \theta } ) = \mathop {{ \rm{P}}r} \nolimits_ \theta \left( { \hat \theta ( {{ \bf{y}}_ \theta } ) \ge \hat \theta ( { \bf{y}} ) } \right) = g ( \theta )$$ \end{document} , so the random sample \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${Y_ \theta }$$ \end{document} has a mean of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta )$$ \end{document} . Effectively, this formulation allows us to use the Robbins–Monro procedure to invert the quantile function as a function of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\theta$$ \end{document} . Full asymptotic efficiency can be achieved by multiplying the step size \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \gamma _n}$$ \end{document} by some constant c.

In detail, denote by y_n a random sample from the random variable \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${Y_{{ \theta _n}}}$$ \end{document} . The update rule is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \theta _{n + 1}} = { \theta _n} - c{ \gamma _n} \cdot ( {y_n} - ( 1 - \alpha ) )$$ \end{document} , or explicity: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \theta _ { n + 1 } } = \begin{cases} \begin{matrix} { { \theta _n } - \frac { { c \alpha } } { n } } & { { \rm { if } } { y_n } = 1 } \\ { { \theta _n } + \frac { { c ( 1 - \alpha ) } } { n } } & { { \rm { if } } { y_n } = 0 } { } \\ \end{matrix} \end{cases} . \tag { 7 } \end{align*} \end{document}

The procedure is shown to be fully asymptotic efficient if \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$c = 1 / g ^\prime ( {u_ \theta } )$$ \end{document} . However, as neither g nor \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${u_ \theta }$$ \end{document} are known in advance, c is estimated adaptively, using the current estimate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \theta _n}$$ \end{document} in place of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${u_ \theta }$$ \end{document} , and assuming a parametric form for g (Garthwaite and Buckland, 1992).

The modified Robbins–Monro procedure

As already mentioned, if the optimal step size constant is known, this procedure is fully asymptotic efficient. However, it was empirically shown to work poorly for extreme quantiles. Joseph (2004) suggested a modification of this procedure, which is tuned to obtain optimal convergence speed. It uses the following update form: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \theta _{n + 1}} = { \theta _n} - {a_n} ( {y_n} - {C_n} ) . \tag{8} \end{align*} \end{document}

Joseph allows the use of a different target value, C_n, in each iteration, instead of the required constant, C. The step sizes a_n and target values C_n are derived explicitly in Joseph (2004) to be optimal under a Bayesian analysis framework. As in Garthwaite and Buckland (1992), the optimal step size also uses \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g \prime ( {u_ \theta } )$$ \end{document} , which is unknown, and a suitable approximation scheme is used. The modified Robbins–Monro procedure achieves significantly faster convergence rates in the case of the estimation of extreme quantiles.

3.4. Using the modified Robbins–Monro procedure to obtain CIs for heritability

We now describe how to rapidly construct CIs for heritability. As already described, the first step is to find \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$s , t , {s^*}$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${t^*}$$ \end{document} . To find s, we employ the modified Robbins–Monro procedure (Joseph, 2004), where the parameter of interest is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\theta : = {h^2}$$ \end{document} , the function is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta ) : = \mathop {{ \rm{P}}r} \nolimits_{{h^2} = \theta } ( { \hat h^2} = 0 )$$ \end{document} , and the inverse value we wish to find corresponds to \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$C = \alpha / 2$$ \end{document} . We note that we chose g here to be nonincreasing for the sake of clarity of presentation; to conform with the Robbins–Monro formulation, we would need to redefine \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g \to 1 - g$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$C \to 1 - C$$ \end{document} . At a single iteration of the modified Robbins–Monro procedure, we have an estimate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$h_n^2$$ \end{document} for s, and we need to sample from a distribution whose mean is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{h_n^2} ( { \hat h^2} = 0 )$$ \end{document} . To achieve that, we draw a sample from the distribution corresponding to \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$h_n^2$$ \end{document} , \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal N} \left( {{{ \bf{0}}_n} , {{ \bf{V}}_{h_n^2}}} \right)$$ \end{document} , and check whether the maximum likelihood estimate for it is 0 (or higher). This procedure can be done quickly in \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$O ( n )$$ \end{document} , as we now describe, circumventing the need to perform a full likelihood maximization for the sample.

As already detailed, we make repeated use of the following procedure: (1) draw a random sample y from the distribution corresponding to a given heritability value h², \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal N} \left( {{{ \bf{0}}_n} , {{ \bf{V}}_{{h^2}}}} \right)$$ \end{document} and (2) decide whether its heritability estimate, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2} ( { \bf{y}} )$$ \end{document} , is larger than a given value, H². In Schweiger et al. (2016), it is shown that when \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \bf{X}} = {{ \bf{1}}_n}$$ \end{document} , these two steps may equivalently be performed by drawing a vector u of i.i.d. (independent and identically), standard normal variables \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \bf{u}} \sim { \cal N} ( {{ \bf{0}}_n} , {{ \bf{I}}_n} )$$ \end{document} , and checking whether \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \mathop \sum \limits_{i = 1}^n \xi _i^{{h^2} , {H^2}}u_i^2 > 0 \; , \tag{9} \end{align*} \end{document}

for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$i = 1 , \ldots , n - 1$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\xi _n^{{h^2} , {H^2}} = 0$$ \end{document} , with d_i being the eigenvalues of K. The sign of the expression in Equation (9) is equal to the sign of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$ { \frac { \partial { \ell _ { REML } } } { \partial { h^2 } } } ( { H^2 } )$$ \end{document} , the derivative of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \ell _{REML}}$$ \end{document} at the point H². Therefore, assuming the restricted likelihood function is well behaved, a positive derivative indicates that the REML heritability estimate is larger than H². Similar expressions are defined for a general X in Schweiger et al. (2016). Once the eigendecomposition of K is obtained, this procedure may be performed in a time complexity linear in n.

Similarly, for finding \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${s^*}$$ \end{document} , we define the function \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta ) : = \mathop {{ \rm{P}}r} \nolimits_{{h^2} = 0} ( { \hat h^2} \le \theta )$$ \end{document} , for which we want to find the inverse of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$C = 1 - \alpha$$ \end{document} . The procedures for finding t and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${t^*}$$ \end{document} are similar.

The second step involves calculating the quantities \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$c_{ \alpha / 2}^{ - 1} ( {H^2} ) , c_ \alpha ^{ - 1} ( {H^2} ) , c_{1 - \alpha }^{ - 1} ( {H^2} )$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$c_{1 - \alpha / 2}^{ - 1} ( {H^2} )$$ \end{document} as required. This can again be done by the modified Robbins–Monro procedure, by setting \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\theta : = {h^2}$$ \end{document} , \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$g ( \theta ) : = \mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \le {H^2} )$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$C = \alpha / 2 , \alpha , 1 - \alpha / 2$$ \end{document} or \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha$$ \end{document} . To sample from a distribution whose mean is \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{h_n^2} ( { \hat h^2} \le {H^2} )$$ \end{document} , we draw a sample from the distribution corresponding to \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$h_n^2$$ \end{document} , and check whether the maximum likelihood estimate for it is more than H². Again, this procedure can be done quickly in \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$O ( n )$$ \end{document} . Once these quantities have been calculated, the CI can be calculated as detailed in Equations (3) and (4).

In practice, we used the following choices in the modified Robbins–Monro procedure: (1) we used \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$T = 1000$$ \end{document} iterations; (2 we set the prior standard deviation to \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\tau = 0.4$$ \end{document} , used to derive a_n and C_n through the Bayesian analysis (see Joseph, 2004); (3) we used the midpoint between the estimate and relevant boundary (0 or 1, depending on the quantile required) as a starting point; and (4) we adaptively changed the step size constant, following the suggestion of Garthwaite and Buckland, by approximating the derivative with an expression proportional to the distance from \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\hat \theta$$ \end{document} : \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} g \prime ( { u_ \theta } ) \approx k ( h_n^2 - { H^2 } ) , \quad k = \frac { 2 } { { { z_ \beta } \cdot { { ( 2 \pi ) } ^ { - 1 / 2 } } \cdot { e^ { - z_ \beta ^2 / 2 } } } } , \tag { 11 } \end{align*} \end{document}

where z is the quantile function of the normal distribution and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta$$ \end{document} is the required quantile.

3.5. The NFBC data set

We analyzed 5236 individuals from the NFBC data set, which consists of genotypes at 331,476 genotyped SNPs and 10 phenotypes (Sabatti et al., 2009). From each pair of individuals with relatedness of >0.025, one was reserved, resulting in 2520 individuals.

3.6. The WTCCC2 data set

We analyzed the WTCCC2 data set (Sawcer et al., 2011). In the multiple sclerosis and ulcerative colitis data sets, we used the same data processing described in Yang et al. (2014) to ensure consistency. In brief, U.K. controls and cases from both United Kingdom and non-United Kingdom were used. SNPs were removed with \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$> 0.5 \%$$ \end{document} missing data, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$p < 0.01$$ \end{document} for allele frequency difference between two control groups, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$p < 0.05$$ \end{document} for deviation from Hardy–Weinberg equilibrium, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$p < 0.05$$ \end{document} for differential missingness between cases and controls, or minor allele frequency \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$< 1 \%$$ \end{document} . In all analyses, SNPs within 5 M base pairs of the human leukocyte antigen region were excluded, because they have large effect sizes and highly unusual linkage disequilibrium patterns, which can bias or exaggerate the results. Finally, from each pair of individuals with relatedness of >0.025, one was reserved, resulting in 13,950 individuals.

4. Discussion

We have presented FIESTA, an efficient method for constructing accurate CIs using stochastic approximation. We have shown that FIESTA is very fast, while achieving exact coverage due to the fact that it does not rely on any assumptions of the distribution of the estimator. FIESTA is also faster than the analytical approximation used by SEs. Owing to its speed, FIESTA can be easily used for data sets with tens or hundreds of thousands of individuals.

FIESTA requires the eigendecomposition of the kinship matrix, whose computational complexity is cubic in the number of individuals. Although this is often a preliminary step in heritability estimation, it may be computationally prohibitive for larger data sets. Recent methods for heritability estimation (see Loh et al., 2015) utilize conjugate gradient methods to avoid cubic steps altogether. One direction of extension for FIESTA is devising a procedure to calculate the derivative of the restricted likelihood function using conjugate gradient methods, which are quadratic, but do not require the eigendecomposition.

We note that the CIs constructed by FIESTA are estimated under a set of assumptions, particularly that the data are generated from the LMM as described in the section 3. Deviations from these assumptions could result in inaccurate CIs. Specifically, we observed that when the genotype matrix is of low rank (e.g., in the case where duplicates are introduced), then the CIs calculated by FIESTA may be inaccurate. We, therefore, recommend removing duplicates and closely related individuals from the data before the application of FIESTA.

A common extension of the LMM is that of multiple variance components, where the genome is divided into distinct partitions (e.g., according to functional annotations or by chromosomes), and the relative genetic contribution of each partition is estimated instead. Another extension is that of multiple traits, where several phenotypes are estimated concurrently, allowing dependencies between them. In principle, the methodology behind FIESTA can be applied to the multiparametric case as well. However, there are several computational and conceptual hurdles that make this application highly nontrivial. First, a major difficulty rises from the fact that it is no longer necessarily possible to jointly diagonalize several kinship matrices. Thus, the computation of the derivatives of the logarithm of the restricted likelihood functions can no longer utilize the eigendecomposition. Second, the inversion of acceptance regions of multiple parameters results in confidence regions of more than one dimension. Although these have the required coverage probability, their shape may be difficult to report or to interpret easily (e.g., an ellipsoid). For example, hyper-rectangular confidence regions are often desirable (Sidak, 1967), as the marginal CI of each parameter has the same coverage probability as the confidence region. Therefore, multiparametric extensions remain a future direction of research.

5. Appendix

5.1. Variance of estimators

The main method of calculating the variance of the estimator, applied by all widely used linear mixed model methods, employs the Fisher information matrix, or a variant of which, possibly applying the delta method in addition Wasserman, L. (2013). All of statistics: A concise course in statistical inference. Springer Science & Business Media. The observed information matrix \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal J} ( \theta )$$ \end{document} of parameters \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\theta$$ \end{document} is the negative of the Hessian of the log-likelihood of the data y . Namely, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$ { \cal J } { ( \theta ) _ { i , j } } = - \frac { \partial } { { \partial { \theta _i } { \theta _j } } } \ell ( \theta ; { \bf { y } } )$$ \end{document} . The Fisher information matrix \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathcal{I} ( \theta )$$ \end{document} is the expectation of the observed information matrix. Namely, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathcal { I } { ( \theta ) _ { i , j } } = { \rm { E } } \left[ { - \frac { \partial } { { \partial { \theta _i } { \theta _j } } } \ell ( \theta ; { \bf { y } } ) } \right]$$ \end{document} . Asymptotically, under certain regularity conditions, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $\sqrt n ( \widehat \theta - \theta ) \buildrel d \over \longrightarrow { \cal N } ( { \bf { 0 } } , \mathcal { I } { ( \theta ) ^ { - 1 } } )$ \end{document} . According to the delta method, the asymptotic distribution of a function \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$f ( \theta )$$ \end{document} satisfies \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $\sqrt n ( f ( \widehat \theta ) - f ( \theta ) ) \buildrel d \over \longrightarrow { \cal N } ( { \bf { 0 } } , \nabla f { ( \theta ) ^T } \mathcal { I } { ( \theta ) ^ { - 1 } } \nabla f ( \theta ) )$ \end{document} .

GCTA uses the average information (Gilmour et al., 1995) matrix \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\cal A$$ \end{document} to calculate the variance of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _g^2$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _e^2$$ \end{document} , where \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$ { \cal A } = \frac { 1 } { 2 } ( \mathcal { I } + { \cal J } )$$ \end{document} . For the REML method, this is the matrix: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \cal A } = \frac { 1 } { 2 } \cdot \left( { \begin{matrix} { { { \bf { y } } ^ { \bf { T } } } { \bf { QKQKQy } } } & { { { \bf { y } } ^ { \bf { T } } } { \bf { QKQQy } } } \\ { { { \bf { y } } ^ { \bf { T } } } { \bf { QQKQy } } } & { { { \bf { y } } ^ { \bf { T } } } { \bf { QQQy } } } { } \\ \end{matrix}} \right) , \tag { 12 } \end{align*} \end{document}

where \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \bf{Q}} = { \Sigma ^{ - 1}} - { \Sigma ^{ - 1}}{ \bf{X}}{ \left( {{{ \bf{X}}^T}{ \Sigma ^{ - 1}}{{ \bf{X}}^T}} \right) ^{ - 1}}{{ \bf{X}}^T}{ \Sigma ^{ - 1}}$$ \end{document} , with \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\Sigma = \sigma _g^2{ \bf{K}} + \sigma _e^2{ \bf{I}}$$ \end{document} . Then, the delta method is used to calculate the variance of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} : \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} { \rm{Var}} ( { \hat h^2} ) = ( \hat \sigma _g^2 + \hat \sigma _e^2{ ) ^{ - 4}} \left( { \begin{matrix} { \hat \sigma _e^2} { - \hat \sigma _g^2} \\ \end{matrix} } \right) {{ \cal A}^{ - 1}}{ \vert _{ \sigma _g^2 = \hat \sigma _g^2 , \sigma _e^2 = \hat \sigma _e^2}} \left( { \begin{matrix} { \hat \sigma _e^2} \\ { - \hat \sigma _g^2} \\ \end{matrix} } \right) . \tag{13} \end{align*} \end{document}

Given the eigendecomposition of K, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \Sigma ^{ - 1}}$$ \end{document} (and thus Q) can be calculated in O(n) (where n is the number of individuals), avoiding an expensive matrix inversion. Several other computational improvements may be carried out, depending on software implementation. However, we note that \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$O ( {n^2} )$$ \end{document} matrix-by-vector multiplications cannot be avoided.

5.2. CIs for heritability

Our approach is based on the duality between hypothesis testing and CIs. As the distribution of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} depends solely on h², we may assume without loss of generality that \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\sigma _p^2 = 1$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta = {{ \bf{0}}_p}$$ \end{document} . For a fixed value h², an acceptance region \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}}$$ \end{document} is defined as the subset of values \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} for which a test does not reject the null hypothesis that the phenotype vector is drawn from \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal N} \left( {{{ \bf{0}}_n} , {{ \bf{V}}_{{h^2}}}} \right)$$ \end{document} . The probability of the event \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}}$$ \end{document} under \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \cal N} \left( {{{ \bf{0}}_n} , {{ \bf{V}}_{{h^2}}}} \right)$$ \end{document} should be \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\ge 1 - \alpha$$ \end{document} . This region may be indirectly derived from an actual test (e.g., a generalized likelihood ratio test) or constructed explicitly. The corresponding CI for an estimate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2} = {H^2}$$ \end{document} , \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${C_{{H^2}}}$$ \end{document} , comprises the set of parameter values for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} does not imply the rejection of the null hypothesis that the true heritability value is h²: \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {C_{{H^2}}} = \left\{ {{h^2} \left\vert {{H^2} \in {A_{{h^2}}}} \right.} \right\} \;. \tag{14} \end{align*} \end{document}

Since the distribution of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} is bounded and generally asymmetric, the choice of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}}$$ \end{document} is not unique. It remains to determine \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}}$$ \end{document} for every h². We give here a general description of the construction; in Schweiger et al. (2016), we give a full description of the method, along with proofs.

Let \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${c_ \beta } ( {h^2} )$$ \end{document} be the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta$$ \end{document} -th quantile function of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} , when the true heritability is h², that is, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \le {c_ \beta } ( {h^2} ) ) = \beta$$ \end{document} . A natural choice for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}}$$ \end{document} would be taking the interval obtained by removing a \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\alpha / 2$$ \end{document} tail from both sides of the distribution of \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2}$$ \end{document} given h², that is, choosing the two-sided \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}} = [ {c_{ \alpha / 2}} ( {h^2} ) , {c_{1 - \alpha / 2}} ( {h^2} ) ]$$ \end{document} . If this were always possible, a succinct way of describing the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha$$ \end{document} CI, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${C_{{H^2}}} = [ {l_{{H^2}}} , {h_{{H^2}}} ]$$ \end{document} , would be using the fact that its endpoints are exactly those following \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {c_{1 - \alpha / 2}} ( {l_{{H^2}}} ) = {H^2} \rightarrow {l_{{H^2}}} = c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) \tag{15} \end{align*} \end{document} \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {c_{ \alpha / 2}} ( {h_{{H^2}}} ) = {H^2} \rightarrow {u_{{H^2}}} = c_{ \alpha / 2}^{ - 1} ( {H^2} ) . \tag{16} \end{align*} \end{document}

as described in Figure 3.

FIG. 3.

An illustration of acceptance regions and CIs. The diagonal lines are the \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\alpha / 2$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha / 2$$ \end{document} quantile functions, shown for values in the midrange of heritability values. Several example acceptance regions are denoted as horizontal lines, in parameter regions where simple two-sided acceptance regions can be defined. The CI for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${ \hat h^2} = 0.5$$ \end{document} is shown as a vertical line.

However, since the distribution is of a mixed type with discontinuity points, it may be the case that the probability of the interval \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$[ {c_{ \alpha / 2}} ( {h^2} ) , {c_{1 - \alpha / 2}} ( {h^2} ) ]$$ \end{document} might be greater than \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$( 1 - \alpha / 2 ) - \alpha / 2 = 1 - \alpha$$ \end{document} . For example, if \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} = 0 ) > \alpha / 2$$ \end{document} , then \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${c_{ \alpha / 2}} = 0$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \in [ 0 , {c_{ \alpha / 2}} ) ) > \alpha / 2$$ \end{document} . In this case, we then instead choose to take the one-sided interval \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${A_{{h^2}}} = [ 0 , {c_{1 - \alpha }} ( {h^2} ) ]$$ \end{document} . Similarly, if \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} = 1 ) > \alpha / 2$$ \end{document} , then \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${c_{1 - \alpha }} = 1$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \in ( {c_{1 - \alpha / 2}} , 1 ] ) > \alpha / 2$$ \end{document} . In this case, we similarly choose the one-sided interval \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$[ {c_ \alpha } ( {h^2} ) , 1 ]$$ \end{document} instead. We are, therefore, interested in the maximal value s for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_s ( { \hat h^2} = 0 ) \ge \alpha / 2$$ \end{document} , and the minimal value t for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} = 1 ) \ge \alpha / 2$$ \end{document} , because in the range of values \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${h^2} \in [ s , t ]$$ \end{document} , it holds that \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} \in [ {c_{ \alpha / 2}} ( {h^2} ) , {c_{1 - \alpha / 2}} ( {h^2} ) ] ) = 1 - \alpha$$ \end{document} . Equivalently, assuming \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} = 0 )$$ \end{document} (respectively, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2}} ( { \hat h^2} = 1 )$$ \end{document} ) is decreasing (respectively, increasing) in h², we may simply define s and t to be the values for which \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2} = s} ( { \hat h^2} = 0 ) = \alpha / 2$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\mathop {{ \rm{P}}r} \nolimits_{{h^2} = t} ( { \hat h^2} = 1 ) = \alpha / 2$$ \end{document} .

The following assumes s and t exist, and that \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$s < t$$ \end{document} ; for the general case, see Schweiger et al. (2016). We divide our construction into distinct cases, by setting \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {A_{{h^2}}} = \begin{cases} \begin{matrix} { [ 0 , {c_{1 - \alpha }} ( {h^2} ) ] } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {h^2} \in [ 0 , s ) } \\ { [ {c_{ \alpha / 2}} ( {h^2} ) , {c_{1 - \alpha / 2}} ( {h^2} ) ] } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {h^2} \in [ s , t ] } \\ { [ {c_ \alpha } ( {h^2} ) , 1 ] } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {h^2} \in ( t , 1 ] .} {} \\ \end{matrix} \end{cases} \end{align*} \end{document}

The three region types are illustrated in Figure 4. Inverting the acceptance regions, we get the following definition for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${C_{{H^2}}} = [ {l_{{H^2}}} , {h_{{H^2}}} ]$$ \end{document} . For the lower endpoint, we have \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {l_{{H^2}}} = \begin{cases} \begin{matrix} 0 & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ 0 , {c_{1 - \alpha }} ( 0 ) ) } \\ {c_{1 - \alpha }^{ - 1} ( {H^2} ) } & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ {c_{1 - \alpha }} ( 0 ) , {c_{1 - \alpha }} ( s ) ) } \\ s & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ {c_{1 - \alpha }} ( s ) , {c_{1 - \alpha / 2}} ( s ) ) } \\ {c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ {c_{1 - \alpha / 2}} ( s ) , 1 ] } {} \\ \end{matrix} \end{cases}. \end{align*} \end{document}

For the higher endpoint, we have \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} {u_{{H^2}}} = \begin{cases} \begin{matrix} {c_{ \alpha / 2}^{ - 1} ( {H^2} ) } & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ 0 , {c_{ \alpha / 2}} ( t ) ) } \\ t & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ {c_{ \alpha / 2}} ( t ) , {c_ \alpha } ( t ) ) } \\ {c_ \alpha ^{ - 1} ( {H^2} ) } & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ {c_ \alpha } ( t ) , {c_ \alpha } ( 1 ) ) } \\ 1 & {{ \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \in [ {c_ \alpha } ( 1 ) , 1 ] } {} \\ \end{matrix} \end{cases}. \end{align*} \end{document}

FIG. 4.

An illustration of the three acceptance region types. The diagonal lines, from left to right, indicate the quantile functions for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\alpha / 2 , \alpha , 1 - \alpha$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha / 2$$ \end{document} . The three region types are indicated as horizontal lines. The points s and t, where region types used are changed, are indicated as horizontal dashed lines. See section 3 for a full description.

These conditions, phrased in terms of the quantile functions \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${c_ \beta }$$ \end{document} , for example, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${H^2} \le {c_ \alpha } ( t )$$ \end{document} , can be equivalently written in terms of the value of inverse quantile functions of the estimate H², for example, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$c_ \alpha ^{ - 1} ( {H^2} ) \le t$$ \end{document} . In addition, let \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${s^*} = {c_{1 - \alpha }} ( 0 ) , {t^*} = {c_ \alpha } ( 1 )$$ \end{document} . Explicitly, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{matrix} {{l_{{H^2}}} = \begin{cases} \begin{matrix} 0 & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} {H^2} \le {s^*}} \\ {c_{1 - \alpha }^{ - 1} ( {H^2} ) } & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} c_{1 - \alpha }^{ - 1} ( {H^2} ) < s} \\ s & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} s \in [ c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) , c_{1 - \alpha }^{ - 1} ( {H^2} ) ] } \\ {c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} s < c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } {} \\ \end{matrix} \end{cases}} \hfill \\ \end{matrix} , \tag{17} \end{align*} \end{document}

and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} \begin{align*} \begin{matrix} {{u_{{H^2}}} = \begin{cases} \begin{matrix} {c_{1 - \alpha / 2}^{ - 1} ( {H^2} ) } & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} c_{ \alpha / 2}^{ - 1} ( {H^2} ) < t} \\ t & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} t \in [ c_ \alpha ^{ - 1} ( {H^2} ) , c_{ \alpha / 2}^{ - 1} ( {H^2} ) ] } \\ {c_ \alpha ^{ - 1} ( {H^2} ) } & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} t < c_ \alpha ^{ - 1} ( {H^2} ) } \\ 1 & {{ \kern 1pt} { \kern 1pt} {\rm if} \;{ \kern 1pt} {t^*} \le {H^2}} {} \\ \end{matrix} \end{cases}} \hfill \\ \end{matrix} . \tag{18} \end{align*} \end{document}

It follows from the mentioned discussion that to construct a CI for an heritability estimate H², we need to first find s, t as previously, \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${s^*} = {c_{1 - \alpha }} ( 0 )$$ \end{document} and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $${t^*} = {c_ \alpha } ( 1 )$$ \end{document} , and then we need only calculate \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$c_ \beta ^{ - 1} ( {H^2} )$$ \end{document} for \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$\beta = \alpha / 2 , \alpha , 1 - \alpha$$ \end{document} , and \documentclass{aastex}\usepackage{amsbsy}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{bm}\usepackage{mathrsfs}\usepackage{pifont}\usepackage{stmaryrd}\usepackage{textcomp}\usepackage{portland, xspace}\usepackage{amsmath, amsxtra}\usepackage{upgreek}\pagestyle{empty}\DeclareMathSizes{10}{9}{7}{6}\begin{document} $$1 - \alpha / 2$$ \end{document} . Therefore, the entire construction relies on inverting certain quantile functions.

5.3. Accuracy of ALBI CIs

FIG. 5.

Accuracy of CIs for the NFBC and WTCCC2 data sets. The coverage probabilities of the ALBI CIs. The coverage probabilities are shown for CIs designed to have coverage probabilities of 95%.

Footnotes

Acknowledgments

The authors thank David Steinberg. R.S. is supported by the Colton Family Foundation. This study was supported, in part, by a fellowship from the Edmond J. Safra Center for Bioinformatics at Tel Aviv University to R.S. The Northern Finland Birth Cohort data were obtained from dbGaP: phs000276.v2.p1. This study makes use of data generated by the Wellcome Trust Case Control Consortium. A full list of the investigators who contributed to the generation of the data is available in (). Funding for the project was provided by the Wellcome Trust under award 076113.

Author Disclosure Statement

Regev Schweiger is an employee of MyHeritage Ltd.

References

Burch

B.D.

2007. Comparing pivotal and REML-based confidence intervals for heritability. J. Agric. Biol. Environ. Stat., 12, 470–484.

Burch

B.D.

2011. Assessing the performance of normal-based and reml-based confidence intervals for the intraclass correlation coefficient. Comput. Stat. Data Anal., 55, 1018–1028.

Burch

B.D.

, and Iyer

H.K.

1997. Exact confidence intervals for a variance ratio (or heritability) in a mixed linear model. Biometrics, 53, 1318–1333.

Carpenter

, and Bithell

2000. Bootstrap confidence intervals: When, which, what? A practical guide for medical statisticians. Stat. Med., 19, 1141–1164.

Chernoff

1954. On the distribution of the likelihood ratio. Ann. Math. Stat. 25, 573–578.

Chung

, Rabe-hesketh

, Gelman

, et al. 2011. Avoiding boundary estimates in linear mixed models through weakly informative priors. Berkeley Preprints. 1–30.

Fisher

R.A.

1918. The correlation between relatives on the supposition of mendelian inheritance. Trans. R. Soc. Edinb., 52, 399–433.

Furlotte

N.A.

, and Eskin

2015. Efficient multiple trait association and estimation of genetic correlation using the matrix-variate linear mixed-model. Genetics, 200, 59–68.

Furlotte

N.A.

, Heckerman

, and Lippert

2014. Quantifying the uncertainty in heritability. J. Hum. Genet., 59, 269–275.

10.

Garthwaite

P.H.

, and Buckland

S.T.

1992. Generating Monte Carlo confidence intervals by the Robbins-Monro process. Appl. Stat. 41, 159–171.

11.

Gilmour

A.R.

, Thompson

, and Cullis

B.R.

1995. Average information REML: An efficient algorithm for variance parameter estimation in linear mixed models. Biometrics, 51, 1440–1450.

12.

Harville

D.A.

, and Fenech

A.P.

1985. Confidence intervals for a variance ratio, or for heritability, in an unbalanced mixed linear model. Biometrics, 41, 137–152.

13.

Joseph

V.R.

2004. Efficient Robbins–Monro procedure for binary data. Biometrika, 91, 461–470.

14.

Kang

H.M.

, Sul

J.H.

, Service

S.K.

, et al. 2010. Variance component model to account for sample structure in genome-wide association studies. Nat. Genet., 42, 348–354.

15.

Kang

H.M.

, Zaitlen

N.A.

, Wade

C.M.

, et al. 2008. Efficient control of population structure in model organism association mapping. Genetics, 178, 1709–1723.

16.

Kraemer

2012. Confidence intervals for variance components and functions of variance components in the random effects model under non-normality. https://lib.dr.istate.edu/etd/12893/

17.

Kruijer

, Boer

M.P.

, Malosetti

, et al. 2015. Marker-based estimation of heritability in immortal populations. Genetics, 199, 379–398.

18.

Kushner

, and Yin

G.G.

2003. Stochastic Approximation and Recursive Algorithms and Applications, volume 35. Springer Science & Business Media.

19.

Lippert

, Listgarten

, Liu

, et al. 2011. Fast linear mixed models for genome-wide association studies. Nat. Methods, 8, 833–835.

20.

Loh

P.-R.

, Bhatia

, Gusev

, et al. 2015. Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat. Genet., 47, 1385–1392.

21.

Lohr

S.L.

, and Divan

1997. Comparison of confidence intervals for variance components with unbalanced data. J. Stat. Comput. Simul., 58, 83–97.

22.

Macgregor

, Cornes

B.K.

, Martin

N.G.

, et al. 2006. Bias, precision and heritability of self-reported and clinically measured height in australian twins. Hum. Genet., 120, 571–580.

23.

Manolio

T.A.

, Brooks

L.D.

, and Collins

F.S.

2008. A hapmap harvest of insights into the genetics of common disease. J. Clin. Invest. 118, 1590.

24.

Moran

P.A.

1971. Maximum-likelihood estimation in non-standard conditions, 441–450. In Mathematical Proceedings of the Cambridge Philosophical Society, volume 70. Cambridge University Press.

25.

Patterson

H.D.

, and Thompson

1971. Recovery of inter-block information when block sizes are unequal. Biometrika, 58, 545–554.

26.

Robbins

, and Monro

1951. A stochastic approximation method. Ann. Math. Stat. 22, 400–407.

27.

Sabatti

, Service

S.K.

, Hartikainen

A.-L.L.

, et al. 2009. Genome-wide association analysis of metabolic traits in a birth cohort from a founder population. Nat. Genet., 41, 35–46.

28.

Sawcer

, Hellenthal

, Pirinen

, et al. 2011. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis. Nature, 476, 214.

29.

Schweiger

, Kaufman

, Laaksonen

, et al. 2016. Fast and accurate construction of confidence intervals for heritability. Am. J. Hum. Genet., 98, 1181–1192.

30.

Searle

S.R.

, Casella

, and McCulloch

C.E.

2009. Variance Components, volume 391. John Wiley & Sons.

31.

Self

S.G.

, and Liang

K.-Y.

1987. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J. Am. Stat. Assoc., 82, 605–610.

32.

Sidak

1967. Rectangular confidence regions for the means of multivariate normal sistributions. J. Am. Stat Assoc., 62, 626–633.

33.

Silventoinen

, Sammalisto

, Perola

, et al. 2003. Heritability of adult body height: A comparative study of twin cohorts in eight countries. Twin Res. 6, 399–408.

34.

Stern

, and Welsh

2000. Likelihood inference for small variance components. Can. J. Stat., 28, 517–532.

35.

Sudlow

, Gallacher

, Allen

, et al. 2015. UK biobank: An open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779.

36.

Thai

H.T.

, Mentré

, Holford

N.H.G.

, et al. 2013. A comparison of bootstrap approaches for estimating uncertainty of parameters in linear mixed-effects models. Pharm. Stat., 12, 129–140.

37.

Vattikuti

, Guo

, and Chow

C.C.

2012. Heritability and genetic correlations explained by common SNPs for metabolic syndrome traits. PLoS Genet. 8, e1002637.

38.

Visscher

P.M.

, and Goddard

M.E.

2015. A general unified framework to assess the sampling variance of heritability estimates using pedigree or marker-based relationships. Genetics, 199, 223–232.

39.

Visscher

P.M.

, Hill

W.G.

, and Wray

N.R.

2008. Heritability in the genomics eraconcepts and misconceptions. Nat. Rev. Genet., 9, 255–266.

40.

Welter

, MacArthur

, Morales

, et al. 2014. The NHGRI GWAS catalog, a curated resource of snp-trait associations. Nucleic Acids Res. 42(Database issue), D1001–D1006.

41.

Wolfinger

R.D.

, and Kass

R.E.

2000. Nonconjugate Bayesian analysis of variance component models. Biometrics, 56, 768–774.

42.

Wright

F.A.

, Sullivan

P.F.

, Brooks

A.I.

, et al. 2014. Heritability and genomics of gene expression in peripheral blood. Nat. Genet., 46, 430–437.

43.

Yang

, Benyamin

, McEvoy

B.P.

, et al. 2010. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet., 42, 565–569.

44.

Yang

, Lee

S.H.

, Goddard

M.E.

, et al. 2011. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet., 88, 76–82.

45.

Yang

, Zaitlen

N.A.

, Goddard

M.E.

, et al. 2014. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet., 46, 100–106.

46.

Zhou

, and Stephens

2012. Genome-wide efficient mixed-model analysis for association studies. Nat. Genet., 44, 821–824.

Using Stochastic Approximation Techniques to Efficiently Construct Confidence Intervals for Heritability

Abstract

Abstract

1. Introduction

2. Results

2.1. A faster method for calculating CIs for heritability

2.2. Benchmarks

3. Methods

3.1. The LMM and REML

3.2. CIs for h2

3.3. Using stochastic approximation to calculate CIs

Robbins–Monro

Using Robbins–Monro to calculate CIs

The modified Robbins–Monro procedure

3.4. Using the modified Robbins–Monro procedure to obtain CIs for heritability

3.5. The NFBC data set

3.6. The WTCCC2 data set

4. Discussion

5. Appendix

5.1. Variance of estimators

5.2. CIs for heritability

5.3. Accuracy of ALBI CIs

Footnotes

Acknowledgments

Author Disclosure Statement

References

3.2. CIs for h²