Abstract
In estimating a quantitative assay's lower limit of detection (LOD), standard deviation (SD) is the most common measure used to quantify the dispersion of the data, yet this LOD calculation method assumes that the low concentration samples follow a Gaussian distribution, which is not always true in reality. Here, a few LOD estimating methods that are based on different dispersion measures were investigated; each method's performance was evaluated across various distribution scenarios. Nine methods for LOD estimation that use different measures of data dispersion—SD, mean absolute deviation (MD), median absolute deviation, Gini's mean difference (GMD), percentiles (PCT), Algorithm A, S n , Q n , and inter-quartile range—were evaluated using both simulations and real-life datasets. LOD estimates calculated using different variability measures were compared to the true LOD value under different scenarios. A method was judged to be good if the method had a relatively stable formula, low bias, confidence interval that had shorter width, and achieved the desired level of frequency in covering the true value of LOD (coverage probability [CP]). First, the nine methods were screened for formula consistency across different distribution scenarios. Methods showing the greatest formula variation were removed from further analysis; the remaining methods were then examined and compared. The GMD-based method had a relatively stable formula and demonstrated the best overall performance with low bias, confidence interval of shorter width, and good CP across all situations. The PCT-based method only performed well if sample size was large. The MD-based method in general had larger bias than the GMD-based estimator. LOD estimates based on SD that assumes Gaussian distribution in all scenarios will often generate poor results. Instead, the GMD-based estimator, a method with a simple formula so is easy to use in practice, demonstrated robust performance across varying situations.
Introduction
Generally, different measures of detection capability are used to specify the increasing quantitative certainty within the low-end region of the measuring interval. These include the upper boundary on blank sample measurements (the limit of blank or LOB), yes/no detection of the measurand's presence (the limit of detection or LOD), and the minimal measurand amount that can be quantitated reliably with respect to defined accuracy goals (the limit of quantitation or LOQ). 2 –9 One or more of these estimates may be necessary to adequately characterize performance in the low-end region of the measuring interval. Based on the Clinical and Laboratory Standards Institute (CLSI) guideline EP17A, the definitions of LOB, LOD, and LOQ used here are as follows: LOB is the highest measurement result that is likely to be observed with a stated probability (1−type I error α) for blank samples; LOD is the lowest concentration of analyte that can be consistently detected in greater than or equal to a proportion (1−type II error β) of samples tested under routine clinical laboratory conditions; LOQ is the lowest amount of measurand in a material that can be quantitatively determined with stated accuracy under stated experimental conditions. 4 LOB and LOD are objective statistical constructs that are calculated solely on the inherent precision of the measurement procedure. In contrast, the LOQ reflects performance of the measurement procedure versus a pre-established accuracy goal. The focus of this research is LOD. See Figure 1 for graphical illustration of LOB and LOD.

An illustration of limit of blank (LOB) and limit of detection (LOD). For the blank sample (left curve), 95% of its measurement results (α=0.05) fall at or below the LOB. For a sample whose measurand content equals the LOD (right curve), 95% of its measurement results (β=0.05) exceed the LOB. The truncated blank sample distribution reflects that some instrument systems suppress measurement results below zero.
LOD is generally determined in one of two ways: (i) statistically estimating based on the observed low sample data; (ii) empirically testing serial dilutions of samples with a known concentration of the target substance in the analytical range of the expected detection limit and find the concentration that meets the pre-specified criteria. Using statistical approaches, the parametric formula for the true, not estimated, value of LOD is calculated as follows:
where LOB is the true LOB, c β is the multiplicative factor that is associated with the target acceptable error risk of false negatives (β denotes the tolerated false-negative rate, usually β=0.05), and DLOD is the true dispersion for data generated from samples with concentration of LOD.
Using experimental data from blank samples and low samples, there are different methods to estimate DLOD as well as the c β associated with it. For LOD estimation, current methods rely heavily on the assumption that the data of the low samples follow a Gaussian distribution, which is not true in some cases. 10,11 Limpert et al. argued, “Many measurements show a more or less skewed distribution. Skewed distributions are particularly common when mean values are low, variances large, and values cannot be negative, as is the case, for example, with species abundance, lengths of latent periods of infectious diseases, and distribution of mineral resources in the Earth's crust. Such skewed distributions often closely fit the log-normal distribution.” 12 In biology, it is quite common that the data from immunoassays are positive/right skewed, and usually they can be approximated by log-normal distributions. 13–14 Moreover, even though the data are not skewed, they may still not follow a Gaussian distribution. If the data do not follow a Gaussian distribution, the typical method that uses standard deviation (SD), along with c β determined based on a Gaussian model, may provide poor estimations of the true LOD. 10 Therefore, the interest of this research was to evaluate statistical methods for estimating LOD, using both computer simulations and real-life data, when the distributions of the low samples did not necessarily follow a Gaussian distribution.
Materials and Methods
Assumptions for the Statistical Methods
LOD calculation is a two-step process; it involves using blank samples to calculate the LOB and then using low samples to calculate DLOD . This investigation focused on the second step, and the LOB was assumed to be fixed (LOB is known or has been verified). Further, this research was interested in the situations where the variation of the low samples was approximately constant over a limited range of low concentrations, which is a reasonable assumption for many assays such as serial dilution and ligand-binding assays. 15,16 If the variation of the low samples is not constant, more complicated methods may be adopted to estimate LOD, 4 but that is beyond the scope of this research.
Under these assumptions, the LOD estimate can be expressed in a general formula as
since the variation for the low samples is assumed to be constant: DLOD =Dlow . Therefore, when the LOB is assumed to be fixed, LOD estimation is a matter of determining the dispersion measure Dlow and the multiplicative factor c β. A few statistical methods were investigated through theoretical reasoning, simulation, and real-life data applications. A method was deemed to be superior if (i) it had a simple mathematical formula for Dlow , so it would be easy to implement in practice; (ii) its multiplicative factor c β was easy to determine, preferably a constant; (iii) it demonstrated low bias in estimating the true LOD; (iv) its confidence interval was narrow (low uncertainty) and had high probability to cover the true value; and (v) it showed robust performance across different scenarios.
Statistical Methods for LOD Calculation
When α=0.05, the LOB is estimated by the 95th percentile of the distribution of the blank samples. If the data of the blank samples follow a Gaussian distribution,
where
The statistical methods considered in this research for LOD estimates are summarized in Table 1. More details of the methods are provided in the Supplementary Data (available online at
Measures of Dispersion (
Provided are the formulae for the measurement of the data dispersion
Computer Simulations to Evaluate LOD Calculation Methods
To evaluate and compare the performance of the LOD estimation methods, parametric simulation studies using R 2.10.1 were conducted using a variety of distribution types for the low samples. 21,22 Specifically, scenarios of Gaussian distributions, symmetric-but-not-Gaussian distributions (T-distributions with different degrees of freedom, e.g., T10 stands for T-distribution with 10 degrees of freedom), and asymmetrical distributions (log-normal distributions with different levels of skewness) were considered.
All of the methods have the form
When
Therefore, the first undertaking of this study was to investigate the behavior of the multiplicative factor c
β in the various methods. Based on simulation results across Gaussian, T20, T10, T5, and T3 distributions, those methods with variable c
β values were subsequently removed from further investigation. The simulation was conducted as follows: the type I and type II errors are assumed to be α=β=0.05. For a specific distribution, the LOB is assumed to be fixed, that is,
where LOD is the value of the true LOD and
The remaining methods are thus those whose multiplicative factor c
β is relatively constant; a best c
β value is determined as a fixed value for each method based on further investigation. With the formula fixed (i.e., a fixed value was determined for c
β), each LOD estimation method was then evaluated for performance based on bias, the width of its confidence interval, and how often this confidence interval covers the true LOD (coverage probability [CP]) through simulations on broader sampling scenarios. Specifically, Gaussian, T-distribution with different degrees of freedom, and log-normal distributions with different levels of skewness were considered. In addition, for each distribution, the performance was also investigated when the number of low samples varied. Ten thousand runs were carried out for each distribution scenario, and the LOD estimate was calculated for each method as described below. For a given method and a given distribution, let
Real-Life Data to Evaluate LOD Calculation Methods
The first data chosen for the real-life applications were supplied by CLSI EP17A and were measures of total mercury in blood (μg/L). 4 There were 117 measurements on a variety of blank samples, and the LOB was calculated by CLSI and claimed to be 0.239 μg/L based on the nonparametric percentile (PCT) method. The low sample data were comprised of 15 subjects with low level of mercury in the blood. For each subject, the laboratory had obtained a series of 20 samples over a period of at least 4 days. A subset of 4 subjects from the original 15 had the raw data provided in EP17A, and this was used in this analysis. The raw data of the four subjects are provided in Supplementary Table S1 and Supplementary Figure S1.
The second real-life data were from a fluorescence resonance energy transfer (FRET) assay to detect protein marker X activity in cellular supernatant of cells derived from a fresh patient tumor. Marker X is believed to be minimally expressed by normal cells and over expressed by tumor cells. Equal number of live cells was separately derived from a portion of the normal tissue and a portion of the tumor tissue dissected from a live tumor of a breast cancer patient. The two sources of cells were then put into equal volume of media and grown for 96 h. Replicates of equal volume of the cellular supernatant from the tumor cells were drawn and measured for the activity of marker X using the FRET activity assay; the same was done for the cellular supernatant from the cells from the normal tissue. Twenty of the tumor replicates (low sample) and 18 of the normal replicates (blank sample) provided valid assay results.
Results
Screening of Statistical Methods for the Behavior of
c
β
The medians of the 10,000 estimates of
c β Values for Each Limit of Detection Estimation Method Across Various Symmetrical Distributions
This table reports the c β values (β=0.05) as produced by each method in simulations for symmetric-but-not-Gaussian and Gaussian. Here the method for dispersion measurement is used to refer to the LOD estimation method that is based on it. The more variation in c β values the method has across different distributions, the less stable the method's formula and the more difficult it would be to implement.
Determining the Best
c
β
Values for the Remaining Methods
Besides the shape of the distribution, the c
β values also depended upon the number of low samples and the number of replicates for each sample. A grid-search approach was conducted to investigate the behavior of and to find the best value for c
β for MD-based and GMD-based methods based on different number of low samples (one and four), varying number of replicates for each low sample (total
For the GMD-based method, when there was only one low sample with N=20, the c β value was 1.49; when N=40, 60, or 80, the c β value was 1.47; when N=100 or 200, the c β value was 1.46. The c β values remained unchanged for the GMD-based method when there were four low samples. For the MD-based method, when there was only one low sample with N=20, the c β value was 2.24; when N=40 or 60, the c β value was 2.17; when N=80 or 100, the c β value was 2.15; when N=200, the c β value was 2.14. However, when there were four low samples (with the same total N), the c β values were quite different from the c β values under one low sample, which was in contrast to the same comparison with the GMD-based method. (This is the reason why the biases were different between the one-sample scenario and the four-sample scenarios for the MD-based method in the simulations.)
Table 3 reveals the c
β values for GMD- and MD-based methods in situations with one low sample or four low samples, each evaluated across the different distributions and various number of replicates (total N=20 to 200, in increments of 20). The c
β values for GMD-based method were all close to 1.46; therefore, 1.46 was chosen as the best c
β value for this method (fixed for all situations); that is, the LOD estimate based on the GMD method has the fixed formula:
c β Values for Mean Absolute Deviation and Gini's Mean Difference over Three Distributions, with Different Sample Numbers and Varying Numbers of Sample Replicates
The c β values (β=0.05) detailed here illustrate c β behavior for MD and GMD when the number of samples changes from 1 to 4 and the total number of replicates ranges from 20 to 200.
N, total number of replicates; G, Gaussian distribution.
The c
β values produced by the MD-based method demonstrated relatively larger variations than the variations in c
β values seen with the GMD-based method. As a compromise, 2.19 was chosen as the best c
β for MD-based method since it resulted in relatively low biases across different situations; that is, the MD-based method has the fixed formula,
Method Performance Comparison Based on Simulations
Simulations from different scenarios were then conducted to compare the four-screened LOD estimation methods that are based on SD, MD, PCT, and GMD, respectively. The symmetric distributions considered included Gaussian distribution and T-distributions with degrees of freedom (3, 5, 10, and 20). The asymmetric distributions considered included log-normal (0, 0.05) with a skewness value of 0.15, and log-normal (0, 0.25) with a highly skewness value of 0.8. Table 4 summarizes the simulation results of 10,000 runs for one low sample with sample replicates N=60 (more comprehensive simulation results are provided in the Supplementary Tables S4 and S5).
Method Comparison Across Different Scenarios
Comparison of method's bias, the width of its 95% CI (β=0.05), and the frequency the CIs covering the true LOD value (CP) by computer simulation across Gaussian, T-distributions, and log-normal distributions. Representative evaluation criteria for one low sample with 60 replicates are listed for each method.
CI, confidence interval; CP, coverage probability.
Symmetric-but-not-Gaussian distributions
The SD-based method performed poorly when the low sample distribution was not Gaussian; even though it was symmetric, it demonstrated relatively large bias and wider confidence interval (high uncertainty). MD- and GMD-based methods performed well with symmetric distributions. The only exception was the extremely heavy-tailed T3 distribution, in which case, the confidence interval for the GMD-based method was slightly wider. Overall, the confidence intervals for GMD- and MD-based methods had almost the same width. The GMD-based method demonstrated very low bias, unlike the MD-based method, which demonstrated high bias in some cases. When the low sample data followed a Gaussian distribution, the GMD-based method was comparable to the SD-based estimator (the optimal method for Gaussian distributions), whereas the MD-based estimator was very biased.
Asymmetric distributions
For the asymmetric distributions exemplified by a log-normal model, when the distribution was moderately skewed, GMD- and SD-based methods performed well. The two methods showed similar performance in terms of bias and uncertainty, while the MD-based method produced very large bias; the PCT-based method also displayed low bias, but it required a large sample size to reduce its uncertainty. When the distribution was much skewed, all four methods performed poorly; however, the GMD-based method was relatively better, its confidence interval was narrower, and it had a smaller bias than the MD-based method and the SD-based method. Overall, if the low sample data were approximately log-normal, GMD- and SD-based methods performed better than the other two methods when sample size was not large.
Sample size
The percentile method generated large uncertainty and large bias if the size was small. When the sample size was large, the percentile method performed well. The uncertainty of the GMD-based and the MD-based methods was almost the same. The MD-based method was heavily affected by the number of low samples (in these simulations, one low sample and four low samples were hypothesized), yet the GMD-based method showed little variation with respect to the number of low samples. The GMD-based method yielded good coverage probability (94%–95%), as opposed to the MD-based method, which had relatively lower coverage probability. This difference existed even when the sample size was large; this was because the GMD-based method had less bias than the MD-based method.
Real-Life Data Applications
For the analysis of the mercury level in blood, the LOB was reported as 0.239 in EP17A, which was considered fixed in this analysis. The raw data of the four subjects were used as the four low samples (Supplementary Table S1 and Supplementary Fig. S1). The Cochran test showed no significant differences in variation among the four subjects (P value=0.1459),
23
so the variation was assumed to be constant in the low sample data. The Shapiro–Wilk's test revealed that the four subjects' data were of approximately Gaussian distribution (P value=0.06329, 0.06277, 0.1608, and 0.2391 for the data of subjects 1, 2, 3, and 4, respectively)
24
; therefore, the SD-based method was justified. Table 5 reports the estimated LOD values based on the four methods. For example, based on the 80 observations in the four subjects, the pool SD was
Comparison of the Estimated Limit of Detection Values in the Real-Life Datasets
Assuming the LOB values are fixed for the both datasets, the table lists the LOD estimates (β=0.05) based on the four methods. For data 1, assuming the low sample data follow Gaussian distributions, the true LOD value is 0.4165; for data 2, assuming the low sample data follow a log-normal distribution, the true LOD value is 7,630.53.
LOB, limit of blank.
For the GMD-based method, the average GMD value from the four subjects was 0.12222, so the estimate of LOD was
The LOD estimates based on MD and PCT are 0.4258 and 0.4384, respectively. Since the data were approximately Gaussian, the SD-based method was expected to provide a good estimate of LOD. The estimate produced by the GMD-based method was very similar to that produced by the SD-based method. The estimates based on the other two methods were slightly higher.
The fitted log-normal distribution for the normal sample data had mean and SD of μ=7.1788, σ=0.9657; the fitted log-normal distribution for the tumor sample data had mean and SD of μ=7.886, σ=0.4473. Shapiro–Wilk's normality test was separately performed on the logarithmic-transformed data from the blank sample (normal tissue) and the tumor tissue, and the P values were 0.8558 and 0.0918, respectively. The LOB calculated from the normal sample was LOB=6,244.915, which was then assumed to be fixed. Assuming low sample data indeed follow a log-normal distribution, the true LOD should be calculated as LOB plus the distance between 5th percentile and 50th percentile of the true distribution LogNormal(7.886, 0.4473). This resulted in true LOD=7,630.53. Table 5 shows that the SD-, MD-, and GMD-based methods are all positively biased; this is because the large value of σ=0.4473 in the log-normal distribution makes the log-normal curve highly tailed and asymmetric (this agrees with the simulation results for highly skewed log-normal distribution). Relatively speaking, the GMD-based method is better than the SD-based method. The PCT-based method generated an LOD estimate that is close to the true value. Note that the true LOD is unknown. The P value of 0.0918 for the Shapiro–Wilk's normality test on the tumor data suggested that the assumed true LOD might be somewhat different from 7,630.53.
Discussion
This study assessed a few statistical methods for LOD estimation across various scenarios. After initial screening based on the stability of the c β value, four methods (based on SD, MD, GMD, and PCT) remained the focus of this investigation. Among the four methods, the GMD-based method demonstrated good performance across a range of scenarios (distribution shapes, numbers of low samples, and total sample sizes). Despite each method being dependent on a variety of factors, the GMD-based method showed consistently good performance with a relatively stable formula (invariant c β values) across all scenarios, which is an important consideration for a method's strength due to its easy implementation. The investigation in this research assumed that β=0.05, and the c β values are different for different choices of β levels. For the GMD-based method, the c β values for β=(0.01, 0.025, 0.05, 0.1) are c β=(2.15, 1.76, 1.46, 1.12), respectively.
The major advantage of GMD as a measure of data dispersion is that it does not depend on some measure of location, and is therefore superior to those location-dependent methods when the distributions are asymmetric. GMD attaches the largest weight to the section closest to the median, and then, the weights decline symmetrically the farther the section is from the median. 19 Therefore, it is still affected by outliers, though not as much as other methods. The MD-based method was generally superior to the SD-based method for non-Gaussian distributions, but both MD and SD as measures of data dispersion depend on the location of the data and are adversely affected by outliers. Still, when the low-level sample was assumed to be symmetric but not necessarily Gaussian, the MD-based LOD estimator performed well. When the distribution changed from Gaussian distribution to extremely heavy-tailed and the sample size changed, the change in c β value was moderate for this method. The MD-based estimator demonstrated relatively lower uncertainty (narrower confidence interval) than the PCT-based method when sample size was small, yet MD-based estimator always exhibited larger bias than the GMD-based estimator. In comparison, the PCT-based method was simple, easy to calculate, and was not impacted by the shape of the distribution; however, the PCT-based estimate had higher uncertainty (wider confidence interval), so it requires large sample size to achieve a reliable estimate.
In this study, LOB was considered to be a fixed value so that the work could focus on LOD. LOB may be considered as a random variable in future research studies. Linnet and Kondratovich's 2004 publication considered the random blank samples when LOD was discussed. 10 In that case, the heteroscedasticity between the blank sample and the low concentration sample was an important factor for consideration. 10 Another factor that may need to be revisited in the future is the constant variation in the lower range of concentration. In this study, a constant variation was assumed to be true, and in most real-life assays, this is an appropriate assumption. However, when variation in low sample concentrations cannot be assumed to be constant, some other approaches to estimate LOD need to be proposed.
In general, the GMD-based method produced good estimations of LOD and maintained a relatively invariant multiplicative factor throughout the scenarios examined here, making it an effective and easy method to use in practice. The MD-based method yielded better estimations of LOD than the SD-based method did if the distributions were symmetric but not necessarily Gaussian. It is not surprising that the PCT method has good performance. By definition, LOD is such that 5% of the data fall to the left of LOB. The PCT-based method identifies the 5% quantile and the 50% quantile, and then essentially the 50% is claimed as the LOD. Therefore, in theory, it is guaranteed that the PCT-based method is the best. However, to have reliable estimates of the 5% and the 50% quantiles, a large sample size is required—in our simulations, large sample size means >200, which is usually very unrealistic to assay developers.
The major intention of this research is to find an alternative method to the current standard method—the SD-based method. It is clear that the GMD method is superior to the SD-based method in non-Gaussian distributions (symmetric or asymmetric), and is comparable to the SD-based method when the distribution is Gaussian. In conclusion, we recommend that GMD should be used as the dispersion measure if the data distribution is relatively symmetric. If the data are highly skewed, get a large sample size if possible and use the PCT-based method to estimate LOD; if sample size has to be small, GMD may be preferable but may yield a biased LOD estimate.
Footnotes
Acknowledgment
Rebecca J. Palmer, PhD, assisted in the preparation of this article.
Disclosure Statement
None.
