Abstract
The network scale-up method enables researchers to estimate the sizes of hidden populations, such as drug injectors and sex workers, using sampled social network data. The basic scale-up estimator offers advantages over other size estimation techniques, but it depends on problematic modeling assumptions. The authors propose a new generalized scale-up estimator that can be used in settings with nonrandom social mixing and imperfect awareness about membership in the hidden population. In addition, the new estimator can be used when data are collected via complex sample designs and from incomplete sampling frames. However, the generalized scale-up estimator also requires data from two samples: one from the frame population and one from the hidden population. In some situations these data from the hidden population can be collected by adding a small number of questions to already planned studies. For other situations, the authors develop interpretable adjustment factors that can be applied to the basic scale-up estimator. The authors conclude with practical recommendations for the design and analysis of future studies.
1. Introduction
Many important problems in social science, public health, and public policy require estimates of the sizes of hidden populations. For example, in HIV/AIDS research, estimates of the size of the most at-risk populations—drug injectors, female sex workers, and men who have sex with men—are critical for understanding and controlling the spread of the epidemic. However, researchers and policymakers are unsatisfied with the ability of current statistical methods to provide these estimates (Joint United Nations Programme on HIV/AIDS 2010). We address this problem by improving the network scale-up method, a promising approach to size estimation. Our results are immediately applicable in many substantive domains in which size estimation is challenging, and the framework we develop advances the understanding of sampling in networks more generally.
The core insight behind the network scale-up method is that ordinary people have embedded within their personal networks information that can be used to estimate the sizes of hidden populations, if that information can be properly collected, aggregated, and adjusted (Bernard et al. 1989, 2010). In a typical scale-up survey, randomly sampled adults are asked about the number of connections they have to people in a hidden population (e.g., “How many people do you know who inject drugs?”) and a series of similar questions about groups of known size (e.g., “How many widowers do you know?”“How many doctors do you know?”). Responses to these questions are called aggregate relational data (McCormick et al. 2012).
To produce size estimates from aggregate relational data, previous researchers have begun with the basic scale-up model, which makes three important assumptions: (1) Social ties are formed completely at random (i.e., random mixing), (2) respondents are perfectly aware of the characteristics of their alters, and (3) respondents are able to provide accurate answers to survey questions about their personal networks. From the basic scale-up model, Killworth, McCarty, et al. (1998) derived the basic scale-up estimator. This estimator, which is widely used in practice, has two main components. For the first component, the aggregate relational data about the hidden population are used to estimate the number of connections that respondents have to the hidden population. For the second component, the aggregate relational data about the groups of known size are used to estimate the number of connections that respondents have in total. For example, a researcher might estimate that members of her sample have 5,000 connections to people who inject drugs and 100,000 connections in total. The basic scale-up estimator combines these pieces of information to estimate that 5 percent (
Unfortunately, the three assumptions underlying the basic scale-up model have all been shown to be problematic. Scale-up researchers call violations of the random mixing assumption barrier effects (Killworth et al. 2006; Maltiel et al. 2015; Zheng, Salganik, and Gelman 2006), they call violations of the perfect awareness assumption transmission error (Killworth et al. 2006; Maltiel et al. 2015; Salganik, Mello, et al. 2011; Shelley et al. 1995, 2006), and they call violations of the respondent accuracy assumption recall error (Killworth et al. 2003, 2006; Maltiel et al. 2015; McCormick and Zheng 2007).
In this paper, we develop a new approach to producing size estimates from aggregate relational data. Rather than depending on the basic scale-up model or its variants (e.g., Maltiel et al. 2015), we use a simple identity to derive a series of new estimators. Our new approach reveals that one of the two main components of the basic scale-up estimator is problematic. Therefore, we propose a new estimator—the generalized scale-up estimator—that combines the aggregate relational data traditionally used in scale-up studies with similar data collected from the hidden population. Collecting data from the hidden population is a major departure from current scale-up practice, but we believe that it enables a more principled approach to estimation. For researchers who are not able to collect data from the hidden population, we propose a series of adjustment factors that highlight the possible biases of the basic scale-up estimator. Ultimately, researchers must balance the trade-offs between the basic scale-up estimator, generalized scale-up estimator, and other size estimation techniques on the basis of the specific features of their research setting.
In Section 2, we derive the generalized scale-up estimator, and we describe the data collection procedures needed to use it. In Section 3, we compare the generalized and basic scale-up approaches analytically and with simulations; our comparison leads us to propose a decomposition that separates the difference between the two approaches into three measurable and substantively meaningful factors (equation 15). In Section 4, we make practical recommendations for the design and analysis of future scale-up studies, and in Section 5, we conclude with a discussion of the steps that follow. Appendices A to G in the online journal provide technical details and supporting arguments.
2. The Generalized Scale-Up Estimator
The generalized scale-up estimator can be derived from a simple accounting identity that requires no assumptions about the underlying social network structure in the population. Figure 1 helps illustrate the derivation, which was inspired by earlier research on multiplicity estimation (Sirken 1970) and indirect sampling (Lavallée 2007). Consider a population of seven people, two of whom are drug injectors (Figure 1a). In this population, two people are connected by a directed edge

Illustration of the derivation of the generalized scale-up estimator. (a) A population of seven people, two of whom are drug injectors (shown in gray). A directed edge
Each person can be viewed as both a source of out-reports and a recipient of in-reports, and in order to emphasize this point, Figure 1b shows the population with each person represented twice: on the left as a sender of out-reports and on the right as a receiver of in-reports. This visual representation highlights the following identity:
Despite its simplicity, the identity in equation (1) turns out to be very useful because it leads directly to the new estimator that we propose.
In order to derive an estimator from equation (1), we must define some notation. Let
where
Equation (3) is an expression for the size of the hidden population that does not depend on any assumptions about network structure or reporting accuracy; it is just a different way of expressing the identity that the total number of out-reports must equal the total number of in-reports. If we could estimate the two terms on the right side of equation (3)—one term related to out-reports (
However, in order to make the identity in equation (3) useful in practice we need to modify it to account for an important logistical requirement of survey research. In real scale-up studies, researchers do not sample from the entire population
where
Unfortunately, despite repeated attempts, we were unable to develop a practical method for estimating the term related to in-reports (
where
To understand the reporting assumption substantively, consider the two possible types of reporting errors: false positives and false negatives. Previous scale-up research on transmission error focused on the problem of false negatives, where a respondent is connected to a member of the hidden population but does not report this, possibly because she is not aware that the person she is connected to is in the hidden population (Bernard et al. 2010). Because hidden populations such as drug injectors are often stigmatized, it is reasonable to suspect that false negatives will be a serious problem for the scale-up method. Fortunately, equation (5) holds even if there are false negative reporting errors. However, false positives—which do not seem to have been considered previously in the scale-up literature—are also possible. For example, a respondent who is not connected to any drug injectors might report that one of her acquaintances is a drug injector. These false positive reports are not accounted for in the identity in equation (5) and the estimators that we derive subsequently. If false positive reports exist, they will introduce a positive bias into estimates from the generalized scale-up estimator. Therefore, in Appendix A in the online journal we (1) formally define an interpretable measure of false positive reports, the precision of out-reports; (2) analytically show the bias in size estimates as a function of the precisions of out-reports; and (3) discuss two research designs that could enable researchers to estimate the precision of out-reports.
2.1. Estimating
from Sampled Data
Equation (5) relates our quantity of interest, the size of the hidden population (
The total number of out-reports (
where
Estimating the average number of in-reports for the hidden population (
A second problem arises because we do not expect respondents to be able to easily and accurately answer direct questions about their visibility (
Given a relative probability sampling design and enriched aggregate relational data, we can now formalize our proposed estimator for
The estimator for
where
The first condition underlying the estimator in equation (7) is related to the design of the survey, and we call it the probe alter condition. This condition describes the required relationship between the visibility of the hidden population to the probe alters and the visibility of the hidden population to the frame population:
where
The second condition underlying the estimator
where
Finally, the third condition underlying the estimator
To recap, using two different data collection procedures—one with the frame population and one with the hidden population—we can estimate the two components of the expression for
We can combine these component estimators to form the generalized scale-up estimator:
Result C.8 proves that the generalized scale-up estimator will be consistent and essentially unbiased if (1) the estimator for the numerator (
One attractive feature of the generalized scale-up estimator (equation 10) is that it is a combination of standard survey estimators. This structure enabled us to derive very general sensitivity results about the impact of violations of assumptions, either individually or jointly. We return to the issue of assumptions and sensitivity analysis when discussing recommendations for practice (Section 4).
3. Comparison Between the Generalized and Basic Scale-Up Approaches
In Section 2, we derived the generalized network scale-up estimator by using an identity relating in-reports and out-reports as the basis for a design-based estimator. The approach we followed differs from previous scale-up studies, which have posited the basic scale-up model and derived estimators conditional on that model. In this section, we compare these two different approaches from a design-based perspective.
We begin our comparison by reviewing the basic scale-up model, which was used in most of the studies listed in Table 1. To review this model, we need to define another quantity: we call
Network Scale-up Studies That Have Been Completed
The basic scale-up model assumes that each person’s connections are formed independently, that reporting is perfect, and that visibility is perfect (Killworth, McCarty, et al. 1998). Together, these three assumptions lead to the probabilistic model:
for all
The basic scale-up model leads to what we call the basic scale-up estimator:
where
Given this background, we can now compare the basic and generalized scale-up approaches by comparing their estimands; that is, we compare the quantities that they produce in the case of a census with perfectly observed degrees. The basic scale-up estimand can be written
where
Comparing equations (13) and (14) reveals that both estimands have the same numerator, but they have different denominators. The network reporting identity from Section 2 (total out-reports = total in-reports) shows that the appropriate way to adjust the out-reports is based on in-reports, as in the generalized scale-up approach. However, the basic scale-up approach instead adjusts out-reports with the degree of respondents. Although using the degree of respondents cleverly avoids any data collection from the hidden population, our results reveal that it will be correct only under a very specific special case (
To further clarify the relationship between the basic and generalized scale-up approaches, we propose a decomposition that separates the difference between the two estimands into three measurable and substantively meaningful adjustment factors:
The decomposition shows that when the product of the adjustment factors is 1, the two estimands are both correct. However, when the product of the adjustment factors is not 1, then the generalized scale-up estimand is correct but the basic scale-up estimand is incorrect. We now describe each of the three adjustment factors in turn.
First, we define the frame ratio,
Next, we define the degree ratio
Finally, we define the true positive rate,
Furthermore, the decomposition in equation (15) can be used to derive an expression for the bias in the basic scale-up estimator when we have a census and degrees are known:
The comparison between the basic and generalized scale-up approaches leads to two main conclusions. First, the estimand of the basic scale-up approach is correct only in one particular situation: when the product of the three adjustment factors is 1. The estimand of generalized scale-up approach, in contrast, is correct more generally. Second, as equation 15 shows, if the adjustment factors are known (or have been estimated), then they can be used to improve basic scale-up estimates.
3.1. Illustrative Simulation
To illustrate our comparison between the basic and generalized scale-up approaches, we conducted a series of simulation studies. The simulations were not meant to be a realistic model of a scale-up study, but rather, they were designed to clearly illustrate our analytic results. More specifically, the simulation investigated the performance of the estimators as three important quantities vary: (1) the size of the frame population
As described in detail in Appendix G in the online journal, we created populations of
Figure 2 shows that the simulations support our analytic results. First, the simulations show that the generalized scale-up estimator is unbiased even in the presence of incomplete sampling frames, nonrandom mixing, and imperfect reporting. Second, they show that the basic scale-up estimator is unbiased in a much smaller set of situations. More concretely, the basic scale-up estimator is unbiased in situations in which the basic scale-up model holds—when everyone is in the frame population (

Estimated size of the hidden population for the generalized and basic scale-up estimators. Each panel shows how the two estimators change as the amount of random mixing is varied from low (

Bias (open circles and diamonds) and predicted bias (solid lines) in the basic scale-up estimates and generalized scale-up estimates for the same parameter configurations depicted in Figure 2. Our analytical results (equation 20) accurately predict the bias observed in our simulation study.
4. Recommendations for Practice
The results in Sections 2 and 3 lead us to recommend a major departure from current scale-up practice. In addition to collecting a sample from the frame population, we recommend that researchers consider collecting a sample from the hidden population so that they can use the generalized scale-up estimator. As our results clarify, researchers using the scale-up method face a decision: they can collect data from the hidden population or they can make assumptions about the adjustment factors described in Section 3. The appropriate decision depends on a number of factors, but we think that two are most important: (1) the difficulty of sampling from the hidden population and (2) the availability of high-quality estimates of the adjustment factors in Section 3. For example, if it is particularly difficult to sample from a specific hidden population and high-quality estimates of the adjustment factors are already available, then a basic scale-up estimator may be appropriate. If, however, it is possible to sample from the hidden population and there are no high-quality estimates of adjustment factors, then the generalized scale-up estimator may be appropriate. Many realistic situations will be somewhere between these two extremes, and the trade-offs must be weighed on a case-by-case basis.
To aid researchers deciding between basic and generalized scale-up approaches, we collected the conditions needed for consistent and essentially unbiased estimates into Table 2; formal proofs of these results are presented in Online Appendices B and C. We find it helpful to group these conditions into four broad categories: sampling, survey construction, network structure, and reporting behavior.
Summary of the Conditions Needed for the Generalized and Modified Basic Network Scale-up Estimators, and Their Components, to Produce Estimates That Are Consistent and Essentially Unbiased
Note: This table uses the version of the basic scale-up estimator we recommend in Section 4.2.
A review of the conditions in Table 2 necessarily raises practical concerns. In situations in which researchers are trying to make estimates about real hidden populations, they probably will not know how close they are to meeting these conditions. Therefore, researchers may wonder how their estimates will be affected by violations of these assumptions, both individually (e.g., “How would my estimates be affected if there was a problem with the survey construction?”) and jointly (e.g., “How would my estimate be affected if there was a problem with my survey construction and reporting behavior?”). To address this concern, in Appendix D in the online journal, we develop a framework for sensitivity analysis that shows researchers exactly how estimates will be affected by violations of all assumptions, either individually or jointly. Table 3 summarizes the results of our sensitivity framework.
Analytical Expressions Researchers Can Use to Perform Sensitivity Analysis for Estimates Made Using Scale-up Estimators
Note: See Appendix D in the online journal for more details.
Another problem researchers face in practice is putting appropriate confidence intervals around estimates. The procedure currently used in scale-up studies was proposed by Killworth, McCarty, et al. (1998), but it has a number of conceptual problems, and in practice, it produces intervals that are anticonservative (i.e., the actual coverage rate is lower than the desired coverage rate). Both of these problems—theoretical and empirical—do not seem to be widely appreciated in the scale-up literature. Therefore, instead of the current procedure, we recommend that researchers use the rescaled bootstrap procedure (Rao, Wu, and Yue 1992; Rao and Wu 1988; Rust and Rao 1996), which has strong theoretical foundations, does not depend on the basic scale-up model, can handle both simple and complex sample designs, and can be used for both the basic scale-up estimator and the generalized scale-up estimator. In Appendix F in the online journal, we review the current scale-up confidence interval procedure and the rescaled bootstrap, highlighting the conceptual advantages of the rescaled bootstrap. Furthermore, we show that the rescaled bootstrap produces slightly better confidence intervals in three real scale-up data sets: one collected via simple random sampling (McCarty et al. 2001) and two collected via complex sample designs (Salganik, Fazito, et al. 2011; Rwanda Biomedical Center 2012). Finally, and somewhat disappointingly, our results show that none of the confidence interval procedures work very well in an absolute sense, a finding that highlights an important problem for future research.
We now provide more specific guidance for researchers based on the data they decide to collect. In Section 4.1 we present recommendations for researchers who collect a sample from both the frame population,
4.1. Estimation with Samples from
and
We recommend that researchers who have samples from
For researchers using the generalized scale-up estimator, we have three specific recommendations. Of all the conditions needed for consistent and essentially unbiased estimation, the ones most under the control of the researcher are those related to survey construction, so we recommend that researchers focus on these during the study design phase. In particular, we recommend that the probe alters be designed so that the rate at which the hidden population is visible to the probe alters is the same as the rate at which the hidden population is visible to the frame population (see Result C.2 for a more formal statement, and see Section C5 for more advice about choosing probe alters). Second, when presenting estimates, we recommend that researchers use the results in Table 3 to also present sensitivity analyses highlighting how the estimates may be affefcted by assumptions that are particularly problematic in their setting. Finally, we recommend that researchers produce confidence intervals around their estimate using the rescaled bootstrap procedure, keeping in mind that this will likely produce intervals that are anticonservative.
We also have three additional recommendations that will facilitate the cumulation of knowledge about the scale-up method. First, although the generalized scale-up estimator does not require aggregate relational data from the frame population about groups of known size, we recommend that researchers collect these data so that the basic and generalized estimators can be compared. Second, we recommend that researchers publish estimates of

Recommended schematic of inputs and outputs for a study using the generalized scale-up estimator. We recommend that researchers produce size estimates using the generalized scale-up estimator and that researchers produce estimates of the adjustment factors
4.2. Estimation with Only a Sample from
If researchers cannot collect a sample from the hidden population, we have three recommendations. First, we recommend two simple changes to the basic scale-up estimator that remove the need to adjust for the frame ratio,
Instead of equation 22, we suggest a new estimator, called the modified basic scale-up estimator, that more directly deals with the fact that researchers sample from the frame population
There are two differences between the modified basic scale-up estimator (equation 23) and the basic scale-up estimator (equation 22). First, we recommend that researchers estimate
Our second recommendation is that researchers using the modified basic scale-up estimator (equation 23) perform a sensitivity analysis using the results in Table 3. In particular, we think that researchers should be explicit about the values that they assume for the adjustment factors
5. Conclusion and Next Steps
In this paper, we developed the generalized network scale-up estimator. This new estimator improves upon earlier scale-up estimators in several ways: it enables researchers to use the scale-up method in populations with nonrandom social mixing and imperfect awareness about membership in the hidden population, and it accommodates data collection with complex sample designs and incomplete sampling frames. We also compared the generalized and basic scale-up estimators, leading us to introduce a framework that makes the design-based assumptions of the basic scale-up estimator precise. Finally, researchers who use either the basic or generalized scale-up estimator can use our results to assess the sensitivity of their size estimates to assumptions.
The approach we followed to derive the generalized scale-up estimator has three elements, and these elements may prove useful in other problems related to sampling in networks. First, we distinguished between the network of reports and the network of relationships. Second, using the network of reports, we derived a simple identity that permitted us to develop a design-based estimator free of any assumptions about the structure of the network of relationships. Third, we combined data from different types of samples. Together, these three elements may help other researchers in other situations derive relatively simple, design-based estimators that are an important complement to complex, model-based techniques.
Although the generalized scale-up estimator has many attractive features, it also requires that researchers obtain two different samples, one from the frame population and one from the hidden population. In cases in which studies of the hidden population are already planned (e.g., the behavioral surveillance studies of the groups most at risk for HIV/AIDS), the necessary information for the generalized scale-up estimator could be collected at little additional cost by appending a modest number of questions to existing questionnaires. In cases in which these studies are not already planned, researchers can either collect their own data from the hidden population, or they can use the modified basic scale-up estimator and borrow estimated adjustment factors from other published studies.
The generalized scale-up estimator, like all estimators, depends on a number of assumptions, and we think three of them will be most problematic in practice. First, the estimator depends on the assumption that there are no false positive reports, which is unlikely to be true in all situations. Although we have derived an estimator that works even in the presence of false positive reports (Appendix A in the online journal), we were not able to design a practical data collection procedure that would allow us to estimate one of the terms it requires. Second, the generalized scale-up estimator depends on the assumption that hidden population members have accurate aggregate awareness about visibility (equation 9). That is, researchers have to assume that hidden population respondents can accurately report whether or not their alters would report them, and we expect this assumption will be difficult to check in most situations. Third, the generalized scale-up estimator depends on having a relative probability sample from the hidden population. Unfortunately, we cannot eliminate any of these assumptions, but we have stated them clearly and we have derived the sensitivity of the estimates to violations of these assumptions, individually and jointly.
Our results and their limitations highlight several directions for further work, in terms of both of improved modeling and improved data collection. We think the most important direction for future modeling is developing estimators in a Bayesian framework, and a recent paper by Maltiel et al. (2015) offers some promising steps in this direction. We see two main advantages of the Bayesian approach in this setting. First, a Bayesian approach would allow researchers to propagate the uncertainty they have about the many assumptions involved in scale-up estimates, whereas our current approach captures only uncertainty introduced by sampling. Furthermore, as more empirical studies produce estimates of the adjustment factors (
Footnotes
Acknowledgements
We thank Alexandre Abdo, Francisco Bastos, Russ Bernard, Neilane Bertoni, Dimitri Fazito, Sharad Goel, Wolfgang Hladik, Jake Hofman, Mike Hout, Karen Levy, Rob Lyerla, Mary Mahy, Chris McCarty, Maeve Mello, Tyler McCormick, Damon Phillips, Justin Rao, Adam Slez, and Tian Zheng for helpful discussions. Many of the methods described in this paper can be implemented using our accompanying open-source R package, called networkreporting, which is available on the Comprehensive R Archive Network. Replication materials for our analysis can be freely downloaded from the Harvard Dataverse (
).
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by the Joint United Nations Programme on HIV/AIDS, the National Science Foundation (CNS-0905086), and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (R01-HD062366, R01-HD075666, and R24-HD047879). Some of this research was conducted while Dr. Salganik was an employee at Microsoft Research. The opinions expressed here represent the views of the authors and not the funding agencies.
Notes
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
