How Spatially Concentrated Are Industrial Clusters?: A Meta-analysis

Abstract

This study conducts a meta-analysis of empirical studies that have measured the spatial scale of industrial clustering. Two types of scales are examined: the peak scale (at which cluster effects are maximized) and the maximum reach (beyond which cluster effects are undetectable). We find that the scale varies significantly by the unit of analysis, industry sector, country of study, and the sources of cluster effects examined (e.g., knowledge spillovers, localization, and urbanization). Planners and policy makers should tailor the geographies embodied in cluster strategies to match the specific local needs and circumstances.

Keywords

industrial cluster spatial scale meta-analysis

Introduction

Urban areas are centers of production and innovation due to the benefits brought by clusters of related industries (Beaudry and Schiffauerova 2009; Duranton and Puga 2001, 2004). Since Marshall (1890) and Jacobs (1969), researchers have intensively studied the causes and benefits of industrial clusters (Feldman 1994; Rosenthal and Strange 2001, 2004). These scholars have sought to understand the clustering processes and develop cluster strategies to enhance regional economic advantages. Numerous studies have quantified how much clusters contribute to productivity, wages, innovation, and entrepreneurship (Andersson, Klaesson, and Larsson 2016; Melo, Graham, and Noland 2009; Renski 2011), and the extent to which Marshallian (localization), Jacobs (diversity or urbanization), and Porter (competition) mechanisms are the drivers of cluster effects (Beaudry and Schiffauerova 2009).

A significant gap hinders our understanding of industrial clusters in their geographical domain. Whereas the benefits of clusters are expected to decay over distance (Rosenthal and Strange 2003, 2008), the specific geographical scales at which clusters exhibit impacts are not well understood (Andersson, Klaesson, and Larsson 2016; Overman 2003). The theoretical foundation and empirical evidence both are too limited to answer with any certainty whether a cluster effect extends throughout the whole labor market or a metropolitan area or is confined within a neighborhood. Earlier empirical studies have identified cluster effects at various levels of aggregation, such as a metropolitan area or labor market (Burger, Van Oort, and Van der Knaap 2007; Neffke et al. 2011), state (Bryan and Morten 2015; Rosenthal and Strange 2001), county (Fallah, Partridge and Rickman 2014; Hanink 2006), or city (Dekle and Eaton 1999; Han and Ke 2016). Nearly all of these studies selected the level of aggregation to investigate clusters as part of their research design rather than treating the geographic scale of cluster impacts as an empirical question. The same approach is taken by the US Cluster Mapping Project of the Harvard Business School (2018). While meticulously investigating industrial relatedness in order to define the sectoral membership of clusters, geographically, the US Cluster Mapping Project simply adopts the usual spatial aggregations—states, metropolitan areas, and counties—to map clusters. Some more recent studies have applied firm-level data to more precisely measure the distance at which cluster effects peak or the pace at which they subside (Aharonson, Baum, and Feldman 2007; Barlet, Briant, and Crusson 2013; Behrens and Bougna 2015). However, these empirical findings are mixed, with estimated spatial scales ranging from less than one mile in radius to more than 300 miles (Kolympiris, Kalaitzandonakes, and Miller 2015; Partridge and Rickman 2008; Rosenthal and Strange 2005).

It is, nevertheless, critical to understand the spatial scale of industrial clusters. Theoretically, estimating the spatial scale of clusters helps us to better capture the underlying mechanism(s) that drives the cluster effect. Most urban areas are blessed with the presence of not one but multiple clustering mechanisms; in these cases, estimating their varied spatial scales is a necessary step in determining how various clustering benefits may stack together geographically. For researchers, a deeper grasp of clusters’ spatial scale aids in selecting appropriate geographical units for conducting future empirical analyses (Cainelli and Ganau 2018). In practical applications, correct apprehension of the spatial scale of clusters makes it possible for planners to encourage firm and worker concentration at appropriate geographical units via policies and strategies such as zoning, industrial or innovation districts, and location-based incentives (Arauzo-Carod and Manjón-Antolín 2012).

This study fills the gap by conducting a meta-analysis of prior empirical studies that have explicitly measured the spatial scale of industrial clusters. We distinguish between two types of scales: the peak scale (where a cluster effect is maximized) and the maximum reach (the distance beyond which cluster effects are no longer detectable). This analysis makes two contributions. First, we provide a set of reference estimates of the spatial scale of clusters by summarizing and amalgamating the findings of existing studies. Second, we identify factors that account for differences in the estimated scales. For example, different clustering mechanisms (e.g., knowledge spillovers, localization, and urbanization) operate at different scales. The mixed empirical findings in prior studies are reconciled with the various determinants taken into consideration. Local planners can take home the message of the appropriate geographical unit at which to promote clusters and propel urban economic prosperity, that unit being tailored to specific policy aims and local characteristics.

The paper proceeds as follows. The next section qualitatively summarizes prior studies, especially empirical studies estimating the spatial scale of industrial clusters. The third section introduces the collection of empirical studies, the inclusion criteria, and the methodology for the meta-analysis. The fourth and fifth sections present and discuss the descriptive and quantitative findings, and the final section concludes with a summary of the results and their scholarly and practical implications.

Literature

Researchers have examined the benefits of industrial clusters, or urban agglomeration as most economists call it, since at least the nineteenth century. Marshall (1890) summarized the benefits of clusters of similar industrial activity (later termed localization economies) as labor pooling, cost savings with regard to specialized inputs, and knowledge spillovers. Ohlin (1933) and Hoover (1937) distinguished localization economies from urbanization economies—the benefits that arise from and are gained by a variety of economic activities locating together.¹ The seminal works of Jacobs (1961, 1969) described cross-fertilization across a diversity of urban industries and social interactions as a distinct source of clustering effects. More recently, Porter (1998) emphasized the essentiality of competition in the urban environment and argued that a “keeping up with the Joneses” motivation helps regions to maintain competitiveness in the global market. Following these lines of theoretical discussion, empirical studies have flourished in recent decades to estimate the magnitude of cluster effects and to differentiate various mechanisms of cluster benefits. Examples include Ellison, Glaeser, and Kerr (2010); Jaffe, Trajtenberg, and Henderson (1993); and Rosenthal and Strange (2001, 2003).

Industrial clusters have been found to exhibit a variety of benefits. For example, Strøjer Madsen, Smith, and Dilling-Hansen (2003) have found significantly higher productivity of Danish firms in clusters than their counterparts outside of the clusters. They argued that many mechanisms might account for this boost in productivity, such as networks, knowledge spillovers, and human capital mobility (i.e., labor pooling). Similar effects are found in the Japanese silk-reeling industry (Arimoto, Nakajima, and Okazaki 2014), in US manufacturing industries (Rigby and Essletzbichler 2002), and in China’s textile industry (Lin, Li, and Yang 2011). Melo, Graham, and Noland (2009) carried out a meta-analysis to summarize the results of studies that examined the relationship between urban agglomeration and productivity.

Innovation is another cluster benefit that has been examined frequently. For example, Wallsten (2001) found that firms close to an innovative firm are more likely to innovate themselves. A similar effect is confirmed by Beaudry and Breschi (2003) and Kolympiris and Kalaitzandonakes (2013). Other studies have identified the positive impact of own-industry or other-industry employment on firm innovation (Baptista and Swann 1998; Fang 2020; Feldman and Audretsch 1999; Moreno, Paci, and Usai 2006). This benefit of clusters on innovation comes from the mechanisms of knowledge spillovers, sharing of common (knowledge) inputs, and better matching between the skills needed by firms and those possessed by workers (Carlino and Kerr 2015). Carlino and Kerr (2015) have written a review paper, and Fang (2015) a meta-analysis to summarize the theoretical and empirical studies examining the relationship between clusters and innovation. Relatedly, entrepreneurship is another cluster benefit. Clusters are found to encourage the birth of new firms (Arzaghi and Henderson 2008), their growth (Delgado, Porter, and Stern 2010; Rosenthal and Strange 2005), and in some cases, their survival (Renski 2011). All of the mechanisms responsible for uplifting productivity and innovation may be conducive to entrepreneurship.

Clusters are found to boost wages (Addario and Patacchini 2008; Fu 2007; Rosenthal and Strange 2008). These authors argue that the effect arises from a combination of clusters’ attraction to high-skilled labor, their capacity to train and retain these workers, and skill complementarities that enhance labor force productivity. Other cluster benefits, such as more extended and in-depth networks, the attraction of capital investments, and higher returns to commercial real estate, have also been studied, though less frequently (Kolympiris, Kalaitzandonakes, and Miller 2011; Koster 2013; Ma et al. 2014).

There is, however, a lack of theoretical guidance and no empirical consensus on the spatial scale of industrial clusters, even though empirical studies of clusters generally must pick one or more geographical units of analysis. Industrial clusters generate assorted benefits by varied mechanisms, which may operate or accumulate at different spatial extents. Moreover, whereas there are solid foundations, both theoretical and empirical, to signify that cluster effects attenuate over distance (Andersson, Klaesson, and Larsson 2016; Koster 2013; Rosenthal and Strange 2008), how rapidly such attenuation occurs is a subject that has not been addressed by theories of clusters. Thus, the spatial scale of clusters remains an empirical question and a choice for researchers to make in conducting empirical studies. The options include administrative units such as states (Bryan and Morten 2015; Rosenthal and Strange 2001), counties (Fallah, Partridge and Rickman 2014; Hanink 2006), or cities (Dekle and Eaton 1999; Han and Ke 2016) and economically determined geographical units such as metropolitan areas or labor markets (Burger, Van Oort, and Van der Knaap 2007; Neffke et al. 2011). More recently, accessible microlevel data have enabled distance to be examined more precisely, using individual firm and establishment locations (Albert, Casanova, and Orts 2012; Barlet, Briant, and Crusson 2013; Brown, Dar-Brodeur, and Tweedle 2020). Case studies of specific industrial clusters also are common but typically do not explicitly define the geographical domain of the clusters studied (Austrian 2000).

Two types of spatial scales have been estimated in prior studies as Figure 1 illustrates. The peak scale is the distance at which the cluster achieves its maximum effect. Note that some studies assumed that the peak occurs at a distance of zero (de Groot, Poot, and Smit 2016; Drucker and Feser 2012; Han and Ke 2016), whereas others allowed the peak distance to be nonzero. The latter approach encompasses the notion that at some scales increases in distance may enlarge cluster effects by incorporating more employment activities and potential knowledge sources. These studies either found the cluster effect to attenuate continuously over distance (Wallsten 2001) or not necessarily to do so (Fang 2019). Thus, whether the peak scale is zero or not is an empirical question. The second type of spatial scale, maximum reach, is the distance beyond which the cluster effect diminishes past recognition; it defines the territory of the cluster effect. Table 1 shows various estimations of these two scales in prior studies, classified by different outcome variables. The range of these estimates is considerable, showing substantial heterogeneity across studies.

Figure 1.

Two types of spatial scales of industrial clusters.

Table 1.

The Average and Median Spatial Scales of Clusters (in Miles), Weighted by Sample Size.

Outcome	Productivity	Firm/Employee Density	Innovation	Entrepreneurship	Wage	Other
Peak scale
Average	28.87	28.48 (31.99)	33.86 (27.23)	4.88	7.59 (7.95)	95.17 (14.66)
Median	7.5	10 (8.16)	4.71 (3.5)	0.5	2 (2.25)	5 (0.15)
Maximum reach
Average	52.20	39.52 (40.97)	50.46	13.23	18.12	38.78
Median	1.18	8.82 (8.53)	1.76	1	6	1.76

Note: Number inside parentheses is the result excluding studies/records without tests of statistical significance.

Several factors may have contributed to this heterogeneity. For example, Andersson, Klaesson, and Larsson (2016); Arzaghi and Henderson (2008); and Duranton and Puga (2004) suggested that different underlying mechanisms for clusters likely operate at various geographical scales. The attenuation rate of clusters may also vary by location (Cainelli and Ganau 2018; Fu 2007; Rosenthal and Strange 2008). Moreover, different economic outcomes affected by clusters, such as productivity, wages, innovation, and entrepreneurship, are likely to concentrate at different geographical scales (Cainelli and Ganau 2018; Halpern and Muraközy 2007; McHale, Agrawal, and Kapur 2008; Rosenthal and Strange 2005, 2008), a point corroborated by the evidence in Table 1.

Faced with mixed findings, it is crucial to establish a general reference for the spatial scale of industrial clusters to guide future studies and policy choices. Meanwhile, determining which factors change the estimated spatial scales and by how much is also critical to advancing our understanding of clusters. This study takes on these two tasks.

Data and Method

Data

The literature used in the meta-analysis is collected through the following procedures. First, we searched key word combinations of “spatial (or geographical) scale”/“spatial (or geographical) extent”/“spatial (or geographical) decay” and “cluster”/“agglomeration” through the first twenty pages of Google Scholar (i.e., the most-cited articles), the first 1,000 relevant studies in the Web of Science database, and the entire collections of Social Science Research Network and National Bureau of Economic Research working papers to account for unpublished works. Second, we manually examined these collections and removed articles of little relevance or that do not contain an empirical examination of the spatial scale of clusters. After this screening, we were left with 106 articles collected from Google Scholar (in which eighty-seven overlap with the collection from the Web of Science, and nineteen are unique from Google Scholar), ninety-one from the Web of Science (in which four are unique), two from SSRN, and one from NBER. Third, we added nineteen additional studies that appear repeatedly in the reference lists of the collected papers and three more independently known to the authors. This process provided us with a total of 135 articles.

We then cleaned the collected literature and categorized it into two lists.² The first list contains the research used in both the descriptive analysis (Descriptive Analysis section) and the meta-analysis (Average and Median Estimates section and Fundamental Differences and Methodological Choices Both Influence the Estimated Spatial Scale section). These studies (1) explicitly measured at least one of the two spatial scales of clusters and (2) reported critical methodological and statistical information such as sample size, studied industry, and statistical significance. Applying these two criteria to the 135 collected articles, we are left with this first list of eighty-five articles. Many of these studies contain multiple records resulting from distinct estimations being carried out for different subsamples, industries, time frames, or specifications. All estimation results within the main texts are collected but analyses in appendices are discarded.³ A total of 968 records are obtained from these eighty-five articles. The process of paper collection and selection is illustrated in Figure 2.

Figure 2.

The process of paper collection and selection.

The second list contains studies that we use only for the descriptive analysis because they did not explicitly measure the spatial scales of clusters in a fashion compatible with the quantitative meta-analysis. This list includes another fifty studies. The methodological variations across both lists of studies are explored in detail in Methodological Variations in Studied Empirical Research section.

Different sets of information are collected for the two lists of studies. For the second list, we collected information on the spatial data level, studied location, vintage, industry/sector, type of model, dependent variable(s), key independent variable(s), method of distance estimation, and the clustering mechanism(s) examined. For the first list, we also collected the unit of distance measurement (e.g., km, m, mile), estimated peak scale and/or maximum reach, sample size, model estimation technique (e.g., density, regression, simulation), statistical significance, number of citations, and journal impact factor. These attributes are used as explanatory variables in the mixed-effect meta-analysis regressions; some of them explain much of the heterogeneity across studies.

Method

We adopt the method of meta-analysis to examine the empirical literature. Meta-analysis is a method that estimates the average effect across multiple empirical studies (DerSimonian and Laird 1986). The method was first adopted in clinical trials but has spread widely into other fields including economics (Stanley et al. 2013), business (Orlitzky, Schmidt, and Rynes 2003), urban planning (Fang 2015; Stevens 2017), and regional science (Melo, Graham, and Noland 2009). A meta-analysis overcomes several common shortcomings of a single empirical study, such as a small sample size, a single location, and a limited time frame. By combining a large number of samples from multiple studies, potentially from different countries and periods, a meta-analysis can efficiently measure the average effect and reveal the homogeneity or heterogeneity of the estimated effect across space and time.

We focus on the two different spatial scales of industrial clusters described above. The peak scale is where planners and firm managers should target their efforts, all else equal, to reap maximum benefits from clusters. The maximum reach is the greatest distance at which the benefits of a cluster can identifiably spillover, signifying the largest appropriate scale of regional collaboration in fostering a cluster. All localities within this distance can be mobilized in forming a strong regional cluster, but beyond this distance such collaboration likely is futile. Among the eighty-five studies evaluated in the meta-analysis, eighty-three estimated the peak scale and forty-three estimated the maximum reach.

We separated papers using different outcome variables. This is a choice made based on both theoretical and methodological considerations. Theoretically, industrial clusters are places in which multiple economic mechanisms intertwine and various economic outcomes emerge, impacted by the industrial cluster’s features to different extents. These outcomes may not all manifest at the same spatial scale. Certain outcomes such as entrepreneurship might be quite concentrated in space, as entrepreneurs benefit from a small and tightly connected ecosystem in which they can interact frequently with each other (Arzaghi and Henderson 2008; Rosenthal and Strange 2005). This requires a relatively short physical distance. In contrast, productivity and wages may be elevated throughout the entire labor market, which can be as large as an entire urban area or metropolitan area (Di Addario and Patacchini 2008; Fu and Ross 2013; Lin, Li, and Yang 2011). For example, if a region is attractive to high-skilled labor, people can live and work throughout the metropolitan area and thus boost wages across the entire region. Thus, separating outcome variables helps us produce more meaningful and insightful results.

Methodologically, a major criticism for the application of meta-analysis in social science is that studies are too heterogeneous in terms of their measurement of independent and dependent variables as well as the specific contexts (Nelson and Kennedy 2009). Pooling the results of such studies is akin to averaging apples and oranges. We dealt with this problem in three ways. At the highest level, we separated empirical studies into two lists, the first containing the more or less comparable studies included in the quantitative meta-analysis. The noncomparable studies we summarized only qualitatively and descriptively. At a medium level, we separately analyzed papers that examine different outcome variables. Various outcome variables are measured in disparate ways and with distinct units and may involve different clustering mechanisms. Thus, summarizing them together can be misleading. We classified studies into six groups with different outcome variables: (1) productivity, typically measured by total factor productivity, total output, and total value added; (2) firm and employee density, measured by the number of firms or employees per square mile, or a concentration or agglomeration index of firms or workers; (3) innovation, measured by patent counts or citations, or metrics of research and development (R&D) activity, funding, or awards; (4) entrepreneurship, measured by the number of firm births and new firm employment; (5) wages, measured by wage levels or growth; and (6) other outcome variables, which includes housing prices, office rents, and collaboration networks, among others. These outcome variables typically measure a type of cluster benefits as discussed in Literature section, except for firm and employment density, which is the spatial demonstration of a cluster itself. Lastly, we captured the remaining heterogeneity of prior studies using independent variables in the mixed-effect model to explain why their estimates differ.

For the descriptive analysis, we constructed a table summarizing the estimates and explored methodological variations across studies. For the quantitative meta-analysis, we calculated the average and median estimates for the two spatial scales and conducted a mixed-effect meta-analysis regression (at the record level; Sutton et al. 2000), specified as follows, to explore which factor(s) explain the heterogeneity in the estimation:

y_{i r} = α X_{i r} + β Z_{i} + ε_{i r} .

In equation (1), i indexes studies, r indexes individual records, and $y_{i r}$ denotes the estimated spatial scales of clusters. $X_{i r}$ and Z _i capture the record-level and article-level characteristics that may have affected the estimated scales, respectively. Conceptually, the difference in the estimates comes from three sources: (1) fundamental differences in what the studies/records estimated, such as different clustering mechanisms, industries, settings (locations and time periods), and other dependent and independent variables; (2) different methodological choices such as the unit of analysis (e.g., establishment, zip code, or county), model specification, and whether the study has tested for statistical significance; and (3) the quality of the analysis, measured by the impact factor of the journal and the subsequent number of citations of the paper. These three sets of variables are included in the regression. Prior to pooling all of the independent variables together in a single regression, we conduct separate estimations for each set of independent variables (Tables 2 –5), both to escape the potential multicollinearity problems caused by the large number of dummy variables in the full model and to facilitate interpretation of the estimated coefficients. The pooled model (Table 6) controls all relevant variables, minimizes possible omitted variable bias, and provides guidance for policy makers and practitioners facing the range of variables in making decisions. $ε_{i r}$ denotes standard error; it is clustered at the article level to account for the relatedness among records of a single study. Note that this is also a robust standard error that accounts for heteroscedasticity across studies. All records are weighted by sample size to count each observation equally.

Table 2.

Mechanism Investigated and the Estimated Spatial Scale.

Variables	Distance in Miles
Variables	Peak Scale		Maximum Reach
Constant	−14.846	(9.249)	−9.976	(5.776)
Knowledge spillover	53.747***	(18.023)	105.891***	(23.927)
Competition	24.915*	(9.532)	19.309*	(7.552)
Network	10.005	(7.446)
Spatial	27.745***	(8.618)	73.211***	(5.776)
Localization	36.425***	(6.724)	19.389	(9.891)
Urbanization/agglomeration	36.425*	(16.018)	48.610*	(24.211)
Other mechanisms	5.912	(20.294)	−32.752	(23.807)
F statistic	6.11***		18.27***
R2	.0443		.2291
Number of observations	932		376

Note: Standard errors in parentheses are clustered by article. This group of dummy variables does not form perfectly collinearity because one record can examine more than one mechanism, with multiple explanatory variables.

*p < .05.

**p < .01.

***p < .005.

Table 3.

Sector of Study and the Estimated Spatial Scale.

Variables	Distance in Miles
Variables	Peak Scale		Maximum Reach
Constant	−194.106***	(9.235)	1.176***	(0.00003)
Manufacturing	224.837***	(6.160)	53.586***	(16.441)
Service	204.562***	(6.613)	2.051	(1.526)
High tech	198.655***	(9.438)	9.644*	(4.942)
All sectors	235.478***	(16.667)	50.374*	(23.989)
Other sectors	206.900***	(9.976)
F statistic	12.02***		5.00***
R ²	.061		.112
Number of observations	932		376

Note: Standard errors in parentheses are clustered by article. For the maximum reach model, other sectors are used as the baseline. For the peak scale model, this group of dummy variables does not form perfectly collinearity because some records examined samples from both the manufacturing and service sectors.

*p < .05.

**p < .01.

***p < .005.

Table 4.

Unit of Analysis and the Estimated Spatial Scale.

Variables	Distance in Miles
Variables	Peak Scale		Maximum Reach
Constant	112.283	(73.399)	100.441**	(38.793)
Establishment/firm/patent	−93.819	(73.547)	−73.268	(40.177)
Zip code/census tract/grid or block	−84.211	(74.353)	−86.982	(39.614)
County	−30.752	(93.859)	164.265***	(47.943)
Metropolitan statistical area/state/labor market	−38.607	(79.981)
F statistic	1.43		25.63***
R ²	.098		.480
Number of observations	932		376

Note: Standard errors in parentheses are clustered by article. The baseline group includes all other units.

*p < .05.

**p < .01.

***p < .005.

Table 5.

Choice of Analysis and the Estimated Spatial Scale.

Variables	Distance in Miles
Variables	Peak Scale		Maximum Reach
Constant	41.415	(21.877)	43.198	(27.836)
Regression	−15.940	(16.495)	−46.323***	(13.836)
Histogram/kernel/density methods	−2.041	(17.448)	−33.590	(27.836)
Spatial model	−18.539	(22.175)	−28.070***	(5.324)
Statistical test	−6.164	(16.917)	34.872	(27.322)
F statistic	0.81		0.075*
R ²	.020		.023
Number of observations	932		376

Note: Standard errors in parentheses are clustered by article. The baseline group includes all other units.

*p < .05.

**p < .01.

***p < .005.

Table 6.

Factors Influencing the Estimated Spatial Scale.

Variables	Distance in Miles
Variables	Peak Scale		Maximum Reach
Constant	−60.519	(73.766)	433.255	(254.703)
Article quality
Journal impact factor	−18.960***	(5.975)	12.264	(7.074)
Article citations	0.031	(0.026)	0.029	(0.017)
Geography of study: Other regions as the baseline
United States	−33.631	(17.376)	−10.587	(41.033)
Canada	−19.805	(12.318)	16.570	(46.775)
UK	2.575	(21.478)	−53.838	(38.233)
France	−20.211	(24.176)	195.938***	(25.080)
Germany	−43.035	(33.130)	104.585*	(50.417)
Italy	−0.947	(32.396)	−86.851***	(20.634)
Spain	−2.147	(14.479)
China	−18.899	(19.190)	54.702	(30.142)
Japan	−99.231***	(27.883)	−157.87*	(67.517)
Vintage of study
1960s and before	−12.791	(33.247)	−133.259	(77.692)
1970s	70.682	(39.694)	82.340	(48.785)
1980s	66.331*	(28.800)	−19.929	(25.423)
1990s	25.515	(13.525)	−24.826	(18.264)
2000s	−24.599	(18.714)	−38.866*	(15.924)
2010s	−53.300	(27.794)	−13.997	(32.798)
Unit of analysis: City level as the baseline
Establishment/firm/patent	−92.242***	(27.794)	−184.853***	(41.533)
Zip code/census tract/grid or block	−26.493	(38.212)	−209.888***	(40.688)
County	68.688	(60.182)
MSA/state/labor market	−64.885	(41.065)	−151.965	(88.827)
Outcome variable: Other variables as the baseline
Productivity	33.330	(32.987)	2.708	(13.443)
Innovation	24.140	(30.415)	15.574	(31.923)
Entrepreneurship	−69.147	(41.455)	−4.809	(15.101)
Density	−15.318	(25.844)	−6.112	(7.469)
Wage	5.469	(28.890)	−11.133	(30.561)
Explanatory variables: Other variables as the baseline
Distance	−24.557	(42.318)	59.496***	(13.866)
R&D	−85.695	(56.984)	−47.480	(42.297)
Employment	−111.096	(60.097)	27.482	(31.336)
Number of firms	−89.848	(69.195)	35.899	(37.259)
Agglomeration index	−46.243	(52.179)	76.107**	(27.118)
Mechanism investigated
Knowledge spillover	72.943***	(16.132)	−137.496	(109.458)
Competition	62.807*	(27.084)	−232.549*	(98.023)
Network	15.668	(21.648)
Spatial	42.009*	(19.614)	−10.825	(118.667)
Localization	41.240	(21.963)	−229.316*	(98.055)
Urbanization/agglomeration	31.209	(23.997)	−225.899*	(97.863)
Other mechanisms	73.754***	(25.653)	−32.047	(27.979)
Sector
Manufacturing	226.971***	(11.180)	−11.791	(10.490)
Service	225.288***	(11.184)	−9.617	(8.044)
High tech	199.082***	(15.624)	−60.762*	(28.851)
All sectors	247.965***	(20.225)
Other sectors	188.859***	(33.795)
Analytical method: Other methods as the baseline
Regression	−18.826	(27.449)	−16.220	(85.848)
Histogram/kernel/density methods	−3.538	(9.390)	−92.217	(118.203)
Spatial model	−48.078	(43.142)	−8.333***	(1.93e-06)
Statistical test	−19.689	(13.9)	−24.553	(38.681)
F statistic	10.29***		73.37***
R ²	.378		.917
Number of observations	845		321

Note: Standard errors in parentheses are clustered by article.

*p < .05.

**p < .01.

***p < .005.

Methodological Variations in Studied Empirical Research

The eighty-five studies included in the quantitative meta-analysis incorporate many variations but have in common that each estimated either the peak scale or the maximum reach of cluster influence. They did so most often with regression analyses that incorporated distance-based measures of clusters. However, some adopted other methods such as constructing indices of concentration, calculating statistical measures of spatial relationships, estimating spatial density patterns, or conducting simulation exercises. These studies are analyzed in Result section.

The second list of fifty papers encompasses studies that were not used in the meta-analysis because they did not directly estimate the peak scale or maximum reach of industrial clusters or did so in a way that is not comparable with the majority of studies. A few of these works consisted of descriptive summaries of previous studies and did not fully describe the underlying data. Two are themselves meta-analyses.⁴ A couple more studies did not calculate or report a peak scale or maximum reach, despite employing methodologies capable of doing so, aiming instead to apply distance metrics to identify or distinguish industrial clusters. The majority of papers in the second list investigated the spatial scale of clusters with atypical methodologies or measurement strategies.

Several papers incorporated geography through classifying locations by administrative units rather than by adopting distance measures. Rosenthal and Strange (2001), for example, calculated proxies for cluster effects in US manufacturing industries for zip codes, counties, metropolitan statistical areas, and states. Their study did shed light on the spatial scale of clusters, showing that different clustering mechanisms prevail at different geographical units. However, there is no way for us to quantify the peak scale or maximum reach estimated from this study, since administrative units are of quite different sizes.

A few analyses, such as Chauvin (2019), measured distance according to travel time rather than spatial separation. This approach may improve accuracy, particularly for the retail sector that is sensitive to the divergence between distance and travel time. Yet it is challenging to operationalize and requires highly reliable information on travel behavior or transportation networks.

Another set of studies estimated elasticities of the outcome variable with respect to clusters, specifying the functional form of distance decay in advance. Because elasticity need not be constant along the range of distances examined, these analyses usually reported estimates at the sample means. Although the findings are not directly comparable with those of the majority of studies, this method does offer two notable advantages. First, the estimates are not limited to linear relationships between clusters and the outcome analyzed. Second, specifying the functional form of distance decay tightens the focus of the estimation procedure and thus increases its statistical power at the potential cost of inaccuracy or bias due to a flawed specification. Some studies, such as Feser (2002) and Drucker (2012), empirically tested multiple distance specifications.

The largest share of studies in the second list considered distance in a limited fashion, by testing only one particular distance, combining multiple distance-based influences into an index, defining location discretely through concentric distance bands, or most commonly by assuming a linear influence of distance on cluster effects. In these studies, data constraints may have hindered a more thorough evaluation of the spatial differentiation of cluster impacts. In some cases, distance seems to have comprised a relatively minor aspect of a study focused on other factors influencing clusters.

With such substantial divergence in methodologies, it is neither desirable nor possible to summarize the substantive results of the excluded studies as a single group. We provide only the broadest generalization that these studies tend to provide positive evidence of substantial cluster influences across a wide variety of environments.

Result

Descriptive Analysis

Table 7 summarizes the characteristics of the two lists of empirical studies. The number of studies estimating the spatial scale of clusters has been increasing over time. Most studies have focused on North America—the United States and Canada. Several European countries have been repeatedly studied, including Spain, France, Germany, the United Kingdom, and Italy. A smaller set of studies has examined Asian nations, with China and Japan receiving more attention than other Asian countries. The majority of these studies examined vintages of the 1990s and 2000s, the period in which more microlevel data became available. Relatedly, the vast majority of these studies adopted microlevel data, such as establishment, firm, or patent/inventor data. These data are best suited to examine the spatial scale of clusters as they allow precise measurement of the distance between economic actors. In terms of industries, about one-third examined all industries/sectors, one-third focused on manufacturing industries, and the final one-third studied either the service sector or the high-tech industries. The studies/records adopted six groups of different outcome variables: productivity, firm/employee density, innovation, entrepreneurship, wage, and others. About 40 percent examined the density of firms or employees, followed by those examining innovation and productivity as Figure 3 shows.

Table 7.

Number of Studies/Records with Different Characteristics.

Studied location	United States	Canada	Spain	France	Germany	UK	Italy	China	Japan	Other
Studied location	49/480	13/46	7/122	4/62	8/38	6/38	7/26	6/30	3/22	27/154
Publication year	1990–1999		2000–2009		2010–		Working Paper
Publication year	6/24		44/296		57/475		28/227
Vintage	1960s and before		1970s		1980s		1990s		2000s	2010s
Vintage	3/24		15/140		28/156		68/413		83/671	23/134
Unit of analysis	Establishment/firm/patent		Zip code/census tract/grid or block		City/county		MSA/region/labor market/and so on
Unit of analysis	88/654		14/244		15/67		16/48
Studied sector	All		Manufacturing		Service		High tech		Other
Studied sector	59/290		56/412		26/185		23/122		2/4

Note: Slashes separate numbers of studies and numbers of records.

Figure 3.

Studies/records with different outcome variables.

Studies/records also investigated various clustering mechanisms. Most studies explicitly stated which mechanisms they investigated, such as localization effects, urbanization effects, knowledge spillovers, competition, or networking. A few tested the spatial correlation or spatial concentration of firms and workers without specifying a mechanism; we classified these studies together as “spatial” mechanism. We classified localization and specialization together, as both capture the cluster effect stemming from a single industry or a group of very closely related upstream and downstream sectors. Similarly, we grouped urbanization together with diversity and agglomeration effects, as all three capture cluster effects from the mingling of various industries. Different mechanisms often are related to different outcome variables, but these features do not map one to one. For example, the knowledge spillover mechanism typically involves an outcome variable of innovation, but some papers adopted the birth of high-technology firms (i.e., entrepreneurship) and the spatial relationship between a firm and its collaborators as dependent variables. Therefore, we used mechanisms and outcome variables separately to explain the various spatial scales from previous studies as displayed in Tables 2 and 6. Figure 4 shows that most studies and records examined the localization/specialization effect, followed by those studying urbanization/diversity and knowledge spillovers.

Figure 4.

Studies/records estimating different clustering mechanisms.

Table 8 summarizes the distribution of the estimated spatial scales. The spatial scales appear mostly to be small, but the estimates differ by outcome variables. The majority found the peak scale to be within ten miles for studies/records using productivity and wage as outcome variables. For entrepreneurship, the estimated results are overwhelmingly concentrated within one mile. The results are mixed for innovation: 30 percent concentrate within one mile, but many estimated ten to twenty miles or even more than fifty miles. In contrast, for firm and employee density, the results are more evenly distributed across distance categories. The patterns are similar for the maximum reach, except that for innovation, a greater share of studies/records estimated the scale to be within the range of one to ten miles.

Table 8.

Number of Studies/Records with Different Estimated Spatial Scales of Clusters.

Outcome Variables	Productivity	Firm/Employee Density	Innovation	Entrepreneurship	Wage	Other
Peak scale
$\leq$ 1 mile	5/52	22(10)/80(24)	11(2)/76(5)	7/92	4(1)/26(2)	1/8
1–10 miles	9/15	17(7)/121(50)	6/29	3/10	6(1)/33(2)	2/3
10–20 miles	5/29	15(6)/50(25)	4(2)/40(2)	1/4	1/3	0/0
20–50 miles	3/22	13(4)/83(42)	7/15	1/1	1/8	0/0
>50 miles	4/12	12(3)/63(10)	6(1)/40(8)	1/5	1/2	2(1)/8(6)
Total number of studies/records	13/130	25/397	17/200	8/112	8/72	5/19
Maximum reach
$\leq$ 1 mile	4/35	3/23	5/26	6/50	2/18	1/4
1–10 miles	6/15	4(1)/11(2)	6/21	3/32	4/19	3/7
10–20 miles	0/0	4(1)/5(1)	0/0	1/3	0/0	0/0
20–50 miles	2/4	2/8	3/9	1/5	2/12	0/0
50–100 miles	1/14	2/8	2/3	1/1	0/0	0/0
>100 miles	1/8	3/10	2/18	1/4	1/2	1/2
Total number of studies/records	10/76	9/65	11/77	8/95	6/51	4/13

Note: Figures inside parentheses are the numbers of studies/records without tests of statistical significance. The sum of the number of studies by row does not equal the total in the final row because one study may deliver multiple estimates of spatial scales that do not fall into the same distance band.

Average and Median Estimates

To obtain a point estimate of the spatial scales of industrial clusters, we calculated the average and median scales, weighted by sample size. The estimated scales, again, vary significantly across outcome variables as Table 1 shows. Entrepreneurship as the outcome variable is associated with the shortest peak scale at 4.88 miles, wage follows at 7.59 miles, while productivity and firm/employee density show similar peak scales at about twenty-eight miles. Innovation is associated with the farthest distance at 33.86 miles, but only 27.23 miles if excluding studies without statistical tests, not much different from productivity and firm/employee density. Note that while the average estimates for wage and entrepreneurship are roughly in line with the distributional patterns (in Table 8), those for productivity, firm/employee density, and innovation are likely affected by outliers.

In these cases, the median estimate may be more suitable. The median peak scale has a much smaller variation across outcome variables, and the estimates are much more in line with the distributional patterns. The median estimate for entrepreneurship is 0.5 miles; for wage, it is 2 miles; for innovation, it is 4.71 miles; and for productivity and firm/employee density, it is 7.5 and 10 miles, respectively.

For maximum reach, the mean estimate for entrepreneurship is also the smallest, 13.23 miles, again followed by wage at 18.77 miles. The largest estimates are for innovation and productivity at 50.46 and 52.20 miles, respectively. The estimate for firm/employee density is in between at 18.77 miles. The median estimates, again, are much smaller and more in line with the distributional patterns: one to two miles for productivity, innovation, and entrepreneurship, six miles for wage, and 8.82 miles for firm/employee density.

These results imply that (1) mechanisms underlying each outcome variable are different, and thus clusters exhibiting these impacts prevail at different geographical scales; (2) a researcher needs to pick the most appropriate geographical unit to study the impact of clusters based on which outcome variables they are examining (e.g., those studying entrepreneurship needs to adopt micro-geographical data); and (3) for practical reasons, policy makers aiming to achieve different outcomes in their jurisdictions can use these findings to guide their development of industrial clusters at different geographical scales. A cluster that encourages entrepreneurship needs to concentrate geographically at 0.5–5 miles to exhibit the maximum impact.

Fundamental Differences and Methodological Choices Both Influence the Estimated Spatial Scale

The studies’ fundamental differences exhibit a significant impact on the estimated spatial scales of clusters; among them, different cluster mechanisms and sector of study appear to be the most prominent factors. Different cluster mechanisms, for example, explain 4 percent of the variation in estimated differences in peak scale, and 23 percent in maximum reach as Table 2 shows. Those examining “network” found a smaller peak scale than most of the other mechanisms, though the peak distance is not statistically significant. This is sensible as networks function at the interpersonal level, requiring face-to-face interactions that likely operate at a walkable scale (Arzaghi and Henderson 2008; Rosenthal and Strange 2005).

“Knowledge spillover” exhibits the largest peak scale and maximum reach among all mechanisms. This may be surprising, as tacit knowledge spillovers are thought to be most intensively exchanged at a localized scale. For example, when Wallsten (2001) measured knowledge spillovers by whether a small firm is more likely to obtain a Small Business Innovation Research Award after another nearby firm received one, he found that the peak impact occurred at a quarter mile of distance between the two firms. But then, as part or all of the knowledge is later codified, for example, turned into patents and new products, it can transmit with limited spatial decay across greater distances. For example, Gittelman (2007) found that patent citations, again measuring knowledge spillovers, can demonstrate effects extending to fifty miles. The finding is also consistent with studies of the impacts of universities and research institutes, which have suggested that knowledge spillovers that can be influential with infrequent face-to-face interactions and temporary proximity operate at large spatial scales (Drucker 2016; Goldstein and Drucker 2006; Hausman 2012; Torre 2008; Woodward, Figueiredo, and Guimarães 2006). This heterogeneity in knowledge and thus in the spatial scale of knowledge spillovers has gone largely unnoticed in prior studies. One exception is Duranton and Puga (2001), who argued that radical innovation is more likely to happen in bigger and more diversified cities (spatially unbounded) while routine innovation is likely to happen in smaller and more specialized cities (spatially constrained). Rather than treating “knowledge spillover” as a unitary phenomenon, researchers and practitioners should think carefully of the specific type of knowledge involved as well as the context of knowledge exchange processes.

“Localization” and “urbanization” both exhibit relatively large peak scales among all mechanisms, but only “urbanization” shows a relatively large maximum reach. In other words, “localization” does not appear to be more local than “urbanization” for its greatest impact but is more geographically constrained in terms of its extent. Similar to “urbanization,” “spatial proximity” appears to be less constrained in its peak scale and maximum reach. Finally, “competition” exhibits a medium peak scale, whereas its maximum reach is spatially constrained. These findings signal the importance of targeting different geographical scales in accord with divergent policy aims. To spur networking, a local geographical scale that facilitates frequent face-to-face interaction is critical (Gertler 2003; Storper and Venables 2004). For the exchange of codified knowledge and to embrace a general urbanization effect, a larger spatial scale is not an impediment.

These results also help us make sense of the literature that examines theories of localization, urbanization, and competition as contenders for explaining the most important source of cluster benefits. Beaudry and Schiffauerova (2009), for instance, summarized sixty-seven empirical analyses and found that studies revealing positive and significant urbanization effects outnumbered those that identified positive localization and competition effects. Their result is reasonable as most of the sixty-seven studies were based on regional level data, analyzing a relatively large spatial scale across which the effects of localization and competition may have decayed beyond detection (because they exhibit shorter maximum reach). The three sources of cluster benefits, rather than competing directly, may complement each other across the range of distances. The next breakthrough in cluster theories may come from figuring out how localization, urbanization, and competition interact at different spatial scales.

The sector of study also exhibits significant impacts on the estimated spatial scale as shown in Table 3. The manufacturing sector has the largest peak scale, followed by the service sector. Studies focusing on the high-technology sector estimated the smallest scales. This makes sense as high-tech industries rely more on face-to-face interaction to innovate and develop new products (Deltas, De Silva, and McComb 2015; Kolympiris and Kalaitzandonakes 2013), and manufacturing industries generally benefit more from input–output relationships rather than interpersonal network-building; the service sector falls somewhere in between. The spatial scale, and likely the clustering mechanisms, differs by industries, such that systematic comparison across sectors will be a constructive approach for future studies.

Aside from cluster mechanisms and sector of study, we tested how countries and vintages of study, and also different outcome and explanatory variables, affect the estimated spatial scales. (The detailed results are available upon request.) We found that clusters in France are associated with a significantly larger maximum reach than other countries, and the peak scale in France is also larger, though not significantly so. These findings show that clusters in different locations may exhibit distinctive spatial scales. We also found that earlier vintages generally are associated with larger scales, while studies of more recent decades tend to reveal smaller spatial scales; this differentiation is statistically significant for the estimated peak scale in the period after 2010. We think this is partly because recent data are more likely to be at the microlevel, and microlevel analysis units help reveal more local cluster scales (Table 4). Another possibility is that clusters have started to operate more at local levels in recent decades; with the rapid development and wide adoption of transportation and internet technology, the importance of distance has faded in some respects but not for the critical face-to-face interactions that must take place at the local level (Glaeser 1998; Packalen and Bhattacharya 2015; Storper and Venables 2004). We did not find statistically significant variation in estimated spatial scales across different outcomes and explanatory variables. Methodological choices play an important role in shaping the estimated spatial scales; these findings can provide essential guidance to researchers. For instance, the unit of analysis explains 48 percent of the variation in the estimated maximum reach and 9.9 percent in the peak scale (Table 4). In general, the larger unit, county, exhibits a significantly greater maximum reach than the microlevel geographic units, establishment/firm/patent, and zip code/census tract/grid or block. The same pattern exists for the peak scale, though statistically insignificantly. These results highlight the modifiable areal unit problem (Wong 2009). Introducing microlevel data helps overcome this problem and refine the actual scale of clusters. A second potential explanation is that researchers selected larger units when examining phenomena anticipated to have a broader reach. However, due to the lack of theoretical understanding and empirical consensus on which mechanisms prevail at which geographical units, even researchers who correctly selected relatively larger or smaller areal units were unlikely to have been able to select the optimal unit a priori. Thus, adopting multiple units to test for robustness remains important, both for scholarly rigor and for supporting practical policy recommendations.

Studies using regressions and spatial models are associated with a smaller maximum reach compared to those adopting other analytical methods as shown in Table 5. Regression and spatial models can control for other variables and consider overlapping spatial configurations. This added information may help refine the detection of the maximum reach of clusters. For instance, a kernel method may detect significant employment at a specific maximum reach. However, suppose employment were to diminish more rapidly than population with increasing distance. In such a case, employment at the detected maximum reach may not indicate a cluster after controlling for population density. Thus, we recommend considering more types of relevant data.

The quality of studies demonstrates a limited role. We did not find the journal impact factor or article citations to significantly affect the estimated peak scale or maximum reach. (The results are available by request.) Of course, it may be that these variables are not very good proxies for study quality.

Table 6 puts all of the explanatory factors together to estimate a complete model. The explanatory factors combine to explain 37.8 percent of the variation in the estimated peak scale and 91.7 percent of the variation in the estimated maximum reach. The maximum reach is much better explained due to its relative homogeneity. Many fewer studies estimated the maximum reach. Additionally, the quality of the information used in these studies often is superior (calculating maximum reach is more demanding of the underlying data), and the estimation procedures are more precise (requiring multiple distance bands or other methods of comparison across varying distances).

Several groups of explanatory variables suffer from multicollinearity problems. For example, more recent data are also more likely to pertain to finer geographic scales. As a result, some of the coefficients become statistically insignificant—the primary reason why we also analyzed groups of variables separately in Tables 2 –5. We tested the normality of the error terms; the estimated residuals do appear to be nearly normally distributed. The clustered standard errors, as mentioned above, account for heterogeneity across studies.

The results in Table 6 are, in general, consistent with those in Tables 2 –5, with a couple of exceptions. First, the impact of the country of study becomes more prominent. France and Germany are associated with clusters with greater maximum reach, while Italy, in contrast, has clusters with smaller maximum reach. Moreover, clusters in Japan have both smaller peak scale and maximum reach than those in many other countries. These differences have not been well noticed and understood in prior studies. One potential reason lies in the different city structures across countries. While in France and Germany, labor and knowledge move relatively freely and thus can produce clusters that extend afar, in Italy, local labor markets are more restricted in their spatial scopes, constraining the impacts of clusters within these local markets (Di Addario and Patacchini 2008). In comparison to Western countries, urban areas in Japan are denser and more transit-dependent. The dense urban centers permit cluster effects to accrue more quickly within shorter distances, and the well-developed public transit systems make mobility easy within the central city areas. But beyond city boundaries, the absence of a well-developed transit system hinders the spillover of cluster benefits. In contrast, car-dependent Western cities can more easily expand cluster effects to much greater distances via automobiles. In addition, each of these countries has a long, unique history of urban development, which may represent path dependency in both urban form and cluster patterns today.

These comparisons suggest the importance of locality heterogeneity. Different geographical ranges should be considered for various countries to initiate cluster policies. For example, firms in Japan need to be (and perhaps ought to be encouraged to be) more geographically concentrated than those in Western countries in order to reap the maximum benefits of clusters.

The second exception is that independent variables become more statistically significant in explaining the differences in maximum reach due to the control of other related variables. Agglomeration index and distance as explanatory variables are associated with larger maximum reach, while R&D is associated with the smallest maximum reach. This indicates that the impacts of agglomerations (high concentrations of firms and workers in related industries) and of the distance from the urban center itself can encompass substantial distances, but the impact of R&D is more localized. This finding is reconcilable because the cluster effect, in essence, is the externalities between firms. Both distance from the urban center and agglomeration index effectively capture these externalities that are not easily confined in space, while the effect of R&D can be spatially restricted within a single firm or a few neighboring firms. This is also consistent with the previous discussion of two types of innovation. R&D investments are more associated with an early-stage radical innovation that is not easily transferrable over long distances. Thus, R&D’s external effects are more likely to be exhibited through face-to-face communications, which are more constrained by physical space.

There is a general trend for later vintages to be associated with smaller spatial scales, which is consistent with the previous findings. Smaller units are associated with a shorter peak scale and maximum reach for the unit of analysis, which fits with the results in Table 4. Different outcome variables are not statistically significantly associated with different spatial scales. Findings for various cluster mechanisms match the results in Table 2. For the peak scale, knowledge spillover is associated with the largest scale, while network is associated with the smallest scale; competition, spatial relationship, localization, and urbanization fall in between. Competition in Table 6 shows a greater peak scale than it does in Table 2, though, and knowledge spillover and spatial relationships are associated with a broader maximum reach. In comparison, competition, localization, and urbanization are associated with smaller maximum reach.

Turning to industrial sectors, the manufacturing sector is associated with a larger peak scale and maximum reach than the high-tech industry as in Table 3. One slight difference in the full specification is that the service sector is associated with about the same peak scale and maximum reach as the manufacturing sector. Finally, studies published in journals with higher impact factors are associated with a smaller peak scale. These studies likely adopt more precise estimation methods and make use of more detailed and higher quality data, thus yielding more refined estimated spatial scales.

Conclusion

We conducted a meta-analysis to estimate the spatial scale of industrial clusters, combining and integrating the findings from prior empirical studies. Quite a large number of empirical studies have estimated the spatial scales of industrial clusters, producing mixed results. Moreover, these analyses are difficult to compare, encompassing a variety of methods, outcomes, industrial sectors, and contexts. A meta-analysis is useful in reconciling inconsistent empirical findings and revealing patterns hidden in the existing pool of evidence. We estimated the average and median spatial scales associated with different cluster outcomes (e.g., productivity, innovation, entrepreneurship, and wage) and probed the factors associated with the variation in spatial scales. In doing so, we revealed the following facts that contribute to our knowledge of clusters.

Innovation as an outcome of clusters and knowledge spillovers as the underlying mechanism can happen at either highly localized or wider geographical scales, depending on the type of innovative activities in question. Early-stage and radical innovation happen at local scales, as such creative activities require face-to-face interaction and the exchange of tacit knowledge (Gertler 2003). This is consistent with the finding that entrepreneurship as an outcome of clusters happens at more local scales, since entrepreneurial activities essentially are early-stage and radical innovative activities with new businesses as the products. Another corroborating piece of evidence is that networking is also found to occur at local levels, consistent with prior studies (Arzaghi and Henderson 2008; Storper and Venables 2004). Entrepreneurial activity and early-stage innovation usually require not only the exchange of knowledge but also relationship-building, networking, and collaboration. In contrast, other innovative activities are more routine and require mostly codified knowledge, such as refining and building on an existing patent. Citing a patent does not require physical proximity to the inventor, though a shorter distance may boost the chance of becoming aware of a particular patent. Thus, for those types of innovation, the effect of clusters can extend far afield. Few studies to date have differentiated among types of innovation, which this analysis suggests is much needed. Also, different measurements for innovation and knowledge spillover capture various types of innovative activities; for example, the form of new businesses, collaborative projects, and innovation awards capture more of the early-stage and radical innovation, whereas patent citations capture more of the transmission of already codified knowledge. Researchers should be cognizant of such subtle distinctions in what they are measuring and use different measures to distinguish among kinds or stages of innovation.

This study has made it clear that cluster benefits accrue at different geographical scales for distinct sectors and sources of cluster or urban agglomeration benefits. High-tech industries that require more radical innovative activities colocate at smaller geographical scales. In contrast, manufacturing industries involve more routine innovative activities and can benefit from expansive clusters in terms of labor pooling and supply chain relationships. Thus, studies that combine all sectors together are less effective in revealing the precise spatial territory of cluster effects. The mechanisms by which members of industrial clusters gain benefits operate differently at various spatial scales as well. Rather than adopting a horse-race approach to rank the relative importance of localization, urbanization, and competition mechanisms, it would be more valuable to investigate how these processes may interact and even complement each other at differing spatial scales. We look forward to future studies probing into these detailed differences across industries and cluster effects.

Our analysis highlights several methodological issues, such as the choice of unit of analysis, model approaches, and whether or not to conduct statistical tests, that can significantly affect the estimated spatial scales. In general, testing data at various geographical levels can help mitigate concerns regarding the modifiable areal unit problem (Wong 2009). Alternatively, researchers can use microlevel data where they are available, avoiding aggregating the measured clusters into broader spatial units. Microlevel data also support more detailed and finer-grained analyses and methods. An appropriate model choice, or sensitivity tests across various models, is critical to producing robust and trustworthy results. The choice to conduct statistical tests also has a significant relationship with the estimated geographical scale. A large difference in the magnitude of cluster benefits or spatial patterns may not be statistically significant; in contrast, a statistically significant difference may be small in magnitude. We suggest performing statistical tests where they are feasible, while also paying attention to the difference in magnitude to judge whether findings are substantively meaningful.

We find significant differences across countries. Clusters show distinctive patterns in various European countries and are spatially more constrained in Japan. These discoveries beg for additional investigation to explain their origins. We suspect that the connections (or lack thereof) among inner cities and suburbs, transit accessibility, and path dependency in urban forms may be important.

Practically, the most important lesson to be learned from this study is that cluster strategies ought to be tailored to specific regional and industrial needs. Different geographical scopes should be targeted for the design and implementation of policies in different regions. In other words, examining the specific geographical pattern of cluster effects in different regions is a prerequisite for designing effective policies.

Policies aiming at different outcomes such as promoting entrepreneurship and networking, encouraging specific types of innovation, or enhancing productivity, operate best at different geographical scales. In order to better encourage entrepreneurship and early-stage innovation, cities must not only encourage (related) firm concentration at a walkable scale but also provide a built environment that sustains frequent face-to-face interactions, such as with walkable streets (Ewing and Handy 2009; Hamidi and Zandiatashbar 2019) or a variety of social spaces (e.g., coffee shops, bars, and restaurants; Mehta and Bosson 2010).

Finally, policies ought to be tailored to the specific needs of industries. High-tech companies benefit from a similar spatial scale and local infrastructure as entrepreneurs, early-stage innovators, and networkers. In comparison, manufacturing companies require high-quality labor and well-integrated downstream and upstream supply chains, and their clusters are less constrained spatially. If future studies can delve more deeply into the detailed variations across industries, we anticipate that future cluster strategies can be more closely tailored and targeted than currently and that companies will be more fully able to realize the potential benefits of clusters.

The estimates derived in this research can provide initial directions toward targeting specific geographical scales in cluster policies, for example, suppose the state of Illinois is interested in encouraging firm concentrations in advanced manufacturing in order to spur entrepreneurship. In that case, the peak scale for this cluster will occur at a distance of approximately ten miles, and the maximum reach is around 392 miles, according to the estimates in Table 6. Thus, policies to promote advanced manufacturing entrepreneurship in Illinois can aim at relatively sizable areas to achieve maximum cluster benefits. The impacts of these industrial districts can extend quite far. Because of the wide range of estimates in the literature, we emphasize that the scale estimates calculated from this meta-analysis should be considered preliminary. They offer immediate but tentative guidance and provide an impetus for focused study to derive more precise estimates of the scale of industrial clusters in specific localities and contexts.

Footnotes

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Li Fang

Joshua Drucker

Notes

References

Aharonson

B. S.

Baum

J. A. C.

Feldman

M. P.

2007. “Desperately Seeking Spillovers? Increasing Returns, Industrial Organization and the Location of New Entrants in Geographic and Technological Space.” Industrial and Corporate Change 16 (1): 89–130. doi: 10.1093/icc/dtl034.

Albert

J. M.

Casanova

M. R.

Orts

2012. “Spatial Location Patterns of Spanish Manufacturing Firms.” Papers in Regional Science 91 (1): 107–36. doi: 10.1111/j.1435-5957.2011.00375.x.

Andersson

Klaesson

Larsson

J. P.

2016. “How Local Are Spatial Density Externalities? Neighbourhood Effects in Agglomeration Economies.” Regional Studies 50 (6): 1082–95. doi: 10.1080/00343404.2014.968119.

Arauzo-Carod

J. M.

Manjón-Antolín

2012. “(Optimal) Spatial Aggregation in the Determinants of Industrial Location.” Small Business Economics 39 (3): 645–58. doi: 10.1007/s11187-011-9335-6.

Arimoto

Nakajima

Okazaki

2014. “Sources of Productivity Improvement in Industrial Clusters: The Case of the Prewar Japanese Silk-reeling Industry.” Regional Science and Urban Economics 46 (1): 27–41. doi: 10.1016/j.regsciurbeco.2014.02.004.

Arzaghi

Henderson

J. V.

2008. “Networking Off Madison Avenue.” The Review of Economic Studies 75 (4): 1011–38. Accessed November 10, 2019. http://users.nber.org/∼arzaghim/MadisonAve07-12-19.pdf.

Austrian

2000. “Cluster Case Studies: The Marriage of Quantitative and Qualitative Information for Action.” Economic Development Quarterly 14 (1): 97–110. doi:10.1177/089124240001400110.

Baptista

Swann

1998. “Do Firms in Clusters Innovate More?” Research Policy 27 (5): 525–40. doi: 10.1016/S0048-7333(98)00065-1.

Barlet

Briant

Crusson

2013. “Location Patterns of Service Industries in France: A Distance-based Approach.” Regional Science and Urban Economics 43 (2): 338–51. doi: 10.1016/j.regsciurbeco.2012.08.004.

10.

Beaudry

Breschi

2003. “Are Firms in Clusters Really More Innovative?” Economics of Innovation and New Technology 12 (4): 325–42. doi: 10.1080/10438590290020197.

11.

Beaudry

Schiffauerova

2009. “Who’s Right, Marshall or Jacobs? The Localization versus Urbanization Debate.” Research Policy 38 (2): 318–37. doi: 10.1016/j.respol.2008.11.010.

12.

Behrens

Bougna

2015. “An Anatomy of the Geographical Concentration of Canadian Manufacturing Industries.” Regional Science and Urban Economics 51 (September): 47–69. doi: 10.1016/j.regsciurbeco.2015.01.002.

13.

Brown

W. M.

Dar-Brodeur

Tweedle

2020. “Firm Networks, Borders and Regional Economic Integration.” Journal of Regional Science 60 (2): 374–95.

14.

Bryan

Morten

2015. “Economic Development and the Spatial Allocation of Labor: Evidence from Indonesia.” Working Paper. Accessed November 10, 2019. http://economics.yale.edu/sites/default/files/paper_skeleton.pdf.

15.

Burger

M. J.

Van Oort

Van der Knaap

2007. “A Treatise on the Geographical Scale of Agglomeration Externalities and the MAUP.” Scienze Regionali 9:19–40.

16.

Cainelli

Ganau

2018. “Distance-based Agglomeration Externalities and Neighbouring Firms’ Characteristics.” Regional Studies 52 (7): 922–33. doi: 10.1080/00343404.2017.1360482.

17.

Carlino

Kerr

W. R.

2015. “Agglomeration and Innovation.” In Handbook of Regional and Urban Economics, edited by Duranton

Gilles

Henderson

J. Vernon

Strange

William

, Volume 5 (pp. 349–404). Elsevier: Amsterdam, the Netherlands doi: 10.1016/B978-0-444-59517-1.00006-4.

18.

Chauvin

. 2019. “When Distance Shrinks: The Effects of Competitor Proximity on Firm Survival.” Accessed April 28, 2021. http://marriottschool.byu.edu/upload/event/event_755/_doc/Jasmina%20Chauvin%20distance_shrinks2.pdf

19.

de Groot

H. L. F.

Poot

Smit

M. J.

2016. “Which Agglomeration Externalities Matter Most and Why?” Journal of Economic Surveys 30 (4): 756–82. doi: 10.1111/joes.12112.

20.

Dekle

Eaton

1999. “Agglomeration and Land Rents: Evidence from the Prefectures.” Journal of Urban Economics 46 (2): 200–14. doi: 10.1006/juec.1998.2118.

21.

Delgado

Porter

M. E.

Stern

2010. “Clusters and Entrepreneurship.” Journal of Economic Geography 10 (4): 495–518.

22.

Deltas

De Silva

D. G.

McComb

R. P.

2015. “Industrial Agglomeration and Spatial Persistence of Employment in Software Publishing.” Working Papers No. 85393182, Lancaster University Management School, Economics Department, Lancaster, UK. doi: 10.3109/15368378209040332.

23.

DerSimonian

Laird

1986. “Meta-analysis in Clinical Trials.” Controlled Clinical Trials 7 (3): 177–88. doi: 10.1016/0197-2456(86)90046-2.

24.

Di Addario

Patacchini

2008. “Wages and the City: Evidence from Italy.” Labour Economics 15 (5): 1040–61. doi: 10.1016/j.labeco.2007.09.003.

25.

Drucker

2016. “Reconsidering the Regional Economic Development Impacts of Higher Education Institutions in the United States.” Regional Studies 50:1185–202. doi: 10.1080/00343404.2014.986083.

26.

Drucker

J. M.

2012. The spatial extent of agglomeration economies: Evidence from three US manufacturing industries. US Census Bureau Center for Economic Studies Paper. Available at https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1995507. Accessed April 28, 21.

27.

Drucker

Feser

2012. “Regional Industrial Structure and Agglomeration Economies: An Analysis of Productivity in Three Manufacturing Industries.” Regional Science and Urban Economics 42 (1–2): 1–14. doi: 10.1016/j.regsciurbeco.2011.04.006.

28.

Duranton

B. G.

Puga

2001. “Nursery Cities: Urban Diversity, Process Innovation, and the Life Cycle of Products.” American Economic Review 91 (5): 1454–77.

29.

Duranton

B. G.

Puga

2004. “Micro-foundations of Urban Agglomeration Economies.” In Handbook of Regional and Urban Economics, edited by Henderson

J. V.

Thisse

J.-F.

, 2063–771. Amsterdam, the Netherlands: North Holland.

30.

Ellison

Glaeser

E. L.

Kerr

W. R.

2010. “What Causes Industry Agglomeration? Evidence from Coagglomeration Patterns.” American Economic Review 100 (3): 1195–213.

31.

Ewing

Handy

2009. “Measuring the Unmeasurable: Urban Design Qualities Related to Walkability.” Journal of Urban Design 14 (1): 65–84. doi: 10.1080/13574800802451155.

32.

Fallah

Partridge

Rickman

D. S.

. 2014. “Geography and High-tech Employment Growth in U.S. Counties.” Journal of Economic Geography 14 (4): 683–720.

33.

Fang

2015. “Do Clusters Encourage Innovation? A Meta-analysis.” Journal of Planning Literature 30 (3): 239–60. doi: 10.1177/0885412215589848.

34.

Fang

2019. “Manufacturing Clusters and Firm Innovation.” Economic Development Quarterly 33 (1): 6–18. doi: 10.1177/0891242418800770.

35.

Fang

2020. “Agglomeration and Innovation: Selection or True Effect?” Environment and Planning A 52 (2): 423–48. doi: 10.1177/0308518X19868467.

36.

Feldman

M. P.

1994. “Knowledge Complementarity and Innovation.” Small Business Economics 6 (5): 363–72. doi: 10.1007/BF01065139.

37.

Feldman

M. P.

Audretsch

D. B.

1999. “Innovation in Cities: Science-based Diversity, Specialization and Localized Competition.” European Economic Review 43:409–38. doi: 10.1016/S0014-2921(98)00047-6.

38.

Feser

E. J.

2002. “Tracing the sources of local external economies.” Urban Studies 39 (13): 2485–2506. doi: 10.1080/0042098022000027077

39.

. 2007. “Smart Café Cities: Testing Human Capital, Externalities.” Journal of Urban Economics 61 (1): 86–111. Accessed November 10, 2019. http://citeseerx.ist.psu.edu/viewdoc/summary;jsessionid=1956B58656ECEF1BF0E7A1916E7337FC?doi=10.1.1.214.9049.

40.

Ross

S. L.

2013. “Wage Premia in Employment Clusters: Agglomeration or Worker Heterogeneity?” Journal of Labor Economics 31(2): 271–304.

41.

Gertler

M. S.

2003. “Tacit Knowledge and the Economic Geography of Context, or the Undefinable Tacitness of Being (There).” Journal of Economic Geography 3:75–99.

42.

Gittelman

2007. “Does Geography Matter for Science-based firms? Epistemic Communities and the Geography of Research and Patenting in Biotechnology.” Organization Science 18 (4): 724–41.

43.

Glaeser

E. L.

1998. “Are cities dying?” Journal of Economic Perspectives 12(2): 139–160. doi: 10.1257/jep.12.2.139

44.

Goldstein

Drucker

2006. “The Economic Development Impacts of Universities on Regions: Do Size and Distance Matter?” Economic Development Quarterly 20 (1): 22–43. doi: 10.1177/0891242405283387.

45.

Halpern

Muraközy

2007. “Does Distance Matter in Spillover?” Economics of Transition 15 (4): 781–805. doi: 10.1111/j.1468-0351.2007.00308.x.

46.

Hamidi

Zandiatashbar

2019. “Does Urban Form Matter for Innovation Productivity? A National Multi-level Study of the Association between Neighbourhood Innovation Capacity and Urban Sprawl.” Urban Studies 56(8): 1576-94.

47.

Han

2016. “The Effects of Factor Proximity and Market Potential on Urban Manufacturing Output.” China Economic Review 39:31–45. doi: 10.1016/j.chieco.2016.04.002.

48.

Hanink

D. M.

2006. “A Spatial Analysis of Sectoral Variations in Returns to External Scale.” Journal of Regional Science 46 (5): 953–68.

49.

Hausman

2012. “University Innovation, Local Economic Growth, and Entrepreneurship.” SSRN Electronic Journal. doi: 10.2139/ssrn.2097842.

50.

Hoover

E. M.

1937. Location Theory and the Shoe and Leather Industries. Harvard University Press, Cambridge, MA.

51.

Jacobs

1961. The Death and Life of Great American Cities. New York: Vintage.

52.

Jacobs

1969. The Economy of Cities. New York: Vintage.

53.

Jaffe

A. B.

Trajtenberg

Henderson

1993. “Geographic Localization of Knowledge Spillovers as Evidenced by Patent Citations.” The Quarterly Journal of Economics 108 (3): 577–98.

54.

Kolympiris

Kalaitzandonakes

2013. “Geographic Scope of Proximity Effects among Small Life Sciences Firms.” Small Business Economics 40 (4): 1059–86. doi: 10.1007/s11187-012-9441-0.

55.

Kolympiris

Kalaitzandonakes

Miller

2011. “Spatial Collocation and Venture Capital in the US Biotechnology Industry.” Research Policy 40 (9): 1188–99. doi: 10.1016/j.respol.2011.05.022.

56.

Kolympiris

Kalaitzandonakes

Miller

2015. “Location Choice of Academic Entrepreneurs: Evidence from the US Biotechnology Industry.” Journal of Business Venturing 30 (2): 227–54. doi: 10.1016/j.jbusvent.2014.02.002.

57.

Koster

. 2013. “Rocketing Rents: The Magnitude and Attenuation of Agglomeration Economies in the Commercial Property Market.” Working Paper. Accessed November 10, 2019. http://eprints.lse.ac.uk/58531/.

58.

Lin

H. L.

H. Y.

Yang

C. H.

2011. “Agglomeration and Productivity: Firm-level Evidence from China’s Textile Industry.” China Economic Review 22 (3): 313–29. doi: 10.1016/j.chieco.2011.03.003.

59.

Fang

Pang

2014. “The Effect of Geographical Proximity on Scientific Cooperation among Chinese Cities from 1990 to 2010.” PLoS One 9 (11): e111705. doi: 10.1371/journal.pone.0111705.

60.

Marshall

1890. Principles of Political Economy. London, UK: MacMillan.

61.

McHale

Agrawal

Kapur

2008. “How Do Spatial and Social Proximity Influence Knowledge Flows? Evidence from Patent Data.” Journal of Urban Economics 64 (2): 258–69. doi: 10.1016/j.jue.2008.01.003.

62.

Mehta

Bosson

J. K.

2010. “Third Places and the Social Life of Streets.” Environment and Behavior 42 (6): 779–805. doi: 10.1177/0013916509344677.

63.

Melo

P. C.

Graham

D. J.

Noland

R. B.

2009. “A Meta-analysis of Estimates of Urban Agglomeration Economies.” Regional Science and Urban Economics 39 (3): 332–42. doi: 10.1016/j.regsciurbeco.2008.12.002.

64.

Moreno

Paci

Usai

2006. “Innovation Clusters in the European Regions.” European Planning Studies 14 (9): 1235–63. doi: 10.1080/09654310600933330.

65.

Neffke

F. M. H.

Henning

Boschma

R. A.

Lundquist

K. J.

Olander

L. O.

2011. “The Dynamics of Agglomeration Externalities along the Life Cycle of Industries.” Regional Studies 45 (1): 49–65.

66.

Nelson

J. P.

Kennedy

P. E.

2009. “The Use (and Abuse) of Meta-analysis in Environmental and Natural Resource Economics: An Assessment.” Environmental and Resource Economics 42 (3): 345–77. doi: 10.1007/s10640-008-9253-5.

67.

Ohlin

1933. Interregional and international trade. Cambridge, MA: Harvard University Press.

68.

Orlitzky

Schmidt

F. L.

Rynes

S. L.

2003. “Corporate Social and Financial Performance: A Meta-analysis.” Organization Studies 24 (3): 403–41. doi: 10.1177/0170840603024003910.

69.

Overman

H. G.

2003. “Can We Learn Anything from Economic Geography Proper? Henry G. Overman.” Journal of Economic Geography 4 (5): 501–16.

70.

Packalen

Bhattacharya

2015. Cities and ideas. National Bureau of Economic Research Working Paper. Accessed November 10, 2019. Available at https://www.nber.org/papers/w20921.

71.

Partridge

M. D.

Rickman

D. S.

2008. “Distance from Urban Agglomeration Economies and Rural Poverty.” Journal of Regional Science 48 (2): 285–310. doi: 10.1111/j.1467-9787.2008.00552.x.

72.

Porter

M. E.

1998. “Clusters and the New Economics of Competition.” Harvard Business Review 76 (6): 77–90.

73.

Renski

2011. “External Economies of Localization, Urbanization and Industrial Diversity and New Firm Survival.” Papers in Regional Science 90 (3): 473–502. doi: 10.1111/j.1435-5957.2010.00325.x.

74.

Rigby

D. L.

Essletzbichler

2002. “Agglomeration Economies and Productivity Differences in US Cities.” Journal of Economic Geography 2 (4): 407–32. doi: 10.1093/jeg/2.4.407.

75.

Rosenthal

S. S.

Strange

W. C.

2001. “The Determinants of Agglomeration.” Journal of Urban Economics 50 (2): 191–229. doi: 10.1006/juec.2001.2230.

76.

Rosenthal

S. S.

Strange

W. C.

2003. “Geography, Industrial Organization, and Agglomeration.” Review of Economics and Statistics 85 (2): 377–93. doi: 10.1162/003465303765299882.

77.

Rosenthal

S. S.

Strange

W. C.

2004. “Evidence on the Nature and Sources of Agglomeration Economies.” In Handbook of Regional and Urban Economics, edited by Henderson

V. J.

Thisse

J.-F.

, Vol. 4, 2119–71. Amsterdam, the Netherlands: Elsevier.

78.

Rosenthal

S. S.

Strange

W. C.

2005. “The Geography of Entrepreneurship in the New York Metropolitan Area.” Reserve Bank of New York Economic Policy Review 11 (2): 29–54.

79.

Rosenthal

S. S.

Strange

W. C.

2008. “The Attenuation of Human Capital Spillovers.” Journal of Urban Economics 64 (2): 373–89. doi: 10.1016/j.jue.2008.02.006.

80.

Stanley

T. D.

Doucouliagos

Giles

Heckemeyer

J. H.

Johnston

R. J.

Laroche

Nelson

J. P.

Paldam

Poot

Pugh

Rosenberger

2013. “Meta-analysis of Economics Research Reporting Guidelines.” Journal of Economic Surveys 27 (2): 390–94.

81.

Stevens

M. R.

2017. “Does Compact Development Make People Drive Less?” Journal of the American Planning Association 83 (1): 7–18. doi: 10.1080/01944363.2016.1240044

82.

Storper

Venables

A. J.

2004. “Buzz: Face-to-face Contact and the Urban Economy Michael Storper and Anthony J. Venables.” Journal of Economic Geography 4 (4): 351–70.

83.

Strøjer Madsen

Smith

Dilling-Hansen

2003. “Industrial Clusters, Firm Location and Productivity—Some Empirical Evidence for Danish Firms.” Working Papers 03-26, Department of Economics, Aarhus School of Business, University of Aarhus, Aarhus, Denmark.

84.

Sutton

A. J.

Abrams

K. R.

Jones

D. R.

Jones

D. R.

Sheldon

T. A.

Song

2000. Methods for Meta-analysis in Medical Research. Chichester, UK: Wiley.

85.

Torre

2008. “On the Role Played by Temporary Geographical Proximity in Knowledge Transmission.” Regional Studies 42 (6): 869–89.

86.

US Cluster Mapping Project of the Harvard Business School. 2018. Accessed November 10, 2019. https://clustermapping.us/about/about-project.

87.

Wallsten

S. J.

2001. “An Empirical Test of Geographic Knowledge Spillovers Using Geographic Information Systems and Firm-level Data.” Regional Science and Urban Economics 31 (5): 571–99. doi: 10.1016/S0166-0462(00)00074-0.

88.

Wong

2009. “The Modifiable Areal Unit Problem (MAUP).” In The SAGE Handbook of Spatial Analysis, edited by Fortheringham

A. Steward

Rogerson

Peter A.

, 105–23. London: SAGE.

89.

Woodward

Figueiredo

Guimarães

2006. “Beyond the Silicon Valley: University R&D and High-technology Location.” Journal of Urban Economics 60 (1): 15–32. doi: 10.1016/j.jue.2006.01.002.