Abstract
This article studies how the changing geographic distribution of skilled workers in the US affects theoretical models that use Gibrat’s law to explain the size distribution of cities. In the empirical literature, a divergence hypothesis holds that college share increases faster in cities where college share is larger, and a growth hypothesis maintains that the rate of city population growth is also directly related to initial college share. Examining the divergence hypothesis, the classic test for Gibrat’s law is shown to be a test for
Introduction
Applying the standard that human capital is either based on, or proxied by, years of schooling, much research has been devoted to changes in the geographic distribution of workers with different levels of educational attainment. In the United States, as attainment has increased nationally, the distribution of human capital across cities has become more unequal. Skill intensity ratio (SIR), defined as the ratio of adults with at least a bachelor’s degree to those with less education, is a common measure of an area’s human capital level. Table 1 shows that the mean and variance of SIR have risen steadily for metro areas in the US, with both increasing threefold from 1970 to 2010.
Testing for
Notes: Cross-section observations = 316 metro areas. Skill Intensity Ratio (SIR) is the ratio of adults (aged 25 years or older) with at least a bachelor’s degree to those with less education. Levene’s test is used to determine if decennial samples have statistically constant variance.
The trend of human capital divergence across US cities has been noted in a number of papers, including Moretti (2004, 2012, 2013), Berry and Glaeser (2005), Wheeler (2006), Hammond and Thompson (2010), Glaeser (2012) and Diamond (2016). In addition to increasing dispersion, these authors show that the level of human capital has been rising more, in absolute terms, in areas where it was initially more abundant. A second trend documented in the literature is that the population growth rates of US cities over the same period have also been an increasing function of their initial human capital levels (Glaeser and Saiz, 2003; Glaeser and Shapiro, 2003; Glaeser et al., 1995; Shapiro, 2006; Simon, 1998, 2004; Simon and Nardinelli, 2002).
The interaction of these two empirical trends from the literature implies that US cities have been diverging in skill and size, with low-skill workers concentrating in slow-growing cities where economic opportunities may be limited. In addition to concerns over inequality, there is another reason why the trends merit special attention. This article makes a new theoretical observation that the potential for diverging human capital levels to generate divergence in urban growth rates has important long-run implications for the distribution of city sizes.
An established literature on the size distribution of cities suggests a pattern that is remarkable for its simplicity and stability over time and across countries. Zipf’s law is an empirical regularity in which log of city population rank regressed on log of city population yields an estimated slope coefficient that is approximately equal to minus one. Indeed, the city size distribution is so regular that conformity with Zipf’s law may well represent a necessary condition of acceptance for any model that seeks to explain urban growth and urbanisation.
Beginning with Gabaix (1999), research has identified one possible explanation for the emergence of Zipf’s law in a system of cities. If the growth rate of cities is random with constant variance, then city sizes should follow a lognormal distribution. The type of local population growth that generates the regular city size distribution is referred to as a Gibrat process. Consequently, Gibrat’s law for cities has been advanced under the hypothesis that the growth rate of a city is independent of its size. Gibrat’s law is not necessary for Zipf’s law to hold, but it provides a dynamic rationale for the static distribution of city sizes associated with the law.
The size distribution of cities must be maintained by a pattern of productivity growth such that relative wages adjust to compensate for differences in cost of living associated with city size. As noted by Combes et al. (2008), these differences cannot be generated by skill differences associated with city size, but must represent a true compensating differential. The measurement of wage gains associated with movement is complicated by the finding in De la Roca and Puga (2017) that workers may experience unequal and permanent productivity gains from working in big cities. Thus, it seems that measurement of dynamic changes in the urban wage premium is a difficult matter and not a promising venue for testing Gibrat’s law.
Instead, empirical tests of Gibrat’s law have been based on either the size distribution of cities conforming to the lognormal or city growth being independent of city population. Black and Henderson (2003), González-Val et al. (2014) and Michaels et al. (2012) find instances in which one or the other condition fails. Similarly, research by Glaeser et al. (2014) and Desmet and Rappaport (2017) shows that over sufficiently long periods the relation between city size and growth rate varies from slightly above to slightly below zero. However, a review of these tests by González-Val et al. (2014) concludes that the predominant result has tended to support Gibrat’s law. As well, a measure of theoretical support has been provided in studies by Eeckhout (2004), Duranton (2006, 2007), Rossi-Hansberg and Wright (2007), Córdoba (2008) and Lee and Li (2013).
Recent testing of Zipf’s law has revived debate over appropriate geographic units subject to analysis. González-Val (2015) uses incorporated places in the US and finds that population growth rates are independent of size over the 1990–2000 period. Arribas-Bel et al. (2015) analyse 844 employment centres located within 357 metro statistical areas (MSAs) and conclude that Zipf’s law holds. Dobis et al. (2020) consider the effect of proximity to larger cities on growth rates and find that it is consequential for smaller US cities. Moreno-Monroy et al. (2020) estimate the rank-size rule exponent and find values of −0.93 for the US, −1.07 for China, −1.02 in Brazil and −1.11 in India. For these countries, sample sizes are large and results are roughly consistent with Zipf’s law. 1
This article examines the effects of changes in the distribution of skilled workers in the US on the relation between population level and population growth rate, that is, on a fundamental rationale for stochastic growth models that can generate the empirical distribution of city sizes. The classic test for Gibrat’s law is first shown to be a test for
Our theoretical discussion demonstrates that, even under the assumption of no relative divergence, absolute divergence in human capital levels along with the association between human capital and city growth documented in literature are sufficient to violate the conditions for Gibrat’s law to hold. This theoretical challenge to stochastic growth models has not been noted in the literature.
The next contribution of the research is empirical. The finding of a positive relation between initial human capital and the rate of subsequent population growth is reexamined in light of consequences for Gibrat’s law for cities. The empirical literature has assumed a linear relation between lagged SIR and city growth. In contrast, testing performed below indicates that the relation may be concave. If the effect of human capital on urban growth weakens as educational attainment increases over time, then the effect is less likely to produce a long-run relation between population level and growth rate that would violate Gibrat’s law. Hence, our conclusion is that stochastic growth models can survive the challenge posed by divergence in the location of skilled workers.
Measurement
There are several measures of variation that could be used to analyse changes over time in the distribution of city characteristics. In the literature on human capital divergence, insufficient attention has been paid to the conceptual problems that measuring changes in inequality entails. This section draws on two bodies of literature with highly-developed efforts for grappling with these measurement issues – works on regional economic growth and works on individual economic inequality.
-divergence versus
-divergence
Beginning with Barro and Sala-i-Martin (1990), the literature on economic growth has distinguished two types of convergence:
In contrast, papers in the literature on human capital divergence have not formally distinguished between
In the literature on regional growth, the connection between Gibrat’s law and the concept of
Previous testing for convergence in US regional income has generally found a significant pattern of
Absolute versus relative divergence
The theoretical literature on measuring individual income inequality has emphasised that when distributions of a variable are compared over time in a growing economy, it is important to distinguish between relative and absolute measures of change. Foster (1998) provides a seminal discussion of absolute versus relative index numbers. For example, absolute measures of income dispersion, such as variance and standard deviation, naturally increase over time when average income is growing. Therefore, it is common to measure
In addition to failing to distinguish between
Alternative measures in the data
Following convention in the literature on human capital divergence, metropolitan areas rather than incorporated cities are the unit of analysis. Demographic data on a panel of 316 MSAs in the contiguous United States are obtained for the decadal years 1970–2010 by aggregating county-level tabulations to metro definitions promulgated by the US Office of Management and Budget for the 1990 census. The general concept of a metropolitan statistical area is that it represents a local labour market. Using the official delineations, the sample excludes small urbanised areas, that is, the smallest MSA in the panel in 1970 has 27,559 inhabitants.
The underlying data come from the 1970, 1980, 1990 and 2000 US censuses and the 2008–2012 multi-year American Community Survey (ACS), referred to as ‘2010’ in reference to the year on which the five-year period is centred. For the census data, MSA-level values are accessed from the US Department of Housing and Urban Development’s State of the Cities Data Systems. The authors perform their own aggregations from the ‘summary tape file datasets’ for the ACS.
Testing for
Tests for
where
The first test is performed for a relation between initial SIR and absolute change in SIR. This specification is not scale invariant and therefore not a true test for
Testing for
Notes: OLS estimates of equation (1).
The next step requires testing for possible relations between skill intensity level and either its rate of growth or the difference in growth rates between skilled and unskilled workers. Estimates of equation (1) with
The literature on income inequality has shown that it is reasonable to find absolute divergence when there is relative stability, if income per capita is increasing. Similarly, the results in Tables 1 and 2 indicate that, as expected in an economy with rising educational attainment, measures of the absolute difference in differences are positive and statistically significant, even though there is no relative divergence as indicated by either
Theoretical implications of human capital level divergence for Gibrat’s law
The simplest model of the evolution of cities that comports with stylised fact is stochastic growth based on Gibrat’s law. Gabaix (1999) first demonstrated that if city growth is characterised by Brownian motion with drift, and the mean and variance of the growth process are similar across cities, then a distribution of cities consistent with Zipf’s law with an exponent of unity will be generated. Thus, Gibrat’s law is sufficient, but not necessary, for Zipf’s law to hold.
Suppose the evolution of population (
where
The standard stochastic growth rationale for Gibrat’s law holding in the data requires that
Turning to the specific case of the relation between divergence in SIR and stochastic growth models, population growth is easily expressed as a function of the growth rates of skilled and unskilled workers and the concentration of human capital in a city,
Consider the case in which there is absolute divergence in SIR but no relative divergence. Assume that two cities differ in initial skill intensity so that
Using (4), the difference in population growth rates between the two locations can be written as a function of initial skill level,
Equality of growth rates requires that either there is no change in national skill intensity ratio (
There is an abundance of empirical evidence, discussed at length in Glaeser et al. (2014), that since 1970 lagged SIR has been positively related to population growth rates in cities. As discussed above, if this association between SIR and the rate of city growth were found to be temporary, then it would not represent a violation of Gibrat’s law. Instead it would be considered part of the pattern of idiosyncratic growth documented in the literature. Desmet and Rappaport (2017), for example, examine two centuries of change and report extended periods in which population growth was either directly or inversely related to city size.
Indeed, Glaeser et al. (2014) identify a number of variables which have been associated for temporary periods with faster rates of city growth. For example, variation in the industrial mix of cities has been shown to produce periods of unequal city growth. But again, these effects should be temporary and periods of unusual industrial expansion should generally be followed by periods of sluggish growth, thus reversing the sign of the relation between industrial composition and growth.
In their treatment of models of city size distributions, Gabaix and Ioannides (2004) devote an entire section to discussing possible deviations from Gibrat’s law that do not change the city size distribution. Writing (2) in discrete time yields
where
Consider climate as an example. Climatic variables may be mean reverting, although periods of drought or unusual temperature may show significant persistence in some areas. Alternatively, the effects of climate on city growth may be temporary. The effect of air conditioning on the relation between hot summer temperatures and economic growth is one example of an effect that is likely temporary over a sufficiently long time horizon. If either of these propositions holds, then the finding of a temporary association between climate and population growth rate does not imply a permanent departure from Gibrat’s law. However, Rappaport (2007) argues that there is a permanent component to differences in climate and that the income elasticity of demand for mild winters and summers may be greater than unity, giving areas with superior climate a permanent advantage over rival cities. Such permanent effects could be a challenge for Gibrat’s law.
The specific
The strong evidence for a relation between absolute changes in SIR and its initial level found above along with the documented empirical relation between absolute differences in SIR and subsequent city growth, when combined with the theory in this section, have implications Gibrat’s law. In particular, this suggests the need for a further test regarding the second derivative of population growth with respect to SIR. Gabaix (1999) shows that, if
Testing for concavity in the relation between human capital level and urban growth
The most common approach to testing Gibrat’s law is the parametric city growth equation. In addition to checking for the classic Gibrat relation, the primary focus of the testing here is determining whether and how rates of urban growth are related to human capital levels in the data. Testing begins by estimating the following panel equation:
Equation (7) follows directly from Glaeser et al. (2014), with identification based on using lagged SIR to explain subsequent population growth. The form of the relation between human capital and urban growth is reflected in estimates of
Estimates of equation (7) are reported in Table 3. Skill intensity ratio enters the equation sequentially in first-, second- and third-degree polynomial functions. Odd-numbered models are estimated using ordinary least squares (OLS) and even-numbered models with iterated re-weighted least squares (IRLS) for robust regression. Across the six models in Panel A, estimates of
Urban growth and human capital.
Notes: Estimates of equation (7). Models include year fixed effects and year-by-population interactions, not reported.
To illustrate the economic significance of these results, the effect on population growth of increasing SIR by one standard deviation (0.14) can be compared for two cities at different points in the distribution. The Sheboygan, WI metro area has SIR at the first quartile value of 0.22 for the year 2000, and Wilmington-Newark, DE has the third quartile value of 0.38. As a result of the hypothetical SIR shock, quadratic OLS Model (3) and cubic OLS Model (5) predict that Sheboygan’s rate of population growth would have been 4.7% or 4.3% points higher, respectively, over the 2000–2010 decade. Because of the estimated concavity, the growth rate for Wilmington-Newark would have changed much less, rising by just 1.6 or 0.1 points. Suppose the mayors of these two cities had policy goals of increasing their growth rates. The results in Table 3 suggest that the efficacy of attracting more college-educated workers as a policy lever, were that even achievable, would be subject to a great deal of randomness and would also depend critically on how well-educated on average a city was already.
As a robustness exercise, natural amenity variables and measures reflecting the concentration of local employment in industries which have experienced divergent growth trends at the national level are added to the specification. The industry mix measures are aggregated from the same source data as the demographic variables (i.e. decennial US censuses and the ACS) and the amenity variables are taken directly from the County and City Data Book: 2000 published by the US Census Bureau. These variables have been reported to be significant in growth equations published elsewhere. 4 Thus, their omission could be biasing estimates in Panel A if they are conditionally correlated with included regressors.
As expected based on previous research, coefficients on amenity and industry mix variables reported in Panel B of Table 3 are significant and have expected signs. For example, cold and wet areas with large manufacturing sectors are found to grow more slowly during the study period. The city size effects remain negative and estimates are now statistically significant. The new estimates of
To visualise general forms of the relations estimated thus far, Figure 1 fits by decade quadratic and cubic trend curves to population growth rate and initial skill intensity ratio in standardised (z-score) form. It is clear that the quadratic and cubic models fit quite similar concave relations to these data. Further inspection of the plots raises several issues. First, holding the distribution of SIR constant, it appears that the curves are becoming less steep and curved over time. Second, assuming the polynomial functional forms are reasonable approximations of the true underlying relationship, the plots show that a number of cities are likely located in a region where the derivative of population growth with respect to SIR is negative. In other words, there are metros on the downward-sloping portion of the estimated relation. Third, although the non-linear nature of the relation is apparent, a particular functional form that remains constant throughout the range of SIR values is not clear. The remainder of the testing examines these issues.

Non-linear fits by decade.
In regard to the first two points above, population growth rate is next regressed by decade on initial population and a quadratic in skill intensity ratio, 5
In estimates of equation (8) reported in Panel A of Table 4,
Critical points in the relation between urban growth and human capital.
Notes: The rate of change in total population of city
The critical points in Table 4 indicate the level of SIR above which population growth rate begins to fall with increasing human capital. The critical point values increase over time, which is not surprising given the overall rightward shift in the distribution of SIR. The table also reports mean and maximum SIR values by decade. All of the estimated critical points are greater than mean SIR, but lower than maximum, for their respective time periods. In other words, there are high-skill MSAs with population growth rates that are expected to fall with further increases in average human capital values in all four time periods. The number of such MSAs is referred to in the table as ‘frequency in excess’. Relative to the 1970s, these counts fall during the 1980s and 1990s, but rebound in the first decade of the 2000s. According to Models (7) and (8), for example, the population growth rate from 2000 to 2010 is inversely related to human capital level for 16 MSAs with SIR levels in 2000 greater than 0.60.
An important concern with parametric growth regressions is that the number of cities included in the sample can be consequential for parameter estimates (Eeckhout, 2004; González-Val et al., 2013). In particular, inclusion of small cities tends to produce refutations of Gibrat’s law. Because the sample of MSAs used here is truncated to exclude small cities, based on the findings in Levy (2009), the distribution of sizes is expected to be Pareto consistent with the law. Nonetheless, as an additional robustness exercise, equation (8) is re-estimated using a sample of 929 core-based statistical areas (CBSAs) delineated for the 2000 census. In addition to MSAs, CBSAs include ‘micropolitan’ statistical areas based on counties with urbanised areas of at least 10,000 inhabitants in 1990. Applying the definitions retroactively, the smallest CBSA in 1970 has a population of just 2665.
In Panel B of Table 4, the model fits are generally worse with the non-truncated sample. However, the estimated critical points and number of locations on the downward sloping portion of the urban growth–human capital relation using CBSAs are quite similar to the previous results using the sample of MSAs. While inclusion of small ‘cities’ may result in refutation of the Gibrat relation, the robustness exercise shows that it does not have the same impact on the finding of a concave human capital relation.
A well-known drawback of the use of polynomial terms to estimate a potential non-linear relation is that the functional form chosen may not accurately represent the underlying data-generating process. Researchers in the related inequality literature have used semi-parametric methods to flexibly estimate the Kuznets (bell-shaped) curve relation between GDP per capita and income inequality, e.g. Barrios and Strobl (2009) and Lessmann (2014). The final tests here draw inspiration from this literature by similarly estimating a semi-parametric functional form that places fewer restrictions on the underlying relationship than did the estimates above which use higher-order SIR terms. The approach involves estimating a piecewise functional form for the relation between population growth rates and SIR. This, of course, allows the effect of initial population level to remain linear.
The piecewise linear function relates growth rates to initial SIR and population as follows:
where the
Table 5 shows results from estimating equation (9) by decade using OLS and IRLS. For the 1980 and 1990 cross-sections, six
Piecewise linear functional form.
Notes: Standard errors in parentheses. Standard errors for odd-numbered models are heteroskedasticity consistent. Cross-section observations = 316 metro areas. The dependent variable is the percent change in total population of city
Given that SIR is diverging in absolute value naturally due to the rise in average SIR in the economy, the potential for SIR effects to influence Gibrat’s law is evident. However, a positive relation between SIR and population growth may not have implications for Gibrat’s law if it is concave. The results in this section, subject to substantial stressing, show that the positive relation reported in previous studies is not linear: it appears concave and not monotonic increasing. These empirical findings provide assurance that the divergence in absolute SIR, which is likely to continue as average SIR rises, may not have permanent implications for the validity of stochastic growth models that generate the empirical size distribution of cities.
Conclusions
This article makes a new observation that findings in the literature regarding divergence of human capital and a positive relation between initial human capital level and subsequent population growth could challenge stochastic models of city growth based on Gibrat’s law. However, analysis of the evidence supports a conclusion that Gibrat’s law and its implications for the size distribution of cities can survive the apparent challenge from these stylised facts.
First, measures of divergence in the skill intensity of cities reported in the literature lack the property of scale invariance. Given the national rise over time in average educational attainment, absolute divergence across locations could be a statistical artefact. Examination of scale-invariant measures of divergence in the measurement section of the article indicates that the dispersion of human capital has not increased relatively and there is evidence that human capital is growing at a rate independent of its initial value. Indeed, testing for both
However, there is rising absolute divergence as a natural consequence of the general rise in education within the labour force. The theoretical section demonstrates that, even when relative divergence is lacking, absolute divergence is sufficient to cause a positive relation between city size and city growth rates that could violate Gibrat’s law. Further analysis of the empirical relation between initial human capital and city growth in the testing section demonstrates that it is diminishing over time and concave. This limits the potential for absolute divergence to create a systematic relation between population levels and growth rates that would violate Gibrat’s law.
This article provides important clarification of previous empirical findings from the literature on urban growth and the distribution of human capital. Overall, the news for stochastic growth models in the results presented here is favourable. Indeed, it appears that the growth rate of human capital in cities is actually unrelated to past levels. This does not mean that absolute divergence will not be found as national education levels rise, but it does mean that there is no relative divergence using either
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
