Abstract
The Hoover Index is the most widely used measure for assessing the concentration or deconcentration tendencies of a country’s evolving population distribution. In this article, the authors propose alternative variants of the index designed to explore the influences of the various demographic components of change (birth, deaths, net domestic, and net international migration) on settlement dynamics. These decompositions can also be applied to examine the differential contributions of various race and ethnicity groups to overall trends. Using U.S. Census Bureau county population estimates, the authors deploy their approaches to examine the nature of deconcentration trends evidenced for the decadal periods 1990–1999 and 2000–2009.
Keywords
Dedication
This article is dedicated to the memory of our colleague, mentor, and friend, Andrew M. Isserman. In his mentoring and collegial roles, he did much to spark our own interests in novel methods for regional demographic analysis.
We, along with Paul Beaumont, were privileged to have worked with Andy building the ECESIS economic/demographic population projection model at the U.S. Census Bureau during the period 1979–81. That venture—an ambitious investigation of the issues connected with building a combined economic/demographic model—was Andy’s project as an American Statistical Association/National Science Foundation/Census Fellow. Among the publications reporting on the results of that project was a book Andy edited, Population Change and the Economy: Social Science Theories and Models (Isserman 1986) and, on the migration component on which we primarily worked, an article in the Journal of the American Statistical Association titled: “Forecasting Interstate Migration with Limited Data: A Demographic Economic Approach” (Isserman et al. 1985). A 1982 article reviewing the then extant sources of federal migration data (Isserman, Plane, and McMillen 1982) would rank among Andy’s most well-cited works.
A major focus of Andy’s far-reaching and multifaceted research interests was the shedding of light on the dynamics of U.S. population trends. Late on one of many long nights we spent at the Census Bureau kicking around ideas about the ECESIS project, Andy told us that when he had been a kid he was always the boy out on the field or basketball court who tried to persuade the others that it really was not too dark to keep playing. Andy had an infectious enthusiasm, a passion for big ideas, a persuasive vision of how academic research could motivate public policies to better the human condition, and a caring commitment to the people with whom he came in contact. His light continues to shine, reminding us that it is never too dark to keep working, questioning, and questing.
Introduction
Demographers, population geographers, and regional scientists have long been interested in the spatial concentration of population. Population densities vary widely from place to place at any given time. In addition, the tendency for people to cluster in some counties, states, or regions more than in others may vary over time, and it is of interest to characterize and understand this variability.
The components of population change can exert differing levels of influence on population concentration. Although we may tend to think of population concentration changing only through the redistribution of population brought about by internal migration, in fact the spatial distributions of other components of change—births, death, and international migration—may be quite important as well.
In addition, various subsets of the population may exhibit differing degrees of population concentration. To understand changes in population concentration over time, it is necessary to understand the changing importance of these subsets. Thus if, for example, Hispanics are more spatially concentrated than the rest of the population, the fact that this group is a growing share of the total population will—even absent a change in the distribution of the group itself—lead to an increase in measures of overall population concentration.
In this article, we use the Hoover Index (Hoover 1941) to examine the changes in population concentration in the United States from 1990 to 2010. We carry this out at several spatial scales, and we investigate the contributions to changes in concentration made by components of population change and by changing racial/ethnic distributions.
The Hoover Index and its Decomposition by the Components of Population Change
The Hoover Index as a Measure of Population Concentration
The degree of population concentration in a region of the earth, such as a country, that is disaggregated into a set of subregions (for instance, states, counties, or census tracts) is most commonly measured using the Hoover Index (Hoover 1941). The index can range from a low of 0 to a high that approaches 100, with larger values indicating greater degrees of concentration. The value of the index can be interpreted as the percentage of the total population that would need to be redistributed across subregions to achieve equal population densities in all subregions. The index is calculated as follows:
Alternatively, the Hoover Index may be found from a Lorenz Curve (Figure 1). The curve is constructed by first calculating each region’s percentage of total area and total population. The regions are then ordered in terms of increasing population density, and cumulative percentage areas and populations are found and plotted—first for the region with lowest density (point A in the figure), then for the two regions with lowest density (point B), followed by the three regions with lowest density (point C), and so on. The Hoover Index is simply the maximum vertical distance between the forty-five degree line and the Lorenz Curve (i.e., the length of segment EB in the figure). Alonso-Villar (2011) focuses on alternative decompositions of the Lorenz curve.

The Lorenz curve and the Hoover Index.
In his seminal 1941 article, Hoover noted that his procedure for gauging population concentration and deconcentration trends had already been used in the context of measuring the localization of manufacturing industries in Great Britain by Florence and Wensley (1939), and he recognized antecedents of the ideas in the work of Wright (1937).
The index is identical in concept to the Index of Dissimilarity used in studies of segregation (where the value of the index indicates the percentage of the population of one of the two groups that would have to be redistributed to achieve perfect integration, i.e., to obtain identical percentage allocations of majority and minority population in each subregion’s population). Although Duncan and Duncan (1955) are properly given credit for the term Index of Dissimilarity (as well as its conventional notation, D), the suggestion of this as a measure of segregation was made earlier by Jahn, Schmid, and Schrag (1947), and in fact Duncan and Duncan cite this earlier work.
Vining (1975) and Johnson and Vining (1976) have examined the Hoover Index within the context of long-term population trends. Specifically, they calculate the index associated with the long-term equilibrium spatial distribution that is implied by current population trends. Monitoring changes in this index is equivalent to tracking the long-term level of concentration toward which the population is currently headed.
Viewed over a long-time period, the US population has increasingly concentrated in urban areas while deconcentrating at the broadest spatial scales: away from the densely populated northeast and toward the less populated south and west. Beyond these well-known long-term trends, more subtle and less obvious counter trends have emerged in certain recent time periods. The so-called metropolitan/nonmetropolitan “turnaround” during the 1970s, termed by some as clean break with past trends, was convincingly demonstrated by Vining and Strauss (1977) using the Hoover Index. They showed that the Hoover Index decreased during that decade regardless of what spatial scale was used to disaggregate the country’s land area. During the 1980s, however, counties in metro areas once again began to grow more rapidly than nonmetro counties, before a second “turnaround,” with a resurgence of faster nonmetropolitan population growth, was noted to have occurred in the early 1990s (Long and Nucci 1997).
Recent Changes in the Hoover Index
Table 1 displays the Hoover Index as calculated at various geographic scales from US Census Bureau annual population estimates, for each year during the period 1990–2009. Recall the interpretation of the index: these numbers indicate that roughly 66 percent of the population would have to be redistributed between counties of residence in order for all counties to have equal population densities.
Hoover Index for U.S. Counties, States, Divisions, and Regions: 1990–2009
Examining the annual trend, we find, similar to Long and Nucci (1997), evidence for a slight deconcentration at the county scale during the early 1990s. Although their period of study ends in 1995, our calculations show the index continuing to decline through 1996. The index dropped from 65.49 in 1990 to 65.43 in 1995 and 1996. 1 Beginning in 1997, county-scale population concentration increased, and the index increased slowly and steadily throughout the remainder of the 1990s and into the 2000s. Thus, at the county scale, the US population was again concentrating during the first decade following the millennium. Although these changes might appear small at first glance, even fractional percentage point changes represent substantial changes in the location of people across counties since the index is a summary for a national population distribution of hundreds of millions.
Table 1 also shows that at scales larger than the county (i.e., states, census divisions, and census regions), deconcentration occurred throughout the twenty-year period. Note too that the value of the index declines with increasing scale: only a relatively small percentage of the population would have to change their census region of residence to ensure equal population densities across the four census regions. A general property of the Hoover Index for nested geographies is that the index value for a higher scale set of units cannot exceed that for a set with finer spatial disaggregation.
Effects of the Components of Population Change on the Hoover Index
In his original application to the measurement of population concentration, Hoover (1941) was keenly aware of the fact that all of the components of population change, and not just internal migration, contributed to the changes in the index. A lack of migration data prevented him from analyzing the effects of the individual components of change on trends in the index of concentration: … the method lends itself to a comparison of the same distribution at two different dates, thus showing the net shift or redistribution occurring between two dates. Such a “redistribution coefficient” does not attempt, of course, to measure either migration or mobility. No adequate statistics exist for the accurate estimation for either of these, save in quite recent years; nor is it possible to trace accurately the effects of differential fertility and mortality upon redistribution. For these reasons, and because for many practical purposes the net total result of birth–death differentials and migration is the important thing, the measurement of redistribution seems worthy of serious attention. (Hoover, p. 201)
Table 2 categorizes rates of birth, death, and migration, according to whether the county was one that had a share of the national population that was greater than, or less than, its share of land area in 2000. The denominators of these rates use an interpolated midperiod population for each decade. Since the length of the period runs from the time of the census (April 1, 1990 and 2000) to the time of the estimates for July 1, 1999, and July 1, 2009, the middle of these periods occur in mid-November of 1994 and 2004.
Components of Change in High and Low Population Density Areas
Note from the table that the domestic net migration rate is positive for the subset of counties that have relatively low population shares and that there is net outmigration from the subset of counties with population shares that are large relative to their area. This is in keeping with the idea that the population is deconcentrating due to the effects of domestic migration; people are, on balance, moving from high population density areas to areas of lower population density.
Also, note that the crude birthrate is relatively higher for those counties that have relatively high population shares; this implies that births act to counteract the effect of domestic migration on population deconcentration.
The crude death rate is higher in those counties with population shares that are low relative to their area. This is at least in part due to the relatively older age structures that characterize these areas. During the decade, the death rate differential acted in the same way as birthrates: to concentrate population share even further, and thus widen the disparities in population density. Furthermore, the difference in crude rates between the two areas is substantially higher for deaths than it is for births. Finally, international migration also acts in this same direction, and the effect of international migration alone would be to increase concentration, since the rate into high-density areas is greater than the rate into low-density areas.
Figure 2 depicts both the spatial distribution of birthrates and population density. The darkest gray and cross-hatched areas constitute places of higher than average population density, and white and light gray are used for places with lower than average population density. The darkest gray areas have higher than average crude birthrates, and since these are in high population density areas, they contribute to concentration. Similarly, crude birthrates are lower than average in the white areas—they also contribute to increasing concentration since they represent counties with low population density. The large number of rural counties in the Midwest with low crude birthrates (white), combined with a smaller set of highly urbanized counties with high crude birthrates (the dark gray areas, including, e.g., Chicago and southern California), give rise to the finding that the distribution of births is acting to increase population concentration.

Population density (2005) and crude birthrate, 2000–2009.
Figure 3 is similar, but now the spatial distributions of death rates and population density are being shown. The darkest gray and cross-hatched areas again constitute places of higher than average population density, and white and light gray are again used for places with lower than average density. Dark gray areas have lower than average crude death rates, and since these are in high population density areas, they contribute to concentration. Crude death rates are higher than average in the white areas—they contribute to increasing concentration since they represent counties with low population density. The combination of many rural counties in the Midwest with relatively high crude death rates (reflecting in part the relatively old age structure of this region) and a sizable number of urban areas with low crude death rates (the dark red areas, including New York City, Southern California, and parts of Florida) combine to cause the distribution of death to contribute to increasing population concentration. The low crude death rates in these regions are at least partially attributable to relatively young age structures, and the age structure is in turn affected by the migration patterns of both the young and the elderly.

Population density (2005) and crude death rate (2000–2009).
Table 3 illustrates the second of the three perspectives we propose on demographic decomposition of changes in the Hoover Index. It was constructed by using county shares of national births and deaths, and by comparing those shares with shares of land area. It shows that (1) the spatial distribution of births is more concentrated than the spatial distribution of deaths, (2) the spatial distribution of both births and deaths have levels of concentration that are not too different from that for total population, and (3) there was little change in the Hoover Index for both births and deaths throughout the decade.
Hoover Index for Births and Deaths
This index is of course “global”—it is a single number that captures the extent of spatial concentration. Thus, although the index is similar for deaths and population, the detailed geographic distributions of population and death across counties differ (e.g., we have seen from the previous table that the death rate is higher in counties with lower population densities).
One may also ask how the Hoover Index would change if only a single component acted upon population. Table 4 depicts the results of this third approach to examining the influence of individual components of change on concentration. We term the quantities shown here “Effect-Alone” Hoover Indexes. For example, the values shown in the “Births” column for 1991 to 1999 were calculated by adding to the 1990 populations of each county the number of births recorded for the county between July 1 of the year listed and the 1990 Census April 1 base date to obtain a hypothetical “birth effect-alone” population. Similarly, the 2001–2009 numbers were calculated for the sums of births and the counties' Census 2000 base populations.
Effect-Alone Hoover Index Values for Components of Population Change
Examining the time-series index values in Table 4, we can see that the separate effects of births and deaths, if left to act alone on population change, would have been to cause small, yet steady increases in population concentration throughout the decade. Similarly, the effects of international migration were also in the direction of increased population concentration; there was more net migration into the counties that already had higher than average population density.
The “combined” column shows the joint effect of birth, death, and international migration on the Hoover Index, and reveals the increase in population concentration that would have occurred in the absence of domestic migration. The final column shows the effects of domestic migration alone. Here we see that, in the absence of the other components of change, there would have been a small amount of population deconcentration. The other components served effectively to mask the effect of internal migration trends on deconcentration.
The Hoover Index Decomposed by Race and Hispanic Origin
The overall concentration or deconcentration trends exhibited by the US population can also be thought to be composed of the trends in population change for various race and ethnic groups. As shown in Table 5 , the values of the Hoover Index and the changes in the index from 2000 to 2010 are rather different when computed separately for settlement patterns of different race/ethnicity groups. Because annual subnational population estimates by race and Hispanic origin are less reliable than those for the components of change, and because a different race categorization was adopted in 2000 than had been in use previously, here we restrict our analysis to the subpopulations of selected groups as enumerated on the 2000 and 2010 censuses.
Hoover Indexes of Population Concentration, 2000–2010, by Hispanic Origin and Race
There are also interesting differences in concentration or deconcentration depending on the geographic units used to calculate the index. Whereas our analysis of the effects of the components of change used annual estimates data, which permit analysis only down to the county level, race and Hispanic origin are “complete count” items on the decennial censuses, and thus we can examine the index at the scale of census tracts as well for counties and states. 2 Because census tract boundaries do not cross county lines, it is possible to compare across three sets of units that exhibit nesting relationships. As was noted in discussing Table 1, one of the properties of the Hoover Index for nested geographies is that the values cannot be smaller for the more spatially disaggregated units.
Note that the Hoover Index for total population increased from 2000 to 2010 when calculated for either census tracts or counties, revealing a trend toward more concentrated settlement. When calculated for states, however, the index value went down, indicative of a broader scale trend toward overall population deconcentration.
When calculated separately for each of the race/Hispanic origin groups, the Hoover Index discloses relative levels of concentration. Across all three scales of geography, and for both time periods, the highest values, indicative of the highest levels of spatial concentration, are those calculated for Asians. The lowest values, meaning the most dispersed pattern of settlement, are those for whites and American Indians and Alaskan Natives.
At the state scale, for every race and Hispanic origin group examined there was a trend toward deconcentration of population: lower Hoover Index values were found for 2010 than for 2000. At the county and census tracts levels, however, overall population became more concentrated. The overall increase in concentration is reflective of the differential population growth dynamics across the race/Hispanic origin groups. The numbers illustrate an important point about comparing the changes in the Hoover Index for subpopulations having differential rates of growth to the change in the index for total population. Let us consider, for instance, the trends for two groups: non-Hispanic whites and Hispanics.
At the tract level, a concentrating pattern of non-Hispanic whites—when compared to the non-Hispanic white distribution in 2000—can be noted. A further factor, however, is shown in Table 6 : non-Hispanic whites accounted for a lower proportion of the total population in 2010 than in 2000. As a group that continued to be more deconcentrated than the population overall, their declining share vis-à-vis other more spatially concentrated groups was an additional factor contributing to more concentrated overall population.
Changing Percentage Shares of National Population for Hispanic Origin and Race Groups, 2000–2010
At the county scale, Hispanics in 2010 were less concentrated than were Hispanics in 2000. Hispanics, however, continued to exhibit substantially above-average levels of concentration and had well above the national rate of population growth over the decade (see Table 6). Thus, Hispanics contributed more to the Hoover Index value for 2010 than for 2000. Their increased proportion of the population was a factor in the greater overall concentration of the US population at the county level—even though the group itself was becoming more dispersed!
The Effect-Alone method discussed in the earlier section of the article can be applied to help provide a more explicit perspective on the influence of the settlement trends of Hispanics, non-Hispanic whites, and the various race groups on the overall concentrating or deconcentrating tendencies of the American population. Table 7
shows calculated values of the Hoover Index for hypothetical distributions of 2010 population if it were to be assumed that the only change occurring in the system was the changed distribution of the specified group. So, for example, the tract-level index value for Hispanics, 77.44 percent, was obtained based on adding the change in the number of Hispanics residing in each tract to the tract’s base (Census 2000) total population to obtain a 2010 “Hispanic Effect-Alone” population:
Effect-Alone Hoover Indexes of Population Concentration for 2010, by Hispanic Origin and Race, with Differences from 2000 Index Value for Total Population
The main difference between what took place at the county scale and at the tract level was that the changed distribution of non-Hispanic whites represented less of a deconcentrating influence at the county scale. Hispanics, blacks, and Asians all exerted significant concentrating effects. Note, too, that many persons of Hispanic origin choose to self-enumerate in the Census Bureau category of “Some Other Race,” and across all our findings the “Some Other Race” and “Hispanic origin” index values tend to be quite similar.
At the state scale, at which the overall Hoover Index discloses population deconcentration, it is non-Hispanic whites who account for most of the lowered value, as an already below-average-concentration group further deconcentrates. Hispanics at the state scale have been redistributing away from the traditional Mexican American border regions into other portions of the country, but given their above-average concentration their more dispersed settlement pattern across the states represents a relatively insignificant influence on changing the overall index. Similarly, blacks, Asians, and a number of the other groups show more muted effects at the state scale than at the tract and county scales.
We have thus now examined the influences on the Hoover Index of concentration of population change broken down both by the demographic components of change and by broad race/ethnicity groups. In principle, it would be possible to take the analysis one step further to examine an even more disaggregated picture of distributional change by breaking out each of the components by race/ethnicity. Thus, one could examine, say, the effects of Hispanic fertility and non-Hispanic white internal migration. While births and deaths by race and Hispanic origin are obtainable down to the county scale (if not the tract level), internal migration and immigration/emigration data are much more problematic. While the Census Bureau in preparing its annual county estimates by race and Hispanic origin disaggregates domestic and foreign in- and out-migration, 3 the numbers are generated for internal use only and, at present, are not publically released.
Summary and Discussion
In this article, we have described the changes that have occurred in population concentration in the United States during the past two decades. Population concentration at the county scale is attributable to the spatial pattern of births, deaths, and international migration; the effects of internal migration alone were in the opposite direction and would, on their own, have led to population deconcentration. At larger spatial scales (i.e., states and census divisions and regions), the population continued its long-term trend of deconcentration. Although not studied here, these national trends exhibit spatial heterogeneity that would be interesting to examine further. For example, during the most recent decade, regions in the central and Rocky Mountain portions of the country exhibited population concentration at the county scale, while counties in New England and near the Pacific coast exhibited deconcentration at this scale.
Changes in the population concentration are also affected by both the changing distribution of particular racial/ethnic groups and the changing share of these groups, relative to the total population. Thus, at the county scale, although the Hispanic population became less concentrated during the most recent decade, this subpopulation actually contributed to the increase in concentration observed for the total population, by virtue of the facts that (1) the Hispanic population is more concentrated than the rest of the population and (2) the Hispanic population constituted an increasing share of the overall population as the decade progressed.
The Hoover Index, as a measure of population concentration, is straightforward to calculate and easy to interpret. It provides a single summary measure that allows for the study of change over time. Changes in population concentration, however, are the result of changes in the concentration of various components or subgroups of that population. Study of the contributions that each of these components and subgroups make to changing population concentration can shed light on the characteristics of population processes.
Footnotes
Acknowledgments
The authors acknowledge the assistance of Drew Trgovac and Peter Kedron with the production of the figures.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
