Abstract
This article examines the ability of proportional hazard models to evaluate changes in land use through time. There are three specific objectives: (a) to review previous research on the complexity of urbanization and explain how the spatial hazard framework accommodates that complexity; (b); to estimate a series of spatial hazard models characterizing land use in the twenty-five highest growth core-based statistical areas (CBSAs) of the United States in 1990, 2000, and 2006; and (c) to use the estimation results to track land use change region-by-region over the 16-year time frame. Overall, the analysis reveals that the spatial hazard framework offers a highly effective means of describing land use change. Along the way, it also illustrates that the classic model of urbanization continues to hold in an evermore-complex world—albeit, in an explicitly uncertain and inherently probabilistic manner.
Introduction
In their classic article The Urban Field, Friedmann and Miller (1965, 314) suggested that the city should no longer be viewed as a “physical entity,” but, instead, as “a pattern of point locations and connecting flows of people, information, money, and commodities.” The work was prescient because it had identified a fundamental break in the American way of land use—a break brought on by the outright disintegration of clear demographic, socioeconomic, and spatial boundaries between urban, suburban, exurban, and rural settings. 1 Over the nearly fifty years since, land use patterns have continued to evolve along this trajectory and essentially all urbanization, no matter how far-flung, is now anchored, one way or another, to one or more of the country’s 967 core-based statistical areas (CBSAs). As shown in Figure 1 , the contemporary urban field—defined, following Friedmann and Miller (1965), as the area located within about a one-hour drive, or a 100-kilometer radius, of a CBSA—covers most of the continental United States. Not only is the nation personally urbanized, with around 83 percent and 10 percent of its population living in metropolitan and micropolitan areas, respectively, it is spatially urbanized, with most of its territory located within the sphere of one class of CBSA or the other.

The contemporary urban field.
Because of its geographic scope, this still-emerging reality poses daunting problems for the study of land use and, even more, land use change. In particular, urbanization is exceptionally diverse and so too are the people and activities it accommodates, plus the various landscapes that it is situated on. Consider, for instance, the vast differences between the Northeast Corridor and Southern California conurbations, or between the environments of the Atlantic Southeast and the Pacific Northwest—they confound both the simplifying assumptions of theoretical models of land use and the practical limits of empirical methods of describing it: to wit, flat, featureless plains and perfectly smooth, negative exponential density gradients can be hard to justify theoretically (Brueckner 1982, 1987) and even harder to locate empirically (Johnson and Kau 1980; Kau and Lee 1976a, 1976b, 1977; Kau, Lee, and Chen 1983). As a consequence, researchers have struggled through the years to characterize urbanization in a way that enables scientific analysis of similarities and dissimilarities from place-to-place and time period-to-time period. But, in spite of this effort, a definitive approach has yet to be discovered. As soon as one group (Burchfield et al. 2006, most recently) seems to have come up with one, another (Irwin and Bockstael 2007, in that case) delivers evidence to the contrary. In short, generalizing about the way of land use across a nation as large and variegated as the United States remains problematic. The challenge must be overcome, though, because social scientists and policymakers alike require the ability to compare and contrast outcomes around the country in order to address them on evidentiary—and not strictly interpretive—grounds (Batty 2007).
Toward that end, this article examines the ability of proportional hazard models—a class of duration, or failure time, models originally developed for analyzing life cycles (Cleves, Gould, and Guitierrez 2005; Heckman and Singer 1984; Kiefer 1988; Lawless 2002; Odland and Ellis 1992; Selvin 2008; Waldorf 2003)—to evaluate changes in land use through time. It builds directly off a previous analysis (Carruthers et al. 2010) that establishes hazard models as a viable tool for studying spatial point patterns generated by urbanization. The present objectives are three: (a) to review previous research on the complexity of urbanization and explain how the spatial hazard framework accommodates that complexity; (b); to estimate a series of spatial hazard models characterizing land use in the twenty-five highest growth CBSAs of the United States areas in 1990, 2000, and 2006; and (c) to use the estimation results to track land use change region-by-region over the 16-year time frame. Overall, the analysis reveals that the spatial hazard framework offers an effective means of describing land use change and comparing diverse outcomes through time. Along the way, it also illustrates that the classic (Alonso 1964; Mills 1971; Muth 1969) model of urbanization continues to hold in an evermore-complex world—albeit, in an explicitly uncertain and inherently chaotic manner.
Background Discussion
Complexity, Land Use, and the Urban Field
Land use patterns are inherently complex: urbanization is, after all, composed of physical development—buildings, infrastructure, and other engineering—that has been shaped, large and small, by a literally countless number of individual actions taken by its builders, inhabitants, and planners (Batty 2007). Plus, different regions have different cultures, functions, geographic constraints, and 5natural resources and have likewise (and consequently) experienced different cycles of growth and decline through time (Perloff et al. 1960). Even in a nation as young as the United States, land use has evolved over the course of hundreds of years and it has done so under continually shifting economic, environmental, demographic, social, and technological circumstances. As an outcome, urbanization is a veritable mash-up of different modes of land use with an internal structure that varies significantly from spot-to-spot and era-to-era—no two regions are the same, and individual regions exhibit a diverse patchwork of development.
Adding to this complexity, the sphere of most regions has expanded so greatly over the past half-century that long-standing distinctions between urban, suburban, exurban, and rural settings have lost much of their meaning (Clark et al. 2009; Frey 2004). Friedmann and Miller (1965) recognized this early on and responded by suggesting that urbanization should be reframed as a field—rather than material—concept, wherein the regional center exerts both centripetal and centrifugal forces. Specifically, they: (a) acknowledged that the net flow of migration from rural to urban parts of the country was unlikely to change; but (b) at the same time, suggested that development patterns were taking on a new, far more expansive and elaborately structured, character. 2 Only a few years later, this view was vindicated by the Census Bureau’s Current Population Reports, which revealed that, beginning in the late 1960s, internal migration had favored nonmetropolitan areas over metropolitan areas—in a dramatic turnaround, the former grew at the expense of the latter as households and, eventually, firms began relocating to outlying centers (Beal 1975; Gordon, Richardson, and Yu 1998). Though this trend, along with various explanations for it, has waxed and waned through the intervening years (see Frey 1993; Fuguitt and Beale 1996), it now seems clear that Friedmann and Miller’s (1965) field concept is of enduring value. The nonmetropolitan turnaround may not have been the “clean break” that some analysts (Vining and Strauss 1977) initially interpreted it to be, but its decisive transformation of land use patterns is indisputable (Gordon 1979). Most regions still retain a dominant center of gravity, but their development is more complex than ever before—partly because of the nature of urbanization itself, and partly because of how the urban field holds its far-flung, polycentric anatomy together.
Yet, in spite of all this, the classic (Alonso 1964; Mills 1971; Muth 1969) economic model of urbanization continues to explain the general tendencies of land use, even within very large regions having an extended spatial hierarchy (Bogart 2006; Glaeser and Kahn 2004). In its simplest form, the model describes a perfectly smooth, monotonic rent gradient that declines with distance from its peak at the central business district of a circular region situated on a flat, featureless plane. At equilibrium, all households, which are assumed to be identical, attain the same level of utility—and, so, the rent gradient reflects the trade-off between location and the cost of travel to and from downtown. A corresponding and equally smooth density gradient emerges as a result of households consuming progressively greater amounts of land, a normal good, toward the urban fringe where land is less expensive. The density gradient and, with it, urbanization come to an end once the rent gradient (minus the cost of construction) reaches zero and the highest and best use of land is no longer for development but, instead, for some natural resource-oriented activity. 3 In practice, the pattern is rarely, if ever, monocentric, but the same story readily generalizes to polycentric settings. The reason for this is that, under the conditions just described, firms, which are also assumed to be identical—similar to households, all firms attain the same level of profits (zero)—have an incentive to decentralize: since a household’s net income is its wage less the cost of commuting, a decentralizing firm can offer lower wages and still attract the labor that it requires (DiPasquale and Wheaton 1996). As shown in Figure 2 , the result is a polycentric bid rent gradient, ρ(d), that first falls with distance, d, from the central business district, then climbs as it approaches the outlying subcenter and, finally, falls again until it reaches the baseline rent, ρ(n), which reflects the value of land as a natural resource (for empirical examples, see Heikkila et al. 1989; Richardson et al. 1990). This kind of rent gradient emerges organically when the marginal costs of production and/or transportation are large relative to the population and physical size of the region in question (Odland 1978; Scott 1988).

A polycentric rent gradient.
A more formal description of household behavior within this framework is as follows (see Fujita 1987 for a complete exposition). Households have a common utility function, U(z,s), which contains a composite good, z, and urban space, or land, s. A household’s budgetary constraint is determined by its income, y, less the cost of travel, k, between its place of work and its location at radial distance d from its place of work:
Figure 3 illustrates the connection between bid rent and optimal lot size. It displays the marginal rate of substitution, described by an indifference curve (the arc) for a fixed level of utility, u, between the composite good, z, and land, s, plus the budget constraints (the dashed lines) and corresponding consumption bundles (the dotted lines) for two households located at distances d 1 and d 2 from a common place of work, where d 1 < d 2. Because the cost of travel to and from work, k(d), is lower at d 1 than it is at d 2, the net income of the household located at d 1 is greater than the net income of the household at d 2, or y – k(d 1) > y – k(d 2). The two budget constraints, which must be tangent to the indifference curve in order for each of their respective households to achieve utility level u, show that: (a) the bid rent, which is equivalent to the slope of the budget constraint, for the household located at d 1 is greater than the bid rent for the household located at d 2, or ρ(d 1, u) > ρ(d 2, u); and (b) the optimal lot size for the household located at d 1 is less than the optimal lot size for the household located at d 2, or ς(d 1, u) > ς(d 2, u). In short, all else being equal, households located closer to their workplace pay a higher price per unit of land and, so, consume less of it—but still manage to attain the same level of utility, by substituting more of the composite good.

Bid rent process and residential land use.
The strength of this framework lies in its ability to distill the complexity of urbanization into a few simple relationships that explain the general tendencies of land use. In doing so, it also illuminates the explosion of the urban field that occurred in the wake of the nonmetropolitan turnaround: household income and commuting costs have, respectively, grown and declined dramatically in the years since World War II, and their combined impact first began materializing in the late 1960s (see Mieszkowski and Mills 1993). All else being equal, an increase in income, or, equivalently, a decrease in the cost of commuting, shifts the budget constraint shown in Figure 3 outward from the origin, enabling households to reach a higher level of utility through more land and/or other forms of consumption. 4 Households continue to face the same trade-offs as always but they increasingly have more income to allot and less aversion to commuting and, so, adjust their land consumption accordingly. But the weakness of this framework—for all its explanatory power—is that it is baldly deterministic, when actual land use patterns are not. Urbanization rarely unfolds monotonically, much less smoothly, but, instead, for all the reasons given above and more, appears to be a discontinuous patchwork that becomes progressively more complex as the scale of perspective expands. Even though land use does normally grow less dense with distance from various centers of gravity, it typically does so in a disjointed and seemingly chaotic manner. The problem with modeling land use patterns anymore, then, rests not so much with theoretically explaining why they are as they are, but with empirically characterizing how they are—while certain potentials prevail throughout the urban field, actual material conditions do not necessarily (Stewart 1947; Stewart and Warntz 1958) meaning that it is one thing to predict general tendencies and another to model specific outcomes.
Modeling Land Use—and its Complexity
Efforts to scientifically evaluate changes in land use date at least to Clark’s (1951) discovery of the negative exponential density gradient:
Ever since, the density gradient has been the workhorse of land use analysis: it is straightforward to implement and very flexible—it can be estimated in virtually any functional form, and expanded to include any number of explanatory variables besides distance (McDonald 1988). Plus, it engages naturally with economic models of land use, which, as shown in Figure 2, normally portray development in a one-dimensional setting. Just like the theory outlined above, the strength of the density gradient lies in both its simplicity and its ability to representatively describe the general tendencies of land use worldwide (Anas, Arnott, and Small 1998). But, likewise, the weakness of the density gradient lies in the fact that it, too, is restrictively deterministic and glosses over the inherent complexity of urbanization. Indeed, studies have shown that the negative exponential density gradient in particular rests upon unrealistically strong assumptions (Brueckner 1982, 1987) and may grossly mischaracterize underlying development (Johnson and Kau 1980; Kau and Lee 1976a, 1976b, 1977; Kau, Lee, and Chen 1983). As always, generality comes at a loss of specificity, so it’s only fair to ask: what is the alternative? Although faulting the density gradient is easy, modeling land use in a way that better reflects its complexity is not. Nevertheless, the fact is that contemporary urban centers project a far-reaching field that encompasses and influences—even organizes—various permutations of clustered, nonclustered, contiguous, noncontiguous, and linear development patterns (Clark et al. 2009). A single transect may look more like Tobler’s (1969) spectrum of Interstate forty than a well-behaved distance gradient, monotonic or not. And, even in the most general of terms, it is a rare case that exhibits anything like a uniform pattern all 360° around the regional center of gravity. What’s required, are empirical models of land use that somehow accommodate the gnawing uncertainty that attends complexity—and, more, that make that uncertainty a main aspect of the analytical framework (Batty 2007).
One such approach is the “fractal geometry” method pioneered by Batty and Longley (1987, 1994) and Frankhauser (1994). Fractals are chaotic shapes having, in the context of geographic phenomena, a dimension of between one and two—somewhere between a one-dimensional line and a two-dimensional polygon (Miller 2009)—that is a measure of space filling: the greater the fractal dimension, the greater the space filling, and the more compact the development pattern (see Peitgen, Jürgen, and Saupe 2004). For example, Batty (2007) reports the following fractal dimensions for six regions: (a) 1.539 for Albany, New York; (b) 1.793 for Buffalo, New York; (c) 1.760 for Cleveland, Ohio; (d) 1.670 for Columbus, Ohio; (e) 1.673 for Pittsburgh, Pennsylvania; and (f) 1.370 for Syracuse, New York. By these measures, Buffalo is the most compact of the six and Syracuse is the least. The fractal dimension of urbanization (or any other object) is measured by estimating the power law:
Another approach to modeling land use that places uncertainty at the center of the analytical framework is the “spatial hazard” method (Carruthers et al. 2010). This turn on traditional (Anselin and Rey 2010; Boots and Getis 1988; Diggle 2003; Fotheringham, Brunson, and Charlton 2000) point pattern analysis
7
—developed by Odland and Ellis (1992) and formalized by Waldorf (2003)—involves adapting proportional hazard models, also called accelerated failure time models, to spatial settings. Hazard models are longitudinal models designed to estimate the conditional probability of a time frame ending (Cleves, Gould, and Guitierrez 2004; Heckman and Singer 1984; Kiefer 1988; Lawless 2002; Selvin 2008). They come out of engineering, but have been applied to a variety of issues in regional science and other fields—for example, Irwin and Bockstael (2002) and An and Brown (2008) use them to study the timing of land use change. Like time, distance, D, is a nonnegative random variable that terminates at a particular point, d, conditional on the probability of having made it to that point in the first place. This characteristic results in there being a hazard function that describes the baseline rate at which distances separating spatial points terminate:
In sum, the fractal geometry and spatial hazard approaches are complementary alternatives to analyzing land use via density gradients: both address the inherent complexity of development but, whereas fractals characterize its material condition, hazard functions characterize its field of potentials. The fractal method is an excellent means of evaluating land use change, but the hazard method—which holds great potential because it, like the density gradient, may be used to operationalize the very powerful behavioral theory outlined in the first half of this discussion—remains unproven. Is the approach viable? The following section tends to this question by estimating a series of spatial hazard models of urbanization and evaluating their ability to describe how the American way of land use has changed over the past two decades.
Empirical Analysis
Data and Econometric Specification
The empirical analysis is focused on the twenty-five highest growth—between 1990 and 2000—CBSAs of the United States in 1990, 2000, and 2006. The regions are listed from largest to smallest in Table 1 . In the eight cases that are composed of two or more divisions, the divisions themselves are used, so, counting all of these, the actual number of settings is thirty-six. 9 The units of analysis are census tracts, defined by their 2000 boundaries, and the data comes from four sources: (a) a nationwide count of housing units at the census block level in 2006; 10 (b) a Geolytics product that allocates select Census Summary File 1 (SF-1) variables from 1990 census block group boundaries to 2000 boundaries; (c) a second Geolytics product that allocates Census Summary File 3 (SF-3) from 1990 tract boundaries to 2000 boundaries; and (d) SF-3, from the 2000 census. Comparing localized census data through time is hard because block group and tract boundaries are regularly redrawn to accommodate changes in the geography of the population—but the two Geolytics products were used to overcome this problem by reconciling population estimates from 1990 into 2000 block group boundaries and, then, by reconciling other (SF-3) data from 1990 into 2000 tract boundaries. Finally, block group-level housing unit counts from 2006 were multiplied by 2000 estimates of average household size to develop 2006 population estimates that could be compared to the 1990 and 2000 estimates. 11 Though intensive, these machinations were necessary in order to unify the geometry of the data across all three years.
Twenty-Five Highest Growth CBSAs, 1990–2000
After laying this groundwork, a database of spatial point patterns and relevant attributes was assembled in a geographic information system (GIS) via a process detailed in Renner et al. (2009). In a nutshell, the process involved five steps. In the first step, a base-map consisting of all census block groups in the continental United States—there are 208,643—was created and their population estimates used to generate a population weighted center for each of the 66,157 tracts that make up the country in 1990, 2000, and 2006. As opposed to the geometric center, this so-called “mean center” (see, for example, Barber 1988) is a point that marks where people were concentrated within the tracts, which can be quite expansive, at the three points in time. In the second step, similar routines were run to generate population weighted centers, the 939 CBSAs, and for each county subdivision in 2006. Here again, the points produced by this process mark the mean center of the regions and their various subcenters; they were held constant (arbitrarily, at their 2006 position) in order to facilitate consistent analysis through time. 12 In the third step, each tract-level point was assigned to a CBSA-level point, whether it “officially” belongs there or not, and to a subcenter-level point via a nearest neighbor routine. In the fourth step, the GIS was used to generate three sets of rays measuring the distances separating tract-level points from: (a) their regional center; (b) their nearest subcenter; and (c) their nearest neighbor. Finally, in the fifth step, relevant data (identified below) from SF-3 was assigned to the tract-level points; since 2006 is between census years, those points had to be matched with data from 2000. This attribute data was then stacked, forming an n × t panel for each CBSA involved in the analysis, where n refers to the number of tracts and t refers to the three years of observation. The results of this data assembly process are illustrated in Figure 4 , which contains maps of spatial point patterns in the four regions—Las Vegas, Nevada, Austin, Texas, Raleigh, North Carolina, and Phoenix, Arizona—that experienced the highest rates of growth between 1990 and 2000. The rays visible in the maps connect nearest neighbor tracts to one another and measure the distances that are the object of this analysis.

Spatial point patterns in (clockwise from upper left) Austin, TX, Las Vegas, NV, Phoenix, AZ, and Raleigh, NC.
Returning to the modeling framework that was outlined above, economic theory yields the following two core premises: (a) the baseline hazard function for distance separating the spatial points that make up an overall pattern of urbanization is bound to exhibit positive spatial dependence; and (b) the baseline hazard decelerates with distance from regional centers of gravity. In other words, the probability of the distance between tract-level points terminating increases with the distance that separates them and decreases with the distance that separates them from their regional center and their nearest subcenter (see Carruthers et al. 2010). A Weibull distributed spatial hazard model of urbanization based on these expectations is as follows:
The other explanatory variables (besides the two distance measures) contained in the vector Xik also flow directly from theory. Specifically, the economic model of urbanization points to three main variables: (a) land is a normal good, so household income, including wages and all other sources, positively affects the optimal lot size—meaning that income is expected to decelerate the hazard of the distance between points terminating; (b) commuting costs are what determine the budgetary constraint, so time spent traveling to work is expected to either accelerate or decelerate the hazard of the distance between points terminating, depending on region-specific conditions; and (c) as footnoted above, due to vintage effects, aged development, which is often of a different density than contemporary market conditions call for, is expected to influence the hazard of the distance between points terminating. In addition to these three factors, population is included in order to control for the fact that, other things being equal, larger tracts will encompass a larger area. This variable is expected to decelerate the hazard of the distance between points terminating. Table 1 gives the specific definition and source of each variable; descriptive statistics are available upon request.
Estimation Results
The maximum likelihood estimates of the thirty-six individual spatial hazard models, which were generated CBSA-by-CBSA using the streg command in Stata, are listed in alphabetical order in Table 2 . Note that none of the parameter estimates carry a negative sign, because they are “hazard ratios” that scale the baseline hazard—values less than one decelerate the baseline hazard and values greater than one accelerate it. The estimates are for the most part consistent with the estimates of previous research (Carruthers et al. 2010), which: (a) focused on a somewhat different set of regions; (b) dealt only with the 2006 time period; (c) did not address subcenters; and (d) used block groups, not tracts, as the unit of analysis. As a precursor to evaluating the models' ability to describe changes through time, the following paragraphs summarize the estimates.
Data Definitions and Sources
Note: All data is at the level of census tracts.
First, every region’s shape parameter, λ, is positive and statistically significant at well over a 99 percent confidence level, confirming the expectation that the probability of the distance between points terminating increases with the distance that separates them. Once again, a general finding is that, as a set, the shape parameters indicate that urbanization—however, chaotically evolved and uncertain it may be—exhibits genuine, probabilistic order. Second, the parameter estimates on the two temporal fixed effects are almost all statistically significant and all positive, signaling that, if all else remained equal through time (which it did not) every region would have grown more compact. This aspect of the analysis is dealt with in detail below. Third, moving under the Φ heading, the parameter on distance from the regional center is negative and highly significant in every case, indicating that, as also expected, the probability of the distance between points terminating decreases with the distance that separates them from their regional center. Fourth, the parameter on distance from the nearest subcenter is nearly always negative and statistically significant, meaning that the probability of the distance between points terminating decreases with the distance that separates them from their nearest subcenter. The exceptions, where the opposite effect is registered, are very large, dense regions like New York City and Chicago. Fifth, the parameter on household income is almost uniformly negative and statistically significant—land is a normal good so, other things being equal, income decelerates the spatial hazard function. Sixth, as in previous research, the parameter on travel cost has a somewhat mixed effect: in those regions registering a positive sign, as most all do, it is associated with a more compact pattern of urbanization whereas, in those regions having a negative sign—New York City and Newark—it is associated with more sprawl. Seventh, the parameter on the age of housing units varies across regions: in about two thirds of the cases where the variable is statistically significant, the influence is positive, suggesting that older development is generally denser than newer development. This finding is different from before, but it seems plausible that adding distance to the nearest subcenter to the mix alters the effect of the variable. Finally, the parameter on population, a control for the shear size of census tracts, is nearly always statistically significant and negative.
Moving on to aggregate patterns of land use, spatial hazard models, as explained above, portray urbanization not as a material condition but, instead, as a field of potentials. To illustrate this, the estimation results just summarized are evaluated by tracing out survival functions—which are simply the opposite, and more intuitive, way of expressing hazard functions
13
—at relevant values of explanatory variables. Following Carruthers et al. (2010), this is done by varying xi→
center, distance from the regional center, and, on top of that, the two temporal fixed effects while holding the remainder of Xik
constant at the mean
The resulting survival curves, which were generated using the stcurve command in Stata, are shown region-by-region in alphabetical order in the left-hand panes of the panels contained in Figure 5 . These survival curves, which are cumulative probability functions, describe the conditional probability of the distance between nearest neighbor tracts extending past a particular distance at relevant locations within the regions. In the graphs, the x-axis, which registers distance between nearest neighbors, ranges from zero to 5,000 meters, and the y-axis, which registers the probability that dij extends, ranges from zero to one. Going from left to right, the ten separate curves shown in each of the graphs correspond to the distance from the CBSA center, ξ i→center, that captures ~5, …, ~95 percent of the region’s population; the graphs are all consistent and, so, are directly comparable to one another. As a set, they show that (subjectively, at least) each of the thirty-six regions falls into one of four basic typologies (Carruthers et al. 2010): (a) high-density, compact—Chicago, Illinois, Los Angeles, California, Nassau, New York, New York, and San Francisco, California; (b) low-density sprawl—Atlanta, Georgia, Austin, Texas, Bethesda-Frederick, Maryland, Charlotte, North Carolina, Ft. Worth, Texas, Gary, Indiana, Nashville, Tennessee, Orlando, Florida, Phoenix, Arizona, Raleigh, North Carolina, Riverside, California, and San Antonio, Texas; (c) high-density core, with sprawling outer areas—Dallas, Texas, Denver, Colorado, Houston, Texas, Las Vegas, Nevada, Miami, Florida, Minneapolis-St. Paul, Minneapolis, Newark, New Jersey, Portland, Oregon, Sacramento, California, and San Diego, Californai; (d) nearly spatially invariant, at various densities—Ft. Lauderdale, Florida, Lake-Kenosha, Illinios-Wisconsin, and Oakland, California. Whatever the particular case, the graphs displayed in Figure 5 reveal how land use unfolds outward from the regional center of gravity and, because they express only probabilities, they portray urbanization not as a material condition but, rather, as a spectral field of potentials.

Estimated and differenced survival functions.
Changes
When the three sets of survival functions shown in the left-hand panes of the panels in Figure 5 were generated in Stata, the outfile option was used to capture the numeric data that describes them. This operation produced a total of 108 (36 × 3) new “.dta” files, containing ten columns apiece, or one column for every curve shown in the graphs. Additional graphs registering changes from year-to-year were then generated by using the numeric data to difference the various survival functions for each region, and the results are shown in the right-hand panes of the panels in Figure 5: (a) 1990–2000; (b) 2000–2006; and (c) 1990–2006. For example, the 1990 numeric data was subtracted from the 2000 numeric data to obtain the 1990–2000 graphs. This procedure is an effective means of evaluating land use change within individual regions because the proportional hazard models were estimated as panels with temporal fixed effects and, so, the functions for individual regions have a single underlying shape parameter—what’s being compared, is how the estimated baseline hazard,
To see just how and why this works, consider a simpler model than the one in Equation (10) wherein the baseline hazard is influenced by a single generic fixed effect, θ:
Back to the matter at hand, the graphs in the right-hand panes of Figure 5 illustrate how land use has changed in the thirty-six regions engaged in the analysis over the past two decades. As before, the x-axis, which ranges from zero to 5,000 meters, registers distance between nearest neighbors—but the y-axis, which ranges from –0.4 to 0.2, now registers the change in the probability that distance extends. Note that the changes need not be homogeneous across the ten survival functions, and, indeed, as the background discussion suggests, it is reasonable to expect up-front that, in many cases, they are quite heterogeneous. Most urbanization is a mash-up of different eras and modes of development, so the patterns of change registered by the functions necessarily depend on the within-region location (i.e., core vs. periphery) and nature (i.e., compact vs. sprawl) of growth. Plus, as footnoted above, some locations may exhibit little or no change at all, if they have been built out according to space filling norms (see Fotheringham, Batty, and Longley 1989). When the change curves are positive, they imply a sprawling effect and, when they are negative, they imply a compacting effect—positive (negative) changes correspond to an increased (decreased) survival rate, or, stated the other way around, positive (negative) changes correspond to a decreased (increased) hazard rate. So, using the four regions displayed in Figure 4 as examples: (a) Austin, Texas grew uniformly more dense between 1990 and 2000 and experienced little or no change between 2000 and 2006, for a net effect consistent with what took place in the 1990s; (b) parts of Las Vegas, Nevada grew more dense between 1990 and 2000 and other parts grew less dense between 2000 and 2006, for a net effect of some increased density and some increased sprawl—but in different parts of the region; (c) Phoenix, Arizona grew consistently more dense between 1990 and 2000 and consistently less dense, but not by quite as much, between 2000 and 2006, for a net effect of a moderate increase in density that may be eroded with the passage of additional time if the more recent trend persists; and (d) and Raleigh, North Carolina grew a lot more dense between 1990 and 2000 and a bit less dense between 2000 and 2006, for a net effect of increased density. Similar stories can be told about each of the thirty-two other regions in the figure. 14
Table 3 provides a more detailed taxonomy of the net (1990–2006) changes just described by listing some of the numeric data that went into generating them. Specifically, the table gives the changes in the probability of distance between nearest neighbor census tracts extending that were obtained by differencing each of the ten survival curves. In order to conserve space and facilitate readability, the rows correspond to just a few of the distances separating tract mean centers—500 meters, 1,000 meters, 2,000 meters, 3,000 meters, 4,000 meters, and 5,000 meters between nearest neighbors—but the functions themselves are continuous, so they are based on much greater detail: the spreadsheets the data was taken from have about 100 rows corresponding to distances of 0 to 5,000 meters in 50 meter increments. The table shows that the differenced survival functions yield two lines of insight into how patterns of urbanization have changed through time: (a) by reading across through the columns, the table reveals where within the regions land use has changed; and (b) by reading down through the rows, the table reveals at what spatial scales.
Estimated Spatial Hazard Functions—Distance From Nearest Neighbor
Note: LL is the log-likelihood; n × t is the number of observations in the panel; in the event that an observation/s was dropped in the estimation process, n × t is not symmetric; values in () are z-statistics; all hypothesis tests are two-tailed.
*denotes significant at 90 percent.
**denotes significant at 95 percent.
***denotes significant at 99 percent.
n/sdenotes not significant.
Taxonomy of Land Use Change
Note: Values are the change in the conditional probability of distance extending; - denotes zero or negligible.
Specific insights related to the four example regions are as follows. First, Austin experienced a sharp compacting effect staggered by distance from the regional center of gravity, where the probability of distance between nearest neighbors extending beyond certain lengths declined by about a third. The probability of extending beyond 1,000 meters fell at distances from the regional center of gravity capturing between ~5 and ~45 percent of the population; beyond 2,000 meters, at distances capturing between ~25 and ~75 percent; beyond 3,000 meters, at distances capturing between ~55 and ~85 percent; beyond 4,000 meters, at distances capturing between ~65 and ~85 percent; and beyond 5,000 meters, at a distance capturing ~95 percent. (This pattern of infill is compelling because it seems consistent with some of the density changes reported by Torrens [2008, Figure 12] but it is worth pointing out that that analysis also found that Austin’s fractal dimension dropped slightly—from 1.230 to 1.213—between 1990 and 2000, which is an indication of greater sprawl. It may therefore be a matter of where, in terms of center versus fringe, the development contributing to the change actually occurs—especially in growing regions, like Austin that are experiencing both space filling at the interior and expansion at the fringe.) Second, Las Vegas experienced an interesting mix of two different effects. The probability of distance between nearest neighbors extending beyond 1,000 meters fell by a small amount at the very center of the region (~5 percent of the population) but the probability of distance between nearest neighbors extending beyond 1,000 and 2,000 meters grew by roughly 25 percent midway (~55 percent of the population) to its periphery. Third, Phoenix grew marginally denser from the center to middle (~5–~65 percent of the population) of the region; marginally less dense close to periphery (~75–~85 percent of the population); and less dense at the periphery, where the probability of distance between nearest neighbor tracts extending beyond 3,000, 4,000, and 5,000 meters increased by about 10 percent. And, as pointed out, the 2000–2006 trend, which covers the duration of the recent housing boom in the United States, points decisively in the direction of more sprawl in Phoenix—whether or not the trend will continue now that the market and construction activity have wound down is an open question that is worth pursuing. Finally, Raleigh experienced a spatially staggered compacting effect very similar to what took place in Austin. The probability of the distance between nearest neighbor tracts extending beyond 1,000 and 2,000 meters fell by about a third at the region’s interior (~5 and ~45 percent of the population) and the same happened for probability of extending beyond 3,000 and 4,000 meters at its exterior (~55 and ~95 percent of the population). The table yields other insights too—but these are the main trends in land use change in the four regions.
Summary and Conclusion
The three central objectives of this article, now met, were: (a) to review previous research on the complexity of urbanization and explain how the spatial hazard framework accommodates that complexity; (b) to estimate a series of spatial hazard models characterizing land use in the twenty-five highest growth CBSAs of the United States areas in 1990, 2000, and 2006; and (c) to use the estimation results to track land use change region-by-region over the 16-year time frame. All that remains are a few closing comments and directions for future research.
To begin, the evidence presented in the empirical analysis of this article squares nicely with the both the classic (Alonso 1964; Mills 1971; Muth 1969) theoretical model of urbanization and newer empirical approaches that place uncertainty at the center of the analytical framework (Batty 2007; Carruthers et al. 2010; Torrens 2006, 2008). As Friedmann and Miller (1965) noticed some time ago, the American way of land use changed dramatically over the course the twentieth century and it continues to change no less dramatically in the twenty-first century. And, as people continue to grow wealthier and transport costs continue to fall, especially in the postindustrial economy, the evolutionary process that took hold with the nonmetropolitan turnaround (Beal 1975) is only going to accelerate. Contemporary urbanization is composed of layer-upon-layer of development, varies greatly by regional culture and circumstance, has a far-flung, polycentric anatomy, and is the outcome of a chaotic system of innumerable actions taken by its denizens. Yet, in spite of all of this, all of the regions addressed by the analysis, seen through the lens of spatial hazard models, exhibit striking order and a consistent overall pattern of development, no matter their own peculiarities. Thinking of urbanization as a field, rather than material, concept and treating it that way empirically is helpful because it allows for the fact that, while certain potentials prevail throughout the field, actual material conditions do not necessarily. This view also enables traditional theory to hold in an evermore-complex world, but in an explicitly uncertain and chaotic—though definitely not random—manner.
The spatial hazard approach addresses all of this and is a means of scientifically analyzing the similarities and dissimilarities of development from place-to-place and time period-to-time period. In particular, the models are a highly flexible means of: (a) operationalizing a traditional method of spatial analysis with a long and distinguished history—namely, point pattern analysis (Boots and Getis 1988; Diggle 2003)—via very powerful behavioral models of urbanization; (b) generalizing about the way of land use across a diversity of settings; and (c) standardizing development patterns in the face of their inherent complexity. As such, the approach is viable for comparing and contrasting dynamic outcomes across very elaborate urban systems.
Future research should focus on several key areas. First, both this and previous research (Carruthers et al. 2010) have applied spatial hazard models to very large metropolitan settings—so it would be interesting to apply the approach to smaller, micropolitan and rural settings. In principle, these places should exhibit the same general tendencies of land use but they merit investigation, particularly given the extreme growth (and decline) pressures that many face. Second, while spatial hazard models clearly line up well with traditional theories of land use, other, less tested, frameworks addressing the spatial distribution of activity may also be worth evaluating via the approach. For example, the “new economic geography” (see Fujita, Krugman, and Venables 1999) has gained great currency in economics, geography, regional science, and elsewhere—but has so far been subjected to only a limited amount of empirical evaluation (Head and Mayer 2004). Whether or not spatial hazard models have anything to contribute on this front is unclear at the present, but they very well may. Third, the approach has so far been applied region-by-region and not to any greater system of urbanization, like the Northeast Corridor and/or Southern California conurbations, but there is, in principle, no reason that it could not. In fact, the success realized here in comparing changes through time suggests that, if estimated as part of an urban system, land use patterns of the system’s various components could be compared in a very direct way. Last, most progress in applying spatial hazard models to urbanization thus far has been made by using census block groups or census tracts as the units of analysis. While these are typically small, neighborhood-sized units it would be even better to get down to the level of individual structures, as Kueth, Hubbs, and Waldorf 2009 do in their analysis of housing sales, including both residential and commercial buildings. Just as attributes from the census are used to explain the process generating neighborhood level points, microattribute data, if available, could be used to explore the very fabric of development. Each of these directions and more would be an excellent extension of research involving spatial hazard models.
Footnotes
Notes
Earlier versions of this article were presented at the 2008 meetings of the North American Regional Science Council in New York; the 2009 meetings of the Associated Collegiate Schools of Planning, in Alexandria, Virginia; the 2010 meetings of the Western Regional Science Association in Sedona, Arizona; and in seminars at the U.S. Department of Housing and Urban Development and the University of California, Irvine. The opinions expressed in this article are those of the authors and do not necessarily reflect the opinions of the Department of Housing and Urban Development or the U.S. government at large.
The author(s) declared no conflicts of interest with respect to the authorship and/or publication of this article.
The author(s) received no financial support for the research and/or authorship of this article.
