Abstract
This paper aims to stimulate interest in the early micro-economic conceptualisation of housing submarkets proposed by Rapkin and Grigsby, which defined market areas in terms of dwelling substitutability. Three key questions need to be addressed if a return to the Rapkin–Grigsby approach is to be achievable and worthwhile. First, what are the practical benefits? (The paper highlights a range of potential research applications that would benefit from the substitutability approach.) Secondly, in what way are existing approaches likely to be inadequate for demarcating substitutability-based submarkets? (Four criteria are proposed for assessing submarket estimation methods which existing approaches fail to satisfy.) Thirdly, what are the prospects for developing a substitutability metric that will make the Rapkin–Grigsby definition empirically feasible? (A new measure is proposed, based on the cross-price elasticity of price (CPEP), which is illustrated using data for Glasgow, Scotland.)
1. Introduction
In the seminal work of Rapkin et al. (1953) and Grigsby (1963), submarkets were defined in terms of dwelling substitutability: “A housing market area is the physical area within which all dwelling units are linked together in a chain of substitution” (Rapkin et al., 1953, pp. 9–10; quoted in Grisby, 1963, pp. 33–34). While this definition is intuitively appealing, it has been difficult to implement empirically, largely due to problems in finding a workable measure of dwelling substitutability. Nevertheless, were such a measure to be developed, it could potentially enrich a range of research topics in housing economics.
It might, for example, help us to explore the links between market segmentation and social segmentation. Using regression analysis, with substitutability as the dependent variable, we could estimate the extent to which substitutability is driven by the physical characteristics of dwellings and the extent to which it is driven by the social/ethnic/religious composition of the neighbourhood. Clearly, we cannot do this sort of analysis by way of a priori clustering of dwellings (according to administrative area, house type or neighbourhood mix, for example) 1 as we would be none the wiser in terms of understanding which drivers are most important in determining dwelling substitutability. We need a measure of substitutability that reflects the actual decisions that households make, decisions that reveal their preferences for dwelling characteristics, racial/religious/ethnic composition, access to amenities, etc.
Having a substitutability metric would also allow us to define submarkets in the spirit of Rapkin and Grigsby. Grouping dwellings into submarkets on the basis of substitutability would provide a way of summarising the preferences and choices of consumers. Submarkets, so defined, could have a range of potential applications. For example, substitutability-based submarkets may help to improve the performance of automatic valuation models (AVMs), revealing the hidden fault-lines in the local housing market, and capturing the connections between price movements of dwellings in different locations. They could also open up underdeveloped research avenues, such as the extent to which the shapes of substitutability-based submarkets tend to have compact well-defined geographies, revealing the aversion of households to living at the boundaries between residential enclaves (see Criterion 4, in section 2).
Being able to group dwellings on the basis of substitutability might also lead to important applications in house price index calculation. Dean et al. (2012), for example, find that grouping properties that do not have similar price trajectories can lead to considerable distortion in national price indices. Because the prices of close substitutes tend to move together, a reliable measure of substitutability could offer a means of grouping dwellings in a way that preserves the stability of within-group price relatives, leading to a set of index commodity weights that minimises aggregation bias. (Such a method could also find applications in the context of other complex durable goods such as cars and antiques.)
Finally, having a robust measure of the substitutability of dwellings could help us to develop more meaningful
Unfortunately, finding such a metric has proved elusive. In its absence, the micro-econometric analysis of submarkets 2 has turned either to a priori clustering (grouping dwellings according to dwelling or neighbourhood attributes); or to hedonics; or to some combination of the two. Hedonic methods have, of course, been extensively used in the wider housing economics literature, typically in one of two forms, labelled forthwith as types 1 and 2. 3 Type-1 hedonic analysis uses regression to control for variation in the mix of dwelling attributes when developing a house price index or valuation model. The method can be interpreted as a practical tool and does not require a particularly restrictive interpretation of regression coefficients (covariates are treated as controls/proxies for a mixture of observed and unobserved factors that affect house prices). Type-2 hedonic analysis, on the other hand, relies rather heavily on interpreting regression coefficients as measures of economic value and this interpretation has been subject to formidable critiques from leading economists (Epple, 1987; and Bartik, 1987, for example).
Unfortunately, it is very much the type-2 hedonic approach that has been utilised in the submarkets micro-econometrics literature, 4 which typically clusters areas according to similarity of attribute prices. As demonstrated later, this approach is not only flawed in a number of fundamental ways, it is also difficult (if not impossible) to reconcile with the Rapkin–Grigsby notion of substitutability-based submarkets.
How, then, might we find an appropriate metric that will place submarkets research on a more solid footing and also open up these fresh avenues of research? In section 2, the type-2 hedonics method of submarket demarcation, and also the a priori attribute-cluster method, are scrutinised through the lens of substitutability to establish whether either can be reconciled with a Rapkin–Grigsby approach. A new method of measuring substitutability is then proposed in section 3 based on CPEP (cross-price elasticity of price). Section 4 considers how CPEP could be used to explore the existence and spatiality of submarkets and section 5 provides an empirical application. Section 6 concludes with a summary of findings and suggestions for future research.
2. Problems with Type-2 Hedonics and a priori Clustering as Methods of Defining Substitutability-based Submarkets
Consider the following inventory of housing market entities and definitions
Household: b = 1, 2, …B
Dwellings (or blocks of dwellings):
Attribute vector for dwelling i:
Attribute price vector for dwelling i:
Homogeneous attribute price vectors for i and j:
Euclidean distance between i and j:
Cross-price elasticity of price (CPEP) between i and j:
Homogeneous attribute price vector: HAPV
Let
If HAPV is applied as a necessary condition, it is implicitly assumed that dwellings in the same submarket will have the same attribute prices
This allows us to deduce (using modus tollens) that, if i and j do not have the same attribute prices then they will not be elements of the same submarket
This, too, is problematic. As explained later, HAPV is unlikely to hold as a necessary condition because it is perfectly possible to observe different attribute prices on two dwellings even though they belong to the same submarket. In summary, homogeneous attribute prices can be a false indicator of common membership of a submarket and heterogeneous attribute prices can be a false indicator of market segmentation. Similar problems arise with regard to a priori clustering of attributes: to demarcate substitutability-based submarkets we really need some measure of how the market groups dwellings in terms of substitutability, rather than assuming a particular substitutability function a priori.
These arguments are now developed more fully, woven around a set of four robustness criteria for developing SEMs (substitutability-based submarket estimation methods).
2.1 Criterion 1: SEMs should be Robust to Complex Interaction Effects
Rationale: The existence of many to many mapping of means and ends (MMMMEs) and transformative interaction effects (TIEs) implies that properties can be in the same submarket but have different attributes and different attribute prices.
MMMMEs exist because the same human need can be met in different ways, and the same housing bundle will meet different needs for different people. Consequently, homes having few common attributes and very divergent attribute prices could nevertheless be thought of as close substitutes. An important driver of MMMMEs, and one that makes predicting the global set of MMMMEs considerably more elusive, is the existence of transformative interaction effects (TIEs). TIEs occur when the effect of an attribute (whether geographical or structural) is fundamentally transformed when placed in a particular context, such as being combined with another attribute (either geographical or structural).
To illustrate, consider the options for crossing the English Channel. 5 Suppose ferries, planes and trains are all genuinely close substitutes—a change in the price or availability of one will have a large effect on the demand for the others (they are all in the same market for transferring passengers across the English Channel). 6 While each mode of transport has contrasting physical features, these features combine to offer a similar service because of TIEs. This leads us to question the meaningfulness of a priori clustering of attributes or relying on attribute price differences as a gauge of substitutability. Whereas wings on trains are of no value, wings on aeroplanes are essential. The attribute price of wings could differ dramatically between trains and planes because the utility of wings is transformed when connected to a jet engine and aerodynamic fuselage (an example of TIEs). Similarly, while vital to the functioning of trains, wheels will be of little value on ferries.
Note that, if we were to persist with the application of HAPV as our means of transport submarket delineation, the divergent attribute prices on wings and wheels across transport modes would lead us to the erroneous conclusion that the different modes belong to different submarkets. The error arises because, although the physical attributes of an aeroplane are quite different from those of a ferry, the final service bundles (comfortable, speedy travel) may be very similar.
Decomposing dwelling prices into the prices of constituent parts is unhelpful when measuring substitutability—what matters is the final service bundle. Different dwellings in different locations can provide remarkably similar levels of utility to a particular consumer despite the structural differences. Empirical estimation is frustrated because the service bundle provided by a house and its location is so multifaceted. MMMMEs and TIEs may be subtle and unforeseen, causing different combinations of physical and locational attributes to interact in different ways for different people. Whether attribute prices are observed as homogeneous or not becomes irrelevant to the question of substitutability, in the same way that the price of wings on automobiles vs the price of wings on aeroplanes is an absurd comparison and irrelevant to the cross-price elasticity of demand between transport modes.
Note also that dwellings can be in the same attribute-cluster group but not be in the same substitutability-based submarket; and they can be in the same substitutability-based submarket but not be in the same attribute-cluster group. Statistical clustering of dwellings by physical attributes (and grouping of attribute variables into product bundles) has become popular in the submarkets literature (for example, Maclennan and Tu, 1996; Leishman, 2009), but may not reflect how consumers group them, which may be nuanced and difficult to anticipate, not least because of the interplay between MMMMEs and TIEs.
We would like to group properties according to a variable that captures market behaviour (i.e. substitutability), rather than an a priori categorisation of the dwelling stock (for example, by property type or social mix). In transactions data, the only behavioural variable typically recorded is the selling price, but this is the dependent variable in hedonic analysis and therefore precluded from the clustering process.
Note that grouping properties by type of resident is also problematic: while two consumers may have different preference maps, this does not mean they will disagree on whether two properties are close substitutes—they may consider the dwellings to be substitutable but for different reasons. And even if we were able to anticipate individual rankings of substitutability, the outcome at the level of the market is made fundamentally unpredictable by the Condorcet paradox—transitivities that hold at the level of the individual do not always hold in the aggregate. 7
Ideally, one would like to cluster by market-level substitutability, but this requires a way of measuring the degree of substitution, which has hitherto proved elusive.
2.2 Criterion 2: SEMs should be Robust to the Continuity of Substitutability Space
Rationale: Substitutability-based submarkets can happily exist along a frictionless continuum and so continuous differentiability of the substitution function linking i and j in Cartesian space will neither require nor preclude i and j from belonging to separate substitutability-based submarkets.
Goodman and Thibodeau (1998) argue that attribute price differences persist because of unobserved amenity effects. However, this suggests that heterogeneous attribute prices (and hence submarkets) in hedonic regression only exist because of omitted amenity variables. In principle, therefore, if one were to construct a model of an efficient market that captures all amenity affects (both social and economic), there would be no difference across dwellings in the marginal prices of the structural and locational attributes and their interactions; and (by their definition) no such thing as submarkets.
But why disentangle locational and structural attributes for the purposes of submarket definition? Using Rosen’s (1974) terminology, we are considering a class of commodities—i.e. homes—that are described by A attributes,
An alternative justification of persistent attribute price differences is to assume that they are caused by market inefficiencies and frictions (such as imperfect competition among households and the inelastic demand and supply of housing service; see Schnare and Struyck, 1976). While this is feasible, it leads again to a weak definition because it means that submarkets only exist if markets are inefficient or inflexible. This is problematic if one is seeking to derive substitutability-based submarkets because variation in substitutability of dwellings could persist even in a world of perfectly efficient markets. Heterogeneous dwellings, heterogeneous locations, heterogeneous preferences, MMMMEs and TIEs can all exist in a frictionless world and would cause substitutability to vary across dwellings, even if the supply of each dwelling type was perfectly elastic and all market participants were perfectly informed.
In summary, variations in substitutability within the urban system can co-exist with the presence or absence of market inefficiencies (as in Schnare and Struyck, 1976) and/or discrete breaks in the land rent surface (as in Goodman and Thibodeau, 1998). HAPV type-2 hedonic methods for identifying submarkets, however, rely heavily on such inefficiencies and discontinuities and so are not well suited to the task of delineating substitutability-based submarkets.
2.3 Criterion 3: SEMs should be Robust to Unobserved Attribute Variation
Rationale: If dwelling characteristics are spatially clustered, but not fully described in our hedonic data, then observed differences in attribute prices can occur even when actual attribute prices are homogeneous and observed spatially correlated errors in hedonic price regressions may simply reflect unobserved attribute heterogeneity.
Dwellings are heterogeneous, but so are their attributes. For attribute price differentials to evince HAPV submarkets, we must know all variations in attribute quality and quantity. That half a tank of petrol costs less than a full tank is not evidence of the existence of submarkets in the supply of petrol. Likewise, apparent differences in price per room between tenements and modern flats may reflect unmeasured differences in room size (for example, tenement rooms may have higher ceilings), rather than HAPV submarket boundaries. Unfortunately, full information on the quality and quantity of every attribute of every dwelling is rarely, if ever, available. The measurement errors that result will not be random but correlated with building types, which in turn are likely to be clustered across space. Coefficient shifts in hedonic regressions may therefore be more likely to be co-terminous with the spatial pattern of measurement errors than market segmentation. This means that the boundaries extrapolated from spatially autocorrelated errors (as in Tu et al., 2007) may have little or no correspondence with substitutability-based submarkets.
Note that attribute measurement errors are unlikely to affect a substitutability metric based on the price of the overall housing bundle; nor would they affect definitions of submarkets based on such a metric. If two dwellings genuinely belong to the same substitutability-based submarket, one would expect the price of the overall housing bundle to respond in a similar way to demand and supply shocks, irrespective of attribute prices. Focusing on the dynamics of the sale price of the entire housing bundle (which is generally measured with precision) rather than individual attribute prices (which are not) as the basis for submarket analysis, is likely to be a more fruitful way for submarket research to develop (see section 3).
2.4 Criterion 4: SEMs Should Neither Impose nor Assume Convexity, Compactness, or Contiguity
Rationale: If substitutability is granular (non-contiguous), non-convex or non-compact in Cartesian space, Euclidean distance will not adequately describe the spatiality of submarkets.
Clapp and Wang (2006) explicitly impose a convexity restriction on the shape of submarkets in Cartesian space. That is, they draw submarket boundaries in such a way as to ensure the shape does not have any holes, intrusions or protrusions, ensuring that any straight line drawn between two points in a submarket will be entirely contained within it. Most other authors ignore the shape of submarkets altogether, or inadvertently impose a degree of convexity by the type of submarket delineation method employed.
This means that, while the submarkets literature has explored the degree to which submarkets are structural or spatial (see review by Watkins, 2001), they have almost entirely overlooked the role and meaning of submarket shape, which could profoundly affect the reliability of Euclidean distance as an approximation of substitutability. For example, if substitutability-based submarkets are elongated, fragmented or contain holes (as in the concentric circles of the access-space model), dwellings can be far apart but still close substitutes.
This has profound implications for the reliability of using Euclidean distance to approximate submarket effects, which in turn affects how we might compute
Being able to reveal the true geography of submarkets would also allow us to explore what determines their shape, such as the impact (intended and unintended) of planning decisions, or the sorting effect of the housing market (Schelling, 1971; Meen and Meen, 2003). Preference for racial and social homogeneity may minimise submarket perimeter length because of aversion to living at the boundary (Rose-Ackerman, 1975), leading to compact, convex shapes. However, there may be other factors (the cumulative history of residential planning decisions, local amenities, radial and orbital transport links, heterogeneous preference for mix, etc.) that mitigate the minimum border length hypothesis. Indeed, the concentric rings of access-space theory would produce highly non-convex sets in Cartesian space. Maclennan, however, argues that In the early phase of urban development, the most affluent and influential social and economic group were not sufficiently numerous to occupy a complete residential ring of the city. Instead, they tended to gather within a well-defined area or sector on one side of the city centre (Maclennan, 1982, p. 23).
As the city develops, one might therefore expect the city to comprise a patchwork of residential enclaves, each with its own core and periphery. Submarkets of the type described by Maclennan may be equally non-convex in Cartesian space, but made up of many sets of concentric circles centred on multiple cores, rather than a single sequence centred on the CBD.
If we could measure the true configuration of the urban housing market, we would presumably be able to verify which of these theories dominates in particular urban contexts, leading to a taxonomy of submarket structures for world cities.
3. Deriving a Substitutability Approach to Submarkets
The traditional approach to measuring the substitutability of two goods is to estimate their cross-price elasticity of demand (CPED). Unfortunately, estimating demand functions for dwellings is problematic. “Observed marginal hedonic prices … reveal little about underlying supply and demand functions” (Rosen, 1974, p. 50; see also Epple, 1987; and Bartik, 1987). Rothenberg et al. (1991) made a concerted effort to measure substitutability without estimating CPED, but unfortunately their approach also relied heavily on the meaning and stability of hedonic coefficient estimates.
The alternative proposed here attempts to approximate CPED using the cross-price elasticity of price (CPEP). CPEP does not rely on type-2 hedonics but exploits, instead, the dynamics of the market to reveal substitutability.
Proposition: If demand and supply curves slope downwards and upwards respectively, the cross-price elasticity of price will have a strictly positive one-to-one relationship with the cross-price elasticity of demand.
Intuitively, the CPEP approach to substitutability can be understood as follows. Dwellings i and j are substitutes if a rise in the price of j leads to an increase in the demand for good i; hence, CPED > 0. Conversely, if i and j are complements, then CPED < 0. Now consider the following corollary. If j is a close substitute, a rise in the price of i causes a large increase in the demand for dwelling j and, if the supply of homes is less than perfectly elastic, the short-run effect of the increase in demand for j will be an increase in the price of i. That is,
The argument can be expressed more formally by considering the following equilibrium condition in the market for dwelling type i
where,
By implicit differentiation of equation (1), the cross-price elasticity of price is derived as
Provided all prices are positive (
Now compare equation (2) with the formula for CPED (derived again by implicit differentiation of equation (1))
Again, provided prices and quantity are positive (
where,
The numerator will always be positive, as will the denominator, so long as the demand and supply curves for dwelling i slope downward and upward respectively. It follows that the CPEP will be monotonically increasing in the CPED
Since the CPED is a measure of substitutability, it also follows that CPEP can be interpreted as a proxy. Crucially, however, CPEP does not require us explicitly to decompose the demand function or rely on type-2 hedonics. Instead, we can approximate
If
4. Using CPEP to Investigate Sub-market Existence and Spatiality
4.1 Existence
CPEP leads to a natural test for the existence of submarkets. If CPEP = 1 for all pairs of dwellings, then all dwellings are perfect substitutes and there is no market segmentation,
if
We can represent the non-existence of submarkets by plotting substitutability against

Non-spatial and spatial submarkets.
4.2 Spatiality: The Global Effect of Distance
It would be useful for intercity comparison to have a summary measure of the spatiality of the entire urban submarket system. Using our price-dynamic approach to measuring substitutability, a global indicator of Euclidean substitutability (GIES) for an urban area is given by gradient
For a simple generic measure of the effect of distance on substitutability, one could assume CPEP to be approximately linear in logged distance:
Why might we expect there to be a scatter, rather than a line, of points in
4.3 Lattice of Substitution and the Shape of Submarkets
Having decided on a measure that allows us to gauge the substitutability between a given dwelling
Submarkets can then be thought of as schema for summarising and simplifying the family of substitution lattices for an urban area. The simplest derivation of a set submarkets is one that categorises a single lattice of substitution into groups with similar levels of substitution with respect to a particular dwelling i = 1. We label this a first-order categorisation (FOC) and it is essentially a matter of identifying contour lines of substitutability with respect to a particular dwelling. Second-order categorisation (SOC) entails clustering according to two substitution lattices,
We might choose randomly the dwelling that constitutes the basis for
Note that these clustering processes do not impose spatiality because dwellings are clustered in substitutability space not Cartesian space. This is important because, while the outcome of this process may well lead to systematic patterns in Cartesian space, any apparent spatiality of clustering outcome will not have been caused by the method and therefore should reflect the true spatiality of housing substitution.
5. Empirical Illustration
To illustrate, consider now the application of the CPEP method to 33 680 GSPC residential property transactions in Glasgow, Scotland, all spatially coded 9 and with attribute information, for the period 1999–2007. Quarterly house price inflation time-series were computed for 10 000 individual dwellings using the following procedure.
First, estimate a third-order Taylor series approximation of the house price surface in Cartesian space for each year. Extending the parlance of Fik et al. (2003),
10
time–location value signatures (TLVS) were estimated using flexible functional forms that include interactions between attributes, x and y co-ordinates for the dwelling, area dummies
11
(based on a priori information on where likely shifts in the price surface may lie) and quarter dummies for each year’s surface estimation. Insignificant variables and dummies were then eliminated using a stepwise procedure. Note that each TLVS was estimated independently for each year, allowing coefficients complete freedom to vary over years. Coefficients were also allowed to vary over space through interactions with x and y co-ordinates and area dummies. Dwelling attributes are included to control for the mix of properties selling in a particular time-period. Quarterly time dummies are also included (and allowed to interact with dwelling type and location). The adjusted R2 results for all nine TLVS regressions were as follows: 0.73 (1999), 0.73 (2000), 0.76 (2001), 0.71 (2002), 0.63 (2003), 0.58 (2004), 0.61 (2005), 0.64 (2006) and 0.71 (2007). Having estimated a TLVS for each year with quarterly slope and intercept dummies, an estimated price, Pi, can then be computed for each i in every time-period (quarter):
Secondly, estimate inflation surfaces for each intervening time-period and extract time-series for each of the constant-quality dwellings. By calculating the vertical distance for each i between each successive TLVS, we obtain the absolute price change (
5.1 Application (a): Testing for the Spatiality of Housing Submarkets
Calculating

Scatter plot of cross-price elasticities and distance between dwellings with fractional polynomial line of best fit.
Given the downward-sloping relationship between substitutability over short distances, there is prima facie evidence in Figure 2 that submarkets have a spatial component. The value of
Nevertheless, it is clear from the very low R2 values that the substitutability between dwellings must have a large non-spatial component—at least in terms of the capacity of simple Euclidean distances to capture spatiality (even in the spline regression, 92 per cent of the variation in CPEP is due to factors other than Euclidean distance). This provides support for the non-spatial conception of submarkets (an important theme in Rothenberg et al., 1991) and an imperative to explore further the shape of submarkets—the existence of convexity, granularity and non-compactness renders distance an incomplete measure of submarket spatiality.
5.2 Application (b): Improving Automatic Valuation Models (AVMs)
One of the potentially useful benefits of developing substitutability-based submarkets is that they could potentially improve AVMs. To test whether CPEP submarkets have the potential to add anything to the goodness of fit of AVMs, we compare the adjusted R2 results from three simple hedonic AVMs 12 estimated on 2007 GSPC data. In the first AVM, no submarket dummies were included and the adjusted R2 was around 0.51. In the second AVM, realtor jurisdiction dummies were included to demarcate submarkets (as in Palm, 1978; Michaels and Smith, 1990, and others) and the adjusted R2 rose to 0.58. In the third AVM, submarket dummies based on CPEP third-order clustering were included instead of the realtor jurisdiction dummies and the adjusted R2 rose to 0.63, with all CPEP dummies having significance and <0.01 and all but one having significance and <0.001. These simple diagnostics are far from conclusive proof, but they do tentatively suggest that substitutability-based submarkets could be a useful development in the evolution of AVMs.
5.3 Application (c): Estimating the Shape of Substitutability-based Submarkets
Criterion 4 suggests that the shape of submarkets could itself be informative, revealing the degree of aversion to living at the boundary between residential enclaves. Shape also has potentially important implications for the design of spatial econometric house price models/AVMs because non-convex spatially complex submarkets are likely to undermine the ability of traditional weights matrices to capture spatial effects in house price regressions.
When we plot in Cartesian space the contours of the substitution lattice for randomly selected dwelling

Substitutability lattice for dwelling
In summary, there is evidence of non-convexity, non-compactness and non-contiguity both in the one-lattice contour plot and in the three-lattice cluster analysis. Moreover, there is no evidence of the concentric rings predicted by the standard urban economic model—if anything, the clustering of dwellings is more radial than orbital.
6. Conclusion
This paper has sought to present a case for developing an empirical measure of dwelling substitutability. The case rests partly on the potential benefits of obtaining a substitutability metric (for example, it could offer new ways of exploring old questions and potentially highlight new ones); and partly on the problems associated with using either homogeneous attribute prices, or a priori attribute-clustering, to derive substitutability-based submarkets. To derive such submarkets, one ideally needs a method that is robust to: (1) complex interaction effects between structural and locational attributes; (2) the continuity/discontinuity of substitutability space; (3) unobserved attribute variation; and (4) complexities in submarket shape (particularly non-contiguity, non-convexity and non-compactness).
The cross-price elasticity of price (CPEP) was proposed as a measure of substitutability. CPEP, as defined here, does not rely on type-2 hedonics. It does not decompose dwellings into constituent attributes, nor does it cluster by attributes. Dwellings are instead treated as inseparable bundles—so it is likely to be more robust both to transformative interaction effects and to attribute measurement errors. Moreover, CPEP does not rely on discontinuity in house price surfaces (because substitutability—and hence submarkets—can exist along a continuum); and it does not appear to impose or assume convexity, compactness or contiguity in submarket shape (dwellings are clustered in substitutability space rather than Cartesian space, hence revealing, rather than imposing, the geographical pattern of market areas).
CPEP was defined in terms of market equilibrium (see equation (1)). Yet would the measure have any meaning in a world characterised by disequilibria, where prices are in a perpetual state of flux, never quite reaching equilibrium before having to adjust to some new perturbation to demand or supply; or responding to local/submarket perceptions and bidding conventions (Pryce, 2011; Levin and Pryce, 2007, 2011)? In such a world,
This is all well and good in theory, but is CPEP worth the effort in practice? It is too early to provide a conclusive answer, but there are some encouraging signs from the illustrative analysis presented here that the method could be of genuine practical use. Preliminary evidence suggests, for example, that CPEP submarkets could improve the performance of AVMs, but more thorough testing using a comprehensive set of model types is needed to establish whether CPEP submarkets consistently outperform other ways of defining submarkets in terms of boosting valuation accuracy. It also raises the possibility of an iterative approach to CPEP estimation. For example, one could start with an a priori set of submarkets (for example, based on realtor jurisdictions) in order to estimate an initial set of CPEP submarkets. One would then use these CPEP submarkets to improve the type-1 hedonic price change estimation. This in turn would lead to a more robust set of CPEP estimates, and so on. Note that none of this relies on type-2 hedonics.
Tentative work by Brown et al. (2012) suggests that CPEP could also be used to analyse the determination of substitutability—exploring, for example, the extent to which substitutability is determined by distance between dwellings, differences in dwelling attributes and differences in location attributes. (For example, a regression of
In the current paper, graphical inspection of results suggested a degree of non-convexity, non-compactness and granularity in submarket shape. However, more work is needed to develop a set of quantitative measures of convexity, compactness and granularity, which would pave the way for establishing a taxonomy of cities, with the long-term goal of comprehensively classifying structures of residential segmentation in cities across the world. Further work also needs to be done, however, to establish how stable are the boundaries of substitutability-based submarkets and how drivers of substitutability might change in relative importance (both over time and across cities).
Finally, while this paper has attempted to utilise type-1 hedonics to overcome the problems of type-2 submarket delineation, it should be noted that type-1 hedonics are not the only way of estimating the dwelling-level or block-level house price time-series needed to compute the CPEP metric. Yet another avenue for future research, therefore, would be to develop alternative ways of computing CPEP and related measures (using repeat sales methods or trajectory clustering, for example). Moreover, continued advances in regression estimation methods mean that there will be many opportunities to improve the type-1 hedonics used here to estimate the CPEP lattices. All hedonic estimation methods are subject to error, particularly with respect to how one captures unobserved heterogeneities between dwellings. Nevertheless, these errors could, in principle, be reduced by employing highly flexible methods, such as geographically weighted regression, to give more locally nuanced estimates of home values in each period.
Footnotes
Acknowledgements
The author is grateful to Duncan Maclennan and the seminal writings of Bill Grigsby for inspiring his initial interest in housing submarkets. The author would also like to express thanks to George Galster, Nema Dean, Chris Leishman, Eric Levin and numerous anonymous referees who have helped to refine and clarify the ideas presented here. The author is grateful to GSPC and Experian for supplying the house price and neighbourhood data. An earlier version of this paper (entitled: ‘Housing market segmentation: the theory and measurement of submarkets’) was awarded the 2009 Hypoteční Banka Prize for the Best European Network of Housing Research Conference Paper.
Funding
Thanks are due to the Department of Communities and Local Government for funding the research on submarkets that led to this work. The research was also part-funded by the EPSRC Socio-economic Model and Community Impact Simulators project (EPSRC grant reference: EP/F037716/1).
