Abstract
In this paper, we propose novel local and global models of street network entropy that measure levels of navigability given only limited local directional information. These models are defined for individual locations and entire street networks. Both models are derived using a generalised model of entropy from the field of game theory, which considers a decision-maker attempting to perform a task in the presence of incomplete information. We argue that the proposed models are more interpretable and useful than existing models of street network entropy since they measure the uncertainty of navigation, which is the task street networks are intended to facilitate. We demonstrate this utility by performing an empirical analysis of the entropy properties of UK city street networks.
Introduction
A street network is a type of transportation network corresponding to the set of streets contained in a given geographical region such as a city boundary. The properties of street networks directly influence many aspects of our society including where homes and services are located plus patterns of travel. Therefore, modelling properties of street networks represents an important research problem. There are several street network models that measure many types of properties including topological, geometrical and centrality properties. Each of these models has different applications and uses. For example, models of centrality may be used to help identify elements of a street network that a large proportion of routes pass through and therefore it may be useful to limit traffic congestion at these elements. In this work, we consider the problem of modelling the entropy of street networks. In the context of information theory, entropy is a measure of the uncertainty with respect to a given property (Gudmundsson and Mohajeri 2013). The (Shannon) entropy of a given property, modelled as a random variable X on the set {x1, …, x
n
} with probability mass function p (x1), …, p (x
n
), is defined as follows
A street network is most commonly represented as a graph embedded in the plane (Barthélemy 2011). As such, there exists a large array of entropy models for general graphs that can be applied to street networks (Dehmer and Mowshowitz 2011; Marin et al., 2022). However, these models do not consider the spatial properties of street networks and are therefore rarely considered. Instead, the two most commonly used models of street network entropy are defined with respect to the properties of street length and orientation (Mohajeri and Gudmundsson 2014; Boeing 2019b). These models have several useful applications because they have been demonstrated to be correlated with other properties of interest. Firstly, it has been demonstrated that these models of entropy can be used to identify the time period during which the street network in question was constructed (Gudmundsson and Mohajeri 2013). Specifically, street networks constructed during older time periods tend to be less structured and have higher entropy than those constructed during more recent time periods. Secondly, these models of entropy can be considered an indicator of the directness of shortest routes (ratio of street network distance to great circle distance) and how easy it is to determine these routes given limited information. This is a consequence of the fact that street networks with a grid pattern, which exhibit high levels of both these properties, have relatively small entropy (Boeing 2019b).
The principal purpose of a street network is to facilitate the task of navigating between locations. In this work, we propose novel local and global models of street network entropy that measure the navigability or uncertainty with respect to performing this task. Note that the model of Shannon entropy in equation (1) is defined for a property or random variable and not a task. It cannot, therefore, be used to define a model of entropy defined for navigating. To overcome this challenge we consider a generalised model of Entropy from the field of game theory that measures the uncertainty with respect to a decision-maker attempting to perform a task in the presence of incomplete information (DeGroot 1962; Grünwald and Dawid 2004). We use this generalised model to derive the proposed models of entropy whereby we measure the uncertainty with respect to a decision-maker attempting to navigate given only local direction information. This information constraint is motivated by the fact that it is known to be an accurate model of how humans navigate street networks (Bongiorno et al., 2021). Since the proposed models of street network entropy measure the uncertainty with respect to navigation (as opposed to a more abstract feature), we argue that these models are more interpretable than existing models. This in turn increases the potential for useful analysis to be performed using these models. Furthermore, to the authors’ knowledge, the proposed local model of street network entropy is the first of its kind. The local nature of this model lends itself to many new useful applications. This includes identifying locations in a street network that exhibit poor navigability and therefore may require additional signage to prevent drivers, cyclists or pedestrians from taking incorrect turns.
The layout of this paper is as follows. We first review related works on human navigation and motivate the proposed models of street network entropy. Next, we review necessary background material on generalised entropy and, in turn, derive the proposed models. Next, we demonstrate the usefulness of these models by performing an analysis of UK city street networks. In doing so, we examine the relationships existing between the proposed models of street network entropy, existing models of street network entropy, and other properties of street networks. Finally, we conclude this work and discuss some limitations of the proposed models.
Related works on human navigation
Human navigation has been widely studied in many fields including network science, geographical information science and cognitive psychology. In this section, we review existing works that motivate the proposed models of street network entropy.
Individuals use many different criteria when selecting a route to navigate. This includes the length of, the number of turns in and the amount of congestion along the route (Golledge 1995; Golledge and Gärling 2004). In many cases, the individual will have prior spatial knowledge of the geographical region in question that will help to complete the navigation task. This spatial knowledge is most commonly referred to as a cognitive map. A cognitive map will generally contain information about the locations of landmarks, which can be defined as distinctive features in the environment that help individuals to orient themselves (Siegel and White 1975). Examples of landmarks include a church or a pub. Apart from a cognitive map, individuals will commonly use external sources of spatial knowledge to help navigation such as a GPS device and map (Corcoran et al., 2014). In many cases, these sources will contain references to landmarks.
With incomplete spatial knowledge, an individual needs to intelligently reason and make inferences to successfully navigate. It has been demonstrated that individuals have varying abilities for successfully navigating in this context, and this ability is correlated with factors including gender, age, economic wealth and gender inequality (Coutrot et al., 2018). Note that many experiments that evaluate an individual’s ability to navigate are performed in simulation, which is predictive of an individual’s ability to navigate in the real-world (Coutrot et al., 2019). Coutrot et al. (2022) found that individuals that grew up in geographical regions where the corresponding street network had higher entropy had a superior ability to navigate more complex environments. In this work, street network entropy was defined with respect to the property of street orientation.
It has been hypothesised that, with incomplete spatial knowledge, individuals will use heuristics to navigate street networks (Bongiorno et al., 2021). A heuristic is a useful method for making decisions when there are insufficient resources or knowledge for making an optimal decision. Hochmair (2005) and Lee and Holme (2012) proposed a heuristic that greedily navigates in the direction of the destination. Bongiorno et al. (2021) later empirically demonstrated, through an analysis of trajectory data, that this heuristic accurately models the navigation behaviour of humans. This finding motivates our proposed models of street network entropy, which will assume the use of this heuristic for navigation.
Navigability entropy
This section is structured as follows. First, we describe a generalised model of entropy from the field of game theory. Next, we use this model to derive local and global models of street network entropy.
Generalised entropy
The generalised model of entropy described in this section uses notation and terminology drawn from (Grünwald and Dawid 2004).
We consider the following decision problem, which involves a game between a decision-maker and an adversary (Grünwald and Dawid (2004) refer to the adversary as ‘nature’). Let
Let Γ denote the set of probability distributions defined on
An action
Navigability entropy
In this section, we describe how generalised entropy can be used to define local and global models of street network entropy. Let G = (V, E) be the edge-weighted directed multigraph representation of a given street network where the set of vertices V correspond to intersections and dead-ends, the set of edges E correspond to street segments, and the weighting function
We assume that a decision-maker wants to navigate from a source vertex s ∈ V to a destination vertex d ∈ V along a shortest route in G. Note that more than one shortest route may exist between a given pair of source and destination vertices. Let S: V → 2
V
denote a mapping from a vertex appearing in a shortest route to the set of vertices that appear immediately after that vertex in a shortest route. To illustrate this mapping, consider the toy street network displayed in Figure 1(a). If we consider the case where the source vertex s equals v1 and the destination vertex d equals v8, there are three shortest routes of length 3. These are the path v1, v4, v7, v8, the path v1, v4, v5, v8 and the path v1, v2, v5, v8. In this context, S (v1) = {v2, v4}, S (v2) = {v5} and S (v4) = {v5, v7}. For the decision-maker to successfully navigate a shortest route, when at a vertex v
i
in a shortest route, they must select an element from N(v
i
) that is also an element of S(v
i
). In layman’s terms, they must select a vertex that keeps them on a shortest route to the destination d. Two toy street network graphs are displayed in (a) and (b). The graph in (a) has a perfect grid pattern and therefore contains no cul-de-sacs. Each edge is bidirectional and has a length equal to 1. The graph in (b) does not have a perfect grid pattern and contains several cul-de-sacs. Each edge is bidirectional and has a length equal to 1 or 0.5.
We model the above decision problem as a basic game
The definition of
The bearing difference between two bearings is defined by the function Δβ: [0, 360] × [0, 360] → [0, 180]
When at a vertex v
i
in a shortest route, we assume the decision-maker can compute Δβ between the vertex v
i
and each vertex in the set N(v
i
). We also assume that they can compute Δβ between the vertex v
i
and the destination vertex d. These values could, for example, be computed using a GPS device given knowledge of the latitude and longitude coordinates for d. As discussed in the related works section, it has been demonstrated that a heuristic used by individuals when navigating is to select the vertex in N(v
i
) that minimises the angle difference between the bearing from v
i
to this vertex and the bearing from v
i
to d (Hochmair 2005; Lee and Holme 2012; Bongiorno et al., 2021). In layman’s terms, this corresponds to navigating, as far as possible, in the direction of the destination d. The heuristic in question is defined by the function f: V × V → V
To illustrate this heuristic consider again the toy street network graph displayed in Figure 1(a), and the case where the source vertex s equals v1 and the destination vertex d equals v8. When at vertex v1, the heuristic will select v2 or v4 as the next vertex. When at vertex v2, the heuristic will select v5 as the next vertex. We assume that the decision-maker uses the above heuristic when attempting to navigate a shortest route. We model this assumption by defining the set of actions
This equality results from the fact that
The above loss function means that when the decision-maker is at a vertex v
i
in a shortest route, they will incur a loss of 0 if the function
As previously discussed, in this article we define both local and global models of street network entropy. Here, the local model is defined for individual vertices while the global model is defined with respect to an entire street network. The definition of these models is presented in the following two subsections and differs in how the probability mass function P is defined.
Global model
We derive the global street network entropy model as follows. We define the probability mass function P to equal the probability mass function with which X occurs when the decision-maker is navigating the shortest routes. There are
Using this, we can now derive a plug-in estimation of global street network entropy
The toy street network displayed in Figure 1(a) has a global entropy value of 0. This is a consequence of the fact that the heuristic f will navigate shortest routes in a perfect grid pattern in all cases. In contrast, consider the toy street network displayed in Figure 1(b), which contains many cul-de-sacs. The global entropy value for this street network is greater than 0 and approximated by our model to equal 0.32. This is a consequence of the fact that the heuristic f does not navigate shortest routes in all cases. For example, consider the case where the source vertex s equals v4 and the destination vertex d equals v8. When at vertex v4, the heuristic will select vertex v10 as the next vertex. This vertex corresponds to a cul-de-sac and is not on a shortest route to the destination v8.
Local model
We now derive the local vertex entropy model as follows. For a given vertex s, we define the probability mass function P to equal the probability mass function with which X occurs when the decision-maker is located at s and is navigating to a vertex d ≠ s. We again use plug-in estimation to estimate this value. To do this, we sample with replacement m vertices from the set V − {s}. Let us denote this set of vertices {d1, …, d
m
}. For each pair (s, d
j
) we now compute a shortest route
Using this, we can now derive a plug-in estimation of local street network entropy with respect to the vertex s
This calculation involves using Dijkstra’s algorithm to compute a shortest path tree connecting s to each of the m selected vertices. Doing this for each s ∈ V leads to an overall complexity of O (|V|(|E| + |V| lg |V|)).
From the final approximate equality above, we see that estimating local vertex entropy reduces to computing the proportion of times that, when at the vertex s, the heuristic f
Note that a vertex with degree one, which corresponds to a cul-de-sac, will always have a local vertex entropy value of 0.0. This is because its single neighbouring vertex must be on a shortest route if one exists. Also observe that computing the local entropy model can be computationally expensive for large street networks because it requires computing equation (12) for each vertex in the street network.
Each vertex in the toy street network displayed in Figure 1(a) has a local vertex entropy value of 0. Again, this is a consequence of the fact that the heuristic f will navigate shortest routes in a perfect grid pattern in all cases. In contrast, each vertex in the toy street network displayed in Figure 1(b) has a local vertex entropy value greater than 0.0 apart from the vertices with degree one.
The local and global models of street network entropy presented above are related in the following way. The global model equals the expected loss L with respect to all decisions made when navigating from a random source vertex to a random destination vertex. On the other hand, the local model equals the expected loss L with respect to the first decision when navigating from a fixed source vertex to a random destination vertex. Note that, since both the local and global models approximate expected values, the entropy values in question for cities of different shapes and sizes can be directly compared without any form of normalisation.
Results and analysis
Details of the 48 UK city street networks considered in this study. For each street network, the number of vertices (|V|), the number of edges (|E|) and the geographical area (measured in km2) is provided.
The intersection density (ID) values, the street segment orientation entropy (OE) values, the route directness (RD) values and the global street network entropy (NE) values for each city street network are displayed.
An analysis of the global entropy values for each of our city street networks provides some interesting insights. The mean of these values is 0.144. This is a relatively low global entropy value and indicates that the vast majority of decisions when navigating shortest routes can be made correctly using only local direction information. On the other hand, the variance of these values is quite large. The city of London has the lowest global entropy value of 0.088 while the city of Carlisle has the highest global entropy value of 0.191. This indicates that it is more than twice as difficult to navigate shortest routes using local direction information in the latter compared to the former. This can be attributed to the fact that the city of Carlisle, which is displayed in Figure 2, contains a relatively large number of cul-de-sacs adjacent to important transportation corridors. Three of these are highlighted in the figure. As previously illustrated using the toy street network in Figure 1(b), cul-de-sacs frequently do not belong to shortest routes and, in turn, contribute to higher entropy. Furthermore, the western and eastern regions of the street network are separated by a river, and all routes between these regions involve crossing bridges in the north. This requirement to travel north when travelling east to west and vice versa is not captured by the direction-based heuristic: in such cases, the heuristic will recommend moving in an easterly or westerly direction instead of a northerly direction. Figure 3 displays the street network for the city of Dundee which is of a similar size to the street network for the city of Carlisle. However, this street network has a significantly lower global entropy value of 0.114. This can be attributed to the fact that this street network contains a relatively small number of cul-de-sacs adjacent to important transportation corridors. Furthermore, the environment in which the street network exists does not contain any barriers, such as a river, so shortest routes are more easily determined using only directional information. The street network for the city of Carlisle is displayed. This street network has the highest global entropy value of all city street networks considered. This can in part be attributed to a significant number of cul-de-sacs adjacent to important transportation corridors. A number of these are highlighted in the figure. Cul-de-sacs frequently do not belong to shortest routes and, in turn, contribute to higher entropy. The street network for the city of Dundee is displayed. This street network has one of the smallest global entropy values of all city street networks considered. This can in part be attributed to a relatively small number of cul-de-sacs adjacent to important transportation corridors.

The analysis of global entropy values could be used by urban planners to identify those cities that are more difficult to navigate and therefore in need of additional signage. It could also be used by urban planners to help suggest changes or additions to a given street network structure that improves navigability. It is likely that in some situations a trade-off exists between improving the navigability of a city and improving other aspects of navigation such as the length of shortest routes.
The Pearson and Spearman’s rank correlation between the global entropy values and the street segment orientation entropy values, the intersection density values and the route directness values. Each cell displays the correlation statistic in question, followed by the p-value and the 95% confidence interval.
Figure 4(a) displays a histogram of the local vertex entropy values corresponding to the Cardiff city street network. Recall that these values equal the proportion of times that, when at the vertex in question, the direction-based heuristic does not select a vertex that keeps the decision-maker on a shortest route. The histogram is left-skewed, with a long tail to the right. Of the 10,735 vertices in the street network, 3806 vertices (35%) have an entropy value equal to 0.0 while 765 vertices (7%) have an entropy value greater than or equal to 0.9. This demonstrates that, while a large percentage of vertices can be successfully navigated using only direction information, a significant percentage cannot. Histograms of the local vertex entropy and vertex betweenness centrality values for the Cardiff city street network are displayed in (a) and (b), respectively.
Figure 5(a) displays the 2% of vertices that have the greatest local vertex entropy values. The two regions highlighted in this figure using blue dashed boxes are displayed in Figures 6(a) and 6(c). The regions in question correspond to a suburban residential region in the northwest, and a city-centre region, respectively. As discussed previously, a vertex corresponding to a cul-de-sac will always have a local vertex entropy value of 0.0. Although some vertices identified in the above figures may appear to correspond to cul-de-sacs, they are not. Instead, this is a consequence of the red dots covering corresponding neighbouring streets. We can see that the suburban residential region has a significantly higher density of vertices with large entropy values than the city-centre region. This is a consequence of the fact that, by design, residential regions contain a relatively large number of streets leading to cul-de-sacs. The Cardiff street network is displayed in both figures (a) and (b). In the former figure, the 2% of vertices that have the greatest local entropy value are represented by red dots. The local entropy of a vertex equals the likelihood of one taking an incorrect turn at that vertex when navigating. In the latter figure, the 2% of vertices that have the greatest product of local entropy value and betweenness centrality value are represented by red dots. The betweenness centrality for a vertex equals the percentage of shortest routes that pass through that vertex. The two regions highlighted in these figures using blue dashed boxes are examined in Figure 6. The two regions highlighted in Figure 5(a), corresponding to a suburban residential region in the northwest and a city-centre region, are displayed in (a) and (c), respectively. The same two regions in Figure 5(b) are displayed in (b) and (d), respectively. The red dots in (a) and (c) indicate those vertices where the likelihood of one taking an incorrect turn when navigating is greatest. The red dots in (b) and (d) indicate those vertices where one is most likely to navigate through and take an incorrect turn when doing so.

The local entropy model has many potential applications. It could potentially be used to help recommend routes that are easier to navigate and, in turn, easier to describe. It could also be used to identify vertices where drivers are likely to frequently take an incorrect turn and therefore indicate a need for different road layouts or new signage. To help identify these vertices, we computed the vertex-wise product of local vertex entropy values and vertex betweenness centrality values. The betweenness centrality for a vertex equals the percentage of shortest routes that pass through that vertex (Brandes 2001). Vertices with a large corresponding product value are those that are both more likely to be travelled frequently and more likely for an incorrect turn to be made at. Figure 4(b) displays a histogram of the betweenness centrality values corresponding to the Cardiff city street network. The histogram is skewed to the left indicating that most vertices have small centrality values. Figure 5(b) displays the 2% of street network vertices that have the greatest product of entropy and betweenness centrality values. Figures 6(b) and 6(d) display the same two highlighted regions considered above. By examining these figures we can identify the vertices that exist along important navigation corridors and have high entropy. For example, the most northerly vertex identified in Figure 6(d) corresponds to an intersection along a major navigation corridor that leads to a cul-de-sac. If not already in place, the street network intersections in question may benefit from clear signage to prevent drivers from frequently taking incorrect turns.
Conclusions
This article has proposed new models of street network entropy that are defined with respect to both individual network vertices and entire street networks. These models measure the uncertainty of navigating given only local direction information. This is distinct from existing models of street network entropy which, instead, measure the uncertainty of more abstract properties including street segment direction. Since navigating is the very task that street networks are intended to facilitate, we argue that the proposed models are more interpretable and this, in turn, increases the potential for useful analyses to be performed.
Despite the many potential applications, the proposed models have some limitations that should be considered if attempting to implement these applications. The proposed models assume that individuals navigate using only directional information. However, in many cases, individuals use other forms of information including landmarks, street signage, prior knowledge of the geographical region and satellite navigation systems. Further research is necessary to determine and model how humans integrate and reason using these different forms of information when navigating.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
