Abstract
Accessibility to a fixed point of interest is a powerful indicator for quantifying the strengths and weaknesses of a multimodal pedestrian and transit network. However, it is difficult to predict the impact on accessibility deriving from the insertion of a new infrastructure and its contribution to the urban landscape. This paper uses graph science and multisource open data to implement a tool for studying accessibility in combination with several network performance indicators. The tool allows determination of whether a transit infrastructure project can be beneficial for its potential users. In this paper, we propose to compute gravity-based accessibility, temporally bounded by an isochrone
Keywords
The acceleration of urban sprawl since the 1970s and into the 2000s has generated an increasing number of trips by individuals in private motor vehicles within the urban area. Combined with the increase in the number of motorized households ( 1 ), users have shifted their mobility habits toward an increase in travel time and distance covered each day, in particular to their place of work or to services. These different forms of travel have generated many negative effects, such as the time and productivity costs of congestion ( 2 ). It should also be remembered that private vehicle transport remains the primary way through which individuals travel, despite its high climate impact and the development of public transport ( 3 ).
Thus, it seems natural that access to essential services, such as medical centers or shops, should be maximized by public means of mobility. The notion of accessibility was first formalized by Hansen in 1959 ( 4 ). According to Hansen, accessibility is the number of potential interactions that the individual can have with points of interest within a fixed perimeter.
Unfortunately, accessibility is too often a very unequally distributed resource, on the one hand because of strong economic attraction of urban districts, or, on the other hand, because of an inefficient service in the most disadvantaged districts ( 5 – 7 ). This inequality is often accentuated by political choices of local elected officials, but can also be the result of poor planning of the urban network, leading to an asymmetry of demand and supply. This can often be explained by the massive need for funding for exclusive right-of-way public transport, for example. Another factor is also the time required to carry out such projects. All these reasons, jointly with a reduced availability of tools to quantify this metric, make decisions about accessibility hard and often suboptimal. Mapping the accessibility of a territory has thus become a major challenge for companies with expertise in urban or rural development ( 8 , 9 ). It allows them to ensure that the offer is well adapted to the demand, or to readjust the vision of a project to reach the objectives described by the specifications.
This paper proposes a methodology for a general analysis of the efficiency of an urban public transport (PT) network and its possible improvements by public decision-makers and transport stakeholders. The methodology has been implemented in a computer tool, and evaluated in real-world scenarios related to the Lyon agglomeration, France. This efficiency is quantified through economic performance indicators based on the accessibility to the different points of interest reachable by the user. In parallel, a simple visual approach has been developed to facilitate the understanding of the different computed metrics by the multiple stakeholders involved in the urban planning process.
State of the Art
Accessibility is a way of quantifying an individual’s environment, based on a metric that describes the opportunity for people to interact with specific points of interest ( 4 ). Following Hansen’s general and partial definition, several other scientists have worked to complete the definition of accessibility. Bhat et al. ( 10 ) explain that: Accessibility is a measure of how easy it is for an individual to pursue an activity of a desired type, at a desired location, by a desired mode, and at a desired time. Bhat introduces the notion of “ease of access” to the activity that the user wishes to access. In addition, the notion of choice is also strongly present, suggesting a focus on the metric of access to the desired point of interest and not on the number of opportunities in a given area.
Finally, a third definition contrasts with Bhat’s: the extent to which the land use and transportation system enables (groups of) people or goods to reach activities or destinations by means of a (combination of) transportation mode(s), proposed by Geurs and Van Eck ( 11 ). Closer to Hansen’s definition, Geurs and Van Eck add that accessibility measures the possibility of reaching points of interest by a succession of modes of transport, motorized or not. This definition therefore does not take into account the measure of ease introduced by Bhat et al. ( 10 ). That the definition of accessibility is not a subject of consensus among scientists explains the plurality of definitions and calculations associated with it. Thus, it appears that there is a significant diversity of models, all of which are relevant depending on the use and the variable or hypothesis that one wishes to highlight. One of the most important conditions of this work will be the usability of the output data, because when the studies are presented, the decision-makers must understand the indicators presented to them very quickly.
The econometric approach ( 12 – 14 ) proposes a probabilistic and rational vision of the individual’s choices that will be at the basis of our work. Such vision is based on the maximization of a user’s utility function. In the economic domain, utility is a quality of an object by which it is possible to obtain a relative measure of well-being or current satisfaction through consumption, or the findable benefit of a good or several services ( 15 ). It is related to but distinct from the need of a consumer ( 16 ).
According to Nassir et al. (
13
), Log accessibility is an efficient technique to estimate the maximum expected utility that users of a system would perceive, given the set of choices available to them. It would thus seem that the user is rational, and that he favors the most efficient succession of mobilities, according to his choice criteria. The hypothesis of a rational individual does not seem to be erroneous, since real-time information systems, such as (transit) travel planners, provide almost complete knowledge of the urban transport network of the city of Lyon. We will therefore assume that the individual is quasi-rational and maximizes her utility by minimizing the travel time with few conditions (
17
,
18
). However, the use of a utility function did not seem relevant in the context of creating a tool to facilitate the understanding of urban planners. This study will thus only report on the theoretical or “possible” accessibility and will not further develop the aspects related to accessibility as perceived by the user. To measure the level of accessibility of a point of interest, we have chosen gravity accessibility to impose an easily identifiable isochrone
Methodology
This section describes the proposed methodology to quantify and optimize the accessibility of an urban transit network according to a fully data-driven automatic approach. The general framework is summarized in the diagram of Figure 1, while the following subsections detail the main methodological steps. To theoretically model an urban transportation network, we adopt a graph science approach. The latter is used to compute the travel time between a given origin and a given destination in a suitable way. In our case, origins correspond to the centroid of each building present in the studied area. We will call

Diagram of the generic methodology for analysis and improvement of an urban public transport network.
General Analysis and Opportunities for Network Improvements
To properly model the overall urban fabric, which includes the transit network, the urban buildings as well as the different points of interest (POIs) for accessibility computation, we use diverse sources of open data. First, it is possible to use the generic GTFS (General Transit Feed Specification) data ( 17 , 21 , 22 ) to model the transit network. GTFS data provide theoretical schedules of a PT network according to a standardized format. Geospatial data such as stop locations are also included. In addition, in the context of this paper, only GTFS data between peak hours are considered. The accessibility is thus averaged over this time interval. Pedestrian mobility is modeled by a pedestrian-only network, which does not consider the road network or any other form approaching it, using OpenStreetMap data.
After having modeled the graph representing the urban pedestrian/PT network, we implement the buildings layer by retrieving their geo-referenced polygon shape, computing their centroids, and assimilating them as origin nodes of the graph. This manipulation allows a simple yet effective computation of the travel time in the network, by formally defining the origin points of the transit network. Building nodes are connected via an edge to the nearest existing node of the pedestrian network. The final network will thus include all the buildings of the metropolis (assimilated to origin nodes), the pedestrian network taken from OpenStreetMap, and the PT network used by GTFS (
22
,
23
). In a similar way, POIs from specific categories, such as health services or leisure activities, are added in a second stage for the
Each building is associated with
ID: The unique building number.
Geometry: The geometry of the building (in POLYGON format).
Net type: The type of structure that the object takes in the network.
Accessibility: The accessibility value associated with each building.
With the support of other indicators that will be detailed in the rest of the paper, the user has the possibility of proposing and evaluating an alternative configuration of the transit network, by modifying, deleting, or adding a new infrastructure to the reference initial configuration. Thus, the algorithm modifies the network and recalculates the accessibility matrix. As depicted in Figure 1, the tool could be adopted as in an iterative approach, that is, for comparing the accessibility matrix produced at the
Extract of the Accessibility Matrix at the Output of the Algorithm
Accessibility Computation
Given the bounded spatial region of interest
where
The
Graph Construction
The modeled graph is a directed graph of the form G = (V, E, W), where
The weight of each edge depends exclusively on the nature of the nodes it connects, and will correspond to a travel time in our context. In the following, we consider all cases that one could encounter in building a PT–pedestrian graph for the definition of the edge weights.
First of all, the travel time between two PT stops corresponds to the time it takes, in nominal conditions, to travel from u to v.
Let
The values
This formula is also valid for edges connecting a building to a pedestrian node, as well as an edge from a PT stop to a pedestrian node.
The approximation of the uniform rectilinear trajectory in Equation 3 is not aberrant in view of the average value of the distances between each pedestrian node (
Finally, to adequately model the transfer from pedestrian to PT mode, we propose to consider the waiting time of an individual at a PT stop.
Waiting times at a stop can be modeled by a continuous piecewise function such as the one reported in Figure 2 originally proposed by Kujala and al. ( 18 ).

Graph of the individual’s waiting function at any given stop.
It is moreover worth remarking that the accessibility to a POI varies significantly from one individual to another. An individual who plans her trip in advance will usually minimize her waiting time, but in the worst case, the traveler has to wait for the full duration of the line frequency, that is
Subsequently, we will make the simplifying assumption that individuals wait an average of
Let
where
We could take a waiting time equal to
Shortest Route Calculation
As introduced in the “State of the Art” section, we will consider users to be rational, maximizing their utility by always preferring the shortest path. Therefore, the metric used to model accessibility will be the shortest travel time between a starting node
We can consider the following minimization problem: Let
with
In other words, the minimum of P is the shortest path from the origin to the destination among all existing alternatives ( 9 ). Shortest paths are determined by using the contraction hierarchies algorithm, which makes computation faster than using simple Dijkstra’s algorithm because of the successive steps of simplification of the graph (identification of lower-importance vertices and edges of the road network and compression of the graph) performed during a pre-processing phase. The latter is performed before the actual computation of the shortest paths, which can still rely on Dijkstra’s algorithm but on a contracted graph. Computation time is thus reduced because of the lower number of nodes/edges ( 22 , 25 ).
Network Modification: Accessibility Estimate
While calculating the accessibility of an existing network provides a comprehensive view of the ability of a network to transport people to their chosen POIs, the choice of a new infrastructure or the introduction of a new line is often difficult to quantify. In particular, we propose to modify the existing network, by adding to, deleting from, or modifying the modeled graph. Then, it is possible to predict the accessibility behavior in the vicinity of the new structure and quantify its contribution to the community by different indicators.
Absolute Accessibility
To visualize the contributions of a new transit infrastructure, we propose as an indicator the absolute change in accessibility between the original and the modified network (
26
). Let us denote
where
This accessibility gap can be thus interpreted as the increase in the number of POIs that the user can now reach in a time
Absolute Inequality Gap of Individuals to Accessibility
As we have seen, spatial inequality in accessibility is often a collateral effect of the polarization of economic and residential centers. The policy choices associated with the creation of a specific PT system were primarily motivated by prospects for economic productivity, which had the effect of putting equality of access on the back burner.
The notion of equity discussed in this paper comes primarily from the definition of horizontal equity ( 27 ). Horizontal equity assumes that each individual bears all travel costs unless subsidies are explicitly justified. Indeed, each individual bears the full temporal costs of the trip he or she is making, that is, the waiting time in addition to the different travel times. It is difficult to imagine temporal subsidies such as a reduction of the trip, since the infrastructure itself does not allow this. This definition implies that for a given population of individuals bearing the same costs, the most desirable distribution of the expected wealth, in this case accessibility, would be pure and simple equality between all. One of the best ways to account for the described equity situation seems to be the main economic inequality indicators. However, these indicators do not capture the spatial inequality of the service in a given area. Therefore, it appears appropriate to adopt a graphical representation of accessibility (i.e., maps with the help of GIS tools) ( 28 ). This is the approach that will be taken throughout this paper. By highlighting these inequalities, a transport stakeholder such as Egis Rail could strengthen its argument to prefer one route over another when building, for example, a new transport line. Indeed, a new tramway line route with the argument of an increase in accessibility to health services combined with a decrease in spatial inequalities is assumed to be appreciable by the community.
The usual indicator to model inequalities in transportation (
5
,
26
,
29
) is the Gini coefficient (
30
). The Gini coefficient, denoted as
Let
where
The Gini coefficient can also be equal to twice the area between the equality curve and the Lorenz curve. We will combine these indicators with a graphical representation such as the Lorenz curve, generally used to represent the distribution of a wealth
where
To compare several tramway lines, we will study the number of buildings whose accessibility is affected by the new line, to quantify the catchment area of each tramway option. It is also important to look at the Gini gap between the initial situation and the situation with the new infrastructure. The Gini gap reflects the variation in inequality in the accessibility of individuals. A positive Gini gap means an improvement in the inequality situation for the concerned area. In contrast, a negative Gini gap means a deterioration in the equality of access to transport of individuals in the study area. Finally, the magnitude of the value of this gap illustrates the significance of the change, which allows us to rank the network changes. For example, a line addition producing a positive Gini gap of 0.05 will be less efficient in the “horizontal equality” sense than a line addition with a Gini gap of 0.06. It should be noted that this gap may be very small, because of the small importance of the infrastructure in the overall cumulative share of accessibility of the Lyon metropolis. Finally, we will carefully study the statistics around the overall accessibility gain generated by each of the lines.
Results
Data and Software
We implemented the framework described in the “Methodology” and “Network Modification: Accessibility Estimate” sections in Python 3.9, using as main libraries Pandana and UrbanAccess (
22
,
23
). Pandana allows parallelizing of the shortest path computations because some functions are coded in
Main Input Parameters of the Model
Modification of the Network: Improvements
Let us now analyze the current accessibility of Lyon’s network with respect to health services. Figure 3 shows a heatmap of the accessibility values for each building. Hotter colors are indicative of higher accessibility. Obviously, the accessibility will be centralized around exclusive right-of-way PT in Figure 3, because it offers the shortest travel time combined with a significant frequency. We can observe that the core transport mode for accessibility to health services remains the Lyon metro, with more than 300 establishments reachable from the center of the city (Figure 4b). The centralization of accessibility in the center of the metropolis could be explained by a phenomenon of inertia caused by the choice of establishments to be located around the major PT axes to increasingly serve the major economic centers. Moreover, we can observe that out of 170,460 buildings, the average of health buildings reachable in less than 15 min is 57.64, as given in Table 3. This result shows that, on average, individuals in the city of Lyon have access to a high number of health services very quickly, thus indicating an efficient network. However, a closer inspection of the results from Table 3 highlights a different outlook: the median value of accessibility is much lower than the average, showing that 50% of the individuals have an access in less than 15 min to only 31 health buildings. Interestingly, 25 % of the buildings (Q1) have access to fewer than eight health buildings, a situation that characterizes mainly the west side of Lyon (Figure 4a). Finally, the standard deviation of 67.6034 suggests a large dispersion around the mean, which would imply very large inequalities between individuals.

Map of accessibility to health services in Lyon within 15 min.

Maps illustrating the polarization of access to health services to the detriment of the west: (a) buildings with accessibility under the 1st quartile, and (b) buildings with accessibility above the 3rd quartile.
Statistics Associated with the Accessibility of Buildings in the City of Lyon
The spatial distribution reported in Figure 4a, related to the buildings with accessibility inferior to the first quartile, confirms that the western part of the Lyon metropolis does not provide easy access to health-care services, which are mainly located in the city center around main transit axes. It also appears quite strikingly that the bus lines of the current system, although numerous, have a minor effect on the total accessibility of a building. This can be explained by the very high frequency of passage of the metro and its speed, compared with the other transport modes analyzed in this study. It should be noted that the definition of gravity-based accessibility does not allow for distinctions between POIs and their importance: Access to a social service is as important as access to a hospital. We can therefore only speak of overall accessibility to health services.
Several methods can be used to improve the transportation network in a city. Increasing frequency or speed can significantly augment the accessibility of users to a center of interest, because waiting and travel times are reduced.
Nevertheless, the spatial distribution of accessibility will hardly change, or will be only slightly affected. To decrease the inequalities between residents, the public decision-maker might be tempted to modify an existing PT mode route, or to create another route altogether.
Concerning these aspects, we have analyzed the impact as accessibility gain deriving from the introduction of a new tramway line (whose stops are reported with green symbols in Figure 5). When considering the introduction of a new line, we have analyzed the impact of the tramway passage frequency by considering multiple values: it appears that decreasing the frequency of transport by 1 min produces a decrease in the building accessibility gain of 25 % on average in Figure 5, with a decrease in the total number of buildings affected. This result can also be explained by the threshold effects involved in defining our accessibility. It is obvious that the argument of the quarter-hour city, in which all amenities would be accessible in a 15 min walk, remains very theoretical, and cannot be completely achieved. In the rest of the paper, we will take 6 min as the frequency of the new tramway line, approaching the average frequency of a tramway line in Lyon. Sytral, the operator of the city of Lyon’s transport network, recently unveiled plans for a new tramway line to serve western Lyon. We propose this line as well as two other alternative routes detailed in Figure 6 to compare them, according to the methodology explained above. In addition, we consider POIs including doctor offices, hospitals, and clinics, to refocus our study on facilities that provide health-care services.

Accessibility gain between the initial network and the network with the added tramway line: study of different frequencies: (a) tramway line with a frequency of 6 min, (b) tramway line with a frequency of 7 min, and (c) tramway line with a frequency of 8 min.

Illustration of the three tramway lines submitted to the study.
In Table 4 we have given the different indicators allowing a better understanding of the benefits of the different infrastructures studied. We have computed four important indicators that can potentially help in the decision of urban planning, namely:
Comparative Statistics of the Three Tramway Lines
In relation to the last indicator, it should, however, be noted that the Lorenz curves (in Figure 7) are plotted by only including the buildings within a 200 m zone around each building with an accessibility gain. This allows the better studying inequalities at a local level. The calculation of the Lorenz curve on the whole metropolis would in fact not have been relevant in view of the infrastructure change, which is very minimal compared with the global accessibility offer.

Lorenz curves conjugated to the different plots above.
By comparing the different values of the
We can thus observe that all three lines improve equity between individuals, to a greater or lesser extent, because all values are positive in Table 4. Subsequently, it seems that the least efficient line for equitable distribution of wealth is line 3. This can be explained by an extended service to the east, already very well served, as illustrated by Figure 4b. Line 2 seems the one that maximizes equity between individuals (higher
Conclusion and Future Work
In this paper, we proposed a methodology and a tool for the analysis and optimization of accessibility of a multimodal urban pedestrian/transit network, which can be useful for data-driven urban design planning. To quantify the accessibility of each building, we created a multimodal pedestrian/transit graph, including as additional nodes the buildings of the metropolis. In addition, we set up several performance indicators to analyze the characteristics of the network accessibility, and with the objective of optimizing it by implementing new infrastructures, or modifying existing ones. Although estimating the accessibility gain by modeling a new infrastructure can be effective with the support of decision-makers, it does not fully describe the future effects of this change. In large cities, we can propose that the Gini deviation be performed on all POIs (amenities, shops, and so on) and not on specific points like health-care buildings. We assume that the value of this gap would be more significant, because we would no longer increase the accessibility to a type of building but increase the global accessibility. In addition, several limitations must be considered in our graph science approach. Transfer times when the user does not move from a transit node (i.e., the user stops at a stop with an A bus, and picks up a B bus a few minutes later at the same stop) are not taken into account. It should also be noted that there are some threshold effects around the spatial study area: external POIs are not taken into account, which leads to edge effects and may distort the accessibility of buildings at the edge of the study area. Furthermore, the gravity accessibility described in this article does not include directly the notion of user demand. Building attractiveness is essential to define which areas are priorities for maximizing accessibility. One could refer to the notion of spatial attractiveness, which can be defined as the sum of incoming flows directed to a geographical unit. It is obvious that the calculation or retrieval of inflows information is a very complex task and would normally require availability of large-scale data on building frequentation. However, using mobile phone passive data appears a valid solution to estimate incoming flows or dynamic user presence of the studied infrastructure for each area or building ( 34 – 36 ) and will be a matter for future work.
Footnotes
Acknowledgements
The authors thank Keith Alipogpog for his upstream work on the tool for accessibility quantification, as well as Egis Rail for contributing to funding this research. We also thank Elie Cornillon, Grange Kévin and Stephen Fournout, from Egis Rail, Lyon, for their support with some of the ideas of this paper.
Author Contributions
The authors confirm contribution to the paper as follows: study conception and design: SS, AF, PP, JS; data collection: SS, PP; analysis and interpretation of results: SS, AF, PP; draft manuscript preparation: SS, AF. All authors reviewed the results and approved the final version of the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors acknowledge the support of the French National Research Agency (ANR) grant number ANR-18-CE22-0008 (PROMENADE project).
