Abstract
Transportation networks, from bicycle paths to buses and railways, are the backbone of urban mobility. In large metropolitan areas, the integration of different transport modes has become crucial to guarantee the fast and sustainable flow of people. Using a network science approach, multimodal transport systems can be described as multilayer networks, where the networks associated to different transport modes are not considered in isolation, but as a set of interconnected layers. Despite the importance of multimodality in modern cities, a unified view of the topic is currently missing. Here, we provide a comprehensive overview of the emerging research areas of multilayer transport networks and multimodal urban mobility, focusing on contributions from the interdisciplinary fields of complex systems, urban data science, and science of cities. First, we present an introduction to the mathematical framework of multilayer networks. We apply it to survey models of multimodal infrastructures, as well as measures used for quantifying multimodality, and related empirical findings. We review modeling approaches and observational evidence in multimodal mobility and public transport system dynamics, focusing on integrated real-world mobility patterns, where individuals navigate urban systems using different transport modes. We then provide a survey of freely available datasets on multimodal infrastructure and mobility, and a list of open-source tools for their analyses. Finally, we conclude with an outlook on open research questions and promising directions for future research.
Keywords
Introduction
Urban mobility takes place across a range of transport modes. The most basic mode is walking, allowing individuals to cover short distances. With the historic growth of cities and transport technologies, new forms of mobility have emerged that are able to cover larger distances, keeping the growing city a coherent unit: from horsecar, bicycle, streetcar, to rail, bus, and automobile. These different modes of transport fulfill different roles that cater to the heterogeneity of urban trips which unfold across a range of different spatial scales, from individual buildings to city blocks, neighborhoods, up to large urban agglomerations (Alessandretti et al., 2020; Batty, 2006). Due to their heavy-tailed distribution (Alessandretti et al., 2017; Brockmann et al., 2006; Gonzalez et al., 2008; Song et al., 2010), the majority of urban trips are short and can therefore be taken by foot (Varga et al., 2016). The second largest fraction of trips is medium distance optimally taken by bicycle or bus. Finally, the smallest fraction of urban trips is long distance, for which travel via metro/rail or via car is reasonable (Varga et al., 2016). It is therefore important for any large enough city to have a balanced mix of modes that reflects the distribution of how far its citizens need to move. The multimodality perspective has an application in long-term transportation system analysis planning and design, allowing engineers and planners to analyze strategies for multimodal urban mobility at different scales (Arentze and Molin, 2013; Arentze and Timmermans, 2004; Fu and Lam, 2014; Loder et al., 2019, 2022).
The subject of this review, multimodal mobility (or combined mobility), is then the idea to combine multiple modes in one trip. On one hand, this is happening naturally just by the existence of multiple modes—for example, a trip with a bus includes walking to and from a bus station. On the other hand, multimodal mobility can also be intentionally designed, such as bicycletrains (Geurs et al., 2016) or shared mobility solutions to the last mile problem (Shaheen and Chan, 2016). Given the different ranges, densities, and frequencies of transport modes, their combination can thus be advantageous.
Potential and challenges of multimodal mobility
The biggest potential of multimodal mobility is to improve urban sustainability through the combination of sustainable forms of transport, especially with complementary reach such as bicycles and trains (Rietveld, 2000), to make the whole system more sustainable, that is, able to last for a long time. The incorporation of unsustainable forms of transport into multimodal trips, such as automobile transport, together with mass transit—for example, in a first-last mile scenario—could yield a more sustainable transport system under certain conditions but has to be assessed carefully (Hoehne and Chester, 2017). The potential of multimodal mobility therefore lies in creating an interplay between different modes that is more effective than a single mode alone, and in a direct or indirect reduction of unsustainable transport modes. In particular, with the growth of cities, it has become clear that the monolithic focus on cars that has emerged in the 20th century, together with a car-centric approach of transport planning to minimize travel time for individual motorized transport, is unsustainable (Banister, 2005). The economic burden of automobility alone is € 500 billion yearly in the EU while walking and cycling provides a yearly benefit to society worth € 90 billion due to positive health effects (Gössling et al., 2019). From this perspective, sustainable urban transport development therefore views mobility as a means to provide people social access to the city rather than an optimized traffic flow for vehicles. In this modern view, all transport modes co-exist in a hierarchy that prioritizes walking, cycling, and transit over other forms of transport (Banister, 2005; Gössling, 2016). This co-existence naturally provides opportunities for combinations. However, designing a well-functioning multimodal transport system comes with challenges.
Apart from deep political challenges related to car dependence (Mattioli et al., 2020), each mode of transport has its own underlying infrastructure network and schedule of operation that has unique structural and dynamical properties depending on its function. An example of this structural complexity is provided in Figure 1 showing the interconnected transport system of Manhattan given by the interplay of a pedestrian, a bicycle, an underground, and a car network, characterized by different size, connectivity, and velocity. Considering individual transport, the network of pedestrian sidewalks and walking areas must be at least as dense as the bicycle network, as almost every location in the city should be reachable by walking but not necessarily by bicycle. Similarly, considering mass transit, a bus network will generally be more dense and more frequently operated than an urban rail network which serves larger distances. Multiplex network representation of Manhattan, New York City, with four layers of transport infrastructure (pedestrian paths, bicycle paths, rail lines, and streets), with data from OpenStreetMap (OpenStreetMap contributors, 2017b). (Right) Network information for each layer, number of nodes, links, and average degree ⟨k⟩. Figure adapted from Natera Orozco et al. (2020b).
The new multimodal nature of large cities raises further planning challenges. From the traveler’s view, choosing convenient travel solutions within multimodal systems requires to process large amounts of information (Gallotti et al., 2016b). From the transport provider’s perspective, multimodal systems are more difficult to manage, due to the large number of aspects that have to be taken into account in order to achieve synchronization between different agencies and modes (Barthelemy, 2016). In practice, these difficulties have created novel business opportunities for both proprietary and public stakeholders, for example, mobile phone applications for multimodal urban mobility (Willing et al., 2017), and a plethora of research questions which are the subject of this review.
Aim and structure of this review
The primary aim of this article is to offer a comprehensive review of multimodal transportation and mobility research focusing on recent approaches from complexity science. Within this framework, a city is understood as a complex system (Batty, 2013; Lobo et al., 2020), and its transport infrastructures, such as streets, sidewalks, bicycle lanes, and public transport, are modeled as networks (Barthelemy, 2011). The field of Network Science has extensively studied such spatial networks, first from a single-layer and more recently with a multilayer perspective, with special attention to transportation networks (Barthelemy, 2011; Ding et al., 2019; Lin and Ban, 2013), focusing on topological properties (Barthelemy, 2011; Batty, 2008; Barthelemy and Flammini, 2008; Boeing, 2020; Cardillo et al., 2006; Jiang and Claramunt, 2004; Louf and Barthelemy, 2014; Strano et al., 2013), centrality metrics (Boeing, 2018; Crucitti et al., 2008; Kirkley et al., 2018), and growth processes (Makse et al., 1995; Strano et al., 2012; Szell et al., 2022). Other topics include the impact of the street networks on pedestrian volume (Hajrasouliha and Yin, 2015), accessibility and vitality of cities (Biazzo et al., 2019; De Nadai et al., 2016; Natera Orozco et al., 2019), and resilience of transportation networks (Baggag et al., 2018; Ferretti et al., 2019; Natera Orozco et al., 2020a). These recent approaches can be seen as the beginning of emerging fields like a Science of Cities or Urban Data Science which exploit new large-scale urban datasets with quantitative tools from physics, geoinformatics, and data/network science (Batty, 2013; Resch and Szell, 2019).
In this review, we survey the literature on urban multimodal mobility and on urban transportation infrastructure as multilayer networks. Here, we focus on the primal approach to networks (Porta et al., 2006) where streets and mobility infrastructure constitute the network links, and intersections (bus stops, subway stations, etc.) constitute the nodes of the network. Other approaches such as space syntax are not common in complexity science.
The remainder of this review is arranged as follows. Section 1 introduces the mathematical concept of multilayer networks. In Section 2, we discuss research on urban transport infrastructures, including their empirical characterization (see Subsection 2.2) and theoretical modeling (see Subsection 2.1) as multilayer networks. In Section 3, we instead focus on mobility and navigation across these multimodal systems. In Section 4, we cover the relevant open datasets and the main software tools which can be used to analyze multimodal transportation systems. We conclude with an outlook and a summary of open questions for the research community in Section 5.
Multilayer networks: a framework for multimodality
Over the last decades, networks have emerged as a versatile tool to understand, map, and visualize the interconnected architecture of a wide range of complex systems (Albert and Barabási, 2002; Boccaletti et al., 2006; Dorogovtsev and Mendes, 2002; Newman, 2003), in particular spatially embedded ones (Barthelemy, 2011, 2018). Formally, a network, or graph,
More recently, network scientists have put effort into characterizing the structure of systems which are formed by different interconnected networks. Such interconnected structures are natural for transportation systems. Think, for instance, of the largest transportation hubs in worldwide cities, where stations are routinely served by bus, underground, and railway infrastructures. Indeed, most urban transportation systems systemically rely on the interplay between different means of transportation. These systems can be conveniently described by multiplex or multilayer networks. In multilayer networks, links of different types, describing, for instance, different transportation infrastructures (e.g., subway lines, bus routes, or bicycle lanes), are embedded into different layers. Each layer α, α = 1, …, M, is described by an adjacency matrix
In the case of transportation networks, simply identifying nodes of different networks as the same station might not always provide the most complete description of the multimodal network. Take, for instance, the largest stations in mega-cities, like King’s Cross–St Pancras in London, Grand Central Station in New York, or Hongqiao transportation hub in Shanghai. All of these are identified by a unique location (node index) i across the different transportation layers. Yet, switching from one mean of transportation to another within the same station might require a non-negligible fraction of time and effort, given the complexity and size of the overall infrastructure. For this reason, it is often relevant to complement the description of the intra-layer connections present in the system, with inter-layer links associated to the cost, spatial distance, or time required to switch layers. Inter-layer links between layers α and β at a node i can be encoded through the inter-layer matrix
Multilayer networks are a natural framework for multimodal transport networks. Indeed, the concept of multilayer networks in the transportation engineering field goes back to the 1970s when Dafermos (1972) proposed a formulation for the traffic assignment problem for multiclass-user networks. Since then, the term “multilayer” has been interchangeably used with “multiclass,” “multimodal,” and “multiuser.” Shortly after, Sheffi (1978) proposed the notion of “hypernetwork” that was later redefined as “supernetwork” (Sheffi, 1985), in which decision-making can be modeled as a route selection over a multilayer or multimodal network. In the field of network science, one of the pioneering works introducing the framework and concept of “layered complex networks” (Kurant and Thiran, 2006b) explicitly focuses on the case of transportation systems, where a first layer encodes the physical infrastructure of the system, and the second one describes the flows on such infrastructure. Other early works on the topic also dealt with interconnected systems at the worldwide level, focusing on different modes of transport such as the multiplex airline networks (Cardillo et al., 2013). Noticeably, multimodal infrastructures seem to possess exclusive characteristics different from other multilayer networks. For instance, when tools to assess the redundancy of the different layers are considered, transportation networks are often found to be irreducible (De Domenico et al., 2015). Differently from many biological systems, where layers often duplicate information to guarantee the interconnected system a high level of robustness, the layers of multiplex transportation systems are purposedly engineered to be different in order to maximize efficiency (Latora and Marchiori, 2001). As a byproduct of this feature, multimodal systems are also often highly fragile (Buldyrev et al., 2010) and sensitive to disruptions or failures (De Domenico et al., 2014). For the reader interested in further material on the topic, we refer to the early reviews (Boccaletti et al., 2014; Kivelä et al., 2014) and textbook (Bianconi, 2018) covering the field. Aleta and Moreno (2019) provide a more recent eye-bird view of the field. A thorough review of the measures and models used to analyze such systems can be found in Battiston et al. (2017), whereas De Domenico et al. (2016) give a theoretical overview of spreading and diffusive processes on such systems. In the following sections of this review, we focus on findings of more direct relevance to the research community working with multimodal transportation and urban mobility. The division of related research across these two themes is not meant to be rigid but rather serves as an indication of the core topic treated in the different works.
Multimodal infrastructures
As cities grow and add different transportation modes, understanding the transportation infrastructure and its interconnected nature is crucial to capture patterns of urban mobility. Since the 1950s, fields ranging from Architecture to Urbanism and Transport Planning have grown a large body of literature studying the structure of cities and their transport systems. With the growth of the Complex Systems and Network Science fields, new models have been developed to study the complexity behind urban systems, and specifically mobility infrastructure. Modeling infrastructures is of great importance for understanding how urban systems work, and for the design of new sustainable mobility options.
In the following, we describe recent findings related to how layers of multimodal transport networks are coupled and grow. First, we review the main models developed in the area of complex systems for urban transportation. Then, we describe empirical findings related to the transport infrastructures.
Modeling multimodal infrastructures
The design of efficient single-layer transportation networks is a classical problem in the domain of transport optimization (see for example the comprehensive review by Farahani et al. (2013)). The design of multimodal transportation network, instead, has been less studied (Farahani et al., 2013). Multimodality brings new challenges to the transportation network design problem, including issues related to the integration of the street network with public transit and active transportation networks (walking and cycling networks) in which travelers can choose to take multiple modes for a single journey (Huang et al., 2018; Zhang et al., 2014). In this section, we describe complex systems approaches for modeling multimodal transport systems. A first line of research has dealt with the representation of transport systems as networks, a second one with the generation of synthetic networks (both static and dynamic). In the following, we review the advancements in these areas.
Representation of public transport systems
How many layers should one consider when modeling transport system as a multilayer networks? A widely used approach considers each bus/train/metro line in the public transport system as a different layer, to account for the transfer time between lines (Alessandretti et al., 2016; Aleta et al., 2017). A different approach considers only few layers, for example, a two-layer system capturing the public and private transport networks Gil (2014), or several layers capturing the different transport modes (bus/metro/train) (De Domenico et al., 2014; Strano et al., 2015). For example, Gil (2014) proposed to model the infrastructure of a city as a combination of two layers (the street network and the public transport network), that connect areas with various land uses. Using open data from OpenStreetMap, this framework enabled the authors to study the accessibility of different areas in the Randstad city-region in the Netherlands. Using network centrality measures, such as closeness and betweenness centralities, the authors showed that betweenness centrality in the public transportation layer can be a good indicator of passenger flows.
Aleta et al. (2017) proposed a systematic comparison of the two approaches above. In their work, the authors investigated the public transport systems of nine European cities. The first approach, where each line is considered as a separate layer, enabled the authors to focus on some structural features such as the overlapping degree (sum of the node’s degree in all layers (Battiston et al., 2014)). Instead, the so-called superlayer approach, where each transport mode is considered as a separate layer, Aleta et al. (2017) showed that—surprisingly—the nodes with the highest overlapping degree are not necessarily the ones participating in the highest number of layers. Indeed, some transportation modes (and in particular the bus layer) have a tendency for hubs which might be disconnected to the other transportation modes, leading to high overlapping degree but low multilayer participation across different transport modes. The prevalence of such hubs is relevant when considering the robustness of the whole system. As the study highlights, it is often easier to move a bus stop to a street nearby, even if it is a local hub where multiple lines stop, than solving a disruption in a subway station. In conclusion, the study revealed that the two approaches are useful and complementary to extract information about the functioning of transportation systems and assess their vulnerabilities.
Static toy models
A second area of research focused on the generation of synthetic transport networks using simple static network models. Gao et al. (2017) proposed to couple a configuration model with power-law degree distribution to model high-speed layers with an Erdős–Rényi random network model for low-speed layers (Newman, 2018). Zhuo et al. (2011) proposed to couple Barabasi–Albert (Barabási and Albert, 1999) and Erdős–Rényi networks (Newman, 2018) as prototypical models for multimodal networks with two layers. These approaches, however, did not directly account for the fact that networks representing transport infrastructures such as streets and rails must be planar, for example, they can be drawn in the plane in such a way that its edges do not intersect (Barthelemy, 2011).
A typical approach to model networks embedded in space is to consider 2d regular lattices (Barthelemy, 2011; Du et al., 2016). Du et al. (2016) proposed a two-layer model consisting of a lower-speed layer, modeled as a lattice, and a higher speed layer, modeled as a Barabasi–Albert (Barabási and Albert, 1999) network (see more details about this study in Section 2.2). Morris and Barthelemy (2012), instead, proposed to use the Delaunay triangulation, which is a prototypical planar network (Barthelemy, 2011). Their model coupled different transport modes, according to the following rules. First, N nodes are placed at random within the unit circle, mimicking a spatial configuration typical of many cities, and are connected by a Delaunay triangulation. Second, to simulate another transport mode, a second layer is generated by drawing a subset of the previously generated nodes and a second Delaunay triangulation. Finally, the two layers are coupled with inter-layer links when a node is present in both layers. Using this toy model and a simulated Origin–Destination matrix, Morris and Barthelemy (2012) investigated how fragile the network is to changes in supply and demand. They found that increasing travel speed in one layer tends to concentrate trips in the fastest layer and also produces congestion in the nodes where is possible to change transport mode. These works revealed that simple spatial network models provide a useful tool for understanding real-world systems.
Non-static models
It is important to note that cities and their transportation systems are not static in time. This means new transport modes may be introduced or extended, such as when new bus/subway stops are added. Recently, to plan for open streets during the COVID-19 pandemic, Rhoads et al. (2020) proposed an investigation of the relation between streets and sidewalks. First, the authors used percolation theory to examine whether the sidewalk infrastructure in cities can withstand the tight pandemic social distancing. They then proposed an algorithm that takes into consideration both the sidewalk and street layers while improving the sidewalk connectivity. Despite spatial constraints, Rhoads et al. (2020) showed that it is possible to widen the sidewalks and improve the pedestrian connectivity with a minimum loss in the road network. In a similar fashion, Szell et al. (2022) explored different growth strategies for retrofitting streets to bicycle networks. Exploring the topological limitations of various bicycle network growth processes, Szell et al. (2022) found initially decreasing returns on investment until a critical threshold, posing fundamental consequences to sustainable urban planning: Cities must invest into bicycle networks with the right growth strategy, and persistently, to surpass a critical mass. Decreased directness for automobile traffic due to bicycle network growth was found which is a desirable effect from the perspective of urban sustainability and livability. Finally, Vybornova et al. (2022) developed a procedure based on a multiplex network approach between streets and protected bicycle infrastructure to define and prioritize gaps in bicycle networks.
The models discussed in this section show how the multilayer network framework has been used to investigate the structure, function, and vulnerabilities of a complex transportation system. In the next section, we cover some measures to quantify the multiplexity of these structures and their interconnections.
Characterizing multimodal infrastructures
The empirical study of urban transportation infrastructure has revealed some of the structural properties of specific transport networks (Barthelemy, 2011). But how to quantify the effectiveness of their interconnections? In this section, we describe a number of measures that have been used to capture the multiplexity of multimodal infrastructures, such as the importance of different nodes, the system’s resilience, and the similarity between layers.
Overlap census
Urban transportation networks present different degrees of multimodality and integration. How can we quantify such differences? The overlap census proposed by Natera Orozco et al. (2020b) is a method that helps to answer this question. Taking the multimodal mobility network of a city as an input, this measure calculates the fraction of nodes that are active in different multiplex configurations (Figure 2(a)). Given a multiplex transport network with Structural measures of multimodality. (a) Schematic of a city’s multiplex transportation network. The information of the different layers is integrated in the overlap census (b). We show the case of Amsterdam (b, mostly active in non-motorized layers) versus Los Angeles and Bogota (c). Spatial distribution of betweenness centrality in New York. In (d), we show only the street layer. The subway layer is added in (e), showing a reallocation of centralities from avenues to metro stations. (f) Superlayer interdependency for multiple transportation networks. The metro layer comprises of most shortest paths in the system, making such transportation layer the most beneficial for high interdependence. Figures adapted from Natera Orozco et al. (2020b); Strano et al. (2015); Aleta et al. (2017).
The overlap census was not the first measure to compare similarities between multiplex networks and layers. Indeed, similar approaches were developed for multiplex networks more in general. For a detailed view, see Nicosia and Latora (2015) who proposed measures capturing nontrivial correlations in multiplex networks and models to reproduce those correlations. More recently, Bródka et al. (2018) also presented an overview of different metrics to compute similarities between layers in multiplex networks.
Paths
At a more global scale, multimodality is often associated with the ability of an agent to navigate the system by using the available transportation modes. Journey or route planning in a multimodal setting refers to finding the shortest route in a multilayer network when multiple transportation modes such as private car, public transit, walking, and cycling are combined in a single and integrated journey (Botea et al., 2013; Zografos and Androutsopoulos, 2008). For this reason, the navigation of an agent in a transportation network can be measured through the available types of paths.
The literature on multimodal route planning has been mostly built around static multilayer networks with no time-dependent characteristics (Bast et al., 2016). Thus, a first possibility is to consider the quickest path, neglecting transfer and waiting times between transportation modes. This path is computed using the fastest speed associated to the edges, assuming a perfect synchronization between the different transportation modes. This is analogous to finding the shortest path between i and j in an aggregated weighted-single-layer network, where the weight describing the time to travel between two nodes is the minimum among those offered by the different transportation options.
Yet, this approach often falls short in capturing real patterns of mobility. Indeed, transportation networks also have a temporal dimension that has to be taken into consideration. In order to find a path that allows for a change between two or more different transportation options, we must find a time-respecting path (Gallotti and Barthelemy, 2014), defined as the shortest path between nodes i and j that considers the departures and arrivals constraints given by timetables. In order to find such time-respecting paths, Huang et al. (2019) proposed a time-expanded model to account for the dynamic nature of route planning in a multimodal network with fixed public transit and dynamic ridesharing vehicles. However, a comprehensive route planning algorithm that simultaneously considers multiple transportation modes in a truly time-dependent network is missing. Existing commercial algorithms are limited in their multimodality, as, for example, they often do not make it possible to choose multiple transport modes (e.g., bike and bus together, to find the best multimodal route with these two modes combined). Furthermore, for multimodal transportation networks, the walking transfer time between modes has to be taken into account when computing time-respecting paths.
Besides time constraints, in order to find a viable path between i and j, the logic of the proposed sequence of modes has to be assessed (Battista et al., 1996; Lozano and Storchi, 2001). For instance, in some cases, a path composed by subway-bus-subway-car-subway might solve the shortest path, but the presence of private transportation (car) as an intermediate option makes it an “illogical path” and thus an unlikely choice for a user. The validity of transportation sequences can in general be formalized in terms of cost associated to change of transportation mode.
Finding viable paths is one of the most important problems in urban transportation, as it has the potential to help users finding the most efficient paths in the city. Lozano and Storchi (2001) have proposed an efficient algorithm to find such paths when the agent establishes her limitations on the number of modal transfers.
The contribution of the different layers of a multiplex networks to shortest paths might be very unequal (Aleta et al., 2017). For instance, the rail systems (trams and subway) contribute to most of the shortest paths in a city, connecting distant points at a greater velocity and in straighter routes than bus or other transport modes. However, such layers have only few stations. Slower and more local transport modes often serve a complementary role, offering a deeper coverage of the city. This interplay between different and complementary transport modes has a direct impact on the congestion and shortest paths availability in the system (Morris and Barthelemy, 2012; Solé-Ribalta et al., 2016).
Spatial outreach
The availability of different transportation modes, such as subways or tramways, affects how easily it is to reach certain locations in the city. A way to measure this effect is to quantify the associated spatial outreach (Strano et al., 2015). The spatial outreach can be computed as the average distance from node i to all other nodes in the same layer α that are reachable within a given travel cost τ. Mathematically, it is defined as follows
Strano et al. (2015) modified the average speed (traversal time of a link) in the layers of multiplex systems to measure their effects on the corresponding travel outreach. Strano et al. (2015) found that when the metro speed increases compared to the street layer speed, a clear area of high-outreach nodes emerges in the city center and around the nodes that have connections to the high-speed layer. In other words, as the velocity in layer β increases, the nodes that are closer to the interchange nodes in layer α improve their accessibility, implying that a person can efficiently travel from this area to faraway places. This concept of travel outreach is similar to that of isochrones which quantify the accessible area from a given point within a certain time threshold, for example: What is the area that a user can reach traveling x minutes, in any direction, from a given point? Biazzo et al. (2019) used this approach to measure accessibility in different urban areas computing the isochrones as a combination of public transit and pedestrian infrastructure. With this method, scores were obtained that capture how well a city is served by the public transit and how accessible a specific area is to the rest of the city.
Betweenness centrality and interdependence
The relevance of the nodes in a network is characterized by centrality scores. Additional to single-layer networks, in multiplex networks, the overall centrality of a location or station also depends on the interplay of the different transportation options.
The simplest way to define the centrality of a node is to measure its degree or the number of locations directly connected to it. Such a measure, however, is not so relevant for systems embedded in space due to spatial constraints. For example, the number of possible connections of a given node (intersection) in the street layer is highly constrained by the physical space, as a single intersection can only have a limited number of intersecting streets/sidewalks. For this reason, other centralities than degree are typically used to assess the relevance of a location. One of such measures is the betweenness centrality (Freeman, 1977), measuring the number of shortest paths passing via a given node. This measure is also called “load” and can be seen as the simplest proxy for traffic flow in the system as it assumes uniform demand between each pair of nodes (McDaniel et al., 2014). In the absence of explicit mobility data, betweenness centrality can be used as a proxy to assess the areas at risk to become overcrowded and to identify potential bottlenecks in the system.
In multiplex networks, shortest paths go from one node to another and able to pass through two or more layers. This effect can be quantified by measuring the interdependence of a given node i as
This interdependence measure can be modified to obtain a score for a specific layer (Aleta et al., 2017). Then the layer interdependence for layer α is defined as
When applying this measure to multimodal transport networks, Aleta et al. (2017) found that the metro and tram layers play an important role in concentrating shortest paths, a direct consequence of Morris and Barthelemy (2012) and Solé-Ribalta et al. (2016) (Figure 2(f)). For Madrid, Aleta et al. (2017) found that more than 40% of the trips have at least one link in the metro layer, even if the metro layer has only 241 nodes while the bus layer has 4590 nodes.
As cities grow and new lines and transport modes are added into the mobility system, new interconnections between layers appear, changing the betweenness of the different nodes. Ding et al. (2018) studied how centralities evolved when the rail network of Kuala Lumpur grew from a tree-like structure to a more complex one. Their findings suggest that, as the network grows, the average shortest path in the multilayer network can decrease dramatically, especially as new nodes are able to serve as interchange between layers, thus enabling new shortest paths along the system.
The results from Ding et al. (2018) are in line with previous findings by Strano et al. (2015) on how the subway layer affect the distribution of nodes centrality in London and New York. Strano et al. (2015) show that the introduction of new interconnected layers affects the congestion of the street layer. In fact, the presence of the subway layer allows to move traffic from internal routes and bridges to the terminal points of the subway system (Figure 2(e)), which might be used as interchange locations for suburban flows into the city center. Theoretical work by Solé-Ribalta et al., 2016 confirms that one of the main drivers affecting traffic dynamics and congestion in multimodal transport networks is the interchange from the least to the most efficient layers. In their work, Solé-Ribalta et al., 2016 leverage multilayer betweenness centrality to prove analytically that the structure of the multilayer networks can induce congestion; as the different layers start to get connected, the most efficient layers concentrate a large portion of starting and ending routes.
Resilience
Evaluating the robustness of a transport system under failures is an important task with practical implications in urban planning. Notably, multimodality significantly improves the resilience of transportation systems (De Domenico et al., 2014).
In a single-layer network, the disruption of an infrastructure, that is, the removal of a link, can make a station or a part of the city disconnected. For example, imagine a transit station in a single-layer network: If the links connecting the station with the rest of the system are removed, the station is inaccessible. However, if such a station is part of a multimodal transportation network, it could still be accessed through other layers. To measure the impact of multimodality on resilience, De Domenico et al. (2014) used random walks (see Section 3.1) to mimic trips among locations and investigated the coverage time in the London’s transportation system under different scenarios, showing that the interconnected nature of the different transport modes dramatically enhances the overall system resilience to failure compared with the single layers. A similar approach was followed by Baggag et al. (2018), where the coverage time of random walks was used to measure the robustness of the multimodal transportation networks of Paris, London, New York, and Chicago. To mimic realistic trips, Baggag et al. introduced several constrains on the complexity of the trips, for instance, limiting the maximum number of transport mode changes. More recently, Ferretti et al. (2019) used the multiplex framework to model Singapore’s public transportation infrastructure and test its resilience against floods in the city in different scenarios, finding that the system is extremely resilient as it faces the first significant disruption only after the removal of 50% of it edges.
Multimodal mobility
Understanding travel is paramount for a range of applications, including planning transportation (Patriksson, 2015) and designing urban spaces. Starting from the 1950s, a large body of literature in the fields of Geography and Transportation has studied how people move and use transportation technology. In recent years, the scientific understanding of human mobility has dramatically improved, also due to the widespread diffusion of mobile phone devices and other positioning technologies, which allowed to gather large-scale geo-localized datasets of human movements and develop realistic behavioral models. Concurrently, these developments had benefited by the growth of the fields of Complex Systems and Network Science, which brought together ideal tools to study interconnected systems. For a comprehensive review of the recent literature stream of Human Mobility, see (Barbosa et al., 2018). Despite recent advancements, our understanding of multimodal mobility in urban systems remains limited, also due to the difficulties related to collecting comprehensive, high-resolution and high-quality behavioral data across multiple transportation modalities. Data collected from smartphones are widely used to study other aspects of human mobility but often suffer from limitations that hinder the study of multimodal mobility and routing. These limitations include poor spatial resolution, limited sampling frequency, and lack of labeled data to identify transport modality. For these reasons, behavioral data collected from phones have been little used to study routing and multimodal mobility.
In this section, we review the scientific literature on multimodal mobility. In Section 3.1, we briefly summarize existing models, focusing on latest advances driven by the Complex Systems literature. In Section 3.2, we review measures and empirical findings, with a focus on recent studies based on passively collected data sources.
Modeling multimodal mobility
Modeling travel in a multimodal system involves understanding how individuals make decisions in a constantly changing complex environment. The most common family of models for travel demand in the Geography and Transportation literature are the four-step models proposing that each trip results from four decisions(McNally, 2000): (1) whether to make a trip or not, (2) where to go, (3) which mode to use, and (4) which path to take. For simplicity, these steps have been largely considered as independent, sequential choices and correspond to four modeling steps: trip generation, trip distribution, mode choice, and route assignment. See the review of McNally (2000) for a comprehensive overview about this modeling approach.
In recent years, the field of Complex Systems has modeled travel behavior on multiplex networks using different approaches that we briefly review in this section. Complex Systems research has proposed novel individual (Alessandretti et al., 2020; Gonzalez et al., 2008; Jiang et al., 2016; Schneider et al., 2013; Song et al., 2010) and collective (Schläpfer et al., 2021; Simini et al., 2012; Szell et al., 2012) models that capture well the first two aspects of travel behavior: trip generation and trip distribution. In this review, we will focus largely on the last two of the four modeling steps, mode choice, and route assignment, because they are the most relevant in the framework of multimodality. It is important to remark that, also due to the lack of empirical data on the mechanisms driving human navigation, many models rely on simplistic assumptions, for example, that individuals are rational, homogeneous, or have unlimited knowledge. Research based on novel data sources will be key to develop mobility models on multilayer networks that include realistic elements such as limited knowledge and cognitive limitations (Manley and Cheng, 2018).
Random walks
The random walk is one of the most fundamental dynamic processes that have been widely studied in the Complex Systems literature as a prototypical model for numerous phenomena occurring upon networks, including human mobility. Importantly, in contrast to widely used models that assume individuals with global knowledge of the system thus choosing shortest routes (Wardrop, 1952), random walks assume that agents are only aware of the local connectivity at each node. A random walk on a graph is defined by a walker that, located on a given node i at time t, hops to a random nearest neighbor node j at time t + 1. In the case of multilayer networks, the walk between nodes and layers can be described with four transition rules accounting for all possibilities (De Domenico et al., 2014): (i)
Despite their simple formulation, random walks provide fundamental insights to many types of diffusion processes on networks and allow to measure a network’s dynamical functionality. For example, random walk processes were used to measure the navigability of multiplex networks (De Domenico et al., 2014). To this end, one can measure the coverage of the multiplex network ρ(t), defined as the average fraction of distinct nodes visited by a random walker in a time shorter than t (assuming that walks started from any other node in the network), and describing the efficiency of a random walk in the network exploration
Random walks have further been used to assign a measure of importance to each node in each layer, by measuring the asymptotic probability of finding a random walker at a particular node-layer as time goes to infinity, the so-called occupation centrality (Solé-Ribalta et al., 2016). Solé-Ribalta et al. (2016) provided analytical expressions for the occupation centrality in the case of multilayer networks.
Travel time minimization approaches
At the other end of the spectrum, agents are assumed to have global (or nearly global) knowledge of the system. This is one of the most widely used approaches in the transportation literature, rooted in Wardop’s user equilibrium principle (Wardrop, 1952) for traffic assignment. Under the user equilibrium principle, in a congested system, all agents choose the best route, for example, no user may lower his transportation cost through unilateral action. Uchida et al. (2005); Zhou et al. (2008); Verbas et al. (2015) are among many of the studies that proposed formulations and solution algorithms for traffic assignment under user equilibrium in a multimodal system.
Complex Systems research has developed models where agents aim at minimizing their individual travel times in congested (Bassolas et al., 2020; Çolak et al., 2016; Manfredi et al., 2018; Solé-Ribalta et al., 2016; Tan et al., 2014) or uncongested (Du et al., 2014, 2016) networks.
Bassolas et al. (2020) developed an agent-based models describing mobility of individuals through a multilayer transportation system with limited capacity. The routing protocol used by individuals for planning is adaptive with local information. In the absence of congestion, individuals follow the temporal optimal path of the static multilayer network calculated by the Dijkstra algorithm. If there are line changes, Bassolas et al. (2020) estimate besides the change walking penalty an additional waiting time of half the new line period (the real waiting time will be given by the vehicles location in the line when the individual arrives at the stop). An individual’s route is only recalculated when a congested node, whose queue is larger than the vehicle’s capacity, is reached. The work investigates analytically (for simple networks) and via numeric simulations the robustness of the network to exceptional events which give rise to congestion, such as demonstration concerts or sport events. The study revealed that the delay suffered by travelers as a function of the number of individuals participating in a large-scale event obeys scaling relations. The exponents describing these relations can be directly connected to the number and line types crossing close to the event location. The study suggested a viable way to identify the weakest and strongest locations in cities for organizing massive events.
Similarly, Manfredi et al. (2018) introduced a limit to the nodes capacity of storing and processing the agents. This limitation triggers temporary faults in the system affecting the routing of agents that look for uncongested paths.
Importantly, the assumption that individuals have global knowledge of the system and minimize travel time contrast recent findings in spatial cognition, showing that human spatial knowledge and navigation ability is limited (Bongiorno et al., 2021; Gallotti et al., 2016b). For example, a recent study on pedestrian navigation made clear that path choices seem to be affected by the orientation of street segments along the route (Bongiorno et al., 2021). Recent modeling approaches for single-layer networks (Manley and Cheng, 2018) incorporate these ideas in routing models where agents are characterized by bounded knowledge and limited rationality. Further research will be necessary to develop realistic multilayer routing models accounting for the limits of spatial cognition.
Characterizing multimodal mobility
Traditionally, multimodal mobility models are calibrated using data from travel surveys (Arentze and Molin, 2013). Studies based on survey data have provided insights into how multimodal travelers value aspects such as the different travel time components (in-vehicle time, walk time, access time, wait time, etc.), service quality, travel costs, and heterogeneities across socio-demographic groups (Arentze and Molin, 2013). Due to the high costs associated with data collection and inherent biases in self-reported data, these studies suffer of limitations, including small sample sizes, data inaccuracy, and incompleteness (Chen et al., 2016; Zannat and Choudhury, 2019). Covering empirical results from travel surveys is outside the scope of this review, and we refer the reader to Arentze and Molin (2013) for a comprehensive introduction to the topic.
In recent years, the empirical research on Human Mobility has taken new directions. A growing body of literature has focused on quantitative descriptions of human movements from large, automatically collected data sources, such as mobile phone records, travel cards, and GPS traces (Barbosa et al., 2018). In this section, we give an overview of recent empirical findings which focused on two important aspects: (1) the dynamics of public transport systems, whose study was driven by the availability of public transport data such as schedules and positions of stop and stations, and pioneered by Kurant and Thiran (2006a), and (2) individual multimodal behavior driven by the availability of data collected using “smart travel cards” and GPS data. These aspects are closely interrelated but capture different properties, with the former focusing on infrastructural and the latter on behavioral properties.
Public transport system dynamics
Over the last decade, the availability of detailed public transport schedules shared by public transport companies (see also Section 4) has allowed to better estimate travel times and characterize transport systems.
Efficiency
To satisfy the demand of large number of individuals while reducing energy and costs, multilayer transport systems must achieve high efficiency. One aspect concerns the synchronization between the network layers because the more the layers are synchronized, the less the users have to wait for vehicles. The synchronization inefficiency δ(i, j) (Barthelemy, 2016; Gallotti and Barthelemy, 2014) for nodes i and j can be measured as the ratio of the time-respecting travel time τ
t
(i, j), which accounts for walking and waiting times and the fact that the speed of vehicles varies during the day, and the minimal travel time τ
m
(i, j), assuming that vehicles travel at their maximum speed and that transfers are instantaneous
Using the synchronization inefficiency, Gallotti and Barthelemy (2014) showed that, on average in the UK, 23% of travel time is lost in connections for trips with more than one mode. Interestingly, across several urban transport system in the UK, the synchronization efficiency δ(i, j) obeys the same scaling relation with the path length ℓ(i, j) Characterizing public transport system dynamics. Efficiency (a) The synchronization efficiency d(i, j) between two nodes i and j against the path length ℓ(i, j) for different public transport systems in the UK. (b) The average synchronization efficiency 
Further, Gallotti and Barthelemy (2014) have shown that the average synchronization inefficiency
Other studies focused on the efficiency in terms of ability to satisfy users demand. Alessandretti et al. (2016) introduced a method based on non-negative matrix factorization to compare the network of commuting flows and the public transport network. This methodology, applied to various public transport systems in France, showed that, while in Paris, the transportation system meets the overall demands, it does not in smaller cities where people prefer to use a car despite having access to fast public transportation. Sui et al. (2019) proposed three topological metrics to quantify the interaction between public transport network and passenger flow and applied it to study differences between the cities of Chengdu and Qingdao in China. Holleczek et al. (2014) used data mining approaches to compare the use of public and private transportation and identify the existence of weak transportation connections. Bergermann and Stoll (2021) used multiplex matrix function-based centrality measures in order to assess the importance of bus stops and lines in the multiplex public transport network of multiple German cities.
Congestion
Congestion can dramatically alter travel time estimates for routes that use popular network links in an urban system, but adding network layers in a multilayer transport system can help reduce global congestion (Chodrow et al., 2016), while at the same time can induce local congestion in the layer that concentrates the majority of shortest paths (Solé-Ribalta et al., 2016). Given a network with edges e and flows j
e
, one can quantify the total time lost in congestion as
The authors showed that, as the subway speed increases, the global congestion is reduced but increases locally close to key metro station (Figure 3(d) and (e)).
Navigability
As cities and their transportation systems become increasingly complex and multimodal, it is important to quantify our difficulty navigating in them. It has been shown that multilayer transport system are characterized by limited navigability, implying that finding one’s way is cognitively challenging (Gallotti et al., 2016b). To quantify the difficulty of navigating between two nodes s and t in a network, one can compute the total information value of knowing any of the shortest paths to reach t from s
Gallotti et al. (2016b) quantified the amount of information an individual needs to travel along the shortest path between any given pair of metro stations, in the single-layer metro networks for 15 large cities. They found that this information has an upper bound of the order of 8 bits, corresponding to approximately 250 connections between different routes. Further, studying several among the largest multilayer transport networks (metro/buses/light rail), Gallotti et al. (2016b) showed that the amount of information necessary to know to travel between any two points exceeds the identified cognitive limit of 8 bits in 80% the cases, suggesting multilayer networks are too complex for individuals to navigate easily (Figure 3(c)).
Individual multimodal behavior
In recent years, data collected via smart travel cards has dramatically improved our ability to characterize multimodal behavior in urban transport systems, overcoming some of the limitations related to collecting and analyzing survey data (Chen et al., 2016; Zannat and Choudhury, 2019). Smart-card automated fare collection systems allow passengers to make journeys involving different transport modes using magnetic cards and automatic gate machines. As these systems identify and store the location and time where individuals board and, in some cases, exit public transport, they collect accurate descriptions of individual travel (Pelletier et al., 2011). Concurrently, advancements were made possible by the development of methodologies allowing to identify typical travel patterns (Ma et al., 2013).
Route choices
Smart-card data has allowed to quantify how individuals navigate multilayer networks. One of the key findings is that individuals do not choose optimal paths (those with shortest travel time), especially when the system is congested. Focusing on the bus and subway trips of 2.4 million passengers in Shenzen (China), Zheng et al. (2018) studied the coupling (see equation (2)) between the bus and subway layers. In contrast to previous studies (Strano et al., 2015), Zheng et al. (2018) characterized the coupling λ using passenger behavior rather than structural properties of the multilayer network. Under their definition, the coupling between layers is the fraction of multimodal trips actually undertaken by passengers, rather than the fraction of multimodal shortest paths (see equation (2)). The authors find that this “behavioral” coupling correlates weakly with the empirical speed ratio measured between the two layers over time (Figure 4(a)), implying that passengers choose unimodal trips even when multimodal trips may be preferred because one of the two layers is congested. This finding highlights that the speed ratio of different network layers, which was regarded as a key factor in determining coupling strength (Chodrow et al., 2016; Strano et al., 2015), may have a negligible effect on travelers’ route selections, possibly because passengers do not have a full view of the status of traffic. Instead, Zheng et al. (2018) showed that the coupling between layers is generated by long-distance trips originating from nodes served by a single transport layer (Figure 4(b)–(d)). Characterizing individual multimodal behavior. 
Power-law distribution of displacements
The availability of large-scale data sources has revealed that individual mobility patterns display universal properties. One key finding is that the distribution P(Δr) describing the probability of traveling a given distance Δr is characterized by a power-law tail P(Δr) ∼Δr−β, with 1 ≤ β ≤ 2 (Barbosa et al., 2018; Gonzalez et al., 2008) (Figure 4(e)). This finding is consistent across a range of studies that used different data sources, see Alessandretti et al. (2017) for an extensive review. It was recently shown that these observed scaling properties result from the aggregation of movements within and across characteristic spatial scales, corresponding to the sizes of buildings, neighborhoods, cities, and regions (Alessandretti et al., 2020). Further, the emergence of scaling properties was associated to the use of multimodal transportation: Zhao et al. (2015) used GPS data to show that mobility using a single mode can be approximated by a lognormal distribution, but the mixture of the distributions associated with each modality generates a power-law (Figure 4(f)–(i)); Gallotti et al. (2016a) found that a simple model where individual trajectories are subject to changes in velocity generates a distribution of displacements with a power-law tail. In fact, individuals using multimodal infrastructure are subject to drastic changes in velocity. Varga et al. (2016) showed that the travel speed v increases with travel distance according to the power-law functional form v ∼ r α , where α ≈ 0.5. This dependence is due to the hierarchical structure of transportation systems and the fact that waiting-times (parking, take-off, landing, etc.) decrease as a function of trip distance. More recently, Mizzi et al. (2021) developed a survival dynamical model based on three observables associated to three time scales of the model: the time cost, the convenience time, and the typical time. Mizzi et al. found that the model is able to reproduce the statistical behavior of the travel time distributions.
Open data and tools
Together with other fields, urban and transportation science are becoming more open, increasingly relying on datasets and computational tools freely available to the scientific community. In this section, we highlight some of these openly available datasets and computational resource developed for the analysis of multimodal transport networks.
Data
During the last years, new datasets have been made publicly available either from the public sector or from crowd-sourced data, allowing to go beyond simulations and synthetic data to understand how urban dynamics and mobility unfold, providing a better picture of the world. Along this section, we will showcase different data sources useful for the study of multimodal mobility from different perspectives: first by focusing on static data from transport infrastructure, then we will show the availability and use of dynamical datasets, and finally dedicated multimodal datasets.
Infrastructure data
In the last years, the study of transportation networks has benefited from the development of OpenStreetMap (OpenStreetMap contributors, 2017b), an open-source collaborative project focused on collecting and sharing worldwide high-quality spatial data (Barbosa et al., 2018; Ferster et al., 2019; Haklay, 2010; Girres and Touya, 2010). Data extracted from OpenStreetMap allows to build and analyze several transportation networks, of which the most common are single-layer street networks (Boeing, 2020, 2021).
However, the data available at OpenStreetMap is also useful to extract information about multiple infrastructures, including transportation systems, such as subways and railways. All such data can be combined to build multimodal transport networks. One example of the use of different OpenStreetMap datasets is the work by Gil (2015) where data from multiple transport infrastructures were used to build a multiplex network for the Randstad region of the Netherlands linking the layers through the intersections between transport modes. More recently, Natera Orozco (2019) followed the same approach to analyze the multiplex transport of 15 cities in different development stages, including London, Los Angeles, and Mexico City.
As useful as the infrastructure of transport networks can be it comes with a drawback, it only encodes the physical and static elements of multimodal mobility. In order to capture how people actually move in a city, it is necessary to use dynamical data. In the next sections, we cover different datasets that allows researchers to build and analyze dynamics on top of multimodal transport networks.
Dynamical data
The transportation network of a city also encodes temporal dimensions, such as how people move in the city, or what is the frequency of buses, tramways, and subways. These temporal components are of great value to understand the mobility system and interactions. One possible way to capture such dynamics is the use of transportation timetables; an example of this method is the work by Gallotti and Barthelemy (2015) who constructed and shared the temporal network of public transport in Great Britain. This is a large dataset, where links, associated to flows from one location to another, only exist at specific times. Link information has to be properly combined to compute travel time from origin and destination, highlighting the importance of synchronization of the different transportation models. An interesting feature of this dataset is that it not only contains the public transport layers of several cities but also connections among them, for instance, through coaches, planes, and ferries operating at the national level. Having these different levels of transport information allows researchers to move through different scales and study the interplay between different mobility options in the national, regional, and local levels.
In general, the use of timetables and transit feeds has been enabling researchers to capture with increasing accuracy the dynamics of public transportation systems. A large collection of GTFS feeds in multiple locations has been collected and made freely available as a webpage and API (OpenMobilityData, 2015). These datasets include stops, routes, and timetables of public transport in multiple cities and providers from 667 locations around the world. As shown by Aleta et al. (2017), these data can be used to analyze the public transportation as a multiplex network, considering each bus line and/or transport provider as one layer and making inter-layer connections every time bus lines share a bus stop. Using data from GTFS feeds, Kujala et al. (2018) built and published a collection of 25 urban public transport networks covering cities from North America, Europe, and Oceania. This dataset is peculiar as it also includes the pedestrian layer of the cities, and public transport is further differentiated across the different public transport modes.
Public transport data and GTFS feeds provide researcher with the opportunity to improve the study of multimodal transportation, coupling the dynamics of different public transit modes together with the physical infrastructure. A further step is the use of data that contains multimodal mobility by users, either by aggregating users between areas or by identifying unique multimodal mobility patterns.
Multimodal data
A recent dataset published by Tenkanen and Toivonen (2020) contains aggregated multimodal trips information for the Helsinki region in Finland. This dataset includes multiple transport modes, such as walking, cycling, and public transportation options. To calculate the travel times, Tenkanen and Toivonen (2020) use a door-to-door principle. This means that travel time and distance are calculated considering every step of a journey, including walking legs and transfers between vehicles. An important feature of this dataset is the inclusion of travel time matrices for three distinct years, 2013, 2015, and 2018. This is a rare occasion to compare how travel times changed over the years, allowing a characterization of the evolution of human mobility.
Concerning mobility, the Geolife dataset (Zheng et al., 2011) consists of GPS trajectories collected by Microsoft Research Asia for 178 users in a period of over 4 years (from April 2007 to October 2011). 69 users labeled their trajectories with the corresponding transportation mode, such as driving, taking a bus, riding a bicycle, and walking. As such, the GeoLife data has allowed to investigate mobility behaviors using different transport modes (Zhao et al., 2015).
The data described above are freely available and represent an opportunity for further data-driven investigations of multimodal transportation networks. We note that individual-level mobility data is highly sensitive and often cannot be shared to protect the privacy of the study participants (Alessandretti et al., 2020; Gallotti et al., 2016a; Gonzalez et al., 2008; Zheng et al., 2018).
Tools
Over the years, computational tools have become more and more important for studying urban systems, and in particular transportation networks. For an overview of the available tools in geographic analysis in transport planning, see Lovelace (2021). Here, we focus on open-source tools, as they allow fully reproducible research or analysis.
In the union of the “Networks” and “Transport” spheres, there is a plethora of tools specialized on various purposes, including network analysis (single-layer or multilayer), routing and access (unimodal or multimodal), or mobility analysis. We attempted a visual classification in Figure 5. The classification is not straightforward; for example, OSMnx includes routing capabilities. However, we aimed to place each tool in a category that reflects its main focus, strength, or use case. Therefore, we placed OSMnx at the intersection of networks and mobility and not under routing/access as that is not its main focus. Overview of the main open-source analysis tools related to networks and transport. There are several network and multilayer network-focused tools, and there are several transport or mobility-focused tools. Further, some tools focus on routing or access. Some tools have multiple capabilities (e.g., OSMnx can do routing), but we attempted to place each tool in the intersection of categories that best represents their main focus or use case.
Transport tools
We first describe the most general tools for mobility and transport network analysis, on the right side of the “Transport” category in Figure 5.
To work with public transportation data Google (2018) developed transitfeed, a Python library to parse, validate and build GTFS files. This tool is particularly useful to those interested in the manipulation of the raw data. However, to convert the data into a network, some additional steps are needed, such as using Peartree (see below).
Movingpandas, developed by Graser (2019), is a Python package that provides trajectory data structures and functions for the analysis and visualization of mobility data. In a similar sense, and also developed in Python, scikit-mobility (Pappalardo et al., 2021) is a library that implements a framework for analyzing statistical patterns and modeling mobility, including functions for estimating movement between zones using spatial interaction models and tools to assess privacy risks related to the analysis of mobility datasets.
Network tools
In this section, we cover the most general network analysis tools on the top left side of the “Networks” category in Figure 5.
Multiple tools allow to work with graphs. A few examples of freely available software packages and tools are Networkx by Hagberg et al. (2008), igraph by Csádi and Nepusz (2006), and graph-tool by Peixoto (2014). These tools are freely available and constantly updated over time relying on contributions from an engaged community. Although these tools serve a general purpose, they can also be used for the study of transportation networks.
Transport network tools
In this section, we cover the most general intersection between the “Networks” and “Transport” categories, that is, the top part in Figure 5.
Multiple tools were developed to obtain data on transportation and multimodal infrastructures. One of the best known is OSMnx (Boeing, 2017), a Python package that downloads street networks from OpenStreetMap into Python objects. OSMnx can further be used to download other transportation networks, and build its multimodal transport networks.
Another reliable Python library to read data from OpenStreetMap and extract transportation networks is Pyrosm (Tenkanen, 2020). Differently from OSMnx, Pyrosm reads the data directly from OpenStreetMap’s Protocol Buffer Format files (*.osm.pbf), while OSMnx downloads the data from the Overpass API (OpenStreetMap contributors, 2017a). For this reason, Pyrosm is a particularly good alternative when working with large urban areas, states, and even countries, while OSMnx typically offers a more precise way to collect data from specific points in a city.
Finally, an alternative to transitfeed’s functionality of reading GTFS feeds is Peartree (Butts, 2021), a Python library allowing to convert GTFS feed schedules into the corresponding directed network graph.
Multilayer network tools
In this section, we cover the general multilayer network analysis tools, that is, the left part of the “Multilayer” category in Figure 5.
The so far mentioned tools were not built specifically with the purpose to work with multilayer networks. To cover this need, specific libraries have been developed. A first example is muxViz by De Domenico et al., 2015, a stand-alone front-end tool which allows the computation of several multilayer measures, from centrality to community detection. MuxViz is also an advanced visualization tool, providing an effective way to display edge-colored multigraph or multislice networks.
Several software options are available in Python, often built on top of NetworkX. A library originally designed for the study of multilayer networks, that can be easily adapted to multimodal networks, is MAMMULT by Nicosia and Battiston (2015). This library contains a collection of algorithms to analyze and model multilayer networks. The functions included in the collection cover a wide range of applications from structural properties, such as node, edge, and layer basic properties, to the analysis of dynamics on multilayer networks, such as random walks.
Another example is multiNetX by Kouvaris et al. (2015). This library extends Networkx allowing the creation of undirected weighted and unweighted multilayer networks from NetworkX objects. Once the multilayer networks are built, the library focuses on the spectral properties of the corresponding adjacency or Laplacian matrices. Such tool also provides nice visualization tools improving from NetworkX, allowing the user to better visualize multilayer dynamics through coloring the nodes and links over time.
A more recently developed Python library, not relying on NetworkX, is Pymnet (Kivelä, 2018). The package handles general multilayer networks, including multiplex networks with temporal variables. For this reason, it is possible to use it for the analysis of multimodal urban transport networks that incorporate transit schedules. This library also includes multiple network analysis methods, transformations, and models to analyze and visualize multilayer networks. Another alternative is the multinet library (Magnani et al., 2020), available both in Python and R. This package provides tools to work with multilayer networks, including community detection and visualizations. When visually working with multilayer networks, it is important to account for principles of visualization and cognitive overload (Rossi and Magnani, 2015). Another option in R is multiplex developed by Rivero Ostoic (2020). This library offers multiple functions to work with matricial representations and visualization of multilayer networks.
Tools for routing or access on transport networks
In this section, we cover the remaining “Routing/Access” category at the intersection of the “Networks” and “Transport” categories in Figure 5.
On one hand, there are established general, high-performance tools for unimodal routing such as graphhopper (GraphHopper, 2022) and OSRM (OSRM, 2021). These tools are developed for routing on one network type but could in principle be extended to multimodal routing. Explicit multimodal routing is provided by OpenTripPlanner (OTP) (OTP, 2022) and R5 (Conway et al., 2017). Both tools exist as fast R implementations, r5r (Pereira et al., 2021) and OTP for R (Morgan et al., 2019). The R package dodgr (Padgham, 2019), thanks to an efficient algorithm for computing distances between pairs of nodes in a street network, further enables the aggregation of flows on network links, given as input a set of origin and destination points, a matrix of pairwise flow densities, and the underlying street network.
Finally, several open-source packages focus on computing accessibility metrics, for example, the ease by which people can reach points of interests, such as those offering employment, shopping, medical care, or recreation (Rietveld, 2000). Pandana (Foti et al., 2012; Pandana, 2021) is a Python library that enables to compute the accessibility of places by retrieving points of interest and street network data from OpenStreetMap (OpenStreetMap contributors, 2017b) and by efficiently computing shortest paths along the street network. Access (Saxon et al., 2021) is a Python library that computes a wide range of spatial accessibility metrics given as input a set of origins and destinations, as well as travel times or distances between them. Built on top of Pandana, UrbanAccess (Blanchard and Waddell, 2017) integrates the creation of multimodal transport networks (transit and pedestrian) using GTFS data and the computation of accessibility metrics. Similar functionalities are offered in R by r5r (Pereira et al., 2021).
Complementary tools and future needs
The analysis tools discussed above are well complemented by tools such as mapbox (Mapbox, 2021), carto (Carto, 2021), kepler.gl (kepler.gl contributors, 2021), and studio unfolded (Unfolded, 2021), built on top of OpenStreetMap (OpenStreetMap contributors, 2017b), which allow to create and share geospatial interactive web visualizations. While these tools have not been designed to work specifically with networks, it is possible to leverage their geospatial visualization capabilities to create appealing visualization of urban systems.
Our survey on tools for Networks and Transport analysis in Figure 5 reveals a gap in the existing open-source libraries landscape. On the one hand, established libraries for general multilayer networks (e.g., MuxViz and MAMMULT) offer a wide range of functionalities related specifically to the study of multilayers. For example, they enable to compute multilayer network metrics, correlations across layers, and layer reducibility. These libraries can handle all multilayer networks (including multilayer transport networks); however, they are not specifically designed for the analysis of transport networks and might therefore be missing some transport-specific features. On the other hand, at the intersection between “Multilayer” and “Transport” in Figure 5, tools focusing on multimodal transport networks (UrbanAccess, r5r, etc.) currently serve a general routing or access purpose and offer limited functionalities in regard to specific aspects of multimodality or the structural/dynamic analysis of the underlying multilayer networks. With many of the surveyed tools still at the experimental stage (version number 0.X) or under heavy development, the necessary foundation for multimodal analysis tools simply has not been fully established yet. Given the growing interest for multimodal and sustainable transport, we anticipate that new open-source libraries will be built specifically for multimodal transport analysis.
Conclusions
In this review, we discussed the state-of-the-art in the field of multimodal mobility and multilayer transport networks from a complexity science perspective, focusing on urban environments. On one hand, we covered the science of the dynamics of mobility: How do people move? Which forms of transportation do they use? How do they find their paths or switch between modes? On the other hand, these dynamics take place on an underlying (infra)structure which can be well modeled by multilayer networks. In this context, a number of mathematical metrics have been developed in network science recently which allow the rigorous study of the topic. Parallel to the methodological developments we have witnessed a spur of new computational tools—many of them open source—and datasets which considerably facilitate and boost further research on the topic. Despite an explosion in geospatial data collection, it is still relatively difficult to access spatio-temporally fine-grained—and appropriately anonymized (De Montjoye et al., 2013)—mobility data openly. Such high-quality data are in danger of being siloed in by commercial stakeholders, obstructing transparent research on the topic. We must therefore push for the implementation of better systems by governments, academia, and industry to recognize and promote efforts for making datasets and tools openly available by and for researchers (Lovelace, 2021; Stodden et al., 2016).
The idea of analyzing individuals’ and collective travel patterns as a graph or network, in both unimodal and multimodal travel context, is not new. The classical transportation network modeling approaches such as traffic assignment focuses more on understanding the distribution of single class or multi-class trips across links in a network and estimating link and path flows. These methods do not often take into account the physical and structural properties of the underlying transportation network (e.g., betweenness centrality and node degree distribution), as well as the network dynamical functionality (e.g., random walk and diffusion processes). While complex network-driven approaches mostly explore the structural properties of transportation networks and their dynamical functionality from a statistical physics perspective. The integration of these two traditionally mostly separated fields offers promising directions for measurement, modeling, and description of urban travel behaviors and patterns. There is a growing number of studies that aim to connect the two fields and provide new modeling approaches including understanding the macroscopic dynamics of network traffic (Olmos et al., 2018), characterizing origin–destination travel demand as a graph (Saberi et al., 2017), planning bicycle infrastructure with percolation theory (Olmos et al., 2020), and analyzing congestion propagation as a reaction–diffusion system (Bellocchi and Geroliminis, 2020).
The existing research on multimodal travel based on passively collected data sources is far from being comprehensive. Most empirical studies on multimodal behavior have focused on the use of public transit, such that the interplay between public and forms of private transportation such as walking, driving, and cycling has been poorly characterized. Studies focusing on public transportation network are often based on public transport schedules instead of real-time data and thus neglect important effects deriving from congestion. We anticipate that the increasing availability of high-resolution GPS trajectories collected by individual mobile phones and sensors installed on private and public transport vehicles (Barbosa et al., 2018) will be key to fill these gaps in the literature. The continuous growth and developments of urban transport infrastructures is also raising new research challenges. One critical issue relates to the modeling of shared mobility services, such as shared bicycles and vehicles (Shaheen and Chan, 2016). Multimodal frameworks that integrate shared services with traditional public and private transport infrastructures are becoming necessary to ensure real-time and user-centered solutions for planning, forecasting, and managing services, while increasing safety, reducing congestion and emissions.
Despite the currently exploding research on multimodal mobility, there exists a wide frontier of topics to tackle and new approaches to explore. For instance, there is ample potential for future work to develop growth models for multimodal infrastructures explicitly considering densification and exploration, as previously done for street networks (Strano et al., 2012). An alternative view can be obtained by investigating the optimal growth and design of the multiplex structure of the different layers, as was done for the multilayer airline network composed by routes of different airline companies (Santoro et al., 2018).
Important advances on multimodal mobility and transportation have been shown to be interdisciplinary and have clearly benefitted from the large variety of scientific fields and practices. Indeed, synergies between disciplines such as urban planning, geoinformatics, computer science, and physics have increased in the last few years, giving rise to new interdisciplinary approaches such as a Science of Cities or Urban Data Science (Boeing et al., 2021; Resch and Szell, 2019). We envision that research on multimodal mobility and transportation to maintain a highly interdisciplinary character also in the future. For example, the study of human mobility has recently benefited from novel scientific advances from other fields such as deep learning. Luca et al. (2020) offers a comprehensive overview of the topic and its applications to human mobility, surveying data sources, public datasets, and deep learning models, and we anticipate the possibility that this area will soon make an impact in unveiling new features of multimodal mobility and transportation.
Finally, understanding multimodal mobility and its underlying infrastructure is of paramount importance for developing sustainable urban transport, as it relies on the central role of public transport modes. Indeed, studying multimodal mobility is one piece in the puzzle towards reversing the global societal threat of climate change which is caused to a considerable extent by car-centric transport monocultures (Mattioli et al., 2020). For this reason, we hope that our review can serve as a starting point to develop a more modern, sustainable, and integrated idea of mobility worldwide.
Footnotes
Acknowledgments
The authors would like to thank Riccardo Gallotti for useful feedback.
Author Contributions
L.A. and L.N. contributed equally to this work. F.B. coordinated the project. All authors contributed to the organization and writing of the paper.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Consent for publication
We confirm that the manuscript has been read and approved for submission by all the named authors.
