Abstract
In this paper, we make an attempt to increase our understanding of the urban scaling phenomenon. The aim is to investigate how superlinear scaling emerges if a network increases in size and how this scaling depends on the occurrence of elements that constitute the network. To this end, we consider a city as a complex network structure and simulate this structure by the network of all publications of a research intensive university. In this simulation, the publications take the role of the city inhabitants and the concepts (terms and keywords) in the publications represent all kinds of abilities and qualities of the inhabitants. We use in this experiment all author- and database-given terms of the scientific publications of Leiden University from 2022. We calculate the co-occurrence of terms, and on the basis of these connections, we create a network and let this network grow by successively adding publications from the total set of publications. In this way, we get a series of networks with different sizes and this simulates a series of cities with different number of inhabitants. This procedure is performed for different values of the term occurrence threshold. We then analyze how four important network parameters, namely, number of terms, number of clusters, number of links, and total link strength increase with increasing size of the network. Particularly the number of network links and the total network linkage strength are in our opinion the parameters that dominate the scaling phenomenon and can be considered as a simulation of the socioeconomic strength of a city, that is, its gross urban product. We find a significant power law dependence of these network parameters on network size and the power law exponents for the lowest occurrence threshold are within the range that is typical for urban scaling. In our approach, the number of clusters can be interpreted as a measure of complexity within the network. Since the occurrence threshold determines the diversity of terms, we may expect a special relation between the occurrence threshold and the number of clusters. This is indeed the case: whereas for the three other network parameters the scaling exponent increases with increasing occurrence threshold, the number of clusters is the only network parameter of which the scaling exponent decreases with increasing occurrence threshold. Finally, we discuss how our publication term network approach relates to scaling phenomena in cities.
Introduction
Recent studies show a more than proportional (superlinear) increase of the socioeconomic performance of cities (measured by the gross urban product) in relation to population size (Bettencourt et al., 2007, 2010; Lobo et al., 2013; Bettencourt and Lobo, 2016; Cebrat and Sobczyński, 2016; Barthelemy, 2019a, 2019b). If individual cities i with population
This urban scaling relation implies a power law dependence of the gross urban product with population size. The coefficient Scaling of the gross urban product G(P) with population size P for the major cities in the Netherlands. The central cities are indicated with blue circles (CC) and their urban agglomerations with red squares (A). Data are the average for the years 2020–2022.
Urban scaling behavior is also found for other city characteristics such as employment in terms of number of jobs, number of crimes, total road length, and knowledge production activities in cities (Alves et al., 2013; Arbesman et al., 2009; Bettencourt et al., 2007; Schläpfer et al., 2014).
A simple way to understand the phenomenon of urban scaling is by seeing cities as a complex network. The larger the city in population size, the more network nodes. The nodes in the urban system are abilities and capacities of the inhabitants, social and cultural institutions, centers of education and research, firms, etcetera. The number of nodes has a linear dependence on size, but the links between nodes depend on size in a superlinear way. The links between these (clustered) nodes represent many types of exchange activities such as movement of people, transportation of goods, transmission of information (Shutters et al., 2018), exchange of ideas, and expertise. Therefore, these links are crucial for new developments, reinforcement of urban facilities, and innovation. One can expect that the productivity and efficiency of an urban economy increases as the level of connectivity among the constituent units increases. Because they increase superlinearly, the socioeconomic strength of cities increases more than proportionally with increasing population size. In this way, cities can be conceived as a complex network structure (Qian et al., 2020; Stier et al., 2022) following scaling laws (Batty, 2008, 2009).
The measurement of urban scaling is complicated by a number of problems. Scaling characteristics may depend on the definition of cities in terms of population size and population density of the urban area, and even on the size of a city measured in terms of residents and of working population size (Strumsky et al., 2021). These different definitions of cities can lead to finding linear scaling in situations where other researchers find superlinear scaling (Arcaute et al., 2015; Cottineau et al., 2017). The scaling exponent can even change from superlinear to sublinear (Louf and Barthelemy, 2014a). The chosen set of cities may also lead to different scaling exponents (Cebrat and Sobczyński, 2016; Sahasranaman and Bettencourt, 2021; Van Raan, 2020). For cities in the richer western part of the EU, a lower scaling exponent was found as compared to cities in the less rich eastern part of the EU (Strano and Sood, 2016). Scaling exponents may also vary in time (Bettencourt et al., 2020; Cebrat and Sobczyński, 2016; Depersin and Bathelemey, 2018; Strano and Sood, 2016; Strumsky et al., 2021). Scaling exponents differ for the various variables such as for instance gross domestic product, employment, and number of patents (Bettencourt and Lobo, 2016). Ramaswami et al. (2018) show that the economic structure of cities may considerably affect the measured urban scaling exponents. An Australian study (Sarkar, 2019; Sarkar et al., 2018) shows that scaling exponents may also differ within a broader variable: for lower income, the scaling is about linear but for high income, the scaling is superlinear. Leitão et al. (2016) discuss the problems in the statistical estimation of scaling exponents. Nevertheless, an extensive study on urban scaling in Brazil (Meirelles et al., 2018) using a large number of variables and different statistical approaches shows that socioeconomic variables (e.g., GUP) scale superlinearly with city population size, whereas infrastructural variables (e.g., length of street network) scale sublinearly and individual basic services (e.g., number of houses) scale linearly.
In understanding urban scaling, it is generally assumed that the scaling of urban economic performance is driven by social and economic interactions that increase disproportionally with urban population size. Ribeiro et al. (2017), for instance, discuss a model of urban scaling based on the distant-dependent interaction range between the citizens and on the spatial structure of a city. Altmann (2020) studies interactions between individuals in different cities and shows that including these inter-city interactions lead to better scaling models.
There are several important studies on the theoretical derivation of urban scaling based on mathematical frameworks (Bathelemy, 2019a, 2019b; Bettencourt, 2019), often related to statistical physics. Lobo et al. (2020) use a theoretical settlement model to explain urban scaling. The growth of cities is studied, for instance, by a diffusion-limited aggregation model (Makse et al., 1995) and by a population dynamics model (Reia et al., 2022). A theoretical framework of local interactions was used to study scaling properties of cities (Bettencourt, 2013; Bettencourt and West, 2010). Our approach is not a theoretical, mathematical construction, but an experiment with a “real-world” growing network of which the parameters are measured as a function of network size. Thus, our study poses a specific question about scaling, namely, can we discover scaling properties similar to urban scaling if we analyze the linkage structure of a growing network and can we find how this scaling depends on the occurrence of elements that constitute the network. We think that this is a novel approach in research on urban scaling.
There are many studies on the properties of network structures, cluster formation, community detection, and connectivity of nodes (Barabasi and Albert, 1999; Fortunato and Newman, 2022; Jin et al., 2001; Newman, 2004, 2006; Newman and Girvan, 2004; Traag et al., 2011). Network approaches have been applied to study how cities evolve (Batty, 2008, 2013), and, for instance, Lenormand et al. (2015) use an network approach to develop clustering methods for the study of scaling in land use organization in cities. However, there is little research on the enlargement of real-world networks (i.e., constructed with elements from existing systems as opposed to theoretical constructions) and its relation with connectivity (number of links and total link strength) and clustering (number of clusters), and, in particular, the scaling of these network parameters as a function of network size. Thus, to the best of our knowledge, there are no studies that relate urban scaling phenomena to scaling properties of growing networks.
As we stated earlier, urban scaling is fundamentally based on social interactions. In this approach, a city is a network of social interactions and the total number of interactions is directly related to the socioeconomic output of cities. This directly relates to research on economic models of agglomeration effects that are based on an increasing productivity derived from social interactions (Glaeser et al., 1995; Meijers et al., 2016; Strumsky et al., 2023). Cottineau et al. (2019) analyze urban clusters to detect agglomeration economies.
The structure of this paper is as follows. The following section addresses our method and data, and in particular why we simulate a city with a university. In the third section we present and discuss our results. We conclude this paper in the final section with a discussion of the implications for urban scaling.
Method and data
Simulating a city with a university
In an earlier study (Van Raan, 2013), we found that also universities show a scaling behavior similar to cities. Whereas for cities, the gross urban product is an important measure of socioeconomic strength, one can say that for universities the extent to which all publications of a university are cited worldwide is an important measure of the scientific strength of the university. It is this analogy that we use to simulate the urban network in order to find and analyze scaling properties. Our aim is to investigate how superlinear scaling emerges if a network increases in size, and how this scaling depends on the occurrence of elements that constitute the network. To this end, we consider a city as a complex network structure and simulate this structure by the network of all publications in a research intensive university. In this simulation, the publications take the role of the city inhabitants and the concepts (terms and keywords) in the publications represent all kinds of abilities and qualities of the inhabitants. We then assume that the total links strength of the publication terms network is an analogon of the socioeconomic performance of a city. In this way, we investigate scaling of growing networks in an empirical approach.
We use in this experiment all author- and database-given keywords of the scientific publications of Leiden University from 2022. Using publications of a large university means that one covers all fields of science, and this also resembles a city: a very heterogenous content within one coherent system. Just like a city, a university is a not an isolated system but a system of which the content is strongly related with the outside world.
We calculate the co-occurrence of terms, and on the basis of these connections, we create a network and let this network grow by successively adding publications from the total set of publications. In this way, we get a series of networks with different, increasing sizes and this simulates a series of cities with different number of inhabitants. This procedure is performed for different values of the term occurrence threshold. We then analyze how four important network parameters, namely number of terms, number of clusters, number of links, and total link strength increase with increasing size of the network. Particularly the number of network links and the total network linkage strength are the parameters that dominate the scaling phenomenon and can be considered as a simulation of the socioeconomic strength of a city, that is, its gross urban product.
Shutters et al. (2018) find that the increase in network density is for an important part driven by the presence of rare, but highly interdependent occupations in a city. This supports our approach to particularly include rare terms (i.e., by using low occurrence thresholds, see next section) in the network construction. For examples of bibliometric (e.g., co-citation and term co-occurrence) networks, we refer to Price (1965); Braam et al. (1991); Lehmann et al., 2003; Ren et al. (2012); Martin et al., 2013; Golosovsky and Solomon (2017); Muppidi and Reddy (2020). An overview of bibliometric network analysis is given by Perianes-Rodriguez et al. (2016).
A particularly interesting paper for our study is the work on the spatial structure of mobility networks in cities (Louail et al., 2014, 2015; Louf and Barthelemy, 2013, 2014b, 2016). These authors find that the number of hotspots (urban activity centers) scales sublinearly with city population size and they remark that this finding may serve as a guide for constructing a theoretical model. As we will see further on in this paper, we find a similar scaling behavior in our simulation network.
Creating a network
Given that publications can be characterized by a number of terms, a matrix of publication-to-term relations can be constructed. Using linear algebra, from this matrix, a new matrix of term-to-term relations can be deduced (Van Raan, 2019). The off-diagonal elements of this term-to-term (term co-occurrence) matrix indicate the number of publications in which the two terms involved are both present, that is, the co-occurrence frequency of this pair of terms. The larger the number of publications in which two terms co-occur, the stronger the terms are considered to be related to each other, that is, the stronger their link. The network grows with the number of publications, but the network itself is a structure of interconnected terms. This is similar to what we want to simulate: the size of cities is determined by the number of inhabitants, whereas the real basis of the urban scaling phenomenon is the network of interactions between all possible abilities and qualities of these inhabitants. The size of a city therefore is a kind of an easy measurable, proxy independent variable.
In order to construct and to visualize this bibliometric network, we apply the software tool VOSviewer (Van Eck and Waltman, 2010). Publications can be uploaded from large publication databases—such as the Web of Science or Scopus—into the VOSviewer system on the basis of their full database records including author- and database-given terms (keywords). The VOSviewer algorithm calculates how many co-occurrence links there are in the uploaded set of publications, the strength of the links (number of publications in which two terms occur together) and the total link strength. The terms and their links constitute the bibliometric network. In addition, terms with a high link strength are grouped into clusters. The matrix of term co-occurrence frequencies serves as input for the VOSviewer mapping technique. This technique determines for each term a location in a two-dimensional space. Strongly related terms tend to be located close to each other in the two-dimensional space, whereas terms that do not have a strong relation are located further away from each other. For brevity, we will use in the remainder of this paper “term map” instead of term occurrence map. The VOSviewer mapping technique is related to multidimensional scaling, but for the purpose of creating term maps, it has been shown to yield more satisfactory results. The VOSviewer clustering technique is a modularity-based community detection procedure which is discussed in detail by Van Eck et al. (2010), Waltman et al. (2010), Waltman and Van Eck (2013).
It is possible to set a threshold value for the minimum number of term occurrences. We can use this threshold parameter to specify the minimum number of occurrences that a term must have within the set of publications to be included in a term map. We experiment with different occurrence thresholds, beginning with threshold value γ = 1, that is, all terms identified in the set of publications are taken into account in the construction of the network, and further threshold values up to γ = 10. Thus, the occurrence threshold determines the diversity of terms: at a low threshold, most or all of the terms are included in the network formation, whereas at a high threshold, only the more frequently occurring terms are included.
As mentioned above, the VOSviewer methodology also includes a technique in which terms with relatively high co-occurrence links are grouped into a cluster. What is the meaning of these clusters in the context of our simulation of cities? As discussed previously, the terms in the network correspond to the abilities and capacities of the inhabitants and the co-occurrences are the connections between these abilities and capacities. This will give rise to organizations and institutions in the educational, research, business, cultural, and social sectors, which become visible as clusters in the network. In this sense, the number of clusters is a measure of complexity within the network. We may expect a relation between diversity (value of occurrence threshold) and complexity (number of clusters). In the following section we will see that our analyses indeed show a relation between the occurrence threshold and the number of clusters.
Network data and network parameters
We collect from the Web of Science (WoS) all 7,653 publications of Leiden University published in 2022. Leiden University is a research intensive university, oldest in the Netherlands (founded 1575), 34,000 students, invariably in the top-100 worldwide of the international rankings, and 15 Nobel Laureates affiliated to the university (Leiden University, 2023). The full records of the Leiden publications including author- and database-given terms (keywords) are uploaded in the VOSviewer. All 7,653 publications are put in alphabetic order on the basis of the name of the first author. We assume that this assures a random composition of the set of publications as far as the subjects of research (and with that the relevant terms) concerns. The occurrence frequency of terms may differ substantially for the various terms. Some terms occur only in one or a few publications (these publications relate to a very specific topic such as supermassive black holes), and other terms occur in many publications (often these publications deal with important ad general topics in medical research, for instance, cancer or quality-of-life).
A representative sample of 7653 is 366 (confidence level 95% and margin of error 5%) so we start with alphabetically the first 400 Leiden publications and successively double the number: 800, 1600, 3200, and so on. In order to have more measuring points, we also take the first 1000, and then 2000, 3000 up to 7000 publications. In this way, we let the network grow: we have enlarged the network structure from 400 to 7000, an enlargement of nearly a factor 20.
To give an impression of the visualization of the Leiden publications by the VoS viewer mapping procedure, we show in Figure 2 the term map representing the network for the (alphabetically) first 6,000 publications with occurrence threshold γ = 4 (we take a higher occurrence threshold in order to avoid overloading the map with a very large amount of terms, we now have in total 2222 terms after application of the occurrence threshold). We see a large medical research cluster (left side) and an astronomy and astrophysics cluster (right side). In the network maps, the different clusters are indicated with different colors, and the size of the circle of a term is determined by its weight, that is, the number of publications in which this term occurs. For details of the network and mapping procedures, we refer to the VOSviewer manual (Van Eck and Waltman, 2020). Term map of Leiden University publications (2022), 
Results and discussion
Network parameters at different occurrence thresholds for a growing network
For each network structure, the VoSviewer calculates four parameters: the number of terms involved in the construction of the network, the number of clusters, the number of links, and the total strength of the links. These are our network parameters of which we want to find the dependence on the size of the network. We perform these calculations for occurrence thresholds γ = 1 to 10.
In Figure 3, we show our results for occurrence thresholds γ = 1, 2, and 3; the results for γ = 1 up to 10 are presented in the Supplemental Material Table S1. We find for our four network parameters p
i
(total link strength p1, number of links p2, number of terms p3, and number of clusters p4), a significant power law dependence on network size, so we can write Network parameters as function of network size (number of publications) and occurrence thresholds γ = 1 (upper panel), 2 (middle panel), and 3 (lower panel).
We also performed our calculations after removing those terms that have no or a very small link strength (link strength lower than 10; this value is reasonable given the link strength distribution of all terms in the network, see Supplemental Material Figures S1 and S2). For example, in the case of network size 6000 and γ = 1, 27% of the terms is removed and that leads to 13% less total link strength of the entire network. Generally, for most of the network parameters, the measured scaling exponents do not change much (see Supplemental Material Table S1). The scaling exponent for γ = 1 increases from 1.04 to 1.08. However, what changes drastically by removing terms with no or a very small link strength is the number of clusters. In the case of network size 6000 and γ = 1, the number of clusters decreases from 245 to 88: removing terms with no or a very small link strength means that many small clusters disappear.
In the previous section, we discussed that the number of clusters can be seen as a measure of complexity and that a specific relation with the occurrence threshold can be expected. We see in Figure 3 this specific relation: whereas for the three other network parameters, the scaling exponent increases with increasing occurrence threshold, and the number of clusters is the only network parameter of which the scaling exponent decreases with increasing occurrence threshold. In fact, the number of clusters increases sublinearly with network size particularly for the lower occurrence thresholds. This finding is interesting because the network clusters can be seen as activity centers and other studies show a similar sublinear increase of activity centers (“hotspots”) in cities (Louail et al., 2014, 2015; Louf and Bathelemy, 2013, 2014b, 2016). Particularly with occurrence thresholds above 2, the number of clusters hardly increases with increasing size. This suggests that in a growing network, the complexity does increase but not “explode” and this in turn may be an indication that a growing network tend to decrease complexity, which may be a sign of effective self-organization. Further research, however, is necessary to rule out effects of the VOSviewer clustering algorithm at a higher number of publications.
To the best of our knowledge, the only empirical work on the increase of the number of links for a growing network is the study of Li and Lu (2009) and Li (2009). These authors find values between 1.12 and 1.31 which are close to our values, see Supplemental Material Section 3.
Influence of term occurrence and of occurrence threshold
Our measurements on the growing network as presented in Figure 3 are performed with specific occurrence thresholds γ, for instance, γ = 1 means all terms with occurrence 1 and higher are included. It is interesting to find out how specific occurrence values play a role in the structure of the network. In the Supplemental Material Figure S3, we present the relation between the number of terms and the occurrence of a term, and the relation between the occurrence of a term and its link strength in the network. We see that the number of terms decreases as a function of occurrence with a power law exponent −2.33. On the other hand, the higher the occurrence of a term, the larger its link strength in the network, with a linear relation. Thus, the higher the occurrence of a term, the lower the number of these terms, but the larger its contribution to the total link strength of the network.
In the previous section, we discussed the dependence of the network parameters on network size for different occurrence thresholds. We now analyze the data as a function of the occurrence threshold. With the data used in Figure 3, we can analyze the relation between the scaling exponents and the occurrence threshold more explicitly. The results are presented in Figure 4. We find that the higher the term occurrence threshold, that is, the more common terms are involved, the higher the power law exponents, except for the number of clusters. Relation between occurrence threshold and power law exponents of the network parameters.
Figure 4 shows the relatively large difference between the scaling exponent for γ = 1 and the scaling exponents for the higher occurrence thresholds. This has an interesting consequence. In the case of small networks, the total link strength is the largest for γ = 1 (highest term diversity). But if the network becomes larger then, notwithstanding its small scaling coefficient, the total link strength for γ = 5 (which means a lower term diversity) will exceed the total link strength for γ = 1 because of its high scaling exponent. It can be easily shown by equating the scaling equations for γ = 1 and 5 that this will happen around n = 25,000. This would imply that for very large networks term diversity plays a declining role in the total link strength.
With the results shown in Figure 4, we can analyze the relation between the scaling exponents and the occurrence threshold more explicitly. We find that the higher the term occurrence threshold, that is, the more common terms are involved, the higher the power law exponents, except for the number of clusters, and this relation is with high significance logarithmic:
where the coefficient a(i) is positive for the number of terms, the number of links, and the total link strength, and negative for the number of clusters, and the intersection b(i) is around 1.0 for the number of terms, the number of links, and the total link strength, and around 0.2 for the number of clusters. Inserting equation (3) into equation (2) yields
This means that the scaling behavior of our university network is in fact dictated by the occurrence threshold, that is, by the extent a specific term is present (total number of occurrences) in the set of publications. In the following section, we discuss the relevance of our findings for urban scaling research.
Discussion of the findings in the context of city simulation
Increasing the term occurrence threshold means increasing the minimum number of publications in which a term must occur, which implies that the selection of terms relates more and more to the frequent terms, that is, to research on current topics. Taking the lowest threshold, γ = 1, that is, all terms are included, ensures that also less actual topics will be included in the network structure. Particularly less actual topics may represent new, unexpected, and original research. In the context of an urban community, this relates to rare abilities and qualities within the population. Shutters et al. (2018) find that the increase in the total network interconnectivity is for an important part driven by the presence of rare, but highly interdependent occupations in a city.
The choice of the occurrence threshold is a choice of diversity. From our observations follows that the occurrence threshold not only determines the total number of terms in the network but also the number of clusters: the higher the occurrence threshold (less term diversity), the less the number of clusters increases with increasing network size, whereas the total link strength of the network increases with a high superlinear scaling exponent. This could indicate that in a growing network the terms with a high occurrence (occurrence higher than 1) lead to a decrease of the complexity of the network measured in number of clusters.
What are the characteristics of our network clusters and how does this relate to cities? We can investigate this by looking at the clusters in Figure 2. The clusters are broad common focus areas that more or less remain the same as the size of the network increases, mostly corresponding to large fields of research. In the Leiden network structure, we see large medical and astrophysical clusters, and particularly the medical clusters are characterized by general terms with a high frequency, such as “behavior,” “survival,” “risk,” “management,” “mortality,” and “children.” For lower term, occurrence thresholds are also more specific, and smaller clusters become visible such as DNA expression, aging and dementia, stellar evolution, and black holes. This fits well with the idea of the city: there are a more or less fixed number of problems/areas of attention that arise in each city, and some cities have a special clusters such as a port, or a specific industry, or a university. The clusters represent activity centers and we find that they scale sublinearly with increasing network size. We here see a similarity with the work on spatial structure of mobility networks in cities (Louail et al., 2014, 2015; Louf and Bathelemy, 2013, 2014b, 2016). These authors find that the number of hotspots (urban activity centers) scales sublinearly with city population size and they remark that this finding may serve as a guide for constructing a theoretical model.
Conclusions
In this paper, we make an attempt to increase our understanding of the urban scaling phenomenon. In our approach, we simulate a city by a set of publications from a large university where the publications take the role of city inhabitants and the terms (keywords) in the publications represent abilities and qualities of the inhabitants. We use in this experiment scientific publications that are connected by co-occurrence of terms to create a network on the basis of these connections, and let this network grow by successively adding publications. We analyze how four important network parameters, namely, number of terms, number of clusters, number of links, and total links strength increase with increasing size of the network. This procedure is performed for different values of the term occurrence threshold. This occurrence threshold determines the diversity of terms: at a low threshold most or all of the terms are included in the network formation, whereas at a high threshold only the more common, that is, the more frequently occurring terms are included.
We find a significant power law dependence of these network parameters on network size. We remark that the network grows with the number of publications, but the network itself is a structure of interconnected terms. This is similar to what we want to simulate: the size of cities is determined by the number of inhabitants, whereas the real basis of the urban scaling phenomenon is the network of interactions between all possible abilities and qualities of these inhabitants. The size of a city therefore is a kind of an easy measurable, proxy independent variable. Publications take a similar role in our simulation.
The larger the network (with respect to number publications), the larger the number of clusters, but interestingly this number of clusters is the only network parameter that increases sublinearly (exponent β < 1) with increasing size of the network. This could be an indication that a growing network tends to decrease complexity by effective self-organization. Professional diversity is an important element for the socioeconomic strength of cities (Bettencourt et al., 2014). In our approach, the number of clusters can be interpreted as a measure of complexity within the network. Since the occurrence threshold determines the diversity of terms, we may expect a relation with network complexity, and thus a relation between the occurrence threshold and the number of clusters. This is indeed the case: whereas for the three other network parameters, the superlinear scaling exponent increases with increasing occurrence threshold, and the number of clusters is the only network parameter of which the sublinear scaling exponent decreases with increasing occurrence threshold. The sublinear scaling of the number of clusters shows a similarity with work on the sublinear scaling of urban activity centers.
If we assume that all terms, but particularly the rare terms, that is, those with a low occurrence frequency are important for the structure of the network, then the networks with a low occurrence frequency are the most appropriate simulation of the urban complex system and the scaling of this system. Particularly the number of network links and the total network linkage strength are in our opinion the parameters that dominate the scaling phenomenon and can be considered as a simulation of the economic strength of a city, that is, its gross urban product. The measured power law dependence of these network parameters on network size, and particularly the power law exponents are within the range that is typical for urban scaling.
A low occurrence threshold means a high diversity of terms. We find that highest total link strength for the lowest occurrence threshold. In other words, the more diversity of terms, the stronger the network structure. The phenomenon in cities that may correspond to these observations is that at a high diversity of abilities and capacities, the larger the city the stronger it socioeconomically is, and that this socioeconomic strength increases slightly superlinear. Another important finding is that the higher the occurrence threshold, the stronger the power law of the number of links and the total link strength increases. In the context of an urban community, this finding is intuitively understandable: the more a specific, strongly present ability or quality is present in the urban population (in our simulation higher γ), the faster the number of links related to that ability will grow for an increasing population. For relatively rare abilities within the urban population (in our simulation low γ), however important these may be, one can expect that the number of links will grow less fast for an increasing population. We conclude with the key finding if this study: if our simulation model is an appropriate approach to understand urban scaling, the scaling exponent is strongly related to the presence of common but in particular also rare abilities and qualities in the urban population.
Supplemental Material
Supplemental Material - Simulating urban scaling with a term linkages network of a university
Supplemental Material for Simulating urban scaling with a term linkages network of a university by Anthony FJ van Raan in Environment and Planning B: Urban Analytics and City Science.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Data availability statement
The data that support the findings of this study are available on request from the author.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
