Abstract
It is very popular in bibliometrics to present results on institutions not only as tabular lists, but also on maps (see, for example, the Leiden Ranking). However, the problem with these visualisations is that institutions are frequently spatially clustered in larger cities whereby institutions are positioned one above the other. In this Brief Communication, we propose as an alternative to visualise bibliometric data on the city rather than the institution level to avoid this problem.
Spatial bibliometrics has attracted increasing attention in recent years [1,2]. An overview of studies using bibliometric data for spatial analyses has been published by Frenken and Hoekman [3]. A good indication for the recent popularity of spatial bibliometrics is the tendency in university rankings to present the results not only as tabular lists but also on maps. Bornmann et al. [4] visualise institutional ranking data at www.excellencemapping.net: for each institution, the estimated probabilities of publishing highly cited papers and publishing in the most influential journals are shown on maps. The Leiden Ranking (see www.leidenranking.com) provides the map view in which the ranking positions of universities are presented on maps. The users of these and similar Internet-based visualisations receive a regional overview of performance distributions, which is not possible on the base of institutional lists. For example, the visualisation of the institutional data by Bornmann et al. [4] points out a west-east gradient of scientific performance in several subject areas. However, the problem with these visualisations is that institutions are frequently spatially clustered in larger cities whereby institutions are positioned one above the other. Thus, the user is not able to inspect the results in important areas which are characterised by many institutions on the map.
In this Brief Communication, we show two example maps with the object of improving the perception of spatial results. The underlying data are from the Scimago Institutions Ranking (see https://www.scimagoir.com), which is based on Scopus data (Elsevier). As with the Leiden Ranking, the ranking results are visualised on maps. To avoid the problems described above, the publication data for the maps, which are presented in this Brief Communication, have not been aggregated on the institutional level, but on the city level. The city level has the advantage that corresponding shape files (popular geospatial vector data formats for geographic information system software) are available for spatial visualisations. By visualising the data on the city level, the data can be presented on maps without overlaps. Maps on the city level are also interesting in terms of content because ‘science thrives in cities. With two-thirds of the global population projected to live in urban areas by 2050, cities will become even more distinctly the domain of knowledge and innovation’ (see natureindex.com/supplements/nature-index-2017-science-cities/index).
We use data for German cities to demonstrate the city approach for science mapping. Table 1 shows those cities in Germany with the most papers belonging to the 1% most frequently cited papers within their subject area and publication year. The use of the top-x% field-normalised indicators for performance analyses is recommended in the Leiden Manifesto including 10 principles to guide research evaluation [5]. By considering the top-1% papers, the study focuses on the most highly cited papers. Since we base the study with the choice of the city level on larger units, this focus might be justified. However, considering all papers – and not only the top-papers – might lead to different results. The papers (including all subject areas) in this study refer to the period from 2011 until 2015. Only substantial papers have been considered (articles and reviews); thus, papers with document types such as editorial material have been excluded. The 16 cities in Table 1 are a selection from 200 German cities which published at least one paper (considering all papers, not only highly cited papers) in the publication period.
Number of papers as well as number and proportion of papers belonging to the 1% most frequently cited papers for German cities with more than 500 top-1% papers (sorted by number of top-1% papers in decreasing order).
In this study, we used the full counting approach to assign papers to cities. This means that a paper was fully assigned to a city if at least one author was from this city. An alternative would have been the fractional counting approach, whereby each paper is fractionally assigned to corresponding cities. For example, if authors of a paper are from three cites, the paper would be assigned one-third to each city. The advantage of the fractional counting approach is that multiple assignments of single papers are considered in the calculations. The disadvantage of the approach is that there exist not only one but also numerous methods of how papers can be fractionally assigned to units [6].
Figure 1 shows the number of papers belonging to the 1% most frequently cited papers on the city (and municipality) level. The cities have been assigned to four performance classes by using the Characteristic Scores and Scales (CSS) method – ‘a parameter-free solution for the assessment of outstanding performance’ [7]. This method was introduced by Glänzel and Schubert [8] for assigning papers in a field and publication year to meaningful impact classes (as a rule four classes). ‘Characteristic scores are obtained from iteratively truncating samples at their mean value and recalculating the mean of the truncated sample until the procedure is stopped or no new scores are generated’ [9].

Number of papers belonging to the 1% most frequently cited papers for German cities (see the list of cities in Table 2 in Appendix 1). White areas indicate cities without any paper. Some cities are so small that they do not appear as coloured (white) areas, but are masked by black borders.
In the first step, the mean number of top 1% papers has been calculated for all German cities. Cities with numbers below the mean have been classified as ‘below average number’ (light-pink cities in Figure 1; n = 157). Cities with paper numbers above the mean have been used for further calculations in the second step. For these cities, the mean number has been calculated again and the cities with numbers above the mean have been assigned to the category ‘above average number’ (dark pink in Figure 1; n = 27). In the third step, the procedure of mean calculation and separation of two classes has been repeated, which resulted in two further city classes labelled ‘far above average’ (light-red cities in Figure 1; n = 12) and ‘outstanding’ (dark-red cities in Figure 1; n = 4).
The results in Figure 1 reveal that only a few regions in Germany are involved in the production of (highly cited) papers. Out of more than 11,000 German cities (and municipalities), only 150 are associated with the publication of highly cited papers (200 cites have at least one paper published). The largest concentration of cities with scientific activities (on the top-cited level) can be found in western Germany. The cities with the highest numbers of papers belonging to the 1% most frequently cited are especially big German cities: Berlin, Hamburg and Munich. However, this result changes if the size-independent perspective in Figure 2 is used instead of the size-dependent perspective in Figure 1.

Proportion of papers belonging to the 1% most frequently cited papers for German cities. Only cites with at least 50 papers are considered (see the list of cities in Table 3 in Appendix 1). White areas indicate cities without any paper. Some cities are so small that they do not appear as coloured (white) areas, but are masked by black borders.
Figure 2 shows the proportion of top-1% papers for cities with at least 50 papers. Here, those cities are in the top-group of German cities with a proportion of papers between about 10% and 25% belonging to the top-1% papers. The results reveal that Berlin, Hamburg and Munich belong to different performance classes, whereas Munich and Hamburg are ‘above average’ and Berlin is ‘below average’.
Although the size-dependent perspective in Figure 1 favours larger cities, such as Berlin or Munich, the focus on the size-independent perspective in Figure 2 favours small cities, such as Worms or Neu-Isenburg.
The spatial maps produced for this Brief Communication demonstrate that the analysis of bibliometric data on the city level produces overviews of the performance of countries with high information content. The maps avoid the overlay of performance information from institutional maps, but are more detailed, for example, than the density maps proposed by Bornmann and Waltman [10]. These authors used a raster with 500 rows and 1000 columns and a kernel width parameter of 100 km to visualise the affiliations of authors.
An important disadvantage of our city approach and the regions approach by Bornmann and Waltman [10] is the information loss. If a city has many institutions, the performance differences between these institutions are ignored. A good example is Harvard University, Massachusetts Institute of Technology and Boston University in the Boston (area).
Footnotes
Appendix 1
Number of papers as well as proportion of papers belonging to the 1% most frequently cited papers for German cities (sorted by proportion of top-1% papers in decreasing order).
| City | Number of papers | Proportion of top-1% papers | CSS class |
|---|---|---|---|
| Worms | 106 | 24.53 | 3 |
| Großhansdorf | 54 | 18.52 | 3 |
| Neu-Isenburg | 151 | 16.56 | 3 |
| Bad Krozingen | 94 | 12.77 | 3 |
| Ingelheim am Rhein | 1160 | 10.17 | 2 |
| Schwerte | 71 | 9.86 | 2 |
| Weimar | 452 | 9.51 | 2 |
| Dessau-Roßlau | 240 | 7.5 | 2 |
| Monheim am Rhein | 67 | 7.46 | 2 |
| Bad Homburg vor der Höhe | 255 | 7.45 | 2 |
| Offenbach am Main | 240 | 7.08 | 2 |
| Duisburg | 71 | 7.04 | 2 |
| Leverkusen | 1816 | 6.83 | 2 |
| Müncheberg | 575 | 6.61 | 2 |
| Ehningen | 77 | 6.49 | 2 |
| Bad Nauheim | 575 | 6.43 | 1 |
| Wachtberg | 115 | 6.09 | 1 |
| Planegg | 1523 | 6.04 | 1 |
| Böblingen | 100 | 6 | 1 |
| Blankenfelde-Mahlow | 67 | 5.97 | 1 |
| Hattersheim am Main | 69 | 5.8 | 1 |
| Lüneburg | 1128 | 5.67 | 1 |
| Siegen | 2236 | 5.14 | 1 |
| Wilhelmshaven | 99 | 5.05 | 1 |
| Oberschleißheim | 5140 | 4.98 | 1 |
| Wuppertal | 2910 | 4.98 | 1 |
| Ludwigshafen am Rhein | 1645 | 4.86 | 1 |
| Haar | 65 | 4.62 | 1 |
| Pfinztal | 131 | 4.58 | 1 |
| Garching bei München (Munich) | 7607 | 4.57 | 1 |
| Heidelberg | 32,163 | 4.56 | 1 |
| Radolfzell am Bodensee | 659 | 4.55 | 1 |
| Mülheim an der Ruhr | 1010 | 4.46 | 1 |
| Potsdam | 12,995 | 4.37 | 1 |
| Landstuhl | 69 | 4.35 | 1 |
| Düsseldorf | 10,361 | 4.27 | 1 |
| Munich | 46,917 | 4.24 | 1 |
| Mainz | 14,234 | 4.24 | 1 |
| Penzberg | 546 | 4.21 | 1 |
| Herne | 313 | 4.15 | 1 |
| Homburg | 2204 | 4.13 | 1 |
| Hamburg | 24,375 | 4.12 | 1 |
| Sülfeld | 561 | 4.1 | 1 |
| Bonn | 20,694 | 4.1 | 1 |
| Ulm | 8797 | 3.98 | 1 |
| Lübeck | 2911 | 3.95 | 1 |
| Würzburg | 10,910 | 3.92 | 1 |
| Frankfurt am Main | 16,703 | 3.91 | 1 |
| Bremerhaven | 2543 | 3.89 | 1 |
| Kiel | 12,729 | 3.87 | 1 |
| Schwerin | 78 | 3.85 | 0 |
| Leipzig | 16,870 | 3.82 | 0 |
| Vallendar | 395 | 3.8 | 0 |
| Wiesbaden | 662 | 3.78 | 0 |
| Frankfurt an der Oder | 716 | 3.77 | 0 |
| Essen | 10,544 | 3.76 | 0 |
| Nuthetal | 887 | 3.72 | 0 |
| Tübingen | 16,019 | 3.69 | 0 |
| Aachen | 17,543 | 3.67 | 0 |
| Münster | 13,571 | 3.66 | 0 |
| Friedrichshafen | 191 | 3.66 | 0 |
| Freiburg in Breisgau | 16,765 | 3.61 | 0 |
| Erlangen | 15,601 | 3.59 | 0 |
| Mannheim | 5732 | 3.58 | 0 |
| Jülich | 7402 | 3.51 | 0 |
| Dresden | 20,692 | 3.5 | 0 |
| Russelsheim | 58 | 3.45 | 0 |
| Karlsruhe | 14,122 | 3.44 | 0 |
| Berlin | 56,898 | 3.38 | 0 |
| Cologne | 18,231 | 3.36 | 0 |
| Dortmund | 5906 | 3.32 | 0 |
| Göttingen | 17,423 | 3.3 | 0 |
| Cottbus | 1154 | 3.21 | 0 |
| Plön | 187 | 3.21 | 0 |
| Marburg | 7227 | 3.21 | 0 |
| Bad Saarow | 94 | 3.19 | 0 |
| Stuttgart | 14,171 | 3.11 | 0 |
| Gießen | 8941 | 3.06 | 0 |
| Regensburg | 7938 | 3.05 | 0 |
| Paderborn | 1974 | 3.04 | 0 |
| Hildesheim | 66 | 3.03 | 0 |
| Jena | 13,872 | 3.03 | 0 |
| Greifswald | 6116 | 3.01 | 0 |
| Ladenburg | 137 | 2.92 | 0 |
| Geesthacht | 1663 | 2.89 | 0 |
| Erfurt | 450 | 2.89 | 0 |
| Oldenburg (Oldb) | 3058 | 2.84 | 0 |
| Hannover | 17,583 | 2.84 | 0 |
| Saarbrucken | 6035 | 2.82 | 0 |
| Bamberg | 768 | 2.73 | 0 |
| Nuremberg | 184 | 2.72 | 0 |
| Darmstadt | 8909 | 2.64 | 0 |
| Konstanz | 3910 | 2.63 | 0 |
| Augsburg | 2214 | 2.57 | 0 |
| Tautenburg | 117 | 2.56 | 0 |
| Langen in Hessen | 472 | 2.54 | 0 |
| Halle at Saale | 7351 | 2.53 | 0 |
| Kaiserslautern | 3041 | 2.47 | 0 |
| Braunschweig | 6798 | 2.47 | 0 |
| Reutlingen | 167 | 2.4 | 0 |
| Bergisch Gladbach | 85 | 2.35 | 0 |
| Neuss | 171 | 2.34 | 0 |
| Rostock | 7117 | 2.25 | 0 |
| Chemnitz | 2401 | 2.25 | 0 |
| Bochum | 11,010 | 2.23 | 0 |
| Bayreuth | 4308 | 2.21 | 0 |
| Bremen | 8724 | 2.2 | 0 |
| Flensburg | 92 | 2.17 | 0 |
| Ilmenau | 1528 | 2.16 | 0 |
| Kassel | 2239 | 2.14 | 0 |
| Passau | 377 | 2.12 | 0 |
| Trier | 1321 | 2.12 | 0 |
| Magdeburg | 5787 | 2.11 | 0 |
| Koblenz | 1093 | 2.1 | 0 |
| Bielefeld | 4885 | 2.09 | 0 |
| Osnabrück | 2005 | 2.09 | 0 |
| Walldorf | 96 | 2.08 | 0 |
| Erkner | 52 | 1.92 | 0 |
| Fulda | 158 | 1.9 | 0 |
| Neubiberg | 950 | 1.89 | 0 |
| Witten | 2238 | 1.83 | 0 |
| Freising | 331 | 1.81 | 0 |
| Krefeld | 278 | 1.8 | 0 |
| Wolfsburg | 112 | 1.79 | 0 |
| Sankt Augustin | 291 | 1.72 | 0 |
| Emmerthal | 117 | 1.71 | 0 |
| Ainring | 61 | 1.64 | 0 |
| Wismar | 65 | 1.54 | 0 |
| Hagen | 430 | 1.4 | 0 |
| Clausthal-Zellerfeld | 1183 | 1.35 | 0 |
| St. Ingbert | 224 | 1.34 | 0 |
| Oberhausen | 80 | 1.25 | 0 |
| Ingolstadt | 82 | 1.22 | 0 |
| Freiberg | 2217 | 0.9 | 0 |
| Eggenstein-Leopoldshafen | 443 | 0.9 | 0 |
| Großbeeren | 227 | 0.88 | 0 |
| Furtwangen in Schwarzwald | 156 | 0.64 | 0 |
| Bad Oeynhausen | 163 | 0.61 | 0 |
| Dummerstorf | 577 | 0.52 | 0 |
| Eichstätt | 80 | 0 | 0 |
| Hilden | 57 | 0 | 0 |
| Kühlungsborn | 167 | 0 | 0 |
CSS: Characteristic Scores and Scales.
Only cities are considered with at least 50 papers. The CSS classes are defined as follows: 3 = outstanding, 2 = far above average, 1 = above average and 0 = below average.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship and/or publication of this article.
