Abstract
The implementation of infrastructure projects is one element that drives economic development. When considering the high costs and complexity of these projects, the importance of analysis is highlighted a little more than traditional value for money when such projects are carried out in a public-private partnership. This article examines the state of the art of research on evaluating public-private partnership performance considering evidence of value for money. For this purpose, a systematic literature review was conducted based on the scientific production available in the Scopus and Web of Science databases. The main contributions of this study consist of presenting (1) a set of papers on performance evaluation that considers VfM as a complement to its evaluation systems, comparatively or inclusively in its analyses; (2) a methodological framework; and (3) guidelines for future research. This study provides insights for developing empirical studies in a wide range of contexts or sectors.
Introduction
Public-private partnerships (PPPs) are defined as contracts between the private and public sectors to provide services to a particular community (Yescombe, 2007). In a document from the World Bank, the public-private partnership is defined as “a long-term contract between a private party and a government entity, to provide a public asset or service” (Bank, 2017, p. 5). This definition is based on the British model, established in 1992, whose central character is the private sector financing and building the asset, and selling the flow of services generated to the public sector, which has a direct or indirect obligation to acquire these services during the contract (Grout, 1997). Since PPPs have become popular around the globe, studying their performance from different perspectives is important, as their use tends to be more constant (Hodge & Greve, 2017). In the recent literature, Dolla and Laishram (2019) argued the need for effective procurement, since infrastructure projects contracted through partnerships should generate Value for Money (VfM), while at the same time focusing criticism of the PPP institution that results in higher costs.
Infrastructure projects involve installations necessary for society to function, such as transport, sanitation, and public lighting, in addition to schools, hospitals, and prisons, as well as other essential arrangements for the structure of a society (Yescombe, 2007). It is known, therefore, that the government is not always a specialist in all areas and agreement with private companies is a way of guaranteeing these services for society and, moreover, with quality, speed and at a lower cost. Despite the criticism that PPPs may not generate better results than traditional contracts (Verweij & van Meerkerk, 2021), it is through monitoring and evaluation methods in the various stages of a project that greater security of the proposed objectives is obtained. Assessing performance is a necessary condition to ensure the delivery of the objectives established in the PPP and guarantee VfM (Love et al., 2015). In this context, the following research question is asked: What is the state of the art of research on the evaluation of PPP performance using VfM?
To answer the research question, it was necessary to conduct a systematic literature review (SLR), based on the research protocols suggested by authors such as de Almeida Biolchini et al. (2007), Kitchenham (2004), and Tranfield et al. (2003). The SLR was conducted in the Scopus and Web of Science databases on issues related to performance evaluation of partnership projects between the government and the private sector, considering the cost-benefit – VfM of this undertaking.
VfM is a presumptive estimate of performance in the delivery of public sector works and services through private contracting, in which the higher the measure, the better the works and the lower the expenses (Hu et al., 2014; Peterson, 2019). To determine VfM, the Public Sector Comparator (PSC) is usually used to compare the cost of the proposed PPPs and the reference cost of a traditional acquisition (Grimsey & Lewis, 2005). Thus, according to the authors, VfM can be described as an ideal combination of cost and quality for the project's entire life that meets the user's needs. Therefore, seeking the best result in terms of cost-benefit makes sense, because if it does not occur, other forms of government takeover are preferable. Thus, performance evaluation methods must take this philosophy into account, since the PPP institution exists to achieve better results. In this context, this article assumes that PPPs incorporate principles of efficiency, speed and lower costs into the public sector, supported by the recommendations of Hodge and Greve (2007) regarding a way to gauge and provide advice in advance on partnership projects between government and companies.
This article contributes to the literature by presenting a typology emphasizing (1) a methodological framework, (2) the scope of PPPs and (3) VfM components. In this respect, authors like Grimsey and Lewis (2002, 2005) addressed relevant issues related to PPPs and VfM, as did Song et al. (2016), who influenced the field by highlighting trends for studies on PPP performance evaluation. It is hoped that this article can help researchers and PPPs with regard to the creation and application of evaluation methods, especially in the bidding phase for the selection of new partners and public policy makers in decision making to minimize future problems.
Methodological Procedures
This article presents the state of the art in relation to methods of evaluating the performance and cost-benefit of public-private partnerships. The SLR is a technique that allows the mapping and analysis of relevant research, enabling discussions based on those provided by traditional reviews (de Almeida Biolchini et al., 2007; Kitchenham, 2004; Tranfield et al., 2003).
The SLR protocol is shown in Figure 1. For a better analysis and replicability of this study, the SLR was divided into three stages, as suggested by Tranfield et al. (2003): (1) SLR planning, (2) SLR conduct and (3) SLR knowledge dissemination. Research protocol systematization.
Stage I: Planning the Systematic Literature Review
Strings and Number of Articles Resulting From Searches for Reviews.
aThe data exportation/collect occurred on July 19th 2020.
Table 1 presents 34 previously selected articles, six of which focus on the literature review. Among the 34 documents, the six that focus on reviews are those by Jayasuriya et al. (2019) and Zhang et al. (2020), developed from a specific set of periodicals identified a priori, focused on public-private partnerships in the context of construction. On the other hand, Ma et al. (2019) provided an overview of the state of the art of public-private partnerships between 2008 and 2018. Sanni and Hashim (2016) conducted a review of the practice of public-private partnerships procurement in Malaysia and the study of Ullah et al. (2016) was directed at aspects involving the concession of public-private partnerships. Liu et al. (2014) provided a narrative review of the literature on measuring the performance of public-private partnerships.
This study represents a significant contribution to the literature on Public-Private Partnerships, considering the important role of the PPP institution in providing public services to society with regard to risk sharing with the private sector, as well as taking advantage of the experience, effectiveness and resources that the private sector has to offer in its area of expertise.
Stage II: Conducting the Systematic Literature Review
Number of Articles Found in Scopus and Web of Science.
aThe data exportation/collection occurred on July 19th 2020.
As shown in Figure 1, after the selection of the articles, the textual corpus selection process was begun. The first was to exclude duplicate studies (17) from the 471 articles that were identified, leaving a total of 454 articles.
Regarding reputation, the articles were selected from journals in the first and second citation quartiles (Q1 and Q2) of the Scimago Journal & Country Ranking (SJR, 2019). Given the lower acceptance rates of journals ranked in the first quartiles, compared to journals ranked in the third and fourth quartiles, according to Gu and Blackmore (2019), these journals are more demanding, reflecting on the quality of the articles. Thus, of the 454 articles, 207 are Q1 and 157 are Q2.
In the remaining 364 articles, the presence of the string words in the title, abstract and keywords fields were then analyzed. Thus, owing to the functions of the OR and AND operators and adherence to the research theme, the final textual corpus of 117 articles was selected.
Stage III: Reporting and Dissemination
In the third and last stage, the analyses and discussions were presented in three stages: (1) descriptive analysis; (2) typology analysis; and (3) bibliographic coupling analysis. These analyses were performed with the support of RStudio 1.0.143, IRaMuTeQ 0.7 and Gephi software (Bastian et al., 2009; Perianes-Rodriguez et al., 2016; Souza et al., 2018; Team, 2021).
Analysis of the Results and Discussions
This section presents (i) a descriptive analysis of the textual corpus, (ii) typology, and (iii) a bibliographic coupling analysis with a focus on the research proposals.
Descriptive Analysis
Figure 2 shows the annual distribution of the textual corpus composed of a sample of 117 articles. The selected articles are part of the collection of a total of 65 journals and 263 authors and co-authors. The average number of citations per article was 25.95 in the twenty-year period under analysis (2000 to July 2020), the period in which PPPs grew around the world, given the laws and regulations that emerged in the 1990s and in the United Kingdom, where they were named Private Finance Initiatives (PFIs) (Grout, 1997). In the early 2000s, the 2003 African Treasury Regulation established rules for their conduct. In Brazil, Law 11.079/2004 instituted norms for contracting PPPs nationwide. In Chile, Law 20,410 of 2010 regulated Public-Private Partnerships. The period ended in July 2020, when the collection began, and it was not possible to collect data after the research had begun. Annual distribution of the textual corpus.
Between 2008 and 2018 a growing trend can be seen in the volume of articles published, which may indicate the growing interest of the international scientific community in the subject. From 2010 onwards, the variation in quantity constantly increased year by year. In this respect, considering that the inclusion of articles in the corpus is a reflection of the selection criteria defined a priori, it is noteworthy that in the years 2010, 2013, 2015 and 2018, the number of articles from journals with publication areas aligned with the research in question and classified in the first and second citation quartile of the SJR (2019) was higher. Among the areas of publication are: Development, Urban Studies, Energy Engineering and Power Technology, Health Policy, Medicine, Public Health, and Environmental and Occupational Health.
Top 10 Most Cited Articles.
Source: Research data.
Based on Table 3, Grimsey and Lewis (2002) analyzed the principles involved, with regard to fundraising by the government, in contractual actions with the private sector, taking as an example a project for installation of a wastewater treatment plant in Scotland. In similar terms, the same authors Grimsey and Lewis (2005) conducted a comparative study involving 20 countries, focusing on VfM in PPPs, as well as professional and academic perspectives of the subject. Yuan et al. (2009) pointed to a set of organizational objectives and key performance indicators (KPI) to assist in obtaining the VfM of PPPs. Xu et al. (2010) developed a fuzzy synthetic assessment model to measure the risk level of a given critical risk group (CRG) and the overall risk level associated with PPP projects in China. Shaoul (2005) analyzed the process, the financial methodology, its assumptions and the data used in the evaluation system of hospitals built with resources from the private financing initiative (PFI) in the UK. Song et al. (2016) discussed global PPP standards from the literature, while Ng et al. (2012) highlighted the critical success factors for the initial phases of PPP projects with regard to feasibility, according to the perception of the public sector, private consortia and the community in general in order to achieve a “triple win” scenario. Zhang (2005) suggested measures to remove barriers to PPP, with a focus on development and infrastructure. In the same perspective, Love et al. (2015) proposed a model for measuring the performance of PPPs, based on their life cycle, considering VfM for the public sector. Siemiatycki and Farooqi (2012) emphasized the importance of ex ante evaluations of PPPs, considering the political aspect and the main planning concerns regarding limitations on community consultation, as well as contractual restrictions, which can make it difficult to consolidate plans for the future.
Proposed Typology
For a robust analysis of the textual corpus, the Reinert Method, also known as Descending Hierarchical Classification (DHC), was used to present a typology when interpreting words grouped into classes.
A typology is a systematic grouping of words that, in addition to the frequency of occurrence, considers the lexical field. When considering the meaning of words, the interpretation of the classes that make up the typology enables a contextualization and debate of the themes in question and, in this case, contributes to the development of the literature on public-private partnerships (Loo, 2020).
The textual corpus composed of 117 abstracts was submitted for classification, comprising 684 text segments. Of these 684 text segments, 625 were classified, which is equivalent to 91.37%. It is noteworthy that the classification must be at least 70% of the text segments for the DHC to be considered statistically significant (Camargo & Justo, 2013).
Regarding categorization, a total of three classes were obtained (Figure 3), which were renamed from the content analysis: Class 1 includes 261 text segments (41.76%); Class 2 contains 149 text segments (23.84%); and Class 3 includes 215 text segments (34.40%). Classes of the typology.
The discussion of each typology class is based on the occurrence of words whose chi-square value is greater than 3.80 (χ2 > 3.80), or alternatively, where the probability value is less than 5% (p-value < .05), eliminating those words that do not meet this statistical criterion. We emphasize that a p-value < .05 portrays a significance level in terms of the association between words and classes (Reinert, 1990). Words included in Class 1, Class 2, and Class 3 are shown in Figure 3, as well as, an author associated with each word.
Class 1 presents the methodological procedures considered significant in terms of chi-square (χ2) and p-value statistical metrics. However, in order to provide a complete methodological framework, we conducted a more detailed survey with regard to data collection and analysis techniques (Figure 4). The studies were categorized according to the nature of the research: (1) quantitative research, which represents 41.03% of the textual corpus; and, (2) qualitative research, which represents 58.97% of the textual corpus. In addition, the first authors of each study were considered based on the frequency of occurrences in which the methods were used in the articles. It is also noteworthy that some studies used more than one method of collection and analysis in the same research to achieve better results. Methods used in the corpus articles.
Qualitative approaches were predominant. Of the qualitative data analysis methods, the recurrent ones were descriptive analysis, prescriptive analysis, VfM analysis, comparative analysis and exploratory data analysis. Despite the variety of methods used, no evidence was found of studies of a bibliographic nature that focused on the same topic discussed in the scope of this work and that used metrics of citation, co-citation networks or bibliographic coupling networks together, or even, that used both the Scopus and Web of Science databases as sources of data. With regard to quantitative studies, the recurrent methods of analysis were structural equation modeling and exploratory factor analysis, in addition to the relational method of regression analysis.
Regarding the data collection methods used in qualitative research, the most frequently used metrics were document analysis and interviews. On the other hand, in the case of quantitative collection methods, structured questionnaires through survey research were predominant.
The words that comprise Class 2 (Figure 3) address terms regarding the elements of the PPP contract. In this class, we highlight the work of Javed et al. (2013), in which the authors reinforced the importance of establishing robust specifications for output requirements, particularly in the initial acquisition phase. Moeover, Willems et al. (2017) showed that it is necessary to recognize the diversity of projects and develop a common vocabulary, and contracting authorities must have a clear vision of their objectives before beginning the competition.
Meanwhile, Love et al. (2010) developed a conceptual model represented by the public sector based on trust between alliances in which the use of incentive payments through risk/reward models provide the impetus for better project performance. To these authors, the growing search for a better cost-benefit ratio for infrastructure projects raises the need to identify Success Factors (SF) for an alliance with competitive prices.
In the Class 3 (Figure 3), the words portray the VfM components. The Value for Money measurement can estimate the ideal cost allocation for the project life cycle, comparing it with another contracting modality, that is, the PSC. In this class, Hu and Han (2018) discussed a quantitative VfM assessment of PPP mode road projects in China using the PSC. With the basis of the construction cost and the operating cost, as well as the toll revenues and the revenues from the services and advertising area, the authors identified that a higher discount rate is profitable for the private sector, whereas, in contrast, a lower discount rate is favorable for the government.
This class also highlights the work developed by Aldrete et al. (2012), which provided details on the incorporation of risk factors in the VfM analysis, with this analysis as the central point for decision-making by managers. According to the aforementioned authors, the use of objective measures of costs and quantification of risks was recommended to maximize VfM, while they recommended more methodological approaches through an assessment framework from the VfM perspective.
Bibliographic Coupling Analysis
The bibliographic coupling analysis (BCA), presented in Figure 5, measures the similarity between pairs of documents from shared references (Kessler, 1963) and, according to Grácio (2016), makes it possible to study the development of research lines, as it allows the identification of the most important research centers, researchers and articles in a scientific domain. Coupled documents network.
Figure 5 shows an excerpt of the textual corpus from the period between 2016 and 2020, with each article (node) represented by the main author. The average clustering coefficient is 0.607 and the modularity index of 0.194 suggests that the network has a modular structure (Newman, 2006) and, in this case, it can be analyzed from three clusters.
Using the betweenness centrality, it is possible to highlight some authors in the network. This index measures the mediation intensity in order to enable the connection between pairs of nodes, establishing the network information flow. According to Zhang and Luo (2017), as a node represents the only way to connect with other pairs of nodes, this node possibly has high betweenness centrality.
Details of the Coupled Documents.
Source: Research data.
Table 4 shows that Song et al. (2016) occupy a prominent place in the network, which can be explained by considering that their research provides theoretical support for PPPs.
In Cluster 1, we also highlight the research by Liu, Love, Smith, Irani et al. (2018), Liu, Love, Smith, Sing et al. (2018) and Swanson and Sakhrani (2020), who, based on the developed PPP performance measurement system, suggested greater efforts to adjust payment mechanisms through an explicit function to enable the use of the system in practice. According to the authors, this adjustment to the model structure, combined with the monitoring of service delivery, increases the chances of PPP contracts to provide long-term value for interested parties.
Liu et al. (2016) recommended validating the proposed theoretical framework, which encompasses the real life cycle VfM assessment, stakeholder-oriented performance measures and a systematic review mechanism of operational key performance indicators (KPI). Budayan et al. (2020) suggested studies focusing on the validation of specific KPIs for build-operate-transfer projects.
In general, in Cluster 1, suggestions for future research focus on validating the developed structure or models, inserting variables and developing specific indices. The content of the articles reflects the importance of the review that was conducted, as they demonstrate performance appraisals and how they can aid the VfM assessment. For Hossain et al. (2019), the PSC analysis, on which the VfM assessment depends, is not frequently conducted in Bangladesh. The authors' contribution in the context of the study was to identify a prioritized set of performance indicators and key performance areas (KPAs) for Bangladeshi PPPs by applying the AHP method, establishing weights for indicators identified by a review of the related literature.
This work is a constant example in Cluster 1, developing a method that helps the decision process without disregarding VfM. The results show that “funding” is the most significant KPA with regard to measuring PPP performance, followed in order of importance by “planning and initiation,” and “transparency and accountability” (Hossain et al., 2019).
In the same perspective, the study of Liu, Love, Smith, Sing, et al. (2018) reported that there are very few ways of evaluating the performance of PPPs, which makes evaluations difficult, especially in the initial phases, and restricts the verification of VfM. The authors proposed a Performance Prism that can be applied throughout the lifecycle to fill this gap. This prism extends the assessment of VfM with qualitative and quantitative approaches. This mechanism for reviewing performance measurement systems throughout the lifecycle helps to ensure project success.
In Cluster 2, the studies suggest a more in-depth search for greater accuracy of models, practical applications and validity tests in different types of PPP, distinct regions and different sample sizes (Chou & Leatemia, 2016; Kavishe et al., 2019; Simon et al., 2020; Wang et al., 2019).
Regarding the use of VfM to choose between PPPs and traditional acquisitions of infrastructure projects, it is recommended that the PPP format be chosen when it provides the best cost-benefit ratio. Therefore, it is important to choose indicators that go beyond the mathematical and financial definition of present value, according to Cui et al. (2019), who studied the impacts of VfM drivers and developed structural equation model that indicated that cost and effectiveness drivers contribute to the project's sustainability and performance and the better the cooperation environment, the better the participants' capacities. In Cluster 3, the research recommends a wider range of variables and the development of new tools (Burke & Demirag, 2019; Kweun et al., 2018; Makovšek & Moszoro, 2018; Opara et al., 2017; Santandrea et al., 2016). This review contributes by finding works that analyze risk factors for PPP assessment, for example, despite the controversy over the discount rate to be used for financial estimation between the PSC and PPP. Zwalf et al. (2017), from Cluster 3, claimed that PSC-based valuations must guarantee a discount rate that reflects the real VfM. Hence, an adequate discount rate, especially concerning risk inclusion, is an important issue to consider so that distortions do not occur.
Final Considerations
This article aimed to investigate the state of the art of research on the evaluation of PPP performance using VfM. To achieve the proposed objective, an SLR was performed based on the research protocol developed by Tranfield et al. (2003).
The state of the art was analyzed based on a set of 117 articles from QI and Q2 SJR (2019), available in the Scopus and Web of Science databases. The results showed the behavior of the annual distribution of publications and revealed the growing interest of the academic community in the topic in question, with this interest being more focused on infrastructure projects (construction and highways) and less focused on the area of health and the environment. Among the most cited articles are “Evaluating the risks of public private partnerships for infrastructure projects” (Grimsey & Lewis, 2002) and “Are Public Private Partnerships value for money?: Evaluating alternative approaches and comparing academic and practitioner views” (Grimsey & Lewis, 2005). Both works addressed the evaluation of PPPs with VfM integration. In particular, the latter article highlighted in the title the question of whether Public-Private Partnerships really generate VfM, suggesting measurement alternatives from both academic and practical viewpoints.
In relation to the typology, three classes showed the methodological procedures, elements of the PPP contract and the VfM components. Among the methods, those that seek to model PPP performance assessment instruments through structural equations of exploratory or confirmatory analysis were evident. This purpose, of creating an assessment instrument or applying an assessment tool, was evident in the articles that were analyzed. It is recommended that the VfM approach encompass criteria that address different stakeholders and thus improve the analysis of PPP performance compared with other forms of contracting.
This article can help researchers in future empirical studies, both quantitative and qualitative, through theoretical support for the formulation of research problems, especially when considering the set of possible trends provided in the coupling analysis. In addition, we suggest the development of modeling studies of the multicriteria evaluation methods of Public-Private Partnerships, considering the life cycle phases, especially in the bidding phase, to evaluate private proposals, thereby optimizing long-term results through more assertive choices. Another suggestion is the possibility of bidding proposals using indicators that represent the different stakeholders involved in the public-private partnership.
Although this work presents a systematic and exhaustive review of the literature, there are some limitations in relation to our findings, which can be considered in future research: (1) the defined search strings limited the results to articles published in English; (2) only journals placed in the first and second citation quartiles according to the SJR (2019) were included. Therefore, it cannot be ignored that relevant research has been published in other formats, such as books and conference proceedings, or in other languages.
Footnotes
Acknowledgments
The authors wish to express their gratitude to the editor Richard G. Little and reviewers for their constructive input and kind feedback.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Author Biographies
.
.
.
.
