Abstract
Governments have struggled with addressing problems that cross agency boundaries. Since 2012, the New Zealand Government has achieved significant success by holding groups of agencies collectively responsible for achieving intermediate outcome targets (the ‘Results Programme’). The Results Programme has been described as the most important change in how public services are delivered in New Zealand in 20 years. This article uses a mixed methods approach to triangulate 10 features of the Results Programme that appear to contribute to its success. Collaboration literature typically focuses on reducing barriers, often expressed in terms of transaction costs; in contrast, the successes of the Results Programme are explained here as methods for engineering a sense of joint goal commitment, that provides the sustained impetus to succeed despite the barriers encountered.
Introduction
Supporting interagency collaboration is a major area of interest for governance literature and practice (Carey and Crammond, 2015). Regardless of how government is divided, some problems will inevitably span the boundaries between agencies (Scott and Boyd, 2016). Addressing such problems has been described as both the ‘holy grail’ (Peters, 1998) and the ‘philosophers stone’ (Jennings and Krane, 1994) of public administration. The barriers to interagency collaboration are generally believed to be high (Altshuler, 2003; Glendinning, 2003; Hudson et al., 1999), and these barriers are frequently expressed in terms of transaction costs (Scott and Bardach, 2019; Thomson and Perry, 2006, Warner and Hefetz, 2008). Efforts to improve interagency collaboration have focussed on reducing transaction costs to make collaborative work easier and to progress more quickly – how to ‘lower the barriers’ to collaboration (Edmondson and Roloff, 2009). This article takes a different approach, and asks whether goal commitment might be used to clear those barriers, or else to smash through them.
From 2012 until 2017, the New Zealand government operated an interagency collaboration programme called the Better Public Services Results Programme (the ‘Results Programme’). The Results Programme combined target setting, public reporting, shared responsibility, and various collaborative governance and management practices. It achieved process success, in that it was held up by public servants as an example of successful collaborative behaviours (Scott and Boyd, 2017). More importantly, it achieved programmatic success (Marsh and McConnell, 2010), in that the Results Programme improved outcomes in all 10 problems to which it was applied (State Services Commission, 2017).
This article considers why the Results Programme was successful, when other interagency collaboration programmes in New Zealand had often failed (Scott and Boyd, 2016). This article uses a mixed methods analysis to triangulate findings from four sources. The analysis identifies features that we interpret as contributing to the success of the Results Programme. These features seem to contribute to a shared sense of commitment felt by participating agencies to achieving the desired goal.
This article includes five parts following this introduction. First, there is a description of the New Zealand context, the Results Programme, and why this should be of scholarly interest. Second, the literature on interagency collaboration and goal commitment are reviewed and summarised. Thirdly, the methodology is described. Fourth, findings are presented. Finally, there is a discussion of how these findings contribute to goal commitment.
The New Zealand context
New Zealand is a parliamentary democracy modelled after the Westminster style of government, with the public sector divided into a large number of single-purpose agencies. Ministers enjoy a high level of autonomy over policy within their warrant and are incentivised to defend their own policy priorities rather than the collective good (Gregory, 2006). New Zealand is regarded as the purest example of ‘New Public Management’ (Pollitt and Bouckaert, 2011) with a focus on single-point accountability (Gorringe, 1994; Gregory, 2006), and this may make collaboration even more difficult (Boston et al., 1996). Public servants have become accustomed to this system, and react to calls to work together by trying to read implicit signals about who will really be held accountable for the success of failure of the nominally joint activity (Schick, 2001; Scott, 2001). While barriers to interagency collaboration have been reported from many jurisdictions (Carey and Crammond, 2015), New Zealand’s New Public Management means these barriers are likely to be raised further.
The results programme
After several failed attempts at encouraging collaboration (Scott and Boyd, 2016), in 2012 the New Zealand Government tried a new approach, which combined lessons from literature on collaboration (Carey and Crammond, 2015), prioritisation (Kohli, 2010), and target setting (Bevan and Hood, 2006). This was described in popular media as the most significant change to how government services are delivered in New Zealand in 20 years (Shelton, 2013). The government chose 10 important and long-standing problems where progress required collaboration between multiple agencies, and set a challenging five-year target for each (listed in Table 1). The targets were declared publicly, with a pledge to report on progress every six months. Small groups of chief executives were collectively answerable to Ministers for progress towards each target. The Results Programme was implemented in a context of significant fiscal constraint, with most participating agencies operating under flat or declining nominal baseline budgets. The Results Programme had no dedicated budget and agencies were expected to make better use of resources already allocated to addressing these 10 problems.
BPS results and targets.
After four years, all 10 outcomes were measurably improved both in absolute terms and relative to trend data where available (see State Services Commission, 2017). We consider this evidence of programmatic success (Marsh and McConnell, 2010), but should we? McConnell (2010: 354) describes a programmatic success as the ‘achievement of desired outcomes’. At the time that the analysis presented in this article was conducted (early 2017), the programme was not complete. Later in 2017, there was a general election resulting in a change of government. The programme was discontinued by the incoming government and no final report was issued, although a newspaper article at the time indicated that ‘about two thirds of the targets were met while others fell short’ (New Zealand Herald, 2018). If the ‘desired outcomes’ of the programme were the targets, then the programme was a partial success. If the ‘desired outcomes’ were to improve performance, above baselines and trends, in 10 stubborn crosscutting problems, then the programme was a success in all 10. The authors take the latter interpretation and assume that the targets are a design feature intended to drive performance improvements. We take certain events as suggestive that the government saw the programme this way as well. In 2014, it became apparent that three of the targets would be achieved earlier than anticipated. If the purpose was only to achieve the targets, the government could have responded by claiming victory. Instead the government responded by resetting those three targets to be more difficult (State Services Commission, 2014), suggesting that the true purpose of the programme was to use targets drive performance improvements.
In discussions with public servants involved with the Results Programme, one theme was repeated – progress was never easy (Orange, 2016; Scott and Boyd, 2017a). It involved continued effort to overcome the administrative barriers of a system designed around single-point accountability, siloed working and patch protection. While the Results Programme was a (process and programme) success, it was a hard-won success borne of persistence and dedication.
Interagency collaboration and goal commitment
Interagency collaboration
Working across agency boundaries has become a core part of contemporary arrangement for public service provision (Esteve et al., 2015; O’Leary and Vij, 2012). Despite extensive literature (Carey and Crammond, 2015; Page, 2003), working across agency boundaries remains a fraught and costly endeavour (Bryson et al., 2015; Doberstein, 2016; Minson and Mueller, 2012; Prebble, 2015; Scott, 2015).
‘Coordination’ or ‘collaboration’?
While some authors use the term ‘collaboration’ in a general sense to describe a range of practices for interagency working (see for example, Axelsson and Axelsson, 2006), other authors use collaboration to refer to one category of practice among others like ‘cooperation’ and ‘coordination’ (Eppel et al., 2008; Gadja, 2004; Gregson et al., 1992; Howarth and Morrison, 2007; Huxham and Macdonald, 1992; Keast et al., 2007; Marrett, 1971; Sadoff and Grey, 2005; Thomson and Perry, 2006). Several of the earlier references in this article describe practices that they call coordination (Jennings and Krane, 1994; Peters, 1998), though it would be inaccurate to suggest that clear boundaries exist (Thomson and Perry, 2006). There is a general sense in the literature that, when compared to cooperation or coordination, collaboration involves greater ‘depth of interaction, integration, commitment, and complexity’ (Thomson and Perry, 2006: 23), but unfortunately these terms are not used precisely or consistently. Gray (1989) suggests that collaboration is distinguished in part because it involves integration over a longer period, with cooperation and coordination being perhaps early developmental stages. Keast et al. (2007) suggest that collaboration is distinguished by the actions that are performed together; collaboration involves shared working, rather than merely shared information or shared planning. Eppel et al. (2008) distinguish collaboration by its responsibilities; collaboration involves shared responsibility for a common goal, with other forms of interagency working involving cooperating or coordinating in areas of common interest and overlap while maintaining separate responsibilities. It is this latter distinction that is most relevant for this article, and we define interagency collaboration as the practice whereby administrative divisions of government take shared responsibility for achieving a common goal. The methods employed in achieving this outcome might include sharing information, jointly developing plans and strategies to better align activities (sometimes called coordination) or working together to jointly develop and deliver solutions (sometimes called collaboration), or a combination.
A related term, popularised by the UK’s Blair government, is ‘joined-up government’, which is also used inconsistently and ambiguously in the literature (Carey and Crammond, 2015), and refers to a range of practices (Ling, 2002). Ling (2002) presents examples of ‘joined-up government’ from different jurisdictions, includes practices characterised as both ‘coordination’ (p. 628) and ‘collaboration’ (p. 620, 621), and suggests collaboration as a means to achieve coordination (p. 629).
The field of interagency collaboration is sometimes included within broader collaborative literature alongside ‘cross-sectoral collaboration’ (Bryson et al., 2015). In cross-sectoral collaboration, the public sector is one actor among others. While some of the practices in interagency collaboration may be similar to those in cross-sectoral collaboration (such as the use of governance structures and processes), there may also be important differences. Interagency collaboration is frequently imposed from above (by politicians), whereas cross-sectoral collaboration more regularly involves the recognition by participants of overlapping or aligned interests or capabilities (Provan and Kenis, 2008), and the recognition of collaborative opportunity (Bryson et al., 2015).
Context and generalisability
Carey and Crammond (2015) provide a recent synthesis of empirical evidence specific to collaboration in an interagency context, and identified the following as features found to promote effective collaboration: political support, leadership at all levels, a combination of top-down and bottom-up approaches, decentralised control, and designing for context. While some ‘instruments’ (management structures and processes) were seen as useful in some contexts, there was little commonality across multiple studies. It is therefore with some trepidation that we consider which contextual or contingent factors (Donaldson, 2001) may explain the findings of this New Zealand case.
One defining characteristic of the Results Programme was that the problems selected were those that had been resistant to past attempts at remedy. They were stubborn problems (Scott and Boyd, 2017), and nothing about their context in 2011 suggested that they represented a particular ‘collaborative opportunity’ (Bryson et al., 2015). Nonetheless, they were selected as priorities by the Government of the day. The global financial crisis meant that there were no additional resources available. The New Zealand context was one of individual accountability without a strong history of collaborating successfully. For these reasons, we consider this a case study not of how to make interagency collaboration easy, but how to succeed when it is hard.
Costs and barriers
Interagency collaboration is hard for many reasons. Rather than the efficiency of a hierarchical chain of command, interagency collaboration is an ongoing negotiation (Thomson and Perry, 2006) with significant coordination costs (Van Huyck et al., 1990). Decisions taken by an individual may not be supported by others, and so decision-making often occurs by consensus (Ren et al., 2005), which can be very slow. Goals may be imperfectly aligned, and agencies may act opportunistically to pursue their own goals (Laan et al., 2011). Working together may mean more optimal use of resources for production, but the transaction costs tend to be much higher (Krueger and McGuire, 2005).
Goal commitment
A commonly considered construct for explaining task performance is ‘goal commitment’ (Klein et al., 2013), based on a body of psychology literature known as ‘goal setting theory’ (Locke and Latham, 1990). Goal commitment is defined by Klein et al. (2013: 67) as ‘a volitional psychological bond reflecting dedication to, and responsibility for, a particular target’.
Goal commitment is well established as a contributor to performance in management literature from the private sector (for example, Erez and Judge, 2001; Schweitzer et al., 2004), with a small but growing consideration in public administration literature (for example, Bronkhorst et al., 2015; Favero and Bullock, 2015; Jung and Ritz, 2014; Latham et al., 2008; Pedersen, 2016; Rutherford and Meier, 2015). While important caveats and limitations to the measurement of goal commitment remain, there is general agreement on the positive relationship between goal commitment and performance (Klein et al., 1999). This is explained by the finding that, compared to less goal-committed individuals, highly goal-committed individual exerts greater effort and persistence towards goal attainment (Seijts and Latham, 2000); and are more likely to develop task-related strategies (Earley et al., 1992). Studies on the antecedents of goal commitment identify these as the expectancy of goal attainment and the attractiveness of goal attainment (Hollenbeck and Klein, 1987; Klein and Wright, 1994). Individuals were more highly goal-committed when the goal was perceived as difficult, but with a sharp drop in commitment if the goal was perceived as impossible. Expectancy was higher in cases of high self-esteem, and in the presence of social comparison where others had been successful. The attractiveness of goal attainment was in turn influenced by the anticipated reward structure, when there was a credible expectation of consequences (rewards and sanctions) associated with goal achievement. In the public sector, Wright (2007) linked the attractiveness of goal attainment to concepts of mission valence (the public servants’ perception of the social contribution of the goal) and public service motivation (the desire to serve the public interest).
Goal commitment literature is primarily concerned with individual performance. In those cases where team performance is considered (Aube and Rousseau, 2005; Hoegl and Parboteeah, 2006), these are teams within the same organisation, rather than spanning multiple organisations or agencies. Interagency collaboration has historically been considered unsuitable for goal setting because of ‘the problem of many hands’ (Thompson, 1980), or the inability to disentangle the contributions made by individuals to the observed collaborative outcomes. Several recent cases, such as Virginia Performs, GMAP Washington, Scotland Performs, Victoria Ahead (see Kohli, 2010), and now New Zealand’s Results Programme, suggest that interagency goals may be more useful than previously considered.
Methodology
The Results Programme has been evaluated separately using four evaluation methods, each described below and available online. The combination of these evaluations through a mixed methods approach is likely to result in more adequate explanations than any one method alone (Creswell, 2013). This section describes the use of triangulation and complementarity to analyse the results of these evaluations, and limitations of this approach.
Evaluation methods
The first evaluation was published in a peer-reviewed journal (Scott and Bardach, 2019), and featured a comparative study (Yin, 1981). The relative progress towards each of the 10 goals was attributed to differences between them (for example, target type, relationship history, and governance settings). This method had limitations that may reduce both the reliability and completeness of the findings: the method relied on source documents written by public servants involved in the Results Programme, and their decisions to include or exclude material represent unavoidable selection bias (Heckman, 1990); the methodology only analyses features that differ between the 10 goals, and cannot draw findings on the importance of the many areas in which work towards the 10 goals was alike; and the analysis assumes that success is equivalent to the degree of progress towards the goal, which does not allow for the possibility that the goals may differ in their degree of challenge.
The second evaluation (Scott and Boyd, 2016a) compared programme features against process success, drawn from the normative ‘collective impact’ framework (Kania and Kramer, 2011; Preskill et al., 2014). The use of process measures reduced confounding errors associated with the relative difficulty of individual goals, but is limited by the assumption that the process measures in the Collective Impact framework were valid. The Collective Impact framework was originally designed for use in cross-sector collaboration between organisations in the government, corporate, and community sectors but was adapted for government use with help from the original authors (Hallie Preskill, 26 August 2015, personal communication; Mark Kramer, 25 August 2015, personal communication).
The third evaluation (Scott and Boyd, 2016b) used a multiple case study method to explore individual success stories (George and Bennett, 2005). Public servants involved in the Results Programme identified 89 cases that they believed best illustrated the success of the programme, and described these in short vignettes of 200–800 words (State Services Commission, 2016). The evaluation analysed these case vignettes through exploratory coding (Manning and Cullum-Swan, 1994) and identified a number of recurring themes about government practices that were shown to make a difference. The main strength of this method is that it performs analysis at a more granular level (with 89 rather than 10 elements). However, this method introduces a new type of selection bias, as public servants submit case vignettes that conform to their perspective on success and include only elements that they believe are pertinent to the story.
The fourth evaluation (Scott and Boyd, 2016c) explored behavioural change (Simons, 2013). A literature search was conducted of government documents and media reporting, and analysed thematically (Fereday and Muir-Cochrane, 2006), looking for descriptions of how behaviours had changed in response to the Results Programme and specifically the introduction of performance targets. This provided an additional point of comparison, of behaviours before and during the programme.
Triangulation and complementarity
This paper uses a mixed methods approach to improving the reliability of the above evaluations and the depth of substantive understanding of the phenomena being studied (Greene, 2007). Mixed method research combines qualitative and quantitative data, methods, and/or paradigms from related studies (Teddlie and Tashakkori, 2009). This article uses mixed methods for two purposes: triangulation and complementarity. Triangulation seeks to corroborate findings from different methods to overcome the limitations of each of the preceding evaluations, in order to improve the validity of inferences about constructs and the strength of the warrant for inquiry results (Campbell and Fiske, 1959). Complementarity seeks to elaborate or illustrate findings from one method using findings from another method, or to develop a more comprehensive understanding of the construct(s) being studied (Rossman and Wilson, 1985). This article begins by using triangulation to identify the best-supported findings from the previous evaluations, and then uses complementarity to explain and elaborate on each of the best-supported findings, explored further below.
The four evaluations are good candidates for mixed methods study because they represent reasonably diverse methods for understanding the same phenomenon from different perspectives (Greene, 2007). The four evaluations use a mix of developmental, formative and summative evaluation (Patton, 2011), normative and positive reference (Jensen, 1983), and qualitative and quantitative data (Teddlie and Tashakkori, 2009), as shown in Table 2. It is difficult to objectively weight the relative status or value of each evaluation, hence they have been weighted equally for the purpose of analysis.
Summary of evaluation methods.
Triangulation
Each of the four evaluations describes aspects of the Results Programme that may have contributed to its success. These findings were grouped into three categories: strongly supportive, weakly supportive and opposing. The selection of these three categories was largely opportunistic, as two of the four evaluations were already structured in this manner. In the case of the other two evaluations, findings were assessed as strong if they were described in the evaluation reports as supported by strong evidence, or evidence that suggested a strong effect size. Findings were assessed as weakly supportive if they had mixed evidence (supporting/opposing) with the majority supportive, if there was weak supportive evidence only, if the evidence came from only a small number of cases, or if the evidence suggested a small effect size. As the findings of each evaluation were mostly qualitative, there was some subjectivity in the assessments. The research designs of the original evaluations were such that only two evaluations reported findings that were associated with less success (described as ‘opposing’).
The findings from the four evaluations were then compared, to look for features that appeared in more than one evaluation method. Findings that were consistently supported or opposed by more than one method are more reliable than those identified by one method only (Greene, 2007). Some features are supported by one method and opposed by another, or described by only one evaluation method; these cannot be corroborated using the research design, and are excluded from the findings. The results of the triangulation method are shown in ‘Findings’ section of this article.
Complementarity
Each of the findings that were corroborated using the triangulation method above were then analysed further to provide descriptive accounts of these findings. Each evaluation report was re-read to find any descriptive information relating to the corroborated findings, and a summary of this analysis forms ‘Actor selection’ to ‘Goal monitoring’ sections of this article.
Limitations
This study is limited in completeness, external effects and the extent to which findings can be generalised or applied in other contexts, as discussed below. Triangulation and complementarity are used to compare multiple descriptions of the same phenomena (Creswell, 2013). Therefore it is only useful when that phenomenon is described in multiple evaluations, and any features identified in only one evaluation method are ignored. Some of these features identified in single evaluations may represent real success factors in the Results Programme that may be important in trying to replicate the approach elsewhere. Triangulation is therefore likely to be more reliable than the individual evaluations and produce less spurious associations, but the findings that can be corroborated from multiple evaluations are likely to be less complete and omit several real associations.
While the four evaluations are diverse in their methods, they focus their study on the Results Programme itself and ignore broader questions of performance in the New Zealand public service. The Results Programme has been successful in improving performance in all 10 cases, but it is unknown whether this improved the performance of the New Zealand public service as a whole. None of the contributing evaluations are able to assess any changes in performance in areas outside the Results Programme. It is plausible that the Results Programme created focus on a small number of problems and improved performance in those areas, but at the expense of performance in other areas.
Each evaluation uses publicly available data, and may therefore miss other important sources of information. In a research synthesis, Carey and Crammond (2015) describe the importance of informal, and often subversive, managerial craft (as described in Bardach, 1998). Such elements are typically important in New Zealand (Lips et al., 2011), but are unlikely to be recorded in the source documents used in this article (Scott and Bardach, 2019). This is one limitation to the methods used and an opportunity for further research.
This article relies on secondary data, in the form of the prior evaluations. These evaluations were not conducted for the purpose of exploring goal commitment, and the subsequent link to goal commitment drawn in this article was generated inductively. As with all inductive reasoning, this has the advantage that the original (primary) data collection and analysis was not conducted with a particular goal in mind, and this reduces one potential source of bias. The disadvantage is that the prior evaluations, by not looking specifically for data relating to goal commitment, may have ignored potentially useful data that would have led to a deeper understanding of the phenomena. At least one of the authors of this article was involved in each of the prior evaluations, and this too has advantages and disadvantages. Prior involvement means that the authors had a good understanding of the case and methods used in these evaluations, and had access to the primary data as reference when this was useful for verifying or expanding upon a particular inference from the evaluation reports. One disadvantage of prior involvement is that any biases or assumptions made by the authors in the prior evaluations may be perpetuated in this article. The use of secondary sources, the inductive approach, and prior involvement of authors, are each therefore further sources of limitations to this study.
Each of the 10 cases in the Results Programme shares many of the same contextual factors: the same time period (2012–2017), the same Government (the fifth National Government), and the same jurisdiction (New Zealand) with its somewhat unusual public management system. Attempts to implement the recommendations from this article in other jurisdictions should be cognisant of the impacts of contextual factors, and potential implementers of this approach are encouraged to contact the authors to discuss how the Results Programme may be adapted to fit their problem setting.
This article earlier considered the difficulty and ambiguity involved in terms like ‘coordination’ and ‘collaboration’, and these ambiguities create difficulties in grounding the findings within the broader literature. The case study describes a collaboration in the sense of joint working with shared responsibility for a common goal. It describes a range of activities performed by agencies in service to these goals, including some that could be described as both coordination and collaboration.
Findings
The four previous evaluations included a total of 45 findings; 10 of these were supported by more than one evaluation, listed in Table 3 and discussed below. For the purpose of discussion, these are grouped into three themes: actor selection, goal selection and goal monitoring. Table 3 demonstrates the value of the mixed methods approach; no single evaluation had identified all 10 features described in this study, and each evaluation included several associations that could not be corroborated.
Key findings of previous evaluations.
Strong: this feature was strongly supported as contributing to the success of the BPS Results approach; Weak: this feature was weakly supported as contributing to the BPS Results approach; NA: not applicable, this feature was not described in this evaluation.
Actor selection
Three of the features identified through more than one evaluation method related to the selection of responsible parties or actors: Maximum of three agencies Building on existing relationships Cascading levels of governance
Achieving each target involved contributions from many agencies, but the most successful approaches limited core decision-making to two or three agencies, utilising tiers of involvement when necessary. In some cases, more than three agencies had an interest in the outcome, but greater progress was made when two or three critical partners formed a core group, and involved others on an as-needed basis for information sharing and coordination. Network theory notes the tension between efficiency and inclusion (Provan and Kenis, 2008), with the assumption that coordination (transaction) costs of a group rise as membership increases (Van Huyck et al., 1990; Weber, 2000). An alternate explanation is that felt responsibility and individual effort diminish as group size increases, as observed experimentally in the Ringelmann effect (Ingham et al., 1974).
Collaboration appeared more successful when built upon trusting relationships between contributing parties, ideally with a track record of succeeding together even if on smaller projects. Thomson and Perry (2006) explain that trust reduces transactions costs by reducing the need for monitoring. An additional explanation is that where agencies value the relationship, they feel a psychological contract to deliver on their promises to each other (Ring and Van de Ven, 1994), raising commitment (see ‘Goal commitment in collaboration literature’ section for further discussion of commitment, commitments and relationships).
Agencies worked together with cascading governance (Kania and Kramer, 2011) at ministerial, chief executive, policy, operational and secretariat level. New Zealand public servants had been socialised in a system of single-point accountability (Gregory, 2006) and therefore reportedly looked for signals as to where that single point lay. The most successful governance arrangements appeared to go to great lengths to reinforce the idea of shared and equal responsibility. One example was in the resourcing of secretariats, where successful groups jointly resourced a secretariat including staff from each agency. In contrast, where a secretariat group was resourced by a single agency, this appeared to act as a signal to other agencies that they would be held less responsible for achieving the goal, resulting in those agencies feeling less committed.
Goal selection
Four features that were identified by more than one evaluation method related to the selection of goals: Selecting a small number of problems Public commitment and transparent reporting Specifying results at an intermediate-outcome level Alignment between results and targets
While it seems unlikely that 10 is a ‘magic’ number, limiting the approach to a small number of problems appears important in maintaining the attention of senior leaders. Several authors report an inverse relationship between the number of targets and the effectiveness of a performance regime (Barber et al., 2010; Boyne and Gould-Williams, 2003; Chun and Rainey, 2005; Weiss and Piderit, 1999). This may be because having a small number of problems raises the relative perceived importance of each problem, raising the stakes (or ‘attainment attractiveness’ – Klein and Wright, 1994) for success or failure in each goal.
A notable component of the Results Programme was the Government’s promise to regular and transparent reporting over five years. The New Zealand public sector has a high degree of churn, with many programmes cancelled or altered before their completion (Norman and Gill, 2011). When it is not clear that a programme will persist long enough for leaders to be held responsible for success or failure, this potentially acts to limit the felt commitment to achieving the desired outcome. Publicly setting and reporting on target progress and achievement creates a sense of commitment by increasing a programme’s potential exit costs (Cels et al., 2012). Additionally, public reporting of performance against a target can be a strong incentive to perform (Bevan and Fasolo, 2013), as public servants are typically strongly motivated by reputation – that is, being perceived as doing a good job (LeGrand, 2007). These reputational consequences may increase the extrinsic motivation towards goal attainment.
Great care was taken to choose goals that had robust synecdoche with the problem targeted and yet relatively strong attribution to the changes in service delivery. The New Zealand government describes these as ‘intermediate outcome’ measures (Baehler, 2003). By contrast, long-term outcomes, typically with poor attribution, are described as ‘end-outcomes’. Progress measures, output measures and end-outcome measures have each been criticised as poor methods of managing collaborative performance (Carey and Harris, 2016; Scott and Boyd, 2016). The use of intermediate-outcome measures in the Results Programme may have supported goal commitment by balancing the meaningfulness of the goal and the instrumentality of the actors in achieving that goal.
Participants in the Results Programme distinguish between a result (the outcome sought), a target (the degree of change) and the measure (how that change will be assessed). Results, targets and measures tend to be conflated in other target literature (for example, Smith, 2009). Separating these three components also creates risks – where there was poor fit or alignment particularly between the result and the target this drove confusion and distraction. Goal ambiguity tends to lessen feelings of commitment (Chun and Rainey, 2005).
Goal monitoring
Some goals are qualitative and subjective in nature. In contrast, the Results Programme set quantitative goals where there was little subjectivity in whether the goal had been achieved. Triangulation corroborated a final three features that related to the monitoring of the achievement of these quantitative goals: Aligned accountability systems Fast feedback and learning Sharing administrative data
Public servants are required to pursue multiple competing objectives. The Results Programme aligned goals across several accountability systems, with the same goals and actors included in reports to Cabinet, agency planning documents presented to Parliament, and in the assessment of performance by the State Services Commission (Scott, 2016). This consistency likely further increases the attractiveness of goal attainment.
The Results Programme has been characterised as data driven (Jensen et al., 2014). The most ‘successful’ of the 10 goals involved measurement systems that minimised the delay between actions and observed effects. These delays can occur in two places: between the action and its impact on societal outcomes (‘performance lag’ in Figure 1); and between the change in outcome and its measurement (‘measurement lag’ in Figure 1). Measurement delays can cause many problems: cancelling successful initiatives because they haven’t yet produced measurement results; continuing with unsuccessful initiatives because their impact is not yet known; and demotivating public servants and ministers because the impacts of recent actions cannot be seen. One interpretation of the data is that low delay allowed more adaptive management (Carey and Harris, 2016) because public servants could see what was working and make a change (‘adaptive response’ in Figure 1). Another interpretation is that up-to-date data allowed the pressure of goal attainment to be felt more acutely; all parties (public servants, ministers, and the public) had accurate and timely information on progress towards the goal, and this supported the accountability systems described above.

Delays between action and measurement.
The most successful of the 10 cases created a shared measurement system that allowed the use of data to support decision-making. This made it easier for all parties to understand both what was happening and the underlying causes. In public administration, where causal relationships can be indirect and less easily identified (Wilson, 1989), this deeper data analysis may be necessary to better target services. Access to this data also made performance more transparent, and therefore may have contributed to a greater sense of felt responsibility.
Discussion
The Results Programme consists of 10 previously intractable problems, and in each case outcomes have improved above baselines and trend data (where available) without the addition of new resources. The section above describes 10 features of the Results Programme that may have contributed to its effectiveness.
Some of these features can be found in other literature on interagency collaboration, either in empirical studies (see Carey and Crammond, 2015 for an overview), or theoretical frameworks (see in particular Ansell and Gash, 2008; Bryson et al., 2015; Emerson et al., 2012, Thomson and Perry, 2006), where they are described as ‘reduc(ing) complexity and transaction costs’ (Thomson and Perry, 2006: 28). Interagency collaboration is generally thought to be difficult, and efforts to identify how the barriers to collaboration can be lowered are commendable.
In contrast, in this article we explain these same features and others, not as making interagency collaboration easier, but instead as mechanisms for maintaining persistence and determination when collaboration is hard, through goal commitment. Some examples of how the same feature can be explained using different theories are shown in Table 4. The goal commitment theory was more consistent with the experiences of public servants, who reported that collaborating was hard. In a memoire on working in the Results Programme, Deputy State Services Commissioner Ryan Orange commented: ‘All of the Results teams met significant opposition to breaking from previously established approaches to governance, decision-making, funding, partnerships and delivery’ and ‘the passion of the leaders and teams for improving the lives of New Zealanders was essential in sustaining their efforts to overcome resistance to working differently’ (Orange, 2016: 17).
Observed features of successful collaboration can be explained as reducing transaction costs or increasing goal commitment.
In this section, we propose a framework for creating and sustaining goal commitment in interagency collaboration, as an explanation of the effectiveness of New Zealand’s Results Programme. We begin by exploring how concepts similar to goal commitment have been used in collaboration literature before, and then attempt to construct a novel framework to accommodate the findings of this article.
Goal commitment in collaboration literature
The concept of commitment is not new to collaboration literature, though different authors have used the term in slightly different ways. Thomson and Perry (2006) describe a multi-dimensional framework that involves the cyclical and reciprocal relationship between negotiation, commitment, implementation and assessment (based on Ring and Van de Ven’s, 1994 ‘process framework of collaboration’). Thomson and Perry explain that these processes work together to reduce complexity and transaction costs between agencies. It is apparent from a general reading of the article however, that Thomson and Perry (2006) intend ‘commitment’ to stand for the ‘commitments’ or promises that agencies make to each other (for example ‘commitment to future action’, p. 22). This same sense of commitment as promises is used by other authors, such as Imperial (2005), and differs from the sense of being committed to achieve a target or goal. Indeed, the parliamentary system of government in New Zealand meant that public servants were limited in the explicit ‘commitments’ that they could make to each other because these might be overruled by the priorities of individual ministers. Instead, they could only continually renegotiate provisional agreements and adapt to changing circumstances, motivated by a sense of commitment to a shared goal.
Ansell and Gash (2008) describe the importance of commitment to collaborative process, and propose that collaborative will be more successful if the participating parties feel obliged to adhere to the agreed governance mechanism. This gets closer to the ideas discussed in this article, but still describes a slightly different phenomenon. Between 2004 and 2012, New Zealand public servants were committed to working together (see ‘the sector approach’, Scott and Boyd, 2016) but with mixed results. The Results Programme differed because it was no longer sufficient only to work together, but instead public managers had to be resourceful and creative to reach the goal to which they felt committed.
Much of the collaborative governance literature notes the importance of trusted relationships (Ansell and Gash, 2008; Huxham and Vangen, 2000). This too can be related to the word ‘commitment’, in the sense of being feeling a volitional bond or felt obligation to an other person or persons as necessary or implied in the nature of the interpersonal relationship. Such a sense of obligation may be specific (the promises discussed above) or tacit and general, in the sense of feeling obliged to supporting each other, which Ansell and Gash (2008) relate to the importance of commitment to collaborative process. The nature of much of the data used in this article makes it difficult to comment on the relationships within the Results Programme; the primary sources in three of the four evaluations were written by public servants, with an understandable reluctance to comment on specific relationships in publicly available documents. The relationship between goal commitment and interpersonal or relationship commitment would be an interesting area for future study, and may potentially augment the framework proposed later in this article.
Huxham and Vangen (2013) identifies several themes that may influence collaborative success, and notes that practitioners frequently identify ‘commitment and determination’ as an important factor, but then omits this from further discussion. Without further description it is hard to be certain, but we suspect that this concept of ‘commitment and determination’ might be similar to the ‘commitment to goal achievement’ described in this article. Goal commitment has been described elsewhere as having two dimensions: intensity and duration (Nutt, 1983; Scott, 2014, 2018; Scott et al., 2015), and these two dimensions may be what was intended by the dual name ‘commitment and determination’. Similarly, Provan and Milward (2001: 416) mention the importance of ‘commitment to network goals’ as a determinant of network success, without providing guidance on how this may be engineered or supported.
If the collaboration literature is not particularly forthcoming on the concept of commitment to goal achievement, perhaps there are lessons from related concepts, like ‘motivation’ – the desire or willingness to do something. Motivation has been an area of considerable interest in collaboration literature (see Bresnen and Marshall, 2000; Burkhardt, 2001; Gazley, 2014; Mulroy and Shay, 1998). However, in these articles, motivation is seen as an initial condition – that is, individuals or organisations being motivated to begin to collaborate with others. Past attempts at collaboration in New Zealand have begun with great enthusiasm only to fade after a couple of years (see Baehler, 2003; Jensen et al., 2014). The Results Programme has differed in that commitment has been sustained over time.
While constructs related to goal commitment can be found in collaboration literature, none are sufficient for describing the sustained determination and perseverance that characterised the Results Programme.
A framework for creating and sustaining goal commitment in interagency collaboration
We interpret the case study as an example of goal commitment being used to achieve and sustain interagency collaboration, and note above that goal commitment does not appear sufficiently explained in the interagency collaboration literature. This section proposes a theoretical framework for understanding goal commitment in an interagency collaboration context that relates goal commitment theory, mission valence theory, and the data from the case study (as recorded in prior evaluations).
The ‘Findings’ section of this article identifies and describes 10 features of the Results Programme that multiple prior evaluations found contributed to its success. Even assuming that each of these 10 features are indeed truly contributive, it is likely that some are specific to the context of the case study, or that these features are specific examples of how a more general principle can be expressed. Therefore, in this section we try to group these 10 further, and relate these to goal commitment theory, in the hope that at this more generic level it is possible to identify antecedents to goal commitment that have more generalised relevance.
The findings were earlier grouped into three themes: actor selection, goal selection and goal monitoring (‘Actor selection’, ‘Goal selection’, ‘Goal monitoring’ sections, respectively, of this article). These groupings were subjective, but appeared to the authors to represent reasonable thematic categories for the 10 findings.
In the goal commitment literature, antecedents to goal commitment are described as having two primary components, consistent with expectancy theory (Vroom, 1964): expectation of goal attainment and attractiveness of goal attainment (Klein and Wright, 1994). Thus as a basis for generating a theoretical framework, the authors were able to combine the three groups of findings in this article (actor selection, goal selection, and goal monitoring), with the two primary components of expectancy theory (expectation and attractiveness), which made possible a three-by-two grid (see Table 5). Then, the findings were reexamined to determine if any of the contributive features of the Results Programme could be related to categories in Table 5, and further (below), whether these could be related to relevant theory identified earlier. We propose this framework as six principles for programme design to build and sustain goal commitment to interagency collaboration.
A framework for creating and sustaining goal commitment in interagency collaboration.
Instrumentality of actors
In goal commitment literature, attainment of the goal is something that is usually assumed to be the direct result of the performance of the individual (see review article by Latham et al., 2008). In public administration, the direct activities of public servants are frequently only indirectly related to the intended social outcome (see the production model of public performance, Van Dooren, 2006). In the Results Programme, the goals were selected at an ‘intermediate-outcome’ level, where the achievement of the goal was at least partially attributable to the actions of those public servants held responsible.
Likelihood of attainment
The likelihood of goal attainment is recognised as a contributor to goal commitment, with effort increasing as goals become more difficult and then dropping sharply if attainment becomes very unlikely (Klein and Wright, 1994). The goals in the Results Programme were selected such that there was an expectation that the level of effort of public servants would contribute to whether the targets would be achieved. For most of the goals, the achievement of the goal was in question, possible but not inevitable, for the duration of the Results Programme.
Likelihood of programme completion
Experimental studies on goal commitment assume the completion of the activity (Klein and Wright, 1994). In the public sector, this cannot be assumed. Complex social problems may require sustained attention over multiple years. Programmes that span many years run the risk that Government priorities will change and goals set now may not be relevant in five years time. In the Results Programme, the Government effectively tied its own hands, by publicly announcing the targets and promising to report every six months. More generally, this can be considered as an example of engineered exit costs (Cels et al., 2012).
Distribution of responsibility
While the goals frequently examined in goal commitment literature are individual goals (Locke and Latham, 2002), public administration goals often involve the problem of many hands (Thompson, 1980). Goal commitment diminishes as the number of responsible parties increases (Ingham et al., 1974), and New Public Management typically emphasises single-point accountability, where blame inevitably falls to one person and diminishes goal commitment of others. In the Results Programme, small groups of public managers’ were held collectively responsible for the success of the collaboration. The groups needed to be small, so that each member retained a high-level of responsibility. But they also needed to be designed to promote equal shared responsibility so that each member had an incentive to remain committed to the target.
Intrinsic motivation
Instrumentality and expectancy (both described above) are two of the three components of mission valence, along with ‘meaningfulness’ (Wright and Pandey, 2011). Rainey and Steinbauer (1999) describe mission valence as a public servant’s perception of the attractiveness of a mission. In this context, meaningfulness refers to the perception of the goal as being valuable. The targets in the Results Programme were meaningful, in that they related to problems that were important to New Zealanders, were priorities of government, and were framed at an intermediate-outcome level such that they could be seen as intrinsically valuable (in contrast to output or activity measures). Target regimes elsewhere have been criticised for decreasing intrinsic motivation (Deci, 1971; Eisenberger et al., 1999; Fisher, 1978), though this did not appear to be the case in the Results Programme. New Zealand public servants report that the targets have had a ‘galvanising’ effect on public servants and served as a ‘rallying call’ (both excerpts from interviews conducted as part of Scott and Boyd, 2017). Orange (2016: 17) reported ‘the Results spoke to the key drivers of why many of the people working on the Results had become public servants in the first place.’ The motivation of public servants differs from that in the private sector (Houston, 2000), with a larger role for public service motivation (Perry, 1996). Thus the attractiveness of goal attainment must be viewed both in the context of the extrinsic motivations traditionally assumed in goal commitment literature, and also intrinsic motivation of public servants to meaningfully serve the community.
Extrinsic motivation
Extrinsic motivation, too, may work differently in the public sector, with a typically lesser role for performance-related pay (Scott, 2016). Instead, public reputation tends to be a very strong motivator for public servants (Bevan and Fasolo, 2013). The Results Programme featured easy to understand public reporting that provided reputational rewards and sanctions for public servants. Prospect theory suggests that a fear of reputational loss is more powerful than the potential for reputational gain (Tversky and Kahneman, 1991). However, a system based purely on fear of loss of reputation may not be sustainable (LeGrand, 2007). The Results Programme featured a combination of quantitative and qualitative reporting, including ‘success stories’ that may have contributed to ongoing enthusiasm and self-esteem (two determinants of goal commitment in the private sector; Klein and Wright, 1994). Additionally, the small number of targets (10) increased the relative importance of success of failure in each target.
Implications for practice
While other studies describe ideal conditions for collaboration (see ‘collaborative opportunity’, Bryson et al., 2015), this article describes how collaboration was sustained and ultimately successful without these ideal conditions. Interagency collaboration is frequently imposed upon public servants at times and circumstances not of their choosing: political announcements, crises, or increased media attention might all put pressure on public servants to act to solve a problem that cannot be solved by agencies acting separately. In these cases, governments lack the luxury of waiting for collaborative opportunities to arise. This article points to how public servants may design solutions for solving those persistent and ‘wicked’ problems (Head and Alford, 2015) that cross agency boundaries and don’t have easy answers.
This article contributes to public administration practice through implications of three categorical types (see Table 6). The first and most notable implication for practice is to consider this case study as a proof of concept – evidence that sustained commitment and determination to achieving a shared goal is possible, even when the ‘problem of many hands’ (Thompson, 1980) suggests otherwise. The second category of implication is to consider the features and artefacts of the case study – the 10 findings identified by more than one evaluation method – may be treated as exemplars and potential candidates for policy transfer (Dolowitz and Marsh, 2000). The third category of implication is to consider the framework developed in this article to be an initial and tentative theory, with principles for designing context-specific programmes to build and sustain commitment to shared interagency goals. In this last regard, a single case study is not sufficient for confirming a generalisable theory, and would benefit from additional study as addressed below.
Implications for practice, in three types.
Implications for research and theory
This study identifies 10 features that appear to have contributed to the success of a large government programme in New Zealand. New Zealand has a strong focus on accountability (Gregory, 2006; Scott and Macaulay, forthcoming) and a history of abandoned reforms and cancelled programmes (Scott and Boyd, 2016) that shape the expectations of public servants, and likely influence how they react to the various design aspects of the Results Programme. Hence the importance and relevance of each these features is likely to be context dependent, or even specific to the case described. However, each feature can be explained as contributing to goal commitment, consistent with the experiences of public servants involved in the Results Programme, who report that they remained committed to achieving the goals despite high transaction costs. The expectancy theory of goal commitment (Klein and Wright, 1994) and various public administration literature on public service motivation (Perry, 1996) and mission valence (Wright and Pandey, 2011) provide relevant associations and constructs that have been used to develop a more theoretically grounded framework.
This framework requires further testing in different contexts, and such research could take the form of new case studies, comparative research, or secondary research on past data. As noted in the methodology section, this article relies on secondary data from earlier evaluations. The relationship between the case study and the concept of goal commitment was generated inductively, and the original evaluations were not conducted from the perspective of exploring or testing goal commitment or its antecedents. This suggests an opportunity to conduct similar re-examinations of other case data through the lens of goal commitment. For example, the various prioritisation schemes (for example, Virginia Performs, GMAP Washington, Scotland Performs, Victoria Ahead; Kohli, 2010), or target/goal schemes (for example, the US federal Cross Agency Priority Goals; Putansu, 2015) may provide additional data on which to understand the goal commitment construct. In each case, the goal would be to explore whether the framework has descriptive, explanatory, or predictive power, and whether it may be refined further.
One limitation in drawing conclusions across the collaboration literature lies in imprecision of the language used, as discussed earlier. While we characterise the case study as collaboration, in the sense of shared responsibility for a common goal, we acknowledge that some of the actions in service of those goals would elsewhere be described as cooperation or coordination, in that some solutions involve actions that were jointly conceived but separately delivered. The lack of precision (across multiple dimensions) of these terms makes it difficult to conduct like-comparisons in the form of meta-analyses or evidence syntheses. The field would benefit from more consistent use of more precise language that can distinguish between different types of action and different types of responsibilities.
This article broadly aims to contribute to our understanding of interagency collaboration. So many pages of academic journals have been filled with discussions of collaboration that we can no longer refer to it as a ‘black box’ (Thomson and Perry, 2006), and yet goal commitment remains an unexplored aspect. Ideally, research and theory would inform practice that worked on both ‘ends’ of the collaboration problem: how to improve goal commitment and how to reduce barriers. Public servants would be more committed to succeeding, and the barriers they faced would be lower, though some contexts would inevitable warrant more focus on one or the other. Refining our understanding of the antecedents and consequences of goal commitment in an interagency collaboration context would have numerous benefits, from better predicting, understanding and designing for collaborative success, to discriminatorily matching programme designs to different contexts.
In tackling this underexplored aspect of interagency collaboration, this article aims to ‘zig’ where other articles ‘zag’; while other authors explore how to lower the barriers to collaboration and make it easier, goal commitment is proposed here as an explanation for how effort can be created and sustained where interagency collaboration is hard.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
