Abstract
In a step-level public-good experiment, we investigate how the order of moves (simultaneous vs. sequential) and the number of step levels (one vs. two) affects public-good provision in a two-player game. We find that the sequential order of moves significantly improves public-good provision and payoffs, even though second movers often punish first movers who give less than half of the threshold contribution. The additional second step level—which is not feasible in standard Nash equilibrium—leads to higher contributions but does not improve public-good provision and lowers payoffs. We calibrate the parameters of Fehr and Schmidt’s model of inequality aversion to make quantitative predictions. We find that actual behavior fits remarkably well with several predictions in a quantitative sense.
Keywords
Public goods often have a step-level character; that is, the public good is provided only if some minimum threshold of contributions (or provision point) is met. Examples include the building of a bridge or a dike. More generally, teamwork where the team has to meet a specific goal has step-level public-good character. Also, charities may have properties of step-level public goods if the underlying production of the public good is subject to nonconvexities (see Andreoni 1998).
Our article makes two contributions to the literature on public goods with step levels. First, we analyze whether sequential contributions as opposed to simultaneous decisions improve public-good provision. Second, we analyze if an additional threshold, which is not feasible in standard Nash equilibrium and where the public good is provided at a higher level, improves public-good provision.
The issue of sequential versus simultaneous contributions is the subject of a growing literature. Following the theoretical works by Andreoni (1998, 2006) and Hermalin (1998), researchers have analyzed leading by example in experiments (Gächter et al., 2010b). If a first mover gives an example that is mimicked by the followers, sequential contributions to the public good may be superior to simultaneous decisions. This will particularly be the case when a first mover is better informed about the return to contributions allocated to the common endeavor (Hermalin 1998) or about the quality of a charity Andreoni (2006). 1
We study sequential versus simultaneous decisions in a step-level game with two players and with complete information. For such a setting, one would at first expect a sequential-move game to seem superior to a simultaneous-move setting. A threshold public-good game is foremost a coordination game. With simultaneous moves there are multiple equilibria; coordination failures may occur, and moreover, the public good is not provided in all equilibria. With sequential moves, there is a unique subgame perfect equilibrium in which the public good is provided. Hence, coordination and therefore public-good provision should be more frequent with sequential moves. There is, however, an aspect of sequential decision making that may reduce its alleged superiority. In the unique subgame perfect equilibrium with selfish players, the first mover contributes such that a best responding follower merely breaks even by meeting the threshold with his or her contribution. In other words, the first mover actually gives a bad example by contributing less than the followers. In an experiment, this may reduce the efficiency of the sequential-move setting: players who try to exploit this first-mover advantage risk being punished by second movers who do not best respond but contribute zero to the public good. 2 If such behavior occurs frequently, the higher efficacy of the sequential-move game will not materialize. Based on a calibrated model (see the later discussion), we hypothesize that the efficiency-enhancing effect dominates so that sequential moves improve public-good provision.
Now consider our second extension, the introduction of a second threshold. The general logic of multiple threshold public goods is that no return is obtained unless contributions meet the first level, and after this level, no additional return is earned until the second provision point is met. Multiple step levels have rarely been analyzed before (Chewning, Coller, and Laury 2001; Hashim, Maximiano, and Kannan [2011], which we discuss later), but they seem realistic in many circumstances. For example, the successful development of a new product in teamwork will typically require a minimum level of efforts of the team members. Adding a further feature or quality level of the product may be subject to an effort threshold just as developing the main product and, accordingly, the management may set two threshold levels. Further examples include a public radio or TV station that may transmit more than one program, corresponding to multiple thresholds. Public bridges or highways may be built with one, two, or more lanes. Finally, any kind of public good may be provided at various quality levels and the production of these quality levels may be subject to nonconvexities, suggesting multiple thresholds.
The interaction of the two thresholds and the order of moves can be hypothesized as follows. In our experiments, first movers in the sequential-move game may aim for the second threshold since this yields higher payoffs—provided the threshold is met. Since such first-mover contributions must be higher than those required to meet the first threshold, second movers do not feel exploited and therefore do not punish first movers by making zero contributions. However, the second threshold is not a Nash equilibrium with selfish players. Given one player aims at the second threshold by contributing a high amount, the best response of a second player is to contribute low such that the first level only is met. Thus, with standard preferences the second level is not an equilibrium (with either simultaneous or sequential moves). However, when players have Fehr and Schmidt (1999) preferences, the second threshold is a Nash equilibrium—and meeting the second threshold is of course efficient. In any event, even if some second movers exploit those first movers who aim for the second level, the public good is still provided at least at the first level, and so no efficiency loss occurs. In other words, behaviorally, the existence of a second threshold might make it more likely that the first threshold will be met. We thus hypothesize that the second step level improves public-good provision.
Our main findings regarding the two treatment variables are as follows: Sequential contribution decisions significantly improve public-good provision, even though second movers regularly punish first movers who contribute too little. This is in contrast to Gächter et al. (2010a), who find the opposite result, however, in an entirely different setting (see the later discussion). Coordination rates and payoffs are higher, whereas contributions are not higher with sequential moves. The existence of a second threshold causes significantly higher contributions, but this does not result in higher public-good provision or higher payoffs.
Our article also makes quantitative predictions for our experiment employing a fully calibrated Fehr and Schmidt (1999) model. Whereas Fehr and Schmidt’s model of inequality aversion has been used frequently in the literature, the predictions are almost always of a qualitative nature (“if players are sufficiently inequality averse, abc is an equilibrium”). We calibrate Fehr and Schmidt’s model on a (joint) distribution of the inequality parameters, and we make exact quantitative predictions (“w percentage of the first movers will contribute x,” or “given a first-mover contribution of y, the public good will be provided in z percentage of the cases”). 3
We find that the calibrated Fehr and Schmidt (1999) model not only makes remarkably accurate quantitative predictions, but it also fails in two cases. The calibrated Fehr and Schmidt model predicts second-mover behavior (given first-mover behavior) in the sequential variant extremely well. Specifically, it accurately predicts the frequency of second-mover decisions (contribute such that the step level is met vs. punish first movers by contributing zero). The prediction regarding the first movers fails. First movers should anticipate or learn that second movers punish low contributions and thus always make the payoff-equalizing contribution. However, only slightly more than one-third of them do so. First movers behave “too greedily,” as has been observed in previous experiments (e.g., Huck, Muller, and Normann 2001). The calibrated Fehr and Schmidt (1999) model also predicts well in the case with simultaneous-move contributions where some players contribute whereas others do not. Finally, the model rather precisely predicts the share of first movers who trust second movers by making a high contribution in the sequential two-threshold case. Here, the prediction regarding the second movers fails, as they exploit first movers significantly more frequently than predicted.
Literature Review
There are two major strands of the literature pertinent to our article. The first literature is about simultaneous versus sequential order of moves in public-good games. The second literature concerns public-good experiments with step-level character in general and, specifically, the small literature on experiments with more than one threshold.
As mentioned in the Introduction, several researchers have analyzed leading by example theoretically. Andreoni (1998) examines the efficiency of leadership giving. The article provides an explanation of how seed money, from a group of “leadership givers,” generates additional donations. In Hermalin (1998), a first mover may be better informed about the return to contributions allocated to the common endeavor. Therefore, a first mover may plausibly give an example to followers who rationally mimic the first mover’s behavior.
An increasing experimental literature has been triggered by these theory contributions. Following Hermalin (1998), Potters, Sefton, and Vesterlund (2005) study an experimental voluntary contribution mechanism (VCM), where some donors do not know the true value of the good. The authors conclude that sequential moves result in higher contributions of the public good. They also have a treatment where the sequencing of choices emerges endogenously. Moxnes and Van der Heijden (2003) and Van der Heijden and Moxnes (2012) also highlight the importance of leaders in a public bad experiment and show that followers invest less in the public bad when leaders give a good example. Potters, Sefton, and Vesterlund (2007) report that the leading-by-example approach depends on whether there is incomplete information in the experiment. Levati, Sutter, and Van Der Heijden (2007) also find that incomplete information crowds out the effects of leadership. This explains why some experiments have not found sequential moves to be superior (Andreoni, Brown, and Vesterlund 2002), while Potters, Sefton, and Vesterlund (2005) did. Our experiments differ from those of Potters, Sefton, and Vesterlund (2005, 2007) in that we do not include information asymmetries, and we do not employ the standard VCM. 4
Gächter et al. (2010a) is also related to our study. They experimentally study the effects of a simultaneous versus a sequential choice mode in a test of Varian’s (1994) VCM model. Varian models a public-good setting where sequential contributions are predicted to be lower than simultaneous contributions. 5 In their experimental test of Varian’s model, Gächter et al. find that sequential contributions are indeed lower than simultaneous-move contributions, although the difference in aggregate contributions across the two move structures is not as large as predicted, in part because second movers punish first movers who free ride in the sequential variant. While this is in contrast to our results, note that one of the major differences in our approach is that Gächter et al. test the Varian model, whereas we study a step-level setup. Even though we observe similar punishing behavior, the sequential-move variant is more efficient in our data.
Erev and Rapoport (1990) were the first to study simultaneous versus sequential moves in a step-level public-good game with discrete choices. In their experiments, at least three of the five players must contribute their endowments for the public good to be provided. Actions are minimal contribution sets (MCS) such that players either zero contribute or invest their whole endowment. They find that, with sequential-move choices, information about previous noncooperative choices only is more effective in public-good provision than information about previous cooperative choices. The main differences in our experiment are the discrete action space and the number of players (two in our case).
Schram, Offerman, and Sonnemans (2008) study a similar binary “all or nothing” setup and vary several relevant factors. They find that participants contribute significantly more when the value of the public good is increased. Increasing group size from five to seven players decreases the average contribution level, but the public good is provided more often when groups are large. Whether subjects play with random or fixed matching does not systematically affect behavior. Finally, subjects contribute significantly more in a “public-good” frame compared to a “public bad.” Coats, Gronberg, and Grosskopf (2009) analyze threshold public-good games with both simultaneous and sequential contributions and investigate how refund policies interact with the mechanism. Given either refund mechanism, efficiency is greater with sequential contributions. A full refund achieves higher efficiency but only for simultaneous contributions.
Cadsby and Maynes (1999) analyze a two by two factorial design with MCS (binary contributions) versus continuous contributions as one factor and refunds versus no refunds as the other. Continuous contributions turn out to significantly increase contributions and public-good provision. A money-back guarantee further encourages provision. Cadsby and Maynes (1998b) also study the impact of binary versus continuous contributions but here the focus is on gender: female groups coordinate more closely on an equilibrium (whether it is a free riding or a threshold equilibrium) than male groups do. Cadsby and Maynes (1998a) find that, with continuous contributions, business and economics students make contributions converging to the free-riding equilibrium; by contrast, the contribution of nurses cycled around the efficient threshold equilibrium.
The literature on public-good games with multiple step levels is much less. 6 Chewning, Coller, and Laury (2001) have a five-player experiment with one, two, three, or five step levels. Their experiment involves a simultaneous-move order. Compared to the baseline with one-step level, treatments with multiple levels sometimes keep the social optimum constant and lower the Nash equilibrium contributions, sometimes—as in our case—they increase the group optima contributions but leave the Nash equilibria unchanged. We discuss their design and results in detail subsequently. Recently, Hashim, Maximiano, and Kannan (2011) analyze a game with five levels and five players. The authors vary information feedback about other members’ contributions to a subsample of group members.
Experimental Design and Procedures
In our experiments, there are two players, player 1 and player 2, who each have a money endowment e = 10. They can make a voluntary contribution, ci
, to the public good, where
In two of our four treatments, there is one threshold for the provision of the public good. If the sum of contributions is at least 12, this yields an additional payoff of 10 to both players. Any contributions between 1 and 11 and beyond 12 are wasted. More formally, if xi
denotes player i’s monetary payoff, then
The other two treatments involve an additional second threshold of 18. If
Since 2e > 18, both thresholds of the public good are feasible, but, due to e < 12, no player can meet the threshold on his or her own. Further, because 2 × 10 > 12 and 2 × 15 > 18, the provision of the public good at both provision points maximizes joint payoffs. Note that the return on contributing one euro at each of the two levels is the same.
We have four treatments labeled S
Treatments.
Subjects play this game over ten periods. The payoffs of the previously mentioned game were denoted in euros in the experiments (so that the exchange rate was one to one). In each period, subjects were endowed with e = 10 euros. The final payoff at the end of the experiment was determined by the earnings of one randomly chosen period. (See also the instructions in the Online Appendix.)
We have three entirely independent matching groups per treatment. Each experimental session contained only one matching group. The size of the sessions or matching groups varied between ten and eighteen subjects. (We control for session size in our data analysis subsequently.) In each session and each period, subjects were randomly matched into groups of two players. In the S
The subject pool consists of students from various fields from the University of Frankfurt. In total, we had 191 participants. For the step-level public-good experiments, we had 160 participants who earned on average 11.3 euros. Further, we employed 31 subjects to replicate the results of a previous study we used for calibration (see the next section). They earned on average 12.3 euros. The experiment was programmed in z-Tree (Fischbacher 2007). Sessions lasted about sixty minutes.
Predictions
Assumptions
We now derive the one-shot Nash equilibrium predictions for this public-good game. In addition to standard Nash predictions (selfish players who maximize their own monetary payoff), we will use Fehr and Schmidt’s (1999) model, henceforth F&S. In their model, players are concerned not only about their own material payoff but also about the difference between their own payoff and other players’ payoffs. Assumption 1 defines the two-player variant of their model.
Here, xi and xj denote the monetary payoffs to players i and j and α i and β i denote i’s aversion toward disadvantageous inequality (envy) and advantageous inequality (greed), respectively. Standard preferences occur for α = β = 0. Following F&S, we assume 0 ≤ β i < 1.
Using the specific functional forms of the step-level public-good game for xi
previously, we can write the F&S utilities as a function of contributions directly, so that we obtain
whereas, for the two step-level treatments, we get
Using this model, we will make quantitative predictions. We fully calibrate the F&S model using the joint distribution of the α and β parameters observed in Blanco, Engelmann, and Normann (2011). For each subject, they derive an α i from rejection behavior in the ultimatum game and a β i from a modified dictator game. 7 The distribution is reproduced in Table S1. On average, α = 1.18 and β = 0.47.
There are several reasons to follow Blanco, Engelmann, and Normann (2011) here. First, while Fehr and Schmidt (1999) derive distributions for the α and β parameters based on data from previous ultimatum game experiments,
8
here, we need the joint distribution of the parameters. We are not aware of any joint distribution of inequality-aversion parameters for the F&S model with the exception of Fehr, Kremhelmer, and Schmidt (2008), who assume that there are 60 percent players with α = β = 0 and 40 percent fair types with α = 2 and β = 0.6, which seems too coarse for our purposes. Second, the joint α − β distribution has been successfully replicated in, for instance, Dannenberg et al. (2007), Teyssier (2012), and Kölle, Sliwka, and Zhou (2011). Since our subject pool differs from the one used in Blanco, Engelmann, and Normann (2011) and the aforementioned experiments, we elicited a joint α − β distribution using thirty-one participants from the current subject pool (not necessarily the same subjects). We find no significant differences between the two α and β distributions according to Kolmogorov–Sminov tests (α: D = 0.123, p = .872; β: D = 0.150, p = .663). So, we successfully replicate the data of Blanco, Engelmann, and Normann. Third, the use of this joint distribution (henceforth “α − β data set”) seems promising, as it successfully predicts outcomes in several games (ultimatum game, sequential-move prisoner’s dilemma, public-good game) in Blanco, Engelmann, and Normann, which have a similar complexity as the present game.
Sequential Moves, One Threshold
We start with the sequential-move variant with one threshold (S
Next, consider players whose preferences and beliefs are consistent with Assumptions 1 and 2. Even if
where we drop the S subscripts of the inequality parameters for simplicity. The
Based on our Assumptions 1 and 2, we now predict the frequencies of public-good provision as a function of cF
. For each player in that data set (see Table S1), we determine the
Table 2 shows the results of this calibration for the S
Predicted Second-mover Responses Conditional on First-mover Choices and the Resulting Expected First-mover Monetary Payoff in the S
Sequential Moves, Two Thresholds
Next, consider the sequential-move variant with two thresholds (S
Players with standard preferences will not provide the public good at the second level in the subgame perfect equilibrium. Given
Now assume F&S players and begin with the second movers. With
Consider next the first movers. cF
= 10 will never be chosen in a perfect Bayesian equilibrium by first movers because cF
= 9 triggers the same second-mover response as cF
= 10 (in terms of public-good provision) but cF
= 9 yields a higher expected payoff and higher F&S utility than cF
= 10. As for the choice between cF
= 8 or cF
= 9, we find that cF
= 8 yields a lower expected monetary payoff than cF
= 6 (see Table 2) and accordingly an even lower F&S utility. Hence, a risk-neutral first mover will never choose cF
= 8 in a perfect Bayesian equilibrium. The remaining possibilities are that first movers will choose either cF
= 6 or cF
= 9. Contributing cF
= 6 yields an expected utility of 14 and cF
= 9 gives an expected utility of 15.015 – 1.182α. Now 15.015 – 1.182α > 14 iff α < .859. This is predicted to hold for 36 percent of the subjects in the α − β data set.
Taking second- and first-mover predictions together, we finally derive the prediction for the frequencies of public-good provision. We expect the public good to be provided at step level 1 with a frequency of 0.64 + 0.36 × 0.197 = 0.711 and at step level 2 in the rest of the cases.
Simultaneous Moves, One Threshold
With simultaneous moves, there are multiple equilibria both in the standard model and in the F&S model. With standard preferences, both players contributing nothing and all allocations where c
1 + c
2 = 12 are the pure-strategy equilibria.
9
Perhaps somewhat surprisingly, all of these equilibria are also Nash equilibria with calibrated F&S preferences except for those where
We believe that it is unlikely that entirely symmetric players will coordinate on asymmetric equilibria and we therefore focus on symmetric equilibria. The two symmetric pure-strategy Nash equilibria are
With the calibrated F&S model, the symmetric pure-strategy (Bayesian Nash) equilibria
There is, however, also a qualitative difference to the standard case. With standard preferences, all players have the same best reply: if less than 60 percent of the players are expected to contribute, nobody will contribute (and vice versa if more than 60 percent contribute). With the calibrated F&S model, it is not the case that all players have the same best response. If less than 64 percent of players are expected to contribute ci
= 6, some players will still contribute. Learning is slower, and the shape of the best response correspondence differs from the standard case. We discuss this in detail subsequently.
Simultaneous Moves, Two Thresholds
We turn to the variant with simultaneous-move game with two thresholds (S
We now look for a Bayesian Nash equilibrium of players with F&S utilities where the second level of the public good is provided. Suppose that some types choose c = 9. Previously, we have seen that, given ci = 9, 80.3 percent of all types will reply with cj = 9 whereas the rest plays cj = 3. Hence, there cannot be a Bayesian Nash equilibrium where all types choose cj = 9. We therefore look for a Bayesian Nash equilibrium, where p percent of all F&S types choose ci = 9 whereas 1 − p choose ci = 3.
The expected utility from playing c = 9 is
For F&S players with α = β = 0, this condition is never met (as seen previously); that is, selfish own utility maximizers will always choose c = 3. If p is sufficiently large, however, inequality-averse players prefer c = 9. In the α − β data set, we find that for p = .72 exactly 72 percent of the players (forty-four players) have
It remains to check, though, whether it pays to deviate to any contribution other than nine or three. The only possible deviation is to contribute c = 0 since any other contribution is dominated by either c = 0 or c = 3. Contributing c = 0 yields an expected F&S utility of 10 − 3β – 0.72 × 6β. But the equilibrium action c = 3 yields 0.72(17 − 6β) + (0.28)7 that is strictly larger for all
Hypotheses
Based on Propositions 1 through 4, we now derive two hypotheses regarding the impact of our two treatment variables. Comparing the predicted public-good provision in S
Our second hypothesis, though, does depend on assuming F&S preferences. Propositions 1 through 4 show that public-good provision can be improved if there is the second threshold. There are multiple equilibria in the S
Overview of the Results
We present our results in three parts. In this section, we present a brief overview of the results. The next section presents tests of Hypotheses 1 and 2. The subsequent section presents a more detailed analysis of the predictive power of the calibrated F&S model.
Table 3 presents a summary statistics of the averages of our main variables of interest. (session-level data and variability measures can be found in Table S2). The second threshold level leads to a higher (sum of) contributions than the one-level variant both in the simultaneous and in the sequential treatments. The sequential-move order leads to a higher sum of payoffs compared to the simultaneous treatments. Public-good provision at the first level is most effective in the treatments with sequential moves. Public good level 1 is provided most frequently (85.56 percent) S
Summary Statistics of Our Four Treatments.
Note: Note that the public good is provided at level 2
Table 3 thus suggests that we do find tentative support for Hypothesis 1. Regarding Hypothesis 2, the second step level improves contributions; it also improves public-good provision (at level 1) in the S
Main Treatment Effects
We now report tests of Hypotheses 1 and 2. We mainly apply regression analysis where we take possible dependence of observations into account by clustering at the session level. We additionally report, in footnotes, nonparametric tests in which case we count each randomly matched session as one observation. 12 In these cases, we report two-tailed p values.
As dependent variables we use sequential (a dummy which is equal to 1 if the move order is sequential), twolevel (a dummy which is equal to 1 if there are two levels), the interaction sequenial × twolevel; furthermore we control for period and the sessionsize. We typically report three regressions. Regression 1 reports the impact of the treatment variables sequential and twolevel only. Regression 2 includes the interaction sequential × twolevel, and Regression 3 adds period and sessionsize. We ran further regressions where we add the interactions of sequenial, twolevel, and sequenial × twolevel with period. We briefly report whether these additional regressions (not fully reported here but are available upon request) lead to qualitatively different results in each of the following four sections.
Sum of Contributions
We first analyze the sum of contributions of the (randomly matched) two-player groups. The left panel of Table 4 reports a linear regression suggesting that the sum of contributions is not significantly influenced by the order of moves. Consistent with our Hypothesis 2, adding the second threshold leads to a significantly higher sum of contributions. 13 The interaction of a sequential-move order and two levels does not lead to a further increased sum of contributions. Period is not significant, thus the sum of contributions is not affected by time dynamics. This still holds when we additionally employ the interactions of sequential, twolevel, and sequential × twolevel with period.
OLS Regressions of Sum of Contributions and Sum of Payoffs.
Note: OLS = ordinary least squares. Robust standard errors in parentheses.
*p < .1, **p < .05, ***p < .01.
The variable sessionsize is significant; that is, in sessions with more participants, contributions are lower. While the coefficient is small, we note that this is consistent with findings in Botelho et al. (2009). In their article, repeated settings with “random strangers” and “perfect strangers” matching protocols are compared. The authors find that the assumption that subjects treat random stranger designs as if they were one-shot experiments is false. Our results indicate that the session size, and hence the likelihood of meeting a random stranger again, has an impact on contributions.
Sum of Payoffs
In the right panel of Table 4, we report the results of a linear regression on the sum of payoffs of the two players. The table shows that the sequential contribution mechanism significantly improves payoffs. 14 The second step level insignificantly reduces the payoffs. This can be explained by the fact that two thresholds increase contributions but, as we will see, the second level is rarely actually achieved. The size of the sessions is weakly significant, but, again, the coefficient is small.
When we add the interaction sequential × twolevel, we find that it significantly increases subjects’ payoff compared to the baseline S
Public-good Provision
Table 5 presents probit regressions of the frequency of public-good provision where the left panel is about public-good provision at level 1. The dependent variable equal to 1 if and only if the first threshold is met (i.e., if and only if
Probit Regressions of Public-good Provision.
Note: Robust standard errors in parentheses.
*p < .1, **p < .05, ***p < .01.
The regressions in the left panel show that sequential significantly improves the public-good provision at the first threshold. This supports Hypothesis 1. 15 The implementation of a second threshold does not lead to a higher frequency of public-good provision. Interacting sequential with two thresholds suggests borderline significant support for an increased public-good provision which, however, disappears once we control for period and sessionsize. Overall, we do not find support for Hypothesis 2, which predicts that the second threshold leads to more public-good provision. In regression 3, we find that the coefficient of sessionsize is negative and weakly significant. That is, sessions with a higher numbers of subjects exhibit lower public-good provision. We note, however, that the coefficient of sessionsize is small. Adding the interactions with period does not change the results qualitatively, although the treatment variable sequential turns out to be highly significant in this regression.
Table 5 also presents a probit regression of the frequency of public-good provision of level 2. (Here, twolevel cannot be part of the regression analysis, of course.) Sequential is again significant, that is, sequential-move contributions also stimulate the provision of the second level which is additional support for Hypothesis 1. The dummy sessionsize is not significant. Regression 2 reveals that public-good provision at level 2 moderately decreases over time. Adding the period interactions in an additional regression (not reported here) show that the negative time trend is driven by sequential. In this regression, period is insignificant but period × sequential is.
Coordination Rates
We define
Figure S1 compares coordination in the simultaneous and the sequential treatment with one threshold. In S
Figure S2 compares coordination rates in the treatments with two thresholds. This plot again documents the superiority of the sequential- over the simultaneous-move variant. In S
However, two levels are efficient in the environment with sequential moves that explains the significance of our interaction term sequential × twolevel. The result is driven by first movers contributing higher amounts in S
What Improves Payoffs in the Sequential Case?
We saw that both public-good provision and coordination are significantly better in the sequential treatments. We also saw that higher payoffs occur in S
In order to investigate this question, we analyze the sum of payoffs conditional on the public-good provision. First, conditional on
Now we condition on
To sum up, if the public good is provided, payoffs and coordination are similar in S
The Predictive Power of the Calibrated F&S Model
We now discuss the quantitative predictions of the F&S model in more detail. We begin with Proposition 1. Figure 1 contrasts the predictions made in Table 2 to the observations of the frequency of second movers who contribute
In S

Predicted frequencies (based on the calibrated Fehr and Schmidt model) and observed frequencies of second movers contributing such that the public good at level 1 is provided in S
Figure 2 is a bubble plot of first and second movers in S

Frequencies of first- and second-mover choice combinations where the bubble size corresponds to frequency. The Pareto frontier can be found where
Intriguingly, the second-mover prediction of Proposition 2 fails (whereas it was the first-mover prediction of Proposition 1 that failed). The first mover in the two-level case is in a trust game–like situation. If the first mover chooses cF = 9, he or she can be exploited by second movers. While the calibrated F&S model predicts that more than 80.3 percent of the second movers will be trustworthy, it turns out only 50.9 percent are. Predicted and observed share differ significantly (binomial test, p < .05). The failure of the theory seems surprising since the costs of being trustworthy are low here: second movers gain only one additional euro by exploiting the first mover, but this costs the first mover five euros. (See the discussion at the end of the section).
We finally turn to Proposition 3, the S
Figure S3 (see the online supplement) illustrates what might be going on. It shows the best reply correspondences for standard selfish players, for F&S players and also for players with standard preferences but with a degree of risk aversion according to the findings in Holt and Laury (2002).
With selfish and rational players, the best reply correspondence has a “bang-bang” property. If the belief is that player j chooses ci = 6 less than 60 percent, all players will best respond with ci = 0 and vice versa for a belief of more than 60 percent. With the calibrated F&S model, this is not the case. For beliefs between (roughly) 40 percent and 80 percent, the best replies of the various F&S types differ. For example, given a belief that 70 percent of all players choose ci = 6, only 75 percent of the players will best respond with ci = 6, where 25 percent still choose ci = 0.
As mentioned in Proposition 3, the share of players choosing cF = 6 required such that cF = 6 is a best reply that is slightly larger with F&S players. Inequality aversion has an effect similar, in fact a stronger effect, than does risk aversion (on average, players in Holt and Laury [2002] are slightly risk averse). We also see that the best replies differ from the case with standard preferences. Around the mixed-strategy equilibrium, the best replies are not vertical but somewhat “flat,” implying that not all players will best reply once the fixed point of the mixed strategy is exceeded. This is what we see in the data.
What can we conclude from the analysis of the calibrated F&S model? First, we find remarkable confirmations of the predictions of the model. One may argue that, regarding S
How about the two contradictions to the calibrated F&S model then? First, we found that first movers behaved too greedily to be consistent with Assumptions 1 and 2, providing cF < 6 too often. This finding is not new. For standard ultimatum-game experiments, it can be argued that offering the equal split may be payoff maximizing (assuming risk neutrality), but about half the proposers offer less than the equal split. 19 Huck, Muller, and Normann (2001) show that, in quantity-setting duopoly, Stackelberg followers are inequality averse, but the Stackelberg leaders still choose too high an output. The payoff maximizing (and inequality minimizing) output in that data set was the symmetric Cournot-Nash solution. In ultimatum games, the Stackelberg game, and this study, risk-loving behavior can explain the first-mover behavior. However, it could also be that first movers feel entitled to more than 50 percent of the pie, whereas second movers regard the equal split as fair. Social norms may be perceived differently by first and second movers.
We secondly saw that second movers exploit first-mover trust (that is, cF
= 9) too often in S
Conclusion
How should, say, two academics organize their joint work when the goal is that a certain threshold in terms of quality of the journal has to be met? Our experiments indicate two answers to this question. First, we find that an additional second step level (say, targeting an A- rather than a B-level journal) does lead to significantly higher contributions (efforts in the case of teamwork), although the impact on payoffs is ambiguous and insignificant. The effect on public-good provision is positive, especially in the sequential case (where the effect is significant). The logic is as follows: first movers often contribute high 20 such that the second step level can be met. Second movers may exploit this by contributing less, however, they still contribute enough so that the probability of meeting the first step-level increases. For academic teamwork, say, a strategy like “we need to invest this much effort for a B journal, but with that much more work we could go for an A-level journal” might work out. With simultaneous moves, however, coordination failure becomes more frequent.
Second, we find that the sequential-move variant yields more frequent provision of the public good and higher payoffs. This confirms the literature on leading by example where, in our setup, it is mainly the better coordination that renders the sequential mechanism superior in the threshold public-good game. Even though some low-contributing first movers (who actually give a bad example) are punished by second movers, higher provision rates and payoffs emerge. The finding is in contrast to Gächter et al.’s (2010a) test of Varian’s (1994) model. They find that sequential contributions are lower with sequential moves, but the difference is not as big as predicted. One reason for this is that, as in our setting, second movers sometimes punish first movers. In our setting, the sequential-move variant is more efficient.
As mentioned in the literature survey, Chewning, Coller, and Laury (2001) run five-player step-level public-good games with simultaneous moves and different numbers of provision points (ranging from 1 to 5). Their treatments with two and three thresholds can be compared to our experiments with simultaneous moves because in these treatments, the Nash equilibria are the same (0 and 7.5 in their case), whereas the Pareto optimum is higher with three than with two thresholds. This is the same in our experiment.
21
Comparing two and three thresholds, Chewning, Coller, and Laury find that contributions increase in the first five periods with three thresholds. However, in periods eleven to fifteen, they are below the two thresholds case. This is consistent with the findings in our treatment with two levels where two levels initially lead to higher contributions. We also find that contributions decrease over time. They are, however, higher than that in the one-level treatment throughout. Chewning, Coller, and Laury conclude that the decrease in the treatment with three thresholds is due to coordination problems. Thus, the high degree of coordination failure in our S
Based on a fully calibrated Fehr and Schmidt’s (1999) model, we make accurate ex ante predictions. We find that actual behavior fits quantitatively well with these predictions. Specifically, the F&S model predicts the second-mover responses amazingly well. While the predictive power on first-mover behavior is less impressive, similar findings have been observed before in other sequential games. The calibrated Fehr and Schmidt model also predicts behavior well in the sequential treatment with two step levels, and in the simultaneous-move case with one level.
Footnotes
Acknowledgments
We are grateful to Volker Benndorf, Dirk Engelmann, Nikos Nikiforakis, Theo Offerman, Martin Sefton, and Marta Serra Garcia for detailed comments on our article. We are also indebted to audiences at ESA Innsbruck, University of Frankfurt, University of Heidelberg, Centre for European Economic Research Mannheim, Hohenheimer Oberseminar, the Annual Congress of the European Economic Association in Oslo, and the Annual Meeting of the Verein für Socialpolitik in Frankfurt for helpful comments.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
