Registered Replication Report

Abstract

Finkel, Rusbult, Kumashiro, and Hannon (2002, Study 1) demonstrated a causal link between subjective commitment to a relationship and how people responded to hypothetical betrayals of that relationship. Participants primed to think about their commitment to their partner (high commitment) reacted to the betrayals with reduced exit and neglect responses relative to those primed to think about their independence from their partner (low commitment). The priming manipulation did not affect constructive voice and loyalty responses. Although other studies have demonstrated a correlation between subjective commitment and responses to betrayal, this study provides the only experimental evidence that inducing changes to subjective commitment can causally affect forgiveness responses. This Registered Replication Report (RRR) meta-analytically combines the results of 16 new direct replications of the original study, all of which followed a standardized, vetted, and preregistered protocol. The results showed little effect of the priming manipulation on the forgiveness outcome measures, but it also did not observe an effect of priming on subjective commitment, so the manipulation did not work as it had in the original study. We discuss possible explanations for the discrepancy between the findings from this RRR and the original study.

Keywords

commitment rejection relationships replication preregistration

Protocol vetted by: Eli J. Finkel

Protocol edited by: Daniel J. Simons

Multilab direct replication of Study 1 from: Finkel, E. J., Rusbult, C. E., Kumashiro, M., & Hannon, P. A. (2002). Dealing with betrayal in close relationships: Does commitment promote forgiveness? Journal of Personality and Social Psychology, 82, 956–974.

Data and registered protocols: https://osf.io/s3hfr/

Citation: Cheung, I., Campbell, L., LeBel, E., . . . Yong, J. C. (2016). Registered Replication Report: Study 1 from Finkel, Rusbult, Kumashiro, & Hannon (2002). Perspectives on Psychological Science, 11, 750–764.

Even in the closest relationships, people sometimes betray their partner’s trust. Such betrayals introduce stress, and the way partners handle such threats to their relationship can have lasting consequences. Offering forgiveness can be more constructive than blaming or retaliating. What motivates partners to forgive?

Many studies report an association between relationship commitment and willingness to forgive transgressions (e.g., Cann & Baucom, 2004; Fehr, Gelfand, & Nag, 2010; Karremans, Van Lange, Ouwerkerk, & Kluwer, 2003; McCullough et al., 1998; Tsang, McCullough, & Fincham, 2006). As with any correlation, though, disentangling the direction of causality can be challenging. Using interdependence theory as a guiding framework, Finkel, Rusbult, Kumashiro, and Hannon (2002) provided one of the only experiments addressing the causal relationship between commitment and responses to betrayal. They used a priming task to experimentally manipulate commitment (low or high) and then assessed forgiveness responses.

In Study 1 of their article, the focus of this Registered Replication Report (RRR) project, participants were primed by writing responses to open-ended prompts that guided them to think about either their dependence and commitment to their partner (high commitment) or their independence and lack of commitment to their partner (low commitment). High commitment prompts included five items such as “Describe two ways in which you feel that your life has become ‘linked to’ your partner” and low commitment prompts used items such as “Describe two ways in which you are independent of your partner.” Then, in an ostensibly unrelated study, participants read descriptions of 12 hypothetical betrayals committed by their partner and indicated how they would react. These potential reactions corresponded to four response tendencies: exit, voice, loyalty, and neglect (Rusbult, 1993; Rusbult, Zembrodt, & Gunn, 1982).

The four possible reactions to the hypothetical betrayals differ along two dimensions (Rusbult, 1993; Rusbult et al., 1982): constructive/destructive and active/passive. Exit responses are actively destructive: People deliberately respond in a way that harms the relationship, such as trying to get even with their partner. Neglect responses are passively destructive: People respond in a way that weakens the relationship rather than actively addressing the betrayal, such as giving their partner the “cold shoulder.” Voice responses are actively constructive: People respond in a way that benefits the relationship, such as discussing the betrayal with their partner to understand why it happened. Loyalty responses are passively constructive: People respond by waiting for the situation to improve, such as by maintaining the view that their partner is a good person or believing that their partner’s behavior was unintentional.

Finkel et al. (2002) predicted that participants exposed to a high-commitment prime would exhibit greater forgiveness, reacting to betrayal with “lesser exit and neglect along with greater voice and loyalty” (p. 960) than would those exposed to a low-commitment prime. They observed the predicted effects for exit and neglect responses, but not for voice and loyalty responses. The effect of the commitment prime on exit and neglect responses builds on earlier correlational research, showing that increased commitment may help partners to respond to betrayals of trust in less destructive ways.

This highly cited paper serves as a cornerstone for the theoretical importance of relationship commitment as a predictor of relationship outcomes, including forgiveness. The findings have important implications for the theoretical understanding of forgiveness and, assuming the self-report response measures predict actual responses to betrayal, for practical implications such as couple therapy. No direct replications of this study have been published. This RRR is designed to provide a direct replication of this influential finding and to provide a more precise estimate of the size of the effect of this commitment prime on how people report that they would respond to betrayals from a romantic partner.

The authors of the original study noted the need for replication of the pattern in which the prime affected negative responses (exit and neglect tendencies) but not positive responses (voice and loyalty), especially given that they had originally predicted effects for all four measures. They explained: “. . .to the extent these results are replicated in future work. . .such findings suggest that commitment exerts its motivational effects precisely where such effects are most critical”—on the inhibition of “potentially devastating destructive impulses”—and speculated that it may be “. . .less important that close partners enact constructive behaviors. . .” (Finkel et al., 2002, p. 970).

In this project, 16 labs completed independent, preregistered direct replications of the original study, all following the same vetted protocol. The primary analysis specified in the protocol involved testing the influence of commitment prime (high vs. low) on exit and neglect forgiveness (outcomes that showed an effect in the original study); auxiliary analyses tested the influence of commitment on voice and loyalty forgiveness (outcomes that, despite Finkel et al.’s hypotheses, did not show an effect in the original study).¹ The meta-analytic results of these separate analyses are presented in the Results section.

Protocol Development and Requirements

Cheung, Campbell, and LeBel proposed this RRR project and developed the protocol in consultation with Eli Finkel, the lead author of the original study. Finkel provided all of his original materials and reviewed the protocol and scripts. The protocol is available on the OSF project page for this RRR project (https://osf.io/s3hfr/).

Perspectives on Psychological Science publicly announced a call for laboratories interested in participating in this replication project on March 2, 2015. The deadline for laboratories to submit their application to participate was March 23, 2015. A total of 21 labs applied to join the replication project, and 16 labs from 5 countries completed the study.

Each lab preregistered a detailed plan for implementing the approved protocol prior to conducting their study, and the editor reviewed each plan to ensure that it met the requirements of the protocol. These preregistered implementation plans are linked from the OSF project page. Labs noted any deviations from the standard protocol in their preregistration, and they noted any departures from their preregistration that occurred during data collection on their OSF page. All of the researchers involved in conducting replications as part of this project are coauthors on this report.

Participants

Each study tested a minimum of 50 participants in each prime condition, with approximately the same proportion of men and women in each condition (each study included between 20% and 80% women). As in the original study, participants were recruited exclusively from undergraduate psychology participant pools or from an equivalent population recruited in other ways. All participants were 18–24 years old, with an average age of approximately 18–21 years. Participants were required to be currently involved in a dating relationship of at least 1 month in duration (see Table 1 for demographic information about each laboratory’s sample).

Table 1.

Demographic Information for Participating Labs

Lab	TotalN	MaleN	FemaleN	Other or unreported N	Meanage	SD age	Exclusion 1 Male	Exclusion 1 Female	Exclusion 1 Total	Exclusion 1 & 2Male	Exclusion 1 & 2Female
Original	89	22	67	0	19.13	2.31	0	0	0	0	0
Aykutoglu	142	46	95	1	21.59	2.13	15	19	35	15	19
Bredow	160	52	108	0	18.79	0.98	4	3	7	17	36
Caprariello	127	66	61	0	19.73	2.74	4	4	8	5	5
Carcedo	100	20	80	0	20.22	1.46	0	0	0	0	3
Carson	201	47	154	0	20.28	2.08	5	20	25	21	78
Cheung	160	46	114	0	18.48	0.96	1	1	2	8	29
Cobb	173	88	85	0	19.6	1.59	12	7	19	14	12
Collins	102	25	77	0	18.84	0.95	0	0	0	0	0
DiDonato	105	19	86	0	19.67	3.48	1	3	4	7	24
Fuglestad	124	28	94	2	22.19	4.97	8	11	20	14	35
Goldberg	111	37	74	0	19.41	1.63	0	0	0	0	0
Hoplock	209	46	161	2	19.01	1.55	7	16	24	11	43
Sucharyna	187	81	105	1	19.91	3.74	19	17	36	33	53
Tidwell	101	28	72	1	21.31	6.90	5	9	14	7	26
Vranka	162	37	125	0	21.38	1.61	3	6	9	13	38
Yong	120	54	66	0	21.5	1.49	1	6	7	3	10
RRR Total	2284	720	1557	7	-	-	85	122	210	168	411

Note: RRR = Registered Replication Report.

Testing setting

Participants were tested in-person either individually or in small groups, and those tested in groups could not see the responses of other participants. Each testing station was set up so that participants could complete both the paper-and-pencil and computer-based components of the study. Each participating laboratory uploaded photographs of their testing setting to their OSF page. The person conducting the experiment had to be at least 20 years old and needed to have experience collecting experimental psychology data and interacting with participants. Assignment of participants to conditions was randomized by the experimental script so that the experimenter could remain blind to the condition assignment.

Materials

In the original study, all of the questionnaires were administered using paper and pencils rather than computers. For the RRR protocol, the commitment prime was administered using a paper-and-pencil questionnaire, but the forgiveness measures were collected via a computer-based Qualtrics (www.qualtrics.com) survey. The original study used verbal debriefing to assess suspicions about the link between the prime and the forgiveness measures. The RRR study instead used a 6-item, computer-based funnel debriefing questionnaire as a more systematic way to test for suspicion. All study materials are available from https://osf.io/s3hfr/.

Data collection

When the subject pool required a description, the study was described as being about the participant’s dating relationship. If participants had broken up with their partner since signing up for the study, they were instructed to describe their most recent dating relationship. R scripts (R Development Core Team, 2008) were used to generate a randomized order of condition assignments for men and for women. These scripts ensured that approximately equal numbers of people were assigned to each condition and that similar proportion of men and women were assigned to each condition (https://osf.io/s3hfr/).

As in the original study, the experimenter told participants that they would be participating in two separate studies (so that they would be unaware of the link between the commitment prime and the forgiveness measures). The use of paper-and-pencil responses for the commitment prime and computerized presentation for the forgiveness survey reinforced this cover story. Participants were told that the primary study would take only 20 minutes and were asked to help the experimenter’s mentor or supervisor with his or her research program by first filling out a short open-ended questionnaire (the commitment prime). The cover page of the questionnaire was placed face up, and it consisted of a letter from the experimenter’s mentor or supervisor. The letter included a description of the purpose of and instructions for the questionnaire, as well as an expression of gratitude to the participant. The commitment prime questionnaire was on the back of this letter. The items in the priming questionnaire appear in Table 2.

Table 2.

Priming Items for Each Commitment Prime Condition

Condition	Priming items
High commitment	1. If your relationship were to end in the near future, what would upset you the most about not being with your partner anymore? 2. What is the number one reason why it would be nice to grow old with your partner? 3. Describe two ways in which you feel that your life has become “linked to” your partner. 4. What two characteristics of your partner make you think that you could be happy living together in the long run? 5. Describe two reasons why you are (or could become) committed to your relationship.
Low commitment	1. Describe one of the activities that you enjoy engaging in when your partner is not around. 2. What is one trait that your partner will develop as he/she grows older? 3. Describe two ways in which you are independent of your partner. 4. What are the two most salient characteristics of your partner? 5. Describe two reasons why people (in general) become involved in romantic relationships.

Condition

Priming items

High commitment

1. If your relationship were to end in the near future, what would upset you the most about not being with your partner anymore?
2. What is the number one reason why it would be nice to grow old with your partner?
3. Describe two ways in which you feel that your life has become “linked to” your partner.
4. What two characteristics of your partner make you think that you could be happy living together in the long run?
5. Describe two reasons why you are (or could become) committed to your relationship.

Low commitment

1. Describe one of the activities that you enjoy engaging in when your partner is not around.
2. What is one trait that your partner will develop as he/she grows older?
3. Describe two ways in which you are independent of your partner.
4. What are the two most salient characteristics of your partner?
5. Describe two reasons why people (in general) become involved in romantic relationships.

After they completed the priming questionnaire, participants turned it over and proceeded to the computer questionnaire. Participants read descriptions of 12 hypothetical betrayals committed by their partner (e.g., “Your partner talks to friends about private issues in your relationship”; “Your partner makes fun of you when you talk about your deepest fears”) and indicated how likely they would be to react in a variety of ways using a 9-point rating scale from 0 (not at all likely to react this way) to 8 (extremely likely to react this way). The various reactions contributed to exit, voice, loyalty, and neglect index scores, with one item assessing each tendency in response to each scenario. Table 3 shows the list of scenarios and the response items corresponding to each forgiveness measure.

Table 3.

List of Scenarios and the Response Items Corresponding to Each Forgiveness Measure

Scenario	Response items
1. Your partner talks to friends about private issues in your relationship.	I would tell my partner that it will take a long time to make it up to me. (E) I would calmly tell my partner why I’d prefer that our private life remain private. (V) I would assume that my partner probably didn’t mean to expose our private life. (L) I would dwell on how angry I feel, but wouldn’t talk to my partner about it. (N)
2. Your partner makes fun of you when you talk about your deepest fears.	I would assume that my partner must feel very uncomfortable about the issue underlying my fears. (L) I would imagine ways to obtain revenge in the future. (N) I would make fun of my partner at the next available opportunity. (E) I would talk about how important it is that we understand each other’s weaknesses. (V)
3. Your partner becomes sexually intimate with another person.	I would retaliate, becoming sexually intimate with someone myself. (E) I would imagine breaking up because there are “other fish in the sea.” (N) I would suggest that we have a positive talk about sexual monogamy. (V) I would remind myself that in general, my partner treats me very well. (L)
4. Your partner deliberately says something that hurts you badly.	I would ask my partner why he/she had hurt my feelings. (V) I would say something equally mean right back to my partner. (E) I would try to understand that my partner may not have intended to hurt me. (L) I would give my partner the cold shoulder for awhile. (N)
5. Your partner tells friends about an embarrassing secret from your past.	I would imagine ways to get revenge. (N) I would yell at my partner not to do that again. (E) I would assume that my partner didn’t mean to embarrass me in front of friends. (L) I would nicely explain that I’d prefer that we keep embarrassing events to ourselves. (V)
6. Your partner forgets your birthday.	I would calmly remind my partner that it’s my birthday. (V) I would keep my anger bottled up inside me. (N) I would assume that my partner would eventually remember. (L) I would consider ending the relationship. (E)
7. You find out that your partner kissed someone else at a party.	I would understand that things got out of hand, and that my partner behaved in a very unusual manner on that occasion. (L) I would yell at my partner about how horrible he/she has behaved. (E) I would tell my partner I’m glad things didn’t go further than “just kissing.” (V) I would feel irritated at my partner for awhile. (N)
8. Your partner fails to support you when you’re really upset.	I would recognize that my partner’s life is busy, and deal with the situation myself. (L) I would decide to quit supporting my partner so much in the future. (N) I would ask if my partner is upset about something, and whether that caused him/her to let me down. (V) I would tell my partner I’m going to cut off the relationship unless things improve fast. (E)
9. Your partner flirts with a classmate.	I would insist that my partner apologize to me over and over again. (E) I would suggest that we go out to dinner and have a constructive talk about flirting. (V) I would recognize that I sometimes flirt, so I shouldn’t be too hard on my partner. (L) I would behave in a cold manner toward my partner for awhile. (N)
10. Your partner lies to you about something important.	I would feel angry that my partner can’t be honest with me. (N) I would tell my partner that I’d like us to try and resolve the situation. (V) I would try to understand the situation from my partner’s point of view. (L) I would come up with ways to get even with my partner. (E)
11. In a disagreement with a third person, your partner takes the other person’s side.	I would quit arguing, but would feel angry that my partner didn’t take my side. (N) I would imagine that my partner has strong feelings about this issue, because usually he/she is on my side. (L) I would do the same thing to my partner the next chance I had. (E) I would exert extra effort to understand my partner’s perspective on the issue at hand. (V)
12. Your partner says something bad about you behind your back.	I would feel so irritated that I wouldn’t be able to deal with the situation. (N) I would forgive my partner because I’ve done similar things in the past. (L) I would tell my partner that I hope we can work out this problem. (V) I would get even by saying bad things about my partner behind his/her back. (E)

Note: E = exit, V = voice, L = loyalty, and N = neglect.

Following these forgiveness items, participants completed two other measures: the 7-item subjective commitment subscale of the Investment Model Scale (Rusbult, Martz, & Agnew, 1998) to assess their commitment to their partner and the 40-item Balanced Inventory of Desirable Responding (Paulhus, 1984) to assess self-deception (20 items) and impression management (20 items). For the subjective commitment scale, participants indicated the extent to which they agreed with statements like “I want our relationship to last for a very long time” or “I am committed to maintaining my relationship with my partner” using a 9-point rating scale from 0 (do not agree at all) to 8 (agree completely). A single commitment score was calculated for each participant by reverse coding 1 item and then averaging the scores across the 7 items, with higher scores representing greater subjective commitment. For the self-deception and impression management scales, participants responded using a 7-point rating scale ranging from 1 (do not agree at all) to 7 (agree completely) to items like “I am fully in control of my own fate” or “I am a completely rational person.” Indices of self-deception and impression management were calculated following Paulhus’ scoring key: The negatively keyed items were reverse scored (10 items for each scale), and then extreme scores (i.e., a 6 or 7) were recoded to a value of 1 and all other scores were recoded to 0. Thus, the number of 1s for each scale corresponded to the number of extreme responses, and the sum of these recoded scores for each scale could range from 0 to 20, with higher scores representing greater self-reported self-deception and impression management.

Next, participants provided their age and gender, the current status of their relationship, how long they had been involved with their partner, the exclusivity of their relationship, how often they saw their partner, and how far away their partner lived at that time. Finally, participants completed a 6-item funnel debriefing questionnaire to assess whether they believed that the priming and forgiveness tasks were related and whether they realized that other study participants had received different primes (see Table 4). After participants were debriefed, the experimenter entered the priming condition and name of the lab into the Qualtrics survey to ensure that the prime and forgiveness measures were linked in the data file.

Table 4.

Debriefing Questionnaire Items

Items
1. Did anything about the study seem strange to you, or was there anything you were wondering about? 2. What was the purpose of the pilot study (administered on paper)? 3. What was the purpose of the main study (administered on the computer)? 4. Were the two studies related in any way? If yes, how? 5. Did completing the pilot study (administered on paper) influence your responses on the main study (administered on the computer)? If yes, how? 6. In the pilot study (administered on paper), did everybody receive the same writing prompts? If not, how might these prompts have differed?

Items

1. Did anything about the study seem strange to you, or was there anything you were wondering about?
2. What was the purpose of the pilot study (administered on paper)?
3. What was the purpose of the main study (administered on the computer)?
4. Were the two studies related in any way? If yes, how?
5. Did completing the pilot study (administered on paper) influence your responses on the main study (administered on the computer)? If yes, how?
6. In the pilot study (administered on paper), did everybody receive the same writing prompts? If not, how might these prompts have differed?

Stopping rules and exclusions

As part of the OSF preregistration, each lab indicated its stopping rule to end data collection, and the editor approved these procedures prior to pre-registration. The rules were designed to ensure that each lab would meet the minimum data collection requirements for the protocol and that the decision to end data collection would not be influenced by the results of the study.

Data from participants were excluded from analyses for any of the following reasons: participants were not in the required age range (18–24 years old), participants were not currently involved in a romantic relationship, participants did not follow instructions, participants did not complete all tasks, participants were aware of the different conditions of the study or suspected that the two studies were part of the same study (based on responses from the funnel debriefing questionnaire), or the experimenter did not administer the instructions or tasks correctly.

We created a set of guidelines for two levels of data exclusions due to the number of open-ended questions during the debriefing portion of the study. The first level of exclusions included cases that clearly met the criteria outlined in the protocol, and the second level of exclusions included cases that required judgment calls, such as participants indicating that the two studies may be related because they both look at dating relationships. The analyses reported here correspond to the first level of exclusions. An equivalent set of analyses taking the second level of exclusions into account is available on the OSF project page.

Results

The goal of an RRR is to provide a precise measure of the size of an effect by combining the results of multiple, independently conducted direct replications. The results of all studies are included regardless of their outcome, providing an unbiased meta-analysis of the effect. The analysis does not focus on null-hypothesis significance testing. Instead, we report the meta-analytic effect size for each outcome measure, along with the confidence interval around that effect size. Each individual laboratory was provided with an R script to analyze their data in a way that is consistent with the preregistered protocol. The script is available on the main OSF project page, and each laboratory’s results are available on their OSF project page, linked from the main project page.

The output of the script—following the preregistered plan—includes a measure of the overall effect, ignoring gender, on subjective commitment (manipulation check), and the primary outcomes that showed a priming effect in the original study (i.e., exit and neglect measures). It also provides the results for the voice and loyalty measures that, despite Finkel et al.’s hypotheses, did not show a differential effect of high and low commitment primes in the original study. Additional analyses provided on the OSF project page consist of models that include impression management and self-deception as covariates and a mediation analysis of the effect of subjective commitment for all four outcome measures. The analysis plan for the individual lab analyses was preregistered in the official protocol. These R scripts were written by Edison Choe and reviewed by Courtney Soderberg at the Center for Open Science. We verified the accuracy of the scripts by reproducing the original statistical results reported by Finkel et al. (2002) from their raw data.

A separate R script, using the same analysis functions, was written to conduct the meta-analysis across labs. The scripts provide meta-analyses for the subjective commitment manipulation check as well as for the two outcome measures that showed a priming difference in the original study (exit, neglect) and the two that did not (voice, loyalty). For each outcome measure, we provide a forest plot showing the overall difference between high and low commitment primes in each laboratory result (see Figs. 1 –5). At the top of each forest plot we show the original result from Finkel et al. (2002), and below the forest plot we provide the results of a random-effects meta-analysis across laboratories for that measure (the meta-analysis does not include the original Finkel et al. result). Below that meta-analytic result, we also provide meta-analyses for a model that includes gender as a moderator and a model that includes gender as a moderator and impression management and self-deception as covariates. Forest plots corresponding to all of the reported meta-analytic results are available on OSF, and Table 5 reports both the reliabilities of each outcome measure and the meta-analytic correlations between outcome measures.

Fig. 1.

Forest plot of the effect of the commitment prime manipulation on subjective commitment, with negative effects indicating lower scores for participants in the high commitment prime condition than the low commitment condition (high–low). The figure also shows the meta-analytic effect of the commitment prime on subjective commitment when gender was included as a moderator and when gender was included as a moderator and impression management and self-deception were included as covariates. The data are listed in alphabetical order by the name of the first author from each replicating team. For each team, the figure shows the mean subjective commitment score for the high and low commitment prime condition and a forest plot of the raw mean difference score. Bigger effect size markers indicate that the study has greater weight in the meta-analysis, where the weight is the inverse of the standard error. The High–Low column provides the values used in the forest plot.

Fig. 2.

Forest plot of the effect of the commitment prime manipulation on exit, with negative effects indicating lower scores for participants in the high commitment prime condition than the low commitment condition (high–low). The figure also shows the meta-analytic effect of the commitment prime on exit when gender was included as a moderator and when impression management and self-deception were included as covariates. The data are listed in alphabetical order by the name of the first author from each replicating team. For each team, the figure shows the mean exit score for the high and low commitment prime condition and a forest plot of the raw mean difference score. Bigger effect size markers indicate that the study has greater weight in the meta-analysis, where the weight is the inverse of the standard error. The High–Low column provides the values used in the forest plot.

Fig. 3.

Forest plot of the effect of the commitment prime manipulation on neglect, with negative effects indicating lower scores for participants in the high commitment prime condition than the low commitment condition (high–low). The figure also shows the meta-analytic effect of the commitment prime on neglect when gender was included as a moderator and when impression management and self-deception were included as covariates. The data are listed in alphabetical order by the name of the first author from each replicating team. For each team, the figure shows the mean neglect score for the high and low commitment prime condition and a forest plot of the raw mean difference score. Bigger effect size markers indicate that the study has greater weight in the meta-analysis, where the weight is the inverse of the standard error. The High–Low column provides the values used in the forest plot.

Fig. 4.

Forest plot of the effect of the commitment prime manipulation on voice, with negative effects indicating lower scores for participants in the high commitment prime condition than the low commitment condition (high–low). The figure also shows the meta-analytic effect of the commitment prime on voice when gender was included as a moderator and when impression management and self-deception were included as covariates. The data are listed in alphabetical order by the name of the first author from each replicating team. For each team, the figure shows the mean voice score for the high and low commitment prime condition and a forest plot of the raw mean difference score. Bigger effect size markers indicate that the study has greater weight in the meta-analysis, where the weight is the inverse of the standard error. The High–Low column provides the values used in the forest plot.

Fig. 5.

Forest plot of the effect of the commitment prime manipulation on loyalty, with negative effects indicating lower scores for participants in the high commitment prime condition than the low commitment condition (high–low). The figure also shows the meta-analytic effect of the commitment prime on loyalty when gender was included as a moderator and when impression management and self-deception were included as covariates. The data are listed in alphabetical order by the name of the first author from each replicating team. For each team, the figure shows the mean loyalty score for the high and low commitment prime condition and a forest plot of the raw mean difference score. Bigger effect size markers indicate that the study has greater weight in the meta-analysis, where the weight is the inverse of the standard error. The High–Low column provides the values used in the forest plot.

Table 5.

Correlation Matrix for Subjective Commitment and the Four Outcome Measures

Measures	1	2	3	4	5
1. Subjective commitment	.91	–.18	–.13	.22	.10
2. Exit		.79	.71	–.14	–.18
3. Neglect			.76	–.13	–.19
4. Voice				.76	.56
5. Loyalty					.72

Notes. Italicized values on the diagonal represent coefficient alpha internal consistency estimates and the values above the diagonal represent the correlation among the measures. All values reported are analyzed meta-analytically across labs.

The purpose of an RRR is to evaluate—in a confirmatory manner—the size of an effect observed in an original study. Although Finkel et al. (2002) predicted effects of priming on all four outcome measures, they only observed significant effects for exit and neglect. If the RRR precisely replicated the results of the original study, it would observe a similarly sized difference for exit and neglect, and a similar lack of a difference for voice and loyalty. Given that only exit and neglect showed effects in the original study, those measures are the primary focus of the replication effort.

We note, though, that the original study predicted effects for all four outcome measures. And other patterns of results could support the broader theory. For example, if all four outcome measures produced effects in the directions hypothesized by Finkel et al. (negative for exit and neglect, positive for voice and loyalty), such a pattern would differ from the results of the original study, but it would be consistent with the predictions made for the original study.

Manipulation check: Subjective commitment

In the original study, subjective commitment ratings were 1.33 points higher (statistically significant) in the high commitment than in the low commitment prime condition. Our meta-analysis showed an average difference of .02 points (95% confidence interval: −.07 to .10) between the two priming conditions (see Fig. 1). The difference between the high and low prime conditions ranged from −.21 to .29 across the included studies. The variability in the effect size among the studies (i.e., heterogeneity) was consistent with what would be expected by chance (τ = 0, I² = 0%, H² = 1.00, Q₁₅ = 5.35, p = .989).

Figure 1 shows that the overall effect of the priming condition on subjective commitment was not substantially moderated by gender. Figure 1 also shows the meta-analytic effect of the difference between high and low priming conditions in a model that includes both gender as a moderator and self-deception and impression management as covariates. The pattern in the model including covariates did not differ substantially from the model without the covariates.

Overall, across the 16 labs, there was no evidence that the commitment prime manipulation influenced subjective commitment.

Exit forgiveness

In the original study, exit ratings were .65 points lower (statistically significant) in the high commitment prime condition than in the low commitment prime condition. Our meta-analysis yielded a difference of −.06 points (95% confidence interval: −.17 to .05) with similar ratings of exit for the low and high commitment prime conditions (see Fig. 2). The difference between the high and low prime conditions ranged from −.42 to .29 across the included studies. The variability in the effect size among the studies (i.e., heterogeneity) was consistent with what would be expected by chance (τ = 0.0380, I² = 3.06%, H² = 1.03, Q₁₅ = 14.39, p = .496).

The overall effect of the priming condition on exit forgiveness was not substantially moderated by gender as shown in Figure 2. Figure 2 also shows the meta-analytic effect of the difference between high and low priming conditions in a model that includes both gender as a moderator and self-deception and impression management as covariates. The pattern in the model including covariates did not differ substantially from the model without the covariates.

In sum, across the 16 independent labs, there was a lack of evidence that priming commitment decreased exit forgiveness.

Neglect forgiveness

The findings from the original study showed that neglect ratings were .42 points lower (statistically significant) in the high commitment prime condition than in the low commitment prime condition. Our meta-analysis yielded a difference of −.06 points (95% confidence interval: −.18 to .07) with similar ratings of neglect for the low and high commitment prime conditions (see Fig. 3). The difference between the high and low prime conditions ranged from −.42 to .38 across the included studies. The variability in the effect size among the studies (i.e., heterogeneity) was consistent with what would be expected by chance (τ = 0.1052, I² = 19.09%, H² = 1.24, Q₁₅ = 18.09, p = .258).

The overall effect of the priming condition on neglect forgiveness was not substantially moderated by gender as shown in Figure 3. Figure 3 also displays the meta-analytic effect of the difference between high and low priming conditions in a model that includes both gender as a moderator and self-deception and impression management as covariates. The pattern in the model including covariates did not differ substantially from the model without the covariates.

The findings from these studies show that there was no effect of the commitment prime manipulation on neglect forgiveness.

Voice forgiveness

In the original study, voice ratings were .46 points higher (not statistically significant) in the high commitment prime condition than in the low commitment prime condition. Our meta-analysis yielded a difference of .03 points (95% confidence interval: −.08 to .13) with similar ratings of voice for the low and high commitment prime conditions (see Fig. 4). The difference between the high and low prime conditions ranged from −.33 to .44 across the included studies. The variability in the effect size among the studies (i.e., heterogeneity) was consistent with what would be expected by chance (τ = 0, I² = 0%, H² = 1.00, Q₁₅ = 11.19, p = .739).

The overall effect of the priming condition on voice forgiveness was not substantially moderated by gender as shown in Figure 4. Figure 4 also displays the meta-analytic effect of the difference between high and low priming conditions in a model that includes both gender as a moderator and self-deception and impression management as covariates. The pattern in the model including covariates did not differ substantially from the model without the covariates.

Overall, the findings from these studies show that there was no effect of the commitment manipulation prime on voice forgiveness.

Loyalty forgiveness

In the original study, loyalty ratings were .29 points higher (not statistically significant) in the high commitment prime condition than in the low commitment prime condition. Our meta-analysis yielded a difference of .00 points (95% confidence interval: −.08 to .09) with similar ratings of loyalty for the low and high commitment prime conditions (see Fig. 5). The difference between the high and low prime conditions ranged from −.37 to .32 across the included studies. The variability in the effect size among the studies (i.e., heterogeneity) was consistent with what would be expected by chance (τ = 0, I² = 0%, H² = 1.00, Q₁₅ = 8.81, p = .887).

The overall effect of the priming condition on loyalty forgiveness was not substantially moderated by gender as shown in Figure 5. Figure 5 also shows the meta-analytic effect of the difference between high and low priming conditions in a model that includes both gender as a moderator and self-deception and impression management as covariates. The pattern in the model including covariates did not differ substantially from the model without the covariates.

Overall, the findings from these studies show that there was no effect of the commitment prime manipulation on loyalty forgiveness.

Exploratory analyses

Given that the commitment prime manipulation was not effective across the studies, exploratory analyses examined whether there was a correlation between subjective commitment and each of the four response tendencies. The original study reported a negative correlation between subjective commitment and each of the destructive responses—exit (r = −.30, p < .01) and neglect (r = −.29, p < .01). Similarly, in the RRR, greater subjective commitment was associated with less exit and neglect forgiveness (see Table 5).

The original study did not report the correlation between subjective commitment and the constructive responses, but we should expect those correlations to be positive. As expected, in the RRR, greater subjective commitment was associated with more pro-relationship responses, with subjective commitment positively correlated with both voice and loyalty (see Table 5).

General Discussion

The results of this large-scale, multilab direct replication of Study 1 of Finkel et al. (2002) are not consistent with the original result that a high commitment prime leads to greater subjective commitment or less exit and neglect forgiveness responses than a low commitment prime. For both exit and neglect, most labs reported an effect of the commitment prime that was close to zero, with some labs (6 for exit, 8 for neglect) finding effects that were numerically in the opposite direction. For both exit and neglect, the meta-analytic effect remained near zero when including gender as a moderator and when including impression management and self-deception as covariates.

Although Finkel et al. (2002) had predicted an effect of the commitment prime on constructive responses (voice and loyalty), they did not observe those effects. The results from this RRR are consistent with that original result. Most labs found effects of the commitment prime on both voice and loyalty that were close to zero. And, the meta-analytic effect remained near zero when including gender as a moderator and when including impression management and self-deception as covariates.

One reason why the results from this RRR might have differed from the original study is due to the failure of the manipulation check. In the original study, participants in the high-commitment prime condition rated their subjective commitment 1.33 points higher on average (on a 9-point Likert scale) than did those in the low-commitment prime condition. In contrast, none of the RRR studies observed an effect of the priming condition on subjective commitment, with a meta-analytic effect near zero and little heterogeneity across labs. Given that the priming manipulation did not yield different subjective commitment between the conditions, it is not surprising that we failed to replicate the pattern of findings for the target outcomes (exit and neglect).

It is unclear why the RRR studies observed no effect of priming on subjective commitment when the original study observed a large effect. Given the straightforward nature of the priming manipulation and the consistency of the RRR results across settings, it seems unlikely that the difference resulted from extreme context sensitivity or from cohort effects (i.e., changes in the population between 2002 and 2015).

The findings from this RRR provide no evidence for (or against) the causal role of commitment in the forgiveness process. Although many studies have observed a correlation between subjective commitment and forgiveness (e.g., Cann & Baucom, 2004; Fehr et al., 2010; Karremans et al., 2003; McCullough et al., 1998; Tsang et al., 2006), Finkel et al. (2002) was the clearest evidence for a causal role. This RRR did not find a causal effect of subjective commitment on forgiveness. But, it also found the commitment priming manipulation to be ineffective in changing subjective commitment. The failure of the commitment priming manipulation to induce a change in subjective commitment leaves open the possibility that a different manipulation might reveal a causal effect of subjective commitment on forgiveness. Future research should use other manipulations of relationship commitment as well as more sensitive experimental designs (e.g., a Highly-Repeated Within-Person; see Molenaar & Campbell, 2009; Whitsett & Shoda, 2014) that measure the causal influence of commitment on the forgiveness process within rather than across individuals. Such within-person designs can also reveal heterogeneity across people in how commitment influences forgiveness.

Footnotes

Appendix—Participating Laboratories

Lead Lab

Irene Cheung, Huron University College at The University of Western Ontario

Lorne Campbell, The University of Western Ontario

Etienne P. LeBel, Berkeley Initiative for Transparency in the Social Sciences (BITSS), University of California, Berkeley

https://osf.io/mfjv8/

A total of 160 students (high commitment n = 80; low commitment n = 80) were recruited from the introductory psychology subject pool at the University of Western Ontario. Participants were tested in groups of 1 to 4 at a time using the provided materials, following the official protocol. All participants were compensated with course credit.

Participating Labs

Bülent Aykutoğlu, Middle East Technical University

Elçin Gündoğdu-Aktürk, Middle East Technical University

Ahmet Uysal, Middle East Technical University

https://osf.io/es9ur/

A total of 142 students (high commitment n = 72; low commitment n = 70) were recruited from the psychology subject pool at Middle East Technical University in Turkey. After exclusions, there were 107 participants (high commitment n = 55; low commitment n = 52). Participants were tested in groups of 1 to 2 at a time using the provided materials. Our study materials were translated into Turkish, but in all other respects, we followed the official protocol. Our preregistered plan specified that minimum sample size would exceed 100 (50 per condition) after exclusions, however, we were unable to recruit enough people to meet our target sample size for one condition, so we recruited 9 more participants.

Carrie A. Bredow, Hope College

Lindsey M. Root Luna, Hope College

https://osf.io/h5rgy/

A total of 160 students (high commitment n = 81; low commitment n = 79) were recruited from the psychology participant pool at Hope College. Participants were tested in groups of 1 to 4 at a time using the provided materials. We followed the official protocol in all respects.

Peter A. Caprariello, Stony Brook University

https://osf.io/cgbhn/

A total of 127 students were recruited from the business subject pool at Stony Brook University (high commitment n = 64; low commitment n = 62; condition information for 1 participant was missing due to experimenter error). Participants were tested individually using the provided materials. We deviated from the official protocol in one respect: At the end of the study, experimenters entered conditions by code (e.g., “A” or “B”) instead of by label (e.g., “high commitment prime”). One participant, for whom condition information was missing, was suspected to have completed the study twice, and both entries for this individual were excluded from analyses. All participants were compensated with course credit.

Rodrigo J. Carcedo, University of Salamanca

Noelia Fernández-Rouco, University of Cantabria

https://osf.io/sdnzg/

A total of 100 students (high commitment n = 50; low commitment n = 50) were recruited from the psychology subject pool at University of Salamanca. Participants were tested individually or in small groups no bigger than 10 at a time using the provided materials. Our study materials were translated into Spanish, but we followed the official protocol in all other respects. All participants took part in this study voluntarily. They did not receive any kind of compensation.

Kevin J. Carson, The University of Texas at Dallas

Conrad A. Corretti, The University of Texas at Dallas

Heidi S. Kane, The University of Texas at Dallas

Robert A. Ackerman, The University of Texas at Dallas

https://osf.io/n7wqs/

A total of 201 students (high commitment n = 101; low commitment n = 100) were recruited from the psychology subject pool at the University of Texas at Dallas. Participants were tested in groups of 1 to 4 at a time using the provided materials. Deviating from the official protocol, this replication attempt was a joint effort by two separate labs (i.e., the PAIR Lab and the Close Relationships and Health Lab) at the University of Texas at Dallas. In addition, because two of the researchers were faculty, we modified a portion of the script from the cover story to make it more believable to participants (instead of saying “one of my professors” when referencing the ostensibly unconnected study, we changed it to “one of our colleagues.”). On our application, we had said we would screen the demographics of our sample midway through data collection, and if we found that our sample was older, we would restrict future participants to be between the ages of 18 to 19. We further said that we would pursue the same strategy to ensure that our gender breakdown matches what is required by the protocol. Because we were concerned that sticking with this exact plan could hurt our chances of collecting enough people for the study, we asked the editor if we could deviate from this protocol. With the editor’s approval (who was blind to the actual data), we restricted the age range of participants to 18 to 21 years. In addition, we modified the posting of study sessions so that a smaller proportion of the time slots (or openings with those time slots) were available for women.

Rebecca J. Cobb, Simon Fraser University

Jennifer C. Pink, Simon Fraser University

Roanne D. Millman, Simon Fraser University

Jill M. Logan, Simon Fraser University

https://osf.io/w5gt9/

Of the 176 students who were recruited from the psychology subject pool at Simon Fraser University, 173 allowed their data to be included in the study (high commitment n = 89; low commitment n = 84). Participants were tested in small groups (range n = 1–9) using the provided materials and were compensated for their time with course credit. We followed the official protocol as instructed.

Nancy L. Collins, University of California, Santa Barbara

Jeffrey D. Bowen, University of California, Santa Barbara

Lauren A. Winczewski, University of California, Santa Barbara

Christopher Bromberg, University of California, Santa Barbara

https://osf.io/qhs5e/

A total of 102 students (high commitment n = 52; low commitment n = 50) were recruited from the psychology subject pool at the University of California, Santa Barbara. Participants were tested in groups of 1 to 6 at a time using the provided materials, and were compensated with course credit. Although our preregistered plan specified that we would obtain equal numbers of men and women, our final sample was composed of more women than men (77 women, 25 men). However, within gender groups, equal numbers of participants were assigned to the high and low (H/L) commitment conditions (men: n = 13/12; women: n = 39/38). Based on the exclusion criteria provided, no participants were excluded from data analysis.

Theresa E. DiDonato, Loyola University Maryland

Frank D. Golom, Loyola University Maryland

https://osf.io/2ijkx/

A total of 105 students currently in romantic relationships (high commitment n = 57; low commitment n = 48) were recruited from the psychology department’s participant pool at Loyola University Maryland. Participants were tested one or two at a time using the provided materials, following the official protocol. Although we attempted to recruit males who were also in relationships, this proved more difficult than expected: Our preregistered plan was to recruit 20%−80% females, and our final sample had 82% females (n = 86) with only 18% males (n = 19).

Paul T. Fuglestad, University of North Florida

Christopher T. Leone, University of North Florida

John S. Kim, Lesley University

https://osf.io/wj2uf/

A total of 124 students (high commitment n = 61; low commitment n = 63) were recruited from the psychology participant pool at the University of North Florida. Participants were tested in groups of 1 to 4 at a time using the provided materials. In all respects we followed the official protocol. Although our preregistered plan specified that participants would be compensated with course credit, we were unable to recruit enough men to meet our target sample size with that method. As a result, one man participated in exchange for $5 and 10 men participated in exchange for $10.

Rebecca M. Goldberg, Mississippi State University

H. Colleen Sinclair, Mississippi State University

Taylor Ritchey, Mississippi State University

Chelsea Ellithorpe, Mississippi State University

https://osf.io/cij64/

A total of 111 students (high commitment n = 57; low commitment n = 56) were recruited from Mississippi State University. Participants were tested in groups of 1 to 4 at a time using the provided materials. Participants were compensated in the form of course participation credit if in the subject pool or $15 if not. Official protocol was followed during the administration of this study and there was no deviation from the preregistered plan.

Lisa B. Hoplock, University of Victoria

Danu Anthony Stinson, University of Victoria

https://osf.io/hq5xc/

A total of 209 students (high commitment n = 103; low commitment n = 105; unknown condition due to computer error n = 1) were recruited from the psychology subject pool at the University of Victoria. Participants were tested in groups of 1 to 6 at a time using the provided materials. We followed the official protocol. Participants were compensated with course credit.

Tamara A. Sucharyna, University of Manitoba

Marian M. Morry, University of Manitoba

http://osf.io/d43kn/

A total of 187 students, (high commitment n = 97; low commitment n = 90) were recruited from the psychology subject pool at the University of Manitoba. We followed the official protocol in all respects. Participants were tested in groups of 1 to 10 at a time in a computer lab using the provided materials. Participants were compensated with course credit.

Natasha D Tidwell, Fort Lewis College

Sue Kraus, Fort Lewis College

https://osf.io/ayfet/

A total of 101 students (high commitment n = 51; low commitment n = 50) were recruited from lower division psychology courses at Fort Lewis College. Participants were tested in groups of 1 to 10 at a time using the provided materials. We do not have a traditional subject pool, so participants were recruited verbally in classes; in all other respects, we followed the official protocol.

Marek Vranka, Charles University in Prague

Štěpán Bahník, University of Würzburg

Petr Houdek, University of Economics, Prague

A total of 162 students (high commitment n = 81; low commitment n = 81) were recruited from the student subject pool of the PLESS laboratory. Participants were tested in groups of 6 to 15 at a time using the provided materials. Our study materials were translated into Czech. Due to logistic reasons, we distributed prime questionnaires after participants were seated and we wrote down IDs on the prime questionnaires after the participants left. In all other respects, we followed the official protocol. We did not deviate from our preregistered plan in any way.

Jose C. Yong, Singapore Management University

Norman P. Li, Singapore Management University

https://osf.io/n3cdk/

A total of 120 students (high commitment n = 61; low commitment n = 59) were recruited from the psychology subject pool at Singapore Management University. Participants were tested in groups of 1 to 8 at a time using the provided materials. The official protocol was followed precisely. Although our preregistered plan specified that participants would be compensated with course credit, we were unable to recruit enough people to meet our target sample size. Therefore, 81 participants participated in exchange for $10.

Declaration of Conflicting Interests

The authors declared that they had no conflicts of interest with respect to their authorship or the publication of this article.

Notes

References

Cann

Baucom

T. R.

(2004). Former partners and new rivals as threats to a relationship: Infidelity type, gender, and commitment as factors related to distress and forgiveness. Personal Relationships, 11, 305–318. doi:10.1111/j.1475-6811.2004.00084.x

Fehr

Gelfand

M. J.

Nag

(2010). The road to forgiveness: A meta-analytic synthesis of its situational and dispositional correlates. Psychological Bulletin, 136, 894–914. doi:10.1037/a0019993

Finkel

E. J.

Rusbult

C. E.

Kumashiro

Hannon

P. A.

(2002). Dealing with betrayal in close relationships: Does commitment promote forgiveness? Journal of Personality and Social Psychology, 82, 956–974. doi:10.1037/0022-3514.82.6.956

Karremans

J. C.

Van Lange

P. A. M.

Ouwerkerk

J. W.

Kluwer

E. S.

(2003). When forgiving enhances psychological well-being: The role of interpersonal commitment. Journal of Personality and Social Psychology, 84, 1011–1026. doi:10.1037/0022-3514.84.5.1011

McCullough

M. E.

Rachal

K. C.

Sandage

S. J.

Worthington

E. L.

Jr. Brown

S. W.

Hight

T. L.

(1998). Interpersonal forgiving in close relationships: II. Theoretical elaboration and measurement. Journal of Personality and Social Psychology, 75, 1586–1603. doi:10.1037/0022-3514.75.6.1586

Molenaar

P. C. M.

Campbell

C. G.

(2009). The new person-specific paradigm in psychology. Current Directions in Psychological Science, 18, 112–117. doi:10.1111/j.1467-8721.2009.01619.x

Paulhus

D. L.

(1984). Two-component models of socially desirable responding. Journal of Personality and Social Psychology, 46, 598–609. doi:10.1037/0022-3514.46.3.598

R Development Core Team. (2008). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from http://www.R-project.org

Rusbult

C. E.

(1993). Understanding responses to dissatisfaction in close relationships: The exit, voice, loyalty, and neglect model. In Worchel

Simpson

J. A.

(Eds.), Conflict between people and groups: Causes, processes, and resolutions (pp. 30–59). Chicago, IL: Nelson-Hall.

10.

Rusbult

C. E.

Martz

J. M.

Agnew

C. R.

(1998). The Investment Model Scale: Measuring commitment level, satisfaction level, quality of alternatives, and investment size. Personal Relationships, 5, 357–391. doi:10.1111/j.1475-6811.1998.tb00177.x

11.

Rusbult

C. E.

Zembrodt

I. M.

Gunn

L. K.

(1982). Exit, voice, loyalty, and neglect: Responses to dissatisfaction in romantic involvements. Journal of Personality and Social Psychology, 43, 1230–1242. doi:10.1037/0022-3514.43.6.1230

12.

Tsang

J. A.

McCullough

M. E.

Fincham

F. D.

(2006). The longitudinal association between forgiveness and relationship closeness and commitment. Journal of Social & Clinical Psychology, 25, 448–472. doi:10.1521/jscp.2006.25.4.448

13.

Whitsett

D. D.

Shoda

(2014). An approach to test for individual differences in the effects of situations without using moderator variables. Journal of Experimental Social Psychology, 50, 94–104. doi:10.1016/j.jesp.2013.08.008