Abstract
In the present study, we aimed to replicate and extend findings by Mehl, Vazire, Holleran, and Clark (2010) that individuals with higher well-being tend to spend less time alone and more time interacting with others (e.g., greater conversation quantity) and engage in less small talk and more substantive conversations (e.g., greater conversation quality). To test the robustness of these effects in a larger and more diverse sample, we used Bayesian integrative data analysis to pool data on subjective life satisfaction and observed daily conversations from three heterogeneous adult samples, in addition to the original sample (N = 486). We found moderate associations between life satisfaction and amount of alone time, conversation time, and substantive conversations, but no reliable association with small talk. Personality did not substantially moderate these associations. The failure to replicate the original small-talk effect is theoretically and practically important, as it has garnered considerable scientific and lay interest.
Keywords
It is now a well established and consensually acknowledged fact that social relationships are key to well-being (Argyle, 2001). What is much less clear, however, and what remains a source of considerable debate is whether it is primarily the quantity or the quality of our social encounters that matters. Several gold-standard experience-sampling studies show a linear relationship between how much time people spend interacting with others and how happy they tend to be (Lucas & Dyrenforth, 2006). However, other researchers have argued that the quality of everyday social encounters may be more important than the frequency with which people engage in social contact (e.g., Deci & Ryan, 2000; Myers, 1999).
In a recent study, Mehl, Vazire, Holleran, and Clark (2010) used the Electronically Activated Recorder (EAR; Mehl, 2017) to investigate how well-being relates to observational indicators of real-world conversation quantity and quality. Participants wore the EAR for 4 days while the device intermittently and unobtrusively recorded snippets of ambient sounds as they went about their days. Two measures of interaction quantity (time alone and time in conversation) and quality (amount of small talk and amount of substantive conversations) were derived from the sampled ambient sounds and linked to well-being. Further strengthening the evidence for the association with quantity, happier participants spent objectively less time alone and more time talking with others. The analyses also provided evidence in favor of quality: Happier participants engaged in more substantive conversations and had a lower ratio of daily small talk accounting for conversation quantity. Importantly, the effects were robust in the sense that they held for weekday and weekend conversations and across multiple measures of well-being.
However, with 79 participants, the sample size, though sizable for labor-intensive naturalistic observation research, was ultimately modest and by current standards too small to yield effects that can be expected to replicate (Maxwell, Kelley, & Rausch, 2008; Schönbrodt & Perugini, 2013).
Beyond statistical power, the generalizability of the sample was also limited in consisting only of college students. Although undergraduate samples are common and valuable in psychology, it is empirically and theoretically important to test whether effects found in a sample that is relatively homogeneous with respect to important social-context variables extend to other populations (Peterson, 2001). For example, undergraduate students are in an environment (i.e., college) that maximally affords getting to know new people and in a life stage where establishing new friendships is normative, relative to middle adulthood, when family and job demands might render the formation of new relationships more difficult and less important for social integration (Havighurst, 1981). In addition, later life stages often include critical life events (e.g., personal illness, trauma or death of a close friend or loved one), which can affect the habitual equilibrium between conversation quantity and quality.
Hence, the question of whether one’s well-being is first and foremost a function of the quantity of one’s daily conversations or rather a function of their quality remains unanswered in important ways. Well-powered replication studies using age- and context-heterogeneous samples are needed to establish the robustness and generalizability of effect parameters in single studies (Asendorpf et al., 2013). In addition, researchers are increasingly calling for changes in analytical strategies and recommend estimating posterior model probabilities (i.e., a Bayesian approach) in addition to (or instead of) standard frequentist parameters (Carlsson, Schimmack, Williams, & Bürkner, 2017; Johnson, Payne, Wang, Asher, & Mandal, 2017; Marsman et al., 2017). The Bayesian analytical approach seems particularly well suited for replication studies as it “provides a more flexible modeling framework, allows more appropriate quantification of the uncertainty around effect estimates” (Pitchforth & Mengersen, 2012, p. 118), and permits the integration of prior information.
Finally, Mehl and colleagues (2010) took into consideration that participants’ personalities might explain differences in well-being (cf. DeNeve & Cooper, 1998; Lucas & Fujita, 2000) by controlling for the Big Five personality domains in their analyses. However, as research shows, participants’ personalities exert organizational forces on how individuals act in, and interact with, their social environments (Mehl, Gosling, & Pennebaker, 2006). For example, researchers have suggested that extraverts might benefit more from social interactions than introverts (e.g., Harris, English, Harms, Gross, & Jackson, 2017). Hence, personality might also moderate the link between well-being and conversation quantity and quality, which was not tested in the original study because of limited statistical power.
Therefore, in the present study, we sought to replicate and extend the main findings reported by Mehl and colleagues (2010) and tested whether associations between life satisfaction, as a key component of subjective well-being, and conversation quantity and quality generalize beyond a student sample. To obtain robust overall effect estimates, we used Bayesian integrative data analytic methods by pooling life satisfaction and EAR-observed daily conversation information from three large and diverse samples of working adults in addition to the original data (N = 486). For daily conversation quantity, we expected a negative association between life satisfaction and spending time alone as well as a positive association with talking with others. For conversation quality, we expected a negative association between life satisfaction and having small talk as well as a positive association with substantive conversations. Given the known link between personality and life satisfaction, replicating the exploratory analyses conducted by Mehl and colleagues (2010), we examined whether the associations between life satisfaction and the conversation variables hold even after controlling for personality. In addition, we tested whether the associations between life satisfaction and daily conversation patterns were moderated by personality but without proposing specific hypotheses (including all Big Five dimensions for the sake of empirical completeness).
Method
In all studies, (a) the total number of excluded observations and the reasons for making those exclusions have been reported, (b) all independent variables have been reported (see also the Supplemental Material available online), (c) all (available) dependent variables or measures that were analyzed for this article’s target research question have been reported, and (d) how sample size was determined has been reported.
Participants and procedures
We report how we determined our sample size, all data exclusions, and all available happiness and conversation measures for each study below. Information about the study samples and measures is summarized in Tables 1 through 4.
Sample Characteristics
Note: EAR = Electronically Activated Recorder.
For the Big Five measures, N differs because of missing values (Study 3: N = 180; Study 4: N = 120).
Descriptive Statistics for Key Study Measures
Note: Higher values indicate higher levels of the specific trait (range = 1–7).
Descriptive Statistics for All Observational Measures
To account for individual differences in number of conversations, we computed normalized variables as the number of small talk or substantive conversations relative to a person’s total number of recorded conversations. bData were single coded; intercoder reliabilities were computed on the basis of a set of training sound files that all coders coded, and intraclass correlation coefficient (ICC)[2, k] reliabilities were computed at the sound-file level. cData were double coded; intercoder reliabilities were computed across the two coders on the aggregated average sound-file level (ICC[1, 2]).
Pearson Correlations Between All Study Variables
Note: All correlations in boldface are significant at p < .05, without correcting for multiple comparisons.
Study 1
This sample constituted the original sample reported in Mehl and colleagues (2010). Eighty undergraduate students recruited primarily from introductory psychology courses at the University of Texas at Austin completed a series of questionnaires and wore the EAR for 4 days. One participant failed to provide valid EAR data and was excluded from all analyses.
Studies 2a and 2b
As part of a larger study, breast cancer patients and their cohabiting partners were recruited from the Arizona Cancer Center (University of Arizona, Tucson) during regular visits to an oncologist. Couples were eligible if the female partner had a primary diagnosis of Stage I, II, or III breast cancer, they were living together in a marriage-like relationship, and spoke primarily English in their daily conversations. Fifty-six couples gave their consent to participate in the study. Fifty patients (Study 2a) and 51 caregiving partners (Study 2b) provided usable data for the current analyses. Both partners completed a series of questionnaires and wore an EAR device for 3 consecutive days from Friday afternoon to Monday morning (for more details, see Robbins, López, Weihs, & Mehl, 2014).
Study 3
As part of a larger study, 261 medically healthy adults living in Atlanta, Georgia, were recruited by the Emory University Center for Health and Well-Being to participate in a randomized controlled trial of a meditation intervention (ClinicalTrials identifier NCT01643369). A total of 184 participants provided usable data for the current analyses. They completed a series of questionnaires and wore the EAR for 3 days (Friday night through Monday morning) before and after an 8-week meditation intervention (for more details, see Kaplan et al., 2017).
Study 4
As part of a larger study about coping with divorce, 133 adults recently separated from their marital partners were recruited from the larger Tucson, Arizona, area. In all, 122 participants provided usable data for the current analyses by completing sets of questionnaires at Time 1 (initial time of assessment), at Time 2 (3-month follow-up), and at Time 3 (5-month follow-up). At each time of assessment, they also wore the EAR for 3 consecutive days (typically Friday to Monday; for more details, see Hasselmo et al., 2018).
Measures
Life satisfaction
All participants completed the Satisfaction With Life Scale (SWLS; Diener, Emmons, Larsen, & Griffin, 1985), a five-item instrument measuring global evaluations of life satisfaction (from 1, strongly disagree, to 7, strongly agree). Internal consistency was high in all studies (αs ranged from .84 to .93).
Personality
All participants completed the Big Five Inventory (BFI; John, Donahue, & Kentle, 1991), a 44-item instrument assessing personality at the global level of the Big Five personality dimensions (from 1, strongly disagree, to 7, strongly agree). Internal consistencies for Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness to Experience were satisfactory in all studies (αs ranged from .69 to .87).
Observational measures
To obtain an estimate of the objective frequency of participants’ daily conversations, we asked participants to wear the EAR from the time they woke up in the morning until they went to bed at night. The EAR is a digital audio recorder that unobtrusively samples daily behavior by intermittently recording snippets of ambient sounds. It captures samples between 5% and 10% of participants’ waking hours without them knowing exactly when it is recording. The EAR has been successfully used, with good acceptance and adherence, in various samples that are diverse with respect to age, gender, ethnicity, and location (Mehl & Holleran, 2007). Established privacy-protection and data-confidentiality guidelines (Mehl, 2017; Robbins, 2017) were followed in all studies. Specifically, it was ensured that participants had an opportunity to review their recordings and delete sound files they preferred to remain private. All participants received financial compensation at the end of the study.
The EAR sampling rates varied across studies (30 s or 50 s recordings every 9 min or 12.5 min). For each recording, two trained research assistants (except in Study 1) independently coded whether, in any given sound file, the participant was alone or talking with other people (conversation quantity) and whether a captured conversation was small talk or a substantive conversation (conversation quality). We used the conversational-purpose coding system (adapted from the original study, i.e., Mehl et al., 2010). Codings were mutually exclusive but nonexhaustive; that is, for some conversations, neither category applied (e.g., conversations could also be coded as practical conversations, personal/emotional disclosure, or gossip; for more detail, see Table S1 in the Supplemental Material or the Open Science Framework EAR Repository at https://osf.io/74x3c/). Small talk was defined as an uninvolved, banal conversation in which only trivial information was exchanged (e.g., “I stepped on something”; “What are you up to?”). Substantive conversations were defined as involved conversations in which meaningful information was exchanged (e.g., “There are a lot of high stress A-type personalities there,” “They have already raised 10 million dollars for Haiti”). Because the data were coded by only one coder in Study 1, intercoder reliabilities were computed on the basis of a set of training sound files that all research assistants coded, and intraclass correlation (ICC)[2, k] reliabilities were computed at the sound-file level. For all other studies, ICCs were computed across the two coders on the aggregated, average sound-file level (ICC[1, 2]). ICCs were satisfactory for all categories, ranging from .67 to .98 (Table 3), except for small talk in Study 4 (ICC[1, 2] = .36). For each participant, we converted the EAR codes into relative frequencies (i.e., percentage of valid waking recordings in which a category applied). Because Study 3 was an intervention with implications for conversation behavior, we included only the preintervention data; because Study 4 was an observational study, we averaged across the three assessments to obtain maximally reliable estimates of participants’ (general) life satisfaction and conversation patterns. To account for individual differences in the amount of daily conversations, we also computed the percentage of conversations that were small talk or substantive (labeled as normalized).
Statistical analysis
Following recommendations for the “new statistics” (Cumming, 2014; Kruschke & Liddell, 2018), we opted against a hypothesis-testing approach (e.g., interpreting p values or Bayes factors) in favor of an effect-size-estimation approach. The Bayesian estimation framework appeared particularly suited for a replication analysis, as it allows incorporating prior information and appropriately quantifies the uncertainty around effect estimates. Data, R code, and supplementary analyses are available on the Open Science Framework at osf.io/hp2wx.
First, we conducted a series of Bayesian linear regression analyses (five chains, 10,000 iterations, burn-in period of 1,000 steps) separately for each study and for each predictor using the brms R package (Buerkner, 2017). Visual examination of the chain trajectories suggested converging results. To facilitate interpretation and to be compatible with the analyses in the original publication, we transformed posterior Bayesian point estimates of the beta weights as well as the corresponding credible intervals into a correlation metric. The outcome measure was normally distributed, and we therefore used a standard Gaussian regression model. We ran the models using the parameter estimates from Study 1 as priors for Study 2 to Study 4. However, the parameter estimates from the Mehl et al. (2010) study may have been biased because of the small sample size. In the absence of an established literature to guide our choice of priors, we reran the models using noninformative flat priors (the brms default) for all estimated parameters. The posterior distributions were relatively robust to changes in the specification of the prior distributions. We therefore report only the set of analyses with flat priors (supplementary analyses including Study 1 priors are provided at osf.io/hp2wx).
Second, to derive overall estimates for the associations between life satisfaction and the EAR-derived variables for conversation quantity and quality, we pooled the raw data of all four samples. Pooling raw data across samples rather than aggregating sample-based summary statistics has the advantage of increasing variability and, hence, statistical power (Scheibehenne, Jamil, & Wagenmakers, 2016; see also Curran & Hussong, 2009). However, when choosing such an approach, one needs to consider nonindependence because participants are nested within studies. Failing to account for between-study variability and nonindependence can lead to biased results (e.g., Carlsson et al., 2017). Conceptually, random-effects models are the gold standard for this type of data structure. Practically, however, with only four studies 1 and in the absence of strong prior knowledge, Bayesian random-effects models failed to reliably converge. Note that our analyses on the pooled data, therefore, assume a fixed effect. To nevertheless account for between-sample sources of variability, and to formally test whether associations differed between studies, we included effect codes for study membership and the corresponding interaction terms (i.e., target predictor by study membership). The Watanabe-Akaike information criterion (WAIC) and leave-one-out (LOO) cross-validation (Vehtari, Gelman, & Gabry, 2017) were used to compare model fit of different models applied to the same data on a conventional scale of “deviance” similar to the Akaike information criterion (AIC; lower WAICs and LOOs denote better model fit). Model comparisons revealed that models with and without interaction terms had a very similar fit, indicating little benefit to allowing effects to vary between studies. This was true for all six conversation variables. Consequently, we excluded the interaction terms from the final models and controlled only for study differences in average life satisfaction.
These final models then also served as reference to test (a) whether participants’ personalities accounted for the associations between life satisfaction and daily conversation quantity or quality and (b) whether personality 2 moderated these associations. Theoretically, we were most interested in extraversion as a potential moderator (because of its known association with social relationships and life satisfaction), but we included all Big Five dimensions for empirical completeness.
Finally, to complement the Bayesian approach, we also conducted a traditional frequentist random effects meta-analysis (REMA). We separately calculated effect sizes for all studies and used the Exploratory Software for Confidence Intervals (Cumming, 2014) to obtain the overall maximum likelihood correlation estimates and the corresponding confidence intervals.
Results
Daily conversation quantity
As shown Figure 1a, when considered separately, two out of three new studies (Studies 2 to 4) provided evidence that a negative association between life satisfaction and spending time alone was among the 95% most credible values. When pooled across all studies, the overall posterior Bayesian point estimate of the correlation (r) was −.19, 95% confidence interval (CI) = [−.28, −.09], approximately half the size of the original study. Following the analytic procedures of the original study, we also tested whether personality differences accounted for this correlation. When simultaneously controlling for participants’ Big Five scores in the models, we found that the effect of spending time alone was essentially unaffected (r = −.18, 95% CI = [−.26, −.09]). The traditional frequentist REMA produced a mean effect-size r of −.18 (95% CI = [−.34, −.02]).

Posterior Bayesian point estimates for the associations between life satisfaction and (a) spending time alone and (b) spending time talking with others. The posterior Bayesian point estimates (PBPEs, in standardized correlation metric) are indicated as dots or diamonds (size corresponds to sample size), and 95% credible intervals are indicated as horizontal lines. Grand totals include pooled data from Study 1, Study 2a, Study 3, and Study 4 (N = 429). The parallel analysis including data from Study 2b produced similar results. B5resid refers to the effect after we controlled for participants’ Big Five personality domain scores. In the random effects meta-analysis (REMA), the effect represents the frequentist maximum likelihood estimate and confidence intervals.
The sample-specific posterior correlation estimates for well-being and talking with others were positive for all but Study 2b (Fig. 1b). When pooled across studies, the data indicated that life satisfaction was moderately positively associated with talking with others (r = .22, 95% CI = [.13, .31]), less strongly than in the original study. Again, controlling for participants’ Big Five scores did not substantially alter the mean posterior correlation estimate (r = .18, 95% CI = [.10, .26]), and the frequentist REMA resulted in a similar mean effect-size r of .24 (95% CI = [.14, .34]).
Daily conversation quality
The sample-specific results for the associations of life satisfaction with small talk (Fig. 2) and with substantive conversations (Fig. 3) demonstrated that posterior Bayesian point estimates and credible intervals varied among the new studies to different degrees for the two conversation quality variables. Posterior Bayesian correlation estimates (rs) for the association between small talk and life satisfaction ranged from −.16 (95% CI = [−.44, .12], Study 2a) to .15 (95% CI = [−.03, .32], Study 4). Similarly, for normalized 3 small talk, a correlation of zero was among the 95% most credible correlations for all but the original study. Results for Studies 2 to 4 remained robust even when the posterior distribution from Study 1 was set as a prior for the regression coefficient. Hence, when collapsed across samples, the most likely posterior point estimates (rs) were .05 (95% CI = [−.04, .14]) for small talk and −.08 (95% CI = [−.17, .01]) for normalized small talk. The credible intervals of the two indicators for small talk also contained zero after accounting for the Big Five scores (frequency: r = .08, 95% CI = [−.001, .16]; normalized: r = −.01, 95% CI = [−.09, .07]). The traditional REMA linking life satisfaction to small talk resulted in a mean correlation estimates (rs) of .06 (95% CI = [−.07, .18]) for small talk and −.09 (95% CI = [−.23, .05]) for normalized small talk.

Posterior Bayesian point estimates for the association between (a) small talk and (b) normalized small talk. The posterior Bayesian point estimates (PBPEs, in standardized correlation metric) are indicated as dots or diamonds (size corresponds to sample size), and 95% credible intervals are indicated as horizontal lines. Grand totals include pooled data from Study 1, Study 2a, Study 3, and Study 4 (N = 429). The parallel analysis including data from Study 2b produced similar results. B5resid refers to the effect after we controlled for participants’ Big Five personality domain scores. In the random effects meta-analysis (REMA), the effect represents the frequentist maximum likelihood estimate and confidence intervals.

Posterior Bayesian point estimates for the association between (a) substantive conversation and (b) normalized substantive conversation. The posterior Bayesian point estimates (PBPEs, in standardized correlation metric) are indicated as dots or diamonds (size corresponds to sample size), and 95% credible intervals are indicated as horizontal lines. Grand totals include pooled data from Study 1, Study 2a, Study 3, and Study 4 (N = 429). The parallel analysis including data from Study 2b produced similar results. B5resid refers to the effect after we controlled for participants’ Big Five personality domain scores. In the random effects meta-analysis (REMA), the effect represents the frequentist maximum likelihood estimate and confidence intervals.
In contrast, posterior correlation estimates for substantive conversations were positive for all but Study 2b. When pooled across studies, the data revealed that life satisfaction was moderately associated with having more substantive conversations (frequency: r = .23, 95% CI = [.14, .32]; normalized: r = .15, 95% CI = [.06, .24]), similar to the original study. The magnitude of the associations was only slightly diminished when we controlled for participants’ Big Five scores (frequency: r = .18, 95% CI = [.10, .26]; normalized: r = .10, 95% CI = [.02, .18]). The frequentist REMA linking life satisfaction to substantive conversation yielded effect sizes numerically similar to the Bayesian results (frequency: r = .25, 95% CI = [.15, .34]; normalized: r = .15, 95% CI = [.06, .25]).
Personality as a moderator
Although we refrained from formulating distinct a priori hypotheses about how personality might moderate the observed effects (but, theoretically, were most interested in extraversion as a potential moderator because of its relationship to both participants’ social lives and their life satisfaction), we tested whether the slope of the relationship between conversation quantity or quality and life satisfaction differed for people who had different personality traits (e.g., whether introverted people benefitted more or less from small talk than did extraverted people). The additional moderation analyses revealed that individual differences in personality did not substantially moderate the strength of the links between life satisfaction and any of the six conversation variables. Model comparisons using the WAIC and the LOO (Vehtari et al., 2017) revealed that models with interaction terms did not yield better model fit than models without them, providing no convincing evidence for moderation. Either (a) the WAICs or LOOs were substantially smaller in the reference model or (b) the estimated difference (ΔLOOIC) of the expected LOO prediction errors between the moderation model and the reference model included zero in its confidence interval, suggesting no difference in model fit between the models (so the more parsimonious model is to be preferred). In addition, all credible intervals for the posterior point estimates of the interaction terms contained zero (see Tables S2 and S3 in the Supplemental Material). This suggests that, for example, small talk or substantive conversations seemed not to be appreciably more (or less) associated with life satisfaction for extraverted than for introverted participants in our sample.
Discussion
We aimed to replicate and extend previous findings about how conversational properties of social interactions relate to life satisfaction in a heterogeneous adult sample. For daily conversation quantity, we replicated the results reported by Mehl and colleagues (2010) and found evidence for medium-sized associations between life satisfaction and spending time alone (negative) or talking with others (positive), although the negative association for spending time alone was smaller than in the original study.
For daily conversation quality, not all effects were replicated. In accordance with the original findings reported by Mehl and colleagues (2010), we found solid evidence that engaging in substantive conversations was moderately associated with life satisfaction, above and beyond personality characteristics. Our analyses indicated that participants who reported higher life satisfaction than one would expect on the basis of their personality had not only more, but also more substantive, conversations than their less satisfied counterparts with similar personalities.
In contrast, the medium-sized negative association between life satisfaction and the percentage of small-talk conversations—our second indicator of conversation quality—did not replicate: Whether people had more or less small talk in daily life was not reliably associated with life satisfaction. This was true for the overall frequency of small talk as well as the normalized small talk–conversation ratio. The negative correlation suggested by the underpowered original study seems to have been a false-positive finding.
The associations between life satisfaction and conversation quantity and between life satisfaction and substantive conversation were reliably detected in two out of the three new studies. The findings are particularly compelling, as there is no method overlap between the observational conversation codes and the self-report measures of life satisfaction that could explain the links. Mean effect estimates were somewhat smaller than the ones originally reported. This can be expected even in well-powered replication studies and is likely a result of publication bias inflating effect sizes in original studies (Lakens & Etz, 2018). Indeed, given its small sample size and lack of preregistered study and analysis plan, the original study might have been capitalizing on chance, overestimating the true effects. In fact, three out of the four studies—taken individually—were underpowered. This fact underscores the observation that isolated results from small studies should be interpreted with caution and that pooling multiple smaller studies (assuming the absence of a selection bias for inclusion) can be an effective way of increasing power and more reliably estimating population effects (cf. Curran & Hussong, 2009).
The current study further illustrates how the EAR can provide a window into everyday social experiences. Recently, other research has shown how digital devices (e.g., smartphones, cameras) can successfully be used to collect data to examine psychological, behavioral, and health-related phenomena as they naturally occur in everyday life (e.g., Brown, Blake, & Sherman, 2017; Lathia, Sandstrom, Mascolo, & Rentfrow, 2017). Whereas past research often focused on between-persons comparisons, these new devices are particularly suited to disentangling the within- versus between-persons associations between social activity and well-being. Combining the EAR method with mobile sensing and experience-sampling assessments would enable researchers to compare participants’ happiness on days when they have a lot of small talk with days when they predominately engage in substantive conversations. For example, a recent study found evidence for within-persons as well as between-persons links between social interactions and momentary health indicators in daily life (Bernstein, Zawadzki, Juth, Benfield, & Smyth, 2018).
Interestingly and somewhat surprisingly—and in contrast to our expectations specifically with regards to extraversion—we did not find evidence of personality moderating the identified effects. Absence of evidence, however, should not be mistaken for evidence of absence, and future research should follow up on this aspect. Empirically, although our study was well-powered to detect small moderation effects according to conventional power analyses, our effective power may ultimately have been considerably lower considering that it is becoming clear that the field has historically operated on an optimistic consensus of what constitutes small, medium, and large effects (Aguinis, Beaty, Boik, & Pierce, 2005). Hence, it is possible that our study was underpowered to detect small or even medium-sized moderation effects. Theoretically, extraversion might not have emerged as a moderator for the small-talk effect because only the cognitive component of subjective well-being—life satisfaction—was assessed in the current study. Because small talk occurs more often with strangers than with close others, this type of conversation may be an emotionally unpleasant experience for introverts who are reserved but an emotionally pleasant experience for extraverts who are outgoing. Therefore, moderation might be more apparent for the emotional component of well-being that assesses positive and negative affect. Future research should integrate the affective component of well-being, model personality as a latent rather than an observed variable, and examine whether effects of naturalistically observed conversation quantity and quality extend beyond psychological well-being to physical well-being.
The findings reported here should be considered in light of several further limitations. First, even though our explicit goal was to capitalize on the commonalities of four unique samples, results might still be limited with respect to generalizability to the general population. Second, our data are correlational and therefore causally ambiguous: Whether it is the satisfied person who attracts more substantive conversations or whether having substantive conversations makes people more satisfied with their lives is still to be clarified in future (experimental) research. Knowledge about the mechanisms of whether and how substantive conversations relate to better well-being could inform health counselors and inspire innovative interventions. Third, we did not account for the broader social context of the sampled interactions, that is with whom participants were having small talk or substantive conversations (e.g., a stranger vs. a friend). Future research should clarify whether small talk and substantive conversations may relate differentially with well-being in different social or normative contexts. For example, Sandstrom and Dunn (2014) showed that more daily weak-tie interactions (e.g., a small chat with a coffee barista, work colleague, yoga classmate) predicted greater average well-being.
Overall, our findings are consistent with prior intensive longitudinal studies showing that the quantity as well as quality of social interactions matter for well-being (e.g., Carmichael, Reis, & Duberstein, 2015). However, it might be that one of the “active ingredients” of having a conversation is, in fact, that the conversation meets a certain level of meaningfulness. As Baumeister and Leary (1995) argue, mere social contact does not satisfy the basic need to belong. Having close relationships and—as our results suggest—meaningful, substantive conversations might be a key ingredient to a satisfied life.
Supplemental Material
MilekOpenPracticesDisclosure – Supplemental material for “Eavesdropping on Happiness” Revisited: A Pooled, Multisample Replication of the Association Between Life Satisfaction and Observed Daily Conversation Quantity and Quality
Supplemental material, MilekOpenPracticesDisclosure for “Eavesdropping on Happiness” Revisited: A Pooled, Multisample Replication of the Association Between Life Satisfaction and Observed Daily Conversation Quantity and Quality by Anne Milek, Emily A. Butler, Allison M. Tackman, Deanna M. Kaplan, Charles L. Raison, David A. Sbarra, Simine Vazire and Matthias R. Mehl in Psychological Science
Supplemental Material
MilekSupplementalMaterial – Supplemental material for “Eavesdropping on Happiness” Revisited: A Pooled, Multisample Replication of the Association Between Life Satisfaction and Observed Daily Conversation Quantity and Quality
Supplemental material, MilekSupplementalMaterial for “Eavesdropping on Happiness” Revisited: A Pooled, Multisample Replication of the Association Between Life Satisfaction and Observed Daily Conversation Quantity and Quality by Anne Milek, Emily A. Butler, Allison M. Tackman, Deanna M. Kaplan, Charles L. Raison, David A. Sbarra, Simine Vazire and Matthias R. Mehl in Psychological Science
Footnotes
Acknowledgements
The present analyses reproduce the findings reported by Mehl, Vazire, Holleran, and Clark (2010) but otherwise do not overlap with findings reported in other articles published from the included data sets.
Action Editor
Brent W. Roberts served as action editor for this article.
Author Contributions
A. Milek and M. R. Mehl developed the study concept. A. Milek and E. A. Butler performed the analyses with input from M. R. Mehl. A. Milek, E. A. Butler, and M. R. Mehl drafted the manuscript. A. M. Tackman, D. M. Kaplan, C. L. Raison, D. A. Sbarra, and S. Vazire provided critical revisions. All the authors approved the final version of the manuscript for submission.
Declaration of Conflicting Interests
C. L. Raison has served as a consultant for Alkermes, Novartis, Usona Institute, and Emory Healthcare. C. L. Raison also serves on the steering committee for the North American Center for Continuing Medical Education. The other authors declared that they had no conflicts of interest with respect to their authorship or the publication of this article.
Funding
A. Milek’s work on this article was supported by the Swiss National Science Foundation (P2ZHP1_164959). The research was supported by National Institutes of Health Grants 3R01AT004698, 5R01AT004698, R01HD069498, and R03CA137975.
Open Practices
All data and materials have been made publicly available via the Open Science Framework and can be accessed at osf.io/hp2wx. The design and analysis plans for the studies were not preregistered. The complete Open Practices Disclosure for this article can be found at https://journals-sagepub-com.web.bisu.edu.cn/doi/suppl/10.1177/0956797618774252. This article has received badges for Open Data and Open Materials. More information about the Open Practices badges can be found at
.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
