Abstract
This study introduces the concept of acoustic-prosodic entrainment (ways people speak similarly). We review prior research on entrainment theory and methods from computational linguistics, and then apply this concept to team research by examining the relationship between team personality composition and subsequent entrainment in an exploratory case study. With 62 teams playing a cooperative board game, team average Agreeableness and team Agreeableness diversity positively, and Openness to Experience diversity negatively, preceded different kinds of entrainment. This study suggests entrainment is not a singular construct. Small group researchers could leverage technological, methodological, and conceptual advances in computational linguistics to study emergent team processes.
Keywords
The analysis of conversations has emerged and persisted as one method for studying team emergent states and processes (e.g., Ervin et al., 2017; Kane & van Swol, 2022; Lehmann-Willenbrock & Chiu, 2018; Paletz et al., 2017). One aspect of the back-and-forth nature of communication between individuals within teams is that individuals may speak similarly to each other over time. This phenomenon has been described variously across disciplines as mimicry (e.g., in psychology, Chartrand & Lakin, 2013), accommodation and adaptation (in communication, Giles et al., 1991), or entrainment (in computational linguistics, Levitan et al., 2012). Entrainment research from computational linguistics, which is our focus, has involved both the automation of measurement—an increasingly important technique to speed up communication research—and the development of conversational robots and tutors with the hope that their entrainment to their human partners will increase human rapport and learning (e.g., Gessinger et al., 2021; Kory-Westlund & Breazeal, 2019; Lubold et al., 2018). Connecting this research to questions and methods within small group research can benefit both the computational linguistics study of communication and team science.
We have two goals for this paper. First, we introduce and explain how acoustic-prosodic entrainment is conceptualized and measured within computational linguistics to the Small Group Research readership. We contribute by conceptualizing acoustic-prosodic entrainment as a team construct, which is rarely done in its current form and could be a benefit to team science. We detail how entrainment as we define it differs from entrainment as described in Ancona and Chong (1992, 1996). In addition, our field of small group research too rarely measures behavioral processes directly (Lehmann-Willenbrock & Allen, 2018) or conducts language analysis (Kane & van Swol, 2022). Measures of dynamic processes can be difficult and/or rare (Luciano et al., 2018). By drawing on new measures of acoustic-prosodic entrainment, we heed calls to bridge computer science, particularly sensor-based research, and social science to examine interactions (Lehmann-Willenbrock & Hung, 2023; Lehmann-Willenbrock et al., 2017; Müller et al., 2019). While developed in a different field for different purposes, acoustic-prosodic entrainment involves concepts and measures of team conversations to the benefit of theorizing and hypothesis-testing research within small group research.
The second goal of this paper is to demonstrate how using the concept and measurement of entrainment can achieve new insights via a case study. This case explores potential relationships between team personality composition and subsequent entrainment. Research on why and under what circumstances team entrainment may occur is still in its infancy. Small group research often presumes an input-process-output-input model (e.g., Ilgen et al., 2005), even though teams are increasingly considered to be multilevel, dynamic, networked, and complex (Mathieu et al., 2019). In team science, there are a variety of relationships between team personality composition (as inputs) and team outcomes such as performance (e.g., Barrick et al., 1998; Bell, 2007; Mohammed & Angell, 2003; Peeters et al., 2006; Prewett et al., 2009)—with emergent team processes presumed to link the two. While much research on entrainment has examined subsequent outcomes (e.g., Beňuš, 2014; Kory-Westlund & Breazeal, 2019), only some studies have examined potential antecedents to entrainment (e.g., Danescu-Niculescu-Mizil et al., 2012; Kämpf et al., 2018; see discussion later). These studies also generally examine entrainment as the behavior of one individual toward another (focusing on the individual level, e.g., a participant entraining to someone speaking another language) or interacting dyads (e.g., dyadic level of two people converging), rather than as a team level emergent process. From the perspective of those who wish to increase entrainment in human teams due to its theorized connection to rapport and other positive social outcomes, the literature on team composition antecedents is limited. Thus, our study explores whether and what types of team personality composition might impact what types of acoustic-prosodic entrainment, as communication emergent processes. In addition, this case study gives us a greater understanding of what types and measures of acoustic-prosodic entrainment are distinct from each other, and their potential association with different team personality precursors.
As a quantitative, multidisciplinary, exploratory study that can help to build new theories about processes (Bamberger & Ang, 2016), this research tackles the relationships between variables from two fields: Our entrainment measures were created, and corpus collected, using one approach—computational linguistics/intelligent systems—but we ask questions based in the social/personality psychology of teams. For this case study, we leverage the Teams Corpus (Litman et al., 2016), lab data collected for multidisciplinary purposes. Within this paper, we also reflect on the challenges and opportunities of conducting cross-disciplinary team science.
Acoustic-Prosodic Entrainment Theory and Research
Sociolinguistics researchers have long suggested that individuals in dyads and teams automatically start to speak more similarly to each other (Giles et al., 1973). Communication Accommodation Theory suggests that this similarity occurs, such that “individuals adapt to each other’s communicative behaviors in terms of a wide range of linguistic-prosodic-nonverbal features” (Giles et al., 1991, p. 7). This theory proposes that (even) unconscious entrainment 1 occurs due to social identity and social integration motives, based on research suggesting that similarity leads to greater liking and social attractiveness (Giles et al., 1991) and that similarity indicates social closeness versus distance (Beňuš, 2014). Not all people speak similarly all the time on all dimensions, however, as entrainment may not be present due to social norms, status, identity maintenance through language/accent use, communication goals (e.g., a parent trying to calm a child), or complementarity (e.g., women on dates in past research both entraining and diverging by enacting “feminine” communication styles; Giles et al., 1991).
Accommodation or entrainment used here is not the same as Ancona and Chong’s (1992, 1996) definition in organizational theory, which was borrowed from biology: “the adjustment or moderation of one behavior to synchronize or to be in rhythm with another” (Ancona & Chong, 1992, p. 166). Specifically, Ancona and Chong examined tempo entrainment, or when processes operated at the same speed or pace; synchronic entrainment, which involved similar rhythms/patterns and pace; and harmonic entrainment, or when observers consider two processes to be in harmony even if their pace or rhythm were different (Ancona & Chong, 1992). Their conceptualization of entrainment is broader than the one we describe from computational linguistics; the Ancona and Chong (1992, 1996) conception refers to an array of synchronized activities, not just communication, over longer and larger time frames (e.g., academic semesters, product development cycles, Zellmer-Bruhn et al., 2003). As such, entrainment in their theory is a sociological concept of choreography aligned with multiple intergroup rhythms at once (e.g., with top management, organizational division, and customers; Ancona & Waller, 2007). For example, “teams may become entrained to the pace of activity occurring on other parts of the organization or other levels of the team context” (Zellmer-Bruhn et al., 2003, p. 137). In other words, the concept championed by Ancona and colleagues is related to, but not the same as, the concept of entrainment within computational linguistics, where entrainment is conceptualized as how individuals communicate, including speaking similarly over time—and is usually examined in shorter time frames, such as a meeting.
While studies drawing on communication accommodation or entrainment have used a variety of research paradigms, disciplines, and communication media (e.g., face to face, phone, online), most research in sociolinguistic settings involves dyadic interactions or what we would consider individual-level entrainment—an individual’s entrainment toward others (e.g., immigrants and their speech patterns; Giles et al., 1991). Entrainment has been examined on a variety of dimensions including nonverbal (e.g., smiles, gaze), linguistic and lexical (e.g., specific words), and acoustic-prosodic signals such as loudness (e.g., Anzalone et al., 2015; Chartrand & Lakin, 2013; Giles et al., 1991; Lubold et al., 2018; Tomprou et al., 2021; Van Swol & Kane, 2019). Moment-by-moment entrainment between infants and mothers on a variety of nonverbal aspects is considered important for secure attachment, and entrainment between adults can promote perceptions of being a social unit and build rapport (Delaherche et al., 2012). Despite the variety in measures and potential explanations for the phenomenon, the overarching theory remains consistent: people often unconsciously match how others speak and behave over time, and this matching generally has positive implications for liking and social outcomes (Chartrand & Lakin, 2013; Delaherche et al., 2012; Doré & Morris, 2018; Sinha & Cassell, 2015; Van Swol & Kane, 2019). Regardless of how it is measured, entrainment is a dynamic behavioral construct that involves changes over time, rather than being a static or episodic phenomenon (Luciano et al., 2018). Even when it is measured as a single number, that number is meant to capture the degree of change that has occurred.
Recent advances in computational linguistics have enabled the creation of new acoustic-prosodic measures of entrainment over the past decade (e.g., Gálvez, Gauder et al., 2020; Levitan & Hirschberg, 2011). Acoustic-prosodic dimensions include the rhythm, frequency, loudness (intensity), and intonation of speech, such as pitch. Pitch is the perceived height of tone (high vs. low notes), and intensity is the loudness of the sound of speech (Lubold & Pon-Barry, 2014). This literature also defines different types of entrainment such as proximity, convergence, and synchrony. Proximity is similarity between the speakers’ dimension (e.g., pitch) compared to a baseline of non-conversational partners, whereas convergence is similarity change over time, which is a more truly dynamic measure as it captures whether speakers become closer to each other in specific dimensions toward the end of a conversation versus the beginning (Levitan & Hirschberg, 2011). Synchrony is moment-by-moment coordination between speakers (Levitan & Hirschberg, 2011). In other words, while convergence studies yield a single measure of the extent of convergence occurred over a conversation or meeting, synchrony involves the moment-by-moment changes in behavior matching in a dynamic and reciprocal manner (Delaherche et al., 2012).
Another aspect of entrainment is what features of a given acoustic-prosodic dimension are being examined, such as the standard deviation of pitch (i.e., the variety of high/low tones as sampled within a time period) and mean intensity (average loudness across samples within a time period). Acoustic-prosodic features are typically extracted automatically and separately for each speaker from high-quality audio recordings of conversations using software tools such as OpenSMILE (Eyben et al., 2010) or Praat (Boersma & van Heuven, 2001). Features can be extracted from any segment of speech as input. Speech segments can be associated with linguistic constructs such as speaker turns, or with temporal windows such as each minute of a recording. While research in dyadic entrainment has often examined short-term entrainment across speaker turns, in multi-party (group or team) conversations, a response to a speaker is not guaranteed to be in the next turn of the conversation. Thus, longer-term measurements based on temporal windows may be more appropriate units of analysis for team research, and this is the approach we adopt, as discussed below.
In sum, taking these three aspects together—dimensions, types, and features—researchers can examine any combination of the three to capture acoustic-prosodic entrainment, such as convergence on standard deviation of pitch, or synchrony of mean intensity. Drawing on an existing corpus (Litman et al., 2016) and promising directions from prior acoustic-prosodic research on dyad entrainment (e.g., Levitan & Hirschberg, 2011; Lubold & Pon-Barry, 2014), this study focuses on pitch and intensity (dimensions), convergence (type), and mean, max and standard deviation (features).
Entrainment as a Measurable Team Process
Research and measurement on acoustic-prosodic entrainment is still under development, with mixed findings. To measure acoustic-prosodic team (larger number of discussants than dyadic) convergence with respect to each potential combination of feature and dimension, our research builds upon a dyadic measure that calculates the difference between the dissimilarity of speakers in two nonoverlapping time intervals (Levitan & Hirschberg, 2011). The size of the time interval is a parameter of the convergence measure. If the dissimilarity in the second interval is less than in the first, the pair is considered to be converging, with the difference value indicating the strength of the convergence.
In the beginning of our research effort using our corpus, we initially proposed a team-level version of this dyadic measure via a non-weighted average across each dyad’s convergence (Litman et al., 2016). The partner difference for a speaker in a dyad within a given time interval was the absolute difference between the feature values for the speaker and their partner (Levitan & Hirschberg, 2011). For each team, we averaged these absolute values for all team members (see Equation 1, Litman et al., 2016). In addition, since differing time intervals had been examined in the dyadic entrainment literature, we explored the impact of comparing features extracted from the first versus last 3, 5, and 7 min of each conversation, as well as from the two conversational halves as in Levitan and Hirschberg (2011). Our results (Litman et al., 2016) suggested that when teams in our corpus converged on a feature, they did so earlier in the conversation; evidence of convergence was strongest when comparing the more distant temporal intervals (i.e., first vs. last 3 min).
However, as we continued our research, we realized we needed to consider aspects of the communication dynamics in our measurements of team entrainment. We thus modified the weighting scheme for the entrainment variables to better reflect group dynamics as compared to simply averaging dyad convergence. This modification was done by decreasing the contribution of speakers whose convergence behaviors differed from the rest of the group (e.g., Speaker 3 in Figure 1 for an illustration). In other words, the weight for a speaker should be the percentage of individuals who have the same convergence behavior as the speaker (see Methods for details). This decision improved performance on predicting social outcomes and differentiating between real and permuted conversations, two types of benchmark tasks previously used by the computational linguistics community to evaluate convergence measures (Rahimi & Litman, 2018).

Illustration of a four-person team where all speakers except one are converging; lines connect the feature value for each individual.
Antecedents and Outcomes of Entrainment
Entrainment in general has been examined in human interaction processes such as social integration, perceived likability, dominance, and friendship formation (e.g., Beňuš, 2014; Kovacs & Kleinbaum, 2020). Measured in various ways, entrainment has generally shown positive effects on social (and occasionally task) outcomes, in line with Communication Accommodation Theory. For example, social robotics researchers have examined the potential effects of acoustic-prosodic entrainment by a robot on human outcomes (e.g., Lubold et al., 2021). Social robots that used entrainment (pitch and rate) led children to express more positive and fewer negative emotions in a storytelling task (Kory-Westlund & Breazeal, 2019), and robot pitch entrainment combined with social language resulted in greater mathematics learning in the human students than with non-social robots (Lubold et al., 2018). Gálvez, Gravano, Beňuš, et al. (2020) had mixed findings when a conversational avatar used entrainment: Intensity entrainment had a positive effect on trust, but pitch entrainment had a negative effect on trust, suggesting that entrainment on pitch may not be beneficial in the context of social robots. However, designing robot-to-human social interactions is different from naturally occurring human teamwork (e.g., Van Swol & Kane, 2019).
In human dyads, acoustic-prosodic entrainment has been linked to positive social outcomes, but the results depend on the type and dimensions of entrainment (e.g., Gálvez, Gauder et al., 2020). Acoustic-prosodic entrainment in dyads is positively related to conversational quality (Michalsky et al., 2018) and rapport (Lubold & Pon-Barry, 2014). However, there are mixed findings, as the relationship between entrainment and these outcomes depends on the acoustic dimensions (e.g., pitch but not intensity) and, as suggested by Communication Accommodation Theory, their relevance to social identity (e.g., accents) and perceptual salience to listeners (Levitan, 2020).
In addition, most of this research has examined dyads rather than teams, and further studies are needed to explore antecedents, such as personality, in addition to the presence and consequences of entrainment (e.g., Levitan & Hirschberg, 2011). The Teams Corpus (Litman et al., 2016) provided an opportunity to do both. Past research with this corpus found that both intensity maximum and standard deviation features of convergence entrainment were positively related to social outcomes, as measured by a composite of self-reported team emergent state variables (e.g., cohesion, group efficacy; Rahimi & Litman, 2018). A lexical measure of entrainment based on the Linguistic Inquiry and Word Count (LIWC, Pennebaker et al., 2015) had negative associations with both task and process conflict; there were also relationships between lexical entrainment, team demographic diversity, team size, and social outcomes (M. Yu et al., 2019). For instance, teams with higher gender diversity had higher levels of unweighted minimum lexical convergence (M. Yu et al., 2019). Different versions of the team acoustic-prosodic metrics of entrainment (e.g., proximity measures of intensity standard deviation vs. proximity pitch mean vs. different convergence measures) were positively and negatively related to self-reported social outcomes; were positively and negatively correlated with lexical (non-LIWC) entrainment metrics; and when used in a multi-modal (combined) manner, were better able to predict social outcomes (Rahimi et al., 2017). In a separate set of analyses, a more local lexical (using LIWC) measure of entrainment also predicted social outcomes (Rahimi & Litman, 2020).
One challenge in most entrainment research is that, while entrainment is examined using many types of dimensions, types, and features, the predictions tend to be the same despite empirical evidence suggesting differences between them (Levitan, 2020). In other words, many studies take a high-level view of Communication Accommodation Theory and suggest that entrainment on any dimensions, features, and types will predict of positive social outcomes, despite results supporting this hypothesis only for certain measures (e.g., Beňuš, 2014; Levitan, 2020; Rahimi et al., 2017).
Case Study: Linking Team Personality Composition to Team Entrainment
While research on Communication Accommodation Theory examines when and why individuals entrain (Giles et al., 1991), research on under what circumstances team entrainment may occur is still in its infancy. This study explores the potential associations of team personality composition on a set of new measures of team (multi-party, rather than dyadic) acoustic-prosodic entrainment.
Personality is a set of relatively stable individual differences that impacts behavior across situations (John & Srivastava, 1999). Most studies of team personality composition draw on the Big Five Model, a rigorously-developed model of personality that distinguishes five main dimensions of personality (John & Srivastava, 1999; John et al., 2008). Extraversion is the degree to which the person is gregarious, assertive, talkative, enthusiastic, and seeks excitement and stimulation. Agreeableness involves being trusting, compliant, kind, and altruistic. Conscientiousness involves how reliable, thorough, and organized a person is. Openness to Experience includes being curious, artistic, imaginative, and having wide interests. Neuroticism (also termed Negative Emotionality) is the degree to which a person experiences high levels of anxiety, depression, and general emotionality/emotional volatility. The Neuroticism dimension is sometimes called Emotional Stability, which is the opposite (or flipped) version of this dimension and indicates high level of calmness and even-temperedness (John et al., 2008). Team personality composition can be examined in different ways (e.g., Barrick et al., 1998), but is commonly examined as an amount, level, or elevation (e.g., average, maximum, minimum) and as its diversity or variance (e.g., measured via standard deviation, Neuman et al., 1999).
Relationships between team personality and outcomes (e.g., team performance) tend to be small, whereas it may be “easier to find empirical support for relations between team personality and team processes” (Prewett et al., 2009, p. 289)—and entrainment is a type of team emergent process. Although multiple meta-analyses examined the effects of team composition on outcomes (e.g., Bell, 2007; Peeters et al., 2006), the effects of team personality composition on communication dynamics are understudied in terms of both empirical knowledge and theory. If particular personality composition variables are related to certain entrainment variables but not others, it helps us unpack differences between various acoustic convergence phenomena and suggests which dimensions, types, and features could be pursued for further investigation regarding entrainment as a potential team process mediator between team personality and outcomes. In addition, while this study does not make strong causal claims, there is no possibility for reverse-causal effects.
A challenge in our current theorizing is that the extant theory does not describe why or how teams entrain. Prior research describes why one individual might entrain to others (Lewandowski & Jilka, 2019), why dyads might entrain with each other (e.g., mother/child pairs, Delaherche et al., 2012), or how dyadic pairs entrain (e.g., Levitan & Hirschberg, 2011), but not how team entrainment comes about. There is, however, research on the associations between individual personality and tendencies to entrain (see Table 1). There are individual differences in the tendency to mimic, and initial liking may also lead to mimicry in interactions (Kämpf et al., 2018). Individual Agreeableness is likely related to individual entrainment behaviors because affiliative motivation is related to mimicry (Chartrand & Lakin, 2013). According to Communication Accommodation Theory (Giles et al., 1991), people who want to be liked are more likely to entrain to others, and those who resist conformity would have a divergent communication style to highlight their individual identities. In one study of humans entraining to robots, Agreeableness was not found to be associated with entrainment (Brandstetter et al., 2017), although this finding may be because the targets in this study were robots rather than humans. Conscientiousness may or may not be related to entrainment. While entrainment has been occasionally associated with task performance (e.g., Lubold et al., 2018), and Conscientiousness (both individual and team-level) are task-oriented and related to team performance (e.g., Prewett et al., 2009), these findings do not imply much in terms of predictions about entrainment and Conscientiousness.
Traits and Entrainment Literature Summary.
Individual Neuroticism, however, has been found to be positively related to individual entrainment: Individuals higher on Neuroticism were more likely to converge with a programed tutor by using rising intonation (Gessinger et al., 2021), and nonnative/German speakers higher on Neuroticism were more likely to entrain on the English pronunciation of their interacting partner (Lewandowski & Jilka, 2019). Individuals high on Neuroticism are more affected by distress and so may seek social approval more, and thus entrain because they are unconsciously trying to be positively evaluated (Lewandowski & Jilka, 2019).
On the individual level, Openness to Experience has been found to be positively related to entrainment via adopting words introduced by peers (Brandstetter et al., 2017) and positively related to voice onset time imitation, or entrainment on the length of time between the end of a word’s consonant and the beginning of the next word (A. C. L. Yu et al., 2013). Lewandowski and Jilka (2019) also found that Openness was positively associated with pronunciation entrainment. Those with high Openness to Experience seek out novelty and are also more open to different cultures and settings, and they thus may also be more perceptive and competent with communication in general (Lewandowski & Jilka, 2019).
Extraversion may include dominance/assertiveness, enthusiasm, and talkativeness (John et al., 2008). Power may be related to low Agreeableness and high Extraversion. In particular, power may be related to the Extraversion subdimension of dominance. Based on research that examined entrainment using linguistic style matching (LSM) or the matching of specific lists of function words using LIWC over conversations (Pennebaker et al., 2015; e.g., Gonzales et al., 2010; Kovacs & Kleinbaum, 2020), people may entrain toward someone who is in a higher position of power (Danescu-Niculescu-Mizil et al., 2012; see Van Swol & Kane, 2019 for a review). High-power individuals are more likely to feel they can openly express their opinions compared to low-power individuals (Berdahl & Martorana, 2006), and greater individual power leads to disinhibition in general (Keltner et al., 2003). Individuals with lower assertiveness may entrain to their more assertive teammates (Van Swol & Kane, 2019). However, dominance is only one dimension of Extraversion, which also includes talkativeness and enthusiasm (John et al., 2008).
Our study is exploratory, as this reviewed research does not offer clear predictions for team personality composition and entrainment. Of note, this study is about convergence entrainment, not vocal behaviors in general. For instance, while work stressors are associated with increased rate of speech and vocal intensity (Langer et al., 2022), our study examines convergence, regardless of the overall rates or levels of the acoustic-prosodic features. A team might or might not converge on low or high intensity. We examine potential relationships between team personality composition involving elevation (levels or mean) and diversity (standard deviation) of the Big Five personality dimensions as precursors, and specific aspects of acoustic-prosodic entrainment as dependent variables. Our overarching research question is thus as follows:
Are Big Five personality dimensions (Agreeableness, Extraversion, Conscientiousness, Neuroticism, and Openness to Experience), when measured as elevation/level (mean) and diversity (standard deviation), related to subsequent team convergence entrainment, as measured using pitch and intensity?
The Teams Corpus
To explore the effects of team personality composition on team entrainment, we drew on the Teams Corpus. 2 The goals of the Teams Corpus were to create a dataset that would enable the development of team, rather than dyad, acoustic-prosodic entrainment measures and to examine the relationships between entrainment and variables typically studied in small group research (Litman et al., 2016; Rahimi & Litman, 2018). 3 The Teams Corpus entailed face-to-face teams of three or four participants playing a cooperative board game, Forbidden Island™, twice, under experimental conditions, after being trained on how to play the game (Litman et al., 2016). Each individual was audio-recorded through headset microphones and videorecorded as well. The project was a collaboration between a computational linguist/computer scientist and a social/personality psychologist. The computer scientist was interested in developing measures that could help detect entrainment in real groups, but also could be deployed by intelligent agent tutors in team situations. The psychologist was interested in whether entrainment could be a behavioral mediator for existing findings between team composition and outcomes, and whether entrainment, as a behavioral measure, would correlate with common self-report measures of team processes and emergent states (e.g., cohesion). The original corpus therefore includes a variety of before- (e.g., personality, demographics, experience playing games) and after-game self-report measures (e.g., cohesion, shared mental models, perceived team efficacy, team potency). Task success in terms of winning the game ended up being of limited variability (everyone won the second game) and so has not been used.
The Teams Corpus thus serves as a useful case study for exploring the relationships between these variables, and it has been used in several studies. Indeed, one creative use of the corpus involved examining speaking overlap dynamics as a way of identifying individual personality (M. Yu et al., 2019). Timing of analyses was such that the computational questions and development proceeded first, and the final versions of the acoustic-prosodic entrainment variables are thus included in this study (i.e., the convergence measures that performed best on benchmark evaluation tasks in Rahimi & Litman, 2018).
At this time, the corpus has been used for a variety of research questions (see prior findings), but not whether team-aggregated personality is an antecedent of entrainment, which was one of the original questions to explore and why personality variables were included in data collection. Research using this corpus has mainly focused on developing a series of lexical and acoustic-prosodic measures of team entrainment (proximity and convergence, but not synchrony), a previous rarity in the literature (Litman et al., 2016; Rahimi, 2019; Rahimi et al., 2017; Rahimi & Litman, 2018, 2020; M. Yu et al., 2019). Over the course of the project, novel versions of entrainment measures were developed that weighted for small group dynamics for acoustic-prosodic (Rahimi, 2019; Rahimi & Litman, 2018; see Measures, below) and lexical (Rahimi & Litman, 2020) measures. The current study tackles the research question, using the novel, more sophisticated versions of the acoustic-prosodic entrainment measures, and specifically those for convergence, which are measures of dynamic phenomena rather than slices of time similarity (i.e., proximity).
Methods
Participants
The Teams Corpus consisted of 216 participants, comprising 63 teams of three or four players (62.5% female; 37.5% male). Because the recruitment sought out non-students via flyers both around a college campus and commercial districts in a city on the U.S. East Coast, the average age was 25 years old (SD = 11.3; Minimum = 18, Maximum = 67). In terms of ethnicity, 166 participants reported being Caucasian, 31 Asian/Pacific Islander, 24 Black, 10 Latino/Hispanic, and one Native American (multiple ethnicities could be reported). Participants read and signed consent forms before the study and received compensation after. One audio file was not saved properly, resulting in 62 teams of 213 participants, with 35 (56.5%) 3-person teams and 27 (43.5%) 4-person teams. One individual did not answer the question about cooperative game knowledge, so the maximum team knowledge of cooperative games variable was calculated from the other team members.
Procedure
Teams played the game Forbidden Island™ twice. Forbidden Island™ was chosen because it requires cooperation, communication, and strategy in real time; each player takes on different roles and there is game-specific terminology, making the game analogous to real-world multifunctional and multidisciplinary teams. Each player goes in turn, but they can freely discuss each other’s actions. Half the teams had a minimal team training intervention before the first game, which involved a review of teamwork concepts based on a needs analysis (Gregory et al., 2013) and a reflection exercise after the first game (Schippers et al., 2014). Neither this experimental intervention nor a balanced order variable for the two games had significant effects on any of the entrainment variables, so they were not included in these analyses. In this study, we examined entrainment during the first game only.
Measures
Acoustic-Prosodic Entrainment
Each participant was video- and audio-recorded using separate headsets and cameras. Focusing on features and dimensions found to be promising in prior computational linguistics research, we used the Praat software program (Boersma & van Heuven, 2001) to extract different types of convergence measures (e.g., Levitan, 2020), specifically the mean, maximum (max), and standard deviation (SD) of both pitch and intensity (e.g., pitch-SD, intensity-mean) for two time slices from the conversation from each individual (not across the teams). First, we calculated the feature values for individuals and for each time slice—in this case the first and last (fourth) quadrants of each of the first games (Rahimi & Litman, 2018). As noted, this decision to compare first and last quadrants was was based on our prior research demonstrating that convergence was strongest when feature value differences were compared between more distant temporal intervals (Litman et al., 2016). Then, we computed the difference in feature values between two individuals, and then the difference of those feature value differences for the two time slices (see Equation 1, Litman et al., 2016). Then, we applied a weighting scheme (detailed below) to calculate a weighted average of such pairwise/dyadic convergence, creating the final team convergence metric used in our prior research (Rahimi, 2019; Rahimi & Litman, 2018) as well as in this study.
For example, convergence on intensity maximum was calculated by (1) determining the maximum loudness of each speaker for the first and fourth time quadrants (see Figure 2 for an illustration of sampling periods)—the stars and dots show each individual’s acoustic-prosodic feature (intensity) maximum value in the first and fourth time quadrant, respectively; (2) creating a measure of the difference of maximum loudness between the two individuals in each dyad during the first and last quadrants separately; (3) subtracting the difference score between the dyads’ differences in the last period from the difference score between the speakers in the first period; and (4) taking a weighted average of those differences to more accurately measure trends in entrainment across the team. For intensity standard deviation (or SD), we examined each individual’s variation in loudness. For interpreting these measures, higher difference numbers indicate more convergence over the game session, and thus higher entrainment.

Illustration of first and last quadrant sampling for convergence measures.
Previous work with this corpus suggests that there was significant entrainment within the first game on several features (Litman et al., 2016), but also that simple averaging across dyadic similarity measures would be insufficient to capture small team entrainment dynamics (Rahimi, 2019; Rahimi et al., 2019; Rahimi & Litman, 2018). Because different patterns were observed in our data when creating plots such as Figure 2 (which is illustrative), we developed the novel weighted averaging scheme used here, which was introduced and fully detailed in Rahimi and Litman (2018). Instead of simply averaging across dyadic similarities (i.e., average of person 1 to person 2, and person 2 to person 3, etc.) as in Litman et al. (2016), this weighting scheme takes into account small team dynamics by considering relative (converging or diverging) behavior of each speaker with respect to the common behavior in the team (but did not involve any adjustments for gender; see Rahimi, 2019). The weight for a speaker was the percentage of individuals who had the same team convergence behavior (either converging, diverging, or mixed) as the speaker, normalized by the group size. More specifically, if all conversational dyads that a speaker was involved in had positive or negative convergence values, the speaker would be assigned converging or diverging behavior, respectively. Otherwise, the speaker’s behavior was mixed. Thus, the current study uses the novel, more sophisticated versions of the acoustic-prosodic convergence entrainment measures.
We note that entrainment measure names such as entrainment on standard deviation (SD) of pitch, or entrainment pitch (SD), do not indicate the standard deviation across the team (unlike the personality measures, e.g., standard deviation of Extraversion). These entrainment measures are created using the standard deviation of an acoustic-prosodic feature within a time slice per individual, which is then combined to create an over-time convergence measure.
Team Personality Composition
All team members filled out individual difference questionnaires before training and before playing the game. This study took the two common ways of measuring team personality composition: means for levels or elevation and standard deviation for diversity (Halfhill et al., 2005). Measures of z-scored mean and standard deviation team composition were created for each of the Big Five dimensions for ease of comparison. We measured the Big Five using the 44-item Big Five Inventory measure (BFI, John et al., 1991). 4 Participants rated the degree to which they agreed or disagreed with each item with the stem of “I am someone who. . .” on a 5-point Likert scale: 1 (disagree strongly), 2 (disagree a little), 3 (neither agree nor disagree), 4 (agree a little), or 5 (agree strongly). Each Big Five dimension has eight or nine items. For instance, for Extraversion (Cronbach’s α = .89), items include “Is full of energy,” “Is talkative,” and “Is reserved” (reverse scored). For Agreeableness (Cronbach’s α = .79), items include “Is helpful and unselfish with others,” “Is considerate and kind to almost anyone,” “Is sometimes rude to others” (reverse scored), and “Is cold and aloof” (reverse scored). Examples of Conscientiousness (Cronbach’s α = .79) include “Does a thorough job,” “Makes plans and follows through with them,” and “Tends to be lazy” (reverse scored). Items for Neuroticism (Cronbach’s α = .83) include “Worries a lot,” “Can be moody,” and “Is relaxed, handles stress well” (reverse scored). Examples of Openness to Experience (Cronbach’s α = .80) include “Is curious about many different things,” “Likes to reflect, play with ideas,” and “Has few artistic interests” (reverse scored).
Team Demographic Composition and Other Covariates
Prior studies using this corpus or other data suggested that demographic composition was significantly related to some entrainment variables (e.g., Levitan, 2020; M. Yu et al., 2019), so we examined the following team-level measures as potential covariates: percent female, standard deviation of age, ethnic diversity using Blau’s index, team size (three or four-person team), and the mean and standard deviation of each participant’s highest level of education (e.g., high school, some college, college, etc. on an ordered scale; Ouyang et al., 2017). Blau’s index is a commonly used metric for racial/ethnic diversity, where higher numbers indicate a greater variety of ethnic/racial categories represented more equally (Solanas et al., 2012). Average age and standard deviation of age within teams were highly correlated (rs = .93), so only standard deviation of age was used. The standard deviation of education, or educational diversity, was included as a covariate because prior analyses with this corpus found that it was positively related to a task-specific measure of team performance (Ouyang et al., 2017). We also asked the participants to describe their cooperative board game experience, which could plausibly be a relevant covariate, on a 5-point frequency of play scale: 1 (never), 2 (play rarely, once or twice), 3 (play sometimes), 4 (play often), and 5 (play very often). These individual responses were combined to be mean and maximum team level of cooperative board game experience, and then those two variables were z-scored.
Analyses
All analyses were conducted using SPSS 28.0.0.0 at the team-level, because the covariates, entrainment, and personality composition variables were all at the team level. Given the novelty of the entrainment variables, we used correlations to determine if data reduction indices could be created (Table A1, Appendix A). Because many variables were not normally distributed, Spearman correlations were used. The set of entrainment variables were not highly correlated (all |rs| <.30, see Table A1). 5 While some of these correlations were statistically significant, they were too modest to suggest that the entrainment variables represented the same construct, thus not justifying combining entrainment variables or choosing just some entrainment variables.
First, we examined which covariates were significantly related to each entrainment variable using linear regression. All covariates were entered simultaneously, and non-significant covariates for each entrainment variable were eliminated using backward stepwise elimination, an exploratory, data-driven analytic approach. Second, we then conducted regression analyses for each of the six entrainment variables that included only the significant covariates (if any) and all 10 of the team personality variables, and we eliminated non-significant variables also using backward stepwise elimination. We present the final regression models with the retained variables. We could thus transparently explore which team personality and entrainment variables were significantly related while controlling for the other team personality variables and any significant covariates in the model.
Results
We first present the descriptive statistics for continuous variables (Table 2), and then present the results of our regression analyses.
Descriptive Statistics (N = 62 teams).
Note. N = 62; The Big Five Inventory raw data were measured on a 1 to 5 scale.
No team covariate or personality composition measures were found to be significantly associated with entrainment on mean or maximum pitch. Team education diversity had a significant negative relationship with entrainment in terms of intensity standard deviation (unstandardized B = −0.13, SE = 0.06, lower bound = −0.24, upper bound = −0.01], p = .029). However, no personality composition variable was retained in the backward stepwise elimination for intensity standard deviation.
For convergence entrainment for standard deviation of pitch, team education diversity had a significant positive relationship as a covariate, so it was retained for the analyses with the personality composition variables. The final model after backward stepwise elimination, R2 adj = .23, F(3, 58) = 7.04, p < .001, had a positive association with the team education diversity (standard deviation of education level) and a negative one with team Openness to Experience diversity. The model also retained team Average Conscientiousness, although this variable was not statistically significant (Table 3).
Final Model for Pitch Standard Deviation (Entrainment) After Backward Stepwise Elimination.
Unstandardized beta and standard error for unstandardized error.
z-scored.
The final covariate models for entrainment on intensity mean and intensity maximum both had a significant covariate, the age diversity of the team. The 10 personality composition variables, along with age diversity, were then analyzed for both dependent variables. Of these, for intensity mean, age diversity, team Average Agreeableness, and team Agreeableness diversity (SD) were all retained by the model, R2 adj = .23, F(3, 58) = 7.04, p < .001, with Agreeableness diversity and age diversity having significant (positive and negative, respectively) associations with entrainment on intensity mean (Table 4).
Final Model for Intensity Mean (Entrainment) after Backward Stepwise Elimination.
Unstandardized beta and standard error for unstandardized error.
z-scored.
Just as with the analyses of intensity mean entrainment (Table 4), we conducted a backward stepwise elimination regression with team age diversity as the initially retained covariate and the 10 personality composition variables as predictors for entrainment intensity maximum. In this case, both age diversity and team average Agreeableness were retained, with the latter having a positive association, final model R2 adj = .11, F(2, 59) = 4.84, p = .011 (Table 5).
Final Model for Intensity Maximum (Entrainment) after Backward Stepwise Elimination.
Unstandardized beta and standard error for unstandardized error.
z-scored.
Table 6 shows a summary of our findings. We found no personality composition associations with entrainment measured via pitch mean, pitch maximum, or intensity standard deviation, suggesting that different entrainment features may indicate different constructs. For entrainment measured via pitch standard deviation, we found a negative association with diversity in Openness to Experience when controlling for a positive relationship with team education diversity (and team average Conscientiousness). Further, Agreeableness diversity was positively related to entrainment on intensity mean, and team average Agreeableness was positively related entrainment on intensity maximum, both when controlling for team age diversity (but that covariate was significant in opposite directions). These findings warrant further discussion.
Summary of Findings: Relationship between Entrainment and Team Personality.
Note. + indicates a positive relationship and – indicates a negative relationship.
Discussion
This paper makes a contribution by describing the concept and measurement of acoustic-prosodic entrainment in computational linguistics and then demonstrating how it can be used in a team research study. By conducting multidisciplinary team science, we identified a new problem space, that being the relationships between team personality composition and acoustic-prosodic speaking convergence. Our key empirical findings suggest that different kinds of team composition may be related to subsequent entrainment, but the relationship varies depending on the exact type of personality composition and entrainment. We found that average Agreeableness was positively related to entrainment on intensity maximum, and diversity in team Agreeableness was positively related to entrainment on intensity mean—both measures of convergence in terms of loudness. Diversity in team Openness was negatively related to convergence on pitch standard deviation. Previous research suggests that team personality compositions have different implications for team processes and outcomes. Our quantitative exploratory study (Bamberger & Ang, 2016) further implies that acoustic-prosodic convergence entrainment is similarly not a single phenomenon. As small group researchers, we can benefit from both the utility of entrainment as a possible variable of interest and by incorporating relevant constructs from a larger variety of disciplines. This study leveraged data and metrics generated within computational linguistics, bringing them to questions, theory, and disciplinary practices in social science (Allen et al., 2017). By mixing disciplines, we identified a space to sketch out knowledge gaps and theory.
Implications
Team researchers have long sought to better measure and understand team processes and emergent states (Kozlowski, 2015; Marks et al., 2001), and many argue that we should measure behavior directly (e.g., Baumeister et al., 2007). It is common in the history of science to develop and then widely use a measurement paradigm, but this tendency often results in the exclusion of other potential metrics, which in turn limits theorizing. One example is survey research on team processes such as conflict, where the conceptualization, study, and theorizing of conflict becomes that of a large or lingering memorable phenomenon rather than moment-by-moment conversational events (Paletz et al., 2011). While some might exclude moment-by-moment events from conflict definitionally, there is no theoretical reason to do so—it is due to limitations imposed by methodological norms. A theory of conflict could be more comprehensive by encompassing and distinguishing between both types of conflicts. Similarly, language use and conversations have long been recognized as key to small group processes, and language analysis is a promising method for understanding these processes (Kane & van Swol, 2022). Such studies can open the “black box” of team processes, so that these processes can be measured and conceptualized beyond what has traditionally been researched (Kane & van Swol, 2022; Lehmann-Willenbrock & Allen, 2018; Lehmann-Willenbrock & Hung, 2023).
Further research is necessary to understand whether entrainment is correlated with and/or reflective of other group processes and emergent states that are typically measured via surveys, such as cohesion. If different kinds of entrainment correlate highly with cohesion or other constructs (e.g., lexical entrainment with shared mental models), it may be possible to use automated measures of entrainment as a proxy for those other variables when questionnaires are impossible or difficult to obtain. Mixing social science methods and computational, sensor-based methods can be challenging given the different levels of granularity in terms of how to align on different and similar constructs (Lehmann-Willenbrock & Hung, 2023; Müller et al., 2019). Nevertheless, as we expand our measurement paradigms, we can also extend our theorizing about team processes.
Even if entrainment is not correlated with other constructs, it could still be studied as an important phenomenon in its own right, standing with participation and information sharing among other communication phenomena as a variable of interest for small group researchers. Both the concept and the measurement of acoustic-prosodic entrainment have been understudied within small group research but can be integrated into our existing models. For example, in a new integrative model of team effectiveness research that takes into account complexity and advocates for a “dynamic, multilevel” perspective, entrainment is not mentioned at all, nor is any other accommodation-like construct like synchrony (Mathieu et al., 2019). Perhaps entrainment serves as a mediator for the relationships between team personality and perceived social outcomes such as cohesion, but we will not know without theorizing, measurement, and empirical studies, all of which should lead to improved models that incorporate entrainment.
In terms of theoretical implications, our study raises the question of why certain measures of entrainment are related to personality composition while others are not. Just as creativity and emotion need to be measured and conceptualized in highly specific ways to truly understand how each construct works and the relationships between them (Ivcevic, 2022), so too does entrainment need to be unpacked and better conceptualized. We found three types of team personality composition (two regarding team Agreeableness) were related to three types of convergence entrainment. Of note, these six constructs were mostly distinct. Agreeableness diversity and team average Agreeableness were negatively related—modestly, but significantly (rs = −.35, p < .01). Personality dimensions, while correlated modestly, are discrete constructs (John et al., 2008), and prior research suggests that entrainment metrics on different acoustic-prosodic dimensions and features may similarly be distinct, as they are not correlated with each other (Levitan, 2020; Schoenherr et al., 2019). Entrainment findings and correlates may vary depending on tasks, time slices, and types of features measured (Levitan, 2020). For instance, the maximum loudness (intensity) that people might express in different time slices, versus their average loudness, could be indicative of different social communication processes. Mean intensity could reflect general levels of excitement and engagement with the task and team, whereas maximum intensity could indicate intermittent levels of excitement.
To better understand the nature of different types of acoustic-prosodic entrainment, we may need to return to qualitative research. Researchers could watch and compare the interactions in videos of teams with high versus low entrainment (e.g., on pitch standard deviation) to understand what is happening in those conversations over time in terms of both what acoustic features can be perceived by an observer and the content of the communication. In addition, whether entrainment can be perceived by an observer has interesting implications as to whether entrainment must be perceived by the speakers themselves, or if it is an entirely unconscious team phenomenon.
One fruitful way to incorporate entrainment into broader theory is through associating it with other communication constructs such as participation. The relative participation of individuals with different personality patterns could explain a team’s overall entrainment. In other words, entrainment may be constrained or even enacted by participation dynamics within a group over time. Future work to develop a theory of team acoustic-prosodic entrainment should consider team dynamics that might result from not only team personality, but from the interaction between personality, the presumed motives involved in entrainment (e.g., affiliation, Giles et al., 1991), task demands, and settings, and how those factors impact both participation and entrainment. It is possible that the personality composition variables have an impact on different kinds of entrainment dimensions, such as lexical entrainment, or in different settings (Levitan, 2020).
While suggestions for practice are premature from our case study findings, we provide some potential practical insights for small group researchers in their study of entrainment. Most literature on entrainment has been dominated by examinations of lexical measures (see Van Swol & Kane, 2019 for a review), and studies of acoustic-prosodic entrainment have largely focused on dyadic entrainment or on individual tendencies to entrain (e.g., Gessinger et al., 2021; Lubold & Pon-Barry, 2014; Tomprou et al., 2021). Our study suggests the utility of expanding beyond lexical measures and dyads to acoustic-prosodic measures and teams. It is not without potential logistical or professional difficulties, however. Measuring entrainment using sensor and language data requires high-fidelity audio-recordings that can capture the different elements (e.g., specific acoustic-prosodic dimensions and features). Further, researchers must calculate the convergence of team members into one number for each meeting or conversation, or track moment-by-moment matching of changes if measuring synchrony. While our research team worked collaboratively and was able to communicate internally the benefits and limits of these entrainment metrics, situating our research for the psychology/organizational behavior audience has been difficult. In our experience with other multidisciplinary work, importing a research methodology to a new discipline can be more difficult than importing constructs and relevant research questions. Despite calls to be creative by mixing disciplines (e.g., with creativity research, Berg et al., 2023), combining research paradigms comes with the risk of being so counter-normative as to have difficulty finding a disciplinary home or at least difficulty in finding reviewers or audiences who can evaluate all the different aspects of a study.
Limitations and Future Research
Given the novelty of the convergence measures and the sample size of teams for the large number of potential variables, the case study findings are necessarily exploratory. In addition, backward stepwise regression is a data-driven exploratory technique, such that these findings should be replicated before being cited as established science. While considered a large sample for computational linguistics and computer science studies of group interactions (Allen et al., 2017), our sample was not large enough for more complex statistical analyses used in psychology and organizational behavior (e.g., structural equation modeling). These measures should be used in a larger dataset and these findings replicated with different tasks. Because team composition effects may change over time (Mathieu et al., 2014), additional research could examine longer-term teams, such as work teams with a variety of task types. Researchers could test more complex compilation models, which are only possible to examine with a huge variety of team personality combinations in large samples. For instance, the kind of team likely to have the highest entrainment in this task might have a combination of moderately high Agreeableness, high average Openness to Experience, low Extraversion diversity, and moderate-to-high Extraversion. With a larger dataset, we could also explore different combinations with different sub-dimensions of the Big Five (e.g., talkativeness, dominance). In addition, some effects may be curvilinear. Perhaps moderate average Extraversion, rather than high or low average Extraversion, would be predictive of greater team entrainment. This possibility may explain why our linear models did not find a significant association between Extraversion and entrainment. Also, weighting entrainment by team dynamics raises the question as to whether it would be useful to weight team personality composition by team dynamics, such as by participation. Future research could examine the utility and construct validity of weighting personality composition by particular individuals in teams, which to our knowledge has rarely been done in studies of team personality composition.
We also argue for additional research to conduct a fuller convergent-divergent correlational study of different entrainment dimensions, types, and features across many team settings and tasks, and with different team processes and emergent states. Those dimensions—be they pitch and intensity, gestural, lexical, phonetic, and so on—that correlate together could help us build theory around which types of communication are likely to be entrained on by which teams and on which tasks. These types of entrainment, measured in different ways, could also be associated with the typical self-report measures of team constructs—shared mental models, team cohesion, and so on—to determine which aspects of entrainment correlate with them, if any.
In addition, while our measures captured convergence, measures that are on a finer-grained time scale and multilevel could be fascinating to deploy and build theory around. The current convergence measures compare the first and last quadrants of conversations, developed by comparing different potential first and last segments. Future research can explore the optimal time slices and durations of conversations for detecting convergence and other types of entrainment. Do conversations need to be 15 min, 30 min, or an hour to detect convergence? Further, while quadrants were the best time slice for 30-minute conversations, would the first and last 5 min be sufficient for longer or shorter conversations? Future work could also examine multiple time slices rather than just two, such as a 5-min time slice compared to the next 5 min compared to the third set of 5 min, and so on, to calculate change over time in a different manner. For example, in a study of lexical entrainment, instead of comparing just two fixed intervals, we iterated over all two intervals of a given time-slice in chronological order and computed convergence based on properties such as maximum difference across all comparisons (M. Yu et al., 2019).
Finally, while convergence measures are not the same as synchrony measures, discovering the relationship between them could also be useful for conceptual clarity of entrainment. Theory on team dynamics suggests that dynamic processes may have feedback loops, and also that some variables may have asymmetric influence: for the latter, higher levels of one variable can increase a dependent variable, but lower levels of that variable do not result in the decrease of the dependent variable (Cronin & Vancouver, 2019). When examined at a finer-grained time scale, entrainment processes may be built via feedback loops with individual and team personality, participation, and silence.
Conclusion
This study used new advances in measures of team convergence entrainment to explore whether personality composition influences entrainment. By introducing these constructs and measurements to this audience, we offer an integrated, creative (Berg et al., 2023) perspective on team research. Before building theory, exploratory studies like this one are necessary to even realize we need to ask new questions. We recommend that the conceptual and measurement clarity of entrainment should be improved, and then small group researchers can associate those entrainment variables with other constructs of interest. Through iterative studies of quantitative and qualitative exploration, theory-building, and empirical testing, we can better understand the role of entrainment as a communication phenomenon within improved models of team processes.
Footnotes
Appendix
Spearman Correlations of All Variables Entered into the Models.
| Variable | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1. Team size | — | ||||||||||||||||||||||
| 2. Percent female | −.08 | — | |||||||||||||||||||||
| 3. Age SD | .08 | -.30* | — | ||||||||||||||||||||
| 4. Ethnic/ racial diversity | -.29* | .10 | −.16 | — | |||||||||||||||||||
| 5. Education Avg. | .19 | −.18 | .65** | -.31* | — | ||||||||||||||||||
| 6. Education SD | .10 | −.18 | .61** | −.10 | .64** | — | |||||||||||||||||
| 7. Play Coop Board Games Avg. (Z) | −.07 | .00 | -.26* | .23 | −.19 | −.11 | — | ||||||||||||||||
| 8. Play Coop Board Games Max (Z) | .20 | .01 | −.13 | .06 | −.01 | .01 | .66** | — | |||||||||||||||
| 9. Agreeableness Avg. (Z) | .04 | −.05 | .10 | −.01 | .18 | .04 | .07 | −.22 | — | ||||||||||||||
| 10. Agreeableness SD (Z) | .13 | .03 | .07 | .10 | .06 | .05 | .17 | .39** | -.35** | — | |||||||||||||
| 11. Extraversion Avg. (Z) | .11 | −.06 | .20 | −.11 | .18 | .21 | .11 | .17 | .14 | .06 | — | ||||||||||||
| 12. Extraversion SD (Z) | .20 | .02 | .13 | −.10 | .11 | .04 | .11 | .22 | −.19 | .21 | −.24 | — | |||||||||||
| 13. Conscientiousness Avg. (Z) | .14 | .05 | −.21 | −.15 | .13 | −.20 | .14 | .07 | .15 | .17 | .04 | .22 | — | ||||||||||
| 14. Conscientiousness SD (Z) | .10 | .06 | −.13 | −.03 | −.12 | −.17 | .04 | .16 | −.07 | .02 | −.03 | −.07 | -.33** | — | |||||||||
| 15. Neuroticism Avg. (Z) | −.16 | .33** | −.09 | −.09 | −.20 | −.13 | −.15 | −.12 | -.36** | .06 | -.27* | .05 | -.26* | .08 | — | ||||||||
| 16. Neuroticism SD (Z) | .15 | .28* | .03 | .11 | .13 | −.04 | −.00 | .09 | −.07 | .12 | −.20 | .09 | −.06 | .15 | .29* | — | |||||||
| 17. Openness Avg. (Z) | .02 | −.15 | .12 | .14 | −.04 | .16 | −.08 | −.08 | −.15 | .14 | .04 | −.14 | −.12 | −.09 | −.05 | −.01 | — | ||||||
| 18. Openness SD (Z) | −.12 | .02 | .00 | −.12 | −.08 | .01 | −.01 | −.11 | −.03 | −.20 | −.25 | −.02 | .02 | -.37** | .13 | −.11 | -.32* | — | |||||
| 19. Pitch Mean | .08 | .10 | −.19 | .04 | −.15 | −.10 | .02 | −.01 | .07 | .13 | .06 | −.07 | .23 | −.09 | .09 | .10 | −.00 | −.02 | — | ||||
| 20. Pitch Max | .11 | .01 | .05 | −.01 | .17 | .13 | .03 | .10 | .11 | .05 | .20 | .03 | .11 | −.10 | −.09 | −.01 | .01 | −.14 | .14 | — | |||
| 21. Pitch SD | .14 | −.09 | .13 | −.07 | .12 | .30* | .22 | .13 | .22 | .10 | .19 | −.06 | .09 | −.00 | .06 | −.01 | .15 | −.16 | .29* | .27* | — | ||
| 22. Intensity Mean | −.01 | .17 | -.44** | .01 | -.38** | -.33** | .09 | .06 | .07 | .21 | −.15 | .02 | .24 | .03 | .01 | −.09 | .01 | −.18 | .25 | .28* | .22 | — | |
| 23. Intensity Max | −.06 | -.32* | .15 | .07 | .18 | .07 | .03 | −.14 | .11 | −.17 | −.09 | −.05 | .06 | −.09 | 0 | −.19 | .02 | −.02 | −.10 | −.09 | −.01 | −.19 | — |
| 24. Intensity SD | −.08 | −.07 | −.08 | −.05 | −.18 | −.19 | −.07 | −.06 | −.06 | −.16 | -.30* | .15 | .08 | −.02 | .07 | −.13 | .17 | .08 | −.13 | −.01 | −.16 | .09 | .18 |
Note. N = 62. Uncorrected p-values: *p < .05, ** p < .01. Correlations are rounded to two digits.
Acknowledgements
The authors are grateful to Stefani Allegretti and Caitlin Rice for their assistance in iterating the study materials, data collection, and data processing, as well as Mingzhi Yu for tabulating the ethnic diversity measure. We are also grateful for the reviewers and, especially, the editor for their helpful comments on earlier drafts of this paper. Some of this work was conducted when Susannah Paletz was at the University of Maryland Center for Advanced Study of Language. Zahra Rahimi was at the University of Pittsburgh. Susannah Paletz is currently also affiliated with the Social Data Science Center (University of Maryland) and the Applied Research Laboratory for Intelligence and Security (University of Maryland). Diane Litman is faculty at the Intelligent Systems Program and a Professor in the Department of Computer Science, as well. Some versions of the analyses were presented at the 2018 and 2022 Interdisciplinary Network for Group Research annual convention. Anonymized data and analysis scripts are available upon request.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by National Science Foundation (NSF) grants IIS-1420377 to the first author and 1420784 to the second author. This project was also supported by internal funds through the University of Maryland Applied Research Laboratory for Intelligence and Security and start-up faculty funds to the first author from the University of Maryland College of Information Studies. The opinions expressed are those of the authors and not necessarily those of the University of Maryland or the U. S. government.
