Abstract
Several studies show that a concept or object is more popular when it is easier to process. The present research applies this notion of processing fluency to the lyrics of all 271 top 5 songs on the United Kingdom chart for each week from 1999 to 2014. The processing fluency of the lyrics was computer scored for readability, presence of rhyme, and complexity, and popularity was assessed in terms of peak chart position and duration of tenure on the chart. After controlling for the energy of the musical component of the song, analyses showed that factors relating to the processing fluency of the lyrics predicted peak popularity but not duration of chart tenure. Significant relationships were observed between peak popularity and both rhyme saturation and basic lyric readability properties.
Several theories indicate that the subjective ease with which perceptual information is processed is related positively to the popularity of the object or concept in question. The notion of processing fluency is arguably the most prominent account of this in the literature (Oppenheimer, 2008), and research has considered this in the context of media, such as advertisements, visual art, and text (e.g., Leder, 2013; Mayer & Landwehr, 2018; Schwarz, 2004), and in consumer behavior (e.g., Aydin, 2018; Liebers et al., 2019). On this basis, the present research investigated whether the processing fluency of song lyrics can account for the commercial popularity of those songs.
Processing fluency has previously been measured using a variety of methods, including media priming, visual clarity, and language complexity (e.g., Alter & Oppenheimer, 2009; Im et al., 2010; Schwarz, 2004; Winkielman & Cacioppo, 2001). For example, Song and Schwarz (2009) found that instructions written in easier-to-read fonts (e.g., Calibri, 12pt) were related to participants’ perceptions of their competence to carry out these instructions, and Hansen et al. (2008) found that statements written on paper with higher color contrast to the font, which makes them easier to read, were regarded as more trustworthy than those printed on paper with lower color contrast. This ease of processing also has positive implications for various aspects of people’s evaluations and affective responses to the stimuli in question. For instance, Winkielman and Cacioppo (2001) found that when images were visually primed and exposed for a longer time, participants processed them more easily and enjoyed them more than images that were either not primed or exposed for a shorter time; Song and Schwarz (2009) found that food additives that were harder to pronounce were perceived as less familiar and riskier; and Dragojevic and Giles (2016) found that voices that were easier to understand gave rise to more positive emotional reactions.
A smaller number of studies show specifically that processing fluency is also associated with consumer’s retail decision-making. For example, Im et al. (2010) found that improved image quality on online shopping sites lead to greater customer satisfaction. Herrmann et al. (2013) found that the use of basic scents (one scent note) in comparison with complex scents (multiple scent notes) correlated with the amount of money spent in a store. Jaud and Melnyk (2020) found that wine bottle labels with pictures were processed more easily than labels with text only and were also associated with higher purchase intent and perceived product quality.
The recent growth of media streaming and subscription models represents a fundamental shift in how businesses approach advertising as well as audio, text-based, and visual mass media. Therefore, it is unsurprising that there is renewed research interest in predicting consumers’ responses to these media (Raza & Nanath, 2020). The findings of this are consistent with those obtained in other domains, namely, those improving processing fluency by using priming, specific literary devices, and ease of comprehension are all associated with greater liking and preference judgments (Filkuková & Klempe, 2013; Liebers et al., 2019; Newman & Schwarz, 2018). Aydin (2018) found a significant relationship between processing fluency and increased purchase intention in response to advertising. Liebers et al. (2019) found that when participants saw competing product commercials, conceptual priming of selected products resulted in more favorable judgments, and increased purchase intent. Filkuková and Klempe (2013) explored rhyming as a measure of advertisement slogan processing fluency: rhyming slogans were perceived as more likable, original, easy to recall, trustworthy, and convincing than nonrhyming slogans. With regard to other media, Newman and Schwarz (2018) showed that poorer quality radio broadcast audio was associated with the material being perceived as less fluent and less pleasant, and with the presenter being judged as less intelligent. Various other studies have demonstrated that readability scores are associated positively with the popularity of text-based media (Basyuk, 2018; Gao et al., 2019; Pancer et al., 2019; Wright, 2003). For instance, Pancer et al. (2019) observed a positive relationship between the readability of social media postings and subsequent popularity and engagement (as measured by the number of shares and likes).
A number of other recent studies have attempted to predict the sales chart performance of songs based on various structural and acoustic features of the music (e.g., Askin & Mauskapf, 2017; Herremans et al., 2014; Interiano et al., 2018; Ren & Kauffman, 2017; Zangerle et al., 2020) and lyrics (Dhanaraj & Logan, 2005; Putter et al., 2021; Raza & Nanath, 2020). However, two other studies of music popularity have considered variables similar to processing fluency. Margulis (2013) explored participants’ aesthetic responses to varying repetitive sequences of classical musical samples. Repetitive samples were judged as more interesting, enjoyable, and artistic than the nonrepetitive samples. Bannister (2018) analyzed data from the top singles on the Billboard Hot 100 from 1958 to 2018. Current top-performing songs featured time signatures and key signatures that were typical of past top-performing songs, suggesting that audience familiarity with these features may be a factor underlying song popularity. Findings such as these are consistent with Martindale et al.’s (1988) cognitive theory of aesthetics, which suggests that liking for naturalistic artistic objects is related positively to their meaningfulness, which has been operationalized as how easily the person can interpret the stimulus in question. Several researchers have provided supporting evidence (Martindale et al., 1990; North et al., 2017; Nunes et al., 2015; Whitfield, 1983), which again implies that ease of processing is associated with greater liking.
In the light of the evidence reviewed here, it is unsurprising that a handful of studies suggest indirectly that the processing fluency of music lyrics can predict the popularity of those songs. North, Krause, and Ritchie (2020) considered 1,414 sets of lyrics which were scored according to the extent to which each was typical of the corpus. The typicality of the lyrics was related positively to the chart performance of the songs, and while typicality does not necessarily equate with processing fluency, it is arguable that the two concepts are related. In a similar vein, Nunes et al. (2015) demonstrated that lexical repetition (the repetition of words without regard for grammar) in song lyrics influenced a song’s likelihood of reaching the top of the Billboard Hot 100 singles chart. Increased repetition, and thus potentially greater processing fluency, was associated with greater song popularity. Similarly, Hahn et al. (2018) found that preliterate children preferred rhymed song lyrics over nonrhyming song lyrics, and Singhi and Brown (2015) showed that the rhyming features of lyrics could predict hits better than could musical features. Schedl (2019) investigated the relationships between objective measures of processing fluency (repetitiveness, readability, and duration) and the popularity of song music and lyrics, showing that popular songs are repetitive but not always simple regarding readability. Varnum et al. (2021) reported that lyrical simplicity was related positively to the novelty of the music in a song and to the popularity of that song. Varnum et al. (2021) propose that as more novel music is produced, the lyrics of these songs become increasingly simple to compensate for the greater processing load imposed by the music. In short, several studies seem to indicate that various aspects of the processing fluency of lyrics are implicated in the popularity of the song in question.
The need for research concerning processing fluency and music lyrics is further indicated by several studies which show that the specific nature of the impact of processing fluency is influenced by several properties common to music lyrics. These studies highlight the need to determine whether processing fluency is related to listener responses to lyrics, and if so what form this relationship might take when different forms of processing fluency are considered. For instance, Miall and Kuiken (1994) showed that rhyme and alliteration not only increased reading time, and so by implication appear to hamper processing, but also improved affective responses. The authors argue that there are several explanations of this, such as rhyme encouraging the perceiver not only to reflect on the text (increasing reading time) but also to understand the emotional connotations better (increasing affective responses). In a similar vein, Menninghaus et al. (2015) found that rhymes facilitate prosodic processing but hamper semantic comprehension, so that it may be appropriate to distinguish between perceptual (or prosodic) fluency and conceptual (or semantic) fluency.
Schotanus et al. (2018) found that the popularity of psalms could be predicted by “balanced motion” (e.g., line length, lines per stanza) and the degree of repetition they contained, and by implication their processing fluency. However, the authors also note that some aspects of balanced motion and repetition were related to popularity more closely than were others. In conjunction, findings such as these indicate the need to consider different forms of processing fluency when studying artistic products and suggest that different measures of processing fluency may give rise to different relationships with popularity. Similar arguments are made by Schotanus (2021) concerning analysis of the popularity of two Bob Dylan songs.
Previous research findings indicate that specific aspects of what we might collectively term processing fluency can predict positive affective responses and commercial popularity, and that lyrics contribute to song popularity. Moreover, several variables indicative of processing fluency may underlie the relationship between lyrics and song popularity, although there is also evidence that different measures of processing fluency may give rise to differing relationships with popularity and enjoyment. The current research therefore aimed to directly assess the relationship between various manifestations of lyric processing fluency and song popularity. Various aspects of rhyming, readability, and word complexity were employed as measures of the processing fluency of song lyrics due to their prominence in the literature. Four measures of the songs’ commercial success were used: “peak popularity” represented the song’s peak chart position and “hit duration” represented the duration of the song’s tenure on the charts, and values for both were employed for the United Kingdom only and also for the United Kingdom and United States combined. There were four hypotheses:
Hypothesis 1 (H1). There will be a positive relationship between various aspects of the processing fluency of the lyrics and U.K.-U.S. Peak Popularity.
Hypothesis 2 (H2). There will be a positive relationship between various aspects of the processing fluency of the lyrics and U.K.-U.S. Hit Duration.
Hypothesis 3 (H3). There will be a positive relationship between various aspects of the processing fluency of the lyrics and U.K. Peak Popularity.
Hypothesis 4 (H4). There will be a positive relationship between various aspects of the processing fluency of the lyrics and U.K. Hit Duration.
To test these hypotheses, this research utilized a computerized data set of 271 popular songs from 1999 to 2014, accounting for all those songs that reached the top 5 singles sales charts in the United Kingdom chart each week throughout the time period in question.
Method
Research design
The research utilized a correlational design. Six predictor variables were used to capture distinct aspects of processing fluency, namely, Rhyme Saturation, Multisyllabic Rhymes, Short Unique Rhymes, Basic Lyric Readability Properties, Computerized Reading Difficulty, and Complexity. Four criterion variables were measured, namely, U.K.-U.S. Peak Popularity, U.K.-U.S. Hit Duration, U.K. Peak Popularity, and U.K. Hit Duration. A significant amount of research in the field of experimental aesthetics shows that various aspects of responses to music are influenced by the extent to which the music possesses so-called arousal potential, energy, or high information content (e.g., Berlyne, 1971; Heyduk, 1975; Vitz, 1966). It is also reasonable to suspect that lyricists write words that will complement the music in question (see North , Krause, & Ritchie 2020), suggesting that some attempt to control for variations between each song’s musical components is warranted. Therefore to control for any confounding effects of the song’s acoustic components, a control variable, “Energy,” was utilized.
Data set and lyrics
The raw data set is a subset of a larger data set used in previous studies (e.g., North et al., 2019; North, Krause, & Ritchie, 2020). Music industry professionals cataloged song metadata such as genre, artist, and record company from more than 400,000 record labels to produce this master data set containing more than 38 million pieces of music. The top 5 songs in the United Kingdom official chart for each week from 1999 to 2014 (N = 271; www.officialcharts.com) were employed as the subset of the master data set used for this research. A song appeared in the data set only once, even if it spent multiple weeks in the top 5. The lyrics were collected from web-based sources (primarily www.azlyrics.com) and were cross-checked with a second internet source to verify consistency. In the instance of discrepancies between versions (often due to multiple versions of songs, such as remixes), a third source was consulted, and the version intended for radio airplay was identified. Various processes were employed to clean the lyrics and to ensure that the computer could interpret them consistently: these involved spell checking; and batch search and replace word processing operations, so that instances of vernacular such as “cudda” were replaced with “could have,” standard contractions were expanded (so that, for instance, “couldn’t” was replaced with “could not”), and instances of “chorus,” “repeat verse,” or “x3” were replaced with the full relevant text.
For the hierarchical multiple regression analyses used in the present research, Tabachnick and Fidell (2013) suggest that the total population (N) should ideally be 50 + 8(k) when testing a full regression model, and alternatively 104 + k when testing individual predictors. It is common to calculate both and use the larger N of the two. An a priori power analysis was conducted using G*Power 3.1 (Faul et al., 2009) based on the findings of North, Krause, et al. (2020), Nunes et al. (2015), and Varnum et al. (2021). The G*Power analysis suggested a sample size of 103 based on a medium Cohen’s f2 effect size and a target power of .80. Following Tabachnick and Fidell (2013), 50 + 8(7) = 106 and 104 + 7 = 111, and so the present data set of 271 songs was deemed sufficient to identify any significant relationships (Cohen, 1988).
Measures
Multiple computerized data analysis programs were utilized to produce processing fluency data for the lyrics, as detailed below.
Rhyming variables
The rhyming aspect of processing fluency was measured using the Rhyme Analyzer program (Hirjee & Brown, 2010). This quantifies different rhyming qualities of songs such as rhymes per line, end rhyme scores, and complex rhymes. A full list of the variables is shown in Appendix A in Supplementary Materials online. Given the number of variables measured, the data for all the Rhyme Analyzer variables were subjected to exploratory principal component analysis with oblique rotation, which identified three factors explaining 44.49% of the variance, namely, Rhyme Saturation, Multisyllabic Rhymes, and Short Unique Rhymes (see Appendix B in Supplementary Materials online). Factor scores were calculated and used in subsequent analyses. The three sets of factor scores had small inter-correlations (ranging from r = −.29 to .30), indicating that different constructs of rhyme were identified. Rhyme Saturation describes the extent to which rhymes appear in the song. A higher score suggests more rhyming in the lyrics, and thus greater processing fluency. The variables loading onto this factor included rhymes per syllable, line internals per line, chaining per line, rhyme density, rhymes per line, perfect rhymes, and links per line. Multisyllabic Rhymes recorded instances of rhymes with two or more syllables. An example of this is “the best rhyme is that/a cat rhymes with hat.” A higher score indicates the presence of more Multisyllabic Rhymes in the lyrics, indicating greater processing fluency. Short Unique Rhymes concerns average word length and the proportion of words in a rhyming pair’s second line that do not appear in the first line. An example of a smaller Short Unique Rhyme would be “The cat in a hat/The cat on a mat,” where more similar one-syllable words are present. A larger score for Short Unique Rhymes indicates that the rhymes contain more syllables and unique words, indicating lower processing fluency.
Readability variables
The readability aspect of processing fluency was measured using the online application Readable (https://readable.com; Readable.com by Added Bytes Ltd, n.d.). This assesses popular readability formulae, such as the Flesch Reading Ease Score and The Gunning Fog Index, and readability features, such as Sentence Count and Letters per Word. A full list of variables is shown in Appendix C in Supplementary Materials online. There is currently no clear consensus on which readability formula is (a) the most valid method of capturing readability and (b) the most appropriate for different research designs (van Oosten et al., 2010; Wang et al., 2013). Given this lack of consensus and the number of variables measured by Readable, the data were subjected to an exploratory principal component analysis with oblique rotation, which produced two factors, Basic Lyric Readability Properties and Computerized Reading Difficulty, which explained 45.49% of the variance (see Appendix D in Supplementary Materials online). Factor scores were calculated and used in subsequent analyses. The correlation between the two sets of factor scores was small (r = .10), indicating that they assessed different aspects of readability. Basic Lyric Readability Properties captured the presence of conventional readability characteristics in the lyrics. A lower factor score for Basic Lyric Readability Properties indicates that the lyrics are easier to comprehend and require less reading expertise. Thus, a lower score indicates greater processing fluency. The specific variables loading onto this factor included word count, syllable count, words with one syllable, words with six letters, unique word count, and grammar error count. Computerized Reading Difficulty captured overtly computerized, formula-driven measures of the lyrics’ reading difficulty. A lower factor score for Computerized Reading Difficulty indicates that the lyrics require a lower objective reading level/age, and hence, higher processing fluency.
Complexity
The complexity of the lyrics was assessed using Diction 7.0 (Hart et al., 2013), which has been used previously in several studies (e.g., Cook & Krupar, 2010; see extensive list at http://www.dictionsoftware.com/published-studies). The program measures 36 semantic variables, although only complexity is clearly related to processing fluency, representing a simple measure of the mean number of characters per word. The variable reflects Flesch’s (1951) complexity assumption that more complex text increases abstractedness and reduces comprehensibility. A higher score indicates more characters per word and therefore lower processing fluency.
Energy
Energy scores captured the music’s acoustic and structural properties, and were used as a control variable. Scores were included in the master data set and are calculated via a process described by North et al. (2018, 2019). In brief, this involves a trained artificial intelligence system analyzing each piece of music using a series of algorithms based on music perception research which utilize factors such as beats per minute, tone, and pitch to determine how energetic the music is. Numerous classic theories (e.g., Berlyne, 1971; Osgood, 1952; Russell, 1978) highlight that factors analogous to “energy” are influenced by structural and acoustic properties of music and underlie evaluative responses across a broad range of domains.
Popularity scores
The research used two measures of popularity, namely, Peak Popularity and Hit Duration, each of which was calculated separately for both the United Kingdom only and United Kingdom and United States together, leading to four variables in total, namely, U.K.-U.S. Peak Popularity, U.K.-U.S. Hit Duration, U.K. Peak Popularity, and U.K. Hit Duration. Greater popularity is indicated by higher scores on each measure. The popularity scores were included in the project’s master data set using a process detailed in North et al. (2017). In brief, the Peak Popularity measures represented the highest position the song reached on a variety of sales and airplay charts. The respective charts were weighted according to their geographical range of coverage and generality, so that, for instance, an appearance on the U.K. singles chart was assigned a weighting of 1, whereas an appearance of the song on an album that appeared in the U.K. album chart was assigned a weighting of 0.5. For each song per chart, the Peak Popularity score was calculated as 1 divided by the sum of (peak chart position multiplied by chart weighting). Hit Duration scores were determined by the total number of weeks that each song appeared on the charts, irrespective of chart position. Note that although the songs included in the research were selected on the basis of their appearance in the top 5 singles chart, the popularity data included information from both singles and albums charts, since the appearance of a song on a successful album is clearly also relevant to its popularity.
Results
Examination of the Mahalanobis distance scores (minimum = 0.90, maximum = 58.24; Tabachnick & Fidell, 2013) identified multivariate outliers. This was expected because the research explicitly concerns songs with extremely high levels of popularity. High skewness and kurtosis for all of the criterion variables were identified, and so following statistical conventions (Allen & Bennett, 2012; Tabachnick & Fidell, 2013), the four criterion variables were subjected to logarithmic (log10) transformation to minimize the skewness of the data. Collinearity statistics for tolerance and variance inflation factors (tolerance > 0.1, VIF 10) were all within acceptable bounds (Allen & Bennett, 2012; Tabachnick & Fidell, 2013). The normality of residuals was assessed visually through the normal Q-Q plots of standardized residuals. The criterion variable plots indicated that the data points followed a normal distribution. However, as expected, some slight violations of assumptions were detected for the predictor variables. Linearity and homoscedasticity were visually assessed through the scatterplot of standardized residuals against standardized predicted values. Overall, there were no discernible patterns within the scatterplots, indicating that the assumptions of linearity and homoscedasticity were met.
A hierarchical multiple regression analysis was conducted to test whether the lyric processing fluency scores could predict U.K.-U.S. Peak Popularity. Energy was entered in Step 1 and was nonsignificant, R2 < .001, F(1, 269) = 0.03, p = .875. In Step 2, the lyric processing fluency variables were added, accounting for an additional significant 11% of the variance in U.K.-U.S. Peak Popularity, ∆R2 = .110, ∆F(6, 263) = 5.42, p < .001. In combination, all of the predictor variables explained a significant 11% of the variance in U.K.-U.S. Peak Popularity, R2 = .110, F(7, 263) = 4.65, p < .001. Further details are presented in Table 1. Specifically, Rhyme Saturation scores were related positively to U.K.-U.S. Peak Popularity, whereas the Basic Lyric Readability Properties were related negatively to U.K.-U.S. Peak Popularity, indicating that more easily processed lyrics were more popular.
Hierarchical Regression Results for U.K.-U.S. Peak Popularity.
CI: confidence interval; LL: lower limit; UL: upper limit.
Note. Overall model: R2 = .110, F(7, 263) = 4.65, p < .001.
p < .05. **p < .01. ***p < .001.
A second hierarchical multiple regression analysis was conducted to test whether the lyric processing fluency scores could predict U.K.-U.S. Hit Duration. Energy was entered in Step 1 and was nonsignificant, R2 = .005, F(1, 269) = 1.34, p = .249. In Step 2, the lyric processing fluency predictor variables were added, accounting for a nonsignificant additional 4.4% of the variance in U.K.-U.S. Hit Duration, ΔR2 = .044, ΔF(6, 263) = 2.03, p = .062. In combination, all of the predictor variables explained a nonsignificant 4.9% of the variance in U.K.-U.S. Hit Duration, R2 = .049, F(7, 263) = 1.94, p = .064 (see Table 2).
Hierarchical Regression Results for U.K.-U.S. Hit Duration.
CI: confidence interval; LL: lower limit; UL: upper limit.
Note. Overall model: R2 = .049, F(7, 263) = 1.94, p = .064.
p < .05. **p < .01. ***p < .001.
A third hierarchical multiple regression analysis was conducted to test whether the lyric processing fluency scores could predict U.K. Peak Popularity. Energy was entered in Step 1 and was nonsignificant, R2 = .001, F(1, 269) = 0.25, p = .614. In Step 2, the lyric processing fluency predictor variables were added, accounting for an additional significant 10.6% of the variance in U.K. Peak Popularity, ΔR2 = .106, ΔF(6, 263) = 5.15, p < .001. In combination, all of the predictor variables explained a significant 10.7% of the variance in U.K. Peak Popularity, R2 = .107, F(7, 263) = 4.49, p < .001. Further details are presented in Table 3. Rhyme Saturation scores were related positively to U.K. Peak Popularity, whereas the Basic Lyric Readability Properties were related negatively to U.K. Peak Popularity, indicating that more easily processed lyrics were more popular.
Hierarchical Regression Results for U.K. Peak Popularity.
CI: confidence interval; LL: lower limit; UL: upper limit.
Note. Overall model: R2 = .107, F(7, 263) = 4.49, p < .001.
p < .05. **p < .01. ***p < .001.
A final hierarchical multiple regression analysis was conducted to test whether the lyric processing fluency scores could predict U.K. Hit Duration. Energy was entered in Step 1 and was nonsignificant, R2 < .001, F(1, 269) = 0.02, p = .886. In Step 2, the lyric processing fluency predictor variables were added, accounting for an additional nonsignificant 4.6% of the variance in U.K. Hit Duration, ΔR2 = .046, ΔF(6, 263) = .2.11, p = .053. In combination, all of the predictor variables explained a nonsignificant 4.6% of the variance in U.K. Hit Duration, R2 = .046, F(7, 263) = 1.81, p = .086 (see Table 4).
Hierarchical Regression Results for U.K. Hit Duration.
CI: confidence interval; LL: lower limit; UL: upper limit.
Note. Overall model: R2 = .046, F(7, 263) = 1.81, p = .086.
p < .05. **p < .01. ***p < .001.
Discussion
This study investigated whether various factors relating to the processing fluency of lyrics could explain song popularity. Both measures of Peak Popularity were related to variables that would influence the processing fluency of the lyrics, so that lyrics that were easier to process were associated with songs reaching higher chart positions. This supports H1 and H3. Both measures of Hit Duration were not related to the processing fluency of the lyrics, so that lyrics that were easier to process were not associated with songs spending longer periods of time in the charts. This does not support H2 and H4.
The significant results concerning Peak Popularity are attributable primarily to two of the processing fluency variables, namely, Rhyme Saturation and Basic Lyric Readability Properties. As noted earlier, Rhyme Saturation describes the extent to which rhymes appear in the song and captures features such as rhymes per syllable, rhyme density, rhymes per line, perfect rhymes, and links per line. Basic Lyric Readability Properties captures features such as word count, syllable count, words with one syllable, words with six letters, unique word count, and grammar error count. Lyrics that are simpler in these terms achieve a higher peak position on the charts. It is also interesting that popularity was not predicted by the remaining processing fluency variables, which included several measures used extensively in previous research: the present results show that specifically peak popularity is related to rhyme saturation and basic readability properties, rather than that all aspects of popularity are related to all aspects of processing fluency. This is consistent with the research cited above showing that it is important to consider specific aspects of processing fluency and popularity rather than treating “processing fluency” and “popularity” as single variables.
The differing results concerning Peak Popularity versus Hit Duration suggest that easier readability and higher rhyme saturation in the lyrics are related to short-term spikes in popularity rather than popularity over more extended periods of time. It is tempting to speculate that this may arise because these properties of the lyrics lead to the songs being quickly perceived as boring or repetitive, so that while the lyrics are quickly understood, they are also quick to lose the high level of popularity that results from this processing fluency. Future research could investigate the mediating role of processing fluency in the effect of repetition on liking for lyrics (cf. Heyduk, 1975; Johnston & Childers, 2021).
In this context, it is particularly interesting to relate the present findings to those of two of the studies cited above. First, Nunes et al. (2015) showed that lyric processing fluency, as measured by lexical repetition, contributed to a song’s higher chart position and faster chart success, but that excessive lyric repetition was associated with decreased popularity. Our speculative interpretation of the differing findings here concerning Peak Popularity and Hit Duration is arguably consistent with Nunes et al.’s findings. Note that such an argument is also consistent with Huron’s (2013) habituation-fluency theory of repetition, which argues that internal repetition within a piece of music leads to processing fluency, and in turn a positive hedonic response, but that this repetition also leads to habituation which itself reduces a listener’s responsiveness to the music. These ideas may inform future research on processing fluency in the context of lyrics, particularly given Huron’s (2013) argument that “With the possible exception of dance and meditation, there appears to be nothing else in common human experience that is comparable to music in its repetitiveness (p. 7).” In the context of music and lyrics, “processing fluency” has the potential to be an extremely complex concept that interacts with several other musical features, so that, for example, the repetitiveness of the music interacts with processing fluency to influence the point in time at which peak popularity occurs.
Second, we observed a significant relationship between some aspects of lyric processing fluency and Peak Popularity: in contrast, North, Krause, and Ritchie (2020) identified a positive relationship between the typicality of lyrics relative to the remainder of the cohort and the duration of the chart tenure of the song in question, and that the relationship between typicality and duration of tenure was stronger than that between typicality and peak chart position. If we consider the present findings in the context of North et al.’s, they suggest that peak popularity is related to the simplicity of the lyrics (at least in terms of rhyme saturation and basic readability), whereas the duration of popularity is related more closely to lyrics that are typical. There may be a subtle distinction between typicality, on one hand, and rhyme saturation and basic readability, on the other, but this distinction may be important in explaining whether a song enjoys a longer period of popularity versus a higher peak in popularity.
Before concluding we should acknowledge several limitations of the present research. First, for the sake of clarity, we should stress that there is no suggestion that processing fluency could or should be the primary determinant of the commercial success of a song. In the light of the numerous psychological and marketing factors (e.g., advertising, radio airplay, artist management) that influence commercial success, however, it is interesting that rhyme saturation and basic readability can predict the peak chart position that a song will reach. Second, the present data do not allow us to determine the extent to which any given individual’s liking for a song is influenced by processing fluency. Similarly, the present research concerned only very successful pop songs, and we cannot draw any conclusions concerning whether the findings reported here generalize to the millions of less popular tracks within other genres or cultures to which someone might listen, or whether the present findings can be generalized to future chart performance. Third, our conclusions concerning processing fluency are only as valid as the variables and computer software used to operationalize this. With regard to readability, for example, the use of complex metaphors within lyrics could mean that even simple words and grammatical structures can nonetheless be found in lyrics that are difficult to comprehend. Similarly, lyrics inevitably use different forms of sentence structure and grammar compared with conventional prose, whereas computerized measures of processing fluency are typically developed for analysis of the latter. The inability of computers to cope well with metaphors and similar artistic devices clearly hampers all current research on music lyrics.
This leads to a fourth point, namely that the use of computerized measures throughout makes it impossible for the present research to distinguish between objective properties of the lyrics that might be expected to influence processing fluency and a listener’s subjective experience of finding it difficult to comprehend or otherwise process those lyrics as a consequence of the properties in question. Computerized measures make the implicit assumption that objectively complex lyrics are subjectively difficult to process, and that objectively simple lyrics are subjectively easy to comprehend. It would be extremely interesting to conduct further research that compared measures of the processing fluency of lyrics obtained by computerized methods against similar measures obtained from human participants. We suspect that the two sets of data may differ when considering lyrics containing metaphors, which make reference to broader cultural events and movements, or which otherwise require comprehension beyond the literal meaning of the words and use of standard grammar. Finally, the present results do not tell us why specific aspects of processing fluency influence popularity. For example, do rhymes make the lyrics more memorable, easier to understand and interpret, or have some other specific effect altogether?
The current findings demonstrate that basic readability and rhyme saturation of lyrics can predict a song’s peak chart position, and so provide evidence that one form of popularity is related to two aspects of processing fluency. This finding contributes to our understanding of processing fluency and the prediction of music popularity, and of course indicates more generally that lyrics themselves influence the popularity of music. Future research may attempt to replicate or extend these findings to different samples of music, to explore the more precise relationships between particular lyrical features and particular measures of popularity, or to investigate the relationship between processing fluency and repetition in predicting differing forms of popularity.
Supplemental Material
sj-docx-1-pom-10.1177_03057356221118400 – Supplemental material for Song popularity and processing fluency of lyrics
Supplemental material, sj-docx-1-pom-10.1177_03057356221118400 for Song popularity and processing fluency of lyrics by Sorcha Melvill-Smith, Amanda E Krause and Adrian C North in Psychology of Music
Footnotes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
