Abstract
A computer-aided semantic analysis (using Linguistic Inquiry and Word Count [LIWC]) examined how newspaper coverage of air pollution from 2014 to 2017 may affect the public agenda in four cities—Hong Kong, London, Pittsburgh, and Tianjin. Results show that after controlling for the real-time air quality, the agenda-setting effect was found in Hong Kong, London, and Pittsburgh, but not Tianjin. Tianjin’s reports also contained more future-framed words but fewer present-framed words than other cities.
Air pollution has become a global environmental calamity that has long-term adverse effects on human health. Exposure to air pollutants results in a variety of illnesses, including asthma, birth defects, bronchitis, leukemia, liver cancer, heart disease, and strokes (see De Marco et al., 2019). More than 92% of the global population was affected by air pollution and each year more than 6 million deaths worldwide were directly related to exposure to air pollutants, especially PM2.5 (i.e., inhalable particles with diameters smaller than 2.5 µm) and PM10 (i.e., inhalable particles with diameters smaller than 10 µm) (World Health Organization [WHO], 2016). Despite the identified negative outcomes of air pollution, it remains unclear how the air pollution coverage affects the public agenda, especially in a cross-cultural context.
This study uses the Linguistic Inquiry and Word Count (LIWC), a computer-aided content analysis software, to conduct semantic analyses of the newspaper reports on the air pollution in Hong Kong, London, Pittsburgh, and Tianjin from January 2014 to March 2017. The findings should provide insights into the news decision-making process in reporting environmental issues and contribute to understanding how the public agenda may be affected by the news coverage of air pollution coverage, which is, to the best of our knowledge, the first of its kind in environmental studies.
Literature Review
Four global cities’ air pollution reports were selected for this research, two of which are in China (i.e., Hong Kong and Tianjin), one in Europe (i.e., London), and one in the United States (i.e., Pittsburgh). The four cities were selected for this research because all of them are or had been heavily air-polluted. Due to the shared pollution history, residents in these cities could be sensitive to air pollution information and thus might have a strong information need regarding such environmental issues. Another reason for choosing the four cities is that Hong Kong and Tianjin are currently experiencing severe air pollution, while London and Pittsburgh have significantly decreased their environmental problems. The current air conditions in Hong Kong and Tianjin are far worse than that in London and Pittsburgh. New insights might be generated by comparing how news media report different levels of urban air pollution. Finally, the four cities have different economic, political, and media systems, ranging from relatively little government control in London and Pittsburgh, to some control like in Hong Kong, to tight control like in Tianjin. Hence, the same reality—air pollution—might be perceived differently by the news media outlets.
Air Pollution in Four Cities
Air pollution has been a major environmental concern in China, including Hong Kong and Tianjin. Hong Kong, with a population of more than 7 million, is among the most densely populated cities in the world (Hong Kong Census and Statistics Department, 2017). As a global financial center, Hong Kong’s economy heavily relies on finance and service sectors, but its air pollution has started to endanger the health of residents and drove away some foreign businesses. Levels of cancer-causing pollutants in Hong Kong peaked at 5 times higher than the acceptable level, surpassing the WHO standards for more than 15 years (Haas, 2017). Tianjin’s air pollution was even worse than that in Hong Kong. For years, Tianjin had been among China’s most polluted cities (“The 10 Polluted Cities in China,” 2016).
Comparatively, London and Pittsburgh have better air quality after they went through severe air pollution problems just decades ago. London had a long history of air pollution, which remains an environmental challenge today. Starting in the 1700s, London’s air quality had turned worse each year and reached the peak at the end of 19th century (Ritchie, 2017). In December 1952, the Great Smoke of London resulted in unprecedented morbidity and mortality, leading to more than 12,000 deaths (Bell, Davis, & Fletcher, 2004). The environmental crisis directly led to the Clean Air Act in 1956 that restricted coal-burning and gradually replaced coal with alternative heating sources (Klein, 2012).
In the United States, Pittsburgh had been a heavily air-polluted city for over a century from the 1870s to the 1980s, which was known as the “Smoky City” or “Hell with the Lid Off” (“Pittsburgh, the ‘Smoky City,” 2015). Since its war against air pollution started in the 1940s, Pittsburgh’s industries and residents began to adopt smokeless fuels, such as anthracite coal and natural gas, resulting in better air quality (Davidson, 1979). Significant industrial transformation happened during the 1980s when commercial, retail, and residential districts gradually replaced industrial sites (“Pittsburgh, the ‘Smoky City,” 2015). Other U.S. cities also have decreased their air pollution, however, and Pittsburgh still routinely ranks among the 10 worst U.S. cities for air pollution (and often the only one outside of California). But today, a variety of industries, such as health care, technology, and financial services, have been developed and the steel industry became the city’s history.
Although the air quality in London and Pittsburgh is better than decades ago, their history of air pollution has been regularly mentioned and reported in the news. Thus, the current air quality conditions may shape how air pollution is reported in each of the four cities and the news coverage’s agenda-setting effects may vary due to the different levels of air pollution.
Agenda-Setting Theory
The agenda-setting theory should be an appropriate theoretical framework for this study. The theory assumes a significant correlation between the news media agenda and the public agenda as the importance attributed to certain issues by news media would considerably influence the salience placed on these issues by mass audiences (McCombs & Shaw, 1972). Since the publication of the classic Chapel Hill study (McCombs & Shaw, 1972), the agenda-setting effect has been consistently confirmed in hundreds of empirical studies conducted for numerous media types and public issues in a variety of contextual settings (McCombs & Reynolds, 2009). The agenda-setting effect was also found in the field of environmental communication. Prior studies had discovered a significant relationship between news media’s focus on the environmental issues and the salience of those issues in the public agenda (Shanahan, Morgan, & Stenbjerre, 1997).
For this study, the four cities’ air quality data were collected from local air quality monitoring stations, which provide the real-time air quality data to the public through the Internet. Real-time conditions, defined as “the actual prominence of the specific issue in reality, the indicators of current conditions, or the measure of the importance of an issue in reality, usually based on statistics” (Ader, 1995, p. 300), are important factors to consider when evaluating environmental news effects. In his study of the environmental pollution coverage in The New York Times from 1970 to 1990, Ader (1995) found a negative correlation between the news media agenda and the real-time pollution conditions, suggesting that when the air quality improved, news media covered more about air pollution. In this longitudinal study, Ader (1995) argued that the environmental pollution was an unobtrusive issue that would manifest a significant agenda-setting effect. In the cases of Hong Kong and Tianjin, the air pollution was so obtrusive that it interrupted local people’s daily lives and endangered their health. In this sense, it is important to examine if the real-time air quality data could be a confounding variable that influences the public agenda through the coverage of air pollution. To researchers, an objective assessment of the reality is difficult (McLeod, Becker, & Byrnes, 1974), which may explain why few prior studies included real-time conditions into the content analysis research. Hence, the first research question was proposed:
Second-Level Agenda-Setting
News media may not only tell us what to think about, but also how to think. The transfer of the attribute salience from news media to the public is known as second-level agenda-setting effects (McCombs & Shaw, 1993). When the traditional agenda-setting centers on the “issue” salience, the second-level agenda-setting focuses more on the “attribute” salience of an issue (Kim & McCombs, 2007). On one hand, news media have the power to direct our attention to an object, while, on the other hand, they can guide our attention to certain characteristics and properties of the object, which are known as “attributes” (Kim & McCombs, 2007).
This research thus explores the impact of real-time air quality conditions on the tone of air pollution reports, that is, the positivity or negativity of the reports. Positive and negative evaluations are the fundamental aspects of any human language (Osgood, 1959). Both air pollution index (API) and temperature were found to have a significantly positive relationship with the negativity of the U.S. journalists’ coverage of the Beijing Olympics (Zhong & Zhou, 2012). Findings suggested that journalists’ decision-making was influenced by factors that were previously ignored, such as the environmental cues like weather conditions (Zhong & Zhou, 2012). The rationale was that weather conditions might impact journalists’ mood, which in turn would alter their positive and negative word choices (Zhong & Zhou, 2012). However, the study rested only on a particular context, that is, U.S. journalists’ sports reporting in a foreign city (Beijing), and no study was found to investigate such an association in other settings. Given the different air pollution conditions in the four cities, this study explores the possible differences of emotional words used in reporting the four cities’ air pollution. Hence, two more research questions were formulated:
Chinese news media often linked the issue of air pollution to the country’s economic development. A 2013 content analysis of air pollution reporting showed that 35% of articles in China Daily linked air pollution to societal and economic factors, whereas only 17% of articles mentioned its impact on human health, and only one article linked it to the ecosystem (Zu, 2015). We were unable to find any study that analyzed if air pollution reporting is correlated with the economic reporting, so this study thus poses the following research question:
When covering environmental concerns, journalists tend to view time orientation as another important attribute. The time orientation is important as it discloses the news media’s implicit attention to certain dimensions of the news issues. Chyi and McCombs’s (2004) content analysis of the Columbine school shooting reports showed that the vast majority of stories used a present frame. Following the same rationale, Houston, Pfefferbaum, and Rosenholtz (2012) examined the disaster coverage in U.S. newspapers and network evening news broadcasts, whose results confirmed that U.S. news media mainly focused on the present in reporting disaster events. Many such studies have been about the U.S. or other Western journalistic practices, with little effort devoted to analyzing the time orientation in other countries’ news reporting. Hence, the final research question was proposed to explore the time orientation in the four cities’ air pollution coverage:
Method
Four air pollution reporting corpora were collected for this research, namely Hong Kong’s corpus collected from the South China Morning Post, London’s corpus from The Times, Pittsburgh’s corpus from the Pittsburgh Post-Gazette, and Tianjin’s corpus from Beijing-based China Daily, which has a Tianjin bureau (but not a Tianjin edition). LexisNexis analyzed and assigned index terms that helped identify the attributes of news articles in its database, including industry, subject, geography, and people being related to and mentioned in the articles (LexisNexis, 2017). All articles were then downloaded from the LexisNexis database after they were identified by a subject index of “air pollution.”
Several advantages emerged when the sample was collected by using the subject index of “air pollution,” rather than relying on keyword searching. First, journalists often used interchangeable words referring to air pollution in the news reports, such as air quality, air pollutant, haze, and emission. Some news articles that did not contain the keyword “air pollution” were included in the sample as their themes were about air pollution. This was particularly evident in the London and Pittsburgh corpora, where the current air pollution situation was no longer as bad as before, and as a result journalists reported more on other aspects of air pollution. For example, one Times story reported on pollution caused by diesel car taxes was included in the sample based on the subject index approach, though none of the keywords about air pollution was used. Similarly, a Pittsburgh Post-Gazette article introduced the “Particle Falls”—an installation that can stream colorful light that looked like rainfall burst into flames when pollution particles reached a certain level—to draw residents’ attention to possible air pollution. The article was also included in the sample as it reported local air pollution. In other scenarios, a report mentioning the keyword “air pollution” was excluded as the article centered on other urban issues. Therefore, data gathered through the subject index approach could be more relevant and accurate than keyword searching.
Finally, a location index was also used in combination with the subject index to search the reports on each city’s air pollution (e.g., “air pollution” + “Tianjin” in searching for Tianjin’s air pollution reports). All news articles that met the search criteria were included in the sample, which were those published from January 2014 to March 2017. This time period was chosen because, in LexisNexis, the earliest China Daily reports on Tianjin’s air pollution were available starting in January 2014 and ended in March 2017. The 39-month period thus became the cut-off phrase for all four corpora under study, which consists of 1,982 newspaper reports.
The air pollution salience in a given newspaper (i.e., the news media agenda) was operationalized as the average keywords in each report in each month. Specifically, the LIWC counted the number of air pollution-related keywords, that is, air pollution, air pollutant, smog, and so on, and divided it by the total number of the words used in each article on air pollution. The quotient in each article was added up and the sum was then divided by the total number of articles published by the respective newspaper in each month to get the average number of keywords in a given month for that newspaper. Therefore, the higher the number was, the more salient the air pollution issue was in that month in the city. A separate set of corpora were collected that reported the four cities’ economic issues by searching the index term “economy & economic indicators.” These articles were also downloaded month by month for the same period (January 2014 to March 2017). A comparison could thus be made between the reports on air pollution and economic activities in each city.
The LIWC Analysis
All corpora were processed by the LIWC software (Pennebaker, Boyd, Jordan, & Blackburn, 2015). Compared to manual content analysis procedures featured by a series of time-consuming and tedious human-coded methods, computer-aided content analysis offered several distinct advantages, such as the stability and comparability of coding rules leading to more accuracy in the research findings, perfect coder reliability in applying the rules to texts, and exceptional speed in processing huge quantities of text data (Zhong & Zhou, 2012). The LIWC user can input any text files into the software and it then counts and calculates the percentage of words based on a variety of pre-programmed linguistically and psychologically meaningful categories. Currently, LIWC has 93 word categories, such as positive emotions, negative emotions, and future-framed words (Tausczik & Pennebaker, 2010). The validity and reliability of LIWC calculations have been well demonstrated in a series of studies (Tausczik & Pennebaker, 2010).
For our research, the percentage of words in each category was computed by the LIWC with the unit of analysis being all air pollution–related articles published in 1 month. For instance, China Daily had 24 articles that covered Tianjin’s air pollution issues in March 2017, and the LIWC calculated the percentage of each word categories accounted for the total number of words in the articles. A higher number in one category would indicate that articles in that month were more prone to this psychological state that this word category referred to. Thus, a report containing a higher number of positive words was regarded as more prone to a positive psychological state. Finally, the LIWC computed the six key variables by the following formulas:
Based on the above variables, four more variables were computed in the LIWC:
To avoid semantic biases in processing the words in the news reports, some words were particularly processed by LIWC. For example, all the words “May” were replaced by the word “Mei” in the articles published in the May months before being processed. The procedure would prevent the LIWC from mistakenly categorizing the word “May” as a future-focused word, resulting in a disproportionately high percentage of future-framed words in each of the May month samples.
The Web Search Index
The ubiquity of Internet search among the public has made the search records an accurate, real-time, and easily accessible indicator of the public attention to specific issues (Desjardins, 2018; DMR, 2019). Researchers used to measure the salience of a public issue by conducting polls, monitoring public opinions, or analyzing public attention and concerns (Kwak, An, Salminen, Jung, & Jansen, 2018). Now they are increasingly assessing the public agenda by mining Internet search data, which has been repeatedly tested and validated in numerous studies. Scharkow and Vogelgesang (2011), for instance, used data from Google Insights for Search to measure the public agenda and concluded that the approach was a promising tool for agenda-setting research. Mellon (2014) compared Google Trends data with the Gallop’s “Most Important Problem” poll data from 2004 to 2010 and found that Internet search data were a reliable way to measure the issue salience as part of the public agenda. Following them, the public agenda of air pollution in the four cities was measured by the Internet search data.
The keywords searched by the Internet users in the four cities from January 2014 to March 2017 were used as the indicator of the public agenda. For Hong Kong, London, and Pittsburgh, the English keyword “air pollution” was used and for Tianjin the Chinese keyword of 空气污染 (air pollution) was used. The search results of “air pollution” were collected from Google Trends (https://trends.google.com/trends), while the search results of “空气污染” came from a Chinese search engine, Baidu Index (www.index.baidu.com). A monthly average search index was thus created for each city’s air pollution issues, reflecting how frequently local residents in the four cities searched the keywords “air pollution” in English or Chinese.
Air Quality Indicators
Three measures of air quality data were collected for each city, namely the monthly average PM2.5, PM10, and Air Quality Index (AQI), which are the primary sources for health concerns (U.S. Environmental Protection Agency [EPA], 2016). While Mainland China and the United States employed the same AQI standards (some studies suggested differences exist between the two countries; see Andrews, 2014), Hong Kong and the United Kingdom used different criteria for the AQI. Hong Kong used the Air Quality Health Index ranging from 1 to 10+ and London employed the Daily Air Quality Index ranging from 1 to 10. The AQIs from the four cities were not comparable directly, but they provided useful information on the air conditions in these cities. In addition to the AQI, this study uses two more air quality indicators—PM2.5 and PM10—for measuring air pollution. The three indicators are significantly correlated, thus providing more comprehensive information on the air quality. In this study, the AQI, PM2.5, and PM10 data came from four sources: the Hong Kong data were from the websites of Hong Kong Environmental Protection Department: https://cd.epic.epd.gov.hk/EPICDI/air/station/?lang=en, the London data from the website: https://data.london.gov.uk/dataset/london-average-air-quality-levels, the Pittsburgh data from the U.S. Environmental Protection Agency: https://aqs.epa.gov/aqsweb/airdata/download_files.html#AQI, and the Tianjin data from the website aqistudy.cn .
Results
Table 1 reports all the means and standard deviations of PM2.5, PM10, AQI, news media agenda, public agenda, and the percentage of emotional words in the sample.
Descriptive Statistics of Air Quality Indicators, Media Salience Score, Public Salience Score, and the Percentage of Emotional Words for Four Cities, by Month, Mean (SD)
Note. All sample sizes N = 39. AQI = Air Quality Index, Media agenda = number of keywords/number of articles in 1 month, Public agenda = public web search index in a month.
Agenda-Setting and Real-Time Conditions
The Pearson correlation coefficient analyses showed that both the news media and public agendas of air pollution issues were significantly correlated in each of the four cities, for Hong Kong, r(39) = .52, p < .001, for London r(39) = .56, p < .001, for Pittsburgh, r(39) = .51, p <.001, and for Tianjin, r(39) = .51, p < .001. The results indicate that the news media agenda had a significant impact on the public agenda in the four cities. To answer
After controlling for the real-time PM10 data, the analysis showed a similar trend: for Tianjin, the news media’s agenda-setting effects no longer existed, r(36) = .26, p = .119. But the effects remained significant for Hong Kong, r(36) = .41, p < .01, London, r(36) = .46, p < .01, and Pittsburgh, r(36) = .43, p < .01. Since the AQI from the four cities were measured in different scales, no analysis could be conducted based on the absolute values of AQI. Considering the overall significant correlation between air quality indicators of PM2.5, PM10, and AQI in the four cities, it can be inferred that the same results would show a similar trend after controlling for the AQI.
Air Quality and Affective Tone
Again,
Correlations Between Air Quality Indicators and Affective Word Frequencies
Note. AQI = Air Quality Index. *p < .05, **p < .01, ***p < .001, df = 39.
Emotional Words in the Air Pollution Reports
To answer
Air Pollution Reporting and Economic Reporting
Again,
Correlations of Affective Word Frequencies Between Air Pollution Reporting and Economy Reporting
Note. *p < .05, **p < .01, ***p < .001, df = 39.
Time-Orientation Comparison
The time-orientation variable reflects the amount of news media attention devoted to one type of time-orientation words compared with the other types of time-orientation words regardless of any systematic bias. To answer
The results also reveal that the Tianjin sample (M = 49.54, SD = 4.46) also had significantly lower present relativity score than the Hong Kong sample (M = 55.39, SD = 6.72), Pittsburgh (M = 55.05, SD = 8.60) or the London sample (M = 57.36, SD = 7.09), F (3, 152) = 9.29, p < .001,
Discussion
A number of points from the data require some comment. First, the time-orientation analysis showed that Tianjin’s air pollution reports focused more on the future and less on the present than other cities under study. These results imply that the Tianjin reports tried to paint a rosier picture of air quality in the future so to alleviate local residents’ concerns on the imminent air pollution at hand.
Second, again, after controlling the air quality condition, the agenda-setting effect disappeared for Tianjin. The previous significant relationship between media agenda and public agenda could be largely accounted for by the real-time air quality conditions in Tianjin. This indicated that the impact of Tianjin’s media agenda on the public opinion was not as strong it seemed, which may be partially due to the tight control of Chinese news media or, more likely, due to China Daily being a national, English-language newspaper that probably is not well read by Tianjin’s Chinese residents.
Third, that non-significant agenda-setting effect in Tianjin certainly deserves more research, but the result may reflect concerns about trustworthiness of Chinese news media outside of Hong Kong. When public opinion was less affected by news media than by personal experiences, the credibility of Chinese news media among the public could be assumed to be low, which Tsfati (2003) called “media skepticism.” Affective word frequencies results in Tianjin further strengthened the argument that real-time air quality influenced both the news media agenda and public agenda in Tianjin.
Fourth, that air pollution reporting only in Tianjin had significantly more future-framed words and fewer present-framed words than the other three corpora requires analysis. It may result from differences in media systems between Tianjin and other cities (see “Limitations and Future Studies” section).
Previous studies (e.g., Chyi & McCombs, 2004; Houston et al., 2012) found that at least U.S. news media focus more on the present. China Daily’s reporting on Tianjin’s air pollution focused significantly more on the future and less on the present than the other three newspapers. This indicated that China Daily tried to avoid covering the Tianjin air pollution in detail, but focused more on the government’s determination to search for future solutions.
Limitations and Future Studies
Several limitations should be noted when interpreting the findings. First, all relationships found in this study may not be interpreted as causal relationships. The salience of air pollution in the news media was correlated with the public salience in the same month without including any time lag. This was mainly because the time lag for an issue like air pollution would likely be as short as a couple of days. Thus, the unit of analysis, ideally, should be weekly or daily air pollution data, not the monthly data used in this study. Further analysis may explore the question of causality by using weekly or daily data.
Second, the air pollution coverage period from January 2014 to March 2017 was selected based on the availability of archives in LexisNexis, in which the reports on Tianjin were archived starting in January 2014. Future studies could include different time periods or even compare the historic data to more recent data. For example, a study could compare news reports in the 1950s when Pittsburgh air pollution was severe to those in recent years from Hong Kong and Tianjin. Such a study should generate more in-depth findings and thus make a significant contribution to the understanding of news media portrayal of air pollution. Finally, this study identifies and explores three variables that could potentially moderate the agenda-setting effect: the real-time air pollution condition, the affective tone, and the media trustworthiness. Future research may also examine how and in what contexts these variables may alter agenda-setting effects.
Third, the newspapers were not directly comparable: the Pittsburgh Post-Gazette is local while the others are national, even international, in readership and influence; China Daily is not based in Tianjin while the other newspapers are based in the cities studied; and neither China Daily nor South China Morning Post are published in Chinese (and nearly 50% of Hong Kong residents speak English compared with less than 1% of China’s residents outside of Hong Kong).
Conclusion
An important theoretical contribution of this comparative study is shedding light on issue obtrusiveness in agenda-setting theory. In Ader’s (1995) study of agenda-setting, air pollution was regarded as an unobtrusive issue, and the correlation between the news media agenda and public agenda was strengthened when the effect of reality was controlled. Contrary to that result, this study chose four cities where air pollution was or had been an obtrusive issue. The result indicated that the correlation between the news media salience and public salience of air pollution was weakened when real-time air quality was controlled. This was particularly noticeable in Tianjin, where air pollution was the worst and most obtrusive, and the agenda-setting effect disappeared as long as the real-time air quality was controlled. Thus, real-time air quality was found to affect both the news media and public agendas in Tianjin.
The analyses discovered the scale of the negative tone of air pollution reports from high to low was in the order: London, Tianjin, Hong Kong, and Pittsburgh. Interestingly, the ranking of the strength of agenda-setting correlations for the four cities from strongest to the weakest followed the same order. This suggested that affective attributes had agenda-setting effects, which is consistent with prior studies (e.g., Wu & Coleman, 2009). Specifically, when news media covered air pollution more negatively, the public would pay more attention to the issue as it was made more salient in the news media agenda.
This study also identifies the relationship between the air pollution reporting and economic reporting. Emotional words used in the air pollution and economic reports were significantly correlated for both the Tianjin and Hong Kong samples, but not for the London and Pittsburgh samples. This might be due to the fact that many news articles in Tianjin and Hong Kong corpora discussed air pollution in the context of the economy, which seldom happened in the London and Pittsburgh corpora. The close relationship between air pollution and the economy can be interpreted as framing the air pollution as a problem resulting from humans’ economic activities rather than natural factors. In this study, both China Daily and South China Morning Post linked the two issues closely, but not so much in The Times or Pittsburgh Post-Gazette.
Generally, the professional and academic fields of environmental communication require a better understanding of news media portrayal of, and public attitudes toward, air quality because both public involvement and news media perceptions are crucial to successfully curbing air pollution. As long as urban air pollution continues to be a major global risk, more comparative studies like this are needed to systematically investigate how news media and the public perceive air pollution risks. Then, the results of such studies need to get into the hands of policymakers and the news media. These studies’ findings should empower residents to play a more significant role in local pollution-control strategies, which are also relevant to policymakers in regulating air pollution. This is never an easy task, and this study represents such an effort in achieving it.
