Abstract
Previous research has analyzed the impact of air pollution on life satisfaction (“happiness”) based on both subjective perceptions of environmental quality and objectively measured pollution data. This article combines these two types of environmental data in life satisfaction regressions and investigates using an instrumental variable (IV)-ordered probit approach whether perceived air quality is endogenous with respect to happiness. We find combining German 2004 socioeconomic panel data with annual readings for sulfur dioxide (SO2), NO x , and PM10 by the Umweltbundesamt (UBA; German Environmental Protection Agency) for counties that people bothered by air pollution feel less happy but that simultaneously unhappy people feel more disturbed by air pollution. Controlling for this simultaneity in an IV-ordered probit approach reveals that perceived air pollution does not have a statistically significant effect on life satisfaction anymore. We also find econometric evidence that air pollution is fully capitalized in the housing market.
Keywords
Introduction
Life satisfaction approaches as a nonmarket valuation method for environmental quality have in recent years become increasingly more popular (van Praag and Baarsma 2005; Ferrer-i-Carbonell and Gowdy 2007; Levinson 2012; Luechinger 2009; MacKerron and Mourato 2009). Welsch (2002, 2006), Welsch and Kühling (2008), Ferreira and Moro (2010) and Frey, Leuchinger, and Stutzer (2010) give an overview of how to apply happiness research to environmental valuation. This popularity is related to the weaknesses of traditional economic valuation approaches. One of the traditional valuation approaches is the hedonic pricing approach, where the value of a public good is inferred from market transactions in private goods, in particular the housing market. The methodology is based on an equilibrium assumption in the housing market where rent compensates people for environmental disamenities (e.g., the costs of self-protection measures, higher risk premiums for insuring oneself against damages as well as for all noninsurable and nonavertable losses). In the market equilibrium, housing prices adjust to equalize utility across locations. Otherwise, some individuals would have an incentive to move from places with lower utility to places with higher utility.
If the equilibrium assumption holds and people are fully compensated, we would not find any environmental disamenities effects in life satisfaction regressions. However, the assumption of a spatial equilibrium in housing markets may be flawed, and, therefore, the benefits of a clean environment may be incompletely capitalized in housing markets. First of all, moving costs and other transaction costs may be high so that spatial equilibrium would not easily be achieved, which would lead to downward-biased coefficient estimates in hedonic housing price models. Second, government regulations in the housing market may constrain market compensation, either because housing prices are not allowed to adjust rapidly (e.g., as a result of rent control) or because there are constraints on the amount and variety of housing available (e.g., due to building regulations). Third, incomplete information or a distorted perception of the risk of pollution and its health effects may easily bias hedonic estimates. In particular, pollution levels and effects are not reflected in private markets if individuals are ignorant about their existence. Another traditional valuation approach, contingent valuation, directly asks people about their willingness to pay for environmental improvements or their willingness to accept payment in exchange for bearing a particular loss. Yet the approach is criticized among others, for not measuring revealed preferences, but asking hypothetical questions and for giving rise to problems of scope or embedding (Hausman 2012). The life satisfaction approach may be considered complementary to the hedonic pricing approach: it ideally measures the residual shadow costs for which people are not already compensated in the housing or labor markets and accounts for the additional disutility of environmental costs due to the nonexisting spatial equilibrium. It is often assumed that the combined willingness-to-pay figures derived from the regression coefficients, the one from the hedonic housing price model and the other one from the life satisfaction approach, represent a more accurate valuation of environmental (dis-)amenities. Yet Frey, Leuchinger, and Stutzer (2010) argue that under certain circumstances, the hedonic method and the life satisfaction approach can also measure similar phenomena and be considered substitutes.
We directly build on previous research that has suggested that some public goods or bads may not be entirely capitalized in market prices. However, if we follow the theory, then we could use the happiness regression approach also to test whether air pollution is fully capitalized in rent and the housing market is actually in spatial equilibrium with respect to the environmental quality variable of interest. To do so properly, it is imperative that the regression model is specified correctly. Luechinger (2009), for example, used an instrumental variable (IV) approach to account for the simultaneity of local sulfur dioxide (SO2) improvements and regional economic conditions, both having an influence on well-being, however, with the opposite sign. Rehdanz and Maddison (2008) use reported subjectively perceived air pollution for their happiness regression, however, they fail to account for the endogeneity of this measure.
Inspired by the van Praag and Baarsma (2005) approach and further informed by MacKerron and Mourato (2009) and Ferreira and Moro (2010), the main contribution of this article is to extend their ideas to account for the endogeneity of a subjective perception of an externality and apply it to the happiness model by Rehdanz and Maddison (2008). Our primary interest is therefore the endogeneity between perceived air pollution and happiness. For that purpose, we use the IV-ordered probit approach developed by Rivers and Vuong (1988), which also allows explicit testing for endogeneity. Our rationale goes as follows: if air pollution affects life satisfaction above and beyond what is already capitalized in market rents (which can be analyzed using a hedonic housing price model), then ceteris paribus it should be the subjectively felt proportion of air pollution (perceived air quality) rather than the scientifically measured one. However, any subjective measure of air pollution could, in turn, be influenced by the person’s general happiness level. In other words, lower air quality may not only make people unhappy but also, simultaneously, unhappy people may feel more bothered by more air pollution.
Incomplete information and distorted risk perceptions can create the impression that people are strongly affected by air pollution, even if this is not reflected by measured pollution data. Yet Frey, Leuchinger, and Stutzer (2010) argue that the life satisfaction approach is less affected by distorted risk perceptions than the hedonic method because the welfare consequences of risk are primarily identified when the risk actually materializes. Moreover, the life satisfaction approach can capture the welfare consequences of health effects from air pollution even if individuals are ignorant about the causes, whereas incomplete information on health effects often leads to a downward bias in hedonic estimates (e.g., Smith and Huang 1995; Pope 2008).
A secondary interest of this study is, following the approach of Ferreira and Moro (2010), to identify whether or not air pollution is fully capitalized in rents, leading to the housing market being in spatial equilibrium with respect to air quality.
As in Rehdanz and Maddison (2008), we take life satisfaction data from the 2004 German socioeconomic panel (SOEP 2013), and as in Luechinger (2009), we take the air pollution measures from the 2004 Federal Environment Agency air quality monitoring data (UBA 2011). The article proceeds as follows. The second section provides a brief overview of related research. The third section elaborates on the methodology and the estimation strategy and describes the data. Results are presented in the fourth section. The fifth section concludes.
Previous Literature
A number of previous studies have explained differences in subjective well-being as a function of environmental (dis-)amenities and perceptions or attitudes. An international approach investigating NO x emissions across many countries was followed in earlier research by Welsch (2002, 2006). Using aggregate, cross-sectional data for fifty-four countries, he found a negative effect of lower air quality on happiness. The main disadvantage of these studies is that, due to lack of data, life satisfaction and pollution levels are only assessed on the most aggregate level with no indication of within-country variability or real pollution exposure of individuals. Unobserved heterogeneity across individuals is therefore assumed to even out, and unobserved heterogeneity across countries needs to be controlled for. Moreover, country borders would hinder spatial equilibrium through migration.
Most of the more recent literature uses individual-level data and is restricted to one country or region. Luechinger (2009) uses individual-level panel data from the German SOEP and combines them with SO2 data at the county level over a nineteen-year period (1985–2003). 1 To control for the simultaneity of local air quality and regional economic conditions, he uses exogenous environmental regulation, which mandates SO2 emissions control equipment as an instrument. He finds that SO2 emissions are capitalized in market rents only to a limited extent. In fact, Luechinger’s life satisfaction approach estimates a willingness-to-pay value for SO2 reduction of five to fifty times higher than compatible hedonic housing price models.
Using SOEP data for 2004, Rehdanz and Maddison (2008) incorporate into a model estimating life satisfaction subjective measures of local environmental quality, including air pollution. Their results again support the idea that air pollution is not entirely capitalized in market rents. However, this study has one main shortcoming: the endogeneity of subjective air pollution measures is not accounted for.
Both Luechinger (2009) and Rehdanz and Maddison (2008) present some evidence that there may be a degree of spatial disequilibrium with respect to air quality in Germany. Levinson (2012) undertook a study comparable to Luechinger’s (2009) for the United States. Using the General Social Survey data, he found significantly negative effects on happiness only for PM10, but not for ozone, SO2, or carbon monoxide (CO). These results could indicate that in the United States, there is spatial equilibrium for some pollutants. However, Levinson (2012) suggests that his study covers many areas without significant ozone and SO2 problems so that significantly negative impacts on happiness would not be measured correctly.
A weakness of the above-mentioned studies is the high spatial aggregation of the environmental quality variables, such as it is the case for, that is, county-level data. This issue is addressed by MacKerron and Mourato (2009) who analyze willingness to pay for air quality in London, UK, focusing on NO x . They collect original survey data for 400 Londoners and match them with pollutant concentrations in the immediate vicinity of their homes. Two happiness models are used, one with a subjective air quality measure as an independent variable and another one with scientific measurements of air quality. Both regressions indicate a statistically significant and negative impact of air pollution on life satisfaction. The authors even discussed the possibility that the subjective air quality measure variable is endogenous; however, they did not push the argument further and just compared both the subjective and scientific measurement coefficients but did not address the issue econometrically.
In the case of Ireland, Ferreira and Moro (2010) find that PM10 reduces subjective well-being. Furthermore, they discuss the limitation of environmental valuation based on just hedonic models in the presence of spatial disequilibrium, a discussion that was initiated by van Praag and Baarsma (2005). This important study addresses airport noise in Amsterdam, the Netherlands, to supplement hedonic housing price valuation results for a cost–benefit analysis. They assume spatial disequilibrium in housing market from the onset, and, unlike other authors, explicitly discuss the problem from the theoretical point of view in order to develop empirical solutions. They first regress an objective indicator of noise on life satisfaction (using measured noise levels, the frequency of flights, and a correcting factor for day and night traffic) but do not find a significantly negative influence. Similar to MacKerron and Mourato (2009), they use disaggregated noise data at the postcode level, which is reasonable given the highly localized influence of noise. Second, they replace the objective noise indicator by an ordinal indicator of subjective noise nuisance that is corrected for individual characteristics (like housing attributes, family size, and presence at home during the day). In this specification, the noise variable has a significantly negative impact on well-being. Van Praag and Barsma (2005) also compute the residual shadow costs for changes in the noise level that are not compensated in the housing market. In contrast to Rehdanz and Maddison (2008), they model subjective noise perception explicitly to include the fitted values into the final happiness regression. Yet they still fail to explicitly account for the endogeneity issue in the econometric model.
Ferrer-i-Carbonell and Gowdy (2007) study the relationship between individual environmental attitudes and subjective measures of well-being, an issue that is not treated explicitly in the above-mentioned literature using perception data. The former are conceptualized on the basis of subjective ratings and questions regarding concerns about ozone destruction and species extinction. According to their baseline regression, they find a negative coefficient for concern about ozone destruction on individual’s well-being and a positive one for concern about species extinction. They notice that environmental attitudes may correlate with excluded explanatory variables or unobserved individual personal traits that influence happiness. Ferrer-i-Carbonell and Gowdy add dummy variables to capture the natural environment where individuals live. They find that the negative effect of being concerned about the ozone layer and the positive effect of caring about animal extinction on happiness in their baseline regression still holds. To account for psychological traits, a set of ten individually self-reported mental health characteristics are included (e.g., feeling of strain, worrying a lot, etc.). Again, no major changes are observed for the environmental attitude variables. While they addressed the omitted variable problem, they fell short of identifying and explicitly modeling the endogeneity of personal traits with respect to happiness.
Endogeneity in life satisfaction approaches have been discussed from early on, mainly with respect to the income variable (Frey and Stutzer 2002). The basic argument is that not only higher income makes a person happier but also simultaneously reversed causation holds as well, and happier people are also more likely to have higher incomes. In fact, both Luechinger (2009) and Levinson (2012) do account for the endogeneity of income in their respective analyses. However, the endogeneity of subjective air pollution measures is so far not econometrically addressed in happiness research. This is the starting point of our study.
Data and Model
The principal source of data in this study is the SOEP, which is a representative, longitudinal annual survey of socioeconomic characteristics for individuals and households in Germany that was started in 1984. Repeating waves of the survey focus on different topics, such as life satisfaction and perceived air pollution. Geocoded data can be used via remote access at the level of the 439 German counties. 2
For our happiness model, we use 2004 personal data of the household heads, such as life satisfaction, income, employment status (unemployed), education (more than high school), age, gender (male), marital status, the existence of at least one child in the same household, and perceived air pollution (SOEP 2013). As the question on perceived air pollution is only asked every five years, 2009 is the only other year with available data. However, the year 2009 is marked by the economic crisis distorting the potential impact of perceived air pollution on happiness relative to 2004. Life satisfaction is ranked on an 11-point scale describing the degree of happiness. Perceived air pollution is surveyed on a 5-point scale. Table 1 exhibits the summary statistics of the final data set with all the included model variables.
Summary Statistics.
Note: max = maximum; min = minimum; SD = standard deviation.
The average life satisfaction score is about 6.7 and therefore in the upper half of the 11-point scale. The mean of ln(income) is about 10.2, which corresponds to about 27,000 Euros, and 7.8 percent are unemployed. These income and employment figures coincide closely with official statistics. About one-quarter of the participants have more education than high school. The mean age is about fifty-one years old. A little less than half of the people are male, while a little more than half are married. About 14 percent of the people have at least one child in their household. The average environmental worry score is around 1.1, pretty much in the center of the 3-point scale.
We estimate an ordered probit model to estimate a person’s level of life satisfaction. To account for endogeneity in the analysis, we apply a two-stage conditional maximum likelihood (2SCML) estimation method, developed by Rivers and Vuong (1988). The method calls for a first-step ordinary least squares (OLS) regression with the endogenous effect as the dependent variable on the IVs. The residuals of the first step together with the measured endogenous effect are then used in the second-step probit model (Wooldridge 2002). The model system looks as follows:
where y
1i
is the hypothesized endogenous effect (air pollution perceived by individual i); y
2i
* is the ordered happiness score for individual i; y
2i
is the latent happiness level for individual i; μ
j
is the happiness level threshold values for 1 ≤ j ≤ 9;
We assume (ν i , ui ) are independent and identically distributed, with a mean of zero. The parameters in equation (2) are not identified without a further normalization. Following Wooldridge (2002), we choose var(u) ≡ 1 and set ui ≡ ν i ′λ + η i with λ = cov(ν, u)/var(ν). Under these assumptions, we can now rewrite equation (2) as:
Rivers and Vuong’s 2SCML estimation method proceeds as follows: Since ui
is not observed, we first estimate this by running the OLS regression in equation (1) of the hypothesized endogenous variable y1i
on the observed variables
We include a total of two instruments. As the first instrument, we use county-based air quality readings of 2003 to 2005 annual mean concentration for SO2, NO x , and PM10 collected by the monitoring network of the 16 state environmental agencies and provided by the Federal Environmental Agency of Germany for the years 2000 to 2009 (UBA 2011). In total, there are 475 SO2 monitoring stations, 652 NO x monitoring stations, and 688 PM10 monitoring stations, which are all geocoded and attributed to their respective counties. To match pollution data and survey data, we use the mean value of concentrations per county (in case there is more than one monitor per county, which is true in about 20–25 percent of all the counties with monitors). In all, 34–40 percent of the 439 German counties do not have any monitor (149 for NO x , 177 for SO2, and 153 for PM10). However, these were all rural counties and thus had only a few SOEP participants who were typically not exposed to a lot of air pollution. 3 This process reduced the original SOEP data set only by about 10 percent from 8,810 to 7,874 observations.
Since the three air quality variables are highly correlated with each other, mainly because of them being coproduced, we summarize the information from all pollution measures into one variable using factor analysis. This has the added benefit of minimizing the danger of overidentification. In Table 2, it can be seen that the air pollution factor is more correlated with the endogenous variable than each of the three components.
Correlation Table.
As the second instrument, we take the subjectively stated variable, environmental worries, which is reported on a 3-point scale. Table 2, which reports the correlations between the depend variable, the instruments, and the endogenous variable, shows that both the air pollution factor and the environmental worries are correlated with the endogenous variable, perceived air pollution, but not with each other, which makes both instruments complementary. Furthermore, neither of the two instruments is correlated with the dependent variable, life satisfaction, which makes them both good instruments.
Results and Discussion
We ran a total of five models: The first two models were straightforward-ordered probit models assuming no endogeneity. Model (1) uses just the perceived air pollution variable; Model (2) replaces the perceived air pollution variable with the factor of SO2, NO x , and PM10. Model (3), the main model, is the IV-ordered probit model that accounts for the endogeneity of perceived air pollution by using the air pollution factor together with the environmental worries measurement as instruments. Models (4) and (5) are estimated for a sensitivity analysis, where the data set was split into urban and rural observations to check for systematic differences. The regression results for all models are displayed in Tables 3 and 4, with all error terms being clustered by the Kreise/county.
Life Satisfaction Regression Models for Air Pollution.
Note: Standard errors are clustered by county and reported in parenthesis; constant terms and cut values are not reported. Adj. = adjusted; AIC = Akaike Information Criterion; BIC = Bayesian information criterion; IV = instrumental variable.
aFactor includes SO2, NO x , and PM10. The larger the factor, the higher is the air pollution.
*p ≤ .05.
**p ≤ .01.
Sensitivity Analysis for Life Satisfaction Regression Models.
Note: Standard errors are clustered by county and reported in parenthesis; constant terms and cut values are not reported. Adj. = adjusted; IV = instrumental variable.
aFactor includes SO2, NO x , and PM10. The larger the factor, the higher is the air pollution.
*p ≤ .05.
**p ≤ .01.
Model (1) is similar to the one of Rehdanz and Maddison (2008), with perceived air pollution as an exogenous variable. All regression coefficients for the personal characteristics have the expected signs, documented in previous literature, and all but one (male) are also statistically significant at the 1 percent level: income increases happiness at a diminishing rate, unemployment depresses it, as does a disability, and more education than high school improves life satisfaction. People are the most happy when they are young and again when they are older, but life satisfaction is the lowest during the mid-life period. Men are happier than women. Although married people have a higher life satisfaction, the existence of at least one child in the household makes a person less happy. Finally, perceived air pollution decreases life satisfaction, just as it was reported in Rehdanz and Maddison (2008).
If perceived air pollution is replaced by the actual measurements of annual mean concentration for SO2, NO x , and PM10 and environmental worries, as it is done in Model (2), we find that none of the regression coefficients for the personal variables change their sign or magnitude nor do they change their significance level. However, the air pollution factor becomes positive and significant at the 5 percent level, whereas the environmental worries coefficient is not significant at all. The contradicting results in Models (1) and (2) indicate the strong possibility of the perceived air pollution variable being endogenous, which we will test in Model (3).
Model (3) is the IV-ordered probit model, which consists of two different regressions. The first regression is an OLS estimation of perceived air pollution, displayed in the left column. Both instruments, the air pollution factor and the environmental worries measurement, have the expected positive sign and are significant at the 1 percent level. This means that people living in areas with lower air quality and/or with more environmental worries do in fact feel more bothered by air pollution. Although there is no formal test for weak instruments for instrumental probit models with clustered errors, a first approximation would be Staiger and Stock (1997), who claim that for a model with one endogenous variable, an F value higher than 10, which is definitely the case for Model (3) (F value 19.02), would ensure instruments to be not weak.
The second regression of the IV-ordered probit model, which is the ordered probit estimation, and reported in the right column, shows that perceived air pollution is no longer statistically significant, while all the other regression coefficients neither change their signs, magnitude, nor their significance level. In addition, the sign of the perceived air pollution variable is now positive, just as for the air pollution factor in Model (2). The test of the null hypothesis that perceived air pollution is exogenous is rejected at the 1 percent significance level. Both the lack of significance for the instrumented perceived air pollution regression coefficients and the result of the endogeneity test are strong evidence that the variable is indeed endogenous.
In addition, we use several diagnostic tools to determine whether Model (3) is superior to the previous two models. It turns out that Model (3) outperforms both models along most measures: The (more liberal) Akaike Information Criterion weight and the (stricter) Bayesian information criterion weight are by far the highest, both indicating clearly that Model (3) (with a probability of 99.5 percent and 85.5 percent, respectively) is comparatively the best specification (Wagenmakers and Farrell 2004). Although the prediction difference from the mean is lower in Model (2), Model (3) has more correct predictions than the other two models. In addition, the mean of the absolute deviation and the square root of the squared deviation, which weights higher deviations more, are both lower for Model (3) compared to Model (2), and are the same as Model (1).
Taking everything together, it can be said that Model (3) is the strongest one, which is further evidence in support of causality between happiness and perceived air pollution running both ways. This means that once simultaneity is controlled for, perceived air pollution does not have any statistically significant impact on life satisfaction anymore.
The results of Models (4) and (5) confirm the findings of Model (3). There are only a few marginal changes in the regression coefficients and none of significance for the instrumented perceived air pollution. Therefore, we have shown that model (1) can be improved upon by accounting for the endogeneity of the perceived air pollution measure. Also, we have shown for Model (2) that using the perceived air pollution measure as an instrumented endogenous variable corrects for the counterintuitive sign of the air quality factor coefficient.
As a secondary finding, our results exhibit some evidence that subjective air quality is capitalized in rents and a spatial equilibrium exists in the housing market with respect to air quality. With this analysis, we follow the approach by Ferreira and Moro (2010), which does not require any data on housing characteristics. The basic idea is the following: a utility-maximizing agent faces the trade-off of any characteristics in question, such as air pollution, which households are willing to pay for through housing location, and all other goods and services, based on the their tastes and preferences, their income, and relative prices. Although tastes and preferences are fixed and relative prices are determined in markets, only income should matter for happiness (which would be represented as the level of the indifference curve). This means in our case, that as long perceived air pollution is not significant, happiness cannot be improved by moving to another location, which can only happen if air pollution is fully capitalized in rents with the result of spatial equilibrium, but only for the one characteristic, air quality. Of course, our results neither state that air pollution does not reduce utility nor the possibility of spatial disequilibrium along another housing characteristic. Rather it just claims that the disutility stemming from air pollution, and only that characteristic, is fully compensated by lower housing cost. Therefore, the willingness to pay for reduced air pollution can only be identified through a hedonic housing price regression and not through a happiness model.
Conclusion
Based on a life satisfaction approach with German SOEP data and air quality measurements, we could show using an IV probit model that (1) the subjective variable “perceived air pollution” is endogenous and (2) air pollution is capitalized in market rents, indicating spatial equilibrium with respect to air quality. Our result is somewhat different to what Luechinger (2009) finds; Luechinger presented evidence of spatial disequilibrium for SO2 emissions. However, he used data over a longer time period (1985–2003) during which new emissions regulation was introduced and air pollution decreased significantly. This change in air quality due to the new regulation would explain the interim spatial disequilibrium. Our findings, on the other hand, point to the fact that in the meantime, the housing market has adjusted and spatial equilibrium has been reached again. Our study also extends Rehdanz and Maddison (2008) who ignored the endogeneity of the same subjective air quality measure we use.
There are some limitations to our research: we did not have air pollution measurements for all counties causing the loss of about 10 percent of our observations. We considered generating the missing values with the help of a regression model but did not pursue this avenue for three reasons: first, modeled values are never as good as measured ones. Second, the benefit of adding 10 percent of the observations did not justify the cost of generating air quality values. And, third, all the missing measurements occur in rural areas, where there is generally much less of an air pollution issue. In fact, the regression results do not change at all if we just replace missing emission figures with the ones from the closest neighboring county.
The next limitation is that we assume linearity of the ordinal variable perceived air pollution in the first step of the IV estimation (OLS regression). In further robustness checks, which we did not report, we also ran the model with the ordinal variable perceived air pollution converted into a binary one but did not find any significantly different results.
Finally, the regression coefficients seem also robust, when each of the measured emissions (SO2, NO x , and PM10) are included separately, as well as with different conversion definitions of the ordinal scales of the perceived air pollution variable (not reported, as well). Therefore, we are confident about the validity of our findings.
Footnotes
Acknowledgments
This article benefited greatly from comments by two anonymous referees. We also thank the German Institute for Economic Research (DIW) for providing us with the remote access to the SOEP data set.
Authors’ Note
All errors or omissions remain our responsibility.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
