Recent interest in modern regression modelling has focused on extending available (mean) regression models by describing more general properties of the response distribution. An alternative approach is quantile regression where regression effects on the conditional quantile function of the response are assumed. While quantile regression can be seen as a generalization of median regression, expectiles as alternative are a generalized form of mean regression.
Generally, quantiles provide a natural interpretation even beyond the 0.5 quantile, the median. A comparable simple interpretation is not available for expectiles beyond the 0.5 expectile, the mean. Nonetheless, expectiles have some interesting properties, some of which are discussed in this article. We contrast the two approaches and show how to get quantiles from a fine grid of expectiles. We compare such quantiles from expectiles with direct quantile estimates regarding efficiency. We also look at regression problems where both quantile and expectile curves have the undesirable property that neighbouring curves may cross each other. We propose a modified method to estimate non-crossing expectile curves based on splines. In an application, we look at the expected shortfall, a risk measure used in finance, which requires both expectiles and quantiles for estimation and which can be calculated easily with the proposed methods in the article.
Quantile regression allows the estimation of the effect of covariates on the distribution of a response variable. The idea has been suggested by (Koenker and Bassett, 1978) and is well elaborated with numerous extensions in (Koenker, 2005). The underlying regression model for the -quantile with is specified as
with as response variable, and as the -quantile which may depend on covariates , say, e.g., through the linear model . Unlike classical regression where a zero mean is assumed for the residuals, in quantile regression one postulates that the -quantile of the residuals is zero, i.e., . Estimates for are obtainable through the minimizer of the weighted sum
where
are asymmetric weights. Numerically, (1.1) can be minimized by linear programming, see, e.g., (Koenker, 2005).
As an alternative to the quantile regression, (Aigner et al., 1976) and (Newey and Powell, 1987) proposed to replace the distance in (1.1) by a quadratic term leading to the asymmetric least squares
where the minimizer , say, is called (estimated) expectile. The underlying regression model now is
under the assumption that the -expectile of the error terms is zero. The expectile may again depend on covariates, e.g., through the linear expectile model . Expectile estimation is thereby a special form of M-quantile estimation Breckling and Chambers, 1988 and expectile regression has seen some increasing interest in the last years (Schnabel and Eiler, 2009b; Pratesi et al.,, 2009; Sobotka and Kneib, 2012; Guo and Härdle, 2013). An overview about methods focusing on estimation procedures regarding more features of the data than its centre (including semiparametric expectile and quantile regression) can be found in (Kneib, 2013).
In this article, we contrast quantile and expectile regression and propose some extensions to expectile estimation to link it to quantiles. A comparison of the two routines might remind us of the story of David and Goliath just by comparing the number of citations: about 1850 for (Koenker and Bassett, 1978) referring to quantiles and about 100 for (Newey and Powell, 1987), as of November 2013. Quantiles are certainly more dominant in the literature due to the fact that expectiles lack an intuitive interpretation while quantiles are just the inverse of the distribution function. Numerically, as seen by comparing (1.1) to (1.2), quantiles ‘live’ in the world while expectiles are rooted in the world. This by itself has several consequences. Quantiles need linear programming for estimation while expectiles are fitted using quadratic optimization. Beyond all discrepancies between quantiles and expectiles, it is important to note that both are related in various ways. (Jones, 1994) shows that expectiles are in fact quantiles of a distribution function uniquely related to the distribution of . (Yao and Tong, 1996) give a similar result by showing that there exists a unique bijective function such that , where is defined through
with as the partial moment function and as cumulative distribution of (see also De Rossi and Harvey, 2009). Note that . In this article, we will make use of relation (1.3) and relate quantile estimates to expectile-based quantile estimates , where is an estimated version of in (1.3). One of the key findings of the article is that are numerically more demanding than quantile estimates, but, as simulations show, they serve as quantile estimates which can be even more efficient than the empirical quantile itself.
In quantile regression, a numerical problem in applications are the so-called crossing quantile functions. These occur if for estimated quantiles, one gets for for some value (in the observed range of the covariate), where . Several methods, algorithms and model constraints have been proposed to circumvent the problem. Bondell et al., 2010 made use of linear programming. They also gave a good overview about earlier proposals including (Koenker, 1984), (He, 1997), (Wu and Liu, 2009) or (Neocleous and Portnoy, 2008). Chernozhukov et al. (2010) rearranged the fitted (linear) curves into a set of non-crossing curves, whereas (Dette and Scheder, 2011) used marginal integration techniques and monotone rearrangements. (Schnabel and Eilers, 2013b) proposed the so-called quantile sheets where crossings are circumvented by penalization. The problem of crossing curves occurs in principle in the same way for expectile regression. We demonstrate with simulations that crossing expectiles occur less frequently. This implies that less attention is needed to avoid crossing expectiles compared to crossing quantiles.
Quantile regression, as well as expectile regression, can be extended to non-parametric functional estimation. For quantile estimation, (Koenker et al., 1994) proposed spline-based estimation. (Bollaerts et al., 2006) made use of penalized B-splines with an penalty. Recently (Reiss and Huang, 2012) suggested quantile estimation based on penalized iterative least squares (see also Yuan, 2006). For expectiles, smooth estimation has been pursued by, e.g., (Pratesi et al. 2009), and (Schnabel and Eilers 2009b). Other semi- or non-parametric extensions of quantile regression which allow for flexibility are varying coefficient models. Here smoothness is achieved by letting the quantile not only depend on (unknown) coefficients which are constant, but on (smooth) coefficient functions (see, e.g., Honda, 2004; Noh et al., 2012 or Andriyana et al., 2014). The idea of smoothing can be extended by assuming that a ‘set’ of -quantile curves smoothly depends on both, the covariate and . Using B-splines this easily allows us to incorporate non-crossing conditions, as in (Bondell et al., 2010) or (Muggeo et al., 2013), for quantile estimation. The estimation of quantile sheets is proposed in (Schnabel and Eilers, 2013b) (Schnabel and Eilers 2009a) give a description of non-crossing smooth expectile curves (see also Eilers, 2013). Both approaches are based on penalized spline smoothing. Our approach to estimate expectile sheets is very similar to an extension formulated in Schnabel and Eilers, 2009a, who propose the estimation of expectile sheets using a tensor product of B-splines combined with a penalty in direction of the asymmetry parameter and covariate. But it is different in the sense that we are using a linear B-spline in direction of the asymmetry parameter and avoid the use and choice of a smoothing parameter in that same direction.
Expectiles might not gain popularity as much as quantiles, but we think they deserve their niche. For example, (Aigner et al., 1976) construct expectiles to estimate production frontiers and give an additional argument for using expectiles by stating that expectile regression is a way to treat asymmetric consequences as it places different weights on positive and negative residuals. But there are other fields which demand for expectiles as well, for example the field of risk measures for financial assets. (Ziegel, 2013) argues for the use of expectiles as a risk measure as they have desirable properties. Another frequently used risk measure is the ‘expected shortfall’, which needs the calculation of both, quantiles and expectiles. The expected shortfall (ES) is a trimmed mean, that is, the mean of a random variable conditional that its value is above (or below) a certain quantile. The ES can be written as a function of both, the quantile and the expectile for a level . Estimation of the ES has been recently proposed by (Leorato et al., 2012) by employing the integrated (conditional) fitted quantile regression function (see also Wang and Zhou, 2010). We extend an idea of (Taylor, 2008) and use the fitted quantiles and expectiles for the estimation of the ES. This connection becomes extremely useful for calculating the ES as it depends both on expectiles and their corresponding quantiles (as described by Taylor, 2008).
The article is organized as follows. In Section 2, we compare and contrast quantiles and expectiles, both theoretically and based on simulation. In Section 3, we look at quantile and expectile regression before Section 4 provides extensions and examples. Section 5 concludes the contest of David and Goliath.
Expectiles and quantiles
Quantiles from expectiles
Quantiles as well as expectiles uniquely define a distribution function. Let denote the continuous distribution function of a univariate random variable , which for the sake of simplicity for now is assumed to not depend on any covariates . The distribution is uniquely defined by the quantile function for or by the expectile function for . First we show how to numerically derive the quantile function from the expectile function . With other words we demonstrate how the transfer function in (1.3) can be derived numerically, which in practice allows us to calculate the quantile function from a fitted expectile function , say. Note first that expectiles are defined through
which needs to be solved numerically for . Let, therefore, be a dense set of knots covering the interval. In the following we denote to simplify the notation. In principle, a fine grid of expectiles is all we need to estimate the distribution function or quantiles. If the original data is still at hand, one can set and just as well , where and serve as tuning parameters. In our simulations in Section 2.2 and in the examples in Sections 2.4 and 4.2, we set , . If one chooses to be close to zero (and analogously to be close to 1), then there is obviously just a small difference between the minimal expectile and the minimal observed value of the data.
We now solve (2.1) for , and denote the resulting estimator of the cumulative distribution function with . To obtain , we estimate the distribution function at the estimated expectiles through
for non-negative steps and . Making use of linear interpolation between adjacent values of leads to
with and as (linear) constraint. This setting now allows to calculate from a set of estimated expectiles. Details are given in the Appendix.
Defining the linear interpolation for allows us to invert to obtain quantile estimates based on estimated expectiles. We define these as
Note that with the definition of for , we get an explicit estimate of as a by-product. This is derived by interpolating and , which defines , and by taking the inverse we get .
We need that which must be fulfilled since . Numerical inaccuracy may yield negative values for , in particular, for close to 0 or 1. Estimation under the linear constraint circumvents the problem. Moreover, the estimation can get numerically unstable, which is easily eliminated by imposing a small penalty on the calculated values . In fact, defining the density corresponding to as with for , we want to be ‘smooth’. In other words, should be small for small. Given that is a step function, this translates to imposing the penalty
Details are provided in the Appendix. Note that the calculation of quantiles from expectiles is somewhat numerically demanding. Alternative approaches to estimate quantiles from expectiles are described in (Efron 1991) and (Schnabel and Eilers 2013a). (Schnabel and Eilers 2013a) propose to estimate non-crossing expectile curves using a so-called expectile bundle.Within the expectile bundle crossing curves are prevented but, as a location scale model is assumed, one also loses flexibility. Schnabel and Eilers (2013a) also give a description of how to estimate the density (and therefore, quantiles) from a set of expectiles by using penalized least squares. The approach by (Efron 1991) is a more naive way to get an estimation for quantiles on a bases of expectiles. (Efron 1991) proposed to estimate a high number of expectiles and to count the number of observations lying below each expectile. He calls the resulting estimates percentiles. Taylor (2008) also uses this method to estimate quantiles from expectiles to calculate the ES. The method proposed in (Efron 1991) clearly has the advantage that it is simple and easy to perform, but, as one can imagine, it is not very efficient. Especially for extreme values of , our method is to be preferred as it leads to more precise estimates.
Conclusion: From a set of expectiles we can numerically obtain the quantile function. The method can also be applied in the regression scenario by conditioning on the explanatory variable as will be demonstrated in the article later.
Empirical evaluation
Evidently, the resulting fitted distribution function is continuous but has non-differentiable edges. In principle, one can set large to , but this may require heavy and numerically unstable calculations. In our experience, a sequence from usually is sufficient for deriving the quantile in the range between 1% and 99%, but for large sample sizes, it may be sensible to choose such that is proportional to .
The procedure allows us now to derive quantiles from expectiles and the question arises how they perform in terms of efficiency. We, therefore, run a small simulation study where we estimate a number of expectiles slightly smaller than (for , we estimated 459 expectiles; for , we set ). We simulate (a) from the standard normal distribution, (b) from the Chi-squared distribution (df = 2) and (c) from the t-distribution (df = 3) with sample sizes and , and each simulation is replicated 1000 times. We use odd sample sizes to guarantee unique quantiles, for e.g., . We compare our quantiles from expectiles with ordinary quantiles for . The calculation of quantiles from expectiles is a part of the R-package ‘expectreg’ (as all R-packages available from cran.r-project.org). Quantiles are calculated using the function rq from the R-Package ‘quantreg’ by (Koenker 2013b). We also look at smooth quantiles denoted by and calculated using the method proposed by (Jones, 1992). For a moderate number of , numerical instability does not seem to be a problem within the estimation of quantiles from expectiles. In total, the simulation includes times the calculation of quantiles from expectiles, a procedure which failed in none of the 6000 cases. Penalty (2.3) does not only lead to a smooth distribution function and, therefore, to smooth quantile estimates, but also improves the numerical stability of the calculations. For and a number of 459 expectiles, the implemented function needs around seven seconds to calculate (using one kernel of an ordinary computer).
In Figure 1, we show for one (randomly chosen) sample of each distribution with the fitted transfer function .
Source: Authors' own (prepared with GNU R).
The true function is provided for comparison and apparently the fit looks acceptable. The function itself is of secondary interest for this simulation study, but we see that in fact can be estimated. Moreover, we will need the transfer function later in the example of Section 4.3 where we estimate the ES.
In Figure 2, we visualize the result of our simulations. Here we compare with (solid lines) and, to make a fair comparison, with (dotted lines). As the results for and are very similar, Figure 2 concentrates on . The first plot of Figure 2 shows the simulation-based relative root mean squared error (RMSE) of the estimated quantiles for the standard normal distribution. Results for the Chi-squared distribution can be found in the second plot and in the third plot, findings for the student t-distribution are visualized. Results above the unit line stand in favour for expectiles, and Figure 2 mirrors surprisingly satisfactory performances for . We notice the gain of efficiency for the two symmetric distributions and inner quantiles: The RMSE for quantiles from expectiles is 5–10% lower than the RMSE for smooth quantiles. Not surprisingly, the difference between quantiles and smooth quantiles becomes stronger when looking at extreme quantiles. This is also mirrored in the relative RMSE as we see that for quantiles reflecting extreme observations the smoothing leads to an improvement. Generally, it occurs that the expectile-based quantile estimators behave sound and are (for most values of ) more efficient than the direct quantile estimates . This holds as well in terms of relative RMSE as in terms of relative mean absolute error which is not reported here as the results were quite similar.
Source: Authors' own (prepared with GNU R).
Conclusion: All in all, we see, that the calculation of quantiles from a set of expectiles is a sensible thing to do also in terms of efficiency. The numerical burden is of course not ignorable.
Expectiles and quantiles in the tail
As can be seen from Figure 1, we have for small values of that and accordingly for close to 1, , unless the distribution is heavily tailed. For instance, the quantile of the standard normal distribution corresponds to the expectile. This raises the question if and how well extreme expectiles can be estimated. To tackle this question formally, we look at expectiles and quantiles in the tail of the distribution by setting
for some . Moreover, we assume that the tails have a reasonable interpretation in that the second-order moment of the underlying distribution is finite. The expectile estimate is defined as a minimizer of (1.2) for and we get
where for and for . Note that (2.5) is not an analytic definition, since the iterated weights depend on the fitted value . We simplify (2.5) by replacing the ‘fitted’ weights by their ‘true’ weights defined through for and otherwise. This allows us to approximate (2.5) to
Note that as shown in the Appendix, we have for close to 0 and for close to 1. Therefore, we find , so that with (2.4), we may approximate (2.6) through expansion to
The first component in (2.7) has mean zero and variance
Note that with the assumption of finite second-order moments, we have which implies that for . This in turn yields that and so that overall .
Looking at the second component in (2.8), we find that its mean equals while its variance is of order Arguing that , see the Appendix, we can conclude that the second term in (2.8) is of ignorable asymptotic order for . The same holds by simple calculation for the remaining components not explicitly listed in (2.7) and (2.8). Hence, we may approximate the distribution of through
In particular, with (2.10), we get the (asymptotic) unbiasedness . One may even derive asymptotic normality from (2.10) by showing that higher order moments vanish. We can, therefore, conclude that even for extreme expectiles, i.e., or very small, respectively, we achieve asymptotic unbiasedness and normality.
Source: Authors' own (prepared with GNU R).
We now pose the same question to quantiles, i.e., what can be said asymptotically about quantile estimation in the tails of the distribution. Following (Koenker and Bassett, 1978) and (Koenker, 2005 pp. 71–72 ), we can derive the distribution of the quantile estimate as follows. Let with as a indicator function, then is defined through and for all . Hence
where is a binomial random variable with parameters . Note that and with (2.4) we have that the distribution of (for small ) converges to a Poisson distribution. As a consequence, for extreme quantiles, we do not achieve asymptotic normality and, therefore, unbiasedness is not guaranteed. We can easily calculate the limit of which equals for . For instance, for , this equals 0.26, which mirrors skewness of the distribution of extreme quantiles. Note, of course, that we may use extreme value theory to derive the asymptotic distribution of .
We run a small simulation to study the performance of tail expectile and tail quantile estimation. We simulate data and look at the distribution of extreme quantiles. In Figure 3 we show the distribution of (solid line) and (dashed line) for a sample size of . We look at the (top row) and the (bottom row) quantiles and the corresponding expectile. We simulate from (a) a normal distribution (left column), (b) a Chi-squared (with two degrees of freedom, middle column) and (c) a t-distribution (with three degrees of freedom, right-hand side column). The vertical line indicates the mean value of (solid line) and (dashed line) which should be zero to indicate unbiasedness. There is apparently a bias occurring for quantile estimation for close to 1 (or close to zero).
Conclusion: Overall we may conclude that expectiles estimates behave stable even for very small or very large values of . This is of course important to know if one uses a sequence including even extreme expectiles to estimate quantiles as suggested in the previous subsection.
Example
To illustrate expectiles and the conversion of expectiles to quantiles we give a short example. We apply our methods to data collected 2012 in Munich to construct the Munich rent index. The full data set consists of 3080 observations, i.e., rented apartments in Munich, Germany, and we here analyze the variable giving the net rent per squared meter for each apartment. For illustration, we restrict our attention to apartments between 45 and 55 and examine the net rent per square for the resulting 421 apartments in the data set of that size.
First we calculate a fine grid of sample expectiles and quantiles for the variable net rent per and plot them in Figure 4 on the left. The estimated expectiles naturally form a smooth curve while the estimated quantiles mirror some variability. In a next step, we use the set of expectiles to calculate quantiles from expectiles and plot these against the empirical quantiles (see the right part of Figure 4). When zooming into the figure, one can notice, that the estimated quantiles for the inner range of nearly coincide with their empirical counterpart, whereas for extreme values of there is some minor fluctuation around the identity line. This behaviour is also supported by our simulation results in Section 2.2. All in all, we see the applicability of the approaches. The example will be further discussed in subsection 4.2 by regressing the rent on the floorspace.
Source: Authors' own (prepared with GNU R).
Quantile and expectile regression
The problem of crossing quantiles and expectiles
So far we have considered the simple scenario with no explanatory variables involved. We extend this now to quantile and expectile regression. To do so, we assume a continuous covariate and define the quantile and expectile regression functions through and . Estimation of and is carried out using the weighted sum (1.1) and the corresponding version (1.2), respectively, with and .
A central problem occurring in quantile and expectile regression is crossing of fitted functions. For , by definition, we have and , for all . This inequality can, however, be violated for some (observed) values in the fitted functions, which is called the crossing quantile or expectile problem. Several remedies have been proposed to circumvent the problem, some of which will be used later in the next section. Before turning to that, we want to explore empirically in simulations how frequently one is faced with crossing quantile and expectile functions. We run a small simulation study and count the number of crossings between neighbouring fitted expectiles and quantiles. Therefore, we select a set of and simulate data from the following simple linear regression setting
The covariate is drawn from a uniform distribution, and the random error is added from either (a) a normal , (b) a Chi-squared or (c) a t-distribution . Data sets are generated with sample sizes of and for each combination of settings 1000 replications are created. A data set is then analyzed by computing the set of -quantiles using the R-package ‘quantreg’. For expectiles, we compute the resulting expectiles using the R-package ‘expectreg’. Function is computed separately for each error distribution according to (1.3), which makes the estimates comparable. For each data set and every generated covariate value, all neighbouring pairs of and are checked for crossing regression lines within the range of observed covariates.
Number of crossings between two neighbouring expectiles/quantiles in the linear model (3.1) from 1000 data sets, starting with . Crossing counts are given for all 10 pairs of expectiles or quantiles, respectively, sample sizes of and the three error distributions for , as defined previously. Quantiles smaller than are omitted and indicated as in the table
expectiles
t(3)
/
49
199
499
49
199
499
49
199
499
0.01–0.02
21
5
56
1
49
48
0.02–0.05
54
2
0
207
3
0
22
35
24
0.05–0.1
8
0
0
44
0
0
7
18
3
0.1–0.2
1
0
0
4
0
0
5
5
2
0.2–0.5
0
0
0
0
0
0
1
0
0
0.5–0.8
0
0
0
1
0
0
0
0
0
0.8–0.9
0
0
0
5
0
0
10
3
1
0.9–0.95
8
0
0
19
0
0
14
11
2
0.95–0.98
48
2
0
42
0
0
23
27
23
0.98–0.99
27
3
27
1
53
45
quantiles
t(3)
/
49
199
499
49
199
499
49
199
499
0.01–0.02
564
210
593
226
612
296
0.02–0.05
714
123
10
758
143
16
679
208
27
0.05–0.1
443
46
0
439
46
0
377
78
3
0.1–0.2
144
6
0
167
3
0
156
2
0
0.2–0.5
5
0
0
5
0
0
3
0
0
0.5–0.8
3
0
0
13
0
0
4
0
0
0.8–0.9
168
2
0
154
5
0
166
7
0
0.9–0.95
433
37
1
432
40
1
411
78
3
0.95–0.98
680
130
12
718
175
18
704
214
37
0.98–0.99
593
213
599
199
647
267
Source: Authors' own (prepared with GNU R).
The resulting number of crossings within the 1000 replications is summarized in Table 1. Not surprisingly, crossings occur in the tail of the distribution and become less frequent with increasing sample size. However, the numbers show that there are generally fewer crossings of expectiles, in particular, within the central of the distribution, while for quantile regression we obtain a large proportion of crossings for small samples even in the inner part of the distribution, i.e., between the 0.2 and 0.8 quantiles.
Conclusion: We may conclude from the simulation that expectiles seem less vulnerable for crossing problems than quantile estimates.
Non-crossing spline-based estimation
Several remedies have been suggested to circumvent or correct for crossing quantiles with references given in the introduction. We here extend the idea of Bondell et al., 2010 who fit non-crossing quantiles using linear programming. We pick up the idea and generalize it towards non-crossing spline-based expectile estimation. To do so, we first present spline-based quantile and expectile estimation by replacing and with bivariate functions
where both functions are smooth (or just linear) in the direction of and smooth (or just linear) in the direction of . The bivariate functions may be called quantile sheets or expectile sheets. The setting (3.2) transfers the estimation exercise to bivariate smoothing, as proposed in (Schnabel and Eilers, 2013b) and Schnabel and Eilers (2009a). We replace (or approximate) through
where is a (linear) B-spline basis set up on knots , is a B-spline basis built upon some knots covering the range of observed values of and is the vector of coefficients. If is assumed to be linear in , one may take as linear B-spline and set in this case. Let be the indices of columns of , and the indices of columns of . Vector may then be indexed by for , , and let . Non-crossing quantiles are now guaranteed by linear constraints on the parameter vector of the form
for all in the observed range of the covariates. If is a linear B-spline basis, this simplifies to for and . In general, we can formulate (3.4) as a linear constraint by inserting for the observed values .
We can now fit by replacing (1.1) with its multiple version
which is minimized with respect to subject to the linear constraints (3.4) using linear programming. Alternatively, one may work with iterated weighted least squares by using the fact that . (Schnabel and Eilers, 2013b) change the weight from to and apply iterated weighted least squares to fit the function .
gives a weighted least-squares criterion which allows to estimates the expectile sheet , where again the linear constraints (3.4) need to be fulfilled. Estimation can be carried out by iterative quadratic programming.
Source: Authors' own (prepared with GNU R).
In Figure 5, we show an example for the simulation model (3.1) of the previous subsection. We plotted the resulting quantile and expectile sheets under the assumption of normally distributed residuals. We use a linear B-spline basis for with . For every value, we have increasing (or nondecreasing) functions and in . A simple visual impression shows that the quantile sheet is more variable compared to the fitted expectile sheet. Note that we can now calculate for each value of a set of expectiles , which allows to apply the results of Section 2 to derive quantiles based on expectiles. The code for calculating the linear non-crossing quantile curves by (Bondell et al., 2010) is available from the homepage of Howard Bondell (see http://www4.stat.ncsu.edu/̃bondell/Software/NoCross/NoCrossQuant.R, last date of access 10 January 2014). The programme for fitting non-crossing expectiles is part of the R package ‘expectreg’ by (Sobotka et al., 2013).
Conclusion: For both expectiles and quantiles, we can fit sheets guaranteeing non-crossing functions. Overall, the expectile sheet provides a more smooth surface compared to the quantile sheet, in particular, in the direction of .
Extensions and examples
Penalized smooth expectile sheets
Following the expectile sheet , we may assume that is smooth in , but without any parametric (linear) assumption. This can be fitted with a B-spline basis as in Section 3.2, but now with being large. In order to control for a smooth and numerically stable fit, one may impose a penalty on the coefficients in the style of penalized spline regression (see Ruppert et al. 2003, 2009). In other words we supplement (3.6) by the quadratic penalty
where is an appropriately chosen penalty matrix, and is the smoothing parameter which is chosen data driven. We give an example in the next subsection. This was first proposed by Schnabel and Eilers (2009a) with an additional penalization in direction of . For a specific value of this has been proposed in Sobotka et al., 2012 for expectile smoothing or in Bollaerts et al., 2006 for quantile smoothing where the latter use a different penalization. The smoothing parameter can be chosen by asymmetric cross-validation or the Schall algorithm for mixed models as described in (Schnabel and Eilers 2009b).
Source: Authors' own (prepared with GNU R).
Rent index of Munich
To see how the method performs in practice, we again take a look at the Munich rental data from Section 2.4. As a reminder, the data consists of 3080 observations, i.e., rented apartments in Munich, Germany. We consider two variables in our example: net rent per as response and living space measured in as covariate. First, we perform both a non-crossing and non-parametric expectile regression as described in Sections 3.2 and 4.1, respectively. Our underlying model is given with the expectile sheet . The smoothing parameter was chosen automatically by using the Schall algorithm (see Schnabel Eilers, 2009b). The sheet resulting from the estimation procedure is shown in Figure 6. As one can see the dependency of the two variables is obviously of non-linear nature. The amount of smoothing done for the expectile sheet seems appropriate. The sheet serves as a basis to calculate quantile estimates for certain values of and . We apply the algorithm as described in Section 2 and obtain the mid-panel of Figure 6. The calculated values for the quantiles are indicated by points which are connected by lines. All in all, the quantiles from expectiles seem to behave well. We can see that there is a decrease in net rent per square metre as the apartment size grows. This continues for apartments up to size 100 , but then net rent remains, more or less, constant. A nice feature of our conversion is that non-crossing of quantiles is guaranteed.
As an alternative to the expectile-based analysis, we apply smooth non-crossing spline-based quantile fitting as described in (Muggeo et al., 2013) and implemented in the R-package ‘quantregGrowth’. Here, we choose cubic B-splines with a penalization through second-order differences. (Muggeo et al., 2013) guarantee non-crossing of quantile curves by imposing inequality constraints on the spline coefficients. As the R-package allows us to estimate growth charts, there is also the possibility to enforce monotonicity in the direction of the dependent variable. The resulting fit is shown in the right part of Figure 6. All in all, spline-based quantiles exhibit similar features as the quantiles from expectiles, although the quantile smoothing spline, due to its nature, is angled. For the non-crossing quantile smoothing, we decided to pick a smoothing parameter which would result in a smoothness comparable to the amount of smoothing mirrored in the second panel of Figure 6.
Expected shortfall
Investment risks are frequently measured using the ES, a stochastic risk measure, for the lower tail defined as for a continuous random variable with -quantile . It measures the expectation given that the random variable does not exceed a fixed value and is often applied to financial time series. A naïve estimate would calculate the mean beyond a previously estimated quantile and would, therefore, be rather inefficient. (Taylor, 2008) presents a possibility to estimate the ES using expectiles and their connection to quantiles.
Note that the -expectile is implicitly defined through arg min so that the expectile satisfies
where, as above, . That is, the expectile is determined by the expectation of the random variable conditional on . Rewriting (4.1) and using the fact leads to
for the lower tail of . Depending on whether the random variable describes a win or a loss, we define the ES for the upper tail as . In order to determine the appropriate to a given , (Taylor, 2008) estimates a dense set of expectiles and then constructs an empirical distribution function on the basis of the expectile curves. Here, we make use of the results derived in subsection 2.1 and estimate the distribution function from expectiles. As introduced in Section 2.1, we estimate a dense set of expectiles (i.e., we set ) and compute the cumulative distribution function at each observed covariate value. The estimated distribution allows us to conclude the value for a given -quantile. We then calculate the ES explicitly for certain values of , e.g., .
We apply the idea and estimate the ES for a serially drawn time series from the daily yields of the French stock index CAC40 in the time period between 1991 and 1998. All in all, there are 1860 observations/trading days and we take time as the covariate influencing the ES. For estimation, we, therefore, construct the expectile sheet . As basis in , we use a cubic B-spline basis with 20 inner knots to account for the variability in time. However, in order to give a risk prediction for the next observations, we have equidistant knots from to . That way, we get an estimated risk for the upcoming day(s), i.e., we pursue out of range prediction. To achieve smoothness in time, we add a penalty of first-order differences to (3.6), where has rows for and . The optimal smoothing parameter is chosen via asymmetric cross-validation; see (Sobotka et al., 2012) for a more extensive description. Next, we apply the algorithm presented in Section 2 to all observed covariate values and also to the added time points beyond the data. This delivers the estimated -quantile and its corresponding -expectile. In turn we are able to estimate the expected shortfall (4.2) for each point in time. This is done sequentially, that is, observation by observation, to gather information about the changes in the distribution.
The result of the estimation is presented in Figure 7 (left part). For comparison, we also fit the ES based on the empirical distribution as suggested by (Taylor, 2008) (the right part of Figure 6). As can be seen, the volatility of the data is captured by the curves of the ES, for gains, as well as for losses. A generalization over the range of time can also be observed. When using the empirical distribution function on the other hand, especially the curves for tend towards overfitting. This is particularly visible for the time of low volatility around day 1300. The small amount of prediction incorporated by the splines turns out to be just a linear extension of the last fits. For accurate predictions, one should aim to combine conditional autoregressive expectiles (CARE, Kuan et al., 2009) that are able to account for the autocorrelation in the data with the methods introduced in this article. Still, the example shows that an improvement in ES estimation is possible when using the efficient distribution estimation introduced in Section 2.1.
Source: Authors' own (prepared with GNU R).
Discussion
In this article, we looked at quantiles, as Goliath, and expectiles, as David, and explored how their connection can be used in practice. An algorithm was presented to estimate quantiles from a (fine) grid of expectiles. We noticed and examined properties of extreme quantiles and expectiles and discussed the crossing issue of quantile and expectile regression. Even so, as crossing of neighbouring curves is an issue, we proposed a modified method to circumvent this problem. All methods regarding expectiles which were described in detail in this article can be found in the R-package ‘expectreg’.
Apparently, referring again to the comparison of expectiles and quantiles to David and Goliath is undissolved. There is no final fight, and research on both ends continues. It is certainly true that quantiles are dominant in the literature but we wanted to show that expectiles are an interesting alternative to quantiles and that their combined use is helpful, in particular, for the estimation of the ES. We also demonstrated the use of quantile and expectile sheets as smooth variants to quantile and expectile regression, respectively. This accommodates quite naturally the constraints of non-crossing quantile and expectile curves and the latter allows for smooth expectile regression based on implemented software, as mentioned above. Also, expectile regression now can be performed without loosing interpretability, since quantiles can be estimated from expectiles.
All in all, we hope to have convinced the reader that expectiles do not immediately ‘belong in the spittoon’ as (Koenker 2013a) provocatively postulates. We think that expectiles provide an interesting and worthwhile alternative to the well-established quantile regression.
Footnotes
Acknowledgments
Financial support from the German Research Foundation (DFG) grant KA 1188/7-1 is gratefully acknowledged. We thank two anonymous reviewers for constructive remarks which led to an improved version of this article.
Relation of α and α ̃ for α → 0
We assume that has finite second moments, then with (1.3)
Since the nominator and denominator both tend to zero for , we apply the rule of de l'Hospital. Observing that for , we get
Hence for . Note that since which follows due to the existence of second-order moments, we find again that nominator and denominator of (A.2) converge to zero for . Assuming now that is proportional to for which is required to guarantee finite second-order moments, we get, again with the rule of de l'Hospital applied to (A.2), that , while if is proportional to for some , we get with the same arguments that .
Estimation of F ̂ m ( . )
Let be a dense set of knots covering and containing and define with the index with . First note, that (2.1) for gives a redundant information as it states that . That is to say that we need an additional constraint. This is found by observing that
with the approximation of from Section 2.1. Remembering the definition of the expectiles (2.1), we define function by
We now need such that which in principle can be seen as a root finding problem. We implemented a version where we minimize the sum of squares of under certain restrictions: We face the minimization problem
under the constraints that and which is solved by Newton's method in optimization and also implemented in the R-package ‘expectreg’ by Sobotka et al., 2013. Penalty parameter , which ensures numerical stability and smoothness of the distribution function, may be set equal to the squared empirical variance of the data from which the expectiles are estimated. In our simulations, we set equal to five times the squared empirical variance of the data (for each of the three distributions considered and for both sample sizes and ).
References
1.
AignerDJAmemiyaTPoirierDJ (1976) On the estimation of production frontiers: maximum likelihood estimation of the parameters of a discontinuous density function.International Economic Review, 17 (2), 377–96.
BollaertsKEilersPAertsM (2006) Quantile regression with monotonicity restrictions using p-splines and the l1-norm.Statistical Modelling, 6(3), 189–207.
ChernozhukovVFernández-ValIGalichonA (2010) Quantile and probability curves without crossing. Econometrica, 78(3), 1093–125.
7.
DeRossi GHarveyA (2009) Quantiles, expectiles and splines. Journal of Econometrics, Nonparametric and Robust Methods in Econometrics (Special issue). 152(2), 179–85.
8.
DetteHSchederR (2011) Estimation of additive quantile regression. Annals of the Institute of Statistical Mathematics, 63 (2), 245–65.
EilersPHC (2013) Discussion: The beauty of expectiles. Statistical Modelling, 13(4), 317–22.
11.
GuoMHärdleW (2013) Simultaneous confidence bands for expectile functions. AStA –Advances in Statistical Analysis, 96 (4), 517–41.
12.
HeX (1997) Quantile curves without crossing. The American Statistician, 51(2), 186–92.
13.
HondaT (2004) Quantile regression in varying coefficient models. Journal of Statistical Planning and Inference, 121(1), 113–25.
14.
JonesM (1992) Estimating densities, quantiles, quantile densities and density quantiles. Annals of the Institute of Statistical Mathematics, 44(4), 721–27.
15.
JonesM (1994) Expectiles and M-quantiles are quantiles. Statistics & Probability Letters, 20(2), 149–53.
16.
KneibT (2013) Beyond mean regression (with discussion and rejoinder). Statistical Modelling, 13(4), 275–303.
17.
KoenkerR (1984) A note on l-estimates for linear models. Statistics and Probability Letters, 2(6), 323–25.
18.
KoenkerR (2005) Quantile regression, econometric society monographs. Cambridge: Cambridge University Press.
19.
KoenkerR (2013a) Discussion of ‘beyond mean regression’ by T. Kneib. Statistical Modelling, 13(4), 323–33.
20.
KoenkerR (2013b) Quantreg: quantile regression. R package version 4.97.
KuanCMYehJHHsuYC (2009) Assessing value at risk with care, the conditional autoregressive expectile models. Journal of Econometrics, 150(2), 261–70.
24.
LeoratoSPeracchiFTanaseAV (2012) Asymptotically efficient estimation of the conditional expected shortfall. Computational Statistics and Data Analysis, 56(4), 768–84.
25.
MuggeoVSciandraMTomaselloACalvoS (2013) Estimating growth charts via nonparametric quantile regression: a practical framework with application in ecology. Environmental and Ecological Statistics, 20, 519–31.
26.
NeocleousTPortnoyS (2008) On monotonicity of regression quantile functions. Statistics and Probability Letters, 78(10), 1226–29.
27.
NeweyWKPowellJL (1987) Asymmetric least squares estimation and testing. Econometrica, 55(4), 819–47.
28.
NohHChungKVanKeilegom I (2012) Variable selection of varying coefficient models in quantile regression. Electronic Journal of Statistics, 6, 1220–38.
29.
PratesiMRanalliMSalvatiN (2009) Nonparametric M-quantile regression using penalised splines. Journal of Nonparametric Statistics, 21(3), 287–304.
30.
ReissPHuangL (2012) Smoothness selection for penalized quantile regression splines. International Journal of Biostatistics, 8(1).
31.
RuppertDWandMPCarrollRJ (2003) Semiparametric Regression. Cambridge: Cambridge University Press.
32.
RuppertDWandMPCarrollRJ (2009) Semiparametric regression during 2003–2007. Electronic Journal of Statistics, 3, 1193–256.
33.
SchnabelSKEilersPHC (2009a, July 20–24) Non-crossing smooth expectile curves. In JG Booth, ed.Proceedings of the 24th International Workshop on Statistical Modelling. Ithaca, NY, USA, 330–36.
34.
SchnabelSKEilersPHC (2009b) Optimal expectile smoothing. Computational Statistics and Data Analysis, 53(12), 4168–77.
35.
SchnabelSKEilersPHC (2013a) A location-scale model for non-crossing expectile curves. Stat, 2(1), 171–83.
36.
SchnabelSKEilersPHC (2013b) Simultaneous estimation of quantile curves using quantile sheets. Advances in Statistical Analysis, 97(1), 77–87.
37.
SobotkaFKauermannGWaltrupLSchulze and KneibT (2012) On confidence intervals for semiparametric expectile regression. Statistics and Computing, 23(2), 135–48.
38.
SobotkaFKneibT (2012) Geoadditive expectile regression. Computational Statistics and Data Analysis, 56(4), 755–67.
39.
SobotkaFSchnabelSWaltrupL Schulze (2013) Expectreg: expectile and quantile regression. With contributions from P. Eilers, T. Kneib and G. Kauermann, R package version 0.39.
40.
TaylorJ (2008) Estimating value at risk and expected shortfall using expectiles. Journal of Financial Econometrics, 6(2), 231–52.
41.
WangHZhouX (2010) Estimation of the retransformed conditional mean in health care cost studies. Biometrika, 97, 147–58.
42.
WuYLiuY (2009) Stepwise multiple quantile regression estimation using non-crossing constraints. Statistics and Its Interface, 2, 299–310.
43.
YaoQTongH (1996) Asymmetric least squares regression estimation: A nonparametric approach. Journal of Nonparametric Statistics, 6( 2–3), 273–92.
44.
YuanM (2006) Gacv for quantile smooting splines. Computational Statistics and Data Analysis, 50, 813–29.
45.
ZiegelJF (2013) Coherence and elicitability. arXiv:1303.1690v2.