In this paper we introduce approximations of the cumulative conditional expectation function (CCEF) based on Bernstein polynomial copulas. These approximations converge pointwise and uniformly to CCEF. In addition some examples of the behavior of these approximations are shown. We also built estimators for the CCEF, which were constructed through Bernstein polynomial estimators for copulas. It is shown that the estimators are asymptotically normal and biased for the CCEF. We exemplify the use of these estimators through real data from the educational field.
In this paper we study an extension of the concept of directional dependence in regression setting, which was designated by Sungur [15] as copula regression function. Formally, let be a random pair with uniform marginals on , with associated 2-copula , the expected value which is a function of , is referenced as copula regression function of on . It is proposed in Sungur [15] as a way of summarizing the relation between response and explanatory variables by separating the marginal and the joint behavior. The copula regression function is used to identify the directional dependence in joint behavior of the random pair , as introduced in Sungur [14] (see also Muddapur [11]). In addition, the directional dependence describes the likely direction of influence between two variables, from the joint behavior between them. Advances in this line [14, 15] have been applied to real data by several authors. Here we highlight the recent work of D. Kim and J. Kim [9] where it is proposed a generalized multiple-step procedure for the full inference of the directional dependence in joint behavior based in some asymmetric copula families.
We explore as a function of , we denote this quantity as cumulative conditional expectation function. One of the motivations of working with this quantity is its use for making decisions in real problems. In addition, as noted by Hua and Joe [6], the conditional expectation can capture the strength of dependence of a bivariate random vector and helps in developing graphical techniques for visualizing the dependence in a region of .
Our target is to give the foundation for the construction of approximations and estimators for the cumulative conditional expected function. Also, we present in the first case the rate of approximation and in the second, the asymptotic distribution of the empirical process related to the estimator.
In Section 2 we show a tractable representation of the cumulative conditional expectation function. Also, we present how we can take advantage of certain structures of copula functions to calculate the cumulative conditional expectation function, as for example in the cases of convex mixture of copulas and copulas with polynomial cross sections. In Section 3, we assume the copula model is known and propose a polynomial approximation for the corresponding cumulative conditional expectation function. Also, we obtain a rate of convergence for the approximation. In Section 4, by means of the Bernstein polynomial estimator for copula functions, we propose estimators for the cumulative conditional expectation function and we show its asymptotic normality. We conclude the section with an application to real data. This application is in the educational field, previous works regarding copula models to educational data are recent, see Fernández and González-López [2] and Fernández et al. [4]. Finally, the conclusions are exposed in Section 5 and the proof of our results are given in the Appendix.
Cumulative conditional expectation
For some applied problems, it is helpful knowing the mean performance of one variable conditioned to another variable restricted to a region and not just to a single observation. This concept is formally presented in the next definition.
.
Let be a random vector with associated 2-copula . Then the cumulative conditional expectation function (CCEF) of on is given by .
.
The cases and are excluded because they restrict themselves to the copula regression function. Indeed is trivial and has been studied in Sungur [15].
We note the previous concept was explored by Fernández and González-López [3] with focus on some bounds of the measure . In addition, an equivalent concept was studied by Hua and Joe [6] for the purpose of investigating levels of strength of dependence in the tails: asymptotically linear, sub-linear and constant. The following theorem gives a representation for that let us to simplify its computation.
.
Let be a random vector with associated 2-copula . Then , .
Proof. See the Appendix.
Then, it can be seen that the conditional tail expectation, , can be written as , for , where is the survival copula of and , .
In order to simplify the notation, hereafter we denote the partial derivatives as done in Janssen et al. [8], this is , , and . It can be seen that is the average of the copula regression function .
The last term is equal to according to Sungur [15].
By means of Theorem 1 we can compute the cumulative conditional expectation function for several cases, as it was explored in Fernández and González-López [3]. Now we present below how we can take advantage of certain structures of copula functions to calculate the function .
The following result shows that the cumulative conditional expectation of a convex mixture of co-pulas is the convex mixture of the cumulative conditional expectations. It is clear that the mixture of copulas is also a copula, when and . The usefulness of this result can be justified by the applications of convex combination of copulas found in the literature, for example, in financial market [5], in dynamical clustering [17] and to represent different experts’ information about the parameters under a Bayesian perspective [1]. A complete explanation of this last recent approach can be found in Fernández et al. [4].
.
Under the hypotheses of Theorem 1, let be the convex combination copula, where is a copula function, for and . Then, .
Proof. It is a direct consequence of Theorem 1, since
In the next result we study a specific class of copulas, the polynomial ones. Recalling that a copula has polynomial cross sections in if it can be written as for each , to suitable functions . More details about these copulas can be found in Nelsen [12] Subsection 3.2. As an immediate consequence of Theorem 1 follows the next corollary.
.
Under the hypotheses of Theorem 1, if . Then, is a polynomial of degree given by .
.
Quadratic and cubic cross sections copulas.
The Farlie-Gumbel-Morgenstern (FGM) family contains all the copulas with quadratic cross sections in both variables. with then .
The Lin’s iterated FGM family is defined by with and real numbers satisfying and . has cubic cross sections in both variables. For where
Then,
The difference for the cubic sections family of copulas when and the parameter takes different values
Spearman’s rho
2
0.0416
0.1500
0.1000
0.0250
0.0160
0.0250
1
0.0416
0.0900
0.0600
0.0150
0.0100
0.0150
0
0.1250
0.0300
0.0200
0.0050
0.0030
0.0050
0.25
0.1458
0.0150
0.0100
0.0020
0.0010
0.0020
0.5
0.1666
0
0
0
0
0
0.75
0.1875
0.0150
0.0100
0.0020
0.0010
0.0020
1
0.2083
0.0300
0.0200
0.0050
0.0030
0.0050
We then explore the behavior of a concept of directional dependence that can be constructed based on the curves . For this we use a straightforward extension of the directional dependence defined in Sungur [14]. In Sungur [14] the authors compare this concept with another, showing in several scenarios their performance strongly linked to the underlying copula. In particular it is observed its adequacy in the presence of asymmetry, that is, when is different from . For this reason we chose for our exercise a biparametric family of asymmetric copulas, which include among the possible cases the symmetry, . In D. Kim and J. Kim [9] is proposed a procedure for the inference of the directional dependence in joint behaviour, introduced in Sungur [14]. D. Kim and J. Kim [9] emphasize the way of choosing the asymmetric copula used in the modeling, since this choice is fundamental for the interpretability of the results on the directionality of the dependence. The Corollary 3 let us to easilly compute the CCEF for the asymmetric cubic sections family , where , and . We get and . Recalling that Sungur [14] defines a pair as directional dependent in joint behavior if and has a functional relation with given by Corollary 1, it is natural to explore about its relation with directionality, through the difference . The parameters and of the asymmetric cubic sections family determine the asymmetry, when the copula is symmetric and asymmetric in any other case. At Table 1 we compute this difference for the parameters and . When there is no difference, but the other cases show directional dependence. For instance, observe the cases and , the Spearman’s rho is not the same however the values of the difference are equal in absolute value-with opposite signs-, showing its directional aspect. The same happens when and . Actually for and we have copulas and respectively, such that, the Spearman’s rho coefficients and the functions and are different, but the difference between the cumulative conditional expectations is constant and measured by the asymmetry given by the parameters, i.e. . Besides, the case has the same Spearman’s rho, in absolute value, than but the directions are not equal. Also, for this polynomial family, it can be seen that for and . At Table 1 can be seen the signal changing in the difference at .
In the next section it is introduced the concept of Bernstein copula, which allows to derive approximations and estimates of .
Bernstein copulas
Hereafter we explore the properties of the cumulative conditional expectation function for Bernstein copulas (BC). These copulas belong to the family of polynomial copulas. The bivariate BC family was introduced by Li et al. [10], they are defined in terms of Bernstein polynomials. Li et al. [10] defined a stronger concept of convergence between copulas, through Markov operators, and showed that Bernstein copulas approximates any copula . We will use this approximation to obtain an approximate analytical expression of the function for any copula .
.
Given a 2-copula and a grid of points with , the Bernstein copula approximation of of order is given by
where .
Even though our work focuses on bivariate copulas, it is relevant to cite the work of Sancetta and Satchell [13]. In that paper the concept of Bernstein copulas is extended to the multivariate case, and explored them as an approximation to known copulas and estimation to unknown copulas.
The function approximates the copula and is a copula itself. Now we concentrate on the approximation of known copulas, and in the next section of unknown ones, as a tool for approximating the cumulative conditional expectation of any copula. The usefulness of the next theorem is to numerically simplify the computation of , through solving polynomial integrals, when is a parametric copula described by a complex analytical expression.
.
Consider a 2-copula , and a grid of points with . Set the Bernstein approximation of of order as
where .
.
Definition 3 comes from Theorem 1, by means of approximation of given by Definition 2, it holds
For each and with , . Then, corresponds to Definition 3.
.
Under the hypotheses of Theorem 1, converges to pointwise, when .
Proof. See the Appendix.
For appropriate copulas it is possible to give a rate of the approximation for , and also we observe that this rate is improved when increases.
.
Let be a continuous copula with first order partial derivatives being Lipschitz and . For it holds for a constant . Then, converges uniformly to in , when .
Proof. By Theorem 1 we know that . In addition, in Sancetta and Satchell [13] (Theorem 2), it is proved that a continuous copula with Lipschitz partial derivatives satisfies for a constant . Then, it we can seen that
for . The last inequality tends to when goes to infinity uniformly in .
We exemplify the approximation of through by exploring the case reported in Sancetta and Satchell [13] (Table 1).
.
Consider the Kimeldorf and Sampson (KS) copula given by
In Figs 1 and 2 we show the function for being the KS copula and its approximation for Bernstein copulas with , for several values of . It can be seen that, for all values of , the approximation improves for higher values of . This is a consequence of Theorem 3, since the copula KS is a continuous copula with first order partial derivatives satisfying Lipschitz condition.
We note that in the example, the evaluation of was done by solving numerically the integration given in Theorem 1 through the “NIntegrate” function of Mathematica 9.0 software.
Approximation of the cumulative conditional expectation function of the Kimeldorf and Sampson (KS) copula by (Definition 3) for .
Approximation of the cumulative conditional expectation function of the Kimeldorf and Sampson (KS) copula by (Definition 3) for .
Estimation
Consider a bivariate random sample of the vector with associated unknown 2-copula . Let be rank of in and rank of in . Denote the empirical copula as
By using the Bernstein estimator of the copula function proposed in Sancetta and Satchell [13], we built consistent estimators of the cumulative conditional expectation function.
.
Given a 2-copula the estimator of is
where is the empirical copula.
.
The estimator of is coming from the Bernstein estimator of order , , by applying the transformation . Note that the order depends of and if .
In order to explore the asymptotic behavior of the estimator we make use of some known results about the Bernstein estimator. Janssen et al. [7] (Theorem 2 and Remark 3) proved that for a copula with bounded third order partial derivatives on , if , , when , then the process in the space of bounded functions, where denotes weak convergence. In addition, the limiting process is a tight Gaussian process with mean function where
and covariance function given by for with
This time we consider the process , with , which is the process that defines the asymptotic behavior of the proposed estimator of .
.
Let be a 2-copula with bounded third order partial derivatives on . If and are positive integers such that with when , then for ,
is a Gaussian process with mean function and variance function
where .
Proof. See the Appendix.
.
The covariance function of the process is
with
for and . See the Appendix for details of the calculus.
.
The variance of the process also can be written in terms of and for being the copula regression function, this is . By using the relations , the variance is
For the sample of each course is reported the pairwise Spearman’s rho coefficient, the sample size and the Bernstein polynomial degree used in Fig. 3
Engineering course
Spearman’s rho
Sample size ()
Agricultural
0.261
586
25
Civil
0.203
699
27
Computer
0.284
771
28
Electrical
0.353
597
25
Mechanical
0.288
1204
35
Cumulative conditional expectation function for each of the Engineering courses where was obtained through Definition 4, with taken as reported by Table 2 (left) and through the emprirical copula (right).
Cumulative conditional expectation function for each of the Engineering courses where was obtained through Definition 4, with . The legends are: for agricultural, for civil, for computer, for electrical and for mechanical course.
Application
We estimate to quantify the impact on Calculus undergraduate student’s performance of the Mathematics admission test score. In this application we work with the students of the undergraduate courses of Agricultural, Civil, Computer, Electrical and Mechanical Engineering of University of Campinas, Brazil. There were considered the ones who were admitted at the university from 2003 to 2011. For each student it was taken the Mathematics admission score, denoted by , and the Calculus I score obtained after admission, denoted by . An annual standardization per course was used to avoid the effect of different tests applied each year. Some characteristics of the data are displayed in Table 2. Regarding the polynomial degree we follow an analogous reasoning as Janssen et al. [8] (Remark 1), but for the copula function. In Janssen et al. [7] it is shown that has mean function and variance where and
Now, let us assume . Because of the hypothesis condition of Theorem 4 and the reduction of the bias, it should be . On the other hand, when diminishes, so does the variance. Then, an ideal degree could be . In Fig. 3 the estimator is shown for each engineering course where the vector is obtained by transforming and by scaling ranks to . As a benchmark we also plot the CCEF for the empirical copula of each course given by .
Figure 3 shows that there are some differences between the courses, this is valuable information for academic management. For instance, suppose that the university wants to offer extra lessons to improve the performance in math of the young students. Then, it would be more efficient to use different rules for defining the group for extra lessons, because, there may be several types of behavior of . For example in our data we observe two types of situations, (i) composed by Civil, Electrical and Mechanical Engineering and (ii) composed by Agricultural and Computer Engineering. In the first case, the behavior of is approximately linear, and in the second case has a tendency of second order polynomial. Quadratic forms type (ii) require strategies to efficiently define the groups of students who would need assistance in math. For example, if we set in computer engineering, there are two points that could be interesting to take in consideration and since and . Finally, we empirically explore the performance of for different degrees , since the listed in Table 2 is quite high and this could over-fit the model. In Fig. 4 we can see the degree does not impact the courses classification in two groups: one almost linear and the other almost quadratic. At Table 3 we exemplify these two behaviors showing some values for the electrical course with linear CCEF and for the computer course with quadratic CCEF. In addition, it can be seen that, for each , decreases when gets higher and get closer to at regions where is increasing, this is for electrical and for computer. This highlights a possible criteria for chosing the parameter by means of analysing the monotonical behavior of . In any case, more studied needs to be done regarding the performance of several criteria for setting , in the context of the cumulative function as well as in the more general context of Bernstein copulas.
Electrical and computer engineering courses values of and
Electrical
Computer
Empirical
0.3290
0.3408
0.3990
0.4403
0.4380
0.4286
0.4122
0.4463
0.3413
0.3626
0.4089
0.4441
0.4452
0.4288
0.4292
0.4507
0.3461
0.3706
0.4138
0.4459
0.4452
0.4311
0.4332
0.4526
0.3591
0.3798
0.4176
0.4490
0.4372
0.4319
0.4377
0.4564
0.3832
0.4046
0.4335
0.4580
0.4374
0.4392
0.4496
0.4648
Conclusion
From the results derived in Section 2, it can be studied the shapes and properties of the function , under the assumption of stochastic structures. Some progress in this line can be seen in Fernández and González-López [3]. Even more, Proposition 1 of Fernández and González-López [3] allows to understand theoretically why the cumulative conditional function of each course (in our application) converges to when gets closer to 1, and this explains also why the difference between the functions diminishes when increases (see Fig. 3).
Theorems 2 and 3 show how under the assumption of knowledge of the copula in a given grid of points, the Bernstein copula can be used to obtain approximations of , which converge pointwise and uniformly to . Illustrations of those results are shown in the case of Kimeldorf and Sampson copula family. Our results may contribute to the study of convergence properties in particular cases of copulas. For situations where the underlying copula is unknown, we propose an estimator of , based on the Bernstein estimator of the copula function. Theorem 4 shows that the estimator is an asymptotically biased normal estimator of , with a variance that vanishes, when and goes to infinity. We exemplify the use of this estimator using real data, revealing the practical potential of using to make decisions that could impact educational policies.
Footnotes
Acknowledgments
The authors gratefully acknowledge the financial support for this research provided by FAPESP Post-doctoral Grant 2011/18285-6 to M. Fernández. Also, we wish to thank three referees and an associate editor for their many helpful comments and suggestions on an earlier draft of this paper. Special thanks to Prof. Marcelo Knobel for making the data used in this paper available to us.
Appendix
Proof of Theorem 1
Let us take . We recall that and, for every , there exits for almost (Nelsen [12], Theorem 2.2.7). Here the word “almost" is in the sense of Lebesgue measure. So , for each . Let us denote the set of point where is not derivable as and assume it nondecreasingly ordered. To simplify the notation we set . In addition, if is not derivable on a finite set , we define for all . Now we apply integration by parts and obtain the relation stated,
Proof of Theorem 2
For each let us define the functions , where is the Bernstein copula approximation of of order given by Definition 2, and for . As shown by Li et al. [10], converges strongly to , and they also show that the strong convergence in copulas is strictly stronger than uniform convergence. In addition we have the dominance for all . Then, by the Lebesgue Dominated Convergence Theorem, it holds . Hence, recalling that and , for each we get , when .
Proof of Theorem 4
In order to simplify the notation, let us introduce the following two processes
From Janssen et al. [7], the process , in the space of bounded functions. The limiting process is a tight Gaussian process with mean function , with being a constant value and function, following Eq. (1), with covariance function where and function, following Eq. (2). The continuous mapping Theorem, see, for example, Van der Vaart [16] Theorem 18.11, let us to get the following convergence
for every . Since a continuous linear transformation of a tight Gaussian process is normally distributed, (see Van der Vaart [16], Example 20.11), then, for each , has normal distribution.
By integration by parts of function, we obtain that the expected function is
Now we compute the covariance function.
Expanding the integrand, we get
Then, recalling that and , we can find that
The expression above can be simplified by introducing the functions , and , defined in Remark 4, as follows
Finally, by taking and in the covariance’s expression we get the variance
By means of the relation , we get and . Then, the variance can be reduced to
References
1.
FernándezM. and González-LópezV.A., A Bayesian approach for convex combination of two Gumbel-Barnett copulas, in: AIP Conference Proceedings, 1558, 2013, pp. 1491–1494.
2.
FernándezM. and González-LópezV.A., A copula model to analyze minimum admission scores, in: AIP Conference Proceedings, 1558, 2013, pp. 1479–1482.
3.
FernándezM. and González-LópezV.A., Bounds for the cumulative conditional expectation function, in: AIP Conference Proceedings, 1648, 2015, 060002.
4.
FernándezM. et al., A note on conjugate distributions for copulas, Mathematical Methods in the Applied Sciences38 (2014), 4797–4803.
5.
HuL., Dependence patterns across financial markets: a mixed copula approach, Applied Financial Economics16 (2006), 717–729.
6.
HuaL. and JoeH., Strength of tail dependence based on conditional tail expectation, Journal of Multivariate Analysis123 (2014), 143–159.
7.
JanssenP. et al., Large sample behavior of the Bernstein copula estimator, Journal of Statistical Planning and Inference142 (2012), 1189–1197.
8.
JanssenP. et al., A note on the asymptotic behavior of the Bernstein estimator of the copula density, Journal of Multivariate Analysis124 (2014), 480–487.
9.
KimD. and KimJ.M., Analysis of directional dependence using asymmetric copula-based regression models, Journal of Statistical Computation and Simulation84(9) (2013), 1990–2010.
10.
LiX. et al., Strong approximations of copulas, Journal of Mathematical Analysis and Applications225 (1998), 608–623.
11.
MuddapurM.V., A Remark on the Paper “A Note on Directional Dependence in Regression Setting” By Engin A. Sungur, Communications in Statistics – Theory and Methods37(3) (2008), 386–387.
12.
NelsenR., An Introduction to Copulas, Lecture Notes in Statistics 139, Springer-Verlag, 2006.
13.
SancettaA. and SatchellS., The Bernstein copula and its applications to modeling and approximations of multivariate distributions, Econometric Theory20 (2004), 535–562.
14.
SungurE.A., A note on directional dependence in regression setting, Communications in Statistics – Theory and Methods34(9–10) (2005), 1957–1965.
15.
SungurE.A., Some Observations on Copula Regression Functions, Communications in Statistics – Theory and Methods34(9–10) (2005), 1967–1978.
16.
Van der VaartA.W., Asymptotic Statistics, Cambridge University Press, 2000.
17.
VracM. et al., Copula analysis of mixture models, Computational Statistics27 (2012), 427–457.