Abstract
According to modern finance theory and increasing need for efficient investments, we evaluate the portfolio performance based on the data envelopment analysis method. By the fact that stock market’s return distributions usually exhibit skewness, kurtosis and heavy-tails, we consider some appropriate underlying distributions that affect the input and output of the model. In this regard, the multivariate skewed t and the multivariate generalized hyperbolic as the heavy-tailed distributions of Normal mean-variance mixture are applied. The models are inspired by the Range Directional Measure (RDM) model to deal with negative values. The value-at-risk (VaR) and conditional VaR (CVaR) as risk measures are used in these optimization problems. We estimate the parameters of such distributions by Expectation Maximization algorithm. Then we present an empirical investigation to measure the relative efficiency of two sets of seven groups of companies from different industries of Iran stock exchange market. By comparing the results of introduced models with previous RDM approach, we show that how well the distribution of assets affect the performance evaluation.
Keywords
Introduction
The mean-variance portfolio selection model is to minimize the variance subject to achieving a prescribed mean target in the investment [34, 36]. Its purpose is to allocate the wealth amongst a basket of financial assets to reach a satisfactory trade-off between the return of the investment and the associated risk. The risk measure and the mean return are two important items in portfolio selection theory. By implementing the investment strategy and measuring the portfolio performance, an investor can construct a profitable portfolio. Data envelopment analysis (DEA) technique has been found useful to measure relative efficiencies and to enhance the portfolio performance [13]. In finance, the DEA models have been frequently applied to measure the performance of the portfolio and operational efficiency. First, a DEA portfolio efficiency index was introduced to measure the relative performance with various transaction fees [27] and its generalization that considers different risk measures as the inputs was proposed [4]. There have been a number of approaches to the development of DEA to nonlinear form by quadratic constraints over a multiple time-horizons [26]. The multi-horizon mean-variance portfolio analysis and diversification was modified [6]. For a better decision making, a mean-variance-skewness model has been proposed into the evaluation of portfolio performance which has an additional constraint for skewness besides the mean and variance. In other words, a non-linear DEA-like framework has been developed by using higher moments in portfolio performance measurement in a three-dimensional space [20]. When the sample size is large enough, it is shown that the DEA frontiers converge to the portfolio efficient frontier [21]. A new DEA-based indices are introduced based on new risk measures concepts into fund performance evaluation [10]. Inspired by multi-objective optimization, an approach to take into account the shape of the distribution of returns using several risk measures at the same time has been proposed [5]. As a most common approach, solving fuzzy programming problem has been used for an uncertain multi-objective mean-variance-skewness-kurtosis portfolio optimization model [9]. Original DEA schemes involve crisp information of inputs and outputs that may not be accessible in real world applications. Also undesirable outputs may be present in the manufacturing system and to obtain a reliable measurement, a neutrosophic DEA model has been proposed [23].
A rate of return in a loss of an investment over a specified period of time may turn into negative. So an approach based on the directional distance function as Range Directional Measure (RDM) model was presented [7, 29]. The RDM model applies positive directions to measure a necessary improvement in the inputs and the outputs to reach the efficient frontier. RDM provided a DEA model which can handle the cases where inputs and outputs take negative values.
Risk is one of the important factor in investment policy and the variance of return is traditionally considered as the risk measure in portfolio management. But it is not appropriate because of its penalizing symmetrically both in profit and loss. Furthermore, it ignores the tail risk and skewness of the distribution. There are alternative measures of risk that have many theoretical and practical advantages. One of them is value-at-risk (VaR) that has been approved by bank regulators as a valid approach for calculating risk charges. However, it is not always sub-additive, and it is generally unable to detect diversification of a portfolio [37]. By employing VaR to measure the risk associated with uncertain random return, some corresponding portfolio optimization models have been studied [31]. Despite its deficiencies, VaR, is still preferred risk measure in portfolio [2, 37]. Another representative popular risk measure which is always sub-additive is conditional VaR (CVaR) and is more informative than VaR about extreme losses (for more details see [12, 22]). To cope with skewed return distributions, CVaR as a risk measure has been introduced. This measure is called Mean-Excess loss, Mean Shortfall or Tail VaR. This coherent risk measure has better computational characteristics and consequently is well adopted in finance field [1]. CVaR is proved to be stable with respect to the choice of the confidence level [33]. It was demonstrated that linear programming with CVaR constraints can be used for portfolio optimization problems [32]. It should be noted that VaR and CVaR are two tailed-related risk measures. Two fuzzy portfolio selection models integrated with DEA have been proposed by these measures under a credibilistic programming [15].
Empirical evidence shows that many of financial return series are heavy-tailed and possibly skewed [18]. So, the construction of an optimal portfolio depends on the probability distribution used to model returns. Furthermore, the effectiveness of the underlying distribution is not restricted to the financial markets, but it is applicable to the energy commodity markets and insurance companies. For instance, energy commodity markets’ returns are directly influenced by the volatility of energy prices that are closely related to the economic and financial environment and energy supply-demand situation [34, 39]. Also, the deterministic inputs with stochastic noise as the outputs were introduced according to a family of heavy-tailed stable distribution in DEA framework in insurance companies [28]. The Normal mean-variance mixture are a class of flexible multivariate distributions that can capture heavy tails and skewness [3]. These distributions are well behaved under the linear transformation with nice properties in Monte Carlo simulation and portfolio selection. The multivariate skewed t (mST) and multivariate generalized hyperbolic (mGH) distributions are subfamilies of these distributions. Choosing the proper initial values of the parameters make different skewness and heavy-tailedness in return distributions. Especially mST is the most-heaviest tailed among these distributions [3, 16]. The expectation maximization (EM) algorithm is a preferred numerical method in estimating the parameters of these distributions [18]. This algorithm is a two-step iterative process that obtains the maximum likelihood estimates of parameters. Using current parameter values, and then the function is maximized to produce updated parameter values [11].
In real-world data, due to the non-normality of returns [14, 30], skewness and leptokurtosis are two important parameters that are taken into account of return distributions. Therefore, the underlying probability distribution effects on each asset performance assessment. The return distributions have impact on the input and output amounts of optimization problem, too. Asset returns are assumed to be normally distributed, while the probabilities of tail events were not considered into the performance assessment process [2, 27]. So, we use some multivariate distributions that capture heavy tails and skewness and make better fit to real returns. In this paper, we are concerned with the underlying Normal mean-variance mixture distributions. To evaluate the performance of assets by DEA method, we consider the risk and return. We choose risk measure VaR or CVaR as the input and mean return as the only output in our models.
Empirical studies on portfolio performance evaluation with DEA structure showed that risk measure, mean returns and sometimes higher order moments of returns have been computed directly by the sample real data without considering any distribution. While in current study, the return distributions are taken into account to assess the asset performance in DEA framework that makes a proper tool to get accurate results. We focus on the family of Normal mean-variance mixture as the underlying distribution to model the return and use the risk measure of the asset to evaluate the performance. The mean return and risk measure are simulated by the Monte Carlo method with the estimated parameters of the underlying distribution.
To deal with negative values, the RDM model is applied which provides efficiency scores as well as radial efficiencies traditionally used in DEA. This model is one of the common models for evaluating performance of DMUs that we do not focus on the underlying distribution in its inputs and outputs. In addition, two models are proposed based on the mST and mGH distributions inspired by RDM which have impact on input and output. These multivariate continuous distributions are able to cover the characteristics of returns. So, we do not need to add further constraints to the models. In this regard, a two-dimensional mean return-risk space is identified in our models. Comparing the performance of the models indicates that, considering skewness and kurtosis leads to more interpretable efficiency evaluation. Moreover, improper underlying distribution makes underestimated risk measure and mean return. Also, the efficiency scores measured in such models based on mST and mGH distributions are more accurate than the RDM model. In addition, these results help the manager to be aware of company’s performance and can change his policies. Also, if the company is inefficient, the manager finds how much the risk have to be decreased and the mean return be increased. In real data analysis, to apply proposed models, first the parameters of the distributions are estimated by the EM algorithm. Then, the input and output of the models are computed by the simulated log-returns with Monte Carlo technique. All models are applied to Iran stock exchange market that includes two sets of seven groups of companies where each firm named as an asset (financial asset). So, each asset is considered as a DMU that we evaluate the efficiency of it.
We have organized the paper as follows. Section 2 is devoted to the preliminary concepts of CVaR, skewed t and mGH distributions and an introduction to the RDM model. In section 3, the effect of the underlying distribution into the evaluation of the asset performance is provided. The proposed models are based on the RDM model, by considering the mST and mGH as underlying distributions. Empirical illustrations using two sets of seven groups of companies are provided in section 4, and the last section concludes the paper.
Preliminaries
In this section, we present some definitions and concepts that are used in the following sections. First, we concentrate on the concept of coherent risk measure, introduced in [1]. Then, definitions of VaR and CVaR are provided. Next, the Normal mean-variance mixture distributions are defined. Finally, the RDM model is represented, briefly.
Monotonicity: If X ⩽ Y, then ρ (Y) ⩽ ρ (X) Subadditivity: ρ (X + Y) ⩽ ρ (X) + ρ (Y) Translation Invariance: For all α ∈ R, ρ (X + α) = ρ (X) - α Positive homogeneity: For all λ ⩾ 0, ρ (λX) = λρ (X).
Value at Risk (VaR) as one of the risk measure is a benchmark standard for firm-wide measures of risk.
The risk measure VaR at confidence level β ∈ (0, 1) is the smallest value l such that the probability that the loss exceeds l is no larger than (1 - β), the other hand
VaR is a coherent measure when the underlying distribution is elliptical [18].
The CVaR which is additionally coherent, at confidence level β ∈ (0, 1) is defined as
In other words, the mGH distributions can be represented as a Normal mean-variance mixture where the mixture variable has GIG distribution. The index parameter λ and concentration parameters χ and ψ are inherited from the mixing distribution that remains the same. Moreover, μ represents the location vector, γ the skewness vector and ∑ is the dispersion matrix. The mGH class offers a natural generalization of the multivariate Gaussian class. Potential distributions from mGH are hyperbolic, normal inverse Gaussian (NIG), variance gamma (VG), student t and skewed t distributions [18]. It should be noted that the mGH is more skewed and has heavier tails than the normal distribution. The parameter λ plays an important role in the mGH distributions.
Both distributions have some parameters in which three of them as λ, χ and ψ are the same. By choosing proper initial value of each parameter at beginning of the procedure of estimation, we can show the behavior of the distributions such as the skewness, kurtosis and heavy-tailedness. So the mGH distributions are appropriately used for cases with lighter tails than the mST. For this purpose,
For this purpose, the effects of parameters on distributions are illustrated in Figs.1 4. Figure s1 and 3 show a comparison among different values that parameters λ and ψ have taken and Figs.2 and 4 have depicted the right tails of them [17].

The density of generalized hyperbolic and Gaussian.

The density of Generalized and Gaussian at right tail.

The density of Gaussian, Skewed t and Hyperbolic.

The density of Gaussian, Skewed t and Hyperbolic at right tail.
In Fig. 1, it is shown that how the parameter λ influences the tails and kurtosis. Let λ varies from –10 to 10, set μ = 0, γ = 0, ψ = 1, χ = 1 and σ to be a constant so the variance of the generalized hyperbolic distribution is 1 with mean zero.
When |λ| is small, the tails are heavy but when |λ| becomes larger, the tails become thinner and by increasing it, the symmetric mGH distributions tend toward the normal distribution, Fig. 2.
In Fig. 3, for
It is shown in Fig. 4 that the skewed t has the heaviest tail among those tested distributions.
The RDM model was proposed by [29] and inspired by the Directional Distance Function model by [7] which can be applied for computing efficiency in the presence of negative data. In the present paper, RDM model is used, since some mean returns are negative.
The above model is a non-oriented case, where the input contraction and output expansion improve simultaneously. For a given data set, when some of them are negative, an ideal point is defined as
At the ideal point I the range of possible improvement can be seen as a surrogate for the maximum improvement that DMU o could achieve on each input and output. Such an improvement can never be negative [29].
In this section, we apply RDM model to indicate that the input and the output of a model are affected by the return distributions. As previously mentioned, the return distributions exhibit skewness, kurtosis and are heavy-tailed and they impact on the performance evaluation that leads to more precise efficiency scores. The DEA is used as an efficiency assessment tool, but the traditional DEA models are restricted to non-negative data. So because the variables in financial field such as returns take positive and negative values, we employ one of the DEA-based model that deals with negative values as RDM. In the empirical example, we show that how the data underlying distribution influences the asset performance assessment while in RDM model, it is ignored.
First, we apply RDM model as one of the existing models for evaluating efficiency on data from the stock market. It evaluates the asset performance without considering the type of return distributions. Also, in this model, risk and mean return are the only input and output, respectively. Let’s assume Y1, Y2, . . . , Y
n
be the log-returns of the n assets’ prices in stock market. For a specific asset return Y
o
where o∈ { 1, 2, . . . , n } and regarding to the negative returns value, the vector
Where
In this framework, CVaR and the mean return are the only input and output of the model, respectively. But significant value of skewness and kurtosis, indicating that the data are not normally distributed. The results reveal that VaR and CVaR tend to underestimate the risk. Therefore, this problem is carried over into the asset performance assessment.
In contrast, the mST and mGH distributions are flexible in their tails behavior. Since the return distributions exhibit skewness and leptokurtosis, the mST and mGH are appropriate candidates for return distributions. In the following two models, the mST and mGH distributions are considered as the underlying distribution, in models (2) and (3) respectively with n financial assets.
In model (2) called RDM-mST model, Y
j
is the j-th asset where Y
j
∼ mST (ν, μ
j
, γ
j
, ∑
jj
) , j = 1, . . . , n and ν, μ
j
, γ
j
and ∑
jj
are parameters of mST distribution. According to the directional vector, we solve the following model
It should be noted that the proper choice of the initial values of parameters λ, χ and ψ in mGH distributions, we can control their tails behavior and show the skewness and kurtosis values of the distributions. So the mGH distributions are appropriately used for cases which have less heavy tailed than the mST. When ψ goes to zero, a subclass or limiting distribution of mGH is asymmetric or skewed t distribution [17]. As it is cited, the RDM model underestimates the probability of the skewness and tail events, while the heavy tail properties of the mST and mGH distributions describe them well. Therefore, if we do not employ appropriate distributions including tails behavior, risk measures will be underestimated and efficiency scores will be inaccurate.
The models (2) and (3) are in two-dimensional mean return-risk space and cover the higher moments of returns, so it is not needed to consider additional constraints for them.
We remind that the optimal solution α* in models (2) and (3) indicates the inefficiency score of asset under evaluation and the asset is efficient when the inefficiency score is zero.
In order to solve the models (2) and (3), EM algorithm and Monte Carlo simulation are applied according to the following steps
The above models, CVaR can be substituted by VaR as a risk measure.
In investment policy, it is concerned to have the highest return with lowest risk. So, these items have the main role on the assets performance assessment as the input and the output in our models. In addition to the type of input and output which are effective in asset portfolio evaluation, the type of distribution also affects. So in Table 1. we highlight the novelties of the proposed models in compare with commonly used RDM model.
Comparing proposed models by RDM model
We compare the introduced models for some stock companies of Iranian financial market. Each company is considered as a financial asset. We have daily logarithmic returns of two sets of seven groups of different kinds of industries. The public information of the companies is given from Tehran Stock Exchange (TSE) market. The daily price is considered as the closed price of each asset. The first seven groups data being recorded from 18/07/2016 to 19/07/2017, and the second seven groups is 25/03/2018 to 29/04/2019 which are illustrated in Figs.5 and 6.

Stock price of first seven groups.

Stock price of second seven groups.
As in practice, real data for stock prices returns are often characterized by skewness and kurtosis and have heavy tails, first we find these numerical measures of the shape of these two data sets. As it is shown in Table 2, the skewness and kurtosis of each asset are meaningfully different from the normal distribution. So we employ other probability distributions that can be efficiently captured heavy tails and skewness in the return distributions.
Skewness and Kurtosis of the first and second groups
The input and output of models (2) and (3) come from the simulated log-returns which they are obtained by estimated parameters of mST and mGH distributions and then simulated by Monte Carlo technique. Tables 3, 4, 5 and 6 represent the mST and mGH estimated parameters of two sets of seven groups assets by EM algorithm.
Estimated parameters of mST of the first seven groups for λ = -2.5, χ = 5 and ψ = 10-6
Estimated parameters of mGH of the first seven groups for λ = 8, χ = 9.714 and ψ = 13.766
Estimated parameters of mST of the second seven groups for λ = -1.5, χ = 3 andψ = 10-2
Estimated parameters of mGH of the second seven groups for λ = 0.2, χ = 1.518 and ψ = 6.432
As it is shown in Table 2, the skewness and kurtosis values of each group are different from normal distribution. We set the initial values of the parameters of mST and mGH distributions.
Tables 7 and 8 represent the values of VaR, CVaR and mean return of both seven groups of assets. In model (1), the calculated risk measures and the mean returns depend on the use of log-returns of stock prices whereas in models (2) and (3), they are obtained by simulated returns on estimated parameters. According to the VaR and CVaR definition, we arbitrarily choose the confidence level β = 0.90. These data are used to compute the inefficiency scores in models (1) to (3).
VaR, CVaR and mean-Return of models (1), (2) and (3) of the first seven groups
VaR, CVaR and mean-Return of models (1), (2) and (3) of the second seven groups
In order to show the effect of the underlying distribution on the assets performance, we find the inefficiency scores of each asset by models (1), (2) and (3), recorded in Table 9.
The inefficiency scores under mean return-VaR and mean return-CVaR framework in models (1), (2) and (3) for each seven group at β = 0.90
From the results of Table 2, the skewness and kurtosis values of the first seven groups are less heavy tailed than the second groups. So, the normal distribution is not well-fitted for this group and the inefficiency scores are inaccurate. Furthermore, the mST distribution is not an appropriate underlying distribution, because it is one of the most-heaviest tailed distribution among the mGH distributions. Therefore, by proper choice of the initial values, the mGH distribution is well-fitted for the first seven groups. For instance, asset 4 is efficient based on model (3) but it has the high inefficiency score under mean return-VaR framework in model (1) and also it is inefficient in model (2). The underlying distributions are not normal and mST, therefore the VaR, CVaR and mean returns values are underestimated with these distributions. Moreover, the small skewness and kurtosis makes mGH to be an appropriate underlying distribution in this group. As a result, the inefficiency scores gained by model (3) are reliable to models (1) and (2).
The skewness and kurtosis of each asset in the second seven groups are significantly different from zero and 3 as the skewness and kurtosis of the normal distribution. As it is shown in Table 9, the inefficiency scores provided by model (1) are only zero and 1. For more details, the forth asset’s mean return is the highest one among these seven assets and its risk measures are the lowest among others. For the reasons indicated, asset 4 is an efficient asset based on model (1). Therefore, the mST and mGH distributions are well-fitted to the second seven groups and the inefficiency scores are precise and interpretable. Moreover, asset 6 is completely inefficient by model (1), while it is efficient in models (2) and (3). Since mST is a heavy-tailed distribution among the mGH distributions family, from the inefficiency scores, we conclude that mST is a well-fitted distribution to the second seven groups. In models (2) and (3), asset 7 is inefficient under the mST distribution, has the highest mean return and under mGH distribution has the fourth highest mean return among others. However, at the same time it has a high risk measures. So, it leads to higher inefficiency score in model (3) rather than model (2).
Comparing the models (2) and (3) with the results of model (1) we find that the inefficiency scores of these two seven groups of assets in models (2) and (3) are more accurate than model (1). Therefore, considering the different distributions into the asset performance evaluation makes different results of inefficiency scores and may effect on risk measures and mean returns values of assets. It should be noted that the initial values of λ, χ and ψ in mGH distributions depend on the skewness and kurtosis of the distributions.
Empirical studies show that the asset return distributions are leptokurtic with nonzero skewness and are not taken into account in performance evaluation. In order to evaluate the financial assets performance, we have applied the DEA method. In portfolio performance assessment in DEA framework, there is no study that focus on return distributions. Therefore, the main idea is to demonstrate how the types of return distributions affect the portfolio evaluation. For this purpose, we have introduced some models where the underlying distributions such as the mST and mGH are able to cover the skewness and heavy-tailedness of data sets. The VaR and CVaR risk measures are applied as the input of the models and the mean return is as the only output. The optimal objective value of each model indicates different maximum proportionally changes in the risk measure and the mean return of the asset under evaluation. We have given an example of two groups of assets to compare the models. For return distributions which they exhibit skewness and kurtosis, the mST and mGH distributions describe the performance of assets much better than the normal distribution and more accurately than the RDM model. In other words, the inefficiency scores measured by DEA models based on mST and mGH distributions are more accurate than the RDM model. We have observed from the first seven groups that because of the small skewness and kurtosis values, the mGH distribution is well-fitted to those data sets. It is shown that how the evaluation of the asset performance based on the return distributions in DEA framework depends on the underlying distribution. The multivariate continuous distributions mST and mGH capture more characteristics of financial data and it is not needed further constraints for them in the models. To describe the assets’ return distributions, we can apply the other distributions of Normal mean-variance mixture class and also the other classes of risk measures instead of VaR and CVaR to evaluate the asset performance with DEA method for future work. Moreover, by taking only a single input and a single output in the proposed model we are able to find the efficiency score but it can be extended to some inputs and outputs.
In addition, we can employ the traditional portfolio performance measures Treynor [38], Sharpe [35] and Jensen’s alpha [19] indices as the outputs of the model for more accuracy in asset performance.
