Abstract
The present study investigates how efficiently India-domiciled exchange traded funds (ETFs) replicate the returns of their underlying indices and analyses the factors that determine the tracking performance. We use a three-pronged approach involving Capital Asset Pricing Model (CAPM) regression, cointegration-Vector Error Correction Model methodology and tracking errors (TEs) to assess tracking efficiency. Random-effects panel regression is employed to evaluate how fund-specific factors influence tracking ability. We find that ETFs carry significantly lower exposure towards their indices than what their objective would suggest. Long-run linkages with benchmarks exist for most ETFs, but the price deviations from the indices are fairly persistent. The TEs for the majority of the funds are large and non-trivial. Bid-ask spreads, price-net asset value deviations, fund’s age and, to some extent, its size are the primary factors that influence tracking performance. ETFs in developed markets such as the USA and Europe have been found to exhibit superior benchmarking abilities. The study is expected to assist investors in developing a more efficient ETF portfolio and to help fund providers improve the quality of their offerings.
Executive Summary
The phenomenal expansion of the exchange traded funds (ETF) markets worldwide has attracted the attention of researchers and academics. Consistent underperformance by active funds, especially in the large-cap space, is strengthening the case for passive investing and thus ETFs. The ETF industry in India grew at a compounded annual growth rate (CAGR) of 50.84 per cent in the last decade, which is among the fastest in the world. The literature on Indian ETF markets is limited, moreover, with very few exceptions; most studies are based on small samples and employ primitive analytical techniques. The present article attempts to fulfil these research gaps. The objective of this article is to assess the tracking performance of equity ETFs listed on the Indian stocks exchanges and to identify the factors that determine how well these ETFs replicate the returns of their corresponding benchmarks. The sample includes 18 funds which have been analysed over a five-year period from 1 January 2014 to 31 December 2018. The tracking performance has been assessed on the basis of three approaches. First, a Capital Asset Pricing Model (CAPM) regression is run using daily ETF and index returns to understand the extent of under or over performance by the fund and to determine if the ETF managers follow a full-replication strategy. Second, using Johansen’s method we test whether the price and index series are cointegrated, that is, whether they exhibit a long-run equilibrium, additionally, we employ the Vector Error Correction Model (VECM) to investigate how persistent are the deviations from such an equilibrium. Finally, we compute three variants of tracking errors (TEs) to quantify the magnitude of the tracking inefficiency. The second part of the analysis pertains to the identification of fund-specific factors that have an impact on the fund’s tracking performance. We use the random effects method with tracking error as the dependent variable, the explanatory variables include an estimate of fund’s spread, asset size, age, volume, premium and the number of constituents in its index.
The empirical findings reveal that ETFs in India exhibit sub-optimal tracking performance; the fund managers fail to follow a full replication strategy. However, there is no indication of under or over performance. Most of the sample funds share a long-term equilibrium relationship with their corresponding indices but the deviations from this equilibrium are persistent; for the average fund, it takes approximately eight days for half of the deviation to disappear. The TEs are non-trivial and are larger than those reported for ETFs in developed regions. The panel regression shows that funds with higher spreads and premiums are likely to be poor trackers while older and to some extent, larger funds are expected to track their indices more closely. The study is expected to assist investors in developing an ETF portfolio with superior benchmarking abilities and the fund providers in improving the quality of their offerings.
Introduction
Ever since their launch in 1993 with the S&P 500 Depository Receipts, ETFs have become one of the most successful financial innovations in recent times. The primary objective of a traditional ETF is to closely mirror the returns of its underlying benchmark which may be a stock or bond index or a commodity.
An ETF is primarily a vehicle of passive investment which is continuously tradeable throughout the day and which can be created or redeemed in-kind by exchanging it with its underlying securities. 1 Their growth has been phenomenal in developed financial markets such as the US and Europe where active funds find it increasingly difficult to consistently generate above-average returns that justify their high expenses and commissions.
The objective of ETFs is similar to that of index mutual funds, that is, to provide the investors with an indexing option that allows them to hold a diversified portfolio of assets at low cost (Kostovetsky, 2003); however, ETFs are structured differently which allow them to offer some key advantages. Unlike index funds, ETFs provide intraday liquidity, they may be short sold and traded on margin. Moreover, ETF units are convertible and redeemable in-kind which allows the fund to manage its inflows and outflows without incurring liquidity costs (Gastineau, 2010). The in-kind creation and redemption mechanism also allows the fund to defer its capital gains thereby making it more tax efficient (Poterba & Shoven, 2002).
ETFs have been shown to have a significant impact on the attributes of their stocks and the financial markets in general (Ben-David et al., 2018; Da & Shive, 2018; Madura & Ngo, 2008); moreover, such impact is not just limited to country’s domestic funds but also applies to foreign funds which are traded outside the country’s boundaries but are benchmarked to its domestic stock and indices (Narend & Thenmozhi, 2019).
By May 2019, the assets invested in the Global ETF/ETP industry had touched a record high of USD 5.57 trillion (ETFGI, 2019). According to Kealy et al. (2017), if the current growth rates continue then the passive ETF industry is expected to surpass the actively managed funds by 2027. The United States is by far the biggest market for ETFs in the world. Out of the USD 4.8 trillion of assets invested in ETFs at the end of 2017, USD 3.4 trillion or about 72 per cent are in the US markets alone.
The size of the ETF market in India has historically been small but is now growing rapidly. The ETF industry registered a CAGR of 50.84 per cent in its assets under management (AUM) during the last decade which is among the highest in the world (Business Standard, 2017). Especially now with the Central Government choosing the ETF route for meeting its divestment targets, the Employees’ Provident Fund Organisation opting for them as the preferred mode of investing in the equity segment of the market and with the launch of India’s first corporate bond ETF, it is clear that ETFs are here to stay.
The key question that the present work seeks to answer is how successful are ETFs in their primary objective of tracking their underlying index closely. Further, we examine if a long-term equilibrium relationship exists between ETF price and index series and how persistent are the short-term deviations from such an equilibrium. The final objective is the identification of elements that may be driving TEs in funds and measuring their impact on tracking efficiency.
We find that ETFs in India do not track the movements of their underlying indices perfectly and generate non-trivial TEs, such a finding is expected as the index returns do not take into consideration the cost of trading and holding stocks. However, the TEs are larger in comparison with the funds operating in developed regions such as the US, Europe and Australia. Cointegration and VECM analysis reveal that the deviation between ETF prices and its index can be quite persistent. The panel regression indicates that market transaction costs such as the bid-ask spreads, deviations between ETF prices and its net asset value (NAV), the age of the fund and the size of its asset corpus are the determinants of TEs in funds.
Literature Survey
The phenomenal growth of the ETFs in the last two decades has led to an increased interest in this segment of the financial market. The literature on ETFs, especially from the developed markets, has expanded significantly. However, ETF-related research in emerging markets is still limited.
Studies from around the world have examined how effectively ETFs replicate the returns of their corresponding financial assets. Shin and Soydemir (2010) reported significant and persistent TEs and underperformance vis-à-vis the benchmark in a sample of 26 US-listed ETFs, in addition, the tracking performance of ETFs which were based on Asian indices were found to be substantially more inefficient. Similarly, Chen et al. (2017) in their analysis of New Zealand ETFs observed non-trivial TEs and low benchmark exposures. The volume and spread of the funds and the index constituents, as well as the volatility of the index, were found to be the main factors contributing to TEs. However, Johnson et al. (2013) found that Europe domiciled funds are efficient at tracking their indices and generate small TEs. Gallagher and Segara (2006) arrived at a similar conclusion for Australian ETFs.
In a multi-country analysis, Wong and Shum (2010) examined 15 ETFs from seven markets and found that ETFs always produced higher risk-adjusted returns. The authors also posit that market conditions may also impact the fund’s tracking performance; Jensen’s alpha and risk premiums were found to be higher when market expectations were bullish. Rompotis (2011) made a similar observation wherein a sample of 50 iShares ETFs was found to exceed the returns of their corresponding benchmarks. The TEs were substantial and persistent and were found to be affected by the fund’s expenses, volatility and age. Singh and Kaur (2016) who undertook a similar study for India-listed ETFs observed the presence of significant TEs; fund’s size, volume and volatility were found to be the primary determinants of tracking efficiency.
Another related area of research relates to pricing efficiency and arbitrage. Pricing efficiency implies minimization of the difference between the fund’s trading price and its NAV, that is, premium or discount. In theory, the arbitrage mechanism based on in-kind creation and redemption is expected to ensure that ETFs remain price efficient. Ackert and Tian (2008) who studied domestic and country ETFs trading in the US market found the premiums to be small but varying over time. The deviations were relatively larger and more persistent for the country funds. Kayali (2007) examined the performance of Istanbul’s first ETF, the Dow Jones Istanbul 20 (DJIST). He observed that price and NAV series are highly linked and move close together. Marshall et al. (2013) found that arbitrage opportunities in US and Swiss ETFs are exploited fairly quickly, with the median length of five minutes. Sethi and Tripathi (2019) showed that ETF returns in India are more volatile than their underlying assets and the majority of the excess volatility is attributable to factors that inhibit arbitrage.
Overall, we observe that ETFs often fail to track their benchmarks closely and generate substantial TEs, this is especially true for ETFs which track foreign and/or illiquid securities. Moreover, ETFs in developing markets are more prone to tracking inefficiencies than funds from developed regions. ETF markets in developed regions such as US and Europe have an effective arbitrage mechanism which ensures that ETF prices do not exhibit large and persistent deviations from their underlying assets. TEs are a popular metric used to ascertain the tracking effectiveness and the performance of ETFs. However, using TEs as the sole measure of an ETF’s performance can often produce misleading results (Lin & Mackintosh, 2010). More recently, authors have also employed CAPM methodology, cointegration analyses and long-term return performance to analyse ETFs. Furthermore, examination of contemporary literature reveals funds’ volume, age, expenses, volatility and size as well as the liquidity of its constituents to be the chief determinants of their tracking efficiency.
ETF related literature based on emerging markets in general and the Indian markets, in particular, is fairly limited. Few studies such as Shanmugham and Zabiulla (2012), Purohit et al. (2014), Singh and Kaur (2016) and Saji (2017) have analysed tracking efficiency of Indian ETFs. However, the analyses have mostly been based on TEs only. Moreover, most studies could examine only small samples as the market has started expanding during the last few years only.
We aim to fill this research gap by employing more standardized econometric techniques such as cointegration and error correction models, in addition to CAPM regression and TEs to examine tracking performance. The methodology adopted here allows us to investigate not just the extent of tracking inefficiency but also its persistence. The present work also attempts to identify factors that contribute to TEs; an area that has remained under-explored. Furthermore, the rapid growth of ETFs permits us to examine a larger sample than those of the above-mentioned studies which would improve the generalizability of our results.
Research Methodology
Sample and Data Description
To arrive at the sample for the study, we began by identifying all equity-index ETFs which are listed on either the National Stock Exchange or the Bombay Stock Exchange. Since the focus of the present work is on ETFs which consist of only stocks in their underlying portfolio, we exclude commodity-based funds (primarily Gold ETFs) and funds whose underlying portfolios include fixed-income or money-market securities. To ensure that all the ETFs have at least five years of trading data, we consider only those funds which had been listed before 1 January 2014 and were still in operation at the end of 2018. A few funds had to be dropped due to lack of continuous data. The final sample consists of 18 index ETFs, including sector, broad-market as well as foreign-index ETFs. This sample is one of the largest amongst studies focused on Indian ETF markets. The study period extends from 1 January 2014 to 31 December 2018. Table 1 presents a snapshot of all ETFs which form part of the sample.
Profile of Sample ETFs
The data for the study has been obtained from multiple sources. Basic characteristics such as underlying benchmark, date of inception, fund size and expense ratios have been sourced from reports of the asset management companies which operate the fund. The pricing and volume data has been obtained from the exchange where the fund is listed while reports of Association of Mutual Funds of India provide historical AUMs for the sample ETFs.
Methodology
The methodology section is divided into two parts. First, we explain the approaches employed to evaluate how efficiently ETFs replicate the movements and returns of their corresponding benchmarks. The second section discusses the possible sources that may cause the fund’s performance to deviate from that of its target index.
Evaluating Tracking Performance
We use the following three approaches to assess tracking efficiency:
CAPM Regression Model We run the following linear regression equation based on CAPM:
where rt,ETF and rt,index are the returns on the ETF and on the underlying index, respectively, on day t while rf is a proxy for the daily risk-free rate based on the yields of three-month government bonds. If the funds track their benchmarks closely then we should expect to see α, which measures the extent to which an ETF outperforms or underperforms its index, to be close to zero. Similarly, β should be close to one, as it captures ETF’s exposure towards its index. Cointegration and Error Correction Model Cointegration between two variables implies that they share a common stochastic trend (Hanck et al., 2019). Cointegrated variables are characterized by a long-term equilibrium and the time paths of such variables are influenced by the extent of deviation from this equilibrium (Enders, 2008). We use the cointegration methodology to determine the existence of a long-term equilibrium relationship between the prices of ETFs and their underlying indices. Johansen’s (1991) procedure is employed as a formal test of cointegration between the two series. As per this method, the existence of a cointegrating relationship is determined on the basis of the maximum eigenvalue (λmax) and the trace statistic (λtrace) which are expressed as follows:
where λ
i
is the value of the characteristic root or the eigenvalue obtained from the estimated π matrix. ‘T’ refers to the numbers of usable observations available. Furthermore, the existence of a cointegrating relationship would allow us to model the ETF and index prices in the form of an error correction model. Specifically, we estimate the following VECM:
where αe,0 is the intercept and αe,i and be,i are the lag operator coefficients. Of primary interest, here is Zt–1, that is, the error-correction term (ECT) which measures the rate at which ETF prices adjust to correct any imbalance from the long-run equilibrium. The speed of adjustment coefficient (δ) indicates the rate at which ETF prices adjust to correct any imbalance from the long-run equilibrium between the ETF and the index. Following Chen et al. (2017), we also compute the half-life
2
of the price deviation which is an estimate of the number of days required to eliminate half of the ETF-index deviation. The optimal number of lags have been chosen on the basis of a combination of lag-length criterions, namely the Final Prediction Error (FPE), Schwartz Information Criterion, Akaike Information Criterion and the Hannan–Quinn Information Criterion. Tracking Errors The final method used to assess the tracking performance of ETFs is TEs. We follow Frino and Gallagher (2001) and Shin and Soydemir (2010) to compute three variants of TEs. TE1 is computed as the average of the absolute difference between the daily returns on the fund and the index.
TE2 is based on the standard deviation of the differences between the ETF and index returns.
where ept = rt,ETF – rt,index Finally, TE3 may be calculated as the standard error of the residuals from the following regression:
Determinants of Tracking Efficiency
In this section, we identify factors that may be affecting the TEs and thus leading to sub-optimal replication. These factors have been selected on the basis of general economic reasoning and existing literature.
The following panel regression model captures both cross-sectional and time effects in TEs:
sprdi,t is the estimated bid-ask spread of ETFs which has been computed using daily high, low and close prices using the formula developed by Abdi and Ranaldo (2017), abspremi,t is the absolute difference between ETF price and NAV expressed in percentage terms. lnaumi,t represents the AUM of the fund while lnvoli,t is average daily traded volume, lnagei,t indicates the number of years since the fund’s inception. Lastly, lnindexconsi is the number of constituents in the fund’s index. The last four variables have been expressed in natural log terms to adjust for skewness. The time unit for the model is a year.
The spread of the fund is expected to have a positive relationship with TEs as funds with higher spreads are costlier and hence more difficult to trade. Consequently, the fund’s price may not always closely represent the value of the underlying index. Similarly, a higher premium indicates that fund prices vary widely from the value of its underlying securities and is likely to be associated with higher TEs. Daily traded volume measures liquidity; liquid funds are expected to track their indices more closely. Likewise, higher AUM is likely to enhance tracking performance as the fund’s expenses are borne by a larger asset base. We expect the tracking performance to improve with the age of the fund as the manager and authorized participants become more efficient in managing inflows and redemptions; moreover, the initial time period in the fund’s life is usually characterized by demand and supply imbalances which worsens tracking ability. Finally, lnindexconsi is expected to be positively related to TEs as a higher number of underlying securities may translate to higher trading, holding and rebalancing costs. The description and the relevant literature for the variables used in the regression analysis is presented in Table 2.
Description of Variables Used in the Tracking Error Regression
The dependent variable is the average TE. There are three specifications of the model: one with each variant of TE. We impose the random effect assumption on the model on the basis of the results suggested by the Sargan–Hansen test of overidentifying restrictions, which is a robust Hausman type test used for choosing between fixed and random effects when cluster-robust standard errors are employed.
Findings
The descriptive statistics for the main variables used in the analysis are given in Table 3. We observe that the value of TE2 and TE3 is greater than one (in percentage terms) while TE1 is slightly lower at 0.93 per cent. The high values of TEs provide preliminary evidence which indicates that non-trivial tracking inefficiencies exist in the sample funds.
Summary Statistics of Main Variables
Next, we regress the excess returns of the ETFs on the excess returns of their underlying benchmarks. The regression is based on daily data; the results are presented in Table 4.
Regression of ETF and Index Return
*, ** and *** denote significance at 10%, 5% and 1% levels, respectively.
We find that the intercept term for all the funds is very close to zero and is statistically insignificant for all funds except two (where it is weakly significant). Thus, the funds do not appear to underperform or outperform their underlying indices. This finding is in line with the passive nature of ETFs whose primary objective is simply to produce returns that are commensurate with the returns of their benchmarks. The beta coefficients of all funds exhibit statistical significance; however, all the values are smaller than one. With a mean value of 0.6196, the coefficient of the average fund is substantially lower than unity which suggests that ETFs fail to follow a full replication strategy (Rompotis, 2012) and have far lower exposure to their benchmarks than their objectives would suggest (Chen et al., 2017). The average R2 of the regression is 0.3374, implying that only about a third of the variation in ETF prices is explained by the index movements. Overall, the low values of the coefficient and the R2 are suggestive of poor tracking efficiency. These results may be attributable to low levels of trading and liquidity in the ETFs as well as a possible cash drag as fund managers are required to maintain some cash balance for dividend payouts (Chen et al., 2017). Purohit and Malhotra (2015) suggest that a demand and supply imbalance in the ETF market may also be responsible for such findings.
Next, using the ADF and the KPSS tests we find the log ETF prices and log index series to be integrated of order one, hence a test of cointegration may be suitably applied. 3 We present the results of the Johansen’s test of cointegration between log ETF prices and log index values in Table 5. The results of the VECM and the half-life measure are also reported alongside.
The trace and max-eigenvalue test statistics from the Johansen’s test reveal that for all but two ETFs the price and index series are cointegrated. Hence, nearly all the ETFs in our sample share a long-term equilibrium relationship with their underlying benchmarks. In order to examine the error correction mechanism between the ETF and its index in case of a deviation, we employed the VECM to those funds whose price and index series are cointegrated. Table 5 reports the ECTs from VECM. Eleven of the funds generate ECTs which are negative and statistically significant indicating that the price-index system is stable and exhibits long-run convergence. The average ECT (based on significant values) is −0.1711 implying that about 17 per cent of the disturbance in the ETF-index equilibrium is corrected by the movement in ETF prices.
Cointegration in ETF and Benchmark Prices and the VECM
*, ** and *** denote significance at 10%, 5% and 1% levels, respectively.
The mean half-life of ETFs with significant ECTs is 8.05 days; hence, it takes about eight days for half of the disequilibrium between fund prices and its index to be corrected. The half-life measures of the ETFs which track foreign indices (N100 and HNGSNGBEES) are much higher than average which is a likely consequence of asynchronous trade timings and higher costs of arbitrage associated with such funds. However, the maximum half-life value of 51 days is generated by the KOTAKPSUBNK which is surprising and points towards critical issues in the tracking ability of the fund. On the whole, the tracking inefficiencies appear to be quite persistent.
In Table 6, we present the mean TEs based on daily data. TEs are a commonly reported statistic as they help in quantifying tracking inefficiencies. We report the values for all the three measures of TEs. There are slight differences in the values of the variants. The standard deviation based TE2 produces the highest values while TE1 which is based on the absolute differences between the ETF and index returns generates the lowest estimates. TE3, which has been computed as the standard error of the regression between ETF and index returns lies between TE1 and TE2. Based on TE3, the TEs range from 0.20−2.73 per cent; with half of the funds producing a mean TE in excess of 1 per cent. For four funds, the TEs cross the 2 per cent mark. At 1.36 per cent, the TE of the average fund in the sample is fairly substantial.
Daily Average Tracking Errors (TEs) of Sample ETFs (in%)
Rompotis (2011) in his analysis of 50 iShare ETFs domiciled in the US reported a mean TE of 0.63 per cent, Johnson et al. (2013) found the European ETFs to be generating an annualized TE between 0.04−0.21 per cent while Chen et al. (2017) in their study on New Zealand-based ETFs report TEs in the range of 0.65−0.94 per cent. Thus, the tracking performance of India domiciled ETFs appears to be less efficient as compared to the funds from more developed regions. The higher TEs may be attributable to the emerging nature of this market and consequently the low levels of trading and liquidity. We discuss these issues in more detail now.
Next, we use the panel regression methodology to examine the role of fund-specific factors, which were discussed earlier, in generating TEs. Table 7 reports the results of the random-effects panel regression of TE determinants. The spread variable is highly significant and positive in all the specifications which suggest that the cost involved in trading ETFs is a major factor that contributes to TEs. As the ETF market in India is still characterized by sparse trading and low levels of liquidity, the bid-ask spreads tend to be relatively high which in turn magnifies the TEs of the funds. The absolute premium variable is also of the expected sign and exhibits statistical significance for all variants of tracking error, thus funds whose price diverges widely from their NAVs are poor trackers of their underlying benchmarks. This finding implies that pricing efficiency and tracking efficiency are inter-connected. The AUM variable which is an indicator of the fund’s size is significant and negative in the case of TE1 and insignificant (and negative) for the other two measures; thus, larger funds do seem to generate lower TEs but the effect might be spurious. Fund’s age appears to be negatively related to tracking inefficiency, that is, older funds track their indices more closely. This finding may be attributable to large fluctuations in prices caused by a demand–supply imbalance in the early phase of the fund’s life. lnvol and lnindexcons variables were insignificant in all the specifications.
Determinants of Tracking Errors
*Significant at 10%, **significant at 5% and ***Significant at 1%.
The model is most successful in the case of TE1, all the factors taken together account for about 92 per cent of the variation in it. It explains about 65–66 per cent of the variation for the other two TE measures.
Conclusion
The present work studies the effectiveness of India-domiciled ETFs in achieving their primary objective of benchmark replication. Low-benchmark exposure and high TEs suggest that the funds exhibit serious inefficiencies in their tracking mechanism. The tracking performance of the sample ETFs is significantly poorer in comparison with their counterparts in more developed regions.
The ETF-index disequilibrium has been found to be quite persistent in the short run, implying that tracking inefficiency is not only large but also non-transitory. As evidenced by low beta values in the CAPM regression, the funds do not seem to be employing full-replication strategies. Panel regression analysis shows that TEs are positively affected by the fund’s spread and premium. Moreover, older funds and, to some extent, larger funds are expected to be associated with better tracking efficiency.
Implications and Recommendations
The sub-optimal tracking performance is a cause of concern for the ETF markets as well as the investors who hold such products as they may earn returns which are significantly different from the benchmark which the fund is expected to mimic. Since the index represents only a paper portfolio, thus TEs can only be minimized and not eliminated. We believe that investors who base their investment decisions on factors identified in this study should be able to minimize the TEs of the ETFs in their portfolio. At the same time, fund providers should take these factors into account and adopt measures that would improve the tracking performance of their products.
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
