Abstract
The goal of this research was to investigate the determinants that influence foreign tourism arrivals to the African continent, firstly as a collective and secondly in different regions, with the aim to foster a greater understanding of how African countries and regions can grow their tourism economies. Using static and dynamic panel estimators, two key findings were identified from this research: (1) tourism to the continent is influenced by income in developed countries, prices, telecommunication infrastructure and geographical factors as well as conservation efforts and (2) the regions in Africa do not all react the same to changes in these factors.
Keywords
Introduction
In recent years, the African tourism economy has seen a steady growth in the number of foreign tourist arrivals; however, this growth is still only prominent in a few African countries. This is in contrast to the potential that tourism development offers the continent in terms of its impact on employment, exports, tax income generation and the stimulation of infrastructure provision. Fayissa et al. (2008), therefore, correctly point out that the potential for economic development through tourism growth has not been harnessed effectively in Africa. This is echoed by Sahli and Nowak (2007) who found that tourism-led growth is especially relevant for developing countries, while Christie et al. (2013) argue that many African economies still fail to fully understand tourism’s potential as a driver of economic development and sustained economic growth. The need for improvement and development is particularly evident when considering regional cooperation and integration of economic policy, infrastructure and management.
This is unfortunate, as developing African countries could procure the greatest benefits from tourism, since Africa offers something that few other destinations can in terms of scenery, natural and cultural resources and adventure. However, these particular characteristics are difficult to quantify and are often not included in traditional tourism demand models, contributing to the ignorance of the drivers of tourism in Africa. For example, during the late 1990s, attention was focused on economic drivers or price differentials as the main determinants of arrivals to a destination. Within the African context, Naudé and Saayman (2005) – as one of the earliest studies on tourism to Africa – noted that price competitiveness was not the primary inhibitor of African tourism growth. Analysing the factors influencing tourism demand in Africa as a whole, and also in various regions of the continent, would therefore foster a greater understanding of how African countries and regions can grow their tourism economies. This article aims to make a contribution within this context and distinguishes it from previous research on Africa using more recent data over a longer period of time and by recognizing that the African continent is diverse.
The goal of this research is therefore to investigate the determinants that influence foreign (non-African) tourism arrivals to the African continent, firstly as a collective and secondly in different regions. Using balanced panel data for 25 African countries over 10 years and for five regions in Africa, the research allows for regional analysis and comparison, which provide greater insight into tourism to Africa and, for the first time, the regional differences in African tourism are explored. This may contribute to better policy formulation as well as destination management and regional development.
Literature review
In determining the demand for a certain destination or country, one should consider how demand will be measured. According to Wu et al. (2017), the most prevalent measures are tourist arrivals, followed by tourist expenditure and number of nights stayed in a particular destination. Tourist arrivals are also used to measure tourism demand in this research. Concerning research on tourism demand, the following aspects are reviewed: (1) the standard variables and the evolution to ‘new’ variables; (2) common countries and regions investigated as well as the differences; and (3) the methods and data applied.
Standard and ‘new’ variables
According to Lim (1997), tourism demand can be determined by a few standard tourism flow factors, which, in general, include income from the country of origin, price differentials, travel costs, exchange rates, competing destinations and marketing expenditures, to name a few. Income of origin is the most popular measure used in tourism demand studies (Eilat and Einav, 2004; Lim, 1997; Naudé and Saayman, 2005; Witt and Witt, 2995), and gross domestic product (GDP) is most commonly used as a proxy for income in tourism demand studies (Dogru, Sirakaya-Turk & Crouch, 2017). During the early 2000s, most tourism demand studies relied on variables such as income, GDP, tourism prices, transportation costs, exchange rates, substitute prices, relative prices, hotel prices and various dummies controlling for adverse events such as the 1974 oil crises, 1991 Gulf war and 11 September 2001 or 9/11 (see Bonham et al., 2009; De Mello and Nell, 2005; Dritsakis, 2004; Garín Muñoz and Montero Martín, 2007; Li et al., 2006; Lim, 2004 as prime examples of tourism demand studies that included similar variables). The main differences were in the data ranges, the methods used and the origin and destination countries.
The influence of seasonality, weather patterns and religious holidays has also contributed to a wealth of information. Here, quarterly or monthly data are often used to illustrate the differences of seasonality or weather-permitting holiday periods. Studies related to holidays/Easter (González Gómez et al., 2011), seasonality (Deng and Athanasopoulos, 2011; Zhang and Kulendran, 2017) and weather patterns/climate change (Li et al., 2017; Priego et al., 2015; Ridderstaat et al., 2014; Saayman and Saayman, 2008) have contributed to an enriched understanding of cycles and trends in travel flows.
While standard determinates are still largely relied upon, more recently, studies have included variables such as user-generated data sources from search engines or Google Trends (Li et al., 2017; Rivera, 2016; Yang et al., 2015) and understanding the influence of acts of terrorism on tourism (Buigut and Amendah, 2016; Hamadeh and Bassil, 2017; Karl et al., 2017; Samitas et al., 2018). There is also a growing trend to investigate regional tourism economies (Bo and Ningqiao, 2018; Guizzardi and Stacchini, 2015; López et al., 2016; Luo, 2017).
For the purpose of this investigation, the inclusion of traditional determinants such as income from origin countries (GDP of Organisation for Economic Cooperation and Development (OECD)) and price are used. With reference to African arrivals from origin countries, the majority are comprised of the 36 OECD countries. The OECD countries are distributed within Western (17) and Eastern and Central Europe (9) also known as the European Union (EU)-28, North America (3), Asia (2), Middle East (2) and Australasia (2). Africa’s traditional source markets include Western Europe and North American countries, with the exclusion of Mexico, while the remainder of OECD countries are classified as emerging countries for African arrivals. Within the EU-28, it is estimated that around 57% of Europeans have the propensity to travel outbound (International tourism trends, 2016), furthermore, Europe (France and United Kingdom) and the United States are the largest inbound source markets to Africa. Therefore, the OECD GDP is an appropriate proxy for income of origin country, since traditional and emerging source markets are accounted for.
In addition to these, health, safety, telecommunication infrastructure, level of development and qualitative and geographic dummies are included. Research related to the African continent and determinants of tourist arrivals are scare with the exception of Naudé and Saayman (2005), Kareem (2008) and Fourie and Santana-Gallego (2013). Some variables used by Naudé and Saayman (2005) include political stability, communication infrastructure and marketing, level of development, health risks, tourism infrastructure and geographic dummies such as sharing a border with South Africa. Kareem (2008) made use of both income and consumer price index (CPI) as well as political stability, crime rate, exchange rate appreciation and infrastructure (number of telephone lines). Fourie and Santana-Gallego (2013) also made use of standard determinants such as income (both origin and destination countries), land size, regional trade partners, common borders, language, religion and former colonial ties. Therefore, from these articles, a list of variables were created based on available data, as subsequently discussed in the following sections.
Origins and destinations
The origin countries in many tourism demand studies are usually more sophisticated countries or regions, such as the OECD countries or the EU-28 countries. However, within these regional/economic compositions, some outliers are present. Origin countries usually investigated include the United Kingdom, United States, France and Germany (De Mello and Nell, 2005; Dritsakis, 2004; Li et al., 2006; Shahrabi et al., 2013); however, gradually, Asian-Pacific countries, such as China, Japan, Korea and Australia have attracted more attention (Bonham et al., 2009; Deng and Athanasopoulos, 2011; Lim, 2004; Zhou Grundy and Turner, 2014). Much of the country choice would be dictated by data availability, author preference or specific international events.
Destination countries are in many cases also part of the OECD or EU-28 countries, usually with a warmer climate such as Greece, Spain, Portugal and Italy, or island economies, including Hawaii (state) and Aruba (Bonham et al., 2009; De Mello and Nell, 2005; Dritsakis, 2004; Garín Muñoz and Montero Martín, 2007; Li et al., 2006; Nelson et al., 2011; Ridderstaat et al., 2014). Recent research has also focused on countries with other attractions, including countries such as Macau and Hong Kong for shopping and casino visits (Kim et al., 2011; Lee, 2011).
Concerning Africa, tourism demand studies have mainly focused on individual countries such as South Africa (Burger et al., 2001; Durbarry et al., 2009; Gil Alana, 2011; Moyo and Ziramba, 2013; Saayman and Saayman, 2008; Seetanah et al., 2010), Zimbabwe (Chigora and Vutete, 2015a; Chigora and Vutete, 2015b; Mpofu, 2009; Muchapondwa and Pimhidzai, 2011), Egypt (Faragalla, 2018; Ibrahim, 2013; Wahba and Elanani, 2017), Tunisia (Choyakh, 2008; Choyakh, 2009; Gasmi and Sassi, 2015; Hathroubi, 2011; Ouerfelli, 2008; Ouerfelli, 2010; Sassi, 2014), Mauritius (Gooroochurn and Sinclair, 2005; Gopaul and Cheeneebash, 2015; Seetanah et al., 2015) and Nigeria (Awaritefe, 2007; Bankole and Babatunde, 2010; Eja et al., 2012). Exploring more African countries or the continent as a whole or regionally has attracted less attention, with the exception of Naudé and Saayman (2005), Kareem (2008) and Adeola et al. (2018).
Methods and data
Data and the methods/procedures employed in tourism demand studies are vast. Goh and Law (2011) state that, in the 1990s, econometric approaches to tourism demand modelling focused on the use of single equation regression models. However, as noted by Crouch (1994) and Lim (1997), limitations in these early studies include the existence of multicollinearity and serial correlations as well as possible spurious results due to the lack of unit root tests. This resulted in more sophisticated econometric approaches, such as systems of demand equations, vector autoregression, cointegration, error correction model and panel data approaches (Goh and Law, 2011). The more sophisticated approaches incorporate dynamics of tourism demand such as time lags, information asymmetry, repeat visits, word-of-mouth recommendations, supply rigidities and long-term adjustments (Song et al., 2012). According to Lim and McAleer (2001), the performance of forecasting models can be affected by the type of data (monthly, quarterly and annually) used.
Methodology
The use of panel data is especially popular because it addresses the problem with omitted or unobserved variable bias more effectively than cross-sectional data regressions alone (Kirchgässner et al., 2013). The use of cross-sectional data limits the causal relationship due to (1) unobserved variable bias, (2) endogeneity bias and (3) indeterminacy of the causal mechanism (Berrington et al., 2006). For this reason, the use of panel data is more appropriate as it provides more reliable estimations and greater leverage on causality due to the time component and not the sequencing of the variables as is the case with cross-sectional data alone.The complete panel used in this article consists of 25 African countries (see Table 1) and 10 years of data (i.e. N = 25 and T = 10). It is therefore not a large panel in the time dimension, which causes some constraints in using time-series techniques in the analysis. The panel, therefore, calls for methods that can accommodate a relatively large cross section (when the total panel is analysed) and a relatively small-time dimension. Apart from the regular fixed-effects estimation – also referred to as the least square dummy variable (LSDV) method – these types of panels are also modelled using generalized method of moments (GMM) and bias-corrected LSDV to estimate reliable results (Williams, 2015).
Countries used in the analysis.
Generalized method of moments
According to Blackburne and Frank (2007), small T-panel estimations rely on random or fixed effects or a combination of fixed-effect estimators and instrumental-variable (IV) estimators such as the GMM estimator. These methods require pooling individual groups and allowing only the intercepts to differ across the groups. As a result of bias problems in dynamic panels, two solutions are proposed: (i) introduce exogenous variables (not viable with small T-samples due to bias) and (ii) use IV methods such as the GMM estimator (widely used in panels with small T) (Asteriou and Hall, 2015). According to Garín-Muñoz (2006), unless the time period in a panel is large (close to infinite), random-effect estimators are biased and inconsistent. A solution is to use fixed effects or the dynamic GMM procedure of Arellano and Bond.
Arellano and Bond (1991) state that the GMM estimator optimally exploits all linear moment restrictions that follow from the assumption of no serial correlation in the errors in an equation that contains individual effects, lagged dependent variables and no strictly exogenous variables. One of the main advantages of GMM is that it allows estimation in systems where the number of unknowns is smaller than the number of moment conditions and to test whether the moment conditions hold (Sheppard, 2005).
Two forms of the GMM are available: (1) difference GMM and (2) system GMM. According to Roodman (2009), the GMM estimation starts by changing all regressors, usually by differencing, therefore referring to the differenced GMM, where the first-differenced instrument variables are uncorrelated with the fixed effects. The system GMM derives from the differenced GMM by introducing more instruments for efficiency and building a system of two equations – the original and the augmented one (Roodman, 2009).
In determining GMM estimators, a one-step or two-step approach can be followed, with the latter being the most popular (Drukker, 2010). The main advantage of the two-step approach is that the number of equations and parameters in the nonlinear GMM step does not grow as the number of perfectly measured regressors grows. However, the one-step GMM does not offer the estimating simplicity of the two step but is asymptotically more efficient (Erickson and Whited, 2002). Bond and Windmeijer (2002) state that one-step GMM estimators are asymptotically equivalent to optimal two-step GMM estimators.
According to Mileva (2007), certain problems might arise from using panels with small T and moderate N (such as in this article T = 10; N = 25) including (1) causality might run in both directions, where regressors might correlate with the error term; (2) time-invariant country fixed effects might correlate with explanatory variables; and (3) the presence of a lagged dependent variable gives rise to autocorrelation, while the Arellano–Bond (AB) estimator was designed for small T and larger N panels (Mileva, 2007). This estimator is, therefore, applicable to the larger African panel, but not to the regional panels, where the cross-section dimension (N) is considered to be too small. For these panels, an alternative method should be considered.
Bias-corrected LSDV
Observed bias in panels characterized by small N, small T or both (such as with this panel) necessitates comprehending the properties of different estimators on the estimated fixed effects (Buddelmeyer et al., 2008). One solution to counter the bias within short panels is to make use of a bias-correction technique. In recent years, the bias-corrected LSDV method has become popular, especially the techniques based on Kiviet (1995) and Judson and Owen (1999), which have been shown to outperform GMM and IV estimators (Bun and Carree, 2005).
According to Bruno (2004), the bias-corrected LSDV is more accurate than traditional GMM estimators, especially when small panels are considered, or at least where N is only moderately large. According to Kiviet (1995), LSDV estimators are also biased, although the standard deviation is often much smaller compared to the first-differenced Anderson–Hsiao (AH) IV estimator models and various GMM estimator models. Moreover, the bias-corrected LSDV is not outclassed and is still reasonably easy to calculate, whereas the IV and GMM estimations deliver poor results, especially in restricted conditions (Kiviet, 1995). Panels with small time dimensions (T) deliver the most appropriate results by means of the bias-corrected LSDV.
It is due to these properties that the smaller panels in this research will be estimated using the bias-corrected LSDV. This estimator, together with the AB GMM, will also be applied to the total African panel, for robustness.
Case study
Within the context of this research, inbound tourism refers to tourism activity attracted from other continents to Africa and does not include inter- or intra-African travel. According to Rogerson (2007), the systematic colonization of Africa by Europeans contributed to the establishment of Africa’s major tourism source markets. The United Nations Economic Commission for Africa (TII, 2011) states that the traditional source markets (Britain, Europe and the United States) have been the backbone of the tourism industry to Africa, and the pace of African growth, especially tourism receipts, is heavily dependent on the welfare of these regions (presumably based on income of origin countries and consumer prices in Africa).
Traditional source markets have been sustained very successfully over the years, yet are dwindling and ageing rapidly (WEF, 2013a, 2013b). Emerging markets such as China, Latin America and Russia have started to travel more extensively, also to Africa (Christie et al., 2013; UNWTO, 2010). For example, China held a 9.5% global market share in 2012, which constitutes a 40% increase from 2011. This makes China the largest and most affluent travel market, spending over US$107 billion (UNWTO, 2013a). It may be argued that much of the expansion into Africa is due to trade between the BRICS (Brazil, Russia, India, China and South Africa) countries. This might mean that Africa has positioned itself to benefit from increasing tourism from emerging markets. International tourism arrival statistics show that the majority (52%) of international tourists still travel to Europe, followed by Asia and the Pacific (23%), the Americas (16%), Africa (6%) and lastly the Middle East (5%) (UNWTO, 2013b). However, the demand for destinations in Asia and the Pacific (+6%) and Africa (+6%) is growing rapidly. The growing importance of Africa as a tourism destination is not matched by an increase in research though, with Xiao and Smith (2006) describing the state of research on Africa as ‘low’ compared to tourism research on North America, Europe, Asia and Australasia.
However, Africa differs from other continents due to various difficulties such as a lack of infrastructure, lack of available data and in many cases political and social unrest. The fact that the majority of least developed countries worldwide (34 of 48) are African countries is very worrisome and could be a potential contributor to low inbound travel flows to the continent, since crime is the social cost of poverty and inequality (Bourguignon and Des Hautes, 2001). According to Christie et al. (2013), Africa’s tourism is threatened by land availability, investor access to finance, taxes on tourism investments, low levels of tourism skills, lack of security and safety, high crime, public health, visa requirements, red tape and bureaucracy. Fundamental developments by governments regarding air transport infrastructure, appropriate accommodation and access to tour operators are therefore needed.
Bloom et al. (1998) state that Africa’s geography makes cultural interaction between Africa and the rest of the world, as well as within Africa, very difficult. This is further complicated by the vast area (30,221,532 km2) to be travelled as well as the goods that need to be transported to destinations in and across Africa (Turvill, 2013). Moreover, the constraints that certain African countries have with respect to transport services and infrastructure make it increasingly difficult to accommodate economic activity such as travel and trade (Rodrigue et al., 2013). Seddighi et al. (2001) emphasize that political instability influences the tourism industry because of its sensitive nature, the increasing competition between destinations and the narrow profit margins.
Model specification, data and variables
This research followed the economic approach to demand, where the quantity demanded depends mainly on income, price and other factors that influence tastes and preferences. The following equation is a representation of the simplified model and forms the basis of the estimated models in this article
where
The models are estimated using the complete panel of data for all 25 African countries (i.e. balanced panel of N = 25 and T = 10) as well as for the five regional areas of the continent (i.e. Northern, Eastern, Southern, Western and Central Africa). For Northern Africa, the panel, therefore, consists of four countries (N = 4) and 10 years’ data (T = 10); for Eastern Africa N = 7 and T = 10, for Western Africa N = 5 and T = 10, for Southern Africa N = 6 and T = 10 and for Central Africa N = 3 and T = 10. In this research, demand is measured using foreign arrivals. Although demand can also be measured as real spending by tourists, this option is not available, since tourism receipt data for African countries is even scarcer than arrivals data. According to Sharma (2004), some problems are associated with arrivals data, especially concerning the accuracy.
Using the basic demand specification above (equation (1)), it is evident that the demand for a destination depends on income at the origin, prices at the destination and other tastes and preferences. To measure preferences and tastes for Africa as a destination, this research relies mainly on previous research done on the African continent, specifically that of Naudé and Saayman (2005). Factors to consider, according to these authors, are political stability and personal safety, health risks, the level of development, available infrastructure and tourism marketing. For all the panels, the basic model specification is
with L indicating that natural logs of all variables were taken to ensure standardization and ease of interpretation. Systematically defining the variables employed in equation (2) provides clarity. LARRIVE is the dependent variable, which includes all arrivals to Africa of non-African countries, ensuring that intra-African arrivals are not included, while the independent variables include the following:
LAGDP is the advanced economy GDP, which is expressed by the GDP of the OECD. GDP only has variation in the time dimension and not in the cross-sectional dimension, which implies that time-fixed effects cannot be used together with the income proxy. Since tourism is often classified as a luxury good, a positive sign is expected as well as a coefficient larger than 1.
LRELCPI is the relative CPI, which is expressed as the ratio of the African country CPI divided by advanced economy (OECD) CPI. Again, OECD is used due to the importance of OECD countries as African arrival source markets. According to this calculation, an increase in destination CPI causes the relative CPI to increase and due to the law of demand, it is expected that arrivals will decline. A negative sign is therefore expected.
LTB accounts for health risks, the incidence of tuberculosis per 100,000 inhabitants of an African country. It therefore measures tourists’ exposure to dangerous diseases. Higher incidence is expected to result in less tourism, and therefore, a negative sign is expected.
LSAFETY measures four dimensions of governance (personal safety, rule of law, accountability and corruption and national security) on a scale from 1 to 100. A lower score indicates that a country has more safety concerns and less rule of law, and therefore, a negative sign is expected.
LINTERNET is the measure of Internet penetration and subsequently refers to telecommunication infrastructure, the number of Internet users per 1000 people. A positive sign is expected with more Internet users indicating higher levels of communication infrastructure (i.e. accessibility and marketability the destination).
LURBAN and LDEATH are two proxies for the level of development of the African country as expressed by urbanization rate and the death rate. A more urbanized economy and an economy with a lower death rate are both indicators of a more developed economy. Therefore, a positive sign is expected for the urbanization rate and a negative sign for the death rate.
This basic specification is also expanded by including a number of variables accounting for specific qualitative characteristics (QDUMMY) and to control for the geographical attributes (GDUMMY) of African countries.
In terms of the specific qualitative characteristics (QDUMMY), Africa is often visited for its natural beauty, wildlife and unspoiled marine life. This is added to the basic model by including two variables that were not used previously within the African context, namely: LPROTECT measures the percentage of the total territory, that is, a terrestrial protected area. LMARINE measures the percentage of the total territory, that is, a terrestrial or marine protected area – both obtained from the World Bank Development Indicators (WBDI) database.
Since the variable has little to no variation over time, it is constant in the time dimension, but varies in the cross-sectional dimension of the panel. Both, therefore, serve as fixed-effect dummies and are included in the regressions separately. All analyses were done using Stata 14.
Furthermore, as in the research by Naudé and Saayman (2005), geography is expected to influence tourism arrivals in Africa and three dichotomous (dummy) variables were coded to account for this (GDUMMY): Landlocked countries (LLC) is a landlocked dummy, since being landlocked not only indicates lower accessibility but also excludes access to beaches. In the tourism environment, a negative sign is therefore expected. NADUM is a Northern African dummy that takes the value of 1 for countries that are located in the northern part of the continent. Geographical proximity to Europe may create an advantage in terms of tourism flows to these countries and a positive sign is expected. SABORDER is a dummy variable that measure closeness to South Africa – one of the main destinations in Africa. The variable is coded to take the value of 1 if the country borders South Africa; and 0 otherwise. It is expected that the relationship will be positive, since being close to a major destination may improve accessibility.
Results
The results are discussed for Africa as a whole and are then shown for each region, namely, Northern Africa, Eastern Africa, Southern Africa, Western Africa and Central Africa.
The models were estimated using the standard pooled estimator as well as fixed and random effects, and the two tests were conducted to determine the appropriate estimator: the Breusch–Pagan Lagrange Multiplier (LM) test for random effects and the Hausman test to determine whether random or fixed effects are more appropriate. The null hypothesis of the Breusch–Pagan LM test is that the pooled ordinary least square (OLS) is consistent, while the null hypothesis of the Hausman test is that the random-effects model is consistent.
The Breusch–Pagan LM test only rejects the pooled OLS estimator in favour of random effects in one instance, namely, for the total African panel. For the remainder of the panels, the pooled OLS estimator is consistent. The Hausman test subsequently indicated that for all the African panels, random effects could be rejected in favour of fixed effects (see Online Supplemental Appendix, Table A1). Therefore, the pooled OLS estimator together with the LSDV and bias-corrected LSDV estimators were used for all the smaller panels, while the larger African panel was estimated using fixed effects, the LSDV and bias-corrected LSDV estimators as well as the AB estimator.
Results for Africa as a whole
For the total African panel (i.e. 25 countries and 10 years), the basic regression (equation (2)) was first estimated using the LSDV to account for fixed effects as well as the robust option to account for heteroscedasticity in the error term. The results are shown in column (1) of Table 2. Since the fixed-effects estimator cannot accommodate any other dummy variable, the remainder of the static models was estimated using an LSDV approach, with a dummy variable for each destination country. This estimator is equivalent to fixed effects because it allows for different constants for each cross section, that is, αi and not a constant intercept (α) as in equation (1) (Asteriou and Hall, 2015). Using the LSDV, it is also possible to include other dummy variables. The five dummy variables accounting for various geographical and conservation (or tourism-specific) effects were included separately to test their significance in explaining tourism flows to Africa. The results are reported in columns (2) to (6) of Table 2.
Static estimation results.
Source: Compiled by author from STATA output.
Note: LSDV: least square dummy variable; NADUM: Northern African dummy; SABORDER: South African border. Standard errors are given in parentheses.
*p < 0.01; **p < 0.05; ***p < 0.001.
The static regression results in Table 2 indicate that arrivals to Africa as a whole are sensitive to income of origin countries (LAGDP) and the sign is positive, as expected, and greater than 1. This indicates that tourism to Africa is a luxury good and that an increase in income in the developed world will lead to a relatively larger increase in tourism to Africa. This finding is contradicting to results by Khadaroo and Seetanah (2008) stating that Africa is considered a non-luxury destination.
Telecommunication infrastructure (LINTERNET) is also positively and moderately significantly (at a 10% level of significance in some regressions) related to tourist arrivals in Africa. Again, this is expected and indicates that African countries that are more easily accessible and marketable receive more tourists.
Concerning the dummy variables, it is noteworthy that the geography dummy, landlockedness (LLC), is not significant, but both the NADUM and South African border (SABORDER) are significant. As expected, the NADUM is positive, indicating that countries closer to Europe (one of the main markets for tourists to Africa) attract more tourists. Contrary to previous research by Naudé and Saayman (2005), the SABORDER dummy is found to be negative. This implies that closeness to the largest southern African tourism destination is not creating positive spillover effects to neighbouring countries.
Quite disappointingly, none of the conservation (or tourism-specific) dummy variables, namely, protected marine areas (LMARINE) and protected terrestrial areas (LPROTECT), were significant. This could indicate that overall (because of Africa’s size), the ratio of protected marine and terrestrial areas is insufficient, and therefore, more conservation attempts have to be made to either increase the size of protected areas or establish additional protected areas.
While Table 2 shows the results of the static regression models, the dynamic model results are presented in Table 3. Since the panel has a relatively large cross-sectional dimension (N = 25), but a small time dimension (T = 10), bias in a dynamic fixed-effects model, which results from the correlation of the lagged dependent variable with the individual effects, is addressed using two alternative estimators, namely, the AB estimator and the bias-corrected LSDV estimator.
Dynamic panel data regression results (GMM and bias-corrected LSDV).
Source: Compiled by author from STATA output.
Note: GMM: generalized method of moments; LSDV: least square dummy variable; AB: Arellano–Bond; BB: Blundell–Bond; AH: Anderson–Hsiao; NADUM: Northern African dummy; SABORDER: South African border.
* Significant at the 10% level; **Significant at the 5% level; ***Significant at the 1% level.
The reason for estimating dynamic econometric models is that it allows the time path of the dependent variable to be included in the model. According to Gujarati and Porter (2009), one of the main reasons for including lagged variable in econometric models is psychological, or in other words, habit formation (inertia). The inclusion of the lagged dependent variable due to aspects such as habit formation, explains a large portion of the variance in the dependent variable and therefore often leads to insignificance of other variables (that were significant in the static model) due to a reduction in their contribution to explaining the variance in the dependent variable. For this reason, there might be differences between the static and dynamic model estimates. The dynamic model estimates shown are then also the short-run coefficients, with the long-run coefficients that can be calculated as
The system GMM was estimated because the models include dummy variables (which render them zero in the difference GMM). Furthermore, the robust one-step estimator was used, as in previous studies on tourism (e.g. see Naudé and Saayman, 2005; Seetaram, 2010). The results were obtained using the xtabond2 procedure in STATA, which allows more control over the instrument matrix. As in the static models, the dummy variables were not introduced all at once, and therefore, the first five columns, (1) to (5), report the system GMM results. The last three columns, (6) to (8), report the bias-corrected LSDV results. For the bias-corrected LSDV estimator, three underlying models were estimated, namely, the AB GMM, the Blundell–Bond (BB) GMM and the AH IV approach. The bias was corrected using the bootstrapped method with a variance–covariance of 100 (similar to that employed by Seetaram, 2010).
Concerning the dynamic regression results, it is evident that the lagged dependent variable is positive and significant in all specifications regardless of whether GMM or the bias-corrected LSDV estimator is used, thereby confirming results by Fourie and Santana-Gallego (2013). This indicates persistence in tourism flows to Africa, with the possibility of repeat visits and habit formation. This is contrary to the results found by Naudé and Saayman (2005) and Khandaroo and Seetanah (2008), which found a negative lagged dependent variable for Africa.
Relative prices become significant in the dynamic regression. The negative relationship is as expected and shows that tourists are becoming increasingly sensitive to prices in Africa, although the elasticity is still less than 1. Wakimina et al (2018) also found that the Association of Southeast Asian Nations (ASEAN)-5 countries are price sensitivity destinations, even though Khandaroo and Seetanah (2008) point out that travelling to low-income destinations (Asia and Africa) is less price-sensitive. This is also evident when the price elasticities in this study are compared to that of the United States (Yazdi and Khanalizadeh, 2017).
In the bias-corrected LSDV estimator, income is also positive and significant, similar to the results found in the static regression. The coefficient is larger than unity, confirming that tourism to Africa is viewed as a luxury product. Again, the income variable was also supported by Wakimina et al (2018) in similar research, however, contradicted by Yazdi and Khanalizadeh (2017) for research based on the United States. The insignificance of GDP in the Arrelano–Bond estimations may be due to the inclusion of the lagged dependent variable, as explained above.
While safety and health concerns for Africa as a whole seem to play a lesser role in influencing tourist arrivals, telecommunication infrastructure (as measured by Internet users) is a quite robust determinant of tourism. In almost all specifications, LINTERNET is positive and significant confirming results by Ghalia (2016) stating that Internet is essential to enhance the tourism industry.
The development proxies, that is, the urbanization and death rates, are not significant. However, geographical closeness to Europe (as measured by the NADUM) is clearly an advantage, confirming the results of the static regressions. In the dynamic GMM models, the tourism-specific dummy variables marine and terrestrial protected areas are positive and significant, indicating that African countries that increase their conservation efforts gain more from tourism.
In terms of the diagnostics, it is evident that the null hypothesis of the first-order autocorrelation cannot be rejected, although this is expected in GMM models (Roodman, 2009). The test for the second-order autocorrelation indicates that at a 5% level of significance, the null hypothesis of no autocorrelation cannot be rejected in all instances. The Sargan test of over-identifying restrictions indicates that the null hypothesis can be rejected in all instances. However, the Hansen test indicates that the null hypothesis cannot be rejected and that the instruments are valid. The difference-in-Hansen test results confirm the exogeneity of instruments. According to Baum et al. (2003), the Hansen J-statistic is consistent in the presence of intra-cluster correlation, which causes the standard over-identifying restriction test to over-reject the null.
Static results for African regions
Since the diagnostic tests rejected the random effects estimator for the various African regions, the pooled OLS results are shown together with the robust LSDV estimator in order to control for country-specific effects as well as differences between countries due to conservation efforts (i.e. LMARINE and LPROTECT variables). These static estimates are first discussed and summarized in Tables 4 and 5, before the dynamic estimator results follow.
Static panel data regression results.
Source: Compiled by author from STATA output.
Note: OLS: ordinary least square; LSDV: least square dummy variable. Standard errors are given in parentheses.
*p < 0.1; **p < 0.05; ***p < 0.01.
Static panel data regression results.
Source: Compiled by author from STATA output.
Note: OLS: ordinary least square; LSDV: least square dummy variable; SABORDER: South African border. Standard errors are given in parentheses.
*p < 0.1; **p < 0.05; ***p < 0.01.
The static regression results for Northern Africa (Table 4, columns (1) to (4)) indicate that the region is sensitive to income changes of origin countries, and the relatively large positive coefficient confirms that this is a luxury destination. Contrary to expectations, Northern Africa is not a price-sensitive destination, with the sign of the relative price variable indicating that an increase in prices in this region does not cause a decline in tourism. This is an indication that the region is still viewed as a value-for-money destination. Like the results for Africa as a whole, telecommunication infrastructure (proxied by LINTERNET) is positive and significant. It is also interesting that the development proxies are all significant and have the correct sign, indicating that when tourists choose between Northern African destinations, more developed countries are preferred. Finally, the two conservation proxies are not significant, which shows that tourists to Northern Africa might be less concerned about conservation and natural beauty.
Eastern Africa is the largest region in the analysis and the panel comprises seven countries: Burundi, Ethiopia, Kenya, Mauritius, Seychelles, Tanzania and Uganda. The static regression results in Table 4, columns (5) to (9), indicate that most of the variables are not significant predictors of tourism arrivals to the region. This indicates that tourism to this region is less concerned with price, health and safety concerns and telecommunication infrastructure. Two exceptions are the urbanization and death rates. The urbanization rate shows a fairly large and negative coefficient, indicating that tourists travelling here may be especially interested in rural tourism. The death rate is also significant and negative, which might indicate that tourists choose between countries in this region on the basis of levels of development. Interestingly, and contrary to the significant urbanization variable, the protected terrestrial and marine conservation measure is not significant, indicating that the natural environment might not be the main drawcard for tourists to this region.
The Central African panel consists of only three countries: Angola, Central African Republic and Zambia. The results are indicated in Table 4, columns (10) to (13). The regression results for Central Africa (column (10)) indicate that tourism depends positively on telecommunication infrastructure (LINTERNET), with countries that are more advanced benefiting more from tourism. This result is, however, only significant for the pooled regression. Both the development variables are significant in the fixed effects models, with an increase in the death rate associated with decreased arrivals, as expected. However, the urbanization rate also has a negative sign, indicating that more urbanized economies receive less tourists. The health risk (LTB) also show results contrary to expectations, which may indicate that health concerns are not hampering tourism in the region. The tourism and conservation proxies (LMARINE and LPROTECT) are both negative and significant, indicating that more protected areas are not necessarily associated with increased tourism flows in this region.
Western Africa consists of five countries: Benin, Gambia, Niger, Nigeria and Sierra Leone. The static results (Table 5, columns (1) to (5)) show that income (LAGDP) from origin countries is positive and significant, indicating that increased income in developed countries will lead to increased tourism arrivals in this region. The prevalence of the tuberculosis (LTB) variable indicates a significant negative coefficient, implying that arrivals to this region are deterred by concerns over health issues, such as tuberculosis. Furthermore, there is again a positive and significant relationship between telecommunication infrastructure (as measured through LINTERNET). The protected area dummy (LPROTECT) has a negative coefficient, indicating that arrivals again do not bear a positive relation to conservation efforts in the region.
Southern Africa, the second-largest region in the analysis, consists of six countries: Lesotho, Madagascar, Malawi, Mozambique, Namibia and South Africa. The static results (Table 5, columns (6) to (11)) show, by the significant positive sign of the coefficient, that the Southern African region is considered a luxury destination, similar to Northern Africa. The LSDV estimator shows that the prevalence of tuberculosis and the issue of safety and the rule of law in this region are not deterrents to tourism arrivals, although the pooled estimator does not confirm these results. This is surprising as Southern Africa has one of the highest concentrations of Tuberculosis (TB) in Africa. As in most other regions, the telecommunication (LINTERNET) variable is positive and significant. In the pooled estimation, the positive urbanization rate indicates that tourism is positively associated with higher levels of development. This is echoed by the negative and significant coefficient for death when the LSDV estimator is used.
The geographic dummies offer an interesting insight into the region. The significant and negative SABORDER proxy indicates that arrivals will not necessarily increase to the countries directly adjacent to South Africa. The negative and significant geographic proxy (LLC) indicates that tourism to this region is dependent on access to coastlines, although the protected marine areas (LMARINE) coefficient shows that this does not necessarily imply that conservation efforts will lead to increased tourism in this region.
Dynamic results per region
The dynamic regression results for the various African regions are indicated in Table 6. The dynamic models were estimated using the bias-corrected LSDV estimator and the results are summarized in the table for all the regions. The estimator used was developed by Kiviet (1995) to correct for bias in small samples, and Bun and Kiviet (2001) showed that this estimator is superior to any other estimator when the panel has both small N and T dimensions.
Dynamic panel data regression results (bias-corrected LSDV estimator).
Source: Compiled by author from STATA output.
Note: LSDV: least square dummy variable; AB: Arellano–Bond; BB: Blundell–Bond; AH: Anderson–Hsiao.
* Significance at the 10% level; **Significance at the 5% level; ***Significant at the 1% level.
In the dynamic model, a lagged dependent variable is included, and it is evident that, as in the total African panel, there is habit formation and persistence in tourism to Northern Africa (Table 6, columns (1) to (3)) which confirms results by Fourie and Santana-Gallego (2013). The dynamic specification also confirms that Northern Africa is an income-sensitive destination and that changes in origin country incomes would have a large effect on tourism to this region. The influence of telecommunication infrastructure is also confirmed in the dynamic regression. However, the estimates do not confirm the idea that prices in Northern Africa can increase and still have a positive influence on tourism. Instead, it shows no effect of price changes on tourism. Furthermore, once the effect of repeat visitation (or persistence) is controlled for, the differences in the levels of development between Northern African countries become insignificant for tourism.
The dynamic regression results for Eastern Africa are indicated in Table 6, columns (4) to (6). As in the case of Africa as a whole and Northern Africa, the habit formation and persistence of tourism to Eastern Africa are evident through the significance of the lagged dependent variable. The region is, however, not income sensitive; this could be due to the large variety of expensive lodges and resorts in the region, indicating that very affluent people travel to this region. An attraction, which was not measured, could be cultural phenomena such as the Masai people of southern Kenya and northern Tanzania.
The dynamic regression results for Southern Africa are indicated in Table 6, columns (7) to (9). The lagged dependent variable again shows habit formation and persistence of tourism to the region. Southern Africa is not income or price sensitive. The positive effect of telecommunication infrastructure (LINTERNET) is confirmed in the dynamic regression. It is confirmed that higher levels of development (as measured by lower death rates) are associated with an increase in arrivals to Southern Africa. Like the static LSDV estimator, the dynamic regression shows that the prevalence of tuberculosis (LTB) does not have a negative influence on tourism arrivals.
The dynamic regression results for Western Africa are indicated in Table 6, columns (10) to (13). The results indicate that the dependent variable (LARRIVE) is positive and significant, but only when the BB estimator is the underlying technique, indicating limited evidence of habit formation and continuous travel to Western Africa. The income proxy (LAGDP) is the only other significant variable, which confirms that tourism to this region is dependent on income growth in developed countries.
The dynamic regression results (Table 6, columns (13) to (15)) confirm that the prevalence of tuberculosis does not deter arrivals to Central Africa. As in the static results, urbanization is negative and significant, implying that countries with lower levels of urban development attract more tourists to this region. However, tourism is adversely affected by increase in overall death rates.
Discussion
The first finding that follows from this research is that five determinants that influence travel to Africa as a whole were identified, namely, income, price, infrastructure, geographical and conservation factors. This research confirms that travel to Africa remains a luxury good, with an income elasticity greater than unity. A previous study on the determinants of tourism to Africa by Naudé and Saayman (2005) found price to be an insignificant predictor of arrivals. However, in this updated study with more African countries and a more recent time period, it has been found that tourism to Africa is becoming more price-sensitive. As in Naudé and Saayman’s (2005) findings, infrastructure remains a significant determinant of tourism to Africa. In terms of the significance of the different regions in Africa, the current research shows that the Northern African region and not the Southern African region, as found by Naudé and Saayman (2005), creates positive spillovers for tourism. In addition, conservation efforts have a positive effect on tourism to the continent.
Since this research shows that arrivals to Africa are becoming price sensitive, African countries have the opportunity to increase tourism by focusing on two distinct strategies. These strategies should be aimed at attracting: (1) tourists who are less price-sensitive and are willing to pay for exceptional five-star service at luxury lodges, and (2) attract tourists who are price sensitive by offering services at reduced rates, for example, backpacking or rustic resorts. A number of African countries follow mainly the first strategy and it can be worthwhile to develop the alternative lower cost strategies.
Telecommunication infrastructure (as proxied by Internet connectivity) remains a key determinant of African tourism. The results suggest quite robustly that the rapid expansion of communication infrastructure would benefit tourism in Africa. In future, it could be even more beneficial if communication operators in countries within the same region were connected to offer seamless integration to people travelling to multiple countries within the same region. This will also contribute to regional economic cooperation and integration.
Furthermore, it is confirmed that proximity to Europe creates benefits for tourism in Northern African countries. Since distance is reduced, travel costs are decreased, making Northern African countries ideal destinations for price-sensitive tourists. Moreover, Northern African countries have the ability to offer mass tourism products such as sun, sea and sand (beach) tourism. More affordable resort-style infrastructure could create even more benefits for the region.
Africa is home to many rare and indigenous species, which represents a key draw card for tourism. To ensure sustainable tourism, African countries are in an advantaged position where active conservation efforts may lead to increased tourism. However, it is imperative that developments and investments be made in a sustainable manner that takes into account both biodiversity and societal impact.
The second key finding is that the five regions in Africa are clearly very different and the variables that influence tourism to these regions vary significantly between them. This is largely explained by the fact that each region offers unique experiences and therefore faces unique challenges to attract more arrivals. An encouraging result is that lagged arrivals are positive and significant for almost all African regions, indicating that there is persistence in tourism to all regions of the continent.
A striking result in all the regions, except Western Africa, is that the death rate is almost always significant and have a negative influence on tourism arrivals. Since the death rate is a proxy for the level of development of the country, it clearly shows that more developed economies (i.e. lower death rate) attract more tourists. Not only does low death rates indicate more development, but it also shows better healthcare and less epidemics in a country, which is also important for tourists. Effective and efficient allocation of funds for health services in Africa would therefore also benefit the tourism sectors in these economies.
Northern African arrivals are susceptible to origin-country income and telecommunication infrastructure. Since these determinants are manageable, Northern African countries should look at more inclusive holiday destinations that can suit every budget. Merging with telecommunications operators in Europe might also be beneficial, because tourists will have access when visiting.
Different levels of development between Eastern African countries (as measured by the death rate) may account for differences in tourism between countries in this booming tourist region. The countries included in the region are quite diverse, ranging from islands to recently war-torn countries. This diversity might explain why the average coefficients estimated by panel models are not significant.
Infrastructure is a robust determinant of tourism to Southern African countries, which implies that the region can benefit from improving telecommunication infrastructure. Regional investment in the Southern African Development Community must therefore be promoted. Tourists to Southern Africa are informed and not deterred by the health, development and safety concerns that plague the region. Improvements in the region in terms of development, rule of law and safety can, however, be more beneficial to tourism.
As in the Northern African region, tourism to Western African countries is income elastic, indicating that this region is considered a luxury destination. With the exclusion of Niger, all the Western African countries have access to beaches and could offer cost-effective resort-style tourism. These countries are also much closer to Europe, resulting in reduced travel costs. However, the region still has a long way to go in establishing itself as a tourism destination.
As in Southern Africa, tourists to Central African countries seem to be informed travellers who visit the region despite its low levels of development and health concerns. In fact, tourists to Central Africa may prefer more rustic and natural settings. Greater government investment in public health and social welfare may, however, improve conditions in this region, where there is not yet persistence in tourism arrivals.
Conclusion
The goal of this research was to investigate the determinants that influence foreign tourism arrivals to the African continent, firstly as a collective and secondly in different regions, with the aim to foster a greater understanding of how African countries and regions can grow their tourism economies. The second aspect makes this research unique, since previous research has focused either on the entire African continent or on individual African countries. This research finds that inbound tourism to Africa is dependent on the income of source markets in developed countries, the price at the destination in Africa, challenges faced due to telecommunication infrastructure and geographic factors, as well as conservation efforts of African countries. Additionally, the regional analysis indicates that each region will react differently to the set of determinants that influence tourism arrivals to Africa as a whole.
The contribution that the research makes is firstly in identifying the determinants of tourism to Africa, using the most complete panel of African countries to date, and secondly in investigating and comparing tourism to different regions in Africa, which have not been done to date. Based on the results of the analysis, several recommendations have also been made to increase tourism arrivals to the African continent.
This research shows that although panel data are useful in analysing continents where there are limited data availability, one should also take note that the coefficients estimated represent the average effect and may not apply equally to all countries or regions. A closer analysis reveals that the African regions are diverse and, although some results are generalizable, each region and country should invest in assessing the opportunities (and threats) to benefiting more extensively from tourism. A simple example is conservation: in the complete African panel results, the conservation proxies are positive and significant, indicating that increased conservation efforts are associated with more tourist arrivals. However, conservation is not considered to influence the choice between destinations within a specific region. Therefore, African countries should not solely focus on conservation efforts to attract tourists, but rather expand current offerings to include various products and services, while still promoting conservation and increasing investment in conservation efforts. Another example is price competitiveness: while Africa as a whole is considered to have become more price sensitive, a closer inspection reveals that this result may be attributed to price increase in Northern African countries and not necessarily in all African countries.
Variables that consistently influence the total African panel, as well as most regions, include telecommunication infrastructure and, to a lesser extent, income in the source markets. The development of infrastructure, and specifically telecommunication infrastructure, could lead to increased tourism to the entire continent. Since most African destinations are considered luxury offerings, more could be done to market the continent as a value-for-money destination that offers more-than-luxury safaris. The research also highlights the influence that economic and social development in African countries has on tourism.
The limitations of the research are mostly attributed to the availability and quality of the data. Although data availability has certainly increased over the past decade, a lack of detailed records of tourism and economic data for African countries still hampers research. Future research could segment the continent into the different countries and employ other data analysis approaches.
Supplemental material
Supplemental Material, Table_A1 - Determinants influencing inbound arrivals to Africa
Supplemental Material, Table_A1 for Determinants influencing inbound arrivals to Africa by Armand Viljoen, Andrea Saayman and Melville Saayman in Tourism Economics
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
