Abstract
We analysed the determinants of the length of stay for tourists arriving in a Mediterranean coastal destination by means of high-speed rail (HSR) service. This study is based on data obtained from a survey completed by HSR passengers returning from holiday in Costa Daurada (Catalonia). The empirical analysis is based on estimations made using a survival model. The influence of the availability of HSR service on tourists’ destination choices together with the tourists’ profiles, party structure or accommodation characteristics was used as explanatory variables. Results revealed that the existence of HSR services played a minor role in tourists’ decision of whether to visit the Costa Daurada. Also, evidence suggests that the existence of the HSR station would only affect the length of stay of those tourists who stay overnight in second residences.
Introduction
The extension of a high-speed rail (HSR) network has introduced new facilities for medium-distance trips. It has shortened travel times and made trips more comfortable between the regions connected by this network. Within this context, there is growing literature analysing the contribution of HSR to tourism development (mostly in China, Spain, France and Italy).
The analysis of the effects of travel time reduction and the improvement in accessibility to tourists is a pre-eminent research field within theoretical and ex ante studies (Campa et al., 2016; Masson and Petiot, 2009). In the case of ex post evaluation studies, the main topics are its influence in tourists’ destination choice (Delaplace et al., 2014; Gutiérrez et al., 2018; Pagliara et al., 2015) and in the arrival of new tourists (Albalate and Fageda, 2016; Pagliara et al., 2017). These ex post studies based on empirical data have obtained results that tend to differ. Some studies have demonstrated clear positive effects of HSR on tourism development (Chen and Haynes, 2015; Pagliara et al., 2017), while others noted no single direct contribution of HSR to tourist outcomes (Albalate and Fageda, 2016). This shows that it is a complex field of study and that a wide range of factors related to the territorial context and destination characteristics need to be taken into account (Gutiérrez and Ortuño, 2017).
Despite the increasing number of analyses of tourists’ length of stay and its worldwide emergence as a key demand variable, to the best of our knowledge, none of these studies have focused on HSR passengers. This is a relevant issue because it is a common concern that rapid transport modes, such as HSR, increase short-term visits. Moreover, the availability of HSR services could facilitate opening new markets and diversifying the profiles of tourists. Thus, the effects of the availability of HSR services on destination choice is a relevant research line. This scope has been addressed previously by Delaplace et al. (2014) in the cases of Paris and Rome and by Pagliara et al. (2015) in the case of Madrid. The study by Delaplace et al. (2014) concluded that 28% and 49% of tourists who arrived in Rome and Paris, respectively, by means of HSR were positively influenced in their choice by the possibility of reaching their destination by HSR. In contrast, the study by Pagliara et al. (2015) based on tourist arrival by any transport mode (12.8% by HSR) highlighted that only 0.8% of tourists arriving in Madrid indicated that the availability of HSR services was the primary reason for their choice of destination.
Taking this into account, the relationship between destination choice and length of stay among tourists travelling by means of HSR has been identified as a research challenge. Consequently, the research objective of this article in identifying the determinants of length of stay at destination for tourists arriving by means of HSR is to analyse the role of destination choice as a determinant of the length of stay. In particular, whether tourists who are more strongly influenced by the availability of HSR in their choice of destination tend to have different lengths of stay than other tourists are analysed.
The study is based on two initial research hypotheses. The first hypothesis was that, in addition to the tourist profiles, party structure and accommodation characteristics, the influence of the availability of HSR services on tourists’ destination choices is a key variable that determines their length of stay. The second research hypothesis was that those tourists for whom the existence of HSR services played a significant role in their decision to visit are also those who tend to have a shorter stay. This assumption is in line with previous studies demonstrating that travel time is a key determinant in shorter visits.
The study is based on a survey of tourists visiting Costa Daurada (a coastal destination in Southern Catalonia) who arrived at the Camp de Tarragona HSR Station. The influence of the availability of HSR service on tourists’ destination choices was addressed by means of a specific indicator put forward by Young et al. (2010) which was used as explanatory variable, together with tourists’ profiles, party structure or accommodation characteristics for the empirical analysis of the determinants of length of stay among tourists arriving by HSR. The estimations were made using a survival model.
The rest of this article is organized as follows. Following the introduction, the second section reviews existing literature pertaining to three main topics: (1) HSR and tourism, (2) length of stay at a given destination and (3) the approaches used for modelling the determinants of tourists’ lengths of stay. The third section introduces the area of study. The fourth section explains how the data have been obtained and how the causal indicator (CI; the influence of HSR services on destination choice) has been developed. The fifth section presents the descriptive statistics. In the sixth section, the econometric model specification is introduced and justified. The results of the model are shown and discussed in the seventh section. Finally, we present our key conclusions in the eighth section.
Background
HSR and tourism
HSR services improve accessibility and, in some cases, shorten the travel time between the origin and destination of a trip (Masson and Petiot, 2009; Wang et al., 2018). Moreover, several studies have highlighted the trend towards a diversification of HSR passenger profiles and trip purposes that correlates with the increasing use of HSR services for tourism and leisure (Bazin et al., 2013; Delaplace et al., 2014; Santos et al., 2006). Therefore, HSR services are creating new opportunities for tourism, especially national tourism (Campa et al., 2016; Sun and Lin, 2018). This has been shown to be the case for urban destinations by Delaplace et al. (2014) and Pagliara et al. (2015) (for Paris and Rome, along with Madrid, respectively). Furthermore, Gutiérrez and Ortuño (2017) have demonstrated this for coastal destinations in Spain, as have Delaplace et al. (2016) for the Disneyland Paris and Futuroscope theme parks. At the same time, HSR services also extend tourist markets (Wang et al., 2014).
For some destinations, the primary impact of the availability of HSR services is a change in the mode of transport (Dobruszkes et al., 2014; Saladié et al., 2016), and a growing number of studies have investigated the competition between HSR and air transportation, underscoring the substitution effects of the expansion of HSR networks (Givoni and Dobruszkes, 2013). Nevertheless, the arrival of new tourists to a destination could be less than expected by stakeholders and policymakers (Bellet et al., 2012). As a consequence, the effects of HSR services on tourism development at a local level appear controversial in recent studies. Thus, new studies are required to enrich the debate.
Length of stay
There is a wide range of factors influencing tourists’ destination choices, which are mainly linked to the individual tourist (Um and Crompton, 1991) and to the destinations (Crouch and Ritchie, 2005). Transportation, which is the way that allows the tourists move from their place of residence to the chosen destination, plays a key role in the positioning and competitiveness of tourist destinations (Lumsdon and Page, 2004). Consequently, this becomes one of the more important determinants of a tourist’s destination choice (Decrop and Snelders, 2004). Moreover, improvements in transportation are one of the key reasons that contribute to the fact that people are taking more trips each year. This ultimately can result in a decline in the length of stay. For example, see Alegre and Pou (2006) for a study of the Balearic Islands; Gokovali et al. (2007) in Turkey; Gomes de Menezes et al. (2008) in the Azores; Martinez-Garcia and Raya (2008) on tourists arriving at Girona airport (northeastern Spain); Barros and Machado (2010) on golf tourism in the Algarve (Portugal); Salmasi et al. (2012) in Italy and Ferrer-Rosell et al. (2014) on international tourists arriving by air in Spain. This is also one of the reasons why the tourist industry has begun to show interest in analysing the determinants of length of stay.
Length of stay is an interesting research topic for several reasons. First, it influences overall spending, a sense of affiliation and satisfaction with a given destination (Davies and Mangan, 1992). It is also a key factor in research of sustainable tourism in general because tourism activities stress local resources (Saarinen, 2006). Moreover, it is fundamental to tourists’ decision-making process (Decrop and Snelders, 2004), which may be strongly affected by socio-demographic factors (Alegre and Pou, 2006). Economic impact studies also show that the length of stay strongly correlates with expenditures at a given destination (Alegre and Pou, 2006; García-Sánchez et al., 2013; Thrane, 2015).
From a destination management perspective, length of stay is essential for effective planning and management (Martinez-Garcia and Raya, 2008). How tourists decide on the duration of their stay and which factors determine this are key questions for public and private practitioners. Since the pioneering studies of Archer and Shea (1975), research has shown that longer stay tourists visit a wider range of attractions, explore more peripheral regions and generate more diverse economic, social and environmental impacts. Analysing the determinants of length of stay in different contexts and for different markets and types of tourist destinations has become an issue of the utmost theoretical and practical importance. Ultimately, it is critical for designing marketing policies and achieving higher occupancy rates and revenue streams (Alegre and Pou, 2006).
Modelling the length of stay
Several studies have addressed the determinants of tourists’ length of stay. Researchers have approached this issue using a variety of estimation strategies that range from the simplest ordinary least squares (OLS) to the more complex and frequently used duration models. Even though OLS is the most restrictive estimation procedure, it is advocated by Thrane (2012) as the most suitable technique in this area of research, provided that the data are cross-sectional, there is no right-censoring and there are no time-invariant dependent variables. In fact, OLS was the first approach used for analysing tourists’ length of stay (Mak and Nishimura, 1979; Paul and Rimmawi, 1992), and more recently, it has been favoured due to its lack of complexity (Thrane, 2016; Mortazavi and Cialani, 2016). Another option includes discrete-choice models, which treat the dependent variable as a decision made by the tourist based on a set of independent variables. For instance, multinomial logit has been applied by Nicolau and Más (2009) and Grigolon et al. (2014), whereas ordered logit has been used by Ferrer-Rosell et al. (2014). As demonstrated by Cameron and Trivedi (1998), estimators lose efficiency as the number of alternatives grows. This problem can be mitigated by grouping alternatives (Ferrer-Rosell et al., 2014), although this would result in a loss of information. Other authors have favoured count models (Adongo et al., 2017; Alegre et al., 2011; Brida et al., 2013; Rodríguez et al., 2003). Thrane (2015) argues that, aside from their complexity, these kinds of models are not meant to deal with a time dimension. Finally, since Gokovali et al. (2007) used survival models (also known as duration models) for the first time, this sort of model has become the most popular estimation technique in this field (see Santos et al., 2015, for a list of contributions using this type of model).
Concern for how to deal with the potential bias caused by unobserved heterogeneity has also been pointed out (Barros et al., 2008). Strategies to cope with unobserved heterogeneity are diverse. Researchers have opted for continuous survival models that allow unobserved heterogeneity (Barros et al., 2008; Santos et al., 2015; Thrane, 2012), the random-logit model (Nicolau and Más, 2009), the mixed-logit model (Grigolon et al., 2014) or the utilization of latent class models (Alegre et al., 2011; Yang and Zhang, 2015).
Finally, another econometrical concern is the potential sample selection bias. This has been addressed by Barros and Machado (2010) and Santos et al. (2015). However, as Thrane (2012) argues, data gathered through surveys at tourist destinations are always selected samples because non-tourists are excluded.
Study area
The Costa Daurada (Figure 1) is a coastal tourist destination in Catalonia. The main attractions in this area include its 14 km of beach, the city of Tarragona (once the capital of the Roman province of Hispania and now a UNESCO World Heritage site) and Port Aventura (one of the largest theme parks in Europe). According to data from regular surveys conducted by the Science and Technology Park for Tourism and Leisure of Catalonia almost 5 million tourists visited Costa Daurada in 2014, 57% of whom were Spanish. Those tourists were accommodated in hotels (41%), second residences (33%), rented apartments (18%) and campsites (8%).

Study area.
The destination is well connected to Madrid and western Spain as well as with the Spanish Mediterranean regions and the French border via toll motorways. Reus airport is close to the main destinations, and Barcelona airport is roughly 100 km north-east of Salou. According to the same source, the most common modes of transport for tourists arriving at the Costa Daurada are private vehicles (59%) and planes (26%), followed by buses (7%), trains (5%) and other types of transportation (3%). This distribution varies when only Spanish tourists are considered: private vehicles (75%), trains (9%, which include both conventional and HSR services), buses (8%), planes (7%) and other modes of transportation (1%) (Gutiérrez and Miravet, 2016a, 2016b). Only 4% of Spanish tourists arrived on the Costa Daurada via HSR services in 2014. Nevertheless, this figure has increased year on year. It was 1.5% in 2010. Thus, the previsions are that this figure will continue growing in the forthcoming years.
HSR services have been available in Costa Daurada since 2007. The HSR station, called Camp de Tarragona, is an intermediate node of the Madrid–Barcelona–French border high-speed line, and it is located at a notable peripheral location. A taxi service is available, and the station is connected to the Central Costa Daurada and the cities of Tarragona and Reus by bus. According to RENFE (a Spanish rail company), the highest volume of passengers is reported in the summer. It is the eighth largest Spanish HSR station in terms of volume of passengers, but it is the largest among the peripheral stations.
Data
Data collection
Between July and August 2014, 1225 passengers were interviewed at the departure lounge of the Camp de Tarragona HSR station. For this specific study, we disregarded passengers who (a) were not on holiday in the Costa Daurada and (b) were not from Spain. Thus, our results are based on 574 Spanish passengers who were waiting to take the train home after their holiday at the Costa Daurada. These passengers were asked about their socio-demographic characteristics, including their gender, age and region of origin. They were also asked about their trip’s characteristics, including their length of stay, the number of days in advance that they had decided on their trip before departure, whether this was their first visit to the destination, whether they had used the Camp de Tarragona HSR station on previous occasions, how they booked their trip, their party structure, how they transferred to their accommodations, their type of accommodations and their final destination at the Costa Daurada. As one of the main goals of the present work focusses on assessing to what extent the capacity of the HSR station to attract tourists influences their length of stay, tourists were also asked about the availability of HSR services in relation to their decision to visit the Costa Daurada for their holidays. To this end, a causal analysis aiming to identify the probability of HSR passengers visiting the Costa Daurada due to the existence of the Camp de Tarragona HSR Station was applied. This approach has been applied previously using the same sample by Gutiérrez et al. (2018).
Causal analysis
Young et al. (2010) applied a causal chain approach in order to examine empirically the impact that cultural events exert on tourists’ probability of visiting a town or village, which can easily be extrapolated to the influence of the HSR station. More recently, this approach has also been applied in the context of HSR services by Saladié et al. (2016) and Gutiérrez et al. (2018). In addition, it has been used in cases involving other modes of transportation, such as low-cost flight routes (Anton Clavé et al., 2015). This approach was used to identify the upward trend in the probability of visiting the destination.
This can be expressed as:
where
These conditional probabilities are very difficult to observe or measure. As a result,
It is assumed that there are two (and only two) circumstances that would prevent the probability of visiting the Costa Daurada being increased by the existence of the HSR station The existence of the HSR station causes no attraction at all to the tourist to visit the Costa Daurada. The tourist would have visited the Costa Daurada anyway, regardless of the existence of the HSR station.
The occurrence of either of these two circumstances will imply that
Young et al. (2010) incorporated a feasible measurement for the probability that circumstance 1 would not occur
Adapted from their survey, the question that has allowed us the measurement for k is: Q1. On a scale from 0 to 10, to what extent do you agree with the following statement: ‘I chose the Costa Daurada as a tourist destination because of the existence of the Camp de Tarragona HSR Station’ (0 means ‘false’, and 10 means ‘absolutely true’).
Similarly, the question that provided us with Q2. On a scale from 0 to 10, to what extent do you agree with the following statement: ‘I would have chosen the Costa Daurada as a tourist destination even if there was no Camp de Tarragona HSR Station’ (0 means ‘false’, and 10 means ‘absolutely true’).
Descriptive statistics
Table 1 presents descriptive statistics. All variables (with the exceptions of the length of stay and the CI) are dichotomous. Thus, each observation can only equal 1 or 0. According to these data, the Madrid area (36%) is the main origin among tourists arriving by the HSR, followed by the Aragon region (21%). Women are predominant (making up 58% of the sample), while the most important age group is the one ranging from 31 to 45 years old (38%). Up to 64% of the sample has a tertiary education. The most preferred accommodation option is staying with family or friends (42%), whereas hotel accommodations are chosen by 25% and second residences by 19%. The destination of almost half of the sample is the central coastal area of Costa Daurada (48%). Only 12% of the tourists have organized their trips using a travel agency. Most of them have already used the railway station (57%), and this is a repeat tourist destination for them (89%). Regarding the travel party, 49% have travelled alone, 1 whereas 20% have had family trips with children and adult family trips.
Descriptive statistics.
Note: CI: causal indicator. Sample size: n = 588.
The CI of the HSR station is a continuous variable. Given the fact that it is calculated as the product of two probabilities, its values are circumscribed between 0 and 1, both inclusive. Its mean and standard deviation were 0.19 and 0.22, respectively, indicating that the tourists’ reported degree of attraction of the HSR station to the Costa Daurada was relatively limited.
The mean length of stay of tourists arriving at the HSR station is 8.2 nights with a standard deviation of 10.5 nights, which reflects the considerable fluctuation in the number of nights spent in the area. These fluctuations are reflected in the histogram presented in Figure 2. It becomes apparent that the density function is clearly left-skewed because 70% of the tourists interviewed reported stays of seven nights or less. In fact, the median number of nights is 6, while the maximum number reported is 100. Despite the large number of two-night stays, the variable reaches the absolute mode at the seventh day. The number of stays then suddenly drops. Notwithstanding this reduction, several modes can be identified. Local maximums reached at the 10th, 15th and 30th nights are particularly remarkable. Beyond the 31st night, spells tend to be isolated.

Density function of length of stay.
Method
Time is a continuous phenomenon. By contrast, the length of stay is measured as the number of nights and has a discrete nature. Following Chalita et al. (2002), the number of nights spent at the tourist destination can be regarded as a continuous variable if the number of ties among event times is small, or in other words, if the number of failures (tourists leaving) coinciding in each of the intervals is small enough. They suggested a ratio to discern whether the number of ties is low enough to consider the variable of interest as continuous:
where
From this perspective, OLS and traditional specifications of duration models where the variable of interest is continuous are discarded because they do not meet the discrete nature of our dependent variable. Discrete-choice models are also dismissed due to the fact that their efficiency is compromised as the number of distinct spells increases. Finally, the decision between count data models and continuous duration models is not straightforward. The first issue to consider is the fact that count data models are meant to be applied to ‘how-many-times-something-happened’ phenomena instead of time phenomena (Thrane, 2015), whereas survival models were specifically designed to deal with time phenomena that occur at a maximum of once to each of the individuals, as failure implies that the individual is no longer observed in the data set. From a conceptual perspective, the examination of the determinants of tourists’ length of stay is much closer to the latter definition than the former. From a more technical point of view, Thrane (2015) also argues that count data distributions are intended to suit data whose minimum values are zero. As an example, a study that analyses the number of car accidents in different areas requires a distribution whose minimum value is zero. Given that the minimum number of nights spent at a tourism destination will never equal zero, survival models provide a more accurate fit than do count data models. This relative advantage of duration models is also signalled by Santos et al. (2015). Taking these elements into consideration, discrete duration models align better with both the characteristics of our data and the object of our research in comparison with count data models. These advantages do not imply that duration models are free of drawbacks, for instance, their complexity (Thrane, 2012). To the best of our knowledge, none of the previous contributions to the analysis of the determinants of length of stay have adopted a discrete duration model.
Duration models were developed to consider time as continuous. A discrete perspective is adopted by assuming that each night constitutes an interval j, which comprises time within
where
Thus, the probability that the tourist ends the stay at interval j is represented as follows:
Following Prentice and Gloeckler (1978), it is assumed that
The baseline hazard is defined non-parametrically by
The next step involves introducing unobserved heterogeneity into the expression, which is also known as frailty. At this stage, assumptions for how frailty enters the equation are required. We suggest a first specification, which assumes an individual heterogeneity term
An alternative and more flexible approach implies non-parametrical unobserved heterogeneity. Following Heckman and Singer (1984), the existence of a diversity of mass points or types is assumed. The membership of each class is unobserved, and each individual has a probability to be associated with each of the mass points.
For instance, an individual belonging to type 2, thus, would be facing the following hazard function:
According to this parametrization, mass point 2 equals
Results and discussion
Estimation results are presented in Table 2, which contains hazard rates arising from the cloglog model without frailty (1), the cloglog model with gamma-distributed frailty (2) and the cloglog model with non-parametric frailty (3). As previously set forth, the main difference between the specifications of models (2) and (3) lies in the fact that model (2) imposes a specific distribution for the unobserved heterogeneity (gamma distribution), whereas frailty enters model (3) non-parametrically. As a result, model (3) enables a higher degree of flexibility. Discrepancies between the three models in terms of the magnitude of the hazard rates are, in some particular cases, not negligible. Changes in the significance of the variables are reduced. A first issue to disclosure relates to whether the inclusion of frailty in the models is appropriate. Departing from model (2), for the one considering gamma-distributed frailty, whether the unobserved heterogeneity variance component (σ 2) is equal to zero is tested. The null hypothesis is rejected because Chibar2 (01) is equal to 14.066 with a p value of 0.0001. Regarding the model with non-parametric frailty (model 3), the constant associated with the mass point 2 and the probabilities belonging to both mass points are also significant, which also sustains the inclusion of unobserved heterogeneity within the survival models. Thus, models (2) and (3) are much more adequate than model (1) as it has been demonstrated that unobserved heterogeneity is a critical issue. In model (3), frailty is added through the consideration of two different mass points or tourist types. The probability of belonging to mass point 1 is 0.15 and 0.85 for mass point 2. Figure 3 presents the predicted survivor functions for both types of individuals at the mean of covariates. It becomes apparent that each type of tourist behaves differently. Indeed, individuals within mass type 1 are more prone to enjoying long stays, as their predicted mean stay reaches a total of 23.9 nights. Visitors within type 2, in contrast, have a predicted average stay of 5.5 nights. According to these results, substantial differences in terms of the length of stay stem from the impact of unobserved heterogeneity.
Maximum likelihood estimates for the discrete time proportional hazard models.
Note: CI: causal indicator. Standard errors are within parenthesis.
*Significant at 10%.
**Significant at 5%.
***Significant at 1%.

Predicted survivor functions for (a) m1 and (b) m2 types at mean of covariates.
With respect to the impact of the CI, the variable has been introduced in the equations linearly and considering its quadratic value as well. The introduction of the squared value in the equations allows us to assess the existence of non-linear relationships. Both models (2) and (3) concur that there is no evidence of non-linear effects as the hazard rates associated with the squared value of CI are not significant. Focusing on the non-exponential effect of CI, we depart from the results obtained by Nicolau and Más (2009) and Martinez-Garcia and Raya (2008), who concluded that the availability of days is crucial for determining the length of stay. In accordance with their findings, it could be assumed that visitors attracted by the HSR station should have a preference for shorter stay periods, and hence, it is hypothesized that the hazard rate associated with the CI should be significant and larger than 1. Evidence regarding the impact of CI in models (2) and (3) is nonetheless mixed. While the hazard rate in model (2) is not significant, the hazard rate in model (3) is significant and signals, as expected, that higher CI values are linked to a higher risk of ending the stay at each interval and hence shorter stays. According to the definition of the CI, visitors with null CI values are those who firmly intend to spend a holiday in a specific tourist destination, and at the same time, they should be those who do not have a preference for reducing the travel time to their destination by means of HSR services. In contrast, CI values larger than 0 are related to factors that push visitors towards the HSR station to some extent. In the context of HSR services, time constraints that limit visitors’ length of stay should be the most decisive element among the restrictions. From this perspective, the higher the time constraints, the higher the CI. As a result, if the results of model (3) are accepted, the existence of an HSR station that shortens travel times would emerge as an incentive that would make the destination more attractive for short trips.
However, it must be taken into consideration that the results of models (2) and (3) oppose each other. This discrepancy could be simply the result of the more flexible, and as a consequence preferable, specification of the unobserved heterogeneity allowed in model (3) compared to model (2). It could happen as well that some underlying factors could be behind the mixed evidence obtained regarding the effect of the CI. In this sense, some attention should be placed on second residences, as the decision of visiting the Costa Daurada, in the case of this segment of visitors, is undoubtedly determined by the ownership of a property. In order to test to what extent the effect exerted by the CI is influenced by visitors staying in second residences, an interaction term between both variables was created and introduced in models (2) and (3). The results regarding both variables and the interaction term are shown in Table 3.
Hazard rates estimates of the interaction term: CI × second residences.
Note: CI: causal indicator. Standard errors are within parenthesis.
*Significant at 10%.
**Significant at 5%.
***Significant at 1%.
In the model where frailty follows a gamma distribution, neither the CI nor its interaction term with second residences is significant. Conversely, the hazard rate associated with second residences continues to be highly significant, and its effect (increasing the expected length of stay) is consistent with the previously estimated model without the interaction term (model 2 of Table 2). On the other hand, the evidence obtained with the non-parametric frailty model depicts a different scenario in which the previous significance of the CI vanishes, while the interaction term is significant and its hazard rate acquires a similar magnitude to the hazard rate associated with the CI when the interaction term was not considered (model 3 of Table 2). Second residences, similarly to the model with gamma-distributed frailty, remain highly significant, and its hazard rate is slightly smaller than in the model without the interaction term. The interpretation of the results obtained when non-parametric frailty is incorporated into the specification is interesting, as those results suggest that the effect of the CI only appears for the owners of second residences. In this sense, the availability of HSR in the area would foster additional short stays, although only for that segment that travels to the Costa Daurada with a certain degree of regularity as they own an apartment there. In contrast, the length of stay of visitors who stay overnight in alternative sorts of accommodations would not be impacted by the value of the CI; however, these conclusions should be taken with caution, given the discrepancies between the models.
Back to Table 2, the inclusion of unobserved heterogeneity slopes upwards with the duration-dependence dummies, which present a growing trend. The fact that all tourists in the sample finish their stay at some stage results in an increasingly positive relationship with the hazard rate. Furthermore, hazard rates associated with the number of nights that refer to periods of time (such as a week (7 nights), fortnight (15 nights) or a whole month) are larger.
Longer travel times contribute to increasing the number of nights spent. In fact, those areas with the longest travel times (Madrid, Basque Country and the rest of Spain) are associated with longer stays. Neither gender nor education level attained exert significant impacts, whereas age extends length of stay for the oldest population group, given the fact that they face fewer time constraints. 2 This latter result is consistent with most of the research in this area (Santos et al., 2015), although some previous work suggests the opposite results (Barros et al., 2008; Yang and Zhang, 2015). The type of accommodation emerges as a decisive factor. Hotel stays reduce the number of nights, whereas the longest stays are attributable to second residences. The hazard rates associated with apartment rentals and stays with family and friends are also clearly below 1, although their impacts on the length of stay are more modest. Previous studies have obtained similar results (Alegre et al., 2011; Mak and Nishimura, 1979; Martinez-Garcia and Raya, 2008; Santos et al., 2015). The daily price of each of the accommodation options is the underlying factor that accounts for this highly significant impact of the different types of accommodations (Fleischer and Byk, 2009). According to Menezes and Moniz (2011), Grigolon et al. (2014) or Yang and Zhang (2015), party size has been analysed from a party composition perspective. All three models signal that the greatest number of nights spent at the destination corresponds to trips accompanied by adult family members, followed by trips undertaken with friends. Thus, this result gives support to the vision that it is not only a matter of the party size but also its composition. Finally, no significant impact is found to be associated with the location of the tourist destination, or whether the tourists have visited the destination previously. This latter result can be shocking because it opposes the conclusions reached by most of the research in the field. However, it must be taken into consideration there might be variables related to the accommodation options that could capture the effect of a repeating destination, mainly second residences. Some past studies have neither observed a significant influence of repeating a visit to a destination (Yang and Zhang, 2015, for one of their latent classes) nor even found a negative impact on the length of stay (Thrane, 2012; and Santos et al., 2015, for subsequent visits after a second trip to Brazil).
Conclusions
This research article develops an analysis of the determinants of the length of the stay at a given destination for specific tourists arriving by means of an HSR. The survival model has used the usual explanatory variables associated with tourist profiles and trip and stay characteristics. Additionally, it has also used the uneven incidence of the availability of HSR services on destination choice as an explanatory variable. This incidence has been calculated using a CI. This novel approach has allowed us to obtain new data on the relationship between the capacity of HSR services to attract tourists and the length of the stay of those tourists.
The study findings unveil, in the first place, that the capacity to attract tourists associated with the availability of HSR services in the area is relatively limited. In the second place, we have obtained weak evidence of an impact of this capacity of attraction on the tourists’ length of stay as the imposition of a more rigid specification for unobserved heterogeneity results in a non-significant effect of the incidence of HSR services at all. On the contrary, when unobserved heterogeneity enters the estimation equation in a non-parametric form, which allows a higher degree of flexibility, HSR tends to reduce the number of nights spent by tourists at the destination. According to these results, tourists who reported a higher incidence of the HSR station on their decision to visit the Costa Daurada tended to undertake shorter stays. However, it has also been shown that this effect is restricted to those tourists who are lodged in second residences. This profile, which comprises recurrent visitors due to the ownership of a property within the area, would benefit from the travel time reduction yielded by the HSR to increase their number of visits; additional visits would be nonetheless shorter. In the third place, the results have also highlighted a large and significant impact of unobserved heterogeneity on the number of nights spent.
This study also highlights other characteristics of the profiles of tourists who arrive by HSR and their stay at the destination, which significantly affects their length of stay. Older tourists, those arriving from farther away, those travelling in groups and, especially, those hosted in second residences, are the ones who tend to spend longer time in the destination. On the other hand, education level or gender has no significant effects on the length of stay of tourists arriving by means of the HSR.
The findings of this study provide evidences of interest for tourism-destination management, marketing, strategic planning, and for the planning of transportation facilities for destinations with HSR services (or those who plan to be served by HSR). They are also of interest for railway transportation study and management. Although less than 5% of all tourists vacationing at the Costa Daurada arrived by means of HSR in 2014, this is enough for the Camp de Tarragona Station to achieve its highest level of passengers in during the summer months, while the opposite occurs in the average Spanish HSR network. Nevertheless, the study aligns with those developed in Spain (that coincide in highlighting the moderate effects of HSR services on attracting new tourists to destinations). This contrasts with the overestimated expectations linked to this infrastructure (Bellet et al., 2012). Moreover, as Saladié et al. (2016) noted, the availability of HSR services contributes to the diversification of the modes of transport used to reach a destination, which is also a key issue for tourism destinations and for transport planning and management.
The study also opens a door to future research. There is common agreement that the improvement of transportation systems has contributed to increasing the number of visits to some destinations while reducing the length of stays (Alegre and Pou, 2006; Barros and Machado, 2010; Lumsdon and Page, 2004). The availably of HSR services could be aligned to this context particularly when analysing repeat tourism, which has been identified as specific subgroup of interest. Considering this, a study based on repeat tourists as a target group could provide new evidence about whether a reduction in the length of the stay is associated with an increasing of number of visits to this or other destinations.
Finally, as some authors noted, the profiles of users of HSR services experience an evolution after the first years of entry into service of the high-speed line. However, as Gutiérrez and Ortuño (2017) highlighted, there is no specific profile of traveller using HSR for tourism, not even in the case of a coastal destination. The most expected trend is the diversification of user profiles and the growing numbers of travellers due to tourism and leisure (Bazin et al., 2013; Santos et al., 2006). Taking these factors into consideration, a new study in the area in the coming years would offer evidence on the evolution of the effects of HSR stations with regard to the visitors attracted, their profiles and their lengths of stay at their destinations.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Spanish Ministry of Economy and Competitiveness (POLITUR – CSO2017-82156-R).
