Abstract
This paper presents adjustments and validation of a model for the process of passenger boarding and alighting of BRT (Bus Rapid Transport) and express bus systems. Such models are of fundamental importance for the implementation of strategies to control the operation of those systems, which allow for quality and efficiency gains in the service. First, a generalized disjunctive programming model is presented in order to adapt a bus headway control formulation according to the number of onboard passengers. Later, as a way to improve the model, real data were collected from the boarding/alighting times of passengers in the Trunk line 10 system in the city of Blumenau, Brazil. These data were used in a two-stage fuzzy regression to take into account the uncertainties associated with the dwell time of the passengers. The actual data collected were used in simulations of the enhanced bus headway control system, showing that even in the presence of disturbances, the system can still operate properly without bunching.
Introduction
Mass public transport, and more generally urban mobility, has been a topic of constant discussion and mobilization of society, which also draws attention in Brazil. Among the main topics discussed, besides the costs of transport tariffs, we can cite the quality and efficiency of the services. A negative factor observed in a large number of bus-based public transportation systems is the deviation of the schedule and/or bus headway (tendency of bunching). These deviations increase the waiting time of users and the costs of operating the transportation system [1–3].
A solution to the problem of real time operating of bus public transport systems is the implementation of control strategies for their operation. Given the need for differentiated quality of express bus and BRT (Bus Rapid Transport) systems, the application of real-time operation strategies in these systems is of paramount importance, highlighting the bus holding strategies at stops [4–7].
These real-time strategies make use of prediction models for boarding and alighting times of passengers at stops and/or terminals and stations. However, those commonly used models can be overly simplistic, implying a representation that does not match the actual observed behavior or is too complex for real-time applications. In a developed country scenario, with buses that do not operate crowded and with adequate road and station infrastructure, these simplifications typically used may not be significant. In the case of underdeveloped countries, in the everyday reality of many Brazilian cities, the real behavior of the boarding and alighting process is more significant and necessary.
The boarding and alighting process of passengers or “dwell time” is defined as the time spent by a bus at stops [8]. This time typically represents a considerable portion of the total travel time, which justifies the importance of a suitable model for representing this process. For example, [9] found that in American cities from 1957 to 1980 the average dwell time represented about 20% of total travel time within urban areas and increased to 26% in central areas. From data collected in Sydney, considering onboard billing, [10] concluded that the average “dwell time” is about 15% of total travel time.
As a way to improve the representation of the boarding/alighting process, this paper uses as a base the bus headway control strategy model presented by [5], which contemplates characteristics typically present in Brazil bus public transport systems. However, the model presented by [5] requires a more detailed study of the most appropriate value of some parameters, which reflect the behavior of the passengers during the boarding/alighting processes.
To improve the model, our approach uses a series of parameters that describe the behavior of the passengers, both in the boarding and alighting process, as well as parameters associated with the BRT system, like the number of passengers already boarded the bus. Thus, suitable values for these parameters are of fundamental importance for the model to adequately represent the real behavior of the system.
Also, in order to calibrate the model values, this paper makes use of real data collected from the passengers boarding/alighting process of the express bus Trunk line 10 located in the city of Blumenau, Brazil. These data were used in a two-stage fuzzy regression to develop a model that best represents the times associated with “dwell time”, taking into account the number of onboard passengers.
Everything considered, this paper contributes to the state-of-the-art by proposing a novel methodology for modeling and calibrating passenger boarding/alighting time of BRT systems. This methodology, when used in tandem with methods already established in the literature, is able to represent the behavior of a public transport system more accurately. As a consequence, the methodology can improve the performance of control methods at decreasing passenger waiting time.
Other proposed dwell time models make use of nonlinear functions [11, 12], details associated with the passenger’s location preference in the bus [13], door width [14], bus floor height [15], ticketing technique [16], passenger age [17], and grouping of people [18]. Some of these particularities are present directly and indirectly in the model to be used here; others imply in a model too complex, infeasible for real-time applications. Thus, it is considered that the model used here has the potential to improve the representation that describes the dwell time of the Trunk line 10 system, with applicability in other urban public transport systems found in Brazilian cities and similars.
Therefore, this paper is divided as follows. Section 2 presents the model used in the development of the work. Section 3 presents a review of fuzzy regression. The results are presented in Section 4. Finally, Section 5 presents the conclusion of the work.
Methodology
To understand the BRT control strategy employed in this paper, this section first presents the existing control strategy already available in the literature, and later proposes a modification of its boarding and alighting process.
Existing literature model
The original BRT and express bus model used in this paper for analysis and simulation is based on the one presented in [5], which was extended to multiple BRT lines in [6] and to bidirectional segments in [19]. The baseline model [5] allows the real-time control of BRT and express bus systems, more specifically, the control of bus headway through the holding of buses at stops. The model is deterministic and presented in the form of a frequently updated mathematical program with a rolling horizon. Before presenting the mathematical programming model, some preliminary definitions are necessary. First, the sets and indices used are presented in Table 1.
Sets and indices
Sets and indices
Considering a circular itinerary, the following lap arithmetic is used: ω = (k, m); 〈i, ω-〉 represents for bus i, the stop immediately preceding stop k, which can be stop (k - 1) on the same lap m, or else stop n K of lap (m - 1) if the current stop is the first in the circuit (k = 1); 〈i-, ω〉 corresponds to the previous bus that arrived at bus stop k before bus i in lap m, which can be a bus in the same lap or in the previous one. Given those considerations, the rest of the systems variables are presented in Table 2.
Variables
The total passengers’ waiting time for the horizon considered is given by:
Constraint (2) gives initial conditions of the bus departure time from a stop, bus onboard passengers when departing from a stop by (3), and initial residue of passengers at a stop by (4). The constraint (5) represents the bus arrival time to the stop k; (6) the bus remaining capacity and (7) the number of passengers to board the bus. Departure time is given by (8), and (9) guarantees that bus overtaking is not allowed. The number of onboard passengers after bus departure is given by (10), and the residue of passengers by (11).
The original formulation presented in [5], the boarding and alighting processes are defined as follows:
Constraint (12) gives the bus passengers alighting time. The boarding time is given by the time spent to board the passengers who occupy the remaining capacity of the bus or the time spent for boarding the accumulated passenger queue at the stop. This is given by means of a minimum operator as (13). Considering terminals and stations as bus stops of traditional BRT systems, the passenger alighting process occurs first. The boarding process, with pre-billing, proceeds through the same bus rear doors used for alighting. Thus, dwell time is calculated as the sum of the boarding and alighting processes as (14), Fig. 1 illustrates the process when the number of boarding passengers is smaller than the bus remaining capacity, and Fig. 2 illustrates the process when the number of boarding passengers is bigger than the bus capacity.

Boarding and alighting processes for terminal and stations with the number of boarding passengers smaller than the bus remaining capacity. The queue boarding rate is μ k .

Boarding and alighting processes for terminal and stations with the number of boarding passengers bigger than the bus remaining capacity. The queue boarding rate is μ k .
As can be seen in (12), (13) and (14), boarding and alighting times depend on the parameters C a and C b which are typically considered constant. However, what is observed in real systems is that both parameters, besides the typical variation around the average value, vary as a function of other parameters of the BRT system such as bus capacity. More insights on how to solve this model can be seen in [5, 6].
Given the characteristics and peculiarities of urban transport systems by buses in Brazil and other countries, this paper proposes an enhanced estimation of boarding and alighting of passengers that takes into account a more realistic behavior, with characteristics compatible with real-time applications.
In the new proposed boarding/alighting model, the time spent for each passenger boarding the bus (C b ) and the time spent for each passenger alighting the bus (C a ) can vary depending on the number of onboard passengers. Consider the following set containing the limits of the categorical variable “quantity of onboard passengers”:
The decision between the categorical variables for the alighting time can be represented as a Generalized Disjunctive Program (GDP):
Notice that the alighting time depends on the passengers that boarded the bus on the last stop (l〈i,ω-〉). Big-M and convex hull reformulations are the most common methodologies for transforming a GDP into a mixed-integer linear programming (MILP) problem. The proposed GDP can be reformulated as a MILP, using Big-M notation, in the following form (
The decision between the categorical variables for the boarding time can also be represented as a GDP, but the decision variables depends on the number of passengers that didn’t alight the bus on the present stop (l〈i,ω-〉 (1 - q k )), since the alighting occurs first:
With those propositions, there is no need to depend on a single deterministic value during the control estimation. The control system can automatically adapt to the number of passengers boarded. Note that although the example presented here has only three categories (namely low, medium, and large), it becomes simple to extrapolate the model to as many categories as necessary to better represent the system.
To populate C a and C b with more realistic data, the approach of this paper uses two-stage fuzzy regression coefficients (discussed in the next section) of real collected data as deterministic values in the control model. To do so, the measurements were made considering the number of passengers to board/alight, the total “dwell time”, and the number of passengers already boarded. During the holding control holding calculation stage, the equations (17) and (19) are responsible for identifying which categorical variable to be used in the process.
Test setup
In order to test the model, a stochastic scenario is used in a simulation (transit model) in which boarding/alighting times are given randomly. Therefore, two-stage fuzzy regression data were used to collect samples from a triangular distribution over the interval. The triangular distribution is a continuous probability distribution with lower limit left, peak at mode, and upper limit right. Unlike other distributions, these parameters directly define the shape of the probability density function (pdf). The pdf for the triangular distribution is:
For the stochastic scenario simulation, the setup presented in Fig. 3 will be used. Further details of the simulation parameters can be found in [5].

Test setup for the headway control, consisting of a deterministic predictive control and a stochastic transit scenario.
Fuzzy regression was proposed in the literature as a way to consider the uncertainties of the coefficients of linear regression, mostly used when the model is indefinite, the relationships between model parameters are vague, the sample size is small, or the data is hierarchically structured [21–24]. This kind of analysis was initially proposed by [21] based on the concepts of fuzzy functions presented by [25], which aimed to take into account the natural uncertainties present in a certain phenomenon, case of the Trunk line 10 scenario analyzed here.
While for traditional linear regression the deviation between the observed and estimated values is assumed to be due to observation errors, [21] assumed that these differences occur because of the vagueness of the system structure. The fuzzy parameters of a linear system measured by the model correspond to the system’s possibility distribution, being responsible for measuring its uncertainty (fuzziness).
Possibilistic regression
By generating parameters corresponding to the possibility distribution of a system, the model presented by [21] is known in the literature as possibilistic regression and with a general form given by the equation below:

Symmetric triangular membership function.
Differently from the ordinary least squares regression, the deviation between the data and the estimated model depends on the imprecision of the parameters and not on the measurement errors. In this case the model of [21] proposes that, in order to minimize the uncertainty of the estimated model, one can minimize the total spread of the system’s fuzzy coefficients [26].
This spread may also depend on a degree of pertinence known as the “h value”, defined by the operator, which specifies the degree of feasibility of the system conditions [27]. The higher the degree of feasibility, the higher the spreading of the system, i.e., the h value expands the confidence interval of the model. Fig. 5 illustrates the change in the lower and upper limits for h of 0.5 and 0.7.

Increase of fuzzy interval with a higher h value.
Fig. 6 illustrates the behavior of the spreading of the system with the increase of the h factor, both to the right and to the left of the central part of the system. Its increase is directly proportional to the expansion of the limits, allowing the inclusion of more or fewer data between the limits of the model. While the typical values are contained within the limits, even with a reduced h factor, the outliers are included with the factor expansion.

Triangular membership function with h value interval.
From these considerations, [21] show that the possibilistic regression is reduced to a linear programming problem whose objective is to minimize the spread of the system, or its uncertainties, as presented in (26) subject to the constraints given in (27).
However, although still present in the literature due to its ease of implementation and lower computational cost when compared to improved models [26–28], the possibilistic regression using linear programming has received several criticisms, some of which are remarked by [22]: The model proposed by Tanaka resembles linear regression, but there is no evidence of how it relates to the classical concept of least squares [29]; High sensitivity to outliers; Tendency to become multicolinear as more independent variables are collected; In this matter, the literature has presented alternatives for the possibilistic regression using linear programming, such as fuzzy regression using least squares [29], rigid fuzzy regression [30], piecewise fuzzy linear regression [31, 32], fuzzy regression with linear neural networks [33] and nonlinear neural networks [34].
The authors of [21] themselves continued to improve the model employing quadratic programming, which according to the authors, increases the tendency of the coefficients to become diffuse when compared to the initial model [23]. The possibilistic regression using quadratic programming starts from the quadratic minimization of the spreads of the function, as presented in (28).
The function can be rewritten as presented in (29), with the addition of a new term, where ξ is a sufficiently small positive number added to the objective function so that (29) is strictly convex. The constraints of the function remain the same, as shown in (30).
To allow an optimization capable of differentiating the possibility of left propagation of the possibility of right propagation, one can rewrite the quadratic possibilistic regression as in (31), subject to the restrictions presented in (32).
From this modification the optimization starts to consider a non-symmetric triangular membership function, as shown in Fig. 7.

Non-symmetric fuzzy triangular membership.
Criticism of the initial model presented by [21], including the uncertainty of how it relates to the traditional model of least squares regression [29], has led to the creation of hybrid models.
One of these hybrid models seeks to maintain the central tendency of the model; for instance, [35] introduced a two-stage fuzzy regression considered important by its results and low computational cost [28]. In the two-stage fuzzy regression, the central trends of the model are defined, typically by ordinary least squares, which are then used as known variables in the optimization.
Let a
c
be a central trend vector previously defined. The two-stage fuzzy regression can determine the spreads as in (33) subject to the conditions presented in (34). Notice that in order to allow an optimization capable of differentiating between the possibility of left and right propagation, one can rewrite the quadratic possibilistic regression decomposing a
w
in a
wL
and a
wR
.
This model can be augmented to use quadratic programming in order to increase the diffusion of the coefficients. Rewriting (33) in its quadratic form it is possible to arrive at (35), very similar to the model of [22]. Since the vector a
c
is already defined in the first stage of the regression and does not need to be optimized, the function is naturally strictly convex.
With two-stage fuzzy regression it is possible to maintain the original central tendency of the model, which can be replaced by other types of estimation (ridge, minimax) and minimize spreads based on these values to include the human uncertainties in the model.
All the problems presented in this section are linear programming (LP) problems or quadratic programming (QP) problems. The first can be written in the following canonical form:
Linear problems given by (36) can be solved for the global optimum by the use of efficient algorithms such as the simplex method [36], while the global optimum of a quadratic problem (37) can be reached by employing algorithms such the interior-point method [37]. Both algorithms are available in commercial and non-commercial solvers, like Gurobi [38] and IPOPT [39].
Index of confidence (IC)
As a way of interpreting the results of a fuzzy regression and contributting to the interpretation regarding the regression interval, [40] have proposed an index of confidence (IC) that measures the variation between the independent upper and lower limits of the variable, which is similar to the goodness-of-fitness R2 in statistical regression. Thus, we consider that y
L
(k) and y
R
(k) are respectively the lower and upper bounds of a fuzzy estimate
The index of confidence (IC) is calculated as (38), and the larger the value, the greater the system’s representation by the regression performed.
The SST (Total Sum of Squares) measures the total variation of y (k) to the left spread (y
L
(k)) and to the right spread (y
R
(k)), calculated as:
The SSR (Regression Sum of Squares) measures the variation of the center estimate (y
c
(k)) to both spreads (y
L
(k) and y
R
(k)), calculated as:
Since fuzzy regression aims to find a minimal range that covers all collected samples, the Average Fuzzy Spread (AFS), which considers the absolute difference between the longest and shortest interval of the model and the h value, can be used to estimate the range size and general diffusion of the model. The smaller the AFS, the better the model. So the AFS equation is given by [41]:
However, AFS considers only the model range and may not take into account the central approach, which differs from the previous IC and the following metric, the Mean Fuzzy Credibility (MFC).
The fuzzy regression can generate models with low membership and high diffusion, resulting in a low credibility model. In order to measure the model’s credibility, the MFC assesses the relevance of the collected data to the fuzzy regression model and the general diffusion of the model. Thus, it measures the credibility of
Since h directly influences the relevance of the model members, so the model’s credibility is also affected by its value. In order to obtain maximum credibility based on the h factor value, [43] proposed a systematic approach to obtain an optimized h value, as:
According to [42], we can directly obtain the optimized h value with the following formulation:
The BRT baseline model and the two-stage fuzzy regressions formulations were expressed in Python, using the Pyomo library [44]. They were solved with the MIQP solver Gurobi 8.1.1 in a Windows workstation, with an Intel Core i7-7820HK 2.9 GHz processor and 16 GB of RAM.
Studied scenario
For comparative purposes, the studied scenario is the same presented in [5], a high-frequency segregated transit system, more specifically the transit system Trunk line 10 of the city of Blumenau, Santa Catarina, Brazil. The Trunk line 10 system consists of 2 bus terminals, 32 bus stations, and 17 buses in a closed circuit of 17.7 km. Other data of the Trunk line 10 systems and simulation parameters are: n N = 5, C0 = 3 and Cmax = 120. The scenario is shown in Fig. 8.

Trunk line 10 scenario.
During the data collection, filming was carried out at bus stops in the city of Blumenau/SC for later determination of boarding/alighting times, taking into account the number of passengers to be processed, as well as the number of passengers already onboard.
With the collected data, the two-stage hybrid fuzzy regression presented in the previous section was used on each of the categorical variables (related to the number of passengers standing on the buses), considering the collected data.
First, the regressions were performed without optimizing the h value, i.e. with the parameter set to 0. The quality indicators for this case are presented in Table 3. Subsequently, h values were optimized as suggested by [42], which improved the quality indicators, as presented in Table 4.
Fuzzy regression quality indicators without h optimization
Fuzzy regression quality indicators without h optimization
Fuzzy regression quality indicators with h optimized
To illustrate the behavior or the IC variation depending on the h value, Figure 9(a) shows IC changes for the lower limit, while Figure 9(b) shows the same change, now related to the upper index. Notice that both plots also have a a L or a R axis, which grows according to h.

IC variation relative to h value for the case of bus boarding with low occupancy and its influence to the lateral spreads.
Considering the optimized h values, the regression parameters are presented in Table 5 as [a wL ; a c ; a wR ].
To illustrate the results of the fuzzy regression, Fig. 10(a) shows the regression obtained for
Fuzzy regression coefficients

Fuzzy regression for alighting/boarding time for passengers onboard of a bus with low occupancy.
In [5] and [6] the authors used a deterministic scenario in order to test holding control strategy for BRT systems. Although testing deterministic models is important to demonstrate the features of control algorithms, it is necessary to apply disturbances to the scenario in order to bring the model closer to reality. Also, it is vital that the quantities involved in the perturbations, as well as the deterministic data used in the model, approach the system operation.
Model robustness
The setup of Fig. 3 was used to inject system disturbances in the system. Thus, the system is composed of a deterministic control and a stochastic traffic model. The stochastic traffic model here presented uses the limit values found by fuzzy regression to populate a triangular distribution from which stochastic data will be sampled.
First, we analyzed the deterministic behavior of the control system, considering an ideal deterministic scenario. Second, we run simulations for the stochastic scenarios considering different random generator seeds. The bus trajectories for the simulations are shown below, namely Fig. 11 presents the ideal scenario, while Fig. 12 presents the results for the stochastic scenario. All simulation run for 15 bus stops ahead of the beginning of the process.

Bus trajectories for the deterministic scenario.

Bus trajectories for the stochastic scenario with different seeds.
Considering each case, the results of the passenger waiting time appear in Table 6.
Passengers waiting time after 15 simulation steps.
Although the waiting time of passengers is obviously longer in the stochastic scenario, it is possible to notice that the control managed to ensure the correct system operation (bus headway control) even in this kind of situation, keeping the headway between the buses and avoiding bunching situations. This is possible due to the predictive nature of the control proposed in [5], which is extended in this paper in order to enhance the boarding/alighting process, whereby real system disturbances are compensated by the feedback nature of the control strategy.
In order to test the quality of the proposed enhancement, a comparison was made with the baseline model from [5], and also with two traditional controls, namely Forward Headway Control (FH) and Two-Way Headway Control (TWH), as shown in [7]. The same setup of Fig. 3 was adopted in this analysis whose results are presented in Table 7.
Model Comparison.
Model Comparison.
As presented by [7], optimal and predictive control methods are expected to improve passenger waiting times. Owing to the nature of the predictive models embedded in such control systems, which rely on historical data, the model parameters need to be fined adjusted to better represent the dynamics of the BRT system of interest.
Notice that the baseline model [5] uses fixed values for C a and C b based only on historical records, without taking into account the number of boarded passengers. Thus, it is not surprising that, by adjusting the number of boarded passengers, the proposed methodology leads to a shorter waiting time compared to the original model. This is due to the fact that poorly adjusted parameters can lead, for example, to an overestimation of the waiting time of passengers, generating control actions (holding signals) above what is necessary. The opposite is also true.
Despite the original model from [5] bringing about significant gains when compared to traditional control methods (FH and TWH), it can be seen that the adoption of the methodology proposed here can further benefit the model, and consequently the users of the public transportation system.
The advance of computational intelligence technologies, together with infrastructures for system interconnection, can facilitate the tuning of the control parameters of the BRT lines, which must be carried out individually for each line. Despite the potentially high deployment and maintenance cost of the technologies and resources needed to implement an optimal control, such as APC (Automatic Passenger Counter), such cost can be diluted over time in the case of a well-adjusted control, reducing bottlenecks of the system.
This paper presented a methodology to take into account the number of passengers already boarded in the modeling of a BRT system. In order to populate and test this formulation, an approach using two-stage fuzzy regression to estimate the time of boarding/alighting of passengers was employed. This approach used data from the real Trunk line 10 scenario, which was collected in the city of Blumenau, Brazil, measuring the number of passengers to be processed, the time of boarding/alighting by passengers, and the number of passengers already boarded.
The fuzzy regression results were also used in the stochastic simulation model to represent the random behavior present in the transit system better. In this way, the methodology presented here can be used as a framework to test other forms of control in a most accurate scenario.
Simulation results suggest that, for the studied scenario, the predictive nature of the control is capable of maintaining the bus headways and avoiding bunching even in the presence of disturbances.
For future works, it is suggested to estimate other stochastic parameters of the model, bringing the estimated model closer to the system operation. One could, for example, consider investigating possible disturbances in travel times, and passengers boarding and alighting rates.
Footnotes
Acknowledgments
This research was funded in part by CAPES under Program PrInt #698503P.
