Madrid is the capital of Spain and has a population of 3.5 million, a metropolitan area of 6 million inhabitants, and an area of 8030 km2. About 12.93 million displacements take place in its metropolitan area every working day, so it is crucial to ensure an efficient transport network in a city of this size.
PT in the central core of Madrid includes buses, metro, light rail, suburban train and a bicycle sharing system. The urban bus service covers the entire central core with over 200 lines, 3562 km in length, and some 1900 vehicles. The metro network also plays a key role in this area, with 13 lines and a total length of 287 km. This study focuses on the bus and metro systems, as they are the main urban modes of transport, and in combination account for almost 85% of total trips in the central core (2.6 million trips every working day). Our estimates show that approximately 56% of PT users make a single transfer, while 21% transfer more than once, highlighting the importance of optimising transfers to achieve an efficient and high-quality PT system.
Focus groups conducted in Madrid
The aim of the FGs is to identify the factors affecting users’ perception of transfers in their multimodal trips. We conducted three FGs in Madrid: the first with university students, the second with middle-aged workers, and the third with retired people over 65. The decision to separate the respondents by age was due to the differences in the literature on the physical limitations of the elderly [19, 26, 27], which influence the perception of transfers.
A total of 20 people participated in the three FGs. A €10 gift voucher was offered as an incentive for participation in all cases. The FGs were conducted according to the recommended methodologies [27,29,29]. For detailed information on the process, see Cascajo et al. [5].
The results revealed several factors affecting users’ perception of the transfer penalty. The most important –mentioned by over 50% of the participants in the FGs– are “time” (in its different components: walking time, waiting time, transfer time, and total travel time); real-time information; crowding; mode; and different levels in the transfer. Two factors emerged from the FGs that did not appear in the literature, related to the pure transfer penalty [5]: mental effort and activity disruption. Mental effort refers to the extra work required by passengers when making a transfer and the need to remain alert throughout the whole journey in order not to miss their transfer stop. Activity disruption concerns the utility of in-vehicle time, especially on longer trips when the time on-board can be used for activities such as reading, listening to music or even sleeping. Activity disruption is in some way related to mental effort; travellers must be aware of the stop where they need to get off, and can therefore not immerse themselves wholly in their chosen on-board activity, which must also be interrupted when alighting to make the transfer.
Perceived transfer utility in Madrid
The three FGs conducted in Madrid identified the following quantitative and qualitative variables warranting inclusion in the preliminary utility functions:
-
Mode: takes value 1 if metro and 0 if bus.
-
In-vehicle time: time (min) elapsed while a person is inside a mode of transport.
-
Walking time: time (min) elapsed from the moment a traveller gets off a vehicle and walks to reach the next stop or station.
-
Waiting time: time (min) elapsed from the instant a traveller arrives at a stop or station and waits until the next mode.
-
Stairs: takes value 1 if there are stairs (or a difference in level) while transferring and 0 otherwise (it is always 0 in bus-bus transfers).
-
Real-time information: takes value 1 if there are panels with dynamic time arrival for the intended trip as a whole and 0 otherwise.
-
Crowding: takes value 1 if the transfer is overcrowded (involving walking and waiting stages) and 0 otherwise.
Utility functions also captured mental effort and activity disruption (pure penalty transfer) through their constants (α1 and α2), which refer not only to this phenomenon but to others not included in the functions. As both utility functions contain the most relevant variables related to transfers, constants will mainly capture the pure penalty transfer.
The first linear utility functions were based on these representative transfer variables. Utility functions had the structure of a MNL model. In our case study, we defined three utility functions depending on the number of transfers (T0 –no transfers–, T1 –one transfer– or T2 –two transfers–). As a 21% of PT users transfer more than once, we decided to show these three alternatives, which is another point of interest of this research.
$$ U\left({T}_0\right)={\beta}_{tveh00}\cdot tveh0+{\beta}_{mode00}\cdot mode0 $$
(4)
$$ U\left({T}_1\right)={\alpha}_1+{\beta}_{\mathrm{tveh}01}\cdot \mathrm{tveh}0+{\beta}_{\mathrm{mode}01}\cdot \mathrm{mode}0+{\beta}_{\mathrm{twalk}11}\cdot \mathrm{twalk}1+{\beta}_{\mathrm{twait}11}\cdot \mathrm{twait}1+{\beta}_{\mathrm{stair}11}\cdot \mathrm{stair}1+{\beta}_{\mathrm{tveh}11}\cdot \mathrm{tveh}1+{\beta}_{\mathrm{mode}11}\cdot \mathrm{mode}1+{\beta}_{\mathrm{info}1}\cdot \mathrm{info}+{\beta}_{\mathrm{crowd}1}\cdot \mathrm{crowd} $$
(5)
$$ U\left({T}_2\right)={\alpha}_2+{\beta}_{tveh02}\cdot tveh0+{\beta}_{mode02}\cdot mode0+{\beta}_{twalk12}\cdot twalk1+{\beta}_{twait12}\cdot twait1+{\beta}_{stair12}\cdot stair1+{\beta}_{tveh12}\cdot tveh1+{\beta}_{mode12}\cdot mode1+{\beta}_{twalk22}\cdot twalk2+{\beta}_{twait22}\cdot twait2+{\beta}_{stair22}\cdot stair2+{\beta}_{tveh22}\cdot tveh2+{\beta}_{mode22}\cdot mode2+{\beta}_{info2}\cdot \inf o+{\beta}_{crowd2}\cdot crowd $$
(6)
It should be noted that cost is not included in the utility functions. About 73% of PT users in Madrid have a monthly or annual flat-rate travel card (in the commuter group this number is even higher). The remaining 27% use single or multiple tickets which allow transfers within the metro network at no extra cost (except in the case of metro-bus or bus-bus transfer). There is therefore no extra cost to transfer in most cases, and nor was this cost variable found to be significant in the FGs. Commuters assume cost as an unchanging variable when travelling on PT, regardless of the number of transfers. These predicted and preliminary utility functions serve as the basis for designing the SP survey (both the pilot and final versions).
Pilot survey design
The pilot test is one of the most important components of the survey procedure. A pilot survey is a useful fail-safe precaution to take before conducting the main survey [30]. The main benefits of pilot surveys from our point of view are the following: they test the survey structure and the validity of the experimental design; they allow the fieldwork to be refined; and they identify certain behaviours by the respondents.
The pilot survey was web-based. The questionnaire was broadly divided into three main parts: a) trip characteristics regarding current travel behaviour (RP data); b) SP choice scenarios; and c) socio-economic/personal information. Below we explain parts a) and c) in the RP and socioeconomic questions, and part b) in SP choice situations.
RP and socioeconomic questions
Aside from the SP part of the survey, in part a) we asked participants about the characteristics of their regular journey, including all the variables described, plus others. They were asked their occupation (as participants have to commute), trip purpose, type of ticket used, trip origin and destination, trip start time, number of transfers, and trip features such as total travel time, waiting and walking time, modes of transport used, in-vehicle time, whether passengers used mobile apps to see the waiting time for the next vehicle, existence of real time information panels during transfers, and whether they engage in any activity during the trip (listening to music, reading, studying, sleeping and others).
Part c) gathered socio-economic and personal information. The questions concerned gender, age, level of studies completed, household income and household size. There were also some questions about the importance and satisfaction with certain aspects related to transfers (real-time information and mobile network coverage during transfers, or sheltered stops and seats at transfer points). There was an open question at the end for noting additional comments.
SP choice situations
Both our pilot and final SP questions were designed using Ngene software based on a multi-criteria approach, which compares a number of alternatives in different choice situations on the basis of attributes obtained from the literature review and FGs. All the variables described and included in the predicted utility functions are used to design choice situations.
We opted to use an efficient design to estimate MNL models, which involved introducing the predicted utility functions. Although the desirable final output of the study is a ML model, and an efficient design can be applied to estimate random parameter models, we chose the MNL option as Ngene strongly recommends first generating a non-Bayesian design with this model [24]. This identifies any potential problems with the design specifications more quickly.
In the particular case of Madrid there are no previous studies calculating prior components, so they were all unknown. We obtained the average values and signs for each one, and the common levels of each attribute from the literature review. The prior parameter values were then slightly modified to ensure the utility balance criterion [25]. Table 2 shows the values of the first prior parameters and the levels of attributes considered.
Some restrictions were also applied to avoid Ngene generating unreal alternatives and failing to ensure the principle of utility balance. For example, total trip time in T1 and T2 should be less than 3 to 10 min compared to T0 and T1, respectively.
After an iterative process, Ngene generated 18 choice situations (the number is a multiple of the levels of attributes) with three alternatives each (54 alternatives in total). Respondents took a long time to understand and choose between these choice situations, so there was a risk that the survey would be only half completed. To avoid this occurring, Ngene generated three blocks, each one of which would be completed by a different respondent; so three respondents would complete a whole SP survey. Each participant therefore answered the block of choices that had generated the fewest responses at that particular time. This has the added advantage of making the answers less correlated between individuals than if only one respondent had completed the survey. On the other hand, more respondents are required to comply with the value of the S-estimate.
Figure 2 shows a screenshot of a choice situation. Total trip time, total walking time and total in-vehicle time are indicated for each alternative. This is because of comments made by the participants in the first test of the survey; they declined to add these times together so they could be considered as a whole.
The way choice situations were shown to participants depends on the habitual trip revealed in the RP questionnaire:
-
If respondents did not usually transfer, they were given 6 simple choice situations between 0 and 1 transfer.
-
If respondents usually transferred once or more, they were given 6 double choice situations between 0 and 1 transfer, and then between the same alternative of 1 and 2 transfers.
These rules produce more realistic data, as participants did not have to choose between unfamiliar scenarios. Finally, the minimum number of required surveys was established at 24.60, corresponding to the maximum value of the S-estimate parameter for all the attributes considered. The minimum number of individuals surveyed is obtained by multiplying the rounded-up S-estimate parameter and the number of blocks; i.e. 75 people were required in our case study.
Conducting the pilot survey
Once the survey had been designed, it was uploaded to a web page. It was decided to use a web-based format as this has some advantages over paper surveys. The main benefits of web-based surveys are [31]: they are cheaper than other survey setups; they reduce the time required for implementation; data from web-based surveys can easily be imported into data analysis programs; and the respondents can answer at their own convenience (some online surveys even allow respondents to start and then return to the question where they left off earlier).
Respondents were mainly recruited by handing out flyers with all the information required for the survey: website address, personal password for filling in the survey, information about the project and the option of entering a draw for a gift voucher if the survey was completed. The flyers were distributed at metro and bus stations during the morning peak commuter period (7–10 am) on five consecutive weekdays, seeking to obtain a representative sample. As indicated before, target participants were limited to commuters. Participation was voluntary. The answer ratio using this method was around 15%. A €200 gift voucher was offered as an incentive for participation.
Ten respondents completed the survey with personal assistance to test whether there were comprehension problems and whether the icons used in the SP part of the survey (see Fig. 2) could be understood.
The pilot survey involved 79 commuters, of whom 65% were workers and the remaining 35% were students. The average age was 31, and 56% were women. All respondents regularly commuted via PT and their average number of transfers was 0.72 (44% of respondents did not transfer). The average door-to-door trip time was about 35 min, and initial and final walking times were around 5 and 7 min respectively. 20% and 80% of participants chose the no-transfer and one-transfer options respectively in the SP choice situations. When they were asked to choose between making one or two transfers, 64% opted for the first option and the remaining 36% for the second.
Limitations of the survey
It should be noted that the number of respondents surveyed is sufficiently representative to obtain prior parameters, but small enough to gather meaningful results. However the main aim of this paper is to describe the methodology to conduct a successful SP experimental survey. The MNL model suggests some preliminary policy recommendations, which must be confirmed by the results of the final survey.
MNL model calibration
After conducting the pilot SP survey and cleaning up the data base, we used Limdep NLogit software to run the utility functions (Table 1). The results in Table 1 offer great potential for analysing trends and providing preliminary policy recommendations, and for understanding users’ perception of transfers.
Table 1 Results of the MNL models of utility functions. Pilot survey
All significant variables have the expected signs and values. Time-related variables (walking time, waiting time and in-vehicle time) are negative as expected, and almost all significantly. In order to estimate comparisons in equivalent in-vehicle times (IVT), we set an average value of 0.3576 and 0.3472 when making one and two transfers respectively. This shows that on average, in-vehicle times are perceived almost the same, regardless of whether one or two transfers are made. Walking and waiting times, however, are more poorly perceived on average in U(T2) than in U(T1) in this pilot study.
Constants in U(T1) and U(T2) are also significant, and explain all the unobserved variables not included in the model, and particularly the pure transfer penalty phenomenon. The constant in U(T2) is clearly higher than in U(T1), so the penalty perceived on trips increases with the number of transfers. The impact of the pure transfer penalty is perceived as 10.9 and 16.7 equivalent in-vehicle minutes when making one and two transfers respectively. Stairs also have a negative sign, but this result must be treated with caution as it is close to –but not significantly different from– zero at the 90% level.
It is worth noting the disutility produced by crowding scenarios, which is higher in the case of two transfers than only one. This variable, which implies a large number of people gathered together in a limited space, influences transfer perceptions, and is one of the most significant in utility functions after constants. Its impact is comparable to an increase of 2.9 min in equivalent IVT when making one transfer, and more than double when making two transfers (6.8 min equivalent IVT), highlighting its importance. Nor is mode significantly different from zero at the 95% level, which can be explained by the way the variable was introduced in the model. This can be resolved by expressing the variable mode using dummy variables indicating the absence or presence of transfers between bus-metro, metro-bus and metro-metro (assuming transfer between bus-bus as the reference group). In the next step of the research, we separate out the effects of the different modal transfer combinations in the ECL model when analysing the results of the final survey.
Utility functions indicated that the most severely penalised time varied between alternatives. If only one transfer was made, in-vehicle time produced the maximum disutility (greater than walking or waiting times). However, when transferring twice, walking time was more poorly perceived in the second transfer, followed by in-vehicle time in the second vehicle, and waiting times. These results can be compared to those of other studies. Navarrete and Ortúzar [8] reported that waiting time was the most severely penalised, followed by initial and final walking times. These variations highlight the differences between cities, as stated by Iseki and Taylor [4].
Finally, parameter values from Table 1 were introduced as new prior components to design the definitive survey.