Introduction

Ride-pooling is a ride-hailing service which allows for the shared rides. Two or more trip requests submitted to the ride-hailing platform (like Uber, DiDi or Lyft) are pooled into a single vehicle, which picks consecutive travellers at their origins and takes them to their respective destinations, as illustrated in Fig. 1. Such pooled ride is typically longer (due to pick-up delay and detour) and yields a sharing discomfort, which is compensated by a lower ride fare.

Ride-pooling, despite being a promising emerging mode of urban mobility, still struggles to build up momentum. Pre-pandemic it reached critical mass and become profitable for providers and attractive for travellers (Shaheen et al. 2016). Yet, in the post-pandemic landscape of urban mobility, it needs to grow again. Ride-pooling services, when deployed successfully and on a bigger scale, can substantially contribute towards sustainability transitions. They offer an attractive alternative to the private car usage or the solo ride-hailing, yet with greater occupancy. When designed carefully, they complement the public transport offering attractive first-last mile services, or being an alternative in the areas and time-periods where high frequency mass transit is unnecessary. This is challenging, specifically when the perspective of four different parties need to be aligned: travellers (who want to arrive on time, with high comfort and at a low price), drivers (who want to maximise their earnings), platform (who wants to increase market share and benefit from high commission rates) and the city (who wants to reach the objectives of its transport policy). Such alignment is not trivial, specifically when we acknowledge that individual agents are heterogeneous and have different behavioural traits, leading to different perceptions of the new service. Here, we contribute to that alignment and demonstrate how we can assess the ride-pooling system’s performance with heterogeneous travellers.

Fig. 1
figure 1

Ride-pooling illustrated. Three travellers request to travel from their origins (marked with \(\times \)) to destinations (marked with a \(\triangledown \)). Apart from offering them three separate solo-rides (marked with grey), platform offers to pool them together and serve a single trip (red). This induces a discomfort (sharing trip with others), detours (route is longer) and delay (three travellers need to coordinate their departure times) which is compensated with a discounted trip fare. Will the travellers find the pooled option attractive? This depends on their behavioural profiles. So far travellers were assumed either captive, homogeneous and/or deterministic. Here, we utilise the recent findings on heterogeneous, non-deterministic ride-pooing behaviours. Each of three travellers may now have different value-of-time or willingness to share and decide to opt-out from the pool. We investigate how such heterogeneity impacts the expected system performance

While the early studies on the ride-pooling focused on the computational complexity and challenges of real-time operations, the perspective of travellers was often neglected. State-of-the-practice algorithms (Alonso-Mora et al. 2017) successfully match travellers into high order groups, served in real-time with ride-hailing vehicles. Though they often treat travellers captive, insensitive to delay and detour of the ride-pooling services as long as they fit into arbitrary time-windows. In reality, travellers are rational utility maximisers, extremely sensitive to the service performance. Naturally, opting-out from the systems which they perceive as not optimal. In that notion, the ExMAS (Kucharski and Cats 2020) algorithm was built, strictly limiting the ride-pooling to the rides attractive for the travellers, for which the utility is greater than for the alternative (e.g. private ride-hailing ride).

Here, we move further along this line and introduce the natural heterogeneity into the picture. For the new mobility modes, the perceptions may differ dramatically, from enthusiasts to sceptics. This is no different in the ride-pooling where some travellers may avoid crowding, be afraid of infectious diseases, while others aim to save time or save money. Expectedly, such heterogeneity was observed in empirical studies (Alonso-González et al. 2021), but hitherto not implemented into algorithms and experimental studies. As we demonstrate, neglecting the fact that the decision to the ride-pool is ride-specific and made by the heterogeneous agents with substantially different attitudes leads to false assessments.

Ride-pooling needs a critical mass to generate momentum and take-off, which leads to non-linearity in the system performance. However, not only the total number of clients in the system but also the presence and properties of some individual travellers may be crucial. Thus, the assessment based on the mean, average expected behaviour of ride-pooling individuals is likely to be not representative of the whole system’s performance. When we replace the average shareability traits of travellers (behavioural preferences regarding pooling) with individual properties, we will observe on one hand, isolated travellers unwilling to share with anyone and on the other, extremely shareable individuals happy to join any other pooled group. This has a highly asymmetrical impact on the structure of the underlying shareability graph and thus on the ride-pooling performance, we observe it e.g. through the maximal size of pooled rides, which increases when travellers become heterogeneous.

We set an experiment aimed to provide meaningful insights on expected ride-pooling system performance for the operator serving trips in NYC requested by heterogeneous travellers of unknown ride-pooling behaviour. We use findings from an empirical study (Alonso-González et al. 2021) conducted in Netherlands, which revealed four distinct classes of ride-pooling travellers to estimate their pooling decisions. With use of 1000 replications, we calculate system performance for different realisations of the behavioural configurations. Our results yield not only a mean assessment but also give pessimistic and optimistic boundaries to help the operator make informed decisions.

Related work

Ride-pooling is an environmental- and city-friendly service compared to a standard ride-hailing services offered by transport network companies (TNCs). Fagnant and Kockelman (2014) argues that one shared vehicle can replace around eleven privately owned vehicles. Benefits of a partial adoption of on-demand vehicles for the ride-pooling service are discussed in Engelhardt et al. (2019). Martinez et al. (2015) proposes a ride-pooling system which, while added to a standard ride-hailing system, is beneficial for users. Zhang and Nie (2021) discusses the up-sides and down-sides from travellers’ and operator’s perspective. Martinez and Viegas (2017) provides an insight into a CO2 reduction. The most common measure for environmental benefits is vehicle kilometres reduction, widely reported in the studies, like Bilali et al. (2020) which provides both a theoretical estimation and simulation results of ride-pooling efficiency in terms of reduced vehicle hours. Researchers document different values (dependent upon demand and city structure, matching algorithm): Zhu and Mo (2022) finds the total vehicle distance reduced by only \(8.21\%\), Chen et al. (2017) provides the value of \(16\%\), while the seminar paper by Santi et al. (2014) suggests savings exceeding \(30\%\).

Ride-pooling can be analysed on many levels. Ramifications of spatiotemporal demand structure are analysed for two major German cities by Zwick and Axhausen (2022). Soza-Parra et al. (2022) analyses how ride-pooling performance changes with the various synthetic demand distributions. Operational challenges on the supply side are analysed in the paper by Zwick et al. (2022). Zhang and Nie (2021) analyses pricing, finds market equilibrium and proposes congestion tax policy. Competition in the duopoly ride-sourcing market is analysed by Guo and Huang (2022). Li et al. (2022) analyses optimal fare and fleet size with congestion effects. Brodeur and Nield (2018) shows the impact of weather conditions on the demand level for ride-hailing services.

Variety of algorithms were proposed to solve the challenging ride-pooling problem. In the seminal work, Santi et al. (2014) introduces a real-time ride-pooling algorithm and invaluable shareability networks, however limits his solution to two sharing travellers. Alonso-Mora et al. (2017) proposed a state-of-the-art real-time algorithm which allows a higher-degree rides and is considered the benchmark for other approaches. Above authors, like many others (Ke et al. 2021; Shah et al. 2020; Bilali et al. 2020; Wang et al. 2021) use fixed constraints, typically on maximal pick up delay and detour time. The fixed constraints are irrespective of total travel time duration and individual preference. An alternative approach is proposed by Kucharski and Cats (2020). The proposed ExMAS algorithm (detailed in the Methodology section) is an offline algorithm which has relative constraints posed by travellers. The acceptable pick-up and detour time is determined by unified behavioural parameters (such as value of time, penalty for sharing) and travellers are pooling only if it is attractive for them.

To determine performance of the ride-pooling system, Ke et al. (2021) analyses three aspects: fraction of passengers who effectively share a ride, average detour of a traveller and average vehicle routing distance. Al-Abbasi et al. (2019) introduces a deep neural network with the objective function aiming to minimise four objectives: supply–demand mismatch, waiting time and detour distance, travel time extension and number of vehicles and resources used. Agatz et al. (2012) lists three system-wide performance indicators: vehicle distance, travel time, fraction of pooled passengers.

Many studies have been conducted to understand ride-pooling preferences (Krueger et al. 2016; Alonso-González et al. 2021; Lavieri and Bhat 2019). Chavis and Gayah (2017) examines preferences towards three services: fixed, flexible and individual, and estimates alternative specific constants of route type along with explanatory (travel time, cost, etc.), socioeconomic and attitudinal variables. Krueger et al. (2016) introduces three alternatives: non-shared ride-hailing, ride-pooling and public transit. The primary variable is the value of time, which is further split into in-vehicle time and waiting time. In a stated preference survey Geržinič et al. (2022) assesses the demand for on-demand and pooled mobility services in urban areas. Alonso-González et al. (2021) investigates in depth ride-pooling preferences in ride-hailing services. They design a stated preference study and (based on the sample of 1000 Dutch respondents) disentangle the sharing aspect from related time–cost trade-offs (e.g. detours), investigate preference heterogeneity and identify distinct market segments. The study, which results we apply in our experiments, identifies four classes with distinct behavioural characteristics, including value of time and willingness to share (penalty for sharing).

Study objectives and experimental setting

We take the side of the operator who wants to estimate the potential of introducing ride-pooling. We assume that the operator can predict the demand for her services in terms of trip requests (origin, destination and time), but she is not aware of the specific attitudes of particular clients. She does not know their individual value-of-time and penalty-for-sharing, but we assume that they can be either explicitly revealed by clients or inferred with the operations as the platform operator will observe their actual choices. Yet before that, the operator only knows the system-wide composition of the population (from empirical studies) and can decompose the population into distinct groups of more definite behavioural traits. Our objective is to provide the operator, who wants to act optimally in this unknown setting, with meaningful distributions of the key metrics.

We illustrate the method with an experiment in New York, where we use the actual travel demand observed for taxi trips. We assume that the operator already well-predicted the demand pattern in terms of number of travellers requesting trips along with their origins, destination and desired departure times. Moreover, the operator conducted a study like (Alonso-González et al. 2021) and knows the system-wide distributions of travel behaviour variables. Now, the operator wants to reliably predict the expected performance (measured with four indicators proposed in next section) and its distributions. This introduces two levels of uncertainty. First, we do not know the actual realisation of travel behaviour of each traveller. Second, even when the population-wide traits are already determined, they are not known to the operator. To address the former, we use replications, while the latter we assume can be learned by the operator along the process. This can be done either in one-shot, where each traveller is asked to declare own preferences, or iteratively, where platform observes revealed choices and can infer the behavioural characteristics. The former is unpractical, while the latter is doable, yet the exact method is out of scope of this study.

We start with the overly-optimistic benchmark, assuming that all travellers are pooling enthusiasts. This allows us to obtain the complete (maximal) set of possible ride-pooling combinations (optimistic shareability graph). We then simulate 1000 realisations of the of behavioural traits configuration. Each time we sample individual characteristic according to the empirical stated-preference study by Alonso-González et al. (2021), in particular the values of value-of-time and willingness-to-share (penalty for sharing). In each independent simulation, travellers already have their determined behavioural parameters and can tell which of the rides precomputed in the optimistic scenario remain attractive. Here, we need to assume that the platform knows which shared rides remained attractive in respective replication, i.e. knows the filtering of the optimistic shareability graph to the one with attractive rides only (in a given realisation). This may be understood as the case in which each traveller, pre-trip, assesses which ride-pooling options she finds attractive (the operator can mimic the process based on the preferences being revealed or inferred from the past client’s decisions). Such setting, despite being unlikely in practice, allows us to conduct a rich experimental with insightful results, since we may assume that the actual matching (assigning each traveller a pooled ride) is deterministic in a single replication.

Such heterogeneous perspective provides the provider with a far more meaningful estimation of the system performance and its distribution. Our results, apart from the expected values, reveal confidence intervals and tails. Estimates of the performance (e.g. vehicle kilometres travelled, fares collected) and perceived attractiveness for travellers are now more robust. Policymakers and system operators may now understand better the different points of view and tailor the system to the needs of varying target groups. The approach is of particular importance when the ride-pooling is considered as an auxiliary service.

Thanks to such an experimental setting, policymakers may understand when and where the ride-pooling system remains efficient, i.e. positively contribute towards sustainability transitions. Platform executives may use it to make optimal business decisions, which make the ride-pooling operations profitable. Based on such analysis, the platform provider may reasonably identify, e.g. the optimal discount rate at which ride-pooling service, for the expected demand, is both reliable, attractive and profitable. This is particularly timely post-pandemic. When reintroduced, ride-pooling services offer significantly lower discounts for ride pooling than pre-pandemic (like Uber: compare Lo and Morseman (2018) and UberX (2023)). Other platform providers may want to identify what kind of offer would be their best response. And this is crucial not only for the benefits of a particular provider but also system-wide—which we want to benefit from ride-pooling.

Study objectives and approach

Methodology

To reach the objectives of this study, we extend the previously proposed off-line utility-based ride-pooling algorithm ExMAS. We introduce non-deterministic utility formulas to integrate them with the previous deterministic approach and solve the ride-pooling problem for heterogeneous population. ExMAS solves the ride-pooling problem, i.e. for a given demand (set of trip requests) it first identifies the attractive pooled groups (rides) of any size (degree) and then optimally assigns each traveller to a particular ride (pooled or private).

In the heterogeneous scenario, we first assume that all travellers are ride-pooling enthusiasts. We create an overly optimistic set of feasible shared rides. Then, we sample travellers behaviour and accordingly filter optimistic rides only to the ones actually attractive. For the filtering, we assume the operator can accurately predict which rides are subjectively attractive for their clients. If for any of the co-travellers a ride is not attractive, the ride is removed from the set of feasible rides. From such filtered shareability graph, we identify a subset minimising vehicle mileage (matching). From this solution we estimate the four central KPIs of the system: mileage reductions, perceived attractiveness, passenger detour and profitability.

The uncertainty requiring replications lies in sampling individual traits. A single replication denotes a one configuration of those characteristics and the filtering conducted accordingly. The process is summarised in Algorithm 1.

Algorithm 1
figure a

The general overview of the algorithm

Ride-pooling problem solution with ExMAS

Exact matching of attractive shared rides (ExMAS, Kucharski and Cats 2020) is a strategic, utility-driven ride-pooling algorithm. In contrast to e.g. Alonso-Mora et al. (2017), ExMAS works offline, assuming a complete knowledge of the demand and (since it takes the strictly demand-oriented perspective) does not explicit consider individual vehicles. While such an approach is clearly not suitable for real-time operations, it is invaluable for strategic, demand-oriented analyses. Instead of fixed time windows or spatial constrains in which travellers are pooled regardless of their preferences, the focus is on the passengers’ utility.

More strictly, two or more passengers are assigned to a shared ride if the pooled ride utility is greater than a solo-ride utility for each of them. Utility is generically composed of travel times, costs and their relative weights (behavioural parameters). In the original ExMAS, the (dis)utility of a shared- (for passenger i participating in ride \(r_k\) denoted as \(U^s_{i, r_k}\)) and of the non-shared ride (denoted \(U^{ns}_i\)) were the following:

$$\begin{aligned} \begin{aligned}&U^{ns}_i = -\rho l_i - \beta _t t_i \\&U^s_{i, r_k} = -(1 - \lambda ) \rho l_i - \beta _t \beta _s ({\hat{t}}_i + \beta _d({\hat{t}}^p_i)) + \epsilon , \end{aligned} \end{aligned}$$
(1)

where \(\rho \) stands for per-kilometre fare, while \(\lambda \) denotes discount for sharing a ride. Both are controlled by the operator. \(\beta ^t\), \(\beta ^s\) \(\beta ^d\) are the behavioural parameters: value of time, penalty for sharing and delay sensitivity, respectively. \(t_i\) and \({\hat{t}}_i\) stand for travel time of non-shared and shared ride, respectively, \({\hat{t}}^p_i\) is a pick-up delay. Travel time with the shared ride is updated for each evaluated ride candidate and is typically greater than \(t_i\) due to both pooling detour (to pick-up and drop of other travellers) and delay (to wait for others). \(\epsilon \) is a random term. Hitherto, all behavioural parameters (\(\beta \)’s) were assumed homogeneous among the population and the problem was simplified by the assumption that \(\epsilon = 0\) which makes the problem purely deterministic. Consequently, a shared ride \(r_k\) was attractive by the i-th traveller if simply \(U^s_{i, r_k} > U^{ns}_i\).

ExMAS uses the above formulation to evaluate if any theoretically feasible group of travellers is attractive. Starting with pairs of travellers and then gradually increasing the degree (size) of the identified pooled rides. Sharing discount is assumed to be flat, regardless of number of co-travellers, hence attractive ride of higher order comprises attractive rides of lower order. Applying the recursion and exploiting the shareability graph properties, ExMAS identifies for potential attractive triples by combining pairs, later quadruple based on triples, etc. The search is exhaustive, yet the filtering for attractive rides only, prevents the search space explosion in practical settings.

Identified rides may be represented with a so-called shareability networks, one of the outcomes of ExMAS that we exploit in this study. Two networks are generated: in the shareability network two nodes (travellers) are linked when they can be attractively pooled, while in the matching network two nodes (travellers) are linked if they actually travel together in the final solution (bi-partite matching), as we illustrate in the results.

From the set of all feasible and attractive pooled rides we may identify the optimal solution. Solution is found with a bi-partite matching problem minimising the total vehicle hours with a constraint that each passenger is served with exactly one ride (pooled or private). The output is a set of rides with their schedules (subsequent pick-up and drop-off points and times served along the ride) from which we can measure the ride-pooling system’s performance.

Full details on the algorithmic steps in ExMAS are described in Kucharski and Cats (2020). Notably, ExMAS is demand-driven and thus the vehicles (fleet) is not considered explicitly with a reasonable assumptions that: a) the fleet size needed to serve pooled services is not greater than serving individual private rides and b) when the degree of attractive pooled rides starts exceeding the capacity of a single vehicle (three travellers) the service reached a critical mass advocating for bigger vehicles (e.g. 8-person vans).

Ride-pooling utility for non-deterministic heterogeneous travellers

In the study, we extend hitherto applied approach to account for the heterogeneous behavioural characteristics of individuals. Namely, any of behavioural parameters (\(\beta \)’s in Eq. 1) instead of being constants, can be now a random variable. We assume that preferences of an individual are unknown, while we know the distributions within the population. Moreover, to account for additional uncertainty, we introduce a panel noise \(\epsilon _i\) (traveller-specific) and noise \(\epsilon _{i, r}\) (ride-specific for specific traveller), leading to the following formulas:

$$\begin{aligned}&U^{ns}_i = \rho l_i + \mathring{\beta _t} t_i \end{aligned}$$
(2a)
$$\begin{aligned}&U^s_{i, r_k} = (1 - \lambda )\rho l_i + \mathring{\beta _t} \mathring{\beta _s} ({\hat{t}}_i + \mathring{\beta _d}({\hat{t}}^p_i)) + \epsilon _i + \epsilon _{i,r}, \end{aligned}$$
(2b)

where all notions introduced in the Eq. 1 still apply, yet \(\mathring{\beta }\)s are now random variables. Consistently with a discrete choice theory, we assume that the probability that a traveller i finds a shared ride \(r_k\) attractive is expressed with:

$$\begin{aligned} P^s_{i, r_k} = \Pr ( U^s_{i, r_k} > U^{ns}_i), \end{aligned}$$
(3)

which depending on the random variables distributions may become e.g. a logit or a probit model. While in general any distribution of random variable can be applied in the method, in the experiment we use multiple classes \({\mathcal {C}}\) and assume the random variables to follow a multimodal normal distribution (unimodal within the class):

$$\begin{aligned} \mathring{\beta _j} = \sum _{C \in {\mathcal {C}}} p_{C} X({\bar{\beta }}_{j,C},\sigma _{j,C}) \mathrm {\ for\ } j \in \{c, t, s, d \}, \end{aligned}$$
(4)

where \(p_{C}\) is the probability of belonging to the class C, \(X({\bar{\beta }}_{t,C},\sigma _{j,C})\) is a random variable following normal distribution with mean of \({\bar{\beta }}_{t,C}\) and a standard deviation of \(\sigma _{j,C}\) (coefficient- and class-specific).

The above formula may be applied for any behavioural parameter, yet in our experiment only \(\beta _t\) and \(\beta _s\) are random variables. Which is in-line with findings from recent studies on behavioural heterogeneity among ride-pooling travellers’ (Krueger et al. 2016; Alonso-González et al. 2021; Lavieri and Bhat 2019). Value of time (VoT) serves as a proxy between longer travel time and monetary savings. Penalty for sharing (PfS) reflects an individual’s attitude towards sharing (including infection awareness), which varies substantially. While VoT and PfS are collinear in \(U^s\) formula, VoT is the only one to impact \(U^{ns}\), which disentangles the impact of the two variables.

Mind that the two noise terms in Eq. 2 are applied differently, the first-one \(\epsilon _i\) is the traveller-specific and is drawn once for each ride-pooling problem. \(\epsilon _i\) can be interpreted as an attitude towards pooling, irrespective of the trip’s characteristics. While the traveller remains a rational decision maker, there is always uncertainty associated with choice which is not captured by the utility formula, expressed with \(\epsilon _{i,r}\) ( independent for each traveller and each potentially attractive pooled ride).

On the final note, we assume that class-membership probability \(p_C\) is an independent random variable, yet this can be easily enhanced e.g. with sociodemographic variables (when specific age, gender or income groups constitute respective classes). In the practical scenario, the platform may either successfully learn or predict the class membership of the specific i-th traveller \({\textbf{P}}({C} \text {|} i)\), effectively improving estimation. Unfortunately, such data was not included neither in the demand set that we used, nor in the choice model estimation of Alonso-González et al. (2021).

Subsequent to utility determination, the remainder of the ride-pooling algorithm remains the same as summarised in Sect. 2.1 and detailed in the Kucharski and Cats (2020).

Construction and filtering of the shareability graph

First, we construct the optimistic shareability graph. To do that, we assume all travellers exhibit a low level of value of time and penalty for sharing. The density of the graph is controlled with the population distribution percentile \(\alpha \). Let \(\mathring{\beta }_t, \mathring{\beta }_s\) denote variables following population distributions of VoT and PfS, respectively. We seek \(\beta _t^0\) and \(\beta _s^0\) such that:

$$\begin{aligned} \Pr (\mathring{\beta _t}> \beta _t^0) = \Pr (\mathring{\beta _s} > \beta _s^0) = 1-\alpha . \end{aligned}$$
(5)

Once determined, we apply \(\beta _t^0\) and \(\beta _t^0\) to Eq. 1 to construct the initial shareability graph with pooling-favourable homogeneous parameters. To guarantee that we do not discard any valuable combinations, we argue that the \(\alpha \) level should be low (in the study, we use \(1\%\)).

The second algorithmic stage is the filtering, based on the actual heterogeneous behavioural characteristics. This part of the process is stochastic, as it depends on the exact realisations of the random variables (VoT, PfS, \(\epsilon _i\)). We address it in our analysis via replications. Each replication is bound with the configuration of characteristics of travellers. To properly filter actually attractive shared rides in the given replication, we assume the operator learns those individual features. Features can be explicitly declared by travellers or iteratively inferred by the operator based on the observation of choices. Once the characteristics are known to the operator, she filters for individually attractive rides (attractive to each traveller under his exact behavioural profile) according to formula 2. As the result of filtering, the operator obtains a set of shared rides which are always perceived subjectively by clients as more attractive than their non-shared counterparts.

Once the filtered shareability graph is obtained, the operator conducts the deterministic matching maximising her objective (reduction of the vehicle mileage).

Performance indicators

We measure the performance of ride-pooling solution with the four following indicators. For the environment and city perspective we look at the vehicle-kilometres saved due to pooling \({\mathcal {D}}\). For travellers, we observe pooling costs (relative detours \({\mathcal {T}}\)) and benefits (relative improvement in utility \({\mathcal {U}}\)). For the platform, we look at the profitability of ride-pooling observations \({\mathcal {P}}\).

\({\mathcal {D}}\) expresses the relative mileage savings as follows:

$$\begin{aligned} {\mathcal {D}} = \frac{D_{ns} - D_{s}}{D_{ns}}, \end{aligned}$$
(6)

where \(D_{ns}\) denotes distance in the non-shared scenario (e.g. if everyone travelled alone) and \(D_s\)—total lengths of rides comprising the ride-pooling solution.

Mind that here, we use only distance with travellers and do not include deadheading (distance travelled without travellers)—which we assume do not increase due to sharing (compared to private rides).

From the traveller’s perspective, we measure two quantities: trip detour and utility gain. Similarly to \({\mathcal {D}}\), we introduce functions \({\mathcal {T}}\) and \({\mathcal {U}}\), given as:

$$\begin{aligned} {\mathcal {T}}&= \frac{T_{s}- T_{ns}}{T_{ns}}, \end{aligned}$$
(7)
$$\begin{aligned} {\mathcal {U}}&= -\frac{U_{s} - U_{ns}}{U_{ns}}, \end{aligned}$$
(8)

where \(T_{ns}\) and \(U_{ns}\) denote total travel time and utility of all the travellers in non-shared scenario while \(T_s\) and \(U_s\) corresponding values in ride-pooling scenario. While formally the travel time in shared option is longer due to both detour (longer route due to pooling with others) and delay (waiting for others), here, for brevity, we simply call it a detour. Mind the negative sign in Eq. 8, added since the utility (like usually in the transportation) is the disutility (composed of monetary costs and time) and is always negative.

We introduce the profitability \({\mathcal {P}}\), being greater than 1 when the provider’s mileage savings are greater than loss of revenues (due to sharing discount). First, we introduce it at the ride level:

$$\begin{aligned} {\mathcal {P}}_r = {\left\{ \begin{array}{ll} \frac{(1-\lambda ) {\rho } \sum _{i \in r} d^{ns}_i}{ {\rho } d_r} =\frac{(1-\lambda ) \sum _{i \in r} d^{ns}_i}{d_r}&{} \text {, if ride is pooled} \\ 1 &{} \text {, otherwise,} \end{array}\right. } \end{aligned}$$
(9)

where \(d^{ns}_i\) denotes the direct trip length (traveller i pays for the distance that would be served with the private ride, not with the detour), \(d_r\) is a total mileage of the ride r shared by travellers \(i \in r\), \(\lambda \) stands for sharing discount and \(\rho \) is fare. The second term reflects the travellers who were not pooled with anyone and paid the full price.

Profitability \({\mathcal {P}}\) on the system-wide level is then:

$$\begin{aligned} {\mathcal {P}} = \frac{\sum _{r \in {\textbf{R}}} {\mathcal {P}}_r d_r}{\sum _{r \in {\textbf{R}}} d_r}, \end{aligned}$$
(10)

where \(\sum _{i \in {\textbf{R}}} d_r\) is a total length of all rides in the solution. Profitability is essential for a scenario when the fleet is centrally controlled and drivers receive salary (or hourly rate) or the serving vehicles are autonomous. Then, the operating cost is proportional to the vehicle mileage.

The above indicators can be calculated not only at the system-level (summing over all travellers), but also for individual travellers (\({\mathcal {T}}\) and \({\mathcal {U}}\)) or individual rides (\({\mathcal {D}}\) and \({\mathcal {P}}\)). We will use subscript r or t for reference to individual rides’ (\({\mathcal {D}}_r\), \({\mathcal {P}}_r\)) or traveller’s (\({\mathcal {T}}_t\), \({\mathcal {U}}_t\)) performance.

Assessment

Introduction of heterogeneity when determining whether a pooling option is attractive or not, results in a non-deterministic solution and thus a system’s performance. Importantly, there are two intuitive perspectives on which one can assess the variability. One is to look at the solution of a single ride-pooling problem and apart from identifying the mean performance (e.g. utility gains \({\mathcal {U}}\)) we look at how this total is distributed across the individual travellers \({\mathcal {U}}_i\). In this study, however, we replicate the random process (due to non-deterministic behaviour) and see how the performance vary across the replications (as we report e.g. in Fig. 4). Moreover, two approaches can be joined when we look at individual travellers’ performance across the replications, like we report e.g. in Fig. 7.

Results

We illustrate how the proposed method enhances assessment of ride-pooling services with the case-study of New York City. Results are available to reproduce on the public repositoryFootnote 1.

We start with parametrising the behaviour of four distinct traveller groups revealed in empirical study in Sect. 3.2. Then, we visualise the underlying shareability networks (after the reduction accounting for filtering according to travellers’ actual preferences) and show how stable they are in the homogeneous setting (Fig. 3). We show the distributions of key ride-pooling performance indicators in Fig. 4 and compare with the deterministic benchmarks, further stratified for four behavioural classes in Fig. 5. We report the asymmetrical impact of individual willingness-to-share on system performance and we show, in Fig. 6, how the maximal number of travellers riding together increases in a non-deterministic case. Then, we report how varying behaviour of individuals impacts their ride-pooling performance in Fig. 7. We analyse how the variability of ride-pooling behaviour scales with the demand in Fig. 8. To conclude, we show what is the impact of incorrect assumption of homogeneity of travellers for the service’s attractiveness and performance (Table 4).

Experiment details

We experiment with the trips actually requested in the New York City on January 2016. We use the set of about 150 (exactly 147) trips requested in the 30 minute batch (which is beyond the critical mass needed to induce pooling). The density of requests is moderate, sufficient for pooling to reach critical mass and be effective. Half an hour batch allows for an efficient combination of trip requests into shared rides and reduce computation time compared to longer duration (Shulika et al. 2024).

Each trip request links origin with destination at a given time (as illustrated in Fig. 2). We used the fare \(\rho \) of 1.5 €/km (converted from $, consistent with NYC Taxi & Limousine Commission 2022), sharing discount \(\lambda \) is \(30\%\) (within the range suggested by Shaheen and Cohen (2019)). We assume \(\beta _d = 1\) while the rest of behavioural parameters is detailed in the following Sect. (3.2). For the initial filtering of requests, we use the 1st percentile, i.e. \(\alpha =1\%\) in Eq. 5.

Fig. 2
figure 2

Demand dataset for experiments: ca. 150 trip requests from Jan 2016 in Manhattan. Blue dots are origins, purple destinations

Behavioural component

Out of several empirical studies, we select the one which provides the richest insights on heterogeneity of ride-pooling behaviour. In the stated-preference survey on ride-pooling behaviour by Alonso-González et al. (2021), the Latent Class Choice Model (LCCM) was estimated to fit the empirical results. It indicates not only that people can be stratified into four distinct classes, but there is also a significant behavioural variability inside each class. The fusion of the New York demand dataset with Dutch ride-pooling preferences can be questioned. However, we argue that, while specific traits may differ from New York to Netherlands, the NYC travellers are undoubtedly heterogeneous. Nonetheless, the proposed method is generic and replicable whenever a consistent data becomes available.

Travellers were revealed to be divided into classes defined as “It’s my ride” (C1, constituting \(29\%\) of the sample), “Sharing is saving” (C2, \(28\%\)), “Time is gold” (C3, \(24\%\)), “Cheap and half empty, please” (C4, \(19\%\)). Participation in one of the classes is reflected in distinct behavioural parameters, namely the value of time (VoT) and penalty for sharing (PfS) varied among the classes.

The structure of the utility formulas in Alonso-González et al. (2021) differed from the ExMAS utility formulas, thus, the following interpretations were made to apply the results to our study. Firstly, we assumed that both PfS and VoT in formula 2 follow a multimodal normal distribution given as a mixture of four normal distributions (corresponding to each class), like in Eq. 4. The mean value of time is explicitly given in the study for respective classes as 16.98, 14.02, 26.25, 7.78 (€/h) accordingly. Yet, the penalty sharing needed to be derived from the reported willingness to share. We used PfS distribution reported for four additional passengers for class C1 as a reference (because such rides are admissible and appear in our experiments). Corresponding distributions for other groups were not reported, hence we retrieved them as scaled reference distribution. Applied proportions were following proportions of reported fixed WtS penalty for 2 additional passengers.

Besides the mean values, we need to estimate their variances. Since this was not reported explicitly, we derive it from reported parameters estimates and z-values. To retrieve it, we apply the formula given by Seltman (2012) assuming independent variables (covariance is null) and truncated normal distributions:

$$\begin{aligned} {\text {Var}}\left( \frac{X}{Y} \right) \approx \frac{\mu _x^2}{\mu _y^2} \left( \frac{\sigma _x^2}{\mu _x^2} + \frac{\sigma _y^2}{\mu _y^2} - 2 \frac{\text {Cov}(X, Y)}{\mu _x \mu _y} \right) . \end{aligned}$$

Which led to the variances for PfS of 0.082, 0.071, 0.06 and 0.076 and for VoT of 0.318, 0.201, 5.777 and 13.063 for the respective four classes. We clipped the huge variance for the last class to 1, to prevent a class with practically random behaviour. Such big standard deviation presumably resulted from a small sample in a study.

Eventually, we successfully obtained: class membership probabilities (\(p_{C}\)), mean values of value-of-time and penalty for sharing for four different classes (\(\beta _{s,C}\), \(\beta _{t,C}\)) and class-respective variances (\(\sigma _{s,C}\), \(\sigma _{t,C}\)) as reported presented in Table 1.

Table 1 Ride-pooling behavioural parameterization from Alonso-González et al. (2021)

The total variance is distributed in formula 2 into a random value of \(\beta \) and user-specific and ride-specific Gaussian noises. Since such distinction was not reported empirically, we assumed that standard deviation of the ride-specific noise \(\epsilon _{i, r_k}\) (drawn for every evaluation of \(U^s_{i,r_k}\)) constitutes \(10\%\) of standard deviation of user-specific noise (drawn once per replication) which follows standard normal distribution. Such noise is both consistent with empirical findings and mimics the additional uncertainty of the probit model (Cf. Ortúzar and Willumsen 2011).

Probabilistic shareability networks

To illustrate how the already complex shareability structures of ride-pooling change in the non-deterministic setting, we present them as the graph structures below. In Fig. 3, we present shareability and matching structures resulting from first, first two and all replications of ride-pooling problem. In all networks, nodes represent each of 150 travellers. Two travellers (nodes) are linked in the shareability graphs (Fig. 3a–c) if they can be attractively pooled together, i.e. formula 2. In the matching networks (3d–f), links represent the final realisation of rides, i.e. solution of the bi-partite matching yielding minimal total mileage (see details in Kucharski and Cats (2020)). Pair represents a ride comprising two travellers, a triangle—three, etc. While Fig. 3a–c reveal the algorithmic intermediate step of ExMAS, Fig. 3d–f visualise an actual physical realisation of ride-pooling.

Fig. 3
figure 3

Shareability and matching networks obtained in the first, two and all replications of ride-pooling problem. Link widths (proportional to weights) denote how frequent two travellers were connected, i.e. found travelling together attractive (top row) or actually travelled together in the optimal solution (bottom row)

Already replicating the random process twice (second column in Fig. 3), reveals a high variability of resulting networks. Many links appear only in one replication and travellers either do not find travelling together attractive (in Fig. 3b), or they are not part of the solution minimising the mileage (as on fig. 3e). This may be either due to their behaviour (high VoT and high penalty for sharing (PfS)) or due to their spatio-temporal incompatibility with others. The reason behind such variability lays in the assumption that we cannot predict the class memberships of individual travellers, thus an individual may be assumed to be a ride-pooling enthusiast (C2) in one replication and sharing-averse (C1) in another. While this is not true for individual travellers, this represents the perspective of the service provider who is not aware of the behaviour of a particular traveller.

This trend evolves, however, as we replicate the random process and observe diverse shareability and matching structures. We aggregate results from 1000 replications and introduce the edges’ weights (link widths) as linearly proportional to the number of occurrences among all replications. The weighted structures are presented in Fig. 3c and f. Comparing Fig. 3c and f to Fig. 3a and d, we observe a significant increase in the number of links. In the probabilistic shareability network (Fig. 3c), only one traveller remained isolated after 1000 replications (a quarter was isolated in one replication). The number of links increased nearly tenfold from about 600 to 6000. The matching network from a single replication has 62 links, while after 1000 replications there are over 500 links (Fig. 3f).

Above results confirm that the ride-pooling problem for non-deterministic and heterogeneous travellers yield significantly different results. In the deterministic setting, the links can be well predicted as we can reasonably determine whether two travellers find it attractive to pool a ride. Yet, as soon as we include their varying behaviour more and more links are formed, and the structure becomes on one hand more dense and connected (as we aggregate along the replications) yet also more variable (when we compare two independent replications). It is substantially harder to predict whether two (or more) travellers will be happy to share a ride together when we know they may belong to classes of substantially different behaviours. That is why the aggregated shareability network becomes less interpretable as the number of feasible pairs significantly increases.

On the other hand, some of the links are more frequently observed than the others. This is due to the fact that some trips are naturally shareable due to their spatio-temporal similarity (their origins and destinations are close to each other and the departure times are similar). Such similarity (or dissimilarity) can well compensate behavioural penalty for sharing and value of time. Disentangling the two factors (behaviour and spatio-temporal similarity) is not trivial, as they both drive the attractiveness of pooling.

This initially chaotic representation becomes more rigid when we look at the matching solutions. Mind that in the matching, we optimise for the distance (which is deterministic), rather than user-varying random attractiveness. This stabilises the shareability network and the link weights become more pronounced. Some pairs of travellers are connected with high probability, regardless of the varying behaviour of involved travellers. This demonstrates some regularity and predictability of ride-pooling, despite highly variable traveller behaviour, as we will demonstrate in more detail below.

Expected ride-pooling performance with non-deterministic travellers

In this section, we dive into details how the behavioural heterogeneity impacts the ride-pooling performance. We assess the ride-pooling system’s performance with the four indicators introduced in Sect. 2.4. Calculated first for a consecutive replications (realisations of random variables) and then accumulated over all 1000 replications (one value corresponds to one realisation). Results are presented in Fig. 4. Baseline for comparison is the deterministic ExMAS (with VoT and PfS being weighted average of means introduced in Table 1) which yields mileage reduced by \(30\%\), \(9.8\%\) detour, utility increased by \(4.5\%\) and profitability of 1.097.

Regardless of behavioural variability, pooling always reduces the travel distance (\({\mathcal {D}}\)). \(90\%\) of the observations range between \(24.3\%\) and \(29.7\%\) with the average of \(27.1\%\), which is significantly worse than 30% reductions obtained in the deterministic setting (Fig. 4a). While this is a significant reduction, which should be hugely appreciated from the city’s and environmental’s perspective, it also varies significantly, with observations ranging from \(20\%\) to \(32.5\%\). The right tail seems to be fatter, with occasional outliers greater than \(30\%\) vehicle kilometres reductions. The mean of average utility gain (which is user-subjective and depends on behavioural profile) is approximately \(6.1\%\), which, in turn, is now greater than in the deterministic benchmark. The \(90\%\) two-sided confidence interval spans between \(5.1\%\) and \(7.2\%\) utility gains.

A more physical measure of pooling performance, from the user’s perspective, is trip detour, which does not depend on traveller behaviour. Notably, the acceptable detour (and delay), unlike the fixed time windows used in other studies, depends on VoT and PfS and can vary significantly with the behaviour.

The mean travel time of pooled ride is approximately \(11\%\) longer than for the solo-rides only system (Fig. 4c) and \(90\%\) of the observations fit between \(8.6\%\) and \(13.6\%\), which, again, is worse than in the deterministic benchmark (\(9.8\%\)).

Regardless of the behavioural variability, the ride-pooling remains profitable, with average profitability of 1.082 and the \(90\%\) confidence interval from 1.05 to 1.11 (Fig. 4d). It is a very promising finding. It means that in \(95\%\) of scenarios, profitability gain was greater than \(5\%\) and we did not observe any unprofitable replication, meaning that the platform does not risk when offering a 30% discount for pooled rides.

Fig. 4
figure 4

Distribution of ride-pooling performance resulting from 1000 replications. Dotted lines show the deterministic benchmark

Class dependent ride-pooling performance

Considering the importance of the behavioural preferences of individuals on pooling performance, we analyse further the data with respect to individuals and their classes. First, we show the cumulative distributions of individual indicators observed in 1000 replications. Along with overall distributions, we present the profiles associated with four distinct classes. While the CDF profiles serve for illustrative purposes mainly, we enhance them with tables showing differences between classes in the four consecutive indicators as they reveal intriguing patterns among the introduced behavioural classes.

Here, we aggregate all the rides, irrespective of in which replication they appear. In Fig. 5, we present the cumulative distribution functions of the utility gains and detours. For classes C1 and C3, the utility gains are below average and detours are above average. For the class C4 the opposite is true, while class C2 in on average with the mean performance.

Fig. 5
figure 5

Cumulative distribution functions of utility gains and detours for the trips observed in 1000 replications, stratified into four classes of behaviour

In Table 2, we report the resulting trip detour for the passengers in the four respective behavioural classes classes. Apart from mean and standard deviation, we report values of \(75\hbox {th}\), \(90\hbox {th}\) percentile and \(95\hbox {th}\) percentile. First, we present the values obtained for all the rides and then, for illustrative purposes, only for shared rides (excluding the travellers who were not matched with anyone and travelled alone). While on average, pooled rides are 8% longer than private rides, it varies from 5% for class C1 to 15% for class C4. Notably, 5% of travellers in C4 decided to pool despite at least 61% longer travel time. Those trends are further pronounced when we restrict the analysis to the rides which were successfully pooled (right columns).

Table 2 Statistical properties of trip detour (relative) \({\mathcal {T}}_r\) for travellers of four respective behavioural classes

Similarly to travel time analysis in Table 2, we present data of relative utility gain in ride-hailing system with ride-pooling (relative to system without pooling). Surprisingly, the class which has the longest detours (C4) has also the greatest utility gains (11% on average). For the class unwilling to pool the benefits are only 4% (C1). For 10% of all the travellers ride-pooling led to at least 25% increase in perceived utility (Table 3).

Table 3 Statistical properties of travellers gains (in relative increase of utility) \({\mathcal {U}}\) for travellers of four respective behavioural classes

Asymmetrical impact of perceived sharing discomfort

Importantly, ride-pooling behaviour of an individual traveller may have diverse impact on system performance. While travellers unwilling to share form isolated nodes in the shareability graph, the travellers with high willingness to share become super connected and can share with many other travellers and join multiple groups. Thus, the importance of highly shareable travellers is way more important for the ride-pooling performance than single isolated solo-riders.

We illustrate impact of this this asymmetry with Fig. 6, showing a relation between the distance of the shared ride and its profitability. Colours show the degree of a ride, i.e. number of travellers in the ride. In the deterministic benchmark, the maximal observed degree was five, however in the probabilistic experiment, rides composed of six and seven travellers were observed. This is thanks to the asymmetrical impact of highly shareable travellers on the shareability networks. Which is profound also for the ride-profitability, as the strong positive trend between the ride degree and profitability is observable in Fig. 6.

Fig. 6
figure 6

Relation between shared ride’s distance (x-axis, in kilometres) and profitability (y-axis). Each dot denotes a ride observed in one of 1000 replications and is coloured accordingly to is degree (number of travellers)

Ride-pooling performance vs ride-pooling behaviour

In this section, we provide an insight into the relation between sampled behavioural parameters (VoT, PfS) and performance of the individual travellers (\({\mathcal {U}}_{r}\), \({\mathcal {T}}_{r}\)). In Fig. 7, we scatter the values of VoT and PfS (first and second row respectively) on x-axis against the performance indicators, i.e. \({\mathcal {T}}_r\) and \({\mathcal {U}}_r\) (first and second column) on y-axis. Each dot represent an individual traveller coloured accordingly to her behavioural class.

Fig. 7
figure 7

Scatter plots of travellers’ behavioural parameters (x-axis) against the resulting ride-pooling performance. Each dot denotes a single traveller, coloured accordingly to the class assigned to her in the respective replication. y-axis are scaled to represent relative change, e.g. 1 corresponds to increase of \(100\%\)

We can observe a clear trend of ride-pooling detours decreasing with an increasing value of time (Fig. 7a), despite the longer detours for travellers with low value of time, their benefits of pooling (relative increase in utility) remains high (Fig. 7b), whereas for travellers with high value of time utility gains are significantly lower. The three classes (C1, C2 and C4) have relatively low variance and do not overlap, while for class C3 the variance of VoT is very big. Mind that in our experiment each traveller was assigned to different classes across replications, nonetheless the class membership strongly correlate with the resulting ride-pooling performance, despite the fixed spatiotemporal trip characteristics.

Those clear trends become blurred when we plot against PfS (Fig. 7c and d) where the Gaussian shape is observed yet class memberships are indistinguishable. Both highest detours and utility gains are obtained for intermediate values of PfS, while extreme cases have the lowest benefits. The travellers with a low penalty for sharing (below 1 for a subset of class C3) seem to be often exploited by the system and their utilities are sacrificed to match preferences for travellers with lower preferences for sharing. Yet for those with high penalty the benefits are also limited, since they can hardly be pooled with others. The pooling seems to be best performing with the penalties ranging from 1.1 to 1.2 where it can reach the greatest expected benefits (Fig. 7d).

Scaling with the demand levels

The notion of critical mass is crucial to the ride-pooling and number of studies reported how the ride-pooling performance increases non-linearly with the demand size. In our recent study (Shulika et al. 2024), we analysed demand levels at which critical mass is reached in NYC in the deterministic, homogeneous setting, here we enrich it with results of our approach.

We extend our main experiment with closer analysis from the two sides: sub-critical (99 requests per 30 min) and super-critical (198 requests per 30 min). We report the present previously introduced performance measures for three levels of demand in Fig. 8. Unsurprisingly, we observe the critical mass effect as the performance significantly increases when the demand grows from 100 to 150 requests and then stabilizes when it reaches 200 requests (Fig. 8). Yet more importantly, we observe the trends with variabilities, which are alternative: either narrowing with growing demand (utility in Fig. 8b), widening (profitability in Fig. 8b), or remaining roughly constant (travel times and mileage savings in Fig. 8a, c).

Fig. 8
figure 8

Distributions of four performance indicators with the three levels of demand: subcritical, critical and supercritical

Ramifications of an incorrect parameter estimation

One of the main concerns of the operator is to attract and maintain new clients. In this section, we analyse how incorrect assumptions of the travellers’ heterogeneous behavioural features lead to both customers’ dissatisfaction and decline in the service performance.

The first question is how dissatisfied are customers with the service, if the operator falsely assumes they are homogeneous while, in the reality, they are heterogeneous. To answer, we first compute results under homogeneity assumption and then create 1000 different heterogeneous feature configurations. Our results indicate that approximately 18.5% of all travellers will be dissatisfied. While the individual satisfaction is important, the operator is also concerned whether the ride is rejected or cancelled. If we assume that any co-traveller who finds the shared ride unattractive cancels it, we find that ca. 38% of shared rides, considered attractive under homogeneity assumption, are cancelled.

The second question is how much potential lays with ride-pooling enthusiasts, which is not revealed in the homogeneous scenario. Here, we use our 1000 replications under the heterogeneity assumption. Rides in those results were subject to operator’s goal maximisation, yet all are considered attractive by the clients. We analyse the fraction of those rides which would be filtered out under the homogeneity assumption. We find that approximately 21% of shared rides would be discarded. The most significant percentage is when we consider shared rides comprising at least 4 travellers, then the number increases to 46%.

We aggregate our results in Table 4. In the first column, we present results in the first scenario, i.e. for heterogeneous travellers incorrectly assumed to be homogeneous. In the second column, we analyse the second scenario, when travellers are in truth homogeneous and assumed to be heterogeneous. We propose the following metrics: a fraction of people sharing who are dissatisfied with the service and a fraction of shared rides where any of the co-travellers is dissatisfied with the service. The last metric is also split with regard to the number of co-travellers.

Table 4 Ramifications of incorrect assumptions regarding travellers’ behaviour. In the first column, pooling system based on homogeneity assumption proposed to heterogeneous travellers. In the second column, heterogeneous system assessed with homogeneous preferences

Discussion and conclusions

Most ride-pooling algorithms rely on the fixed constraints (usually time windows). To more objectively assess the attractiveness of a shared ride from the traveller’s perspective, we follow the path started by ExMAS. In this study, we take the matter further. We no longer consider travellers homogeneous. Using recent empirical findings, we assume that while we still do now know preferences of the individuals; we know the general distributions.

To properly include this in our calculations, we introduced random variables representing the value of time and perceived sharing discomfort. Such reformulation allowed to reproduce the four distinct classes of travellers’ behaviour towards ride-pooling, as revealed in Alonso-González et al. (2021). Conducting sufficient number of replications, we obtain reliable estimations of the ride-pooling problem solution, which is no longer deterministic. Apart from mean values, we can estimate the lower and upper bounds of the performance indicators, which substantially differ from the deterministic benchmark.

The additional probabilistic layer introduces a variability at the level of individuals and at the system performance. We find that despite the high variability of individuals, pooling performance remains stable and within reasonably narrow confidence intervals. Notably, probabilistic results are shifted compared to the deterministic benchmark. While the primary objective of minimising mileage is better met in the deterministic scenario, we observe a much higher satisfaction with the service in the heterogeneous setting.

Our study also provides an insight into how ride-pooling performs for travellers of certain preferences. We analyse the impact of the value of time and perceived sharing discomfort on trip detour and satisfaction with the service. We find that people with low value of time can be considered both the most flexible and the most beneficial travellers in the pooling system. However, those with intermediate penalty for sharing not only benefit more than those with high penalties (who does not want to share in general), yet also more than those with low penalties (willing to share with anyone).

Similarly to the deterministic case, we observe the critical mass effect and pooling becomes effective only when the demand levels reach the so-called critical mass. Now we enrich this notion with findings on variability, which may either decrease (in terms of utility gains) or increase (in terms of profitability for the provider) with growing demand levels.

To conclude, we analyse how impactful false assumption of homogeneity is for the system. We find that 18.5% of heterogeneous travellers would be dissatisfied. From the operator’s perspective, we can see that it results in the cancellation rate of 38.3% increasing to 48.3% for the most complex rides. Inversely, we find that, from the optimal solution with the correct heterogeneous assumption, 21% of shared rides would not be found in the homogeneous scenario. For the rides comprising at least four travellers, the number reaches 46%. These results show that the false homogeneity assumption leads to both high cancellation rate and underutilisation of subjectively attractive shared rides.

The proposed method is general and can be easily applied to new cases, both for general demand patterns and different behavioural models. Also, the specific experimental setting used in this study may be reformulated, e.g. when we know the class membership, the demand is not predicted properly or when the behaviour is assumed fixed, but demand is varying (like in Kucharski et al. (2021)). In the future studies, those additional dimensions of variability may be included for even richer assessments.

Finally, the proposed method may be valuable in the pandemic-analyses, when virus-averse behaviour drives the pooling behaviour of individuals. With the methodology that we propose, one may better understand the performance of ride-pooling system under dynamically changing behaviour, which becomes highly variable during disruptive changes. This can be instrumental in the event of the new pandemic. One can now assess the potential of ride-pooling and balance the performance with the spreading contribution. Notably, the COVID-19 risk-awareness was heterogeneous and non-deterministic in the population. We argue that the penalty-for-sharing, with which we experiment in this study, may be interpreted as the virus-awareness and assess with the proposed method.