Skip to main content

Advertisement

Log in

The return on investment for taxi companies transitioning to electric vehicles

A case study in San Francisco

  • Published:
Transportation Aims and scope Submit manuscript

Abstract

We study whether taxi companies can simultaneously save petroleum and money by transitioning to electric vehicles. We propose a process to compute the return on investment of transitioning a taxi corporation’s fleet to electric vehicles. We use Bayesian data analysis to infer the revenue changes associated with the transition. We do not make any assumptions about the vehicles’ mobility patterns; instead, we use a time-series of GPS coordinates of the company’s existing petroleum-based vehicles to derive our conclusions. As a case study, we apply our process to a major taxi corporation, Yellow Cab San Francisco (YCSF). Using current prices, we find that transitioning their fleet to battery electric vehicles and plug-in hybrid electric vehicles is profitable for the company. Furthermore, given that gasoline prices in San Francisco are only 5.4 % higher than the rest of the United States, but electricity prices are 75 % higher; taxi companies with similar practices and mobility patterns in other cities are likely to profit more than YCSF by transitioning to electric vehicles.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. *This equation only applies when studying BEVs with switching stations. The different scenarios we study are given in Case study EV scenarios

References

Download references

Acknowledgments

The authors would like to acknowledge Jason Baek, Augustin Chaintreau, Earl Oliver, Lisa Patel, and Dr. Catherine Rosenberg for their assistance with this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tommy Carpenter.

Appendices

Appendix A: Brief CLGN background

In this appendix we provide a brief background on CLGNs. We assume knowledge of standard Bayesian networks; an excellent reference text is (Koller and Friedman 2009).

We first present some necessary definitions.

  • The graphical model of a problem is a directed acyclic graph G(VE), where each vertex is a variable and each edge represents a causal effect. The variables may be known (we can directly observe or compute their values) or hidden (we estimate their value because we cannot observe their values directly).

  • The set of parents Pa(X) of a node X in a graphical model is defined as all nodes Y such that \((Y,X) \in E\) and YX.

  • Variables can be either discrete or continuous; discrete variables can only take values from a countable set of values, such as the integers, whereas continuous variables can be any real number.

  • A Bayesian network is a directed acyclic graph that defines the relationship P(X|Pa(X)) between every variable and its parents. The probability of any variable X is independent of all other variables in the network given its parents.

Hybrid models

Standard Bayesian networks contain only discrete variables. A hybrid model contains a mix of both continuous and discrete variables. Several different hybrid models exist; we chose to use conditional linear Gaussian networks (CLGNs). In linear Gaussian models, each variable X is modeled as a linear combination of its parents. CLGNs are extensions of linear Gaussian models that allow for both discrete and continuous variables.

In CLGNs, three types of relationships are defined:

  • A discrete child with only discrete parents

  • A continuous child with only continuous parents

  • A continuous child with a mixture of continuous and discrete parents.

Note that CLGNs do not allow for discrete variables with continuous parents. Other models address this issue, but we do not need these extensions for our application.

Querying conditional linear gaussian networks

To query a variable is to return its Gaussian distribution. To query each of the three types of variables, we use the following formulas.

We consider the simplest case first; a discrete variable with only discrete parents. To express this conditional relationship, we use a discrete conditional probability table (CPT) as in standard Bayesian networks.

Next we consider a continuous variable with only continuous parents. A continuous variable X can take on any real number in the domain of X. Therefore, we cannot have a finite CPT because it would be infinitely large. Instead, we maintain a function of its parents’ values that is used to generate a Gaussian over X. Let X have k parents with means \(pa_1,pa_2,\ldots pa_k\). Under the CLGN model, we specify k + 2 parameters \(\alpha_0, \alpha_1, \ldots, \alpha_k\), and a variance σ2 and compute \(P(X|pa_1,pa_2, \ldots pa_k)\) as

$$ P(X|pa_1,pa_2,\ldots pa_k) = {{\mathcal{N}}}\left( \alpha_0 + \sum_{i=1}^k\alpha_i(t)\cdot pa_i(t), \sigma^2 \right) $$
(22)

That is, the set of αs are linear combination constants; we are calculating a new Gaussian that is a linear combination of other Gaussians (its parents).

Before we examine the third case, we note how σ2 is obtained. There are two widely used versions of CLGN’s: those where the variance of each variable depends on the variances of its parents, and those where the variance is assumed to not depend on its parents (Koller and Friedman 2009). We are using the latter simpler model because we do not have data for the variables in Table 1. This model is not as accurate because it ignores covariance between variables and their parents, but is commonly used when the variances of variables in the network are not known, and still captures most of the meaningful relationships (Koller and Friedman 2009). Because we use this model, we do not present background on CLGNs where the variance of each variable X depends on Pa(X); but note these models rely on the theory of multivariate Gaussian distributions.

Finally, we consider the most complex case, a continuous variable with both continuous and discrete parents. Let X be a continuous random variable with j discrete parents and k continuous parents. Let \({\bf D} = \{ D_1, \ldots , D_j \}\) represent the discrete parents of X. Let \({\bf C} = \{C_1, \ldots , C_k \}\) represent the continuous parents of X; we denote the mean of the ith continuous parent c i . Together, \({\bf D} \bigcup {\bf C} = Pa(X)\). For every combination d chosen from D, we have a (possibly different) vector of k + 2 constants \(\alpha_{d_0}, \alpha_{d_1}, \ldots , \alpha_{d_k}, \sigma^2_d\), and a variance σ d 2 such that

$$ P(X|{\bf D = d}, {\bf C =c}) = {{\mathcal{N}}}\left( \alpha_{d_0} + \sum^k_{i=1} \alpha_{d_i}\cdot c_i ; \sigma^2_d \right) $$
(23)

Again, the set of αs are the linear combination constants.

The problem with this approach is that the set of all combinations of d may be massive; even if each discrete parent was binary, we would still have 2d combinations and would need to store (k + 2)2d constants for every variable. A better idea is to store a function for each variable that calculates these k + 2 constants based on its parents at any time. This also allows us to set the α values based on Xs discrete and continuous parents, if needed. Therefore, we introduce a function \({\phi_X({\bf d}, {\bf c}): {\mathbb{R}}^{j+k} \rightarrow {\mathbb{R}}^{k+1}}\). This function ϕ X takes in all of Pa(X) and generates the α values used in the linear combination. Creating the ϕ functions requires knowledge of the problem; we need encode our knowledge of how variables are dependent upon each other into the network via the ϕ functions. If we were instead storing the constants, then we would need to derive the constants for each variable based on our knowledge of the problem.

Having introduced the ϕ X function, we rewrite Eq. 23 as:

$$ P(X|{\bf D = d}, {\bf C =c}) = {{\mathcal{N}}}\left( \alpha_{d_0} + \sum^k_{i=i} \alpha_i\cdot c_i ; \sigma^2_{{\bf d}} \right) $$
(24)
$$ \{\alpha_0,\ldots, \alpha_k\} = \phi_X({\bf d}, {\bf c}) $$
(25)

Whenever we query a variable, we use Eqs. 24 and 25 to calculate the distribution.

Appendix B: General switching station optimization algorithm

As discussed in Related Literature, placing facilities to maximize the number of miles travelled or maximize the number of intercepted flows does not necessarily maximize revenue. We assume the taxi company’s objective is to maximize their overall revenue, and therefore introduce a new optimization framework based on the discretized locations of the taxis and their charge levels. It is also a variation of the flow based facility location model.

We now provide the details of our approach to computing locations for switching stations. First, we show that the problem is NP-hard, which implies that it is unlikely to be able to be solved by an algorithm that runs in polynomial-time. Then, we formulate it as an integer program, and propose an algorithm for the problem that works on small instances.

We outline a proof that the switching station location problem is NP-hard by a reduction to the facility location problem. The facility location problem is stated as: given a set of clients has some demand from a facility and a cost to build each facility, find the optimal placement of facilities to minimize the cost of the facilities and the cost of serving the clients. The switching station location problem can be reduced to the facility location problem by treating the taxis as clients whose demand varies over time and the switching stations as the facilities that can meet that demand. Therefore, finding the set of optimal switching station locations is also NP-hard.

We now formally describe the switching station location problem. First, we introduce the necessary notation. Let L be the set of locations where a switching station can be placed. We denote a taxi by x and the set of all taxis by X. We assume knowledge of a cost function cost(l) for each location \(l \in L\) that is the price of placing a station at l. We use Loc t (x) to be the location of x at time t and fare t (x) is True when x has a passenger and False when it does not. We use a binary variable y(l) to indicate if a location has been selected for a switching station. The charge level of \(x \in X\) at timestep t k is denoted by CL(xt k ). Let o t (xr) be the opportunity cost of an EV with charge level r at time t. For a taxi x, o t (xr) should be zero when x’s battery is sufficiently charged; however, as its charge level drops, there is some opportunity cost because the driver will not be able to complete trips over some length, and thus may lose revenue because some passengers cannot be transported to their destination. In our analysis, we define o t (xr) to be the sum of taxi x’s fares for the remainder of its shift, once it cannot complete a trip because its charge level r is too low. That is, if x cannot complete a trip at time t, then its opportunity cost is the fares for the trips it would have completed from time t until the end of its shift. Finally, we use tau to be the battery level at which a taxi will always swap its battery if is at the same location as a switching station.

The objective of our optimization problem is stated as given the set of taxis and their temporal mobility patterns, find the optimal location(s) for switching stations such that the taxi company’s profits are maximized. Our mathematical formulation of the switching station location problem is as follows:

$$ \hbox{min} \sum_{l \in L}{\rm cost}(l) \cdot y(l) + \sum_{x \in T} \sum_t {\rm o}_t(x, {\rm c}_t(x)) $$
(26)

subject to:

$$ \begin{aligned} y(l) &=\{ 0,1\} \forall l \in L \\CL(x,t_{k} ) &=\left\{ {\begin{array}{ll}{Full} & {{\text{if}}\,{{\text y}}({\text{Loc}}_{t}(x)) = 1} \\ {} & {{\text{and }}CL(x,t_{{k - 1}} ) < \tau } \\ {} & {{\text{and fare}}_{t} (x) = {\text{false}}} \\ {CL(x,t_{{k - 1}} ) - u(t_{{k - 1}} ,t_{k} ),} & {\text{otherwise}} \\\end{array} } \right.\\ u(t_{{k- 1}} ,t_{k} ) &= {\text{energy used from }}t_{{k - 1}} ,t_{k}{\text{ }} \\ \end{aligned} $$

when L does not contain too many locations (for example, as in our case study below), we can solve the switching station location problem optimally using brute force. That is, we find the value of Eq. 26 for all possible locations of \(1, 2, \ldots, k\) switching stations. The value of k is found by determining the number of switching stations sufficient so that no revenue is lost due to opportunity costs (i.e., we have \(\sum_{x \in T} \sum_t {\rm o}_t(x, {\rm charge}_t(x)) = 0\)). At this point, Eq. 26 is monotonically increasing when more switching stations are added, so we can safely conclude that Eq. 26 is minimized with k or fewer switching stations.

This brute force approach may not be feasible over larger areas with more locations. In this case, it is possible to use heuristic algorithms to find a solution, though these heuristics cannot guarantee the optimality of their solution. Algorithms such as simulated annealing, tabu search, and hill climbing are general optimization methods, and could be used to find approximate solutions to the switching station location problem (Glover and Laguna 1997; Kirkpatrick et al. 1983; Russell and Norvig 2003).

Our formulation of the switching station location problem relies on time series locations of the vehicles that will use the switching stations. Ideally, this location data is collected from multiple vehicles over multiple weeks; however, this data may not be obtainable in some situations. In this case, it is still possible to optimize the placement of switching stations using stochastic facility location algorithms [e.g., (Owen and Daskin 1998; Snyder 2004)]. Such algorithms are designed to optimize facility locations when there is a high amount of uncertainty in the input. These algorithms take a probability distribution of the amount of time vehicles spend at given locations as input. This distribution could be estimated from, e.g., road congestion statistics or logs of passenger pickups and drop-offs.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Carpenter, T., Curtis, A.R. & Keshav, S. The return on investment for taxi companies transitioning to electric vehicles. Transportation 41, 785–818 (2014). https://doi.org/10.1007/s11116-013-9486-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11116-013-9486-1

Keywords

Navigation