1 Introduction

In many places including many cities, demand for water is growing. Given the time it takes to obtain permission, fund and build water infrastructure, water planners are often tasked to establish plans long in advance for how they plan to address the future water supply–demand balance. In England this time period is generally 25 years.

Choosing an appropriate schedule of infrastructure upgrades and investments is a classic water resource systems analysis problem and several water ‘capacity expansion’ models are described in the literature (Olaoghaire and Himmelblau 1974; Loucks et al. 1981; Mays 2005; Loucks and Van Beek 2006). Capacity expansion planning includes not only choosing which schemes to implement, but also their timing and capacities (sizing). A typical objective function is the minimisation of total discounted financial costs. Financial costs include ‘expansion, shortage, maintenance, idle capacity and inventory’ costs (Luss 1982). Expansion polices are considered over time for a defined planning horizon (typically 20 to 100 years for civil infrastructure).

These problems exhibit economies of scale: the cost per unit of capacity decreases with expansion size. Due to the strictly concave shape of the relevant cost functions there is an incentive to build large facilities now rather than a sequence of smaller ones (Revelle 1999). However if expenditures are pushed into the future their present worth value decreases. A trade-off must therefore be found between economies of scale and the cost of installing capacity before it is needed (Manne 1961, 1967).

Capacity expansion models can be deterministic (as in this paper), assuming one version of the future or stochastic, explicitly considering future uncertainty of supply and/or demand. Different optimisation algorithms have been used including mathematical programming (Labadie et al. 1986; Kim and Hopkins 1995; Barros et al. 2008; Hsu et al. 2008), dynamic programming (Bellman 1957; Dandy et al. 1984; Braga et al. 1985; Mahmoud 2006; Luo et al. 2007; Hsu et al. 2008; Chou et al. 2013) and heuristic optimisation methods (Savic and Walters 1997; Deb 2001). Linear programming is popular because of its assured convergence to a globally optimum solution but is limited to using only convex and linear objective functions, linear constraints and continuous decision variables (Mahmoud 2006). This can be overcome by using mixed integer programming (MIP) where some variables take integer values. MIP conveniently allows representing yes/no decisions at each time step (Mays 2005) and has been used by Tu et al. (2003) and Labadie (2004).

The proposed model poses the capacity expansion problem in a form easily tractable by a mathematical program by boiling the problem down to a supply–demand balance per water resource zone. Such ‘water resource zones’ (WRZ) aggregate interconnected supply areas where residents face the same likelihood of supply shortfalls. Four contributions to capacity expansion optimisation modelling are proposed. User-defined annual water saving profiles for demand management (water conservation) schemes are enabled. Multiple water demand scenarios are considered simultaneously to ensure the supply–demand balance is preserved across high demand conditions and that variable costs are accurately assessed. A wide range of supplementary constraints are formulated to consider the interdependencies between schemes (pre-requisite, mutual exclusivity, etc.). Finally, a two-step optimisation process is introduced to prevent the infeasibilities that inevitably appear in real applications. This model was developed and used for the ‘Water Resources in the South East’ project led by the Environment Agency of England and Wales in collaboration with Ofwat, Defra and six English water companies that serve 17.6 million people.

The paper is structured as follows: Section 2 describes the model formulation, Section 3 describes the water planning context in England and Wales and the model’s application to a regional system, Section 4 presents model results. Discussion of model results and contributions are presented in section 5. Model limitations and benefits are discussed in Section 6 and followed by conclusions.

2 Model Formulation

2.1 Basic Model Formulation

We describe a minimum cost capacity expansion optimisation model formulation for large water supply systems which can be solved by off-the-shelf mixed integer linear program (MILP) solvers. The model identifies the least discounted economic cost annual schedule of capacity expansions to maintain the system’s supply–demand balance over a T-year planning horizon.

The model makes two kinds of decisions: the extent of annual use of supply and demand management options (for both existing and optional schemes) and annual investment decisions on optional schemes. The model has a network structure: water demand nodes (Water Resource Zones, WRZs) are connected to supply and demand management options within their WRZ, or to other demand nodes (to represent transfers). To represent supply schemes that can be shared between more than one WRZ, ‘source-junction’ nodes are used to connect supply options to the demand nodes. The model is formulated as a mixed integer linear programming (MILP) optimisation problem. Appendix A lists the model nomenclature. A single objective is used: minimisation of discounted capital (CAPEX), fixed (FOPEX) and variable (VOPEX) costs (Eq. 1). One-off financial costs are also included. These incur just once over the planning period, when an option is selected.

$$ min{\displaystyle \sum_{t=1}^{\mathrm{tmax}}\frac{1}{{\left(1+ dr\right)}^{t- 1}}\left[\begin{array}{l}{\displaystyle \sum_{i\in SUPPLYN}\left( afcc{s}_i+ sfi{x}_i\right)\times A{S}_{i,t}+\left( carb{s}_{i,t}+{ off}_i\right)\times F{S}_{i,t}+ vopex{s}_{i,t}}\\ {}+{\displaystyle \sum_{\left(i,j\right)\in \mathrm{CON}}\left( afcc{l}_{i,j}+ lfi{x}_{i,j}\right)\times A{L}_{i,j,t}+\left( carb{l}_{i,j,t}\right)\times F{L}_{i,j,t}+ vopex{l}_{i,j,t}}\end{array}\right]} $$
(1)

In the equation above dr is the discount rate, tmax the final year of the planning period, CON is a set which defines the connections among nodes i, j in the network, SUPPLYN is the set of all supply nodes (both existing sources EXDO and optional schemes OPTSOU). A discount factor equal to 1/(1 + dr) t assumes all costs are incurred at the year-end. afccs i , afccl i,j are respectively the sum of annualised CAPEX (capital financial, capital environmental and social costs, see Appendix B) for optional schemes (OPTSOU) and companies’ proposed optional links (LFT), while sfix i and lfix i,j are respectively the fixed annual costs (operating, carbon, environmental and social) for optional nodes and links. vopexs i,t and vopexl i,j,t are variable costs (operating, carbon, environmental and social) incurred at nodes and links respectively. Finally carbs i,t and carbl i,j,t are user-defined annual profiles of capital carbon costs for optional sources and links respectively, while off i are one-off costs applied to DM options. One-off costs include fixed financial, social, environmental and carbon emission related costs.

Environmental costs reflect the environmental impact that an option might have, whereas social costs are measured as a loss of consumer surplus (UKWIR 2002b). Carbon costs are related to the option emissions of carbon dioxide. To estimate carbon costs of the options projected emissions, water companies use the latest government guidance on the cost of carbon (Climate Change Economics and Dpt. of Energy and Climate Change 2009).

S i,t and Q i,j,t are non negative decision variables defining the extent of annual use of supply for sources and links respectively. AS i,t and AL i,j,t are binary decision variables that activate new supply options i and new links (i, j) when they change from 0 to 1 during any particular year. FS i,t and FL i,j,t are binary variables equal to one at the first year of activation of an optional node (OPTSOU) or optional link (LFT) respectively and equal to zero otherwise. FS i,t and FL i,j,t are defined by equations below:

$$ \begin{array}{cc}\hfill F{S}_{i,t}=\left(A{S}_{i,t}-A{S}_{i,t-1}\right)\hfill & \hfill \forall i\in OPTSOU,t\in T\hfill \end{array} $$
(2)
$$ \begin{array}{cc}\hfill F{L}_{i,j,t}=\left(A{L}_{i,j,t}-A{L}_{i,j,t-1}\right)\hfill & \hfill \forall \left(i,j\right)\in LFT,t\in T\hfill \end{array} $$
(3)

In the equation above T is the set of time periods (years). All decision variables are in volume/time except for binary variables which represent yes/no activation decisions. In this paper decision variables (binary and continuous) and set declarations are in upper case letters; parameters (input data) are in lower case.

Variable costs vopexs i,t and vopexl i,j,t are calculated multiplying unit variable costs (svar i,t for sources and lvar i,j,t for links) by the continuous usage decision variables S i,t and Q i,j,t . Capital costs can be included as a custom time-series of cash outflows or using simple cash flow methods (Appendix B). All costs are in real and not nominal terms to avoid having to forecast inflation (Belli et al. 2001).

The objective function equation is subject to a ‘mass balance’ constraint at all nodes of the network (Eq. 4) and to capacity constraints at nodes and links (Eqs. 5 to 8):

$$ \begin{array}{cc}\hfill {S_i}_{,t}\left|{}_{i\in \mathrm{SUPPLYN}}-s{r_i}_{,t}\right|{}_{i\in \mathrm{EXDO}}+{\displaystyle \sum_{j:\left(j,i\right)\in CON}{Q_{j,i}}_{,t}}-{\displaystyle \sum_{j:\left(i,j\right)\in CON}{Q_{i,j}}_{,t}}=d{i_i}_{,t}\left|{}_{i\in DEM}+ th{r_i}_{,t}\;\right|{}_{i\in DEM}\hfill & \hfill \forall i\in I,t\in T\hfill \end{array} $$
(4)
$$ \begin{array}{cc}\hfill \mathrm{smi}{n}_{i,t}\times A{S}_{i,t}\le {S}_{i,t}\le sma{x}_{i,t}\times A{S}_{i,t}\hfill & \hfill \forall i\in OPTSOU,t\in T\hfill \end{array} $$
(5)
$$ \begin{array}{cc}\hfill \mathrm{smi}{n}_{i,t}\le {S}_{i,t}\le sma{x}_{i,t}\hfill & \hfill \forall i\in EXDO,t\in T\hfill \end{array} $$
(6)
$$ \begin{array}{cc}\hfill \mathrm{lmi}{n}_{i,j,t}\times A{L}_{i,j,t}\le {Q}_{i,j,t}\le lma{x}_{i,j,t}\times A{L}_{i,j,t}\hfill & \hfill \forall \left(i,j\right)\in LFT,t\in T\hfill \end{array} $$
(7)
$$ \begin{array}{cc}\hfill \mathrm{lmi}{n}_{i,j,t}\le {Q}_{i,j,t}\le lma{x}_{i,j,t}\hfill & \hfill \forall \left(i,j\right)\in LEX,t\in T\hfill \end{array} $$
(8)

In the equations above DEM is the set of demand nodes (WRZs), LEX is the set of all existing links. Σj Q j,i,t is the sum of the flows entering node i during year t, Σj Q i,j,t is the sum of the flows leaving node i during year t. smin i,t , smax i,t are respectively the minimum and maximum capacities for schemes (EXDO and OPTSOU), while lmin i,j,t and lmax i,j,t are respectively the minimum and maximum capacities for links. Binary variables AS i,t , AL i,j,t are applied only to optional sources and links.

In Eq. 4 sr i,t is the outage while di i,t and thr i,t are respectively the distribution input (DI) and target headroom (THR). Terms di i,t and thr i,t are applied at WRZ level. Outage is defined as a temporary short-term loss in DO (EA et al. 2012a) while the DI is the amount of treated water entering the distribution system. THR is a margin of spare resource between forecast demand and water available for use (WAFU), used to take account of uncertainty around supply and demand estimates (UKWIR 2002c). Process losses and reduction in the firm yield (referred to in the UK as ‘deployable output’ or DO) might be added in the equation above to increase the WRZ DI when either one are greater than zero. Process losses are defined (EA et al. 2012b) as the summation of both raw and treatment work losses and operational use. Reduction of deployable output may include factors such as sustainability reduction (i.e. reduction in DO required by the EA to meet statutory and/or environmental requirements) and impact of climate change (EA et al. 2012a). Sustainability reductions can be either a fixed value or a proportion of annual use.

2.2 Multiple Demand Scenario Formulation

Supplies must meet demand in relatively infrequent periods of high demand when costs are typically higher. This implies the above model will over-estimate costs under normal conditions. To prevent this we introduce multiple simultaneous water demand scenarios in a single model run. For example four demand scenarios (identified by set SCEN) are considered in our application: dry year annual average, dry year critical period, minimum deployable output and normal year annual average (see section 3.2). Supplies must meet the most stringent scenario but variable costs are considered and weighted according to how frequently each demand ‘scenario’ is expected. Solving n demand scenarios requires n sets of continuous decision variables (in place of one set for S i,t and Q i,j,t ) to describe how much each scheme (supply, demand management or transfer) is used annually under each scenario.

2.2.1 Mass Balance and Capacity Constraints Under Multiple Water Demand Scenarios

We introduce two sets of mass balance equations: the first (Eq. 9) to make sure that the selected infrastructure is able to meet the most stringent scenario (demand is equal to DI plus THR), the second (Eq. 10) to evaluate variable costs based on schemes actual utilisation rather than as if peak demands were maintained all year long (THR is set equal to zero). Each mass balance equation has a different set of annual use variables (S i,t,scen , Q i,j,t,scen in Eq. 9 and Sut i,t,scen , Qut i,j,t,scen in Eq. 10). Setting THR to zero means that the required annual use variables (Qut i,j,t,scen , Sut i,t,scen ) will be lower thus incurring less variable costs (VOPEX). This ensures estimated VOPEX will reflect how much schemes are likely to be used over the 25 year period. This second set of use variables (Sut i,t,scen and Qut i,j,t,scen ) is the one that appears in the weighted VOPEX terms included in the objective function equation.

$$ \begin{array}{cc}\hfill \begin{array}{l}{S_i}_{,t, scen}\left|{}_{i\in \mathrm{SUPPLYN}}-s{r_i}_{,t,\mathrm{scen}}\right|{}_{i\in EXDO}+{\displaystyle \sum_{j:\left(j,i\right)\in CON}{Q_{j,i}}_{,t, scen}}-{\displaystyle \sum_{j:\left(i,j\right)\in CON}{Q_{i,j}}_{,t, scen}}\hfill \\ {}=d{i_i}_{,t, scen}\left|{}_{i\in DEM}+\mathrm{th}{r_i}_{,t, scen}\;\right|{}_{i\in DEM}\hfill \end{array}\hfill & \hfill \forall i\in I,t\in T, scen\in SCEN\hfill \end{array} $$
(9)
$$ \begin{array}{cc}\hfill \begin{array}{l} Su{t_i}_{,t, scen}\left|{}_{i\in SUPPLYN}-s{r_i}_{,t, scen}\right|{}_{i\in EXDO}+{\displaystyle \sum_{j:\left(j,i\right)\in CON} Qu{t_{j,i}}_{,t, scen}}\hfill \\ {}-{\displaystyle \sum_{j:\left(i,j\right)\in CON} Qu{t_{i,j}}_{,t, scen}}=d{i_i}_{,t, scen}\left|{}_{i\in DEM}+ th{r_i}_{,t, scen}\;\right|{}_{i\in DEM}\hfill \end{array}\hfill & \hfill \forall i\in I,t\in T, scen\in SCEN\hfill \end{array} $$
(10)

Equations above are applied to each node of the network (supply nodes, junctions, demand nodes and supply junctions) and also apply to multiple demand scenarios scen and each year t of the planning horizon. SCEN is the set of all demand scenarios.

Annual use variables Q i,j,t,scen , S i,t,scen , Qut i,j,t,scen and Sut i,t,scen also appear in capacity constraints Eqs. 5 to 8 (in place of variables S i,t and Q i,j,t ) where the minimum and maximum capacities (smin i,t,scen , smax i,t,scen , lmin i,j,t,scen , lmax i,j,t,scen ) are now defined over set SCEN.

2.2.2 Objective Function

The objective function is identical to Eq. 1 with the difference that now variable costs are weighted over four scenarios (set SCEN). We use one set of CAPEX and FOPEX costs and four sets of VOPEX costs. VOPEX costs are evaluated using Qut i,j,t,scen and Sut i,t,scen variables. If scen are the elements of set SCEN and tscen scen is the weight for each demand scenario (e.g. in number of weeks per year), then the weighted average is expressed by Eq. 11 (for nodes) and Eq. 12 (for links):

$$ \begin{array}{cc}\hfill \mathrm{vopex}{s}_{i,t}=\frac{{\displaystyle \sum_{scen\in SCEN} tsce{n}_{scen}\times sva{r}_i\times Su{t}_{i,t, scen}}}{{\displaystyle \sum_{scen\in SCEN} tsce{n}_{scen}}}\hfill & \hfill \forall i\in SUPPLYN,t\in T\hfill \end{array} $$
(11)
$$ \begin{array}{cc}\hfill \mathrm{vopex}{l}_{i,j,t}=\frac{{\displaystyle \sum_{scen\in SCEN} tsce{n}_{scen}\times lva{r}_{i,j}\times Qu{t}_{i,j,t, scen}}}{{\displaystyle \sum_{scen\in SCEN} tsce{n}_{scen}}}\hfill & \hfill \forall \left(i,j\right)\in LINKS,t\in T\hfill \end{array} $$
(12)

where t is the annual time index, LINKS is the set of existing (LEX) and optional (LFT) transfers, svar i , lvar i,j and vopexs i,t , vopexl i,j,t are respectively the variable costs in pence/m3 and the weighted average variable costs incurred at nodes and links. Scenario durations (tscen scen ) are weights estimated based on expected occurrence with no dimension. Unit variable costs on existing links (LEX) can be used to encourage the model to use WRZs own existing deployable output (local sources) before importing from other WRZs.

2.3 Interdependency Constraints

Other constraints have been added to describe interdependences between options in the water supply network representing practical, technological, social and/or environmental policies and restrictions (see Appendix C). These include: ‘mutually exclusive’ constraints which apply when only one supply or link from a set of alternatives should be implemented, ‘prerequisite AND’ constraints that assure a link/source form group A is implemented only if all links/sources from group B have been activated, ‘prerequisite OR’ constraints that assure a link/source form group A is implemented only if at least one option (link or source) from a set B of prerequisite schemes has been activated, ‘prerequisite with LAG TIME’ constraints that allow a scheme to be activated only a certain number of years (η) after the activation of another prerequisite scheme (node or link), ‘dependency’ constraints that force a node or link to be activated at the same time as another group of optional schemes are implemented. ‘Capacity connectivity constraints’ force the total supply (Ml/d) from a group of schemes in set A to be lower than the total supply from selected schemes in set B, plus an allowance which may vary depending on the scenario (set SCEN).

‘Ratchet’ constraints can be used to impose a monotonically increasing usage of schemes to avoid constructing schemes that are used only for few years or regularly used below their maximum capacity. Ratchet constraints are a good example of a policy or ‘political’ constraint. ‘Consistent use constraints’ (also referred to as ‘capacity constraints’, see Appendix B) are used to discourage the discontinuous use of future supply sources by forcing the binary variables of optional schemes and links to be one after their first year of activation). Finally, start date constraints limit the first use of any link or supply scheme and represent the time it takes to construct and receive benefits from the options.

2.4 Demand Management Option Constraints

Demand management options (set DM) have user-defined annual water saving profiles which start from their first year of activation. Capacity constraints on DM options can be implemented following to the two methods provided below: fixed start year or flexible start year.

2.4.1 Fixed Start Year for DM Options

In some cases it may be desirable to only allow activation of a DM option during a specific year. To ensure DM options are activated at their first available year (dm_avl i ) the following equation is used:

$$ \begin{array}{cc}\hfill {\displaystyle \sum_t\left(A{S}_{i,t}-A{S}_{i,t-1}\right)}\times t\le dm\_ av{l}_i- first\_ yr+1\hfill & \hfill \forall i\in DM,t\in T\hfill \end{array} $$
(13)

where first_yr is the first year of the planning horizon, dm_avl i is the first year of availability of each DM option.

Capacity constraints for DM options whose allowed year of activation is fixed are set to a user-defined profile using the following equations:

$$ \forall i \in dm,t,\theta \in \left[ {\theta + dm\_av{l_i} - first\_yr = t} \right] $$
(14)

where sav i,θ,scen is a table of input data containing the saving profile for each DM option. Θ is a generic set containing as many elements θ as the number of years included in the planning horizon.

2.4.2 Flexible First Year of Availability

When the first year of DM option activation is a decision to be optimised, then DM capacity constraints are written as:

$$ \begin{array}{cc}\hfill {S}_{i,t,\mathrm{scen}}={\displaystyle \sum_{\theta =1}^{\theta max} sa{v}_{i,\theta, scen}\times F{S}_{i,t\hbox{-} \theta }}\hfill & \hfill \forall i\in DM,t\in T, scen\in SCEN\hfill \end{array} $$
(15)

θmax is the last element of set Θ. Equation 15 ensures annual ‘supply’ from DM options is accounted from their first year of activation.

Unlike supply schemes that can be used in any given year less than maximum capacity (Eqs. 5 to 8), DM scheme contributions are fixed to the scheme’s forecasted ability to reduce demand (sav i,θ,scen , see Eqs. 14 and 15). This means that, for selected DM options, the extent of annual use variable S i,t,scen has to assume the same value to the one as variable Sut i,t,scen . This is ensured through the following constraint:

$$ \begin{array}{cc}\hfill {S}_{i,t,\mathrm{scen}}= Su{t}_{i,t, scen}\hfill & \hfill \forall i\in DM,t\in T, scen\in SCEN\hfill \end{array} $$
(16)

Equation 13 ensures that DM options are not activated before their first available year (dm_aval i ), while equation 14 forces any savings (variable S i,t,scen ) before year dm_aval i tobe zero. This prevents DM schemes from activating before they are available.

2.5 Dealing with Model Infeasibilities

Mathematical infeasibilities can arise when the sum of all available supply and DM schemes cannot meet the demand of one or more WRZs in one or more years. Two strategies are available to deal with network infeasibilities: 1) infeasibility flows, where strongly penalised flows can be injected at WRZ demand nodes during any year to allow feasibility, and 2) demand reductions, where demands are reduced just enough to ensure all WRZ supply–demand balances are feasible in each year. Although method 1 is frequently adopted in network optimisation, defining high unit (‘penalty’) costs for the infeasibility flows can make the model less sensitive to the real scheme costs and lead to sub-optimal results. We therefore adopted method 2 in this project.

2.5.1 Infeasibility Flow Procedure

If this method is implemented the mass balance constraint previously defined by Eq. 9 is modified into Eq. 17. The same modifications must be applied to mass balance Eq. 10.

$$ \begin{array}{cc}\hfill \begin{array}{l}\mathrm{INFEAS}\_{\mathrm{FLOW}}_{i,t, scen}\left|{}_{i\in DEM}+{S_i}_{,t, scen}\right|{}_{i\in SUPPLYN}\hfill \\ {}-s{r_i}_{,t, scen}\Big|{}_{i\in EXDO}+{\displaystyle \sum_{j:\left(j,i\right)\in CON}{Q_{j,i}}_{,t, scen}}-{\displaystyle \sum_{j:\left(i,j\right)\in CON}{Q_{i,j}}_{,t, scen}}\hfill \\ {}=d{i_i}_{,t, scen}\left|{}_{i\in DEM}+{{\mathrm{thr}}_i}_{,t, scen}\;\right|{}_{i\in DEM}\hfill \end{array}\hfill & \hfill \forall i\in I,t\in T, scen\in SCEN\hfill \end{array} $$
(17)

In the equation above INFEAS_FLOW i,t,scen is a continuous positive variable representing flows (fictitious supply in volume per year) that can be injected at the WRZ level (set DEM) during each year t and scenario scen. These flows have high (‘penalty’) costs to ensure they are used only in case the network is infeasible (the existing and proposed supply-side and demand-side options are insufficient to meet future demands).

The objective function Eq. 1 now becomes:

$$ \begin{array}{l} min\kern0.24em {\displaystyle \sum_{t=1}^{tmax}\left\{\frac{1}{{\left(1+ dr\right)}^{t-1}}\left[\begin{array}{l}{\displaystyle \sum_i\left( afcc{s}_i+ sfi{x}_i\right)\times A{S}_{i,t}+\left( carb{s}_{i,t}+{\mathrm{off}}_i\right)\times F{S}_{i,t}+ vopex{s}_{i,t}}\\ {}+{\displaystyle \sum_{i,j}\left( afcc{l}_{i,j}+ lfi{x}_{i,j}\right)\times A{L}_{i,j,t}+\left( carb{l}_{i,j,t}\right)\times F{L}_{i,j,t}+ vopex{l}_{i,j,t}}\end{array}\right]\right.}\\ {}\left.\kern2.4em +\left[ penalty\_ cost\times {\displaystyle \sum_{scen\in SCEN}{\displaystyle \sum_{i\in DEM} INFEAS\_ FLO{W}_{i,t, scen}}}\right]\right\}\\ {}\kern0.48em \end{array} $$
(18)

A limitation of this method is the definition of the unit cost penalty_cost. If this is too low, infeasible flows may be chosen over available optional schemes (OPTSOU), if it is too high the objective function value (Eq. 18) may become so large that the model becomes less sensitive to the real scheme costs and therefore produces suboptimal results. This issue led us to develop and use the procedure described below.

2.5.2 Demand Reduction Procedure

The demand reduction procedure deals with infeasibilities in the WRSE network through a 2-step process: 1. reduce demands in WRZs in years with an infeasible balance just enough to make those balances feasible, 2. run the feasible model with reduced demands.

Step 1: Solve the following problem:

$$ \max {\displaystyle \sum_{scen\in SCEN}{\displaystyle \sum_{i\in DEM}{\displaystyle \sum_{t\in T}{\alpha}_{i,t, scen}}}} $$
(19)

Subject to all constraint equations listed in the previous sections (and eventually also in Appendix C) with the additional condition that 0 ≤ α i,t,scen ≤1. Mass balance Eqs. 9 and 10 are now written as:

$$ \begin{array}{cc}\hfill \begin{array}{l}{S_i}_{,t, scen}\left|{}_{i\in SUPPLYN}-{{\mathrm{sr}}_i}_{,t, scen}\right|{}_{i\in EXDO}\hfill \\ {}+{\displaystyle \sum_{j:\left(j,i\right)\in CON}{Q_{j,i}}_{,t, scen}}-{\displaystyle \sum_{j:\left(i,j\right)\in CON}{Q_{i,j}}_{,t, scen}}\hfill \\ {}={\alpha}_{i,t, scen}\Big|{}_{i\in DEM}\times \left(d{i_i}_{,t, scen}\left|{}_{i\in DEM}+ th{r_i}_{,t, scen}\;\right|{}_{i\in DEM}\;\right)\hfill \end{array}\hfill & \hfill \forall i\in I,t\in T, scen\in SCEN\hfill \end{array} $$
(20)
$$ \begin{array}{cc}\hfill \begin{array}{l}{S_i}_{,t, scen}\left|{}_{i\in SUPPLYN}-s{r_i}_{,t, scen}\right|{}_{i\in EXDO}+{\displaystyle \sum_{j:\left(j,i\right)\in CON}{Q_{j,i}}_{,t, scen}}\hfill \\ {}-{\displaystyle \sum_{j:\left(i,j\right)\in CON}{Q_{i,j}}_{,t, scen}}={\alpha}_{i,t, scen}\left|{}_{i\in DEM}\times d{i_i}_{,t, scen}\right|{}_{i\in DEM}\;\hfill \end{array}\hfill & \hfill \forall i\in I,t\in T, scen\in SCEN\hfill \end{array} $$
(21)

Step 2: Fix α i,t,scen values obtained from step 1 and solve the model using Eq. 1 as objective function, Eqs. 20 and 21 for mass balances, all other constraints described in previous paragraphs and constraints in Appendix C if required. In the equations above, α i,t,scen is a continuous positive variable defined for all demand scenarios.

3 Application: Planning South East England’s Water Supply

3.1 Water Planning in England and Wales

Since 1989 England is served by 22 privatised water companies (utilities) which operate as regulated natural monopolies. Their abstraction from the environment is regulated by the Environment Agency (EA) and their consumer prices and investments in water supply solutions are regulated by Ofwat (the water services regulation authority). Water companies must demonstrate to regulators their plans to meet future demand are best value and maintain environmental standards or explain and justify other considerations that move away from these criteria. Plans must follow regulatory guidance (EA et al. 2012a, b) and typically use the ‘Economics of Balancing Supply and Demand’ (EBSD) framework to generate socially efficient least economic cost water resource supply plans (UKWIR 2002a, b).

The water companies operating in South East England have a history of working together as the Water Resources in the South East (WRSE) Group (von Lany et al. 2013); a Group which includes the Environment Agency, Ofwat, the Consumer Council for Water and Defra. South-East England is the driest part of the UK with the largest and fastest growing population.

The group’s aim is to explore new opportunities for water companies to share resources (new bulk supply agreements) and over the past few years has used capacity expansion models following the EBSD framework (EA 2005, 2010; Critchley and Marshallsay 2013). The idea behind the WRSE model is to enable water companies to jointly plan a regional water supply solution that aligns with the mix and schedule of least-cost schemes identified by the regional optimisation modelling (EA 2010). Regional modelling carried out in 1990 led to the implementation, by 2005, of four new bulk transfers between companies. In 2010 the WRSE group identified that a savings of £M 501 could be possible in the South East Region by 2035 if more sharing of existing and new resources was adopted by the six companies, rather than each company adopting its own preferred plan (EA 2010). The WRSE group recently carried out a new phase of regional modelling using the model formulation published here to identify options (new supplies, demand management and bulk transfers) to be included in the next water companies’ WRMPs due for regulatory approval by year 2014 (Critchley and Marshallsay 2013).

The EBSD planning framework outlined below provides the structure within which the proposed model formulation operates. The water supply planning process in England and Wales considers a 25 to 30 year planning period and uses a Water Resource Zone (WRZ) spatial scale, i.e., interconnected water supply zones with equal likelihood of suffering supply shortfalls (UKWIR 2002b). Future demands are estimated on an annual basis.

The firm yield (‘deployable output’ or DO) of existing and potential future sources of supply are evaluated using detailed water resource simulation models that consider hydrology, system operating rules, minimum environmental flows and other relevant hydrological, engineering, and institutional factors. Due to the uncertainty of current and future supply and demand forecasts, the EBSD method uses a safety factor called ‘headroom’ which aggregates all sources of supply–demand uncertainty into an annual estimate (UKWIR 2002c). Headroom is the difference between water available for use (WAFU) and estimated demand. Companies set ‘target headroom’ which they estimate to allow themselves to guarantee service reliability levels to consumers (‘levels of service’).

Analysis of annual supply and demand over the planning period identifies supply–demand imbalances (see Fig. 1) which water company planners must rectify in their plans. To do this they 1) identify all feasible options, 2) determine their costs (or benefits) which split into capital (including initial costs, maintenance and replacement), operating (fixed and variable), environmental and social (fixed and variable) and carbon emission costs (fixed and variable) (EA et al. 2012a), then 3) find the least-cost programme (timing, capacity, choice of schemes). Note that for practical reasons companies are asked to cost and size discrete schemes rather than use continuous cost curves. The model proposed in this publication accomplishes step 3. Inclusion of carbon, social and environmental economic costs means the model can be considered an economic-engineering tool (Lund et al. 2006) rather than an engineering (financial cost minimising) model.

Fig. 1
figure 1

The supply demand balance as considered in England and Wales water planning [adapted from UKWIR 2002b]. Demand plus target headroom must be greater than water available for use (yield of ‘DO’, minus losses, outage and sustainability reductions)

3.2 Case-Study Water Resource System

The proposed capacity expansion optimisation model is applied to a regional system composed of 34 WRZs, managed by the 6 water companies in South East England that serve a 17.6 million population (Critchley and Marshallsay 2013). Input data for the model (demands by WRZ, all costs, capacities of existing and optional schemes, outage, process losses, sustainability reductions and climate change allowances) are provided by the water companies for each WRZ.

Process losses, reductions of deployable output, target headroom and water demand are aggregated at WRZ level. Four demand scenarios are here included: dry year annual average (DYAA), normal year annual average (NYAA), dry year critical period (DYCP) and minimum deployable output (MDO) period. The DYCP scenario has been included only if it drives the need to implement new supply and DM options (EA et al. 2012b). Dry years (DY) are periods of low rainfall without demand restrictions. DYAA is the dry year average annual demand while DYCP is the dry year demand over a period of peak demand (normally defined as a week, see Fig. 2). The MDO period applies when supplies are expected to be at their minimum (see Fig. 2). This normally occurs in autumn when river flows and/or ground water levels are at their lowest and sources operate close to their minimum deployable output (DO).

Fig. 2
figure 2

Definition of annual demand scenarios (NYAA, DYCP, MDO and DYAA) [adapted from South East Water, 2009]. DYAA is the average demand over a dry year with no demand restrictions, DYCP is the average demand over a ‘peak demand period’ (typically a week) during a dry year with no restrictions, MDO is the average demand over the period when supplies available (deployable output) are expected to be at their minimum. DYDI is the dry year daily demand as quantified by distribution input (DI), i.e. what companies input to their networks. NYAA is the average demand over a year with normal weather patterns; NYDI is an example of normal year daily demand

Links between WRZs are either existing or optional. Optional links represent new transfers between WRZs or the prolonging of existing agreements. When optional transfers start from WRZs or areas not in the modelled network they are represented as new supply nodes (TR nodes).

Besides water demand nodes (representing WRZs) and junctions, network nodes are either existing or optional supply sources. Optional nodes are divided into supply-side schemes and demand management (DM) schemes. DM schemes include water efficiency (WEFF), leakage (LEAK) and metering (MET).

Supply-side schemes represented as nodes include reservoirs (RES), groundwater utilization schemes (GW), surface water abstraction (SW), effluent reuse schemes (ER), desalination (DESAL), water treatment works (WTW), aquifer storage and recovery schemes (ASR), and other options (Other) as conjunctive use schemes, network constraints removal or network improvements.

Some major supply schemes have several discrete capacities and cost values included as separate mutually exclusive nodes. For example, reservoir options in the Upper Thames area are proposed in different sizes which can be build in one phase or in two separate phases (e.g. 50 Mm3 in phase 1+ 50 Mm3 in phase 2). In some cases the phase 2 option can only be implemented a pre-defined number of years after the phase 1 option is selected (pre-requisite constraint with lag time).

DM options have water company defined annual water saving profiles which start from their first year of activation. Leakage reduction (LEAK) options include pressure management with new pressure release valves, district metering area data analysis for targeted repairs or network improvements, and reconfiguration as well as other strategies such as new detection technologies. Pro-actively fixing leaks before they are reported is referred to as active leakage control (ALC). ALC options have diminishing returns as their level of implementation increases. To gauge the extent to which ALC should be implemented, companies consider ‘tranches’ (bundles) of implementation. Each tranche is represented in the model with different prerequisite options (e.g. ‘tranch2’ can only be implemented after ‘tranch1’). Successive tranches have diminishing returns: WAFU is the same but capital and operating costs increase.

Metering options (MET) include ‘change of occupier’ instalments (COM), metering on left over domestic or commercial properties (e.g. difficult to fit), targeted compulsory metering, community integrated metering (new meters and upgrade of existing ones), metering of all household within a stressed areas (MET_STR) and achievement of maximum meter penetration by a pre-defined year. MET options also include seasonal tariff and rising block tariff. Tariff is exclusive to COM and MET_STR schemes.

Water efficiency measures (WEFF) include a range of different measures: household and commercial water audit, supply or retrofit of efficient devices (water butts, low flow taps, low use washing machine), rainwater harvesting and grey water reuse schemes.

The proposed model is applied to two different networks: Case 1 and Case 2. In Case 1 only supply-side options are included whereas Case 2 includes both new supplies and DM schemes (water conservation). Both models use the same least economic cost objective function, demand reduction feasibility assurance scheme (see paragraph 2.5.2), mass balance constraints (Eqs. 20 and 21) and scheme interdependence constraints (see Appendix C). The 272 interdependence constraints used in the model (in both cases) include mutual exclusivity (103 constraints), pre-requisites (‘and’ 91 of which 1 with ‘lag time’, ‘or’ 4 – see Appendix C for definitions of the constraint subtypes), mutual dependency (68 constraints) and 6 ‘capacity connectivity’ constraints.

4 Model Results

We solve the model over a 25 year planning period (2015–2039) with a 4.5 % discount rate (Ofwat 2009) for Case 1 (only supply-side options) and Case 2 (supply and DM options) over the four demand scenarios (DYAA, DYCP, MDO and NYAA). The MILP model is implemented in GAMS (Brooke et al. 2010) and solved using the CPLEX 12 solver. Table 1 gives model dimensions for Cases 1 and 2.

Table 1 Model statistics for Case 1 and Case 2 runs

Figure 3 shows the optional schemes and transfers activated by the model over the 25 years for Case 1 and Case 2 respectively.

Fig. 3
figure 3

Selected optional schemes and transfers for Case 1 (top panel - new supplies only) and Case 2 (bottom panel - supply and demand management schemes considered jointly). Dashed boxes contain zoomed out information. Numbers inside the various shapes represent the total number of options activated in each WRZ for a specific option type. Options types include: supply-side schemes (SUPPLY), metering (MET), leakage reduction (LEAK) and water efficiency (WEFF) measures

The model also provides the optimal annual use levels of supply options and transfers (demand management schemes are either on/off) for each demand scenario. Least cost usages (Ml/d) of selected schemes are shown in Fig. 4 for Cases 1 and 2 for the DYCP scenario.

Fig. 4
figure 4

Least-cost quantity of supply-side schemes and imports from external areas (TR) under the DYCP scenario for Case 1 (upper figure) and Case 2 (lower figure)

The Case 1 solution is dominated by large investments in reservoir schemes. Two big reservoirs (RES18 and RES36) are activated. These provide a total supply of 152.4 Ml/s with a total discounted capital cost of £M 124 equivalent to the 85.5 % of total capital investment on RES options (see Table 3).

The Case 2 solution is dominated by smaller options including DM options. DM options activated include 103 LEAK, 138 WEFF and 3 MET schemes. Reservoirs RES18 and RES36 are no longer selected leading to a significant cost reductions (capital costs on RES options are reduced from M£ 145 to M£ 10, see Table 3). Weighted variable costs (WEIGH_VOPEX) are also reduced by 33 % (i.e. from M£ 134 in Case 1 to M£ 90 in Case 2, see Table 2). This is due to a lower use/activation of supply–side schemes compared to Case 1 (e.g. WEIGH_VOPEX drops by 21 % for ER schemes and by 83 % for ASR options, see Table 3) and to negative carbon costs. Negative carbon costs represent savings in greenhouse gas emissions due to the capability of DM options to reduce water demands and the subsequent need for additional supply-side schemes.

Table 2 Total discounted costs over 25-year time horizon for the Case 1 and Case 2 networks for transfers, supply options and different DM types. ‘Links’ refer to both TR nodes and LFT bulk transfers among WRZs. Columns may not add up due to rounding
Table 3 Discounted costs for activated links and supply-side options aggregated by option type for both Case 1 and Case 2 networks. Columns may not add up due to rounding

Table 4 gives a summary of the total optimal capacity provided by selected options for Case 1 and Case 2. LEAK and supply-side options contribute most significantly to the supply–demand balance; LEAK options provide 21 % of total supply, while supply side options utilisation varies from a minimum of 64 % under the NYAA scenario to a maximum of 79 % under the DYCP scenario (see Table 4).

Table 4 Total least-cost quantity in Ml/d for activated supply-side schemes and DM measures for Case 1 and Case 2

Total deficits in Case 2 are lower than Case 1 (see Table 5) because the Case 1 network has fewer options to meet projected demands (it does not include DM schemes) and available schemes are often limited by ‘starting date’ constraints.

Table 5 Total regional deficit (in Ml/d) and level of demand satisfaction in % for the Case 1 and 2 networks

Figure 5 shows the magnitude (in Ml/d) of deficits in WRZs where they occur for Case 1 (upper figure) and Case 2 (lower figure) and under the four demand scenarios (there are no deficits under the NYAA scenario).

Fig. 5
figure 5

Supply–demand deficits appear for some of the WRZs in the network under the DYAA, DYCP and MDO scenario. Deficits under the NYAA scenario are zero

5 Discussion of Model Results and Contributions

Including demand management options reduced total NPV of CAPEX by 16.5 % (from 800 M£ in Case 1 to 668 M£ in Case 2, cf. Table 2), FOPEX by 6 % (80 to 75 M£, cf. Table 2) and VOPEX by 33 % (134 1o 90 M£, cf. Table 2). Total discounted costs for Case 2 decrease by 10 % compared to Case 1 which is equivalent to a saving of 99 M£. The application of the proposed capacity expansion model to South East England argues in favour of joint supply and demand management efforts to meet future water demands. The proposed model is effective at suggesting economically efficient capacity expansion schedules for a large system with 316 costed supply options, 511 demand management options, and 272 interdependency conditions, some quite complex stringing together several proposed schemes. Four contributions to capacity expansion modelling of large inter-connected regional water supply systems are introduced in this paper; their impact on our ability to solve the case-study problem are summarised below.

Demand management (DM) measures are considered in addition to new supply schemes. Special capacity constraints are introduced to allow user-defined annual water capacity (savings) profiles based on empirical evidence gathered by utilities. The introduction of DM options allowed finding a more cost-effective plan and, in our application, avoiding or delaying new infrastructure investments.

The model is conservative in that it ensures the annual supply–demand balance under dry period demands. Because the model considers variable costs (VOPEX) in its investment decisions, it is important that high variable costs associated with drier periods are not applied throughout the year. To address this we embed multiple simultaneous demand scenarios so that VOPEX costs (which depend on the extent to which schemes are used) can be weighted by how frequently a demand scenario is expected to occur. The model selects a single set of schemes that meets the most stringent demand estimates whilst also considering accurate variable cost estimates.

Water supply and demand management options in real systems will tend to have many interdependencies due to the engineering, social, environmental and political realities planners face. We introduce a flexible set of constraint equations that allow modellers to consider these policies and restrictions in the capacity expansion problem (Appendix C). 272 such constraints are used in our application to South East England.

A new two-step model infeasibility prevention scheme is proposed and used in the case study. It was necessary because several water supply zones could not withstand the rigorous supply–demand balance requirements. This model feasibility management scheme was an improvement over an infeasibility flow approach which was deemed unreliable since the magnitude of feasibility flow penalties affected the schemes chosen by the optimisation model.

Finally, a generic software platform was used to separate data management from modelling (Harou et al. 2010). The software allowed easily building the custom database structure and for example generating the network connectivity matrix.

6 Model Limitations and Benefits

The proposed model and its application have limitations stemming from the formulation itself and its application but also from the underlying structure of the problem as formalised by the UK water sector’s EBSD planning framework.

  1. 1.

    The problem of non-convex cost functions in capacity expansion problem is overcome by a significant compromise: fixing capacities and costs into proposed discrete schemes rather than using continuous cost functions. Given each discrete option generates a computationally expensive binary variable, inevitably too few discrete possible capacity and costs values are included, a limitation which necessarily leads to the suboptimality of suggested ‘least-cost’ plans. This limitation is partially mitigated by the fact that water companies and regulators prefer costing discrete schemes as they are unwilling or unable in many instances to generate continuous cost curves, which in practice for large systems are not always easy to generate reliably.

  2. 2.

    EBSD uses yields established under historically stressed conditions. This is accurate for hydrologically disconnected sources like desalination. For sources that are hydrologically inter-connected the yield concept is less suited; non-linear interactions are considered but only under one set of historically stressed conditions. If a future stress state is different from the one that was used to establish yields, the water available from sources (yield) could be substantially different.

  3. 3.

    The model and our application are deterministic, the stochastic part of the problem is dealt with external pre-processors: detailed system simulators which estimate safe yield for each source option. A stochastic EBSD (‘intermediate’) framework (not used in this paper) uses Monte Carlo simulation to explore the effect of uncertainty in yields and demands on the reliability of the programme of measures (schedule of interventions) identified by the optimisation model. Both the deterministic (applied here) and the intermediate framework produce solutions that meet a level of service (supply reliability) agreed by water companies and regulators. This fixed reliability requirement means the EBSD approach does not generate a reliability–cost trade-off curve. Additionally, the EBSD framework and model proposed here do not seek solutions that perform well across a broad range of future conditions or that are resilient (i.e., resume function rapidly after a failure). Neither does the model necessarily produce programs that are flexible and adaptable, i.e., programmes where changing course part way through the time-horizon can be cost-effectively achieved.

  4. 4.

    While real water supply systems are managed according to several criteria such as reliability, environmental performance, energy use, etc., the EBSD framework requires monetising all aspects of system performance and produces one ‘optimal’ least cost solution. System goals that cannot easily be monetised can only be represented in the model as constraints (e.g. reliability, environmental benefits, etc.). Trade-offs amongst monetary and non-monetary performance measures would be of interest to water supply planners but they are not made available using the EBSD method.

Despite several limitations, the benefit of the applied model and EBSD framework are their relative simplicity and applicability to large systems with complex interdependencies. Although water supply planning of large populous regions is complex, this framework parsimoniously boils the problem down to its most essential components to cost-effectively address the regional supply–demand planning problem for real-world systems. Not only does the model find the least-cost mix of schemes, but recommends the least cost implementation schedule to meet projected demands as well. The fact that the model formulated here is being used in a regulator-led effort to optimise investments for South East England with 17.6 million inhabitants is a testament to its ability to help plan real systems.

7 Conclusions

A least economic cost deterministic capacity expansion optimisation model was formulated. The model selects from a set of discretely costed new supply and demand management schemes to ensure the annual supply–demand balance at least economic cost over a T year time horizon. Capital (financial, environmental and social), and both fixed and variable annual carbon, operating, social and environmental costs of proposed schemes are considered. The model includes constraints designed to represent the predicted effect of conservation measures on the supply–demand balance. A series of constraints are formulated to represent complex interdependencies between proposed options which frequently manifest in real systems (there were 272 such constraints in our application). The model is conservative and assumes planners aim to satisfy peak demands with a prescribed reliability. Multiple simultaneous demand scenarios are considered in order to meet peak demands and accurately estimate variable costs. An infeasibility management scheme which reduces demand just enough to allow model feasibility (and warns the analyst) is used so that the model can still run if certain water supply zones are not able to satisfy intended reliability requirements.

The model is applied to a regional system (the whole of South East England) composed of six water utilities serving 17.6 million people. The model was run twice: initially only new transfers and supply-side schemes are considered, next demand management options are added into the dynamic portfolio optimisation problem. The availability of water conservation schemes in the planning problem reduced the total discounted economic costs by 10 % over the 25-year planning horizon. Demand management options are implemented in place of some of the most capital intensive proposed supply schemes such as two reservoirs which had high capital costs. The paper discusses limitations of the applied model and the UK water sector planning approach which led to it.