1 Introduction

Dhaka, currently the ninth largest city in the World and projected to be the fourth largest one by 2030, is the fastest growing and largest urban city in Bangladesh with a population density of 44,500/km2 (UN Habitat 2017; United Nations 2018). As per Dhaka Structure Plan (2016–2035), the projected population of Dhaka city would stand at 22.79 million by 2035.The current evidence shows that one-tenth of the Bangladesh population and a third of its urban population (36%) live in Dhaka (Bird et al. 2018). Therefore, ensuring equitable and affordable access to water to ever-growing population in Dhaka city has become a growing concern and a big challenge for the Dhaka Water Supply & Sewerage Authority (henceforth DWASA). For ensuring efficient water distribution, it is imperative for DWASA to realize the consumption variations of its consumers in different income spheres.

Demand-supply gap, quality and equitable distribution of water have become a growing concern in many urban cities of the developing countries like Bangladesh. Moreover, continued growing population in Dhaka city (i.e., rapid urbanization) adds contribution to the imbalances between water demand and supply. Water management policy response to water demand challenges in Dhaka city has become imperative for improving both distributive as well as allocative perspectives. Therefore, it urges to explore the causes of variations in water consumption among the residential water consumers (Schleich and Hillenbrand 2009) using micro-level data. Water policy makers require evidence-based recommendations with a view to devising cost-effective water delivery options for the growing population in Dhaka city.

The estimation of residential water consumption is straightforward as most of the households located in Dhaka city use the single source of water, namely Dhaka Water Supply & Sewerage Authority (DWASA). Nevertheless, imperfect water price information prevails in the ex post billing in Dhaka city (i.e., fixed pay, household- and holding-wise water meter; combined water usage with enterprises/businesses). However, the basic price theory for water demand from DWASA still holds to some extent. For example, if the water price is fixed (e.g., fixed water bill) and does not depend on the volume of water usages, the marginal price of water usage is zero for all existing consumers. Hence, the law of demand suggests that water use will continue until the marginal utility of water consumption is zero. Hence, preference heterogeneity that persists among the consumers determines the variation in water usage. This also implies weak correlation between fixed pay and water usage. However, this is strongly associated with income. This rational has made us interested in using ‘fixed pay for water’ an instrumental variable for income. Moreover, this might lead to low price elasticity for residential water demand as well as limited effectiveness and desirability of using price signals as a conservation tool (Gaudin 2006). Therefore, efficient and effective management and expansion of water delivery for the growing Dhaka city requires better understanding of the water usages at the household level (Ahmad et al. 2016). This study aims to contribute in this regard.

Inefficient water usage in Dhaka city is now a major concern, and thus, it requires identifying inefficiencies in water usage with a view to prompting policy response and decision making in regard to efficient water usage to urban households located in Dhaka city. This urges the introduction of efficient water pricing as an economic instrument to improve DWASA infrastructure through revenue generation as well as efficiency in water usage. Rapid urbanization of the Dhaka city is posing many challenges and mounting pressure on existing DWASA infrastructure, water deliveries and water usage. Moreover, the emergence of informal, unplanned and underserviced settlements exacerbates this situation. Therefore, it is of great importance for DWASA authority as a decision maker to provide and manage water supply for the growing city of Dhaka. Reconciling the issues like water conservation, efficient usage of water and collecting revenues to cover the costs of water provisioning requires in-depth understanding of the consumer’s response to income as well as other relevant covariates (Griffin 2006).

This paper contributes to the literature by exploring the income elasticities of water consumption in Dhaka city after controlling the effects of other important covariates of water demand. The micro-level water consumption study used in this paper strives to explore the distributional income effects on water consumption through measuring income elasticities across the water consumption distribution. Moreover, many other proxy-wealth variables such as savings, households owing geyser, shower, high commode, washing machine, water bill and DWASA spatial zones have been considered to check the effects of those covariates on water consumption. Moreover, household size as a demographic variable is used as an explanatory variable. Survey data, collected by BRAC Institute of Governance and Development (BIGD) from 459 households located in Dhaka city, have been used in measuring such estimates.

2 Empirical Literature on Water Consumption

Researchers in this field have used and identified different covariates of water consumption including price and non-price measures (i.e., income or wealth, household size and composition, seasonality, precipitation, temperature and locality etc.) using different statistical estimation techniques such as, ordinary least squares (OLS), generalized method of moments (GMM) and quantile regression (Deyà-Tortella et al. 2019; Dikgang et al. 2019; Kumar and Ramachandran 2019; Binet et al. 2014; Cardoso 2013; Wentz and Gober 2007; Worthington and Hoffman 2008). The three most central aspects of empirical works on water consumption are the identification of water demand covariates, nature of water consumption data (i.e., aggregate or household) and the magnitudes of estimated price and income elasticities (Flyr et al. 2019; Oliveira et al. 2017; Sebri 2016; Gardner 2010; Bartczak et al. 2009; Espey et al. 1997; Worthington and Hoffman 2008; Ahmad et al. 2016; Nauges and Thomas 2000; Martínez-Espiñeira 2002; Strand and Walker 2005). Despite different aspects of empirical evidence on water consumption in cities, the common procedure involves identification of most important determinants from a pool of covariates and estimates their separate effects using standard statistical estimation techniques mentioned above.

Among the non-price factors, average income of households appears as a significant determinant of water consumption (Reynaud 2015; Agthe and Billings 1987; Saleth and Dinar 2000; Gaudin et al. 2001; Garcia and Reynaud 2003). This study reviews the empirical evidence in regard to income effects on water consumption, which typically model the divergence between water consumption of the richest and poorest households in respect to their income level differences (Kumar and Ramachandran 2019; Schleich and Hillenbrand 2009; Espey et al. 1997). Empirical evidence shows that though most of the income elasticity estimates are positive, available income data degrades the quality of estimates (Dalhuisen et al. 2003) if income data are collected from public sources (Romano et al. 2016; Bartczak et al. 2009). Most of the recent studies have used the proxy-income variables as a surrogate of real income data for observing their effects on water consumption. This study uses households owing geyser, shower, high commode, washing machine, water bill, and savings as proxy-wealth variables. Incorporation of household size has extreme importance as it influences water consumption. It is evident that the larger household size and the presence of children in the household are positively associated with water consumption (Nauges and Thomas 2000; Cardoso 2013). Population living in spatial zone also affects water consumption and this may be due to the presence of heterogeneous nature of their socio-economic condition. The distributive effects of household income measured in terms of income elasticities have been estimated after controlling the effects of household size, proxy-wealth variables and spatial zones using the cross-sectional data from 459 residential households in Dhaka city. Due to the dearth of facts regarding availability of quality data on water consumption, an empirical analysis of water demand function in developing countries like Bangladesh is less focused. This study aims to contribute in this regard using the most up-to-date computational methods such as instrumental variable linear regression (IVREG) and instrumental variable quantile regression (IVQREG) approaches.

Empirical evidence in this regard is mixed as the studies find different income elasticities of water consumption in the developing countries, and thus, they might have different policy implications for authorities of the water management. The precise and accurate estimation of income elasticities of water consumption plays a great role in devising workable water demand management policies. It would also help contribute to designing efficient water pricing mechanism with a view to reducing the loss of water usage. It also encourages bringing efficiency in production, consumption and sustainability of water availability in the long-run. As multiple factors influence the water consumption, this study analyzes income elasticities of water consumption after controlling the effects of other explanatory variables for different water user groups living in the Dhaka city.

Finally, exploring the diverse water consumer groups requires checking whether elasticity of water consumption varies throughout the entire water consumption distribution. In case of varying income elasticities across the water consumption distribution average income elasticity might fail to provide insightful information on possible effects of income change. As income variable is endogenous, this study applies the instrumental variable quantile regression (IVQREG) to estimate the income elasticities across different quantiles of water consumption distribution in addition to measuring average income elasticity using the instrumental variables regression (IVREG) approach. This would help explain how income elasticities of water consumption differ across the distribution.

Different econometric methods have been applied to explore income elasticities of water consumption at the household level in Dhaka city. There is no published research on estimating income elasticities of water consumption for Dhaka city using the instrumental variables quantile regression (IVQREG) approach. In the quantile regression (QR), pioneered by Koenker and Bassett (1978), each conditional quantile takes linear form representing the relationship between the outcome variable (i.e., water consumption) and the different covariates (e.g., income) at different points of water consumption distribution. As income is endogenous, the IVQREG approach developed by Chernozhukov and Hansen (2006, 2008) has an additive advantage over the IVREG as the IVQREG benefits from being more robust to outliers, more consistent under weaker stochastic assumptions and better picture of the data (Cameron and Trivedi 2005). Introducing IVQREG method into water consumption estimation is of great importance to act upon better characterization of income elasticities and other covariates explaining water consumption in Dhaka city. Findings using the IVQREG estimation would be of great contribution in designing water policies by water management authority of DWASA.

3 Data and Methodology

3.1 Data

Dhaka city comprises a total geographical area of 306.38 km2, which is divided into 129 wards for administrative convenience. Presently one-tenth of the Bangladesh population and one third of its urban population (36%) live in Dhaka (Bird et al. 2018). Dhaka is one of the most densely populated cities in the world with a population density of about 23,234 people/km2. Water consumption data was collected in 2018 from 459 households located in nine different water zones of the DWASA. The dependent variable used in this study is the per capita per month water consumption. The independent variables include per capita per month income, location (formal settlement vs. slum), wealth-proximate variables, household size, water bill and the DWASA’s spatial zones. The descriptive statistics section describes the data in detail.

3.2 Water Consumption Models for Dhaka City

3.2.1 The Instrumental Variable Regression (IVREG) Model

This section starts with the base Ordinary Least Square (OLS) regression for modeling the estimates of income elasticities, followed by instrumental variables regression (IVREG) and instrumental variables quantile regression (IVQREG) approaches, respectively. As the multiple OLS regression approach inadequately solves the problem of the endogenous explanatory variable (i.e., income), the best approaches are to use instrumental variable regression approaches. Therefore, for our analytical purpose, we model the water consumption using the instrumental variable regression (IVREG) approach in order to study the income elasticity of water consumption at the household level. The econometric models for OLS and IVREG estimations can be expressed as:

$$ OLS:\kern2em \ln W={\gamma}_0\ \ln I+\sum \limits_{i=1}^k{\gamma}_i{X}_{ik}+{u}_{ik} $$
(1)

2SLS:

$$ Stage\ 1:\kern1em \ln I={\alpha}_0 FP+\sum \limits_{i=1}^k{\alpha}_i{X}_{ik}+{v}_{ik} $$
(2)
$$ Stage\ 2:\kern1em \ln W={\beta}_0\kern.2em \mathrm{l}\hat{\mathrm{n}}I+\sum \limits_{i=1}^k{\beta}_i\kern.2em {X}_{ik}+{\varepsilon}_{ik} $$
(3)

where W in OLS regression is the vector of K1 quantities of per capita per month water consumption (L); I is the vector of K1 per capita monthly income (the variable of our interest to explore); Xik is the vector of control variables including household savings, location (formal settlement vs. slum), household size, wealth proximate dummies like owning shower, washing machine, geyser, high commode, water bill and DWASA zonal dummies; and uik is the vector of K1 random errors in Eq. (1). See Table 1 for a description of variables. γ0 and γi are the vectors of respective parameters of income and other control variables. The estimated parameters using the OLS obtain the income elasticity of water consumption (Crane 1994; Rietveld et al. 2000; Basani et al. 2008), which can be estimated directly by taking the partial derivative of the water consumption with respect to income. This takes the form, where nI is the income elasticity of water consumption.

$$ {n}_I=\frac{\partial \ln W}{\partial \ln I} $$
(4)
Table 1 Descriptive statistics and definitions of variables

IVs technique, used to explore whether OLS estimates are unbiased and unobservable related to both income and water consumption, is not standing behind the estimated results (Pokropek 2016). More specifically, socio-demographic characteristics and preference could not be included as control variables in Eq. (1) due to the lack of data. Therefore, there is a possibility that water consumption may be determined by preference and other unobserved factors that are outside the model. These factors are very likely to correlate with income as well. Therefore, the OLS income will pick up some of the effects of the unobserved factors and produce bias estimate. Moreover, self-reported income variable used in the study also suffers from underestimation. This provides motivation for using linear instrumental variable estimation technique (IVREG) to provide a causal interpretation to economic water policy.

The error uik embodies all factors other than income and other control variables that determine water consumption. One such factor in uik is fixed pay (FP). Therefore, household or holding under the fixed payment system will induce a correlation between income (I) and error (uik) as household or holding with the fixed pay belongs to the category of low income households. This makes the OLS estimator \( {\hat{\gamma}}_0 \)inconsistent for γ0, as \( {\hat{\gamma}}_0 \) combines the desired direct effect of income on water consumption (γ0) with the indirect effect that household belonging to poor income are likely to have no water meter because of the housing structure and location as well, low uik, and hence, low water consumption.

As income is thought in the study to be endogenous, the inconsistency of \( {\hat{\gamma}}_0\kern0.5em \)is referred to as endogeneity bias, which does not disappear asymptotically. This study uses fixed pay dummy as an instrumental variable, which has the property that changes in payment mode are associated with changes in income but do not lead to changes in water consumption in the current settings of Dhaka city. The IV estimator \( \hat{\beta_{0, IV}} \) will be consistent for β0 provided that the fixed pay (instrument) is uncorrelated with the error uikand correlated with the income. We argue that in this analysis there is no direct path between fixed pay and water consumption. By doing this we are assuming that household under fixed pay mode does not substantially affect water consumption. However, people with low income and poor housing are more likely to be counted under fixed pay mode and vice versa. The models for IVs estimation are shown in Eqs. (2) and (3).

3.2.2 The Instrumental Variables Quantile Regression (IVQREG) Model

Moreover, there is reason to believe that income elasticities using the IVREG might be heterogeneous, that is, income elasticities differ among the different income groups of the households. In addition, income elasticities can vary according to characteristics of the omitted variables such as socio-demographic characteristics and preferences. The IVREG estimator delivers better and reliable estimates in the presence of heterogeneous income elasticities using fixed pay for water as instrumental variable. Before jumping to the IVQREG approach, we briefly describe the quantile regression (QR) first.

Now let us focus on how QR computes several regressions for the different quantiles ranging from 0 to 1 and provides income elasticities after controlling the effects of other control variables on the entire water consumption distribution. This implies an additive advantage over the OLS, which shows how conditional mean value of the dependent variable varies across different covariates. But QR provides more accurate picture of the relationship of the dependent variable, water consumption with the income and other covariates influencing water consumption in Dhaka city. Koenker and Bassett (1978) used the QR in their first seminar paper to explain the usefulness of using QR. However, QR estimates are biased in the presence of endogenous variable, which is income in our case. However, the IVQREG approach can overcome this problem. Now we describe the instrumental variable quantile regression (IVQREG) developed by Chernozhukov and Hansen (2006, 2008).

Conditional on the vector X = x, the scalar potential outcome variable of water consumption, WI is given by the quantile function

$$ {W}_I=q\left(I,x,{U}_I\right) $$
(5)

where q(.) is a conditional τ-quantile function for water consumption; I is a vector of continuous income variable, which is assumed to be endogenous; x denotes the vector of the included exogenous variables such as wealth-proxies, location, household size, water bill and DWASA spatial zones; and U is a non-separable error denoted by U ∣ x, z~uniform(0, 1) with z being a vector of the excluded instrument, here it is fixed payment (FP) dummy for water bill.

The endogenous income variable is given by

$$ I=\gamma \left(X,Z,V\right) $$
(6)

where V the vector of unobserved variables, which depends on U. This paper follows the linear model of the following form to obtain the IVQREG estimator

$$ q\left(i,x,\tau \right)=\overset{\acute{\mkern6mu}}{\imath }\ \gamma \left(\tau \right)+\overset{\acute{\mkern6mu}}{x}\varnothing \left(\tau \right) $$
(7)

We also let \( \theta \left(\tau \right)=\left\{\gamma \left(\tau \right)\acute{\mkern6mu},\kern0.5em \varnothing \Big(\tau \overset{\acute{\mkern6mu}}{\Big)}\right\} \). Here, γ and ∅ show the estimated parameters for income and other observable factors after using instrumental variables quantitle regression estimation technique. Endogeneity arises because of the correlation between I and U. Conventional quantile regression estimate θ(τ) is biased due to endogenous income variable, I (Koenker and Bassett 1978). However, we can overcome this problem using the IV method under certain assumptions (see Chernozhukov and Hansen (2005) for details), which leads to a set of moment conditions as:

$$ P\left[W\le q\left(i,x,\tau \right)\ |\ z,x\right]=\tau $$
(8)

Equation (8) is the main equation for identification as it allows us to estimate θ(τ). Moreover, it is used to construct moment conditions for estimating the conditional quantile function of W given I = i and X = x. Based on ranking invariance assumption, the even {W ≤ q(I, x, τ)} is equivalent to {U ≤ τ} and this provides

$$ \underset{\theta \left(\tau \right)}{\arg\ \min\ E}\left({\rho}_{\tau}\left[w-\overset{\acute{\mkern6mu}}{\imath }\ \gamma \left(\tau \right)-\overset{\acute{\mkern6mu}}{x}\varnothing \left(\tau \right)-\int \left(z,x\right)\right]\right) $$
(9)

where ρτ(u) = u[τ − 1(u < 0)] and ∫(.) refers to the general form of F(z, x). Eq. (9) refers to the state that 0 is the τth quantile of the random variable W − q(i, x, τ) conditional on Z = z and X = x. Solving Eq. (9) would help provide the IVQR estimator for θ(τ). The linearity assumption simplifies Eq. (9) as:

$$ \underset{\theta \left(\tau \right)}{\arg\ \min\ E}\left({\rho}_{\tau}\left[w-\overset{\acute{\mkern6mu}}{\imath}\gamma \left(\tau \right)-\overset{\acute{\mkern6mu}}{x}\varnothing \left(\tau \right)-\overset{\acute{\mkern6mu}}{z}\pi \left(\tau \right)\right]\right) $$
(10)

Instrumental variables quantile regression (IVQREG) uses Eq. (10) to obtain \( \hat{\theta}\left(\tau \right). \)

4 Results

The empirical results are classified into descriptive statistics, correlation analysis, instrumental variables regression (IVREG) and instrumental variables quantile regression (IVQREG) analyses.

4.1 Descriptive Statistics

Table 1 reports the mean and standard deviation of the variables used in the estimation of water consumption model for Dhaka city. Household water consumption is calculated based on water bill paid by the households. The water bill paid by households has been divided by the DWASA tariff rate effective since 1st July 2018 at 11.02 BDT (USD 0.13) per 1000 L, to calculate the total water usage of the households per month. For clarity, we excluded the value added tax (VAT) of 15% from the paid amount for water. We calculated the per capita per month water usage by dividing the household per month water usage by the household members assuming each household member uses equal amounts of water. Per capita per month water consumption stands at 15,103 L, with high standard deviation, implying highly skewed water consumption distribution to the right. Average per capita per day water consumption stands at 503 L, which is very high compared to other studies (BIGD 2019; WASA 2016; Abedin and Rakib 2013). Previous studies find that the estimates of per capita per day water consumption in Dhaka city ranges from 250 to 310 L. The high average of water consumption for this study arises because of the following reasons: (a) water bill in some cases in some areas include sewage charges; (b) households using extra pumping machine to bring water often increase the speed of the meter and so it inflates the bill; (c) water bill reported by the owner of a holding, where a single meter is set for the holding, inflates the water bill and thus, the water consumption, and this is a very common scenario in case of water usage in Dhaka city; (d) household water consumption is adulterated with water usage by enterprises/businesses such as restaurants and groceries attached to the households. Such types of cases also inflate water bill as it includes charges for such enterprises. The distribution of log of per capita per month water consumption seems to be normal (Fig. 1

Fig. 1
figure 1

Normal probability plot of water consumption in Dhaka city

). On average, per capita monthly income stands at BDT 7254 (USD 85.54); 57% of households have savings; 77% live in formal settlement; household size stands at 4.21 persons; 45, 5, 3 and 26% households use shower, washing machine, geyser and high commode, respectively; monthly water bill stands at BDT 742 (USD 8.75); and 49% of households belong to fixed pay water mode.

4.2 Correlation Analysis

Correlation matrix reported Table 2 shows the correlation coefficients between variables. Each cell in the correlation matrix shows the correlation between the two variables. A correlation matrix summarizes data and is also used as a diagnostic for advanced analyses including instrumental variables regression models such as IVREG and IVQREG. This study uses correlation analysis as a precursor to the regression models like IVREG and IVQREG in order to identify the strength of linear association between the two variables. Correlation coefficient, \( p\left(x,y\right)=\frac{Cov\left(x,y\right)}{\sqrt{Var(x) Var(y)}} \), ranges from −1 to 1.

Table 2 Correlation matrix

In addition, it is important to examine the presence of multicollinearity in the dataset before applying the desired regression methods. We used it as an initial check to explore the issue of multicollinearity among the covariates since the correlation matrix explains the bivariate linear relationships between variables. Although the correlation matrix is a good detector of multicollinearity, it requires further examination due to multivariate nature of the multicollinearity. Moreover, the correlation matrix of predictors helps to check at a glance how the covariates are associated with the outcome variable as well.

The correlation coefficients reported in Table 2 shows the linear associations between different pairs of variables used in the study. The correlation matrix shows that water consumption is significantly associated with all covariates except savings. It is also observed that water consumption and household size is significantly negatively associated (−0.21). All other variables, except household size and savings, are positively and significantly associated with water consumption. The correlation coefficients reported in Table 2 shows that no issue of multicollinearity persists in the dataset.

4.3 Instrumental Variable Regression (IVREG) Model for Water Consumption in Dhaka City

This section presents the OLS estimates of the income elasticities of water along with the results obtained with IVs estimation, which is of our interest, from a comparative perspective. The multiple OLS estimation techniques inadequately address the problems arising from endogenous independent variable “income” as it assumes that covariates included in the OLS regression capture all potential selection (and omitted variables) problems. The OLS approach models the per capita per month water consumption as a function of per capita monthly income, savings, household location (formal settlement vs. slum), household size, and household wealth proximate variables like owning shower, washing machine, geyser and high commode, water bill and DWASA spatial zones. The estimated results using the OLS method are reported in columns 1 & 2 of Table 3 with default and robust standard errors respectively. In addition, this study has also applied bootstrapping method for the OLS estimates using 400 replications (column 3 of Table 3).

Table 3 Water consumption estimation using OLS and IVREG

Let us focus on the estimates of the effects of income (here log of income) on water consumption (log of water consumption) to represent the income elasticity of water consumption. Not surprisingly, income elasticity is positive and statistically significant at 5% level. All estimates based on log-log model directly measure the elasticities. For example, the coefficient on income is 0.163, implying that 1% increase in income causes 0.163% increase in the quantity of water consumption. After bootstrapping, the income elasticity remains significant at 5% level with slightly higher standard errors for the income estimate. Similar empirical findings are evident for Ahmad et al. (2016), Stevens et al. (1992), Hoffman et al. (2006), Gaudin (2006), Nauges and Strand (2007), and Schleich and Hillenbrand (2009). Moreover, significantly positive estimate, but small in magnitude, of income elasticity implies inelastic responsiveness to water consumption due to change in income.

By using a two-sided test at level 0.05 for OLS, location, household size, shower and high commode are individually statistically significant as p < 0.10. After applying the bootstrapping method with 400 replications, these estimated coefficients remain statistically significant. In addition, other wealth-proxies like savings, washing machine and geyser have insignificant effect on water consumption and this may be due to small sample for the designated wealth-proxies as well as the inclusion of several wealth-proxies in regression specification that capture well the water consumption effects of these variables. Similarly, this study finds no significant effect of water bill. The dependent variable, water consumption, also displays a spatial (or zonal) distribution with the highest water consumption in Zone 5 of DWASA, which is dominated by large and expensive homes. This indicates the presence of spatial inequality in water consumption in Dhaka city.

As the multiple OLS estimation techniques inadequately address the problems of endogenous independent variable “income” and make the OLS estimates biased, we use IV technique to examine this issue using fixed pay dummy as an instrument for income. Fixed pay variable is expected to be connected with the income and it should not be causally related with water consumption. This analysis argues that there is no direct path between fixed pay and water consumption. By doing so, we are assuming that household or holding with and without fixed pay does not substantially affect water consumption in Dhaka city. It is likely to be true as fixed pay is measured in dummy rather than quantities (it is around 49% households under the fixed pay mode).

The Stage 1 Eq. (2) cleans up the endogenous predictor income, leaving only the exogenous part of the variation that covaries with the instrument, fixed pay in our case. The Stage 2 Eq. (3) is very identical to an OLS Eq. (1) except that predicted income variable \( l\hat{n}I \)is cleaned up by the instrument of fixed pay in Stage 1. This allows to estimate the causal relationship of income with water consumption. Table 4 shows the results of the first stage of 2SLS described by Eq. (2). The first stage regression has satisfactory explanatory power, and the coefficient of fixed pay is negative, as expected, and highly statistically significant. It plays an important role for the study as the instrumental variable “fixed pay” is highly significantly related with the instrumented variable “income”.

Table 4 First-stage regression results of the IVREG and IVQREG

Column 4 of Table 3 reports the results obtained using the second stage of the 2SLS estimation of Eq. (3). Estimates based on the IVs reveal that income elasticity of water consumption is positive, as expected, and statistically significant even at 5% level and income has bigger responsiveness on water consumption than the OLS estimate. For example, the estimated coefficient of income now is 0.78, implying 0.78% increase in water consumption due to 1% increase in income. As IV provides only local average treatment effect (LATE), it is possible that the income elasticity is heterogeneous. It implies that responsiveness of water consumption might vary with the different income groups. The general perception in this regard is that responsiveness would be low with the people in richer income groups as they consume already significantly more compared to the poorer groups. Therefore, we use the instrumental variable quantile regression (IVQREG) approach as it accounts for this heterogeneity while estimating income elasticity across the water consumption distribution (discussed later on the section). However, note that results obtained using the IVREG do not reverse the sign of the effects of the significant covariates reported in Table 3, and thus, interpretation of the results remain the same. However, the coefficient of savings also changes from insignificant to significant. The standard error of the instrumented regressor ‘income’ increases from 0.072 to 0.328, implying the potential loss in efficiency due to using the IV approach.

This study also uses the Wald test to test the equality of the coefficients of interest. The F statistics version of the Wald test has been used after the IVREG regression estimation for water consumption. A common test is one of the equality of coefficients of the covariates. For example, this study tests that the households located in different DWASA spatial zones have the same effects on water consumption. The test of

$$ {H}_0:{\beta}_{zone1}={\beta}_{zone2}{=}_{\dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots \dots ..}={\beta}_{zone9}\ against\ {\mathrm{H}}_A:{\beta}_{zone1}\ne {\beta}_{zone2}{\ne}_{\dots \dots \dots \dots \dots \dots \dots \dots \dots \dots ..}\ne {\beta}_{zone9} $$

has been performed to check whether coefficients of the DWASA zones vary significantly or not. Water consumption estimates of the DWASA zones are jointly statistically significant as the overall Chi2 statistic is 31.90 with a p value of 0.000. This implies that significant variations (i.e., spatial inequality) in water consumption persists across the water zones in Dhaka city, implying spatial inequality in water consumption.

The preceding analysis treats the income variable as endogenous. As the results obtained using the IVREG and OLS approaches differ considerably, this is a strong evidence that income is endogenous. Moreover, this study applies the Durbin-Wu-Hausman (DWH) test to examine endogeneity as it produces robust test statistics using the device of augmented regressors (Davidson 2000). The robustified DWH test statistic leads to strong rejection of the null hypothesis that income is exogenous. Therefore, we conclude that income is endogenous. Now our concern is to check whether the instrument ‘fixed payment’ is weak as it provides a poor guide to actual finite sample distributions. This study diagnoses it using pairwise correlation between endogenous regressor (income) and instrument (fixed payment). The estimated correlation coefficient is −0.28, which is not so low to treat this as weak instrument. Moreover, there will be no considerable loss of precision using the IV estimation as the first two statistics, R2 and adjusted R2, are found around 0.59 and 0.57 respectively, which are high. F-test on just fixed pay (instrument) find the value of 21.92, which is considerably larger than the rule of thumb value of 10. Therefore, fixed pay does seem to be a strong instrument.

4.4 Instrumental Variable Quantile Regression (IVQREG) Model for Water Consumption in Dhaka City

The IVQREG estimator delivers a reliable estimate for the endogenous and heterogenous income elasticities using fixed pay dummy as instrument. Since the sample data collection is random in our case, we can use fixed pay as an instrument for the income. Due to fixed payment nature of water consumption, we are fortunate to have low and high correlations of the instrument with water consumption and income, respectively. The IVQREG results using fixed pay dummy as an instrument are reported in Table 5. The output reported in Table 5 shows the estimated coefficients for different quantiles at QR25, QR50, QR75, and QR85 in addition to IVREG estimates.

Table 5 Estimates of income elasticities of water consumption using IVREG and IVQREG

Table 5, columns 2–5, report the coefficient estimates and bootstrap standard errors in the parentheses at quantiles QR0.25, QR0.50, QR0.75 and QR0.85. The output reported in column 3 of Table 5 shows the coefficient estimate for water consumption at the median quantile QR0.50. The income elasticity for the households with median water consumption is 0.88 (i.e., income elasticity at 50th percentile is 0.88). The estimated coefficients of all the independent variables represent marginal effects for the households with median water consumption. We use bootstrap standard errors assuming that the errors vary with the covariates. For example, the variance of income possibly varies with wealth-proximate variables. Income elasticity decreases from 0.92 at QR0.25 to 0.35 at QR0.85. The income elasticities at different quantiles are positive and statistically significant. The IVREG estimate of income (i.e., 0.78) differs considerably from the IVQREG coefficients, even those for median regression (i.e., QR0.50). The income elasticities for the households with median water consumption is 0.88, which is income inelastic. The substantive interpretation is that median household water consumption increases by 0.88% due to 1 % increase in their income. The IVQREG estimates of income elasticities imply systematic evidence across the water consumption distribution in Dhaka city. More specifically, the income elasticity of water consumption decreases with the increase in percentile. Findings also imply that income elasticity estimate converges to other global estimates as we move from lower water consumption quantile to higher water consumption quantile. The true reflection of the global estimates arises for households belonging to the higher quantiles of water consumption in our case. Note that presence of water is significantly higher among the water richest households compared with the water poorest households, and thus, it becomes possible to have globally comparable estimates for the water richest households in Dhaka city.

One reason for income elasticities differing across quantiles is the presence of heteroskedastic errors. It is also possible to conduct hypothesis tests of equality of the income elasticities at different conditional quantiles of water consumption. The test of the equality of the income elasticities from the QR0.25, QR0.50, QR0.75 and QR0.85 has been done using simultaneous IVQREG with a view to obtaining the full covariance matrix of coefficients. This study performs the Wald test on the null hypothesis that income elasticities are the same for the four quantiles. Wald test of F statistic is 6.08, which rejects the null hypothesis that income elasticities are same for the four quantiles. Therefore, income elasticities of water consumption significantly vary across the quantiles.

5 Summary and Conclusions

This paper explores water consumption in Dhaka city for better understanding its usage using the 459 household survey data collected by BRAC Institute of Governance and Development (BIGD). For this purpose, we estimated income elasticities of water consumption after controlling the effects of other explanatory variables including wealth-proxies, location, household size, water bill and spatial zones using the instrumental variable regression (IVREG) and instrumental variable quantile regression (IVQREG) approaches for providing more reliable and accurate picture of water usage in Dhaka city.

As most of the households located in Dhaka city use the single source of water, namely Dhaka Water Supply & Sewerage Authority (DWASA), the estimation of residential water consumption seems to be straightforward. However, imperfect water price information prevailing in the ex post billing in Dhaka city including fixed pay, household-wise water meter, holding-wise water meter, combined water usage with enterprises/businesses etc. make the estimation more complex. Moreover, dearth of facts on water consumption complicate the issue of measuring residential water consumption. However, the basic price theory for water demand from DWASA may still hold to some extent. For example, if the water price is fixed (e.g., fixed water bill) and does not depend on the volume of water usages, the marginal price of water usage is zero for all existing consumers. Hence, the law of demand suggests water use will continue until the marginal utility of water consumption is zero. Hence, preference heterogeneity that persists among the consumers determines the variation in water usage. This rationale is important in our case and it is reflected in our findings that water bill plays insignificant role in determining water consumption at the 25th percentile of water consumption. This is also an indication of weak correlation between fixed pay and water usage. However, this is strongly associated with income. This study uses ‘fixed pay for water’ an instrumental variable for income. Using the fixed pay variable as instrument, findings reveal the strong evidence that income is endogenous. Therefore, estimates using OLS would suffer from biases. This study contributes in this regard.

The results show that income elasticities are heterogeneous and vary significantly across the water quantiles, implying inequality in income responsiveness to water consumption and it is inversely related with water consumption. The distinctive feature of water consumption is that it is non-linearly related with income. However, findings provide the strong systematic evidence as income elasticity of water consumption decreases with the increase in percentile (i.e., less responsiveness of water consumption to income changes at the higher percentiles). Household income elasticity appears to be inelastic across the distribution and the magnitudes of the income elasticities get smaller as we move to the higher quantiles of water usage. The income elasticity estimate converges to other global estimates as we move from the lowest quantile to the highest water quantile. In our case, water consumption quantiles also reflect the income quantiles in the same direction, but not the same extent. Therefore, our estimate for income elasticities is much wider. This implies strong responsiveness of water consumption to income changes among the water poorest households in Dhaka city. The elasticity estimates obtained for the highest water quantile group are very much consistent with the estimates obtained for other countries (income elasticity of water demand greater than 0.50) such as Bugariam Cyprus, Estonia, Finland, Latvia, Lithunia, Portugal and Slovakia (Reynaud 2015). Note that presence of water meter is significantly higher among the water richest households compared with the water poorest households in Dhaka city, and thus, it has made us possible to accurately measure the income elasticity estimates for Dhaka, which are compatible with the estimates for other countries. However, income elasticity estimates without considering instrumental variables would suffer from downward bias.

Moreover, no water consumption gap between formal and slum settlements persists at the 25th percentile. However, the gap becomes significant and widens up to 75th percentile. Then it starts declining. Another important covariate is savings, which is significantly and negatively associated with water usage. In our case, we have used the savings dummy, not the savings amount due to lack of data. Findings using Chi2 statistics show that savings and water consumption are independent, implying no association between them (similar to our OLS findings and correlation matrix). Therefore, our case does not necessarily mean higher water consumption for households with the presence of savings status. The IVQREG model finds the neutral effect of household size on water consumption among the water poorest households (i.e., 25th percentile). However, household size plays a significant role in reducing water consumption, which contrasts with other estimates. This arises because of the heterogeneous household size within a water quantile group. Note that household size is significantly negatively associated with the water consumption and income as well, implying poor income status for a larger household size. Therefore, for a given level of water quantile, household size varies within this water consumption group to reflect the income status. More household members in a family implies the possibility of belonging to a relatively poorer group and thus, the less water consumption.

Significant spatial inequality in water consumption from IVREG approach disappears as we use IVQREG. This also strongly supports the systematic evidence obtained in our case for measuring water consumption responsiveness to income changes. Finally, we find that water bill paid has no role in explaining water usage variation at 25th percentile. However, water bill estimates become significant from the median to upper quantile regressions though the estimates are small in magnitudes, implying the scope to increase the water bill. Moreover, it is evident that water bill has no significant effect on income (Table 4). However, the estimate of water bill is reversed at the 85th percentile of income. Therefore, introducing increasing tariff blocks (IBT) as water consumption might promote water conservation, ensure more access to water by households belonging to the poor blocks and generate more revenue for the DWASA. However, DWASA must ensure smart water meter to the residents before implementing such tariff structure. Analysis of smart meter traces of individual customers supports prompt detection of leakages (Luciani et al. 2019; Cominola et al. 2019) and other types of anomalies in residential and nonresidential accounts (Patabendige et al. 2018). Further work should seek to collect panel data on price and relevant variables using panel water consumption survey to explore water consumption estimates using quantile regression with instrumental variable for price.