Background

With the rapid expansion in China of the aging population, changing morbidity, and evolving lifestyles, the number of elderly people who require long-term care has been growing dramatically [1]. To deal with the “grow old before getting rich” situation [2], in 2016, the government initiated a pilot program of long-term care insurance in 15 cities, one of which is Shanghai.

Studies have demonstrated that a substantial part of long-term care is provided by informal caregivers in China [3, 4]. Informal caregivers refer to non-professionals who provide long-term care services to their relatives, neighbors, friends, etc. [5]. Although unpaid, informal caregiving is not without costs. For example, informal caregivers are reported to devote less time to paid work and leisure [6, 7], have greater morbidity and mortality risks [8, 9], and be confronted with higher levels of stress [10]. In addition, most researchers agree that informal caregiving substitutes and/or complements formal caregiving services, and should be recognized accordingly in long-term care related researches and policies [11, 12].

Consequently, many informal caregiving economic valuation investigations have been carried out in different countries and regions. The results revealed that different valuation methods exert impacts on monetary value. The most utilized evaluation methods are the opportunity cost method, the proxy good method, the contingent valuation method, and the conjoint method [13]. The former two methods are termed revealed preference methods, and use market prices of labor to evaluate the input or output of caregiving, respectively. However, both the opportunity cost method and the proxy method have been criticized because both methods rely on the accuracy of caring time recall, fail to reflect the full impact of caregiving, and cannot capture the preference of the caregivers [5]. A previous study also suggests that the revealed preference method is less applicable in certain contexts, especially in less developed countries, because most caregivers are without paid work [14]. The contingent valuation method (CVM) and the conjoint method constitute stated preference methods. Among stated preference methods, the CVM is one of the most utilized, because it is less cognitively demanding and more sensitive to the heterogeneity of caregiving [15]. CVM is generally conducted by asking the amount of money that the respondent is willing-to-pay (WTP) or willing-to-accept (WTA) for a reduction or increase of a unit of caregiving time. Moreover, associated factors, differences, and non-responsiveness of WTP and WTA were examined. The results differ according to study design, sample characteristics, countries, regions, etc. Generally, WTP and WTA are associated with income and/or wealth, health status, and care burdens on caregivers [16]. Most CVM studies of informal care were conducted in developed countries with well-established long-term care service systems. However, few such researches were performed in developing countries, and none has been found in China.

Due to Asian filial norms, the developing social insurance system, and scarcity of care resources, informal caregiving has always been the most important source of long-term care in China [17, 18]. However, many scholars have pointed out that the rapidly growing elderly population, one-child policy, and increasing female labor participation could jointly affect the traditional way of growing old for Chinese people, and the demand for formal care is predicted to rise markedly [19, 20]. Shanghai has the largest population of elderly people among all cities in China. In 2016, Shanghai was selected as one of the first piloting cities of the social long-term care insurance. Informal caregivers of the insured elderly choose the service type from a list provided by the government, and then the insurance will co-pay for the chosen services provided by designated providers. The out-of-pocket expense is fixed, irrespective what services are utilized. Moreover, the needed hours of care to be co-paid by LTC insurance is determined by a third-party assessment institution, and the assessment result for an elderly individual is usually a few hours per week (from 1 to 7 h) due to the serious shortage in supply of home care [21]. The reassessment of demand is conducted every 2 years, unless the applicant requests for a reassessment. This ‘managed demand’ strategy can avoid possible rapidly-growing costs of home care, alleviate the burdens of informal caregivers as well as emphasize the responsibility of both the family and the government.

Few investigations have explored burdens of informal care providers in China. Studies concerning informal care mostly focused on caring time, depression risk, and the burden of informal caregiving [6, 22]. One study utilized open access data to evaluate the economic value of informal care provided to stroke survivors using the opportunity cost method [23]. However, as previously mentioned, revealed preference methods fail to represent all costs involved in caregiving, especially in developing countries. At present, there is no extant literature on the economic valuation of informal care by CVM in China. However, since the population is aging rapidly and the long-term care service system in China is being overhauled, it is of pivotal importance to elucidate the economic valuation of informal care in China. Therefore, this paper aims to evaluate the economic value of informal caregiving in Shanghai using the contingent valuation method (CVM) by the least-preferred care tasks, and identify the associated factors of WTP and WTA of caregivers.

Methods

Sample and study design

A total of 439 caregivers were interviewed. Caregivers were recruited through 31 randomly-selected communities in Shanghai. All of the informal caregivers of elderly people applying for social long-term insurance from January 1, 2019, to June 1, 2019 at the Neighborhood Committees of the community were invited to participate. The caregivers must be 16-years-old or older, and have spent at least 1 hour per day performing care tasks for the applicant. Written and verbal consent of participants in the economic valuation project were obtained prior to the interview. On average, the interview took 30 min to complete.

Economic valuation methods

Questionnaire-based face-to-face interviews were conducted by five trained interviewers. The questionnaire was developed for this study, and has not been published elsewhere (see Additional file 1). In order to avoid over-pledging bias, which was observed in other CVM studies [24], the caregivers were interviewed independently without the presence of their care recipients.

Contingent values of WTP and WTA were obtained via open-ended questions. The approach of open-ended questions in CVM studies was controversial in valuing environmental improvements and pollutions [25]. However, it has been successfully employed in several studies valuing informal care in developed countries and regions [16, 26, 27], and empirical evidence shows that this approach can also be employed in low-income countries [14, 28]. Before the CVM questions were proposed, the respondents were first asked to identify which is their least-preferred task among all of the care tasks when taking care of the recipient. Many CVM studies have posed contingent questions in general, for example, by asking what is the maximum/minimum amount of money that the caregiver would be willing to pay/accept for a 1 hour increase/decrease of caring, without specifying the care task. However, considering that one of the main aims of this research is to provide solid empirical evidence for informed long-term care related policy decisions, and the least-preferred task is the service that is most likely to be “contracted out” to the services market and co-paid by the long-term care insurance, especially when the demand for care is managed, the research team decided to obtain WTP and WTA according to the least-preferred tasks. In case of non-responsiveness of the caregivers because of discomfort with sacrificing the benefits of the recipients, we informed the interviewees that the government will send qualified professional caregivers to take care of the recipient. This method has been successfully utilized in several researches [5, 14, 16]. Previous investigations also employed a hypothetical scenario, in which the reduction of caregiving is because the care recipient’s health had improved, or there is a hypothetical recipient [29]. In the pilot study conducted with a small sample of 30 care recipients, these two scenarios were included. However, the improvement of recipients’ health scenario is more cognitively demanding, because most elderly people presented with progressive diseases, in which the passing of time generally leads to deterioration. Moreover, caregivers can also refuse to provide an answer if they do not want to reduce/increase 1 hour of caring per day. The scenario of a hypothetical care recipient was also not adopted, because it is difficult for the real caregivers to imagine taking care of a stranger in WTP questions, and this study only aims to value the informal care provided to real recipients. Examples of the WTP and WTA questions posed are as follows, respectively:

Please imagine that the government is to send professional care workers to provide services. What is the maximum amount of money that you are willing to pay for an hour of reduction in caregiving of your least-preferred care task per week?

Please imagine that the government is going to subsidize the care that you are currently providing to your recipient. How much is the minimum sum that you are willing to accept for this extra hour of your least-preferred care task per week?

Data and statistical methods

Socio-demographic, socio-economic, and health-related data were collected for the analysis of factors associated with WTP and WTA. Age, gender, education, income (per month), and residential information (co-residence) of the caregiver and the recipients were collected. In addition, questions about employment, marital status, and relationship to the recipient of the caregivers were also asked during the interviews. Instead of subjective measures of the health status of the recipients, the Charlson Comorbidity Index (CCI) was employed as an indicator of the health status of the recipients. CCI is an index that assigns each morbidity a score ranging from 1 to 6 according to its severity and resource-intensity [30], in which a higher CCI indicates a worse health status.

Since WTP and WTA were also found to be associated with various care-related factors, the objective and subjective burden of caring, and the availability of other source of care (informal or formal) were also analyzed. The objective burden of caring was measured by care duration and care time spent on caring per week. Care duration was assessed by asking how long the respondent has been taking care of the care recipient. Time spent on caring was measured via 17 tasks, including household activities of daily living tasks (HDL), activities of daily living (ADL) tasks, instrumental ADL (IADL), and supervision and companion time, which is in accordance with the extant literature to provide comparable results [5, 16, 31]. The weekly recall method was adopted for recording the time spent on each task. Respondents were asked how much time they spent on each of the 17 tasks in the previous week. Considering the difficulty in discerning exaggeration in care hours by care tasks, if the total care time exceeded 256 h per week (16 h per day), the questionnaire would be considered as invalid. The subjective burden of caring was assessed using the Caregiver Reaction Assessment (CRA) [32]. There are five subscales of the CRA, each of which measures a dimension by a battery of questions. These dimensions include disrupted schedule, self-esteem, lack of family support, loss of physical strength, and financial problems. The scales were from 1 to 5, in which a higher score indicates a higher subjective burden, except for the subscale of self-esteem, which was coded inversely. Since this study is interested in the impact of specific subjective sources of burden on WTP and WTA, overall subjective burden was not assessed. The availability of other sources of caring were obtained by asking the respondents whether any other people were assisting with taking care of the care recipient, either paid or unpaid.

Descriptive data of continuous variables and categorical variables were presented by means and standard deviations (SDs), and number and percentage, respectively. Univariate and multivariate analysis were performed to explore factors associated with higher WTP and WTA of the caregivers. Student’s t-test, one-way analysis of variance (ANOVA), Chi-square test, and Spearman’s rank correlation test were employed according to the nature of the variables in the univariate analysis. Variables significantly associated with WTP and WTA in the univariate analysis were introduced to the multivariate analysis. The stepwise ordinary least square (OLS) regression method was utilized for the multivariate analysis. A p value < 0.200 was considered statistically significant in the univariate analyses to avoid confounding factors [33], and a p value < 0.050 was adopted for the multivariate analysis. Microsoft Office Excel version 2010 and SPSS version 20.0 were used for data entry and analysis, respectively.

Results

Characteristics of informal caregivers

Among the 439 interviewees, 371 were included in our analysis. Sixty-eight (68) were excluded due to incomplete information (N = 6), under 16-years-old (N = 7), and over 16 h of caring per day (N = 55). The characteristics of the informal caregivers are presented in Table 1. Female caregivers constituted 58.2% (N = 218) of the sample. The average age of the caregivers was 56.67 (SD = 11.77). Most of the caregivers were married (N = 305, 91.6%) and had 7-to-15 years of education (N = 213, 64.4%). Employed, retired, and unemployed accounted for 48.2% (N = 179), 28.6% (N = 106), and 23.2% (N = 86), respectively. 9.3% (N = 123) of the primary caregivers had a monthly income of less than 4000 CNY, 44.4% (N = 139) from 4000 to 6000 CNY, and 51 (16.3%) above 6000 CNY. In addition, a majority of the caregivers were children of the recipients (N = 227, 67.0%), and 26.3% (N = 89) of the primary caregivers were spouses. 3 47.7% (N = 165) of the respondents were living with the recipients. Care recipients were mostly women (N = 238, 64.9%), and the average age was 80.73 (SD = 8.76). Most care recipients had 7-to-15 years of education (N = 203, 57.0%), and had a monthly income below 4000 CNY (N = 184, 50.3%). 19.2% (N = 70) of the elderly people had a CCI equal to zero. 67.1% (N = 245) and 13.7% (N = 50) of the recipients’ CCI were 1 to 2 and above 2, respectively. On average, the caregivers had been taking care of the recipients for 7.44 years (SD = 4.82), and the mean time of caring was 42.89 h (SD = 30.15) per week. Specifically, on average, the caregivers spent 13.19 (SD = 10.76), 8.42 (SD = 7.80), 10.04 (SD = 9.76), and 11.08 (SD = 13.05) hours per week on HDL tasks, ADL tasks, IADL tasks, and supervision and companionship, respectively. Mean scores of subjective burden measured by the CRA were 2.77 (SD = 0.53), 2.64 (SD = 0.69), 2.67 (SD = 0.68), 1.14 (SD = 0.46), and 1.36 (SD = 0.71) for disrupted schedule, self-esteem, family support, physical strength, and financial issue, respectively. 37.7% (N = 140) of the recipients received informal care from caregivers other than the primary caregiver, and 55.8% (N = 207) of the recipients were also using formal care.

Table 1 Characteristics of the caregivers and care recipients

WTP and WTA of informal caregivers

The average maximum WTP and minimum WTA of the respondents were 25.31 CNY (SD = 18.10) and 38.66 CNY (SD = 22.95) for 1 hour of decreasing and increasing in least preferred informal care task per week, respectively (see Table 2). Most caregivers found ADL (N = 126, 38.7%) and IADL tasks (N = 104, 31.9%) to be the least-preferred tasks. The average WTP for ADL and IADL tasks were 26.11 CNY (SD = 19.33) and 22.28 CNY (SD = 14.32) for 1 hour of reduction in caring, respectively. The average WTA of ADL and IADL tasks were 36.58 CNY (SD = 23.19) and 32.07 CNY (SD = 15.36) for one additional hour of the least-preferred task, respectively. Although the least frequent to be identified as the least-preferred task (N = 17, 5.2%), the highest WTP and WTA were found in supervision and companionship tasks, which were 31.25 CNY (SD = 20.35) and 50.00 CNY (SD = 36.33), respectively. Moreover, significant differences of WTP and WTA were found in all four categories of care tasks (p < 0.001), which means, on average, the respondents required significantly higher compensation for one extra hour of the least-preferred task than their WTP for 1 hour of decrease of the same task. There were 3 and 1 respondents who provided us with zero values in WTP and WTA, respectively.

Table 2 WTP and WTA of caregivers according to least-preferred care tasks

Factors associated with economic valuation of informal care giving

Univariate analyses and multi-variate analyses were conducted to identify factors associated with economic valuation of the caregivers. A lower WTP was associated with an older age of the caregiver (Spearman’s correlation = − 0.134, p = 0.032) (see Table 3). Significant differences of WTP were also observed in different education (p = 0.036), employment (p = 0.038), income (p = 0.004), relationship to the recipient (p = 0.002), and co-residence groups of the caregivers (p = 0.022). Regarding the characteristics of the recipient, WTP was significantly correlated with age (Spearman’s correlation = − 0.082, p = 0.200), income (p = 0.063), and CCI (p = 0.198). No caregiving-related variables were found to be significantly associated with WTP. Relationship between the caregiver and the recipient (p = 0.168), age (Spearman’s correlation = − 0.178, p = 0.007), income (p = 0.014), and CCI (p = 0.134) of the recipient were determined to be significantly correlated with WTA of the caregivers. WTA was also significantly associated with several caregiving-related variables, including care time in HDL tasks (Spearman’s correlation = 0.113, p = 0.082), least-preferred care tasks (p = 0.001), CRA scores of disrupted schedule (Spearman’s correlation = 0.105, p = 0.106), lack of family support (Spearman’s correlation = − 0.127, p = 0.046), and loss of physical strength (Spearman’s correlation = 0.133, p = 0.036).

Table 3 Univariate analysis of factors associated with WTP and WTA

The results of the multivariate analysis are presented in Table 4. Lower caregiver income – below 4000 CNY (β coefficient = − 6.145, 95% CI β coefficient = − 10.404 – − 1.886, p = 0.005), being the spouse (β coefficient = − 22.980, 95% CI β coefficient = − 33.264 – − 12.696, p < 0.001) or a child (β coefficient = − 16.555, 95% CI β coefficient = − 26.289 – − 6.821, p = 0.001), and older care-recipient’s age (β coefficient = 0.302, 95% CI β coefficient = − 0.530 – − 0.075, p = 0.010) were significantly associated with lower WTP values. Concerning WTA, higher values were correlated with the least-preferred task being HDL (β coefficient = 13.372, 95% CI β coefficient = 7.434–19.310, p < 0.001) and higher CRA score of disrupted schedule (β coefficient = 5.509, 95% CI β coefficient = 1.621–9.398, p = 0.006). Lower values of WTA were associated with the recipient’s age (β coefficient = − 0.619, 95% CI β coefficient = − 0.928 – − 0.309, p < 0.001), lower care-recipient’s income – below 4000 CNY (β coefficient = − 6.109, 95% CI β coefficient = − 11.564 – − 0.653, p = 0.028), and higher subjective burden of lacking family support (β coefficient = − 9.444, 95% CI β coefficient = − 13.502 – − 5.386, p < 0.001).

Table 4 Multivariate analysis of associated factor with WTP and WTA

Non-responsiveness of WTP and WTA

A major drawback of CVM is the cognitive burden on the caregiver to state a price on caregiving, resulting in high non-responsiveness [15, 34]. Indeed, 12.1% (N = 45) of the respondents did not reveal the least-preferred care task. The rates of non-responsiveness of WTP and WTA were 26.1% (N = 97) and 33.2% (N = 123), respectively. 86 (23.2%) of the respondents did not provide both WTP and WTA. Specifically, 13 (3.5%) of the respondents did not state a WTP, and 37 (10.0%) declined to answer the WTA question.

Discussion

On average, informal caregivers in Shanghai provide approximately 42.89 h per week of caring. This result is substantially higher than that found in similar researches conducted in The Netherlands (27.1 h per week) [5], Thailand (23.7 h per week) [35], the U.S. (17.9 h per week) [36], and India (38.6 h per week) [37]. Although these studies were targeted to measure recipients with varied characteristics, for instance, patients with different illnesses, the large disparity calls for attention. Moreover, since informal caring constitutes the major source of long-term care for the elderly in China [3], it is critical to understand the care burden experienced by these caregivers. According to the results of the current study, we strongly recommend researchers and decision-makers to include all costs and effects, both direct and indirect, in cost-benefit analysis in studies and policy evaluation projects when it involves a potential gain or loss of benefits of informal caregivers. In addition, we assert that CVM can be employed as a technique for valuating informal care in China, since it produces results which are sensitive to the characteristics of caregivers and recipients, and care tasks.

The WTP and WTA of a reduction or an increase of care per week were 25.31 CNY and 38.66 CNY, respectively, which are lower than the price of formal home care in Shanghai, which was 65 CNY, according to informal interviews with several social workers in neighborhood committees. This phenomenon was also observed in several previous investigations [5, 16, 38]. This difference could constitute evidence that providing care can also produce positive effects on caregivers’ utility [39, 40]. Our research also differentiated four categories of tasks, and employed CVM for the least-preferred one. CVM has been criticized for its inability to capture heterogeneity of unobserved preferences [38]. Most previous studies employed general questions by asking the WTP and/or WTA of a reduction or an increase in caring per week without specifying care tasks. Our findings indicated that caregivers had significantly different WTP and WTA, according to care tasks. ADL tasks were the most frequently mentioned least-preferred care task (38.7%). Controlling for other variables, WTP values were not sensitive to specific care task, but WTA values were significantly higher for caregivers whose least-preferred task was HDL.

Disparities between WTP and WTA were also investigated in this study. In our results, the disparities between WTP and WTA were significant for all four care tasks. A higher WTA was observed in numerous other studies [14, 16]. This disparity between WTP and WTA has several underlying explanations. First, it is possible that caregivers did not include process-utility when deciding on the WTA [40]. Second, these caregivers were reluctant, or it was impossible, to reduce their caring time [26, 41]. Third, the disparity could be ascribed to the income effect, in which WTA does not induce a loss in wealth, and no perfect substitution exists [38, 42]. Moreover, studies with different emphases adopted WTP or WTA accordingly. For example, one study concluded that WTA is more appropriate because informal caregivers constitute suppliers in this “market” [5]. In contrast, a study carried out in Malawi recommended WTP because the answers are more realistic when a loss, rather than a gain, is considered [14].

Lower WTP was associated with lower income of the caregiver, caregiver being the spouse or child, and older age of the recipient. Income was the most common factor that was found to be correlated with WTP in almost all of the extant literature [14, 16, 26, 29]. However, relationship to the recipient was not usually an associated factor in economic evaluation investigations. An underlying reason for the lower WTP of the spouses and children, as well as of caregivers taking care for older recipients, is the strong sense of obligation that they feel to provide care, and thus they were reluctant to reduce 1 hour of caring [43].

Lower WTA was determined to be correlated with older age and lower income of the recipient. Former researches have demonstrated that WTA of the caregiver decreases with the recipient’s health condition and inferior socio-economic status [5, 16, 27]. Higher WTA was determined to be associated with the least-preferred task being HDL, lower CRA score for disrupted schedule, and higher CRA score for lacking family support. Increasing 1 hour of domestic help was not a frequent factor associated with higher WTA. Since household work is the most common care task to be “contracted out” to the market in China, its inclusion in economic thinking “makes more sense” with the respondents accepting compensation for caring [26]. However, to test this hypothesis, further investigation is required. According to the results of the multivariate analysis, in general, WTP is associated with the characteristics and relationship of the caregiver and recipient, while WTA seemed to be more sensitive to caring-related variables. This finding may indicate that, when considering payment for formal care, recipients’ decisions seemed to depend on the status of the caregivers and the recipients. However, regarding WTA and a payment from a third party, the caregivers were likely to make judgements based primarily on the caring activities and the recipient per se. However, these statements necessitate further validation.

In previous empirical studies, CVM was found to possess certain problems, such as non-responsiveness, protest answers, anchoring effects, etc. [26, 44,45,46]. Compared to other studies using CVM, the non-responsiveness of the WTP and WTA question in the present investigation is relatively high [5]. Moreover, previous researches have shown that informal caregivers were reluctant to reveal their WTA because it is difficult to make monetary estimations for taking care of their loved ones. This explains the significant higher non-responsiveness of WTA than that of WTP that was observed in our interviews. It has also been argued that WTP questions are insensitive in requiring people to state an amount of money they would want to avoid taking care of the recipients. However, since our respondents were caregivers of recipients applying for long-term care insurance, it is reasonable that the responsiveness to WTP questions was greater than that to WTA questions. Indeed, we employed open-ended questions to elicit the WTP and WTA of caregivers to avoid starting-point bias and the anchoring effect, which were observed in other CVM investigations [38]. Researchers have demonstrated that dichotomous questions and bidding games can induce starting-point bias [5, 16, 27]. Considering the explorative nature of this research, and that non-responsiveness is one of the results in which we were interested, no dichotomous, choice, or bid game was followed by CVM questions. Future studies should consider such techniques to avoid high non-responsiveness of CVM studies conducted in China. Considering the very limited number of zero values in this research, we did not use any technique to detect protest answers. However, since protest answers can considerably affect the validity of results, we recommend that future researches employ follow-up questions, which was reported to be useful in a valuation project conducted in Spain [27], to differentiate ‘protest zeros’ and ‘true zeros’.

There are some limitations of this study regarding sampling and measurements of care time. First, our sample was caregivers of recipients who were applying for social long-term care insurance in Shanghai. This may lead to bias because caregivers of non-applicants were not included. However, according to the application policy, all elderly residents in Shanghai who are 60-years-old or older are entitled to apply, irrespective of actual dependency. Second, a considerable percentage of the informal caregivers were retirees or unemployed. Consequently, the use of the CVM could be biased. However, we assumed that the caregivers decided not to join the labor market because the expected wages were at least not higher than the perceived monetary value of the informal care provided. Third, bias could exist in the measure of informal caring time due to employment of the recall method [47]. We attempted to alleviate this impact by a time-specific method. For example, instead of asking “How many hours did you spend on caregiving last week?”, the interviewers asked “From what time to what time did you perform household work for the recipient”, as well as additional questions, such as “How frequently did you take the recipient outside of the house last week, and how long did it take?” to assist the interviewees to recall accurately. Consequently, future studies should consider using the diary method to produce more robust results of care time. Finally, only the monetary values of the least-preferred tasks were estimated in this study, and it should not be interpreted as the average benefits received by informal caregivers by care tasks. To determine this, interested researchers could consider employing the conjoint valuation method or use CVM for each care task to reach this end.

Conclusions

This paper employed CVM to elicit the WTP and WTA of 371 informal caregivers of dependent elderly applicants for social long-term care insurance in Shanghai, China. The results constituted the first empirical evidence of economic valuation by using CVM of informal care in China, and presented comparable results for international informal care economic valuation studies. The average WTP and WTA of the respondents were 25.31 CNY and 38.66 CNY, respectively. The main factors associated with higher WTP and WTA, and non-responsiveness, were explored. According to the results, decision-makers should take economic valuation results of informal care into account when considering long-term care-related policies.