FormalPara Key Points for Decision Makers

The treatment process characteristics of time given by the physicians and the extent to which they attentively listen to the patient are most important and are equally important to patients using acupuncture, homeopathy or general medicine. These process characteristics should be taken into consideration in the debate about more patient-centered healthcare in Germany.

The willingness-to-pay results differed between the groups. Patients in the general medicine group were more cost sensitive than patients in the complementary and integrative medicine groups. The acupuncture and homeopathy patients may be used to out-of-pocket payments, as not all treatments are covered by statutory health insurances in Germany. However, the results have to be interpreted carefully as different patterns of willingness to pay might have existed within the groups.

1 Introduction

The effects of health interventions are often complex and multifaceted. Many researchers argue that the benefits of healthcare interventions comprise more than pure health outcomes such as changes in clinical parameters [1,2,3,4]. These authors claim that ‘biopsychosocial’ factors or the process of treatment lead to so called ‘benefits beyond health’, which potentially contribute to the patients’ well-being and the overall treatment effect. Aspects of treatment like caring, respecting patients’ dignity and autonomy, and providing information have value to the patient. “There are processes that patients go through to get to the outcomes. These processes are not always and not necessarily ‘utility neutral’”. [1]. Quality-adjusted life-years and other measures of health gains do not assess these process characteristics and their utilities [1]. They do not capture the full range of patients’ benefits [5].

A more comprehensive concept of healthcare is often attributed to complementary and integrative medicine (CIM) [6] but is equally applicable to conventional medicine since all patients go through a treatment process [2]. It is necessary to consider these indirect health benefits in order to conduct comprehensive clinical and economic research for both CIM and conventional medicine, which in the end informs decision makers in healthcare [2].

CIM methods are frequently used in Germany. Acupuncture (34.5%) and homeopathy (27.3%) account for the majority of CIM usage [7]. Acupuncture has been successfully proven as effective for several indications [8]. Since 2007, it has been part of the catalogue of benefits offered by statutory health insurances for some of these medical indications [9]. Homeopathic treatments are partially reimbursed by selective contracts [10]. Nevertheless, homeopathy is critically discussed in science and society. The effectiveness of homeopathic treatments and the mode of action of highly diluted homeopathic medicines are unclear [11, 12]. For both treatment methods, the extent to which indirect health benefits may contribute to the overall treatment effect is not fully understood.

The goal of this study was to investigate indirect health benefits in CIM and conventional medicine. We examined relevant elements of the treatment process from the patients’ perspective and the value they attach to them in outpatient, primary healthcare settings. A discrete choice experiment (DCE) was performed to assess patients’ preferences and the utility resulting from the treatment process and its attributes. The secondary objective was to estimate the patients’ willingness to pay for these attributes. Patients’ preferences were evaluated for three types of treatment: acupuncture, homeopathy and general medicine (GM).

2 Methods

2.1 Discrete Choice Experiments

The DCE, a non-random, cross-sectional survey method, was used to assess patients’ preferences and the related benefits (utilities). In DCEs, participants are presented with a sample of hypothetical scenarios (choice sets) describing the object of investigation (e.g., process of treatment) by several attributes (characteristics) and levels. Based on statistical design principles, choice sets are selected and arranged into choice tasks. Participants are asked to choose the preferred alternative in each choice task [13]. DCEs are based on Lancaster’s [14] economic theory of value. Individuals’ preferences are revealed by the choices they make and are summarized through their utility function. DCE results are then used to model the preferences within a random utility maximization framework [15].

2.2 Development of the DCE Questionnaire

A systematic literature search was conducted in June 2016 using Medline and Embase to identify any relevant literature on DCEs in CIM. Search terms for acupuncture and/or homeopathy were combined with those for conjoint analysis and willingness to pay. We identified one DCE publication on acupuncture [16] and one on homeopathy [17]. To develop a better understanding of the treatment processes, medical consultations in acupuncture and homeopathy were observed by a member of the research group. Additionally, ten outpatient physicians known by our institution were asked via email to name the characteristics of the treatment process and what they believed to be important for their patients. Six semi-structured interviews were conducted with patients who used acupuncture, homeopathy or GM. Patients were asked what had been most relevant for them regarding the treatment process. The received information regarding the process attributes of the previously listed steps was summarized and arranged for the stakeholder involvement. Participants of the stakeholder involvement (three patients, three physicians and three scientists) were asked to evaluate these process attributes regarding their importance and name others if necessary. Nine process attributes were identified altogether: active listening, time, holistic treatment, information, active participation, experience of the physician, technical equipment of the doctor’s practice, structural elements (e.g., waiting time, rooms), and nonverbal communication (e.g., physical contact). The significance and importance of all process attributes were discussed. The stakeholder involvement was audio recorded and a protocol was written. The attributes shown in Table 1 were found to be essential to the participants of the stakeholder involvement and were therefore chosen to be part of the DCE. Process attributes’ levels were chosen according to the Likert-Type Scale Response Anchors [18]. Levels for the cost attribute were based on a selective contract for outpatient homeopathy [19] and the doctors’ fee scale for outpatient services within the statutory health insurance (‘Einheitlicher Bewertungsmaßstab’) for GM and acupuncture [20].

Table 1 Attributes: descriptions and corresponding levels

SAS version 9.4 (SAS Institute, Cary, NC, USA) was used to create an experimental design out of 729 possible choice sets (36; six attributes, three levels) [21]. The choice design consisted of 18 choice sets arranged into six choice tasks, each containing three alternatives (forced choice). Figure 1 shows an example of a choice task. The relative D-efficiency of the experimental design was 100%; i.e., the design can be considered balanced and orthogonal [22].

Fig. 1
figure 1

Example of a choice task

The questionnaire contained one choice task with a dominated alternative. It was used to assess patients’ understanding of the attributes and levels. The dominated alternative was described by logically less preferable levels across all attributes. If it was chosen by the patients, they were defined as ‘irrational’ and were excluded from the analyses.

To describe our study population, the questionnaire contained additional questions on sociodemographic characteristics (age, sex and education), health complaints and the condition of the disease being treated (acute or chronic). Moreover, patients were asked to state if they have used acupuncture, homeopathy or GM simultaneously in order to assess the overlap in the treatment groups.

The first version of the questionnaire was pretested (n = 25) and adapted accordingly. The sample size was calculated according to Orme [23].

$$ \frac{n \times t \times a}{c} \ge 500 $$
(1)

where n is the number of respondents, t is the number of tasks, a is number of alternatives per task, and c is the number of analysis cells (when considering main effects, c is equal to the largest number of levels for any one attribute). For our choice design, t = 6, a = 3, c = 3, which resulted in a minimum of 84 patients per group.

The study was approved by the ethics review board of the Charité-University Hospital Berlin (EA1/320/16) and was registered in the German Clinical Trial Register (DRKS00013160).

2.3 Data Collection and Study Population

Outpatient physicians practicing in Berlin and Munich, Germany, were contacted by email, letter and telephone through the register of the Regional Association of Social Health Insurance-accredited Physicians (‘Kassenärztliche Vereinigung’) and through a list of physicians known by our institution. Physicians were either practicing GM exclusively or had an additional qualification in acupuncture or homeopathy and treated patients with statutory or private health insurance. All participating physicians were informed about the study aim and were asked to distribute the questionnaires to their patients. DCE questions were equal for all patients. The questionnaire was provided online or as a hardcopy. A brochure containing all relevant information about the study was handed out to the patients. Participation was voluntary and anonymous and could be cancelled at any time. According to the physician’s treatment approach, patients were labeled as acupuncture, homeopathy or GM patients.

The following inclusion criteria applied: the patients must be at least 18 years of age and must be undergoing treatment from a participating acupuncturist, homeopath or GM physician. An exclusion criterion was insufficient German language skills.

2.4 Statistical Model and Data Analysis

A conditional logit regression [15] was performed to analyze patients’ preferences regarding the treatment process. In a conditional logit, the probability of choice among the alternatives is related to the attribute levels characterizing the alternatives [24]. SAS was used to estimate the coefficients (preference weights) and the standard errors (SE) for all attribute levels. All process attributes were analyzed as categorical variables, and effects coding (reference category is coded − 1) was used. The cost attribute was coded and analyzed as a linear continuous variable. The regression model was run separately for all three study groups. The robust sandwich estimates of Lin and Wei [25] were used to assess the variance covariance matrix.

Descriptive analysis of the patient characteristics was performed by PASW statistics version 22.0.0.1 (SPSS Inc, Chicago, IL, USA). Figures were plotted using RStudio version 1.1.383 (RStudio, Inc., Boston, MA, USA) and the package ggplot2 [26].

2.5 Interpretation of Regression Results

There are different ways to interpret the results of the regression. The span of the estimated preference weights from one attribute level to the next level of the same attribute reflects the relative importance of that level change; i.e., the longer the distance is, the more important the change (e.g., moving from ‘rarely’ active listening to ‘sometimes’ active listening) [27]. It shows the gain or loss in utility (U∆) for the patients resulting from the change in attribute levels [28].

Trade-offs are another way to look at the results. The magnitude of differences in preference weights or utility can be compared across attributes. One specific trade-off is calculated by the ratio between preference weights of process and cost attributes (willingness to pay) [24].

The ‘raw’ coefficients cannot be directly compared between the groups. They are only comparable within one model or rather within one group [24]. The attributes can be compared within one group and across groups by calculating the relative importance of each attribute. Therefore, the difference in preference weights of the most and least preferred levels of one attribute is compared with preference weight differences of the other attributes. The level difference of each attribute is then normalized on a 10-point scale (normalized level difference). Ten points reflect the biggest level difference and the most important attribute [27]. Still, confidence intervals for the normalized level difference of the attributes are only meaningful within one group and are not comparable across the others.

3 Results

3.1 Patients’ Characteristics

Patients were recruited by 34 physicians (14 acupuncturists, 9 homeopaths and 11 GM physicians) between January and September 2017. In total, questionnaires were distributed to 608 patients, of whom 285 participated in the survey. The response rate was 47%. A total of 22 questionnaires had to be excluded from the analyses: four patients did not meet the inclusion criteria, 13 did not complete the DCE, and five were defined as ‘irrational’ responses. The ‘irrational’ responses were given by patients from the GM group. These respondents were mostly male (3 men, 1 woman, and 1 ‘not stated’), with a mean age of 48.8 years (SD 16.8) and a mean duration of education of 11.6 years (SD 1.1). Three of these patients reported having a chronic condition.

Altogether, 263 questionnaires were included in the analyses (acupuncture 91, homeopathy 87, and GM 85). With 18 choice sets per subject, the data contained 4734 observations (acupuncture 1638, homeopathy 1566, and GM 1530).

Table 2 shows the characteristics of the participating patients. All groups contained more women than men, with the largest portion of women in the CIM groups (acupuncture and homeopathy). Patients in the GM group were older, and mean duration of education was longest in the CIM groups. In the acupuncture group, patients were being treated mainly for musculoskeletal disorders and mental illnesses. Almost one-third of the homeopathy patients visited their physician because of mental illnesses. GM patients mostly reported complaints due to cardiovascular diseases. The category ‘other diseases’ included, among others, atopic dermatitis, and rheumatic and autoimmune diseases. More patients in the CIM groups experienced chronic conditions than in the GM group. Almost half of the acupuncture group visited a GM physician at the same time due to either the stated diseases or other conditions. In contrast, there was hardly any overlap in the GM group, and only 5.9% of the GM patients simultaneously visited an acupuncturist.

Table 2 Patient characteristics

3.2 Preference Weights

Results of the conditional logit regression models are presented in Table 3. Coefficient signs indicate whether the attribute levels lead to an increase (positive) or a decrease (negative) in patients’ utility. p values represent the statistical difference between the preference weight of the attribute levels and the mean effect of the same attribute [24]. Model fit values are displayed at the bottom of the table. Figure 2 illustrates the preference weights with 95% confidence intervals (CIs) for all study groups. To simplify the illustration of the results, we decided to present all preference weights in one graph, although the ‘raw’ coefficients are not comparable between the groups. In the following sections, the regression results shown in Table 3 and Fig. 2 are interpreted for each study group.

Table 3 Results of the conditional logit models
Fig. 2
figure 2

Results of the conditional logit model

3.2.1 Acupuncture

The most important level changes for the acupuncture group were related to the attribute ‘active listening’ (UListen + 1.24), followed by ‘time’ (UTime + 1.13) and ‘holistic treatment’ (UHolistic +1.07), moving from ‘rarely’ fulfilled to ‘sometimes’ fulfilled. These level changes resulted in the largest utility gain for the patients. Differences in preference weights for an increase in time taken by the doctor and a more holistic treatment approach were similar (both moving from ‘sometimes’ to ‘often’; UTime + 0.57, UHolistic + 0.56). Regarding willingness to pay, the level change in active listening from ‘rarely’ fulfilled to ‘often’ fulfilled (UListen + 1.94) had a monetary value of €252.51.

3.2.2 Homeopathy

Patients of the homeopathy group valued the level change from ‘rarely’ to ‘sometimes’ regarding the time taken by the doctor as most important (UTime + 1.42). Level changes from ‘sometimes’ to ‘often’ in the attributes of active listening and information were equally important to the homeopathy patients (UTime + 0.69, UInfo + 0.68). Willingness-to-pay estimates for the level change in time from ‘rarely’ fulfilled to ‘often’ fulfilled (UTime + 1.92) resulted in a monetary value of €147.58.

3.2.3 General Medicine

For the GM patients, the level change from ‘rarely’ to ‘sometimes’ in the attributes ‘active listening’ and ‘time’ were valued as most important and equally important (UListen + 0.74, UTime + 0.68). Looking at willingness to pay, the level change in active listening from ‘rarely’ fulfilled to ‘often’ fulfilled (UListen + 1.15) had a monetary value of €96.40 for the patients of the GM group.

3.2.4 Group Comparison

Figure 3 shows the relative importance of each attribute with 95% CIs. Active listening and time were the two most relevant attributes for the patients of all groups, whereas the attribute ‘active participation’ was least important. For the CIM patients, the attribute ‘holistic treatment’ seemed to be slightly more relevant. The information provided by the physician appeared to be more important to the patients of the GM group. Comparing the results of the cost attribute, it was most important to the GM patients and least important to the acupuncture patients. For more details, see Appendix Table 1 in the Electronic Supplementary Material.

Fig. 3
figure 3

Relative importance of attributes

4 Discussion

4.1 Interpretation of the Discrete Choice Experiment Results

The patients of all study groups valued active listening and time as the two most important attributes of the treatment process. These findings may help in understanding the indirect health benefits of the investigated treatment settings. In Western countries, CIM physicians take on average more time for the consultation and the assessment of medical history, as it is often an inherent part of the therapy [29]; for example, diagnostics according to Chinese medicine in acupuncture or comprehensive anamnesis in homeopathy. Considering our results, this may contribute to the effectiveness of acupuncture and homeopathy, as patients gain more value from the treatment process. In contrast with CIM therapies, GM consultations are often shorter in time, and physicians have a higher workload. In general, they see more patients in the same time period than CIM physicians [29]. Potential differences between the groups were seen in the attributes ‘holistic treatment’, which appeared to be slightly more relevant to acupuncture and homeopathy patients, and ‘information’, which seemed to be more important to the GM patients. These group differences may be explained by a different understanding of health and disease, referred to as salutogenesis [30] in CIM and pathogenesis in conventional medicine, but this is only a hypothesis and should be investigated in future research.

Regarding willingness to pay, patients in the GM group attached greater importance to the cost attribute than patients in the CIM groups. A possible explanation could be that patients in the CIM groups are less cost sensitive because they are used to out-of-pocket payments, as not all acupuncture and homeopathy treatments are covered by the catalogue of benefits from statutory health insurances [9, 10]. Nevertheless, the willingness-to-pay results have to be interpreted carefully. Different patterns of preferences regarding the cost attribute might have existed within the groups. As mentioned above, patients in the acupuncture or homeopathy groups might have used different methods of payment, which can influence their willingness to pay. Unfortunately, we could not investigate these potential patterns, as the number of patients in the groups was too small to conduct further analyses, such as a random parameter logit or a latent class analysis [24]. In addition, potential effects of recoding [31] cannot be ruled out. Recoding is a cognitive strategy used by the patients to simplify the evaluation of the cost attribute. Patients are often not used to direct payments for healthcare because costs are either fully or partially covered by their health insurance. Therefore, levels of the cost attribute may have been recoded into categories (e.g., ‘low’, ‘medium’, and ‘high’), which can lead to distorted willingness-to-pay results [31].

4.2 Strength and Limitations

As far as we know, our study is the first to investigate process utility from a general perspective by comparing the settings of conventional and complementary medicine and considering willingness to pay at the same time.

Nevertheless, our study has several limitations. Concerning DCEs in general, it is questionable to what extent the study results represent real-world decisions. Patients choose between hypothetical treatment processes. Real-world decisions of respondents may consider other medical, financial or emotional consequences [27, 32]. We tried to minimize these potential differences by using qualitative methods, as described in the method section, to identify and select the attributes. Another restraint is that DCEs become more complex as the number of attributes and levels rises. The information patients can process is limited. To prevent the use of heuristics [33], researchers have to limit the number of attributes to meet methodical and cognitive requirements [34, 35]. As a consequence, we excluded several attributes in the development stage of the questionnaire. Nonetheless, these attributes can be meaningful to patients and should be mentioned here again: experience of the physician, technical equipment, structural elements (waiting time, rooms, etc.), and nonverbal communication such as physical contact.

Looking at our study in particular, the greatest limitation is the high level of complexity of our choice design comprising three alternatives and six attributes per choice task. Patients are more likely to make use of heuristics to simplify the decision scenarios as the complexity of the choice design rises [36]. Latent class analysis can help to control for these heuristics but would need a higher number of respondents in each group to be conducted [33, 37]. Additionally, potential interactions between the attributes have not been considered. The estimation of interaction effects needs a higher number of respondents. Future studies should take interaction effects into consideration. In particular, the attributes ‘active listing’ and ‘time’ are potentially correlated. Furthermore, generalizing the study results to the overall population of Germany is limited because the recruitment was restricted to urban areas. Heterogeneity between the patients from Berlin and Munich is possible (e.g., due to differences in the social structure of the local population) but likely to be small, as both are urban regions. Another limitation is the overlap in the study groups. Patients in the CIM groups (acupuncture 46% and homeopathy 29%) visited GM physicians at the same time. As acupuncture and homeopathy are often used as complementary therapies, it was not possible to separate the groups neatly. However, most results of the sociodemographic characteristics in the CIM groups matched with latest results from the European Social Survey [38].

Another potential limitation of our study is the removal of data from five patients because they were defined as ‘irrational’. Deleting responses from a DCE is discussed critically in the literature [39,40,41]. There are several reasons why this may not be appropriate [39]. Regarding our study, we made the assumption that all attribute levels of the dominated alternative were worse than the attribute levels of another alternative in the same choice task. However, this is only valid if patients prefer a greater fulfilment of the process attributes and lower values in the cost attribute. All process attributes are qualitative attributes. This means that the ordering of the levels can be unclear; for example, is it always better for the patients to participate actively in the treatment process, or do they maybe prefer a more passive role? A greater fulfilment of the process attributes is not necessarily preferred by all patients. Even higher costs can be attributed to a higher quality of care and hence be preferable [39]. Nevertheless, we decided to exclude the patients’ data from our data set because the results for the GM group changed heavily and the model fit criteria worsened noticeably when including the data in our analyses (for more details, see Appendix Table 2 and Fig. 1 in the Electronic Supplementary Material). Moreover, an unclear ordering of attribute levels would be contrary to the results of the qualitative stage of this study.

Process utility is an intangible concept and therefore can be difficult to describe in a few attributes as well as to convey on an emotional and cognitive level. We suggest that upcoming studies should choose a more sophisticated choice design to avoid complexity of the choice tasks; for example, a blocked design with a larger sample size, more concise labels for the attribute levels, or a more appealing graphical representation of the attribute levels in the questionnaire, and that they should focus on a specific disease.

4.3 Comparison with the Existing Literature

There is a limited number of studies with comparable results. DCEs analyzing patients’ preferences of treatment processes usually focus on a specific disease intervention, sometimes comprising clinical outcomes as well [17, 42,43,44]. In contrast, our study analyzed different settings of therapies—acupuncture, homeopathy and GM—and assessed process utility from a more general perspective. To our knowledge, only one study analyzed patients’ preferences of the treatment process comparing a conventional setting with a complementary one. Ratcliffe [17] investigated the preferences of patients with asthma either receiving an outpatient conventional treatment or a homeopathic one. The results showed that the most important attributes for all patients were the ‘time given by the doctor to listen to what the patient has to say’, ‘symptom relief’ and ‘travel costs for a consultation’. Only the homeopathic patients valued the attribute “be treated as a whole person” as important. To some extent, the results are comparable to ours. Our attributes ‘active listening’ and ‘time’ were represented by a single attribute in the Ratcliffe study [17], which patients also valued to be most important to them. Our attribute ‘holistic treatment’ was important to the GM patients as well but seemed to be more important to the patients of the homeopathy and acupuncture groups. ‘Symptom relief’ and ‘travel costs’ were not included in our attributes, and willingness to pay was not estimated by Ratcliffe [17].

A recent systematic review from Bien et al. [44] investigated patients’ preferences for cancer treatment, classifying attributes into outcome, process and cost attributes. Results showed that outcome attributes were most relevant to patients. Process and cost attributes were less often included in DCE studies but were still of importance to the patients. Bien et al. [44] concluded that clinicians and decision makers should be aware that not only outcome attributes but also process and cost attributes have value to patients. This shows that more research on process and cost attributes is needed to capture the full range of patients’ benefits.

5 Conclusion

The time physicians take for the consultation and the extent to which they attentively listen plays a major role for patients using acupuncture, homeopathy or GM. This should be taken into consideration within the discussion on providing more patient-centered care in Germany. Our results would support considering a change in the healthcare system towards a strengthening of medical consultations. We suggest giving physicians the opportunity to spend more time with their patients, which may be achieved by changing the general conditions of remuneration (e.g., an improved reimbursement of medical consultations).

Data Availability Statement

The dataset generated and analyzed during the current study is available in the figshare.com repository: https://figshare.com/s/2413f41ef56ca08cc040.