Introduction

Sleep disorders leading to a disruption of sleep are increasingly recognized as significant contributors to a myriad of health issues, impacting the quality of life, overall well-being and long-term health of individuals [1]. Based on current research it is estimated that up to 40% of adults suffer from poor sleep or a sleep disorders, with snoring, obstructive sleep apnea, insomnia and restless-legs syndrome being the most common diseases [2,3,4]. Diagnosing these disorders commonly involves overnight sleep testing, which is crucial to understand sleep patterns and identify potential abnormalities and pathologic events. The current landscape of sleep testing encompasses a variety of methods, including in-laboratory polysomnography (PSG) and home sleep apnea testing (HSAT), which represent the most common methods of overnight sleep testing [5]. While in-laboratory PSG requires admission to a hospital or sleep center and is conducted with supervision of a trained PSG-technician, HSAT or polygraphy is commonly performed at the patients’ home without any surveillance. Though HSAT can be equivalent to an in-laboratory PSG in certain patient populations, this test does not provide direct measures of sleep and cannot be used for differential diagnosis of sleep disorders and hence, HSAT is limited to diagnosis of sleep apnea [6]. Accelerated by the SARS-CoV-2 pandemic, which limited access to in-laboratory PSG due to infection control measures, and enabled by technological developments, overnight sleep testing is increasingly shifting from the sleep laboratory to the patients’ home [5, 7]. With the introduction of modern sleep testing systems, comprehensive sleep test such as home-based PSG have the potential to extend the diagnostic portfolio of sleep medicine. This development is driven by constantly rising demand for sleep testing, as well as decreasing availability of skilled technical staff and rising hospital costs, which makes it increasingly difficult for patients to access overnight PSG testing.

As the field of sleep medicine continues to evolve, it becomes imperative to consider not only the efficacy of sleep diagnostic procedures but also the patient perspective and preferences associated with these tests. Current overnight sleep testing modalities differ not only in their diagnostic capabilities, but also in their convenience, cost, and overall experience they offer to patients. Understanding the factors that influence patient preferences in this context is paramount for tailoring diagnostic approaches that are not only clinically effective but also patient-centered and well accepted [8]. In absence of any evidence on this topic, this study aims to elicit the relevance of attributes of overnight sleep testing and preferences from the patients’ perspective.

Methods

In general, elicitation of preferences, choice behavior and decision-making can be accomplished using two different approaches: revealed preferences, in which actual choices are observed and components of decision-making analyzed ex-post, after a good or service is obtained; and stated preferences, where decision-making is assessed ex-ante, before the consumption takes place, using theoretical choice tasks [9]. Since healthcare services, such as diagnostic tests or medical interventions, are commonly regulated by market authorizations, coverage policies and other insurance provisions, a revealed preference-approach can only be applied to assess choice behavior for technologies that are available and accessible by the respective patient population [10]. Alternatively, stated preference techniques can be used to elicit preferences for hypothetical technologies or healthcare scenarios, and furthermore, allow to identify attributes relevant to the choice decision before the actual decision is made. This enables decision-makers to design patient-centric healthcare services based on requirements and preferences before they are implemented. Various studies have shown, that advanced stated preferences techniques, such as discrete choice experiments (DCE), have a high external validity and can predict choice behavior well [11, 12]. As such, DCE are considered the standard methodology for elicitation of preferences and choice behavior in healthcare, and are thousands of studies have been conducted over the last decades [13].

Experimental design & survey construction

To design a DCE, identification of relevant attributes of the technology or intervention subject to evaluation is required. In absence of any relevant literature on this topic, a set of attributes and levels that reflect the situation in the German sleep care environment was developed with input from three board certified sleep physicians. The attributes were then validated by a group of patients with various sleep disorders. The final set consisted of the attributes Diagnostic accuracy of sleep testing device, personal Costs per test in the form of a co-payment, Effort to apply device at home, Logistics of receiving and returning the diagnostic device, Sleep quality during test, Waiting time to test, and Waiting time to discuss results with the physician. All presented attributes and levels were considered relevant and appropriate by patients and hence the entire set was included in the experimental design (Table 1). To reduce the burden to participants of the study, a d-optimal fractional factorial design was applied, in line with the methodological recommendations for choice analysis using DCE, which decreases the number of choice tasks presented to each patient [14, 15]. Forced choice was applied in order to maximize information generation from the DCE.

Table 1 Attributes and respective attribute levels of overnight at-home sleep testing included in choice tasks

In addition to the DCE part, the survey included demographic information, self-reported symptoms of sleep disorders, prior experience, and items on Willingness-to-pay (WTP) and preferred sleep study location. The survey was administered in German language, and questions, choice tasks and explanations were optimized for patient understanding.

Data collection

Participants were enrolled from two university hospitals in Germany that provide tertiary care to patients with sleep disorders and offer the entire range of sleep medical diagnostic and therapeutic services. Patients presenting to the outpatient department of the clinics with a suspected or confirmed diagnosis of a sleep disorder were invited to participate in the study. Most were referred to the centers by general practitioners or external specialists, which sent requests for consultations and patients commonly receive the first appointment within two to four weeks, in which the diagnostic pathway is defined. After providing written informed consent, participants received the survey in paper formed and provided their answers during their visit to the outpatient department, before they received any information about the planned diagnostic workup to avoid biasing their decision-making. The local ethics committees of both centers approved the study before patients were enrolled.

Data management and analysis

Survey data was collected by the two centers and aggregated centrally for analysis. SPSS software (IBM, New York/USA) was used for data management as well as descriptive statistical analyses. R software with RStudio (r-project.org) was used for analysis of choice data from the DCE. Depending on research questions, hypotheses formulation and data distribution, a range of statistical tests was used, incl. Student’s t-test, Welch-test, Kruskal-Wallis test and Pearson Chi2 test. For management and analysis of DCE data, methodological recommendations from the International Society for Pharmacoeconomics and Outcome Research (ISPOR) were followed [14, 16]. The Wald test was conducted to evaluate the significance of attributes in the choice decisions made by participants [17]. Alpha levels of < 0.050 were considered statistically significant throughout all statistical analyses.

Results

Participants characteristics

From the two centers, a total of 305 consecutive patients were enrolled into the study. From those 305 participants, 288 questionnaires (94.4%) were completed sufficiently and could be included in the final analysis. The remaining 17 patients had to be excluded due to invalid DCE data. Reasons for invalid DCE data and thus exclusion were: DCE not completed; DCE only partially completed; and multiple choices where only single choices were allowed. No statistically significant differences were identified between the characteristics of the total population and patients with complete data sets (Table 2). Patients were predominantly male (65.3%) and on average 54.4 ± 13.1 years old (median = 55.0 years). Most patients reported symptoms of sleep-disordered breathing such as snoring / apneas (83.3%) or daytime sleepiness (23.6%). Other common symptoms mentioned were nightly awakening (54.2%), difficulties falling asleep (21.5%) or restless legs during sleep (19.4%). The majority of participants reported suffering from one (35.4%) or two symptoms (32.3%). The part of the sample which reported symptom snoring / apneas was quite heterogenous, with 51.4% reporting additional frequent nightly awakenings, 24.1% additional daytime sleepiness and 18.8% restless legs. Prior exposure to overnight sleep testing was common, with only 10.4% of patients not having received a sleep study before participating in the study. Of those with previous sleep study experience, HSAT was the most common overnight sleep test, (42.7%) followed by in-laboratory PSG (31.6%). Past experiences with both HSAT and in-laboratory PSG were reported by 15.3% of participants.

Table 2 Characteristics of total sample vs. included sample with complete data sets (SD = standard deviation, HST = Home Sleep Test, PSG = Polysomnography)
Table 3 Coefficients for presented attributes of sleep testing, derived from conditional parameter logit model (log-likelihood = 182.8, AIC = 4216.39)

Discrete choice experiment

DCE data was analyzed using a conditional logit model, from which coefficients for each attribute level were computed, which all showed the expected directions (Table 3) The Wald test was highly significant (97.92 (df = 14), p = < 0.001), suggesting the parameters used in the model provide significant prediction of the outcomes calculated. All alternatives presented in the choice experiment were statistically significant, except Diagnostic accuracy. This should be interpreted as this attribute not having a relevant effect on the choice decisions made by participants. Coefficients of attribute levels, shown in Table 3, represent the marginal utility of each alternative in the choice decision. Positive coefficients have a positive contribution to the choice decision and hence increase the probability that an alternative was chosen due the higher utility assigned to the attribute (Fig. 1). Contrary, negative coefficients have a negative influence on the decision and decrease the probability that an alternative was preferred by participants due to its lower utility.

Fig. 1
figure 1

Coefficients of attribute levels (positive values indicate higher likelihood to choose alternative, negative values indicate lower likelihood to choose alternative)

The most relevant attributes for choice decisions were Waiting time to discuss results, with a coefficient of 0.893, followed by Sleep quality during test and Waiting time to test with coefficients of 0.549 and 0.519 respectively. Diagnostic accuracy and Effort to apply diagnostic device were of least importance (coefficients 0.245 and 0.183). Figure 2 presents the relative importance of each attribute in relation to the others, rescaled from 0 (least important) to 10 (most important).

Fig. 2
figure 2

Relative importance of attributes of overnight at-home sleep testing, re-scaled from 0 (least important) to 10 (most important), all participants

In the DCE, the most preferred at-home overnight sleep test consisted of the following attributes: Diagnostic accuracy = good; Costs = 0€; Effort to apply the device = With help within 15 min; Logistics = received from clinic, return via mail; Sleep quality = 7/10 report good sleep quality; Waiting time to test = 2 weeks; Waiting time to discuss results = Discussion with physician next morning.

Preferred sleep study location

Among patients enrolled in the study a preference of at-home sleep testing (50.7%) over testing in a sleep laboratory (46.9%) was found (Fig. 3). Preference of sleep testing in a hotel room was low with only 1.4% of participants choosing this option. Stratification by age, gender and prior sleep testing experience revealed significant differences in choices, and younger patients (≤ 55 years = 58.4% vs. ˃ 55 years = 43.1%), as well as male patients (male = 54.8% vs. female = 43.0%) and participants without prior experience (without experience = 71.0% vs. with experience = 48.2%) reported higher preferences for at-home testing. Further analysis revealed that patients with a preference for at-home overnight sleep testing were on average 5.6 years younger than those that preferred testing in a sleep laboratory (p = < 0.001).

Fig. 3
figure 3

Preferred location for overnight sleep testing

Willingness-to-pay

Overall WTP was high with 52.4% of the sample stating to spend 100–200€ out-of-pocket for overnight sleep testing, and 25.0% 200–300€ (Fig. 4). Male participants had a significantly higher WTP than female participants (p = < 0.001). The WTP of patients with a location preference of at-home testing was higher as well, compared to those with a preference for testing in a sleep laboratory (p = < 0.001). No differences were found between patients with and without prior sleep testing experience (p = .625). Additional analysis revealed a statistically significant correlation between WTP and symptoms reported, with lower number of symptoms correlating with lower WTP (r = -.132 (95% CI = -0.247 to -0.014); p = .028). No correlation was found between WTP and age (r = .006 (95% CI = -0.113 to 0.125); p = .921).

Fig. 4
figure 4

Willingness-to-pay out of pocket for overnight sleep testing, male vs. female participants

Subgroup analyses

A relatively high Likelihood ratio test statistic provided evidence for heterogeneity in response behavior, which prompted further analyses (Log Likelihood = 182.8 (df = 14), p = < 0.001). To better understand preferences across different subgroups, stratification of choice behavior by age, gender, prior experience, and preferred location for sleep testing was performed. For all parameters, significant differences of preferences were found (Fig. 5). Level coefficients for all subgroup analyses are presented in Tables S1 to S4 in the electronic supplement.

Fig. 5
figure 5

Subgroup analyses of relative attribute importance, re-scaled from 0 (least important) to 10 (most important) and stratified by age (A), gender (B), prior sleep testing experience (C) and preferred location for sleep testing (D)

Stratification by age showed the largest differences between younger (≤ 55 years old) and older participants (> 55 years old) in the relevance of the attributes Costs (0.689 versus 0.209), Sleep quality (1.486 versus 0.434), and Waiting time to test (1.119 versus 0.677).

Between patients with and without prior experience, the attributes Waiting time to test (0.778 versus 2.364) and Costs (0.652 versus 0.055) showed the largest differences.

Subgroup analysis by gender (male versus female) revealed relevant differences in the importance of the attributes Costs (0.318 versus 1.594), Effort to apply device (0.980 versus 0.407), and Waiting time to test (0.807 versus 1.924).

Finally, substantial variances were found between participants preferring at-home vs. in-lab for their sleep study, with Diagnostic accuracy (1.171 versus 0.711), Sleep quality (0.450 versus 1.336), and Costs (0.422 versus 1.034), showing the largest differences between these groups.

Discussion

This study is the first to report preferences for overnight at-home sleep testing in a large sample of patients suffering from sleep disorders. As diagnostic assessments of sleep disorders are increasingly shifting to the patients’ home, the findings of this study can help clinicians and policy makers to optimize patient pathways. As shown in earlier studies, patient perceptions and knowledge of sleep measurements are important drivers of adherence to sleep testing protocols, and can hence impact the effectiveness of medical interventions for sleep disorders [18].

While a broad range of different care models exists, most healthcare systems require a prescription from a sleep physician for coverage of an overnight sleep study. Though HSAT are theoretically limited to diagnosis of sleep-related breathing disorders, they are commonly used as a first-line test to rule out sleep apnea before a more comprehensive sleep assessment is performed. This is not only related to coverage constraints, but increasingly also due to significant limitations of in-laboratory PSG capacity, which can delay access and definitive diagnosis. From that perspective, the DCE designed for this study required patients to make decisions they could encounter in real-life situations.

All attributes presented in this study were statistically significant, except for Diagnostic accuracy. This was not expected a priori since this attribute is closely related to receiving a precise diagnosis. As visible from the coefficients of this attribute, the differences in the levels presented were probably not designed large enough to create a stronger response. This is likely because the worst level still had an acceptable diagnostic accuracy. This may have participants conclude that a test yielding this accuracy level would be sufficient if used under direction of a physician. This also underlines the relevance of the physicians’ role in the diagnostic assessment of sleep disorders and the need for adequate education of the patient on this important topic. Especially with the emergence of consumer healthcare devices and smartphone applications that claim to assess sleep, the value of a high level of diagnostic accuracy of a sleep test, which is properly validated in clinical trials, should be made clear to the patient. This is not only relevant for the assessment of sleep disorders itself, but furthermore to inform the patient about potential negative long-term consequences on their cardiovascular risk profile. With current scientific advancements that demonstrate the benefits of advanced diagnostic tools with greater prognostic value, the relevance of diagnostic accuracy should not be underestimated.

Among the attributes that reached statistical significance, Waiting time to discuss results had the largest coefficients and hence the greatest relative importance for participants. This finding underlines the medical need from the patient perspective and the utility derived from a direct personal consultation without delay. The results presented here are in line with a study conducted in Australia that assessed preferences of OSA-care pathways, in which patients expressed strong preferences for short waiting times [19]. Sleep quality during test had the second most influence on the choice decisions and was especially important to patients with a preference for overnight assessment in the sleep laboratory. In light of a recent study by Colleli et al., that reported worse sleep for some patients depending on their individual chronotype, clinicians may evaluate both preferences and chronotype to identify the ideal overnight assessment for each patient to ensure that the needs are properly met [20].

Given the complexity of some measurement devices, it is not surprising that patients prefer support over self-application, though Effort to apply device was only of moderate relevance in the choice decisions. Interestingly though, the results were independent of prior experience, a parameter that can influence choice behavior significantly as shown previously on sleep therapeutics [21]. Here again, previous studies have shown that usability of the device plays an important role and influences adherence to prescribed overnight sleep testing [18].

In general, a relatively high heterogeneity was found in response behavior, which led to various subgroup analyses. Previous preference studies in the field of sleep disorders observed comparable differences in choice decisions among patients [22,23,24]. As increasingly recognized, sleep disorders are rarely homogenous and patients with a common diagnosis such as OSA often have different underlying endo- or phenotypes causing the disease [25, 26]. For this study in particular, participants reported a wide range of symptoms and will have received various diagnoses after the sleep study, which may explain the observed heterogeneity in responses and thus preferences. The variances that were found after stratification for age, gender, prior experience and preferred sleep study location, underline the need for an individual approach to diagnosis of sleep disorders. With the increasing prevalence of sleep disorders, balancing the requirements of individual patients on the one hand, and the provision of care on population level on the other hand, represents a major challenge to sleep physicians, policy makers and payers though, which is gaining in importance with innovative and expensive treatments for sleep disorders being developed and marketed.

Even though preferences in this study were heterogeneous, the attributes Waiting time to discuss results and Waiting time to test stood out as highly relevant across all subgroups. Given that sleep medical services are a scarce resource in most countries, it underlines the need for a more efficient provision of care and urges institutions in this field to provide the right incentives to ensure attractiveness of sleep medicine for younger physicians [27]. Considering the rising demand for sleep diagnostic testing, this is paramount to ensure sustainable access for patients suffering from sleep disorders to enable timely diagnosis and treatment without delay.

Given the burden of in-laboratory sleep testing, which often requires admission to a hospital, it was not expected that a bit less than half of all participants preferred this type of assessment over an at-home sleep study. Though the average age of patients preferring in-laboratory PSG was significantly higher, stratification of DCE data by age did not reveal any significant differences in the attributes that addressed practical considerations of the sleep study, such as Effort to apply device. Stratification by preferred sleep study location though revealed some important differences in choice behavior, with strong preferences for short waiting times among patients that preferred in-laboratory test over home. Patients may have expected that the entire diagnostic process is shorter when admitted to a hospital with daily presence of physicians directly in the sleep laboratory.

Overall, the WTP for out-of-pocket spending for sleep testing was high, which underscores the medical need experienced by patients due to symptoms of sleep disorders. Especially considering the relatively good coverage of sleep medical services in Germany, which usually requires no or rather low co-payments, these findings emphasize the value patients assign to sleep testing and consequently good sleep itself. This is also supported by the correlation that was observed between the number of symptoms and WTP, in which patients with more symptoms, and presumably higher need for treatment, reported higher inclination for out-of-pocket payments for sleep testing.

Study limitations

Some limitations should be noted for this study, which are largely inherent to the methodology of choice analysis with DCE. First, the attributes of overnight sleep testing that were presented to participants are not exhaustive, and other attributes might will likely also be important for patients. To not overwhelm patients by creating highly complex choice tasks, the list of attributes had to be limited to those seven presented, which is already on the upper end of what is considered appropriate for a DCE. Furthermore, the results of a DCE should be interpreted in the context of the local healthcare system, values as well as cultural background of the participants. By recruiting patients from two sleep centers, based in two states with different sociodemographic backgrounds, the study tried to optimize potential influence from these factors.

Another aspect is that all participants had previous experiences with healthcare services, the majority also with sleep diagnostic procedures. These experiences will have influenced choice behavior and thus the results from the DCE. It was attempted to mitigate this aspect by gathering data on their diagnostic experiences. Due to the length of the questionnaire, and in order not to overwhelm participants, certain information had to be excluded, such as previous or actual waiting times for healthcare services that patients might have experienced. The same holds true for additional medical or demographic information, such as socio-economic status, type of health insurance or education, which may influence decision making. The influence of these factors shall be subject of future studies on this topic.

A further limitation arises from using a stated-preferences over a revealed-preferences approach, which would have been able to evaluate actual choices of patients in need of overnight sleep testing. As patients cannot choose freely in most regulated healthcare systems, due to constraints of for example reimbursement fees, coverage policies and limited availability due to prescription requirements, this approach is commonly not practical in eliciting preferences. Different studies, also in the context of sleep medicine, have shown though, that the stated-preference approach as used in DCE, has a high external validity and can predict choice behavior in real-world situations [11, 12, 28].

Conclusion

To the authors’ knowledge, this study is the first to report patient preferences for overnight at-home sleep testing, a diagnostic tool increasingly utilized for the assessment of sleep disorders. In this study, patients preferred short waiting times for both, the test itself and the following discussion with the physician, as well as good sleep quality during the measurement. Of lower relevance were the effort to apply the diagnostic device as well as the diagnostic accuracy of the test. This finding underlines the need of patient education by their sleep physician, which need to ensure that appropriate technology with sufficient diagnostic accuracy is selected, which is increasingly important given the emergence of consumer health devices that claim to measure sleep. Overall, substantial heterogeneity in choice behavior was found among study participants, which underlines the non-uniformity of sleep disorders and the importance of personalized care pathways. Health care institutions, payers and providers are urged to investigate and consider preferences when designing sleep medical services, to not only meet the increasing demand for overnight at-home sleep testing, but also ensure patient-centricity in the provision of these services.