Introduction

Sleep disorders have become a significant public health concern, affecting millions of individuals across the world [1,2,3]. Overnight sleep testing is commonly required to determine the type and severity of sleep disorders, and testing modalities are commonly categorized into four levels, according to the range of signals acquired by the measurement device [4] (Table 1). Based on this classification, a polysomnography (PSG) can be recorded in the sleep laboratory (level I test), which provides most diagnostic information with at least twelve channels, or unattended at home or in the laboratory with at least seven channels (level II test). For both test modalities, direct measures of sleep in the form of electroencephalography (EEG), electrooculography (EOG), and electromyography (EMG) are required, and can thus be used to diagnose a broad range of sleep disorders. Level III (home sleep apnea test or polygraphy) and level IV tests (screening), record cardiorespiratory signals and are indicated for identification and diagnosing sleep-related breathing disorders such as obstructive sleep apnea (OSA). In-laboratory PSG is considered the gold standard diagnostic tool for assessing sleep disorders, which allows to diagnose the entire spectrum of sleep conditions across all age groups [5, 6]. However, the limitations of in-lab PSG, including high costs, limited accessibility, and potential disruption of natural sleep patterns, drove the development and adoption of home-based tests as an alternative diagnostic approach. Recently, level II testing with unattended polysomnography at the patients’ home is increasingly used to overcome barriers present with in-lab PSG.

Table 1 Overview of sleep testing modalities, based on CMS National Coverage Analysis CAG-00405N [4]

Home polysomnography (hPSG) involves the use of portable monitoring devices that enable the acquisition of physiological data during sleep in the patient’s own home environment [7]. Over the past decade, technological advancements have led to the miniaturization of sensors and the development of user-friendly devices, allowing for the unobtrusive measurement of key physiological parameters such as electroencephalography (EEG), electrooculography (EOG), electromyography (EMG), electrocardiography (ECG), and respiratory airflow [8•]. These advances have sparked growing interest in utilizing hPSG as a practical and accurate method for diagnosing a broad range of sleep disorders, including obstructive sleep apnea, insomnia, periodic limb movement disorder, and narcolepsy. Until recently, hPSG systems had to be set up by trained technicians either in the hospital or at the patient’s home, which implies certain logistical challenges and limited scalability of this method. With new technological developments and further miniaturization of measurement devices, self-appliable hPSG systems are now available which could increase utilization of this method for diagnostics of a broad range of sleep disorders at scale.

Though this technology has the potential to improve diagnostic pathways for patients with suspected sleep disorders, several questions have not been addressed so far, which include technical feasibility, patient preferences, and logistical challenges. This systematic review seeks to address these questions to support the assessment of this tool and the potential utility and disutility associated with extending its use to broader populations. A better understanding of the benefits and limitations of hPSG may inform clinicians, researchers, and healthcare policymakers about its potential future role in the provision of sleep diagnostic services.

Methods

This systematic review was conducted to identify studies assessing the technical success of hPSG for the diagnosis of sleep disorders in adults. The research followed the preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines to ensure comprehensive and transparent reporting of the review process and results [9].

Research Identification, Data Extraction, and Quality Assessment

A comprehensive literature search was performed in electronic databases including PubMed, MEDLINE, and Google Scholar from inception to August 2023, using a combination of relevant keywords and MeSH terms related to the technical feasibility of hPSG. Development of the search strategy and execution of the research was conducted with the assistance of a trained medical librarian. A detailed description of the search strategy and used keywords and MeSH terms is provided in the online supplement (Table S1).

Three independent reviewers screened the retrieved articles based on titles and abstracts for relevance to the research question. Studies that focused on technical aspects of hPSG, including the quality of recorded signals, data acquisition success rates, patient preferences, and equipment usability were eligible for assessment. The inclusion criteria for the quantitative assessment were defined as follows:

  • Research published up to August 2023 that reported original data on hPSG, acquired with standard technology

  • English language

  • Adult populations (age > 18 years)

Reviews, original articles reporting on pediatric use of hPSG, editorials, case reports, and studies not reporting relevant outcomes were excluded. Furthermore, all studies that used level III, level IV, or other types of measurement devices, that do not record direct measures of sleep as well as experimental technologies were excluded during screening at abstract level. A thorough eligibility check at full-text level was conducted to ensure that no overlapping data was included. Data from eligible studies were extracted by two reviewers using a standardized data extraction form. The extracted data included study characteristics (author, publication year, study design), participant information, sample size, type of equipment used, technical success metrics, and other relevant outcomes.

The methodological quality and risk of bias for each included study were assessed using the quality assessment tool for observational cohort and cross-sectional studies from the National Heart, Blood and Lung Institute (NHBLI) [10]. Discrepancies in quality assessment were resolved through discussion between the researchers until a consensus was reached.

Data Analysis

The technical success rate of hPSG was extracted from eligible research, and a pooled failure rate was analyzed using a random-effects nonlinear model to account for variation between studies. Heterogeneity was assessed by applying the I2 statistic, with values above 50% indicating substantial heterogeneity. Subgroup analyses were conducted based on study characteristics and study patient demographics. Statistical analyses were performed using SPSS (SPSS Statistics 29.0.1, IBM New York, USA) and Review Manager software (RevMan 5.4, The Cochrane Collaboration, 2020). A p-value of < 0.05 was considered statistically significant.

Results

Research Identified From Literature Search

From a total of 399 studies identified through the initial literature search, 30 articles were considered eligible for inclusion in this systematic review based on their relevance to the research question and inclusion criteria (Table 2). The process of research selection is illustrated in the PRISMA flow diagram (Fig. 1).

Table 2 Overview of research included in a systematic review
Fig. 1
figure 1

PRISMA flowchart of literature search

Articles identified were published between 1998 and 2022, with increasing numbers more recently, and the majority from researchers in North America (40%) and Europe (33%). The remaining were conducted in East Asia/Pacific (20%), Latin America (3%), and the Middle East (3%). The total sample sizes amount to 14,465 participants, with individual studies ranging from 10 to 6697 subjects (mean 482 ± 1289 participants).

Data quality of eligible research, assessed with the NHBLI quality assessment tool for observational cohort and cross-sectional studies, was acceptable for all articles and none had to be excluded (Supplement Table S2).

Utilization of hPSG for Diagnosis of Sleep Disorders

The location for the set-up of hPSG systems was either the patient’s home (72%) or the hospital (28%), and the set-up was most often conducted by a trained technician (82%). Self-application by patients was used in 18% and emerged only in more recent studies, with the first article on self-applied hPSG published in 2017. The most common deployment model for hPSG was at-home application by a technician (58%), followed by technician in-hospital set-up (31%) and at-home self-application by the patient (11%). Across the studies, a wide range of PSG systems were utilized, which reflects the variety of devices available in the market. The devices used most often were the Nox A1 (Nox Medical, Reykjavik, Iceland) in 13%, and the Embletta X100 (Natus Medical, Pleasant View, USA) in 10% of studies, followed by Medatec Pamela V, Mallinkrodt Minisomno and Compumedics Safiro/S-series (7% each).

Per protocol, the majority of studies recorded one night (60.0%), followed by two nights (6.7%), three nights (3.3%), or a success-dependent approach with up to two nights (23.3%), up to three nights (3.3%) and up to five nights (3.3%). Thirty-three percent of studies (n = 10) allowed repetition of recording in case of failures, which was reported in 4.5% of studies on average (range 0–10%).

In the studies identified for this systematic review, hPSG was used to diagnose a broad range of sleep disorders, ranging from sleep-disordered breathing to general sleep complaints, insomnia, and bruxism. The populations enrolled represented a relatively typical population presenting with sleep conditions. Patients among all studies were on average 51 ± 8 years of age, predominantly male (57 ± 27%), and moderately overweight with a body mass index of 31 ± 5 kg/m2.

Technical Feasibility of hPSG

The pooled technical failure rate of home polysomnography across the eligible studies was estimated to be 7.8% (95% CI 5.5–10.1%), ranging from 0 to 23.4% (Fig. 2). Considerable heterogeneity was observed among the included studies (I2 = 97%), indicating high potential variability in technical success rates. Further analyses revealed no statistically significant correlations of hPSG study success rates with the variables age (r = 0.074, p = 0.713), body mass index (r =  − 0.044, p = 0.848), male gender (r = 0.292, p = 0.157) and sample size (r =  − 0.090, p = 0.635). For the three deployment models the following pooled failure rates were estimated: at-home application by technician = 5.8% (95% CI 3.7–7.9%); in-hospital application by technician = 10.0% (95% CI 3.1–16.8%) and at-home application by patient = 11.1% (95% CI 4.7–17.5%). No statistically significant correlation between the number of nights recorded and the reported technical failure rate was found (r (28) = 0.133; p = 0.483).

Fig. 2
figure 2

Forest plot of individual and pooled technical failure rate with hPSG (failure rate ± SE)

No adverse effects or complications from hPSG were reported by any of the studies included in this review.

Subgroup analyses were conducted for set-up location (home vs. hospital application of hPSG system) and for set-up person (technician vs. patient application of hPSG system). A difference in the technical failure rate was detected between home and hospital set-up (7.1 vs. 9.9%, Fig. 3), which was not statistically significant though (p = 0.171). A non-significant difference in technical failures was found between technician- and patient-applied hPSG (7.2 vs. 10.1%, p = 0.896, Fig. 4).

Fig. 3
figure 3

Pooled technical failure rate with hPSG—home- vs. hospital-applied PSG (p = .171)

Fig. 4
figure 4

Pooled technical failure rate with hPSG—technician- vs. patient-applied PSG (p = .896)

Reasons for Technical Failure

Since only a few studies used common criteria to determine the outcomes of hPSG recording, failure reasons were extracted by estimating the proportion of studies that mention the respective failure mode. Using this methodology, four major sources of hPSG failure could be identified: EEG, SpO2, airflow, and respiratory belts (Fig. 5). Twenty percent of studies did not report failure reason of home sleep studies. Differences in the occurrence and distribution of failure modes across deployment models of hPSG could not be identified.

Fig. 5
figure 5

Reasons for technical failures with hPSG (% of identified research that mentions failure reason)

Patient Preferences for hPSG

Though not a primary outcome in any of the studies included in this review, preferences of participants towards PSG diagnostics were assessed by 11 of 30 articles resulting in a total sample of 874 patients. The mean proportion of study participants preferring hPSG over in-lab PSG in those cohorts was 56 ± 22%, ranging from 28 to 95%. Further differentiation of preferences by subgroups could not be conducted due to limitations in the quality and quantity of available data.

Discussion

Current diagnostic approaches in sleep medicine are either limited in scalability, as in the case of in-lab PSG due to its high costs and resource intensity or offer limited diagnostic information, like with HSAT or consumer sleep trackers, which do not include direct measures of sleep such as EEG, EOG, and EMG signals. With recent developments in sleep research and their potential for improved and individualized approaches to sleep disorders, there is a need for a scalable diagnostic tool that records the signals required for advanced analyses of sleep disorders, such as phenotypization or endotypization, and supports an accurate analysis of direct sleep measures. The aim of this research was to assess the existing evidence of the technical feasibility of hPSG as a method to acquire polysomnographic signal sets outside of the clinic.

While hPSG was first mentioned in the medical literature more than three decades ago and has been used largely in clinical studies, its application in routine sleep medical practice is limited in most healthcare systems, mainly due to logistical and reimbursement-related reasons. With the rising prevalence of sleep disorders and steadily increasing demand for sleep diagnostics, hPSG may offer advantages over in-lab polysomnography as well as over simple home sleep tests [8•]. In light of the current challenges of sleep clinics across the globe to recruit and retain trained technicians for overnight monitoring in the sleep laboratory, shifting PSG towards the home could help increase the capacity of sleep programs and thus ensure access to advanced sleep diagnostics. This is even more important for the increasing populations of patients with non-OSA sleep disorders for which HSAT is not an appropriate substitute of in-lab PSG. With recent advances in the diagnostic assessment of insomnia, for example, the relevance of PSG could eventually further increase, potentially widening access issues [40•].

The results of this systematic review demonstrate that utilization of PSG at home is technically feasible and safe with low failure rates in adult populations, independent of the deployment model and whether the PSG system is applied by a technician or self-applied by the patient. Though certain variabilities of technical failure rates were found, hPSG overall allows reliable and robust signal acquisition. These findings are supported by results from an earlier review by Bruyneel and Ninane [7], in which they demonstrated a high data quality, high diagnostic accuracy, and good agreement between hPSG and in-lab PSG in six randomized cross-over trials. Though data is limited on the technical success rates with HSAT, the available literature suggests comparable outcomes with the more simplistic level III or level IV tests. For peripheral arterial tonometry, a HSAT that is increasingly used, failure rates between 0 and 19% have been reported [41,42,43,44]. The preferences elucidated in some of the studies suggest that conducting PSG at home is not only technically reliable, but also well accepted by patients, especially when the set-up is done at home [28]. In addition, a sleep recording in the comfort of the own home could also lead to a more precise picture of the natural sleep and potentially a more accurate diagnosis [45].

It is important to highlight, that though hPSG may reduce the burden on the sleep clinic and its staff, it is not free of operational challenges that need to be considered. Currently, the most common deployment model requires a technician to drive to the patient’s home to set up the system and collect the device in the morning after the recording. This approach not only has a relevant logistical complexity, but it also increases costs and the ecological footprint of the sleep test. In addition, contrary to attended in-lab PSG, electrode detachments which can happen during sleep, cannot be easily corrected with hPSG. Telemonitored at-home PSG with real-time data transmission to a data center that observes signal recording and intervenes via phone or video call, could be an opportunity to reduce signal losses or incomplete recordings [15, 23]. Recent developments towards patch-based hPSG systems, conceptionally may help reduce signal losses and improve data quality by increasing electrode adhesion and reducing the use of wires to transmit signals [46,47,48]. Those concepts need to be assessed in clinical routine and are subject of ongoing trials.

Given the shortage of trained technicians to support PSG operations in the lab and at home, current developments in the field of self-appliable PSG systems present an interesting opportunity to reduce the burden of sleep clinic staff. Though not all patients needing a sleep study will be able to apply devices themselves, early data support this concept [49]. Further miniaturization of sensors and improvements of device usability could increase the number of eligible populations.

Using hPSG for diagnosing a wide array of sleep disorders outside of the hospital may have also relevant positive economic implications. By enabling patients to conduct PSG within their own homes, this tool has the potential to meaningfully reduce the financial burden associated with clinical-grade sleep diagnostics, which traditionally involve substantial costs related to facility usage, staffing of overnight shifts, and equipment maintenance. Increased utilization of hPSG could alleviate these costs, leading to decreased healthcare expenditures and, moreover, to increased accessibility of sleep diagnostics and thus earlier identification and intervention for sleep disorders [50, 51]. In healthcare systems with limited budgets, lower costs for sleep diagnostics may also allow the allocation of greater financial resources towards treatment, treatment monitoring, and chronic care of patients with sleep disorders, and thus leading to improved overall outcomes.

Limitations

A few limitations are important to mention to the reader to reflect the results of this analysis. First of all, though extensive efforts were undertaken to identify all literature, additional studies with information relevant to the research question could be missed. Given the scope of the literature search and the results of the analysis, the potential impact should be neglectable. Within the studies identified, a variable quality was found, and only a few applied a randomized controlled design, which influences the evidence level that could be generated from the analysis. Furthermore, the technical failure rate calculated as the primary outcome of this research is an aggregated point estimate, which is statistically not precise due to the considerable heterogeneity present in the underlying data.

To estimate the value of hPSG for the diagnosis of sleep disorders comprehensively, the technical success rate and the diagnostic accuracy only reflect the input side. It is essential to dive deeper into the decision-making process to understand how clinicians use the information provided from hPSG in comparison to those derived from in-lab PSG and if downstream treatment outcomes vary, depending on which diagnostic tool was used. The authors were not able to identify any published research on this topic, so this represents an opportunity for future research.

In addition, a relevant heterogeneity in reporting outcomes of hPSG and success criteria was observed across the studies. For example, in the absence of a common definition of technical failure or sleep study success, a variety of metrics was employed in the different studies to assess outcomes, which differ as well depending on the individual study objectives and the clinical context. As such, in studies of populations with sleep-disordered breathing, oximetry signals of less than 4 h might be considered a failure, while this would be of lower relevance in a study on patients suffering from insomnia. On the other hand, a failed EEG or EOG recording might not lead to a failed study in an OSA population, as long as other relevant metrics would allow to estimate respiratory or desaturation indices.

To ensure an accurate assessment, it would be beneficial to agree on a reporting guideline with core metrics that are applied and presented in any research on sleep diagnostic tools. This is particularly important to the outcomes of this study, since a few articles included reported a study as failure only when a recording could not be obtained in the second or third attempt, which can skew the results. Other areas of medicine have adopted this approach already, which supports thorough assessment of healthcare technologies by harmonizing outcome reporting.

Conclusion

With the expected increasing demand for sleep diagnostics and limited resources for in-lab polysomnography, driven by increased awareness for sleep and greater utility of polysomnography, hPSG has the potential to secure and improve access to clinical-grade sleep diagnostics. From the data included in this systematic review, it can be concluded that hPSG has a low rate of technical failures and is safe to use in different care settings. independent of set-up location or set-up person. The most common failure reasons are related to signal acquisition during the night, which could be improved with further optimization of sensor technology. Further research is required to understand the decision-making process of physicians when using this tool in comparison with in-lab polysomnography.