Abstract
Purpose
This study aims to develop sleep apnea screening models with overnight SpO2 data, and to investigate the impact of the SpO2 data granularity on model performance.
Methods
A total of 7,718 SpO2 recordings from the SHHS and MESA datasets were used. Probabilistic ensemble machine learning was employed to predict sleep apnea status at three AHI cutoff points: ≥ 5, ≥ 15, and ≥ 30 events/hour. To investigate the impact of data granularity, SpO2 data were aggregated at 30, 60, and 300 s.
Results
Our models demonstrated good to excellent performance on internal test, with average area under the curve (AUC) values of 0.91, 0.93, and 0.96 for cutoffs ≥ 5, ≥ 15, and ≥ 30 at data granularity of 1 s, respectively. Both sensitivity (0.76, 0.84, 0.89) and specificity (0.87, 0.86, 0.90) ranged from good to excellent across three cutoffs. Positive predictive values (PPV) ranged from excellent to fair (0.97, 0.83, 0.66), and negative predictive values (NPV) ranged from low to excellent (0.43, 0.87, 0.98). Model performance on external test slightly dropped compared to internal test, but still achieved good to excellent AUC above 0.80 across all data granularity and all the three cutoffs. Data granularity of 300 s led to a reduction in performance metrics across all cutoffs.
Conclusion
Our models demonstrated superior performance across all three AHI cutoff thresholds compared to existing large sleep apnea screening models, even when considering varying SpO2 data granularity. However, lower data granularity was associated with decreased screening performance, indicating a need for further research in this area.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The SHHS and MESA datasets used in this study are available at https://sleepdata.org/datasets with the permission of NSRR.
References
Senaratna CV et al (2017) Prevalence of obstructive sleep apnea in the general population: a systematic review. Sleep Med Rev 34:70–81
Adam VB et al (2019) Estimation of the global prevalence and burden of obstructive sleep apnoea: a literature-based analysis. Lancet Respir Med 7(8):687–698
Pivetta B, Chen L, Nagappa M et al (2021) Use and performance of the STOP-Bang questionnaire for obstructive sleep apnea screening across geographic regions: a systematic review and meta-analysis. JAMA Netw Open 4(3):e211009. https://doi.org/10.1001/jamanetworkopen.2021.1009
Mendonça F, Mostafa SS, Ravelo-García AG, Morgado-Dias F (2018) Penzel T (2018) Devices for home detection of obstructive sleep apnea: a review. Sleep Med Rev 41:149–160
Rodrigues J, Pepin JL, Goeuriot L, Amer-Yahia S (2020) An extensive investigation of machine learning techniques for sleep apnea screening. In: Proceedings of the 29th ACM International Conference on Information and Knowledge Management, Virtual Event Ireland, France
Wang S, Xuan W, Chen D et al (2023) Machine learning assisted wearable wireless device for sleep apnea syndrome diagnosis. Biosensors 13(4):483. https://doi.org/10.3390/bios13040483
Li Z, Li Y, Zhao G, Zhang X et al (2021) A model for obstructive sleep apnea detection using a multi-layer feed-forward neural network based on electrocardiogram, pulse oxygen saturation, and body mass index. Sleep Breath 25(4):2065–2072. https://doi.org/10.1007/s11325-021-02302-6
Wei K, Zou L, Liu G, Wang C (2023) MS-Net: sleep apnea detection in PPG using multi-scale block and shadow module one-dimensional convolutional neural network. Comput Biol Med 155:106469. https://doi.org/10.1016/j.compbiomed.2022.106469
Liang Z (2023) Novel method combining multiscale attention entropy of overnight blood oxygen level and machine learning for easy sleep apnea screening. Digit Health 2023:9. https://doi.org/10.1177/20552076231211550
Xie B, Minn H (2012) Real-time sleep apnea detection by classifier combination. IEEE Trans Inf Technol Biomed 16(3):469–477. https://doi.org/10.1109/TITB.2012.2188299
Lin CY, Wang YW, Setiawan F et al (2021) Sleep apnea classification algorithm development using a machine-learning framework and bag-of-features derived from electrocardiogram spectrograms. J Clin Med 11(1). https://doi.org/10.3390/jcm11010192
Bhattacharjee A, Saha S, Fattah SA et al (2019) Sleep apnea detection based on Rician modeling of feature variation in multiband EEG signal. IEEE J Biomed Health Inform 23(3):1066–1074. https://doi.org/10.1109/JBHI.2018.2845303
Bahrami M, Forouzanfar M (2022) Sleep apnea detection from single-lead ECG: a comprehensive analysis of machine learning and deep learning algorithms. IEEE Trans Instrum Meas 71:1–11. https://doi.org/10.1109/TIM.2022.3151947
Liang Z, Ploderer B (2020) “How does Fitbit measure brainwaves”: a qualitative study into the credibility of sleep-tracking technologies. Proc ACM Interact Mob Wearable Ubiquitous Technol 4(1):1–29. https://doi.org/10.1145/3380994
Liang Z, Chapa-Martell MA (2021) A multi-level classification approach for sleep stage prediction with processed data derived from consumer wearable activity trackers. Front Digit Health 3:665946. https://doi.org/10.3389/fdgth.2021.665946
Liang Z, Chapa-Martell MA (2019) Accuracy of Fitbit wristbands in measuring sleep stage transitions and the effect of user-specific factors. JMIR mhealth uhealth 7(6):e13384. https://doi.org/10.2196/13384
Lin HC, Su CL, Ong JH et al (2020) Pulse oximetry monitor feasible for early screening of obstructive sleep apnea (OSA). J Med Biol Eng 40:62–70. https://doi.org/10.1007/s40846-019-00479-6
Rodrigues Filho JC, Neves DD, Velasque L et al (2020) Diagnostic performance of nocturnal oximetry in the detection of obstructive sleep apnea syndrome: a Brazilian study. Sleep Breath Physiol Disord 24:1487–1494. https://doi.org/10.1007/s11325-019-02000-4
Quan SF, Howard BV, Iber C et al (1997) The Sleep Heart Health Study: design, rationale, and methods. Sleep 20(12):1077–1085
Chen X, Wang R, Zee P, Lutsey PL, Javaheri S, Alcántara C, Jackson CL, Williams MA, Redline S (2015) Racial/ethnic differences in sleep disturbances: the Multi-Ethnic Study of Atherosclerosis (MESA). Sleep 38(6):877–888
Zhang GQ, Cui L, Mueller R et al (2018) The National Sleep Research Resource: towards a sleep data commons. J Am Med Inform Assoc 25(10):1351–1358. https://doi.org/10.1093/jamia/ocy064
Bernardini A, Brunello A, Gigli GL et al (2022) OSASUD: a dataset of stroke unit recordings for the detection of obstructive sleep apnea syndrome. Sci Data 9:177. https://doi.org/10.1038/s41597-022-01272-y
Berry R, Brooks R, Gamaldo C et al (2017) The AASM manual for the scoring of sleep and associated events: rules, terminology and technical specifications, version 2.4 edn. American Academy of Sleep Medicine, Darien
Kwak N, Choi CH (2003) Feature extraction based on ICA for binary classification problems. IEEE Trans Knowl Data Eng 15(6):1374–1388. https://doi.org/10.1109/TKDE.2003.1245279
van Rijn J, Hutter F (2018) Hyperparameter importance across datasets. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '18). pp 2367–2376. https://doi.org/10.1145/3219819.3220058
Brodersen KH, Ong CS, Stephan KE, Buhmann JM (2010) The balanced accuracy and its posterior distribution. In: Proceedings of the 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010
Levy J, Alvarez D, del Campo F, Behar J (2023) Deep learning for obstructive sleep apnea diagnosis based on single channel oximetry. Nat Commun 14(1):4881. https://doi.org/10.1038/s41467-023-40604-3
Holfinger SJ, Lyons MM, Keenan BT et al (2022) Diagnostic performance of machine learning-derived OSA prediction tools in large clinical and community-based samples. Chest 161(3):807–817. https://doi.org/10.1016/j.chest.2021.10.023
Shi Y, Zhang Y, Cao Z et al (2023) Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults. BMC Med Inform Decis Mak 23:1–15. https://doi.org/10.1186/s12911-023-02331-z
Banerjee I, Gensheimer MF, Wood DJ et al (2018) Probabilistic prognostic estimates of survival in metastatic cancer patients (PPES-Met) utilizing free-text clinical narratives. Sci Rep 8(1):10037. https://doi.org/10.1038/s41598-018-27946-5
Pevernagie DA, Gnidovec-Strazisar B, Grote L et al (2020) On the rise and fall of the apnea-hypopnea index: a historical review and critical appraisal. J Sleep Res 29(4):e13066. https://doi.org/10.1111/jsr.13066
Acknowledgements
The author would like to express gratitude to the National Sleep Research Resources (NSRR) for granting access to the SHHS and MESA datasets.
Funding
Japan Society for the Promotion of Science (JSPS) provided financial support in the form of KAKENHI (Grant Number 21K17670) funding. The sponsor had no role in the design or conduct of this research.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical approval
This article does not contain any studies with human participants performed by the author. Access to the SHHS dataset was granted by the NSRR.
Conflict of interest
All authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest (such as honoraria; educational grants; participation in speakers' bureaus; membership, employment, consultancies, stock ownership, or other equity interest; and expert testimony or patent-licensing arrangements), or non-financial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liang, Z. Developing probabilistic ensemble machine learning models for home-based sleep apnea screening using overnight SpO2 data at varying data granularity. Sleep Breath (2024). https://doi.org/10.1007/s11325-024-03141-x
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11325-024-03141-x