VOTE versus ACLTE: Vergleich zweier Schnarchgeräuschklassifikationen mit Methoden des maschinellen Lernens

Janott, C.; Schmitt, M.; Heiser, C.; Hohenhorst, W.; Herzog, M.; Carrasco Llatas, M.; Hemmert, W.; Schuller, B.

doi:10.1007/s00106-019-0696-5

VOTE versus ACLTE: Vergleich zweier Schnarchgeräuschklassifikationen mit Methoden des maschinellen Lernens

VOTE versus ACLTE: comparison of two snoring noise classifications using machine learning methods

Originalien
Published: 12 June 2019

Volume 67, pages 670–678, (2019)
Cite this article

HNO Aims and scope Submit manuscript

C. Janott¹,
M. Schmitt²,
C. Heiser³,
W. Hohenhorst⁴,
M. Herzog⁵,
M. Carrasco Llatas⁶,
W. Hemmert¹ &
…
B. Schuller^2,7

411 Accesses
5 Citations
3 Altmetric
Explore all metrics

Zusammenfassung

Hintergrund

Die akustische Analyse von Schnarchgeräuschen ist eine nichtinvasive Methode für die Diagnose von Entstehungsmechanismen des Schnarchens, die während des natürlichen Schlafs durchgeführt werden kann. Ziel der Arbeit ist die Entwicklung und Bewertung von Klassifikationsschemata für Schnarchgeräusche, die eine möglichst aussagekräftige Diagnoseunterstützung ermöglichen.

Material und Methoden

Basierend auf 2 annotierten Schnarchgeräuschdatenbanken mit unterschiedlicher Klassifikation (s-VOTE – 4 Klassen versus ACLTE – 5 Klassen) wurden identisch aufgebaute maschinelle Klassifikationssysteme trainiert. Der Merkmalsextraktor openSMILE wurde in Kombination mit einer linearen Support-Vektor-Maschine zur Klassifikation eingesetzt.

Ergebnisse

Mit einem ungewichteten Average Recall (UAR) von 55,4 % für das s‑VOTE-Modell und 49,1 % für das ACLTE liegen die Ergebnisse auf ähnlichem Niveau. In beiden Modellen gelingt die beste Differenzierung für Epiglottisschnarchen, während velares und oropharyngeales Schnarchen häufiger verwechselt werden.

Schlussfolgerung

Automatisierte akustische Verfahren können bei der Diagnose von Schlafatmungsstörungen unterstützen. Einschränkungen in der Erkennungsleistung sind u. a. durch die begrenzte Größe der Trainingsdatensätze bedingt.

Abstract

Background

Acoustic snoring sound analysis is a noninvasive method for diagnosis of the mechanical mechanisms causing snoring that can be performed during natural sleep. The objective of this work is development and evaluation of classification schemes for snoring sounds that can provide meaningful diagnostic support.

Materials and methods

Based on two annotated snoring noise databases with different classifications (s-VOTE with four classes versus ACLTE with five classes), identically structured machine classification systems were trained. The feature extractor openSMILE was used in combination with a linear support vector machine for classification.

Results

With an unweighted average recall (UAR) of 55.4% for the s‑VOTE model and 49.1% for the ACLTE, the results are at a similar level. In both models, the best differentiation is achieved for epiglottic snoring, while velar and oropharyngeal snoring are more often confused.

Conclusion

Automated acoustic methods can help diagnose sleep-disordered breathing. A reason for the restricted recognition performance is the limited size of the training datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literatur

Croft C, Pringle M (1991) Sleep nasendoscopy: A technique of assessment in snoring and obstructive sleep apnoea. Clin Otolaryngol Allied Sci 16(5):504–509
Article CAS PubMed Google Scholar
Kent D, Rogers R, Soose R (2015) Drug-induced sedation endoscopy in the evaluation of OSA patients with incomplete oral appliance therapy response. Otolaryngol Head Neck Surg 153(2):302–307
Article PubMed Google Scholar
De Vito A, Carrasco Llatas M, Ravesloot MJ, Kotecha B, De Vries N, Hamans E et al (2018) European position paper on drug-induced sleep endoscopy: 2017 Update. Clin Otolaryngol 43(6):1541–1552
Article PubMed Google Scholar
Heiser C, Fthenakis P, Hapfelmeier A, Berger S, Hofauer B, Hohenhorst W et al (2017) Drug-induced sleep endoscopy with target-controlled infusion using propofol and monitored depth of sedation to determine treatment strategies in obstructive sleep apnea. Sleep Breath 21(3):737–744
Article PubMed Google Scholar
Janott C, Pirsig W, Heiser C (2014) Akustische Analyse von Schnarchgeräuschen. Somnologie 18(2):87–95
Article Google Scholar
Schmitt M, Janott C, Pandit V, Qian K, Heiser C, Hemmert W et al (2016) A bag-of-audiowords approach for snore sounds’ excitation localisation. Proceedings 14. ITG Conference on Speech Communication, IEEE, S 1–5
Google Scholar
Qian K, Janott C, Zhang Z, Heiser C, Schuller B (2016) Wavelet features for classification of VOTE snore sounds. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016, S 221–225
Google Scholar
Janott C, Schuller B, Heiser C (2017) Acoustic information in snoring noises. HNO 65:107–116
Article CAS PubMed Google Scholar
Qian K, Janott C, Pandit V, Zhang Z, Heiser C, Hohenhorst W et al (2016) Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multi-Feature Analysis. Ieee Trans Biomed Eng. https://doi.org/10.1109/TBME.2016.2619675
Article PubMed Google Scholar
Qian K, Janott C, Deng J, Heiser C, Hohenhorst W, Herzog M et al (2017) Snore sound recognition: On wavelets and classifiers from deep nets to kernels. In 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
Google Scholar
Kezirian EJ, Hohenhorst W, Vries N (2011) Drug-induced sleep endoscopy: the VOTE classification. Eur Arch Otorhinolaryngol 268:1233–1236
Article PubMed Google Scholar
Schuller B, Steidl S, Batliner A, Bergelson E, Krajewski J, Janott C et al (2017) The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, cold & snoring. Proceedings of INTERSPEECH; 2017; Stockholm, Sweden, S 20–24
Google Scholar
Janott C, Schmitt M, Zhang Y, Qian K, Pandit V, Zhang Z et al (2018) Snoring classified: The Munich-Passau snore sound corpus. Comput Biol Med 94:106–118
Article PubMed Google Scholar
Eyben F, Wöllmer M, Schuller B (2010) Opensmile: The munich versatile and fast open-source audio feature extractor. Proceedings of the 18th ACM International Conference on Multimedia, S 1459–1462
Google Scholar
Eyben F, Weninger F, Groß F, Schuller B (2013) Recent developments in opensmile, the munich open-source multimedia feature extractor. Proceedings of the 21st ACM International Conference on Multimedia, S 835–838
Google Scholar
Schuller B, Steidl S, Batliner A, Vinciarelli A, Scherer K, Ringeval F et al (2013) The INTERSPEECH 2013 Computational Paralinguistics Challenge: Social signals, conflict, emotion, autism. IProceedings of INTERSPEECH; 2013; Lyon, S 148–152
Google Scholar
Schuller B, Steidl S, Batliner A, Epps J, Eyben F, Ringeval F et al (2014) The INTERSPEECH 2014 Computational Paralinguistics Challenge: Cognitive & physical load. Proceedings of INTERSPEECH; 2014; Singapore, S 427–431
Google Scholar
Schuller B, Steidl S, Batliner A, Hantke S, Hönig F, Orozco-Arroyave JR et al (2015) The INTERSPEECH 2015 Computational Paralinguistics Challenge: Nativeness, Parkinson’s & eating condition. IProceedings of INTERSPEECH; 2015; Dresden, Germany, S 478–482
Google Scholar
Fan R, Chang K, Hsieh C, Wang X, Lin C (2008) LIBLINEAR: A library for large linear classification. J Mach Learn Res 9:1871–1874
Google Scholar
Qian K, Janott C, Zhang Z, Deng J, Baird A, Heiser C et al (2018) Teaching machines on snoring: A benchmark on computer audition for snore sound excitation localisation. Arch Acoust 43(3):465–475
Google Scholar
Fant G (1970) Acoustic theory of speech production
Google Scholar
Hessel NS, Vries N (2003) Diagnostic work-up of socially unacceptable snoring. II. Sleep endoscopy. Eur Arch Otorhinolaryngol 259:158–161
Article Google Scholar

Download references

Author information

Authors and Affiliations

Munich School of BioEngineering, Technische Universität München, Boltzmannstraße 11, 85748, Garching, Deutschland
C. Janott & W. Hemmert
ZD.B Lehrstuhl für Embedded Intelligence for Health Care and Wellbeing, Universität Augsburg, Augsburg, Deutschland
M. Schmitt & B. Schuller
Hals-Nasen-Ohrenklinik und Poliklinik, Klinikum rechts der Isar, Technische Universität München, München, Deutschland
C. Heiser
Klinik für HNO-Heilkunde, Kopf- und Hals-Chirurgie, Alfried Krupp Krankenhaus, Essen, Deutschland
W. Hohenhorst
Klinik für HNO-Krankheiten, Kopf- und Halschirurgie, Carl-Thiem-Klinikum Cottbus, Cottbus, Deutschland
M. Herzog
Servicio de Otorrinolaringología, Hospital Universitario Doctor Peset, Valencia, Spanien
M. Carrasco Llatas
Department of Computing, Imperial College London, London, Großbritannien
B. Schuller

Authors

C. Janott
View author publications
You can also search for this author in PubMed Google Scholar
M. Schmitt
View author publications
You can also search for this author in PubMed Google Scholar
C. Heiser
View author publications
You can also search for this author in PubMed Google Scholar
W. Hohenhorst
View author publications
You can also search for this author in PubMed Google Scholar
M. Herzog
View author publications
You can also search for this author in PubMed Google Scholar
M. Carrasco Llatas
View author publications
You can also search for this author in PubMed Google Scholar
W. Hemmert
View author publications
You can also search for this author in PubMed Google Scholar
B. Schuller
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to C. Janott.

Ethics declarations

Interessenkonflikt

C. Janott ist der Erfinder eines patentierten Verfahrens und Systems zur Ermittlung anatomischer Ursachen für die Entstehung von Schnarchgeräuschen (DE102012219128B4). M. Schmitt, C. Heiser, W. Hohenhorst, M. Herzog, M. Carrasco Llatas, W. Hemmert und B. Schuller geben an, dass kein Interessenkonflikt besteht.

Für diesen Beitrag wurden von den Autoren keine Studien an Menschen oder Tieren durchgeführt. Für die aufgeführten Studien gelten die jeweils dort angegebenen ethischen Richtlinien.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Janott, C., Schmitt, M., Heiser, C. et al. VOTE versus ACLTE: Vergleich zweier Schnarchgeräuschklassifikationen mit Methoden des maschinellen Lernens. HNO 67, 670–678 (2019). https://doi.org/10.1007/s00106-019-0696-5

Download citation

Published: 12 June 2019
Issue Date: 01 September 2019
DOI: https://doi.org/10.1007/s00106-019-0696-5

Schlüsselwörter

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

VOTE versus ACLTE: Vergleich zweier Schnarchgeräuschklassifikationen mit Methoden des maschinellen Lernens

Zusammenfassung

Hintergrund

Material und Methoden

Ergebnisse

Schlussfolgerung

Abstract

Background

Materials and methods

Results

Conclusion

Access this article

Literatur

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Interessenkonflikt

Rights and permissions

About this article

Cite this article

Share this article

Schlüsselwörter

Keywords

Search

Navigation