Modelling and Classification of GC/IMS Breath Gas Measurements for Lozenges of Different Flavours
The composition of exhaled breath contains information about diet, oral hygiene and other environmental influences as well as on the state of health and on medications. Therefore, rapid and sensitive breath analysis would be a helpful tool, for example, for medical diagnosis or therapy control. Ion mobility spectrometry coupled with gas-chromatographic pre-separation (GC/IMS) meet those requirements and can be used to distinguish between healthy and diseased persons or to detect drug usage, for example, based on characteristic exhaled metabolites. So far, the detection of peaks in IMS measurements and the assignment of compounds is done manually and an automated procedure is urgently needed. In this article, we analyse breath gas measurements by GC/IMS from a volunteer having consumed lozenges of 12 different flavours. The IMS measurements are modelled along drift time with an additive model of unimodal regressions to describe each peak. The regressions are afterwards combined across all spectra and all datasets to determine typical peak locations and the respective heights of the peaks in each measurement are inferred. The obtained matrix of peak intensities is then used to classify the measurements into the 12 flavour groups using support vector machines. Since the true class labels are known, we can assess the mis-classification rate using cross-validation.
The financial support of the Bundesministerium für Bildung und Forschung and the Ministerium für Innovation, Wissenschaft und Forschung des Landes Nordrhein-Westfalen is gratefully acknowledged. This work has also been supported by Deutsche Forschungsgemeinschaft (DFG) within the Collaborative Research Center SFB 876 ‘Providing Information by Resource-Constrained Analysis’, Project C4.
- Bunkowski, A. (2012). MCC-IMS data analysis using automated spectra processing and explorative visualisation methods. Ph.D. thesis, Bielefeld University.Google Scholar
- Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.)., Springer series in statistics. Berlin: Springer.Google Scholar
- Horsch, S., Kopczynski, D., Baumbach, J.I., Rahnenführer, J., & Rahmann, S. (2015). From raw ion mobility measurements to disease classification: A comparison of analysis processes. PeerJ PrePrints 3, e1294v1.Google Scholar
- Köllmann, C. (2016a). Unimodal spline regression and its use in various applications with single or multiple modes. Ph.D. thesis, TU Dortmund. https://doi.org/10.17877/DE290R-17270.
- Köllmann, C. (2016b). uniReg: Unimodal penalized spline regression using B-splines. http://cran.R-project.org/package=uniReg. R package version 1.1
- Lange, L. (2015). Analyse von GC/IMS-Atemluftmessungen unter Berücksichtigung verschiedener Atemerfrischer. Master’s thesis, Faculty of Statistics, TU Dortmund University.Google Scholar
- R Core Team. (2018). R: A language and environment for statistical computing. R foundation for statistical computing, Vienna, Austria. https://www.R-project.org/