Artificial Neural Network Analysis of Volatile Organic Compounds for the Detection of Lung Cancer
Lung cancer is a widespread disease and it is well understood that systematic, non-invasive and early detection of this progressive and life-threatening disorder is of vital importance for patient outcomes. In this work we present a convergence of familiar and less familiar artificial neural network techniques to help address this task. Our preliminary results demonstrate that improved, automated, early diagnosis of lung cancer based on the classification of volatile organic compounds detected in the exhaled gases of patients seems possible. Under strictly controlled conditions, using Selected Ion Flow Tube Mass Spectrometry (SIFT-MS), the naturally occurring concentrations of a range of volatile organic compounds in the exhaled gases of 20 lung cancer patients and 20 healthy individuals provided the dataset that has been analysed. We investigated the performance of several artificial neural network architectures, each with complementary pattern recognition properties, from the domains of supervised, unsupervised and recurrent neural networks. The neural networks were trained on a subset of the data, with their performance evaluated using unseen test data and classification accuracies ranging from 56% to 74% were obtained. In addition, there is promise that the topological ordering properties of the unsupervised networks’ clusters will be able to provide further diagnostic insights, for example into patients who may have been heavy smokers but so far have not presented with any lung cancer. With the collection of data from a larger number of subjects across a long time period there is promise that an automated assistive tool in the diagnosis of lung cancer via breath analysis could soon be possible.
KeywordsLung cancer diagnosis Volatile organic compounds SIFT Artificial neural network analysis
We gratefully acknowledge funding from the Slater & Gordon Health Projects & Research Fund/14/15 Round 1/A34489.
- 1.Lewis, D.R., Chen, H.S., Feurer, E.J., et al.: SEER Cancer Statistics Review, 1975–2008. MD National Cancer Institute, Bethesda (2010)Google Scholar
- 2.Comella, P., Frasci, G., Panza, N., et al.: Randomised trial comparing cisplatin, gemcitabine and vinorelbine with either cisplatin and gemcitabine or cisplatin and vinorelbine in advanced non small cell lung cancer: interim analysis of a Phase III trial of the Southern Italy Cooperative Oncology Group. J. Clin. Oncol. 18, 1451–1457 (2000)CrossRefGoogle Scholar
- 3.Amann, A., Smith, D. (eds.): Breath analysis for clinical diagnosis and therapeutic monitoring. World Scientific Publishing Co., Singapore (2005)Google Scholar
- 8.Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning Internal Representations by Error Propagation Parallel Distributed Processing, Explorations in the Microstructure of Cognition, vol. 1. MIT Press, Cambridge (1986)Google Scholar
- 10.Choudhury, T., Kumar, V., Nigam, D., Mandal, B.: Intelligent classification of lung & oral cancer through diverse data mining algorithms. In: 2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE), pp. 133–138. IEEE (2016)Google Scholar
- 12.Adetiba, E., Adebiyi, M.O., Thakur, S.: Breathogenomics: a computational architecture for screening, early diagnosis and genotyping of lung cancer. In: Rojas, I., Ortuño, F. (eds) Bioinformatics and Biomedical Engineering, IWBBIO 2017. Lecture Notes in Computer Science, vol. 10209. Springer, Cham (2017)Google Scholar
- 13.Jaeger, H.: The “echo state” approach to analysing and training recurrent neural networks-with an erratum note. Bonn, Germany: German National Research Center for Information Technology GMD Technical report, vol. 148(34), p. 13 (2001)Google Scholar
- 15.Verstraeten, D., Schrauwen, B., Stroobandt, D.: Reservoir-based techniques for speech recognition. In: IEEE International Joint Conference on Neural Networks (IJCNN 2006), pp. 1050–1053 (2006)Google Scholar
- 19.Emmerich, C., Reinhart, R.F., Steil, J.J.: Recurrence enhances the spatial encoding of static inputs in reservoir networks. In: Diamantaras, K., Duch, W., Iliadis, L. (eds) Artificial Neural Networks - ICANN 2010, vol. 6353, pp. 148–153. Springer, Heidelberg (2010)Google Scholar