Lung Cancer Screening Based on Type-different Sensor Arrays

Li, Wang; Liu, Hongying; Xie, Dandan; He, Zichun; Pi, Xititan

doi:10.1038/s41598-017-02154-9

Lung Cancer Screening Based on Type-different Sensor Arrays

Article
Open access
Published: 16 May 2017

Volume 7, article number 1969, (2017)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Lung Cancer Screening Based on Type-different Sensor Arrays

Download PDF

Wang Li^1,2,
Hongying Liu^1,3,
Dandan Xie¹,
Zichun He⁴ &
…
Xititan Pi^1,5

5864 Accesses
65 Citations
1 Altmetric
Explore all metrics

Abstract

In recent years, electronic nose (e-nose) systems have become a focus method for diagnosing pulmonary diseases such as lung cancer. However, principles and patterns of sensor responses in traditional e-nose systems are relatively homogeneous. Less study has been focused on type-different sensor arrays. In this paper, we designed a miniature e-nose system using 14 gas sensors of four types and its subsequent analysis of 52 breath samples. To investigate the performance of this system in identifying and distinguishing lung cancer from other respiratory diseases and healthy controls, five feature extraction algorithms and two classifiers were adopted. Lastly, the influence of type-different sensors on the identification ability of e-nose systems was analyzed. Results indicate that when using the LDA fuzzy 5-NN classification method, the sensitivity, specificity and accuracy of discriminating lung cancer patients from healthy controls with e-nose systems are 91.58%, 91.72% and 91.59%, respectively. Our findings also suggest that type-different sensors could significantly increase the diagnostic accuracy of e-nose systems. These results showed e-nose system proposed in this study was potentially practicable in lung cancer screening with a favorable performance. In addition, it is important for type-different sensors to be considered when developing e-nose systems.

Prediction of lung cancer with a sensor array based e-nose system using machine learning methods

Article 18 April 2024

Electronic Nose Technology in Respiratory Diseases

Article 25 February 2017

Differentiating head and neck carcinoma from lung carcinoma with an electronic nose: a proof of concept study

Article Open access 16 April 2016

Introduction

In recent years, breath analysis has become a research focus in the field of respiratory disease diagnosis due to its noninvasiveness, convenience and real-time analysis¹. The principal component in breath is water vapor, and the remaining parts include volatile organic compounds (VOCs) and nonvolatile matters dissolved in water or contained in exhaled aerosol particles². To date, over 3000 different VOCs have been detected in human breath³, with some successfully used in the detection of diseases including lung cancer. VOCs in breath and their applications are shown in Table 1.

Table 1 VOCs in human breath.

Full size table

At present, a Tedlar^® bag is often used in sampling VOCs in breath³⁴. Collected VOCs could be tested by multiple spectrometric techniques such as gas chromatography, ion transfer reaction, ion flow tube, ion mobility and so on^{35, 36}. These analytical techniques are sensitive and accurate, but they also have many restrictions, such as high cost, requiring professional operation and requiring pre-concentration of the breath³⁷. In recent years, inexpensive and portable e-nose systems have been proposed and designed for respiratory disease detection, the measurement reproducibility of the e-nose system was also validated to be acceptable³⁸. The common methodology and application of e-nose systems were described in detail elsewhere^{39, 40}. For lung cancer detection, Mazzone et al. reported an e-nose based on colorimetric sensor arrays could show a good performance^{41, 42}; Haick’s team from the Israel Institute of Technology developed an e-nose system with gold nanoparticle sensors for the detection of lung cancers as well. Simulations revealed the accuracy for lung cancer detection to be over 86%, and subsequent experiments proved that this e-nose system could identify many types of cancers^{20, 43}; Blatt et al. used metal oxide semiconductor sensor arrays for lung cancer diagnosis with accuracy, sensitivity and specificity all of over 90%⁴⁴. However, these electronic noses are often based on sensor arrays with similar response principles. These sensors are very similar in terms of sensitivity and response patterns. Less research has been performed on type-different sensors for the diagnosis of lung cancers. Different feature extractions and classifiers could significantly affect recognition effects of sensor arrays, but little has been reported in this area.

In the current study, we used 14 gas sensors and 2 temperature/humidity sensors to develop a small-sized e-nose system. Samples from 52 volunteers were tested using the system. A software based on C# was also programmed to control detection process and generate the “breath pattern”. Five algorithms for extracting features from the “breath pattern” were compared. Finally, a 10 fold cross validation method was used to investigate the classification performance of the e-nose system with two classifiers. Lastly, whether type-different sensor arrays could improve the recognition ability of e-nose systems was also studied.

Materials and Methods

Selection of type-different sensors

According to the major components and their concentration ranges in human breath especially from lung cancer patients (Table 1), we selected 14 gas sensors which could be classified into 4 types: metal oxide semiconductor (MOS), hot wire gas, catalytic combustion gas, and electrochemical gas sensors.

None of the four classes of sensors are gas specific sensors; they are all cross response sensors, i.e. different gas sensors responding differently to the same gas mixture and the responses of the same sensor to different gas mixtures are also different. According to this response pattern, sensor arrays could form characteristic “breath fingerprints” by exhalation components, and could diagnose diseases through pattern recognition of a “breath fingerprint”. All sensors used in this study are commercial sensors, considering they are robust and stable. These sensors could be easily obtained to carry out repetitive tests. Details of gas sensors used in this study are listed in Table 2.

Table 2 Sensors used in this study.

Full size table

E-nose system design

We designed the miniature e-nose system using the above mentioned sensors. The system’s gas reaction chamber is a rectangular enclosure made from aluminum alloy, with a volume of about 220 ml. All sensor probes are embedded into the gas reaction chamber. Sampling frequency of the whole system was designed to be 10 Hz. When the gas to be tested passes through the gas chamber, the sensor arrays will respond and form voltage (or current) signals. After initial processing (such as filtering, amplification, etc.), signals are sent to the main control chip (STM32F10) for analog digital conversion and temperature/humidity compensation calculations. Finally the signals are sent to the computer through a universal serial bus (USB) for display and storage. The compensation method of temperature/humidity involves taking the means of the two groups of temperature/humidity sensors (Model:HTG3515CH, Humirel Inc., France) placed at the air inlet and air outlet for linear compensation. The electronic nose system designed for this study is shown in Fig. 1.

In addition, in the current study, we programmed upper computer software based on C#. This software controlled detection modalities of the electronic nose system, as well as the display, storage of test data, and sample information management. The database used by this software was mySQL (Oracle^®, CA, USA).

Lastly, all breath samples in this study were collected by the 2 L Tedlar^® bag (E-switch^®, China), and pumped into the designed e-nose system by a gas sampling pump. The detection platform is shown in Fig. 2.

Source of test samples

Breath samples in this study included 24 cases from lung cancer patients, 5 cases from patients with other respiratory diseases (4 with COPD and 1 with Silicosis), 10 cases from healthy smokers and 13 healthy non-smokers. Among them, lung cancer patients were from the in-patient department of respiration at the Second Affiliated Hospital of Chongqing Medical University; patients with other respiratory diseases were from People’s Hospital of Jiangbei District and the Second Affiliated Hospital of Chongqing Medical University; healthy control volunteers (smokers and non-smokers) were recruited from Chongqing University. All volunteers singed informed consent after a detailed introduction of the purpose and plan of this study. Protocols including any relevant details of this study were carried out in accordance with the relevant guidelines and approved by Medical Ethics Committee of Chongqing University. Details of all volunteers for this study are listed in Table S1.

Test process

All volunteers involved in this study fasted overnight prior to sampling. Meanwhile, smokers were asked to stop smoking two hours prior to sampling. All Tedlar^® bags were washed with nitrogen three times prior to sampling. The process for collection VOCs in breath is as follows:

(1)
All volunteers rested for 3–5 minutes and rinsed their mouths with clear water 3–5 times.
(2)
All volunteers put on nose clips and breathed in deeply with their mouths, exhaling into sampling bags via a disposable mouth piece until the bags were filled.
(3)
Sampling bags were then tightened and marked.

All samplings experiments were carried out in a well-ventilated room to avoid interference of other scents. All breath samples were tested within 8 hours after sampling; otherwise samples would be re-collected.

The breath detection process is as follows:

(1)
Opening the gas sampling pump to pump ambient air into the gas chamber at a rate of 6 L/min. Sensor response gradually stabilized near baseline, which normally took 30–80 s. This process can be termed a “preparation phase”.
(2)
Breath samples were collected and pumped into the gas chamber, and the sensors started to respond. This process took 10 s and can be termed the “ventilation phase”.
(3)
Disconnecting the gas sampling pump resulted in sensor responses gradually stabilizing. This process took 50 s and can be called the “response phase”.
(4)
Opening the gas sampling pump to inlet ambient air. Sensor response gradually returned to baseline. This process took 30 s and can be called “deflation phase”.

The effective detection period of each sample in this study includes a “ventilation phase”, a “response phase” and a “deflation phase”, altogether totaling 90 s. Each stage of the detection process was precisely controlled by a timing relay and electromagnetic valve. A high efficiency air filter was used when ambient air was pumped into the chamber during the “preparation phase” to dispose of solid debris that might exist in the air, avoiding sensor damage and ensuring baseline stability. All sample detections were carried out at the same place with good indoor ventilation.

Data analysis

For successful identification of disease with an e-nose system, raw data should be analyzed and processed appropriately. All data was processed using Matlab. The data analysis procedure is as follows:

Data pre-processing

All raw data measured by the electronic nose system should be pre-processed before analysis, which includes baseline processing and standardization.

Baseline processing was performed for drift compensation and contrast enhancement⁴⁵. Assuming a total of N_S samples were detected in this study, each sample contained N_D sensor response data; the length of each sensor detection data was N_T; and the length of the stabilized data at the “preparation phase” was N_B (N_B ≤ N_T); then the response (after baseline processing) to sample S (S = 1, 2, 3, …, N_S) from sensor D (D = 1, 2, 3, …, N_D) at time T (T = 1, 2, 3, …, N_T) would be:

$${R}_{(S,D,T)}^{B}={R}_{(S,D,T)}-\frac{1}{{N}_{B}}\sum _{t=1}^{{N}_{B}}{R}_{(S,D,t)}$$

(1)

In which, R_{(S, D, T)} and R_{(S, D, t)} are the actual responses to sample S from sensor D at time T and time t (t = 1, 2, 3, …, N_B).

Data standardization is necessary in order to compensate for the numerical range and/or unit differences of the results measured by sensors in the gas chamber⁴³. A standard deviation method was adopted in this study for data standardization, i.e. the average response of each sensor was 0 and the standard deviation was 1 after standardization.

Feature extraction

In this study, the sampling frequency of electronic nose system was set at 10 Hz. The data dimension obtained from each sensor in one sampling cycle was 10 × 90 = 900. For each sample, there were actually 16 sensors for detection. Therefore, the feature vector dimension consisting of sensor responses obtained from each sample was 900 × 16 = 14400, far greater than the total number of samples included in this study. There was a great amount of redundancy in these dimension data. Redundant data were not only non-conducive to the internal parameters of the calculation classification method, but also would lead to over-fitting⁴⁶. Therefore, it is necessary to reduce the dimension of the obtained characteristic matrix and extract main features. In the current study, we searched for optimal dimension reduction mapping from five different dimension reduction methods, including principal component analysis (PCA), linear discriminant analysis (LDA), Laplacian Eigenmap (LE), local linear embedding (LLE), and t-Stochastic Neighbor Embedding (tSNE).

When applying the features for classification after PCA dimension reduction, it was necessary to determine the number of principal components-k. The method used in this study was: the eigenvalues of the covariance matrix of the original data set were arrayed from large to small as: λ₁, λ₂, …, λ_n, with the total variance percentage τ of the first k components occupied being:

$$\tau =\frac{{\sum }_{j=1}^{k}{\lambda }_{j}}{{\sum }_{j=1}^{n}{\lambda }_{j}}$$

(2)

We set τ = 99% in this study to calculate the parameter k.

Classifier selection

In this study, we initially selected 2 classifiers, fuzzy k-NN and support vector machine (SVM). Optimal values of near neighbor k and proportionality coefficient m in Fuzzy k-NN were determined using the 10-fold cross validation method. The kernel function used in SVM was a radial basis function. Penalty factor C and kernel function parameter σ were determined via grid search method⁴⁵.

Error estimation

When classifying the breath samples, 5 samples from other respiratory disease were excluded due to small sample size. Meanwhile, healthy smokers and healthy non-smokers were classified as the healthy group. The remaining 47 samples could be classified into the following two groups: lung cancer negative group and lung cancer positive group. The confusion matrix formed by real results and predicted results is shown as Table 3:

Table 3 Confusion matrix obtained from classifier.

Full size table

Then the accuracy rate (Acr), sensitivity (Tpr) and specificity (Tnr) are:

$$Acr=\frac{TP+TN}{TP+FP+TN+FN}$$

(3)

$$Tpr=\frac{TP}{TP+FN}$$

(4)

$$Tnr=\frac{TN}{FP+TN}$$

(5)

Data Availability

All data generated or analyzed during this study are included in this published article (and its Supplementary Information files).

Results and Discussion

Response curves of sensor arrays

The typical response curve of the electronic nose system to the breath sample is shown in Fig. 3. It can be seen that the curve contains the three phases of one detection period, i.e.: ventilation phase, response phase and deflation phase. The response curves of most sensors were similar within the detection period. However, the curves of some sensors appeared to be quite different, such as TGS2444, MR516 and NAP55A. These pattern differences made the “breath fingerprint” more apparent.

Comparison of feature extraction effects

When performing feature extraction, some methods involved parameter selection, such as the near neighbor number k in LE algorithm and the Perplexity parameter in the tSNE algorithm. We compared the LE mapping results of k = 3, 5, 7, 9, 11, finding the best one to be when k = 7. Similarly, after comparison, the Perplexity parameter in tSNE was set as 5. Besides, LDA is a supervised dimension reduction algorithm, such that the mapping results of different sample labels were different. In this study, we examined dimension reduction results via LDA for 4 types of sample labeling (lung cancer group, other respiratory disease group, healthy smoking group and healthy non-smoking group) and 3 types of labeling (healthy smokers and healthy non-smokers were classified together as a healthy group). The 2D mapping results of different feature extraction methods are as follows:

It can be seen in Fig. 4 (a–e) that different feature extraction methods may result in different classifications. However, we could also conclude from the 5 dimension reduction mapping results that:

First, in the above 5 kinds of feature extraction methods, LDA had the best classification performance for 4 types of sample labels in this study. This was indicated by the great distances between the 4 groups and the aggregation of samples in the same group as shown in Fig. 4b. In addition, there was little overlap between different groups. The next best classification results were from LE and PCA.

Second, no matter which dimension reduction algorithm was applied, healthy smokers and lung cancer patients could all be well classified, i.e. the blue dots and red hexagrams are mutually far away and have nearly no overlap in Fig. 4a–e. This also indicates that although many lung cancer patients in this study were also smokers or former smokers, their exhaled breath were significantly different from that of healthy smokers. This result was practically significant. Indeed, the smokers are a high risk population for lung cancer. However, the e-nose system used in this study could optimally distinguish lung cancer patients from healthy smokers in this high risk population. This also suggests that the simple screening of lung cancer patients in a high risk population is practically feasible.

Third, if samples were classified by the 3 classes of labels (healthy smokers and healthy non-smokers were classified as the healthy group), then we should only consider the dots (blue and green) as one group. This way, it can be seen that there is some overlap between the healthy group and lung cancer patients in the 2D mapping results of dimension reduction by PCA, LE, tSNE and LLE. However, Fig. 4f indicates that LDA still had a good classification performance for the 3 classes of sample labels.

Figure 4a–e reveals that no matter which dimension reduction method was adopted, the e-nose system developed in this study was able to distinguish healthy smokers from healthy non-smokers, as shown in Fig. 4; blue and green dots are far away from each other with only slight overlap. This result suggests that the breath components of smokers and non-smokers might be quite different, consistent with the conclusions of related studies⁴⁷.

Lastly, Fig. 4 also showed that samples from other respiratory diseases (light blue diamonds) were all distinguishable from samples of healthy smokers (Figure a-e), from samples of the healthy population (Fig. 4f), and from samples of lung cancer patients (Fig. 4f). However, due to the insufficient sample size of patients with other respiratory diseases, this conclusion should be further validated with more samples.

Estimation error analysis

As mentioned before, to carry out this part of study, we excluded samples from patients with other respiratory disease, and classified the remaining 47 samples into 2 groups: a healthy group and a lung cancer group. Besides, according to the previous dimension reduction comparison, data for classification were from feature extraction results by LDA, LE and PCA. When using the mapping results by PCA, according to the methods introduced before, the principal component number was 21, and these components explained 99.08% of the variance.

When using Fuzzy k-NN for classification, the near neighbor number k and the proportionality coefficient m should be optimized. A 10 fold cross validation method was used for optimization in this study; results are shown in Fig. 5. It can be seen in Fig. 5a that when k = 5, 9, 13, 17, 21, classification accuracy of Fuzzy k-NN algorithm remains unchanged. Therefore, k = 5 was utilized in this study. Figure 5b showed the effects of proportionality coefficient m on the classification accuracy of Fuzzy 5-NN. It can be seen that when m = 2, the classification accuracy of 3 classification method yielded relatively better results. Therefore, m was set as 2 in this study.

As described before, a grid search method was used to optimize the parameters (C, σ) of SVM⁴⁸. Optimized results for PCA-SVM are shown in Fig. 6. When (C, σ) = (0.0625, 0.25), the classification accuracy of PCA-SVM is the best. Similarly, optimal (C, σ) of LDA-SVM and LE-SVM were also determined by this method.

After all parameters were optimized, 47 breath samples were classified using 2 classifiers based on the mapping results of 3 feature extraction methods mentioned above. Here, we also tested the classification performance by the 10-fold cross validation method. Each combined classification method was tested 50 times to obtain the confidence intervals of classification errors (shown in Table 4). It can be seen from the table that the best classification performance among the 6 classification methods were the combined methods based on LDA (LDA-Fuzzy 5-NN and LDA-SVM). This was consistent with results observed in the dimension reduction mapping plots (Fig. 4). Among them, LDA Fuzzy 5-NN showed the best classification results, which produced greater than 90% of sensitivity, specificity and accuracy.

Table 4 Distinguishing results of lung cancer samples and healthy samples by different classification methods.

Full size table

Effects of type-different sensor array on lung cancer recognition

In order to study the effects of type-different sensor arrays of the e-nose on lung cancer recognition performance, we classified the 14 gas sensors into 2 groups: group K and group T (Table 5). Of them, group K included all 4 types of sensors, and group T included only MOS sensors. According to the response results (Fig. 1), the sensitivity, response patterns and principles of sensors in group K had a greater relative difference from each other than that of sensors in group T.

Table 5 Sensors grouping.

Full size table

We still used the 3 dimension reduction algorithms (LDA, PCA and LE) selected earlier to extract principal features of the data obtained by these 2 groups of sensors. For comparison, 52 samples were still classified into 4 groups (lung cancer group, healthy smoking group, healthy non-smoking group and other disease group). 2D plots of the mapping results are shown in Fig. 7. It can be seen that, no matter which group the data came from, the classification performance of all the 3 feature extraction methods decreased when comparing to the mapping results in Fig. 4. However, data from sensor group K gave better classification results (Fig. 7a,c,e) than that from sensor group T (Fig. 7b,d,f) when using the same feature extraction algorithm.

In order to further compare the recognition performance of sensor group K and sensor group T, we used the Fuzzy k-NN, which had better performance in the previous classification to classify breath samples based on mapping results depicted in Fig. 7. Similarly, for comparison, breath samples of other diseases were also excluded, and healthy smokers and non-smokers were classified as one group, i.e. 47 samples were divided into two groups: a healthy group and a lung cancer group. After parameter optimization, accuracy of 3 classification methods (LDA 5-NN, PCA 5-NN and LE 5-NN) were obtained using 10-fold cross validation. We ran the validation 50 times to get 50 accurate readings for each classification method. Then, the recognition accuracies of sensor group K and sensor group T were analyzed using one way-ANOVA; results are shown in Fig. 8.

Similar conclusions could be drawn from results shown in Fig. 8 as in Fig. 7. Firstly, no matter which sensor group was used, the lung cancer recognition performance was not as good as the performance that all 14 gas sensors showed. Secondly, lung cancer detection accuracy was better when using data from sensor group K than that from sensor group T. When utilizing PCA 5-NN (Fig. 8b) and LE 5-NN (Fig. 8c), the recognition accuracy of sensor group K was significantly greater than that of sensor group T (n = 50, p < 0.01). Although the classification accuracy of sensor groups K and T was not significantly different (n = 50, p = 0.21) when using LDA 5-NN, the average and highest recognition accuracy of sensor group K was greater than that of sensor group T after 50 times of cross validation. In summary, type-different sensors are notably helpful for improving the lung cancer recognition ability of the e-nose system designed in this study.

Conclusion

The e-nose system developed using 4 types of commercial sensors in this study could identify relatively specific “breath fingerprints” based on human breath, which could be used to recognize volunteers in different diseased or healthy states. LDA proved to be among the best methods for “breath fingerprint” recognition in this study. When this e-nose system was used in the differentiation of lung cancer patients from healthy volunteers, the classification specificity, sensitivity and accuracy as determined by LDA fuzzy 5-NN were all above 90%, indicating a comparable performance with traditional imaging modalities. In addition, difference in the type of sensor arrays are notably helpful for the improvement of the ability of the e-nose system to detect respiratory diseases.

In a word, the designed e-nose system based on optimized algorithms was low cost, noninvasive and was potential practicable in screening lung cancers from both healthy people and lung cancer high risk populations.

References

Amann, A. et al. Analysis of exhaled breath for disease detection. Annu Rev Anal Chem (Palo Alto Calif) 7, 455–482, doi:10.1146/annurev-anchem-071213-020043 (2014).
Article ADS CAS Google Scholar
Mutlu, G. M., Garey, K. W., Robbins, R. A., Danziger, L. H. & Rubinstein, I. Collection and analysis of exhaled breath condensate in humans. American Journal of Respiratory and Critical Care Medicine 164, 731–737, doi:10.1164/ajrccm.164.5.2101032 (2001).
Article CAS PubMed Google Scholar
Phillips, M. et al. Variation in volatile organic compounds in the breath of normal humans. Journal of Chromatography B 729, 75–88, doi:10.1016/S0378-4347(99)00127-9 (1999).
Article CAS Google Scholar
Ozsoylu, S. End-tidal carbon monoxide and neonatal jaundice. Journal of Pediatrics 129, 485–485, doi:10.1016/S0022-3476(96)70106-3 (1996).
Article CAS PubMed Google Scholar
Eisenmann, A., Amann, A., Said, M., Datta, B. & Ledochowski, M. Implementation and interpretation of hydrogen breath tests. Journal of Breath Research 2, 046002, doi:10.1088/1752-7155/2/4/046002 (2008).
Article ADS PubMed Google Scholar
de Lacy Costello, B. P., Ledochowski, M. & Ratcliffe, N. M. The importance of methane breath testing: a review. Journal of Breath Research 7, 024001, doi:10.1088/1752-7155/7/2/024001 (2013).
Article ADS PubMed Google Scholar
Tomikawa, M. et al. Asthma diagnosis and treatment - 1005. Optimization for the withdrawal of inhaled corticosteroid treatment by monitoring fractional exhaled nitric oxide (feno) and lung functions. World Allergy Organ J 6(Suppl 1), P5, doi:10.1186/1939-4551-6-S1-P5 (2013).
Article PubMed PubMed Central Google Scholar
Ichinose, M., Sugiura, H., Yamagata, S., Koarai, A. & Shirato, K. Increase in reactive nitrogen species production in chronic obstructive pulmonary disease airways. Am J Respir Crit Care Med 162, 701–706, doi:10.1164/ajrccm.162.2.9908132 (2000).
Article CAS PubMed Google Scholar
Nair, S., Cope, K., Terence, R. H. & Diehl, A. M. Obesity and female gender increase breath ethanol concentration: Potential implications for the pathogenesis of nonalcoholic steatohepatitis. American Journal of Gastroenterology 96, 1200–1204, doi:10.1111/j.1572-0241.2001.03702.x (2001).
Article CAS PubMed Google Scholar
Olopade, C. O., Zakkar, M., Swedler, W. I. & Rubinstein, I. Exhaled pentane levels in acute asthma. Chest 111, 862–865, doi:10.1378/chest.111.4.862 (1997).
Article CAS PubMed Google Scholar
Poli, D. et al. Exhaled volatile organic compounds in patients with non-small cell lung cancer: cross sectional and nested short-term follow-up study. Respir Res 6, 71, doi:10.1186/1465-9921-6-71 (2005).
Article PubMed PubMed Central Google Scholar
Humad, S., Zarling, E., Clapper, M. & Skosey, J. L. Breath Pentane Excretion as a Marker of Disease-Activity in Rheumatoid-Arthritis. Free Radical Research Communications 5, 101–106, doi:10.3109/10715768809066917 (1988).
Article CAS PubMed Google Scholar
Hanouneh, I. A. et al. The Breathprints in Patients with Liver Disease Identify Novel Breath Biomarkers in Alcoholic Hepatitis. Hepatology 58, 828a–828a (2013).
Google Scholar
Huang, J. Z. et al. Exhaled breath acetone for therapeutic monitoring in pneumonia using selected ion flow tube mass spectrometry (SIFT-MS). Analytical Methods 5, 3807–3810, doi:10.1039/c3ay40951a (2013).
Article CAS Google Scholar
Musa-Veloso, K., Rarama, E., Comeau, F., Curtis, R. & Cunnane, S. Epilepsy and the ketogenic diet: Assessment of ketosis in children using breath acetone. Pediatric Research 52, 443–448, doi:10.1203/00006450-200209000-00023 (2002).
Article PubMed Google Scholar
Salehi, S., Nikan, E., Khodadadi, A. A. & Mortazavi, Y. Highly sensitive carbon nanotubes-SnO2 nanocomposite sensor for acetone detection in diabetes mellitus breath. Sensors and Actuators B-Chemical 205, 261–267, doi:10.1016/j.snb.2014.08.082 (2014).
Article CAS Google Scholar
Gordon, S. M., Szidon, J. P., Krotoszynski, B. K., Gibbons, R. D. & O’Neill, H. J. Volatile organic compounds in exhaled air from patients with lung cancer. Clinical Chemistry 31, 1278–1282 (1985).
CAS PubMed Google Scholar
O’Neill, H. J., Gordon, S. M., O’Neill, M. H., Gibbons, R. D. & Szidon, J. P. A computerized classification technique for screening for the presence of breath biomarkers in lung cancer. Clin Chem 34, 1613–1618 (1988).
PubMed Google Scholar
Morita, M. & Wang, H.-L. Relationship between sulcular sulfide level and oral malodor in subjects with periodontal disease. Journal of periodontology 72, 79–84, doi:10.1902/jop.2001.72.1.79 (2001).
Article CAS PubMed Google Scholar
Peng, G. et al. Diagnosing lung cancer in exhaled breath using gold nanoparticles. Nature Nanotechnology 4, 669–673, doi:10.1038/nnano.2009.235 (2009).
Article ADS CAS PubMed Google Scholar
Bajtarevic, A. et al. Noninvasive detection of lung cancer by analysis of exhaled breath. Bmc Cancer 9, 1–16, doi:10.1186/1471-2407-9-348 (2009).
Article Google Scholar
Fuchs, P., Loeseken, C., Schubert, J. K. & Miekisch, W. Breath gas aldehydes as biomarkers of lung cancer. International Journal of Cancer 126, 2663–2670, doi:10.1002/ijc.24970 (2010).
CAS PubMed Google Scholar
Song, G. et al. Quantitative breath analysis of volatile organic compounds of lung cancer patients. Lung Cancer 67, 227–231, doi:10.1016/j.lungcan.2009.03.029 (2010).
Article PubMed Google Scholar
Davidson, C. S. Hepatic coma. Disease-a-Month 10, 2–30, doi:10.1016/S0011-5029(64)80009-2 (1964).
Article Google Scholar
Phillips, M. et al. Volatile biomarkers of pulmonary tuberculosis in the breath. Tuberculosis 87, 44–52, doi:10.1016/j.tube.2006.03.004 (2007).
Article CAS PubMed Google Scholar
Alkhouri, N. et al. Isoprene in the Exhaled Breath is a Novel Biomarker for Advanced Fibrosis in Patients with Chronic Liver Disease: A Pilot Study. Clinical and Translational Gastroenterology 6, e112, doi:10.1038/ctg.2015.40 (2015).
Article CAS PubMed PubMed Central Google Scholar
Karl, T. et al. Human breath isoprene and its relation to blood cholesterol levels: new measurements and modeling. Journal of Applied Physiology 91, 762–770 (2001).
CAS PubMed Google Scholar
Sehnert, S. S., Jiang, L., Burdick, J. F. & Risby, T. H. Breath biomarkers for detection of human liver diseases: preliminary study. Biomarkers 7, 174–187, doi:10.1080/13547500110118184 (2002).
Article CAS PubMed Google Scholar
Studer, S. M. et al. Patterns and significance of exhaled-breath biomarkers in lung transplant recipients with acute allograft rejection. Journal of Heart and Lung Transplantation 20, 1158–1166, doi:10.1016/S1053-2498(01)00343-6 (2001).
Article CAS PubMed Google Scholar
Phillips, M., Sabas, M. & Greenberg, J. Increased pentane and carbon disulfide in the breath of patients with schizophrenia. Journal of Clinical Pathology 46, 861–864, doi:10.1136/jcp.46.9.861 (1993).
Article CAS PubMed PubMed Central Google Scholar
Davies, S. J., Spanel, P. & Smith, D. Breath analysis of ammonia, volatile organic compounds and deuterated water vapor in chronic kidney disease and during dialysis. Bioanalysis 6, 843–857, doi:10.4155/bio.14.26 (2014).
Article CAS PubMed Google Scholar
Davies, S., Spanel, P. & Smith, D. Quantitative analysis of ammonia on the breath of patients in end-stage renal failure. Kidney International 52, 223–228, doi:10.1038/ki.1997.324 (1997).
Article CAS PubMed Google Scholar
Shimamoto, C., Hirata, I. & Katsu, K. Breath and blood ammonia in liver cirrhosis. Hepatogastroenterology 47, 443–445 (2000).
CAS PubMed Google Scholar
Mochalski, P., King, J., Unterkofler, K. & Amann, A. Stability of selected volatile breath constituents in Tedlar, Kynar and Flexfilm sampling bags. Analyst 138, 1405–1418, doi:10.1039/c2an36193k (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Mazzone, P. J. Exhaled breath volatile organic compound biomarkers in lung cancer. Journal of Breath Research 6, 027106, doi:10.1088/1752-7155/6/2/027106 (2012).
Article ADS PubMed Google Scholar
van de Kant, K. D. G., van der Sande, L. J. T. M., Jobsis, Q., van Schayck, O. C. P. & Dompeling, E. Clinical use of exhaled volatile organic compounds in pulmonary diseases: a systematic review. Respiratory Research 13, 117, doi:10.1186/1465-9921-13-117 (2012).
Article PubMed PubMed Central Google Scholar
Chow, K. K., Short, M. & Zeng, H. A comparison of spectroscopic techniques for human breath analysis. Biomedical Spectroscopy and Imaging 1, 339–353, doi:10.1118/1.4892381 (2012).
CAS Google Scholar
Bofan, M. et al. Within-day and between-day repeatability of measurements with an electronic nose in patients with COPD. J Breath Res 7, 017103, doi:10.1088/1752-7155/7/1/017103 (2013).
Article ADS PubMed Google Scholar
Montuschi, P., Mores, N., Trove, A., Mondino, C. & Barnes, P. J. The electronic nose in respiratory medicine. Respiration 85, 72–84, doi:10.1159/000340044 (2013).
Article PubMed Google Scholar
Santini, G. et al. Electronic Nose and Exhaled Breath NMR-based Metabolomics Applications in Airways Disease. Curr Top Med Chem 16, 1610–1630, doi:10.2174/1568026616666151223113540 (2016).
Article CAS PubMed Google Scholar
Mazzone, P. J. et al. Diagnosis of lung cancer by the analysis of exhaled breath with a colorimetric sensor array. Thorax 62, 565–568, doi:10.1136/thx.2006.072892 (2007).
Article PubMed PubMed Central Google Scholar
Mazzone, P. J. et al. Exhaled Breath Analysis with a Colorimetric Sensor Array for the Identification and Characterization of Lung Cancer. Journal of Thoracic Oncology 7, 137–142, doi:10.1097/JTO.0b013e318233d80f (2012).
Article PubMed PubMed Central Google Scholar
Peng, G. et al. Detection of lung, breast, colorectal, and prostate cancers from exhaled breath using a single array of nanosensors. British Journal of Cancer 103, 542–551, doi:10.1038/sj.bjc.6605810 (2010).
Article CAS PubMed PubMed Central Google Scholar
Blatt, R. et al. In International Workshop on Fuzzy Logic and Applications. 261–268 (Springer).
Osuna, R., Nagle, H. & Shiffman, S. The how and why of electronic nose. IEEE Spectrum, 22–34 (1998).
Tarca, A. L., Carey, V. J., Chen, X. W., Romero, R. & Draghici, S. Machine learning and its applications to biology. PLoS Comput Biol 3, e116, doi:10.1371/journal.pcbi.0030116 (2007).
Article ADS PubMed PubMed Central Google Scholar
Buszewski, B., Ulanowska, A., Ligor, T., Denderz, N. & Amann, A. Analysis of exhaled breath from smokers, passive smokers and non‐smokers by solid‐phase microextraction gas chromatography/mass spectrometry. Biomedical Chromatography 23, 551–556, doi:10.1002/bmc.v23:5 (2009).
Article CAS PubMed Google Scholar
Chang, C.-C. & Lin, C.-J. LIBSVM: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2, 27 (2011).
Google Scholar

Download references

Acknowledgements

The authors would like to thank Doctor Zichun He from Chongqing Red Cross Hospital for the collection of the samples and all the volunteers for their participation in this study. This work was supported by National Key Technologies R&D Program (No. 2013BAI03B04 and 2015BAI01B14) and the Fundamental Research Funds for the Central Universities (No. 106112016CDJXY230003).

Author information

Authors and Affiliations

Key Laboratory of Biorheology Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing, P.R. China
Wang Li, Hongying Liu, Dandan Xie & Xititan Pi
Artificial Intelligence of Key Laboratory of Sichuan Province, Sichuan University of Science & Engineering, Zigong, Sichuan Province, P.R. China
Wang Li
Chongqing Engineering Research Center of Medical Electronics, Chongqing, P.R. China
Hongying Liu
Chongqing Red Cross Hospital (People’s Hospital of Jiangbei District), Chongqing, P.R. China
Zichun He
Key Laboratories for National Defense Science and Technology of Innovative Micro-Nano Devices and System Technology, Chongqing University, Chongqing, P.R. China
Xititan Pi

Authors

Wang Li
View author publications
You can also search for this author in PubMed Google Scholar
Hongying Liu
View author publications
You can also search for this author in PubMed Google Scholar
Dandan Xie
View author publications
You can also search for this author in PubMed Google Scholar
Zichun He
View author publications
You can also search for this author in PubMed Google Scholar
Xititan Pi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: W.L. and H.L. Performed the experiments: W.L. and D.X. Analyzed the data: W.L. Contributed reagents/materials/analysis tools: X.P., Z.H. and H.L. Wrote the paper: W.L. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Hongying Liu or Xititan Pi.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supporting Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, W., Liu, H., Xie, D. et al. Lung Cancer Screening Based on Type-different Sensor Arrays. Sci Rep 7, 1969 (2017). https://doi.org/10.1038/s41598-017-02154-9

Download citation

Received: 07 December 2016
Accepted: 07 April 2017
Published: 16 May 2017
DOI: https://doi.org/10.1038/s41598-017-02154-9
Springer Nature Limited

This article is cited by

Gas Sensors Based on Chemi-Resistive Hybrid Functional Nanomaterials
- Yingying Jian
- Wenwen Hu
- Weiwei Wu
Nano-Micro Letters (2020)

Lung Cancer Screening Based on Type-different Sensor Arrays

Abstract

Similar content being viewed by others

Prediction of lung cancer with a sensor array based e-nose system using machine learning methods

Electronic Nose Technology in Respiratory Diseases

Differentiating head and neck carcinoma from lung carcinoma with an electronic nose: a proof of concept study

Introduction

Materials and Methods

Selection of type-different sensors

E-nose system design

Source of test samples

Test process

Data analysis

Data pre-processing

Feature extraction

Classifier selection

Error estimation

Data Availability

Results and Discussion

Response curves of sensor arrays

Comparison of feature extraction effects

Estimation error analysis

Effects of type-different sensor array on lung cancer recognition

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supporting Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Gas Sensors Based on Chemi-Resistive Hybrid Functional Nanomaterials

Search

Navigation