Skip to main content

Advertisement

Log in

Feature Selection and Performance Evaluation of Support Vector Machine (SVM)-Based Classifier for Differentiating Benign and Malignant Pulmonary Nodules by Computed Tomography

  • Published:
Journal of Digital Imaging Aims and scope Submit manuscript

Abstract

There are lots of work being done to develop computer-assisted diagnosis and detection (CAD) technologies and systems to improve the diagnostic quality for pulmonary nodules. Another way to improve accuracy of diagnosis on new images is to recall or find images with similar features from archived historical images which already have confirmed diagnostic results, and the content-based image retrieval (CBIR) technology has been proposed for this purpose. In this paper, we present a method to find and select texture features of solitary pulmonary nodules (SPNs) detected by computed tomography (CT) and evaluate the performance of support vector machine (SVM)-based classifiers in differentiating benign from malignant SPNs. Seventy-seven biopsy-confirmed CT cases of SPNs were included in this study. A total of 67 features were extracted by a feature extraction procedure, and around 25 features were finally selected after 300 genetic generations. We constructed the SVM-based classifier with the selected features and evaluated the performance of the classifier by comparing the classification results of the SVM-based classifier with six senior radiologists′ observations. The evaluation results not only showed that most of the selected features are characteristics frequently considered by radiologists and used in CAD analyses previously reported in classifying SPNs, but also indicated that some newly found features have important contribution in differentiating benign from malignant SPNs in SVM-based feature space. The results of this research can be used to build the highly efficient feature index of a CBIR system for CT images with pulmonary nodules.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig 1.
Fig 2.
Fig 3.
Fig 4.
Fig 5.
Fig 6.

Similar content being viewed by others

References

  1. Matsuki Y, Nakamura K, Watanabe H, Aoki T, Nakata H, Katsuragawa S, Doi K: Usefulness of an artificial neural network for differentiating benign from malignant pulmonary nodules on high-resolution CT: evaluation with receiver operating characteristic analysis. Am J Roentgenol 178(3):657–663, 2002

    Google Scholar 

  2. McNitt-Gray MF, Hart EM, Goldin JG, Yao CW, Aberle DR: A pattern classification approach to characterizing solitary pulmonary nodules imaged on high resolution computed tomography. Proc SPIE 2710:1024–1034, 1996

    Article  Google Scholar 

  3. Nakamura K, Yoshida H, Engelmann R, MacMahon H: Computerized analysis of the likelihood of malignancy in solitary pulmonary nodules with use of artificial neural networks. Radiology 214:823–830, 2000

    CAS  PubMed  Google Scholar 

  4. Shiraishi J, Abe H, Englemann R, Aoyama M: Computer-aided diagnosis to distinguish benign from malignant solitary pulmonary nodules on radiographs: ROC analysis of radiologists′ performance–initial experience. Radiology 227:469–474, 2003

    Article  PubMed  Google Scholar 

  5. Kawata Y, Niki N, Ohmatsu H, Kusumoto M, et al: Hybrid classification approach of malignant and benign pulmonary nodules based on topological and histogram features. In: Proc MICCAI 297–306, 2000

  6. Silva AC, Paiva AC, Oliveira ACM: Comparison of FLDA, MLP and SVM in diagnosis of lung nodule. Lect Notes Comput Sci 3587:285–294, 2005

    Article  Google Scholar 

  7. Shah SK, McNitt-Gray MF, Rogers SR: Computer aided characterization of the solitary pulmonary nodule using volumetric and contrast enhancement features. Acad Radiol 12(10):1310–1319, 2005

    Article  PubMed  Google Scholar 

  8. Yamashita K, Matsunobe S, Tsuda T, Nemoto T: Solitary pulmonary nodule: preliminary study of evaluation with incremental dynamic CT. Radiology 194:399–405, 1995

    CAS  PubMed  Google Scholar 

  9. Siegelman SS, Khouri NF, Leo FR: Solitary pulmonary nodules: CT assessment. Radiology 160:307–312, 1986

    CAS  PubMed  Google Scholar 

  10. Müller H, Michoux N, Bandon D: A review of content-based image retrieval system in medical applications-clinical benefits and future directions. Int J Med Informatics 73(1):1–23, 2004

    Article  Google Scholar 

  11. Fisher B, Deserno T, Ott B, et al: Integration of a research CBIR system with RIS and PACS for radiological routine. Proc SPIE 6919:691914–1–691914-10, 2008

    Article  Google Scholar 

  12. Tan Y, Zhang J, Hua Y, Zhang G: Content-based image retrieval in picture archiving and communication system. Proc SPIE 6145:614515–1–614515-8, 2006

    Article  Google Scholar 

  13. Deserno T, Antani S, Long RL: Ontology of gaps in content-based image retrieval. J Digit Imaging (in press), 2007

  14. Depeusinge A, Lavindrasana J, Hidki A, et al: A classification framework for lung tissue categorization. Proc SPIE 6919:69190C1–69190C12, 2008

    Google Scholar 

  15. Silva AC, Carvalho PCP, Gattass M: Diagnosis of lung nodule using semivariogram and geometric measures in computerized tomography images. Comput Methods Programs Biomed 79:31–38, 2005

    Article  PubMed  Google Scholar 

  16. Haralick RM: Statistical and structural approaches to texture. Proc IEEE 67:786–804, 1979

    Article  Google Scholar 

  17. Clausi DA, Jernigan ME: Designing Gabor filters for optimal texture separability. Pattern Recogn 33:1835–1849, 2000

    Article  Google Scholar 

  18. Manjunath B, Ma W: Texture features for browsing and retrieval of image data. IEEE Trans Pattern Analysis Mach Intell 18(8):837–842, 1996

    Article  Google Scholar 

  19. Unser M: Texture classification and segmentation using wavelet frames. IEEE Trans Image Processing 4:1549–1560, 1995

    Article  CAS  Google Scholar 

  20. Kaplan LM, Murenzi R: Texture segmentation using multiscale Hurst features. IEEE Int Conf Image Process 3:205–208, 1997

    Google Scholar 

  21. Joachims T: Text categorization with support vector machines. In: Proceedings of European Conference on Machine Learning (ECML), 1998

  22. Brown M, Grundy W, Lin D, Cristianini N, Sugnet C, Furey T, Ares M, Haussler D: Knowledge-based analysis of microarray gene expression data using support vector machines. 1999. http://www.cse.ucsc.edu/research/compbio/genex/genex.html. Santa Cruz, University of California, Department of Computer Science and Engineering

  23. Shawe-Taylor J, Cristianini N: Kernel methods for pattern analysis, Cambridge: Cambridge University Press, 2004

    Google Scholar 

  24. Fawcett T: ROC graphs: notes and practical considerations for data mining researchers. Technical report HPL-2003-4 HP Labs, 2003.

  25. Canu S, Grandvalet Y, Guigue V, Rakotomamonjy A: SVM and kernel methods Matlab toolbox, Rouen: Perception Systèmes et Information, INSA de Rouen, 2005

    Google Scholar 

  26. Metz CE: ROCKIT software. http://xray.bsd.uchicago.edu/krl/index.htm, 2006

Download references

Acknowledgements

The project was supported by the grants from the National Nature Science Foundation of China (grant no. 30570512) and Shanghai Science and Technology Committee (grant no. 064119658, 06SN07111). The authors would like to thank Dr. Xiaojun Ge for providing the CT images used in this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianguo Zhang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, Y., Tan, Y., Hua, Y. et al. Feature Selection and Performance Evaluation of Support Vector Machine (SVM)-Based Classifier for Differentiating Benign and Malignant Pulmonary Nodules by Computed Tomography. J Digit Imaging 23, 51–65 (2010). https://doi.org/10.1007/s10278-009-9185-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10278-009-9185-9

Key words

Navigation