Unsupervised class labeling of diffuse lung diseases using frequent attribute patterns
- 269 Downloads
For realizing computer-aided diagnosis (CAD) of computed tomography (CT) images, many pattern recognition methods have been applied to automatic classification of normal and abnormal opacities; however, for the learning of accurate classifier, a large number of images with correct labels are necessary. It is a very time-consuming and impractical task for radiologists to give correct labels for a large number of CT images. In this paper, to solve the above problem and realize an unsupervised class labeling mechanism without using correct labels, a new clustering algorithm for diffuse lung diseases using frequent attribute patterns is proposed.
A large number of frequently appeared patterns of opacities are extracted by a data mining algorithm named genetic network programming (GNP), and the extracted patterns are automatically distributed to several clusters using genetic algorithm (GA). In this paper, lung CT images are used to make clusters of normal and diffuse lung diseases.
After executing the pattern extraction by GNP, 1,148 frequent attribute patterns were extracted; then, GA was executed to make clusters. This paper deals with making clusters of normal and five kinds of abnormal opacities (i.e., six-class problem), and then, the proposed method without using correct class labels in the training showed 47.7 % clustering accuracy.
It is clarified that the proposed method can make clusters without using correct labels and has the potential to apply to CAD, reducing the time cost for labeling CT images.
KeywordsComputer-aided diagnosis Diffuse lung diseases Unsupervised learning Clustering Data mining Evolutionary computation
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.
Human and animal rights
This article does not contain any studies with animals performed by any of the authors.
Informed consent was obtained from all individual participants included in the study.
- 1.Agrawal R, Srikant R (1994) Fast algorithms for mining association rules. In: Proceedings of the 20th VLDB conference, Santiago, Chile, pp 487–499Google Scholar
- 4.Goldberg DE (1989) Genetic algorithm in search, optimization and machine learning. Addison-Wesley, BostonGoogle Scholar
- 7.Kuwahara M, Kido S, Shouno H (2009) Classification of patterns for diffuse lung diseases in thoracic ct images by adaboost algorithm. In: Proceedings of SPIE, medical imaging, computer-aided diagnosis. 7260:37–1–8Google Scholar
- 10.Machine Learning Group at the University of Waikato (2015) Waikato environment for knowledge analysis, open source project for machine learning. www.cs.waikato.ac.nz/ml/weka/
- 12.Quinlan JR (1993) C4 5: programs for machine learning, vol 1. Morgan kaufmann, BurlingtonGoogle Scholar
- 13.Rawat J, Singh A, Bhadauria H, Virmani J (2015) Computer aided diagnostic system for detection of leukemia using microscopic images. Procedia Comput Sci 70:748–756. In: Proceedings of the 4th international conference on eco-friendly computing and communication systemsGoogle Scholar
- 14.Rui X, Hirano Y, Tachibana R, Shoji K (2013) A bag-of-features approach to classify six types of pulmonary textures on high-resolution computed tomography. IEICE Trans Inf Syst 96(4):845–855Google Scholar
- 15.Shimada K, Hirasawa K, Hu J (2006) Genetic network programming with acquisition mechanisms of association rules. J Adv Comput Intell Intell Inform 10(1):102–111Google Scholar
- 17.Zhao W, Xu R, Hirano Y, Tachibana R, Kido S (2013) Classification of diffuse lung diseases patterns by a sparse representation based method on hrct images. In: 2013 35th annual international conference of the IEEE engineering in medicine and biology society (EMBC), IEEE, pp 5457–5460Google Scholar