Abstract
The image mining technique deals with the extraction of implicit knowledge and image with data relationship or other patterns not explicitly stored in the images. It is an extension of data mining to image domain. The main objective of this paper is to apply image mining in the domain such as breast mammograms to classify and detect the cancerous tissue. Mammogram image can be classified into normal, benign, and malignant class. Total of 26 features including histogram intensity features and gray-level co-occurrence matrix features are extracted from mammogram images. A hybrid approach of feature selection is proposed, which approximately reduces 75% of the features, and new decision tree is used for classification. The most interesting one is that branch and bound algorithm that is used for feature selection provides the best optimal features and no where it is applied or used for gray-level co-occurrence matrix feature selection from mammogram. Experiments have been taken for a data set of 300 images taken from MIAS of different types with the aim of improving the accuracy by generating minimum number of rules to cover more patterns. The accuracy obtained by this method is approximately 97.7%, which is highly encouraging.
Similar content being viewed by others
References
Majid AS, de Paredes ES, Doherty RD, Sharma N, Salvador X (2003) Missed breast carcinoma: pitfalls and pearls. Radiographics 23:881–895
Zaïane OR, Antonie M-L, Coman A (2002) Mammography classification by association rule based classifier. In: MDM/KDD2002 international workshop on multimedia data mining ACM SIGKDD, pp 62–69
Xuanyang X, Yuchang G, Shouhong W, Xi L (2005) Computer aided detection of SARS based on radiographs data mining. In: Proceedings of the 2005 IEEE engineering in medicine and biology 27th annual conference, Shanghai, China, pp 7459–7462
Chen C, Lee G (1997) Image segmentation using multitiresolution wavelet analysis and expectation maximum (EM) algorithm for mammography. Int J Imaging Syst Technol 8(5):491–504
Wang T, Karayaiannis N (1998) Detection of microcalcification in digital mammograms using wavelets. IEEE Trans Med Imaging 17(4):498–509
Bozek J, Mustra M, Delac K, Grgic M (2009) A survey of image processing algorithms in digital mammography. In: Grgic et al (eds) Recent advances in multimedia signal processing and communications, SCI 231, pp 631–657
Wang S, Zhou M, Geng G (2005) Application of fuzzy cluster analysis for medical image data mining. In: Proceedings of the IEEE international conference on mechatronics and automation, Niagara Falls, Canada, pp 36–41
Jensen R, Qiang S (2004) Semantics preserving dimensionality reduction: rough and fuzzy-rough based approaches. In: IEEE transactions on knowledge and data engineering, pp 1457–1471
Christiyanni I et al (2000) Fast detection of masses in computer aided mammography. In: IEEE signal processing magazine, pp 54–64
Erray W, Hacida H (2006) A new cost sensitive decision tree method application for mammograms classification. In: IJCSNS international journal of computer science and network security, pp 130–138
Liu Y, Zhang D, Guojun L (2008) Regionbased image retrieval with high-level semantics using decision tree learning. Pattern Recogn 41:2554–2570
Polat K, Gunes S (2009) A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems. Exp Syst Appl 36(2):1587–1592. doi:10.1016/j.eswa.2007.11.051
Pisano ED, Cole EB, Hemminger BM, Yaffe MJ, Aylward SR, Maidment ADA, Eugene Johnston R, Williams MB, Niklason LT, Conant EF, Fajardo LL, Kopans DB, Brown ME, Pizer SM (2000) Image processing algorithms for digital mammography: a pictorial essay. J Radiogr 20(5):400–420
Wu Z, Yuan J, Lv B, Zheng X (2010) Digital mammography image enhancement using improved unsharp masking approach. In: 3rd international congress on image and signal processing (CISP), pp 668–672. doi:10.1109/CISP.2010.5647218
Bick U, Diekmann F (2010) Digital mammography, 1st edn. Springer, Berlin
Saeys Y, Abeel T, Van de Peer Y (2008) Towards robust feature selection techniques. http://www.bioinformatics.psb.ugent
Bontempi G, Haibe-Kains B (2008) Feature selection methods for mining bioinformatics data. http://www.ulb.ac.be/di/mlg
Li J, Wang JZ, Wiederhold G (2000) IRM: integrated region matching for image retrieval. In: Proceedings of the 8th ACM international conference on multimedia, pp 147–156
Wei C-H, Li Y, Chau W-Y, Li C-T (2009) Trademark image retrieval using synthetic features for describing global shape and interior structure. Pattern Recogn 42(3):386–394
Manjunath BS, Ma WY (1996) Texture feature for browsing and retrieval of image data. IEEE Trans PAMI 18(8):837–842
Saha SK, Das AK, Chanda B (2004) CBIR using perception based texture and color measures. In: Proceedings of the 17th international conference on pattern recognition (ICPR ‘04), vol 2
Zhang J, Li G-l, He S-w (2008) Texture-based image retrieval by edge detection matching GLCM. In: Proceedings of the 10th international conference on high performance computing and communications
Jalja K, Bhagvati C, Deekshatulu BL, Pujari AK (2005) Texture element feature characterizations for CBIR. In: Proceedings of geoscience and remote sensing symposium (IGARSS ‘05), vol 2
Haralick RM, Shanmugam K, Dinstein I (1973) Texture features for image classification. IEEE Trans Syst Man Cybern 8(6):610–621
Weszka JS, Dyer CR, Rosenfeld A (1976) A comparative study of texture measures for terrain classification. IEEE Trans Syst Man Cybern 6(4):269–285
Conners RW, Harlow CA (1980) A theoretical comparison texture algorithms. IEEE Trans Pattern Anal Mach Intell 2:204–222
Arivazhagan S, Ganesan L (2003) Texture classification using wavelet transform. Pattern Recogn Lett 24:1513–1521
Hiremath PS, Shivashankar S (2006) Texture classification using wavelet packet decomposition. ICGSTs GVIP J 6(2):77–80
Liu L, Wang J, He K (2010) Breast density classification using histogram moments of multiple resolution mammograms. In: Biomedical engineering and informatics (BMEI), 3rd international conference, IEEE explore, pp 146–149. doi:10.1109/BMEI.2010.5639662
Ke L, Mu N, Kang Y (2010) Mass computer-aided diagnosis method in mammogram based on texture features. In: Biomedical engineering and informatics (BMEI), 3rd international conference, IEEE explore, pp 146–149. doi:10.1109/BMEI.2010.5639662
Khuzi AM, Besar R, Wan Zaki WMD (2008) Texture features selection for masses detection in digital mammogram. In: 4th Kuala Lumpur international conference on biomedical engineering 2008, IFMBE proceedings, 2008, vol 21, part 3, part 8, pp 629–632. doi:10.1007/978-3-540-69139-6_157
Lai S, Li X, Bischof W (1989) On techniques for detecting circumscribed masses in mammograms. IEEE Trans Med Imaging 8(4):377–386
Thangavel K, Karnan M (2005) Computer aided diagnosis in digital mammograms: detection of microcalcifications by metaheuristic algorithms. ICGST-GVIP J 5(7):41–55
Jiang J, Yao B, Wason AM (2007) A genetic algorithm design for micro calcification detection and classification in digital mammograms. Comput Med Imaging Graph 31(1):49–61
Sánchez-Ferrero GV, Arribas JI (2007) A statistical-genetic algorithm to select the most significant features in mammograms, Springer Berlin/Heidelberg, pp 189–196. doi:10.1007/978-3-540-74272-2_24
Bui TN, Moon BR (1996) Genetic algorithm and graph partitioning. IEEE Trans Comput 45(7):841–855
Petre S, Pavel P, Josef K (2004) Fast branch and bound algorithms for optimal feature selection. IEEE Trans Pattern Anal Mach Intell 26(7):900–912
Woods R, Mazziotta J, Cherry S (1993) MRI-PET registration with automated algorithm. J Comput Assist Tomogr 17(4):536–546
Chawla N, Hall L, Bowyer K, Kegelmeyer W (2002) SMOTE: Synthetic Minority Oversampling TEchnique. J Artif Intell Res 16:321–357
Provost F, Domingos P (2003) Tree induction for probability-based ranking. Machine Learning 52(3):199–215
Zadrozny B, Elkan C (2001) Learning and making decisions when costs and probabilities are both unknown. In: Proceedings of KDD’2001. pp 204–213
Cussents (1993) J. Bayes and pseudo-bayes estimates of conditional probabilities and their reliabilities. pp 136–152
Holmes G, Donkin A, Witten IH (1994) WEKA: a machine learning workbench. In: Proceedings of the second Australia and New Zealand conference on intelligent information systems, Brisbane, Australia, pp 357–361
Veldkamp WJH, Karssemeijer N, Otten JDM, Hendriks JHCL (2000) Automated classification of clustered microcalcifications into malignant and benign types. Med Phys 27(11):2600–2608
Wei L, Yang Y, Nishikawa RM, Jiang Y (2005) A study on several machine-learning methods for classification of malignant and benign clustered microcalcifications. IEEE Trans Med Imaging 24(2):371–380
Abu-Amara F, Abdel-Qader I (2009) Hybrid mammogram classification using rough set and fuzzy classifier. Int J Biomed Imaging 2009. doi:10.1155/2009/680508
Chan H-P, Sahiner B, Patrick N, Helvie MA, Lam KL, Adler DD, Goodsitt MM (1997) Computerized classification of malignant and benign microcalcifications on mammograms: texture analysis using an artificial neural network. Phys Med Biol 42(3):549–567
Vibha L, Harshavardhan GM, Pranaw K, Deepa Shenoy P, Venugopal KR, Patnaik LM (2006) Classification of mammograms using decision trees, database engineering and applications symposium, IDEAS ‘06. doi:10.1109/IDEAS.2006.14
Thangavel K, Roselin R (2009) Mammogram mining with genetic optimization of ant-miner parameters. Int J Recent Trends Eng 2(3)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mohanty, A.K., Senapati, M.R. & Lenka, S.K. A novel image mining technique for classification of mammograms using hybrid feature selection. Neural Comput & Applic 22, 1151–1161 (2013). https://doi.org/10.1007/s00521-012-0881-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-012-0881-x