Skip to main content


Log in

A novel image mining technique for classification of mammograms using hybrid feature selection

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript


The image mining technique deals with the extraction of implicit knowledge and image with data relationship or other patterns not explicitly stored in the images. It is an extension of data mining to image domain. The main objective of this paper is to apply image mining in the domain such as breast mammograms to classify and detect the cancerous tissue. Mammogram image can be classified into normal, benign, and malignant class. Total of 26 features including histogram intensity features and gray-level co-occurrence matrix features are extracted from mammogram images. A hybrid approach of feature selection is proposed, which approximately reduces 75% of the features, and new decision tree is used for classification. The most interesting one is that branch and bound algorithm that is used for feature selection provides the best optimal features and no where it is applied or used for gray-level co-occurrence matrix feature selection from mammogram. Experiments have been taken for a data set of 300 images taken from MIAS of different types with the aim of improving the accuracy by generating minimum number of rules to cover more patterns. The accuracy obtained by this method is approximately 97.7%, which is highly encouraging.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others





  1. Majid AS, de Paredes ES, Doherty RD, Sharma N, Salvador X (2003) Missed breast carcinoma: pitfalls and pearls. Radiographics 23:881–895

    Article  Google Scholar 

  2. Zaïane OR, Antonie M-L, Coman A (2002) Mammography classification by association rule based classifier. In: MDM/KDD2002 international workshop on multimedia data mining ACM SIGKDD, pp 62–69

  3. Xuanyang X, Yuchang G, Shouhong W, Xi L (2005) Computer aided detection of SARS based on radiographs data mining. In: Proceedings of the 2005 IEEE engineering in medicine and biology 27th annual conference, Shanghai, China, pp 7459–7462

  4. Chen C, Lee G (1997) Image segmentation using multitiresolution wavelet analysis and expectation maximum (EM) algorithm for mammography. Int J Imaging Syst Technol 8(5):491–504

    Article  Google Scholar 

  5. Wang T, Karayaiannis N (1998) Detection of microcalcification in digital mammograms using wavelets. IEEE Trans Med Imaging 17(4):498–509

    Article  Google Scholar 

  6. Bozek J, Mustra M, Delac K, Grgic M (2009) A survey of image processing algorithms in digital mammography. In: Grgic et al (eds) Recent advances in multimedia signal processing and communications, SCI 231, pp 631–657

  7. Wang S, Zhou M, Geng G (2005) Application of fuzzy cluster analysis for medical image data mining. In: Proceedings of the IEEE international conference on mechatronics and automation, Niagara Falls, Canada, pp 36–41

  8. Jensen R, Qiang S (2004) Semantics preserving dimensionality reduction: rough and fuzzy-rough based approaches. In: IEEE transactions on knowledge and data engineering, pp 1457–1471

  9. Christiyanni I et al (2000) Fast detection of masses in computer aided mammography. In: IEEE signal processing magazine, pp 54–64

  10. Erray W, Hacida H (2006) A new cost sensitive decision tree method application for mammograms classification. In: IJCSNS international journal of computer science and network security, pp 130–138

  11. Liu Y, Zhang D, Guojun L (2008) Regionbased image retrieval with high-level semantics using decision tree learning. Pattern Recogn 41:2554–2570

    Article  MATH  Google Scholar 

  12. Polat K, Gunes S (2009) A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems. Exp Syst Appl 36(2):1587–1592. doi:10.1016/j.eswa.2007.11.051

    Article  Google Scholar 

  13. Pisano ED, Cole EB, Hemminger BM, Yaffe MJ, Aylward SR, Maidment ADA, Eugene Johnston R, Williams MB, Niklason LT, Conant EF, Fajardo LL, Kopans DB, Brown ME, Pizer SM (2000) Image processing algorithms for digital mammography: a pictorial essay. J Radiogr 20(5):400–420

    Google Scholar 

  14. Wu Z, Yuan J, Lv B, Zheng X (2010) Digital mammography image enhancement using improved unsharp masking approach. In: 3rd international congress on image and signal processing (CISP), pp 668–672. doi:10.1109/CISP.2010.5647218

  15. Bick U, Diekmann F (2010) Digital mammography, 1st edn. Springer, Berlin

    Book  Google Scholar 

  16. Saeys Y, Abeel T, Van de Peer Y (2008) Towards robust feature selection techniques. http://www.bioinformatics.psb.ugent

  17. Bontempi G, Haibe-Kains B (2008) Feature selection methods for mining bioinformatics data.

  18. Li J, Wang JZ, Wiederhold G (2000) IRM: integrated region matching for image retrieval. In: Proceedings of the 8th ACM international conference on multimedia, pp 147–156

  19. Wei C-H, Li Y, Chau W-Y, Li C-T (2009) Trademark image retrieval using synthetic features for describing global shape and interior structure. Pattern Recogn 42(3):386–394

    Article  MATH  Google Scholar 

  20. Manjunath BS, Ma WY (1996) Texture feature for browsing and retrieval of image data. IEEE Trans PAMI 18(8):837–842

    Article  Google Scholar 

  21. Saha SK, Das AK, Chanda B (2004) CBIR using perception based texture and color measures. In: Proceedings of the 17th international conference on pattern recognition (ICPR ‘04), vol 2

  22. Zhang J, Li G-l, He S-w (2008) Texture-based image retrieval by edge detection matching GLCM. In: Proceedings of the 10th international conference on high performance computing and communications

  23. Jalja K, Bhagvati C, Deekshatulu BL, Pujari AK (2005) Texture element feature characterizations for CBIR. In: Proceedings of geoscience and remote sensing symposium (IGARSS ‘05), vol 2

  24. Haralick RM, Shanmugam K, Dinstein I (1973) Texture features for image classification. IEEE Trans Syst Man Cybern 8(6):610–621

    Article  Google Scholar 

  25. Weszka JS, Dyer CR, Rosenfeld A (1976) A comparative study of texture measures for terrain classification. IEEE Trans Syst Man Cybern 6(4):269–285

    MATH  Google Scholar 

  26. Conners RW, Harlow CA (1980) A theoretical comparison texture algorithms. IEEE Trans Pattern Anal Mach Intell 2:204–222

    Article  MATH  Google Scholar 

  27. Arivazhagan S, Ganesan L (2003) Texture classification using wavelet transform. Pattern Recogn Lett 24:1513–1521

    Article  MATH  Google Scholar 

  28. Hiremath PS, Shivashankar S (2006) Texture classification using wavelet packet decomposition. ICGSTs GVIP J 6(2):77–80

    Google Scholar 

  29. Liu L, Wang J, He K (2010) Breast density classification using histogram moments of multiple resolution mammograms. In: Biomedical engineering and informatics (BMEI), 3rd international conference, IEEE explore, pp 146–149. doi:10.1109/BMEI.2010.5639662

  30. Ke L, Mu N, Kang Y (2010) Mass computer-aided diagnosis method in mammogram based on texture features. In: Biomedical engineering and informatics (BMEI), 3rd international conference, IEEE explore, pp 146–149. doi:10.1109/BMEI.2010.5639662

  31. Khuzi AM, Besar R, Wan Zaki WMD (2008) Texture features selection for masses detection in digital mammogram. In: 4th Kuala Lumpur international conference on biomedical engineering 2008, IFMBE proceedings, 2008, vol 21, part 3, part 8, pp 629–632. doi:10.1007/978-3-540-69139-6_157

  32. Lai S, Li X, Bischof W (1989) On techniques for detecting circumscribed masses in mammograms. IEEE Trans Med Imaging 8(4):377–386

    Article  Google Scholar 

  33. Thangavel K, Karnan M (2005) Computer aided diagnosis in digital mammograms: detection of microcalcifications by metaheuristic algorithms. ICGST-GVIP J 5(7):41–55

    Google Scholar 

  34. Jiang J, Yao B, Wason AM (2007) A genetic algorithm design for micro calcification detection and classification in digital mammograms. Comput Med Imaging Graph 31(1):49–61

    Article  Google Scholar 

  35. Sánchez-Ferrero GV, Arribas JI (2007) A statistical-genetic algorithm to select the most significant features in mammograms, Springer Berlin/Heidelberg, pp 189–196. doi:10.1007/978-3-540-74272-2_24

  36. Bui TN, Moon BR (1996) Genetic algorithm and graph partitioning. IEEE Trans Comput 45(7):841–855

    Article  MathSciNet  MATH  Google Scholar 

  37. Petre S, Pavel P, Josef K (2004) Fast branch and bound algorithms for optimal feature selection. IEEE Trans Pattern Anal Mach Intell 26(7):900–912

    Article  Google Scholar 

  38. Woods R, Mazziotta J, Cherry S (1993) MRI-PET registration with automated algorithm. J Comput Assist Tomogr 17(4):536–546

    Article  Google Scholar 

  39. Chawla N, Hall L, Bowyer K, Kegelmeyer W (2002) SMOTE: Synthetic Minority Oversampling TEchnique. J Artif Intell Res 16:321–357

    Google Scholar 

  40. Provost F, Domingos P (2003) Tree induction for probability-based ranking. Machine Learning 52(3):199–215

    Google Scholar 

  41. Zadrozny B, Elkan C (2001) Learning and making decisions when costs and probabilities are both unknown. In: Proceedings of KDD’2001. pp 204–213

  42. Cussents (1993) J. Bayes and pseudo-bayes estimates of conditional probabilities and their reliabilities. pp 136–152

  43. Holmes G, Donkin A, Witten IH (1994) WEKA: a machine learning workbench. In: Proceedings of the second Australia and New Zealand conference on intelligent information systems, Brisbane, Australia, pp 357–361

  44. Veldkamp WJH, Karssemeijer N, Otten JDM, Hendriks JHCL (2000) Automated classification of clustered microcalcifications into malignant and benign types. Med Phys 27(11):2600–2608

    Article  Google Scholar 

  45. Wei L, Yang Y, Nishikawa RM, Jiang Y (2005) A study on several machine-learning methods for classification of malignant and benign clustered microcalcifications. IEEE Trans Med Imaging 24(2):371–380

    Google Scholar 

  46. Abu-Amara F, Abdel-Qader I (2009) Hybrid mammogram classification using rough set and fuzzy classifier. Int J Biomed Imaging 2009. doi:10.1155/2009/680508

  47. Chan H-P, Sahiner B, Patrick N, Helvie MA, Lam KL, Adler DD, Goodsitt MM (1997) Computerized classification of malignant and benign microcalcifications on mammograms: texture analysis using an artificial neural network. Phys Med Biol 42(3):549–567

    Article  Google Scholar 

  48. Vibha L, Harshavardhan GM, Pranaw K, Deepa Shenoy P, Venugopal KR, Patnaik LM (2006) Classification of mammograms using decision trees, database engineering and applications symposium, IDEAS ‘06. doi:10.1109/IDEAS.2006.14

  49. Thangavel K, Roselin R (2009) Mammogram mining with genetic optimization of ant-miner parameters. Int J Recent Trends Eng 2(3)

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Aswini Kumar Mohanty.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mohanty, A.K., Senapati, M.R. & Lenka, S.K. A novel image mining technique for classification of mammograms using hybrid feature selection. Neural Comput & Applic 22, 1151–1161 (2013).

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: