Distance Similarity as a CBR Technique for Early Detection of Breast Cancer: An Egyptian Case Study

  • Heba Ayeldeen
  • Olfat Shaker
  • Osman Hegazy
  • Aboul Ella Hassanien
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 340)


Case-based reasoning as a concept covers almost a lot of technologies and techniques including knowledge management, artificial intelligence, machine learning techniques as well as database technology. The usage of all these technologies can easily aid in early detection of breast cancer as well as help other decision makers take the right decision on time and all the times. Of the main hot topics nowadays concerning executive managers and decision makers is measuring the similarity between objects. For better performance most organizations are in need on semantic similarity and similarity measures. This article presents mathematically different distance metrics used for measuring the binary similarity between quantitative data within cases. The case study represents a quantitative data of breast cancer patients within Faculty of medicine Cairo University. The experimental results show that the squared chord distance yields better with a 96.76 % without normalization that correlate more closely with human assessments compared to other distance measures used in this study.


Knowledge management Classification Similarity measure Association Categorization Protein analysis 


  1. 1.
    Pal, S.K., Shiu, S.C.K.: Foundations of soft case-based reasoning. In: Wiley series on intelligent systems, New York, pp. 3–25 (2004)Google Scholar
  2. 2.
    Park, M.-K., Lee, I., Shon, K.-M.: Using case based reasoning for problem solving in a complex production process. Expert Syst. Appl. 15, 69–75 (1998)CrossRefGoogle Scholar
  3. 3.
    Grupe, F.H., Urwiler, R., Ramarapu, N.K., Owrang, M.: The application of case-based reasoning to the software development process. Inf. Softw. Technol. 40, 493–499 (1998)CrossRefGoogle Scholar
  4. 4.
    Watson, I.: Case-based reasoning is a methodology not a technology. J. Knowl. Based Syst. 12, 303–308 (1999)CrossRefGoogle Scholar
  5. 5.
    Singh, S. K.: Database Systems: Concepts, Designs and Applications. Pearson Education India, New Delhi (2006)Google Scholar
  6. 6.
    Zawbaa, H.M., El-Bendary, N., Hassanien, A.E., Kim, T.-H.: Machine learning-based soccer video summarization system. In: Kim, T.-H., Gelogo, Y. (eds.) MulGraB 2011, Part II. CCIS, vol. 263, pp. 19–28. Springer, Heidelberg (2011)Google Scholar
  7. 7.
    Kolodner, J.L.: An introduction to case-based reasoning. Artif. Intell. Rev. 6, 3–34Google Scholar
  8. 8.
    Ghany, K.K.A., Hassanien, A.E., Gerald, S.: Similarity measures for fingerprint matching. In: International Conference on Image Processing, Computer Vision, and Pattern Recognition, USA, pp. 21–24 (2014)Google Scholar
  9. 9.
    Schank, R.C., Abelson, R.P.: Scripts, Plans, Goals, and Understanding: An Inquiry into Human Knowledge Structures. Lawrence Erlbaum, Hillsdale (1977)MATHGoogle Scholar
  10. 10.
    Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: Knowledge discovery and data mining: towards a unifying framework. In: Simoudis, E., Han, J., Fayyad, U. (eds.) Proceedings of KDD’96, Second International Conference on Knowledge Discovery and Data Mining, pp. 82–88 (1996)Google Scholar
  11. 11.
    Rezvana, M.T., Zeinal Hamadania, A., Shalbafzadehb, A.: Case-based reasoning for classification in the mixed data sets employing the compound distance methods. Eng. Appl. Artif. Intell. 26(9), 2001–2009 (2013)Google Scholar
  12. 12.
    Choi, S., Cha, S., Tappert, C.: A survey of binary similarity and distance measures. J. Syst. Cybern. Inf. 8(1), 43-48 (Key: citeulike:7358808) (2010)Google Scholar
  13. 13.
    Huang, A.: Similarity measures for text document clustering. In: New Zealand Computer Science Research Student Conference 2008, pp. 49–56Google Scholar
  14. 14.
    Mc Gill, M.J., Salton, G.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)Google Scholar
  15. 15.
    Leydesdorff, L.: Similarity measures, author cocitation analysis, and information theory. J. Am. Soc. Inform. Sci. Technol. 56, 769–772 (2005)CrossRefGoogle Scholar
  16. 16.
    Bandyopadhyay, S, Saha, S.: Unsupervised Classification: Similarity Measures, Classical and Metaheuristic Approaches, and Applications. Springer, Berlin (2013)Google Scholar
  17. 17.
    Sandhya, N., Lalitha, Y.S., Govardhan, A., Anuradha, K.: Analysis of similarity measures for text clustering. In: International Conference on Information Systems Design and Intelligent Applications, Vishakhapatnam, India, p. 976 (2012)Google Scholar
  18. 18.
    Torres, G.J., Basnet, R.B., Sung, A.H., Mukkamala, S, Ribeiro, B.M.: A similarity measure for clustering and its applications. Int. J. Electr. Comput. Syst. Eng. 3, 164 (2009)Google Scholar
  19. 19.
    De Baets, B., Janssens, S., De Meyer, H.: On the transitivity of a parametric family of cardinality-based similarity measures. Int. J. Approximate Reasoning 50, 104–116 (2009)MathSciNetCrossRefMATHGoogle Scholar
  20. 20.
    Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16, 1370–1386 (2004)Google Scholar
  21. 21.
    Goshtasby, A.A.: Similarity and dissimilarity measures. In: Image Registration: Principles, Tools and Methods, p. 463. Springer, Berlin (2012)Google Scholar
  22. 22.
    Mathur, S., Dinakarpandian, D.: Finding disease similarity based on implicit semantic similarity. J. Biomed. Inf. 45, 363-371Google Scholar
  23. 23.
    Zwick, R., Caristein, E., Budescu, D.V.: Measures of similarity among fuzzy concepts: a comparative analysis. Int. J. Approximate Reasoning 1, 221–242Google Scholar
  24. 24.
    Chang, K.-C., Wen, C., Yeh, M.-F., Lee, R.-G.: A comparision of similarity measures for clustering of QRS complexes. Biomed. Eng. Appl. Basis Commun. 17, 324–331 (2005)Google Scholar
  25. 25.
    Sanchez, D., Batet, M.: Semantic similarity estimation in the biomedical domain: an ontology-based information-theoretic perspective. J. Biomed. Inf. 44, 749–759 (2011)Google Scholar
  26. 26.
    Pesquita, C., Faria, D., Falcao, A.O., Lord, P., Couto, F.M.: Semantic similarity in biomedical ontologies. PLOS: Comput. Biol. 5, e1000443 (2009)Google Scholar
  27. 27.
    Ahlgren, P., Jarneving, B., Rousseau, R.: Requirement for a cocitation similarity measure, with special reference to Pearsons correlation coefficient. J. Am. Soc. Inf. Sci. Technol. 54, 550–560 (2003)Google Scholar
  28. 28.
    Shaker, O., Taha, F., Salah, M., Elmarzoky, M.: LAPTM4B gene expression and polymorphism as diagnostic markers of breast cancer in Egyptian patients. J. Med. Biochem. (2014) (In press)Google Scholar
  29. 29.
    Yin, M., Lou, C., Zhang, W., Meng, F., Zhang, H., Ning, X., Zhou, R., Dong, X., Lou, G.: LAPTM4B overexpression is a novel independent prognostic marker for metastatic ovarian tumors. Int. J. Gynecol. Cancer 22(1), 54–62 (2012)CrossRefGoogle Scholar
  30. 30.
    Fan, M., Liu, Y., Zhou, R., Zhang, Q.: Association of LAPTM4B gene polymorphism with breast cancer susceptibility. J. Cancer Epidemiol. 36(4), 364–368 (2012)CrossRefGoogle Scholar

Copyright information

© Springer India 2015

Authors and Affiliations

  • Heba Ayeldeen
    • 1
  • Olfat Shaker
    • 3
  • Osman Hegazy
    • 1
  • Aboul Ella Hassanien
    • 1
    • 2
  1. 1.Scientific Research Group in Egypt (SRGE)CairoEgypt
  2. 2.Faculty of Computers and InformationCairo UniversityCairoEgypt
  3. 3.Department of Molecular Biology, Faculty of MedicineCairo UniversityCairoEgypt

Personalised recommendations