Skip to main content

Supporting Breast Cancer Diagnosis with Multi-objective Genetic Algorithm for Outlier Detection

  • Conference paper
  • First Online:
Advanced Solutions in Diagnostics and Fault Tolerant Control (DPS 2017)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 635))

Included in the following conference series:

Abstract

Outlier detection in medical data covers a broad spectrum of medical research. In this paper, the authors propose a new approach to outlier detection based on the multi-objective genetic algorithm. In medical data, an outlier may be considered as a deviation which indicates the existence of cancerous cells in the breast. The paper presents the results of tests which were conducted on the set of medical data from the repository. The results of the study indicate that our method can be successfully applied to the medical problem in question.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Alma, Ö.G., Serdar, K., Aybars, U.: Genetic algorithm based outlier detection using bayesian information criterion in multiple regression models having multicollinearity problems. Gazi Univ. J. Sci. 22(3), 141–148 (2009)

    Google Scholar 

  2. Andrews, D.F., Pregibon, D.: Finding the outliers that matter. J. R. Stat. Soc. Ser. B (Methodol.) 40, 85–93 (1978)

    MATH  Google Scholar 

  3. Barnett, V., Lewis, T.: Outliers in Statistical Data (Probability and Mathematical Statistics). Wiley, Chichester (1994)

    MATH  Google Scholar 

  4. Chauhan, R., Kaur, H., Alam, M.A.: Data clustering method for discovering clusters in spatial cancer databases. Int. J. Comput. Appl. (0975–8887) 10, 24–28 (2010)

    Google Scholar 

  5. Cheang, M.C., van de Rijn, M., Nielsen, T.O.: Gene expression profiling of breast cancer. Annu. Rev. Pathmechdis. Mech. Dis. 3, 67–97 (2008)

    Article  Google Scholar 

  6. Cook, R.D.: Detection of influential observation in linear regression. Technometrics 19(1), 15–18 (1977)

    MathSciNet  MATH  Google Scholar 

  7. Corne, D.W., Jerram, N.R., Knowles, J.D., Oates, M.J.: Pesa-ii: Region-based selection in evolutionary multiobjective optimization. In: Proceedings of the 3rd Annual Conference on Genetic and Evolutionary Computation, pp. 283–290. Morgan Kaufmann Publishers Inc. (2001)

    Google Scholar 

  8. Crawford, K.D., Wainwright, R.L.: Applying genetic algorithms to outlier detection. In: ICGA, pp. 546–550 (1995)

    Google Scholar 

  9. Cucina, D., di Salvatore, A., Protopapas, M.K.: Outliers detection in multivariate time series using genetic algorithms. Chemometr. Intell. Lab. Syst. 132, 103–110 (2014)

    Article  Google Scholar 

  10. Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-ii. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)

    Article  Google Scholar 

  11. Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)

    Article  Google Scholar 

  12. Duraj, A., Krawczyk, A.: Finding outliers for large medical datasets. Przeglad Elektrotechniczny 86, 188–191 (2010)

    Google Scholar 

  13. Duraj, A., Szczepaniak, P.S.: Information outliers and their detection. In: Burgin, M., Hofkirchner, W. (eds.) Information Studies and the Quest for Transdisciplinarity, vol. 9, Chap. 15, pp. 413–437. World Scientific Publishing Company (2017)

    Google Scholar 

  14. Duraj, A., Szczepaniak, P.S., Ochelska-Mierzejewska, J.: Detection of outlier information using linguistic summarization. In: Andreasen, T., et al. (eds.) Flexible Query Answering Systems 2015; Advances in Intelligent Systems and Computing 400, Proceedings of the 11th International Conference FQAS 2015, Cracow, Poland, pp. 101–113. Springer (2015)

    Google Scholar 

  15. Durillo, J.J., Nebro, A.J.: jMetal: a java framework for multi-objective optimization. Adv. Eng. Softw. 42(10), 760–771 (2011)

    Article  Google Scholar 

  16. Durillo, J.J., Nebro, A.J., Alba, E.: The jMetal framework for multi-objective optimization: design and architecture. In: 2010 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2010)

    Google Scholar 

  17. Elyasaf, A., Sipper, M.: Software review: the heuristiclab framework. Genetic Prog. Evol. Mach. 15(2), 215–218 (2014). doi:10.1007/s10710-014-9214-4

    Article  Google Scholar 

  18. Hadka, D.: MOEA framework: a free and open source java framework for multiobjective optimization (2012)

    Google Scholar 

  19. Hawkins, D.: Identification of Outliers. Monographs on Applied Probability and Statistics. Chapman and Hall, London (1980)

    Book  MATH  Google Scholar 

  20. Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)

    Article  Google Scholar 

  21. Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml

  22. Lilford, R., Mohammed, M.A., Spiegelhalter, D., Thomson, R.: Use and misuse of process and outcome data in managing performance of acute medical care: avoiding institutional stigma. Lancet 363(9415), 1147–1154 (2004)

    Article  Google Scholar 

  23. Nicolau, M., Levine, A.J., Carlsson, G.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc. Nat. Acad. Sci. 108(17), 7265–7270 (2011)

    Article  Google Scholar 

  24. Pincus, S.M., Gladstone, I.M., Ehrenkranz, R.A.: A regularity statistic for medical data analysis. J. Clin. Monit. Comput. 7(4), 335–345 (1991)

    Article  Google Scholar 

  25. Schwarz, G., et al.: Estimating the dimension of a model. Ann. Stat. 6(2), 461–464 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  26. Shaari, F., Bakar, A.A., Hamdan, A.R.: A predictive analysis on medical data based on outlier detection method using non-reduct computation. In: International Conference on Advanced Data Mining and Applications, pp. 603–610. Springer (2009)

    Google Scholar 

  27. Sotiriou, C., Wirapati, P., Loi, S., Harris, A., Fox, S., Smeds, J., Nordgren, H., Farmer, P., Praz, V., Haibe-Kains, B., et al.: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J. Nat. Cancer Inst. 98(4), 262–272 (2006)

    Article  Google Scholar 

  28. Stasiak, B., Tarasiuk, P., Michalska, I., Tomczyk, A., Szczepaniak, P.: Localization of demyelinating plaques in MRI using convolutional neural networks. In: In Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2017), vol. 2, pp. 55–64 (2017)

    Google Scholar 

  29. Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE’s Symposium on Electronic Imaging: Science and Technology, pp. 861–870. International Society for Optics and Photonics (1993)

    Google Scholar 

  30. Taloba, A.I., Marghny, M., El-Aziz, R.M.A.: Outlier detection using improved genetic k-means. Int. J. Comput. Appl. 28(11), 33–36 (2014)

    Google Scholar 

  31. Thongkam, J., Xu, G., Zhang, Y., Huang, F.: Support vector machine for outlier detection in breast cancer survivability prediction. In: Asia-Pacific Web Conference, pp. 99–109. Springer (2008)

    Google Scholar 

  32. Tolvi, J.: Genetic algorithms for outlier detection and variable selection in linear regression models. Soft Comput. 8(8), 527–533 (2004)

    Article  MATH  Google Scholar 

  33. Tomczyk, A.: Application of active contours with expert knowledge to heart ventricle segmentation. J. Appl. Comput. Sci. 21(2), 181–194 (2013). http://it.p.lodz.pl/file.php/12/2013-2/jacs-2-2013-Tomczyk.pdf

    Google Scholar 

  34. Tomczyk, A., Szczepaniak, P.S.: On the relationship between active contours and contextual classification. In: Computer Recognition Systems, pp. 303–310. Springer (2005)

    Google Scholar 

  35. Wolberg, W.H., Street, W.N., Mangasarian, O.: Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 77(2–3), 163–171 (1994)

    Article  Google Scholar 

  36. Wu, B.: Cancer outlier differential gene expression detection. Biostatistics 8(3), 566–575 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  37. Xiong, X., Kim, Y., Baek, Y., Rhee, D.W., Kim, S.H.: Analysis of breast cancer using data mining & statistical techniques. In: Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, 2005 and First ACIS International Workshop on Self-assembling Wireless Networks, SNPD/SAWN 2005, pp. 82–87. IEEE (2005)

    Google Scholar 

  38. Yi, W., Fuyong, W.: Breast cancer diagnosis via support vector machines. In: Control Conference, CCC 2006, Chinese, pp. 1853–1856. IEEE (2006)

    Google Scholar 

  39. Zitzler, E., Laumanns, M., Thiele, L., et al.: SPEA2: Improving the strength pareto evolutionary algorithm (2001)

    Google Scholar 

Download references

Acknowledgment

The database used in experiments was taken from the UCI Machine Learning Repository [21].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lukasz Chomatek .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Duraj, A., Chomatek, L. (2018). Supporting Breast Cancer Diagnosis with Multi-objective Genetic Algorithm for Outlier Detection. In: Kościelny, J., Syfert, M., Sztyber, A. (eds) Advanced Solutions in Diagnostics and Fault Tolerant Control. DPS 2017. Advances in Intelligent Systems and Computing, vol 635. Springer, Cham. https://doi.org/10.1007/978-3-319-64474-5_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-64474-5_25

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-64473-8

  • Online ISBN: 978-3-319-64474-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics