Abstract
Outlier detection in medical data covers a broad spectrum of medical research. In this paper, the authors propose a new approach to outlier detection based on the multi-objective genetic algorithm. In medical data, an outlier may be considered as a deviation which indicates the existence of cancerous cells in the breast. The paper presents the results of tests which were conducted on the set of medical data from the repository. The results of the study indicate that our method can be successfully applied to the medical problem in question.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alma, Ö.G., Serdar, K., Aybars, U.: Genetic algorithm based outlier detection using bayesian information criterion in multiple regression models having multicollinearity problems. Gazi Univ. J. Sci. 22(3), 141–148 (2009)
Andrews, D.F., Pregibon, D.: Finding the outliers that matter. J. R. Stat. Soc. Ser. B (Methodol.) 40, 85–93 (1978)
Barnett, V., Lewis, T.: Outliers in Statistical Data (Probability and Mathematical Statistics). Wiley, Chichester (1994)
Chauhan, R., Kaur, H., Alam, M.A.: Data clustering method for discovering clusters in spatial cancer databases. Int. J. Comput. Appl. (0975–8887) 10, 24–28 (2010)
Cheang, M.C., van de Rijn, M., Nielsen, T.O.: Gene expression profiling of breast cancer. Annu. Rev. Pathmechdis. Mech. Dis. 3, 67–97 (2008)
Cook, R.D.: Detection of influential observation in linear regression. Technometrics 19(1), 15–18 (1977)
Corne, D.W., Jerram, N.R., Knowles, J.D., Oates, M.J.: Pesa-ii: Region-based selection in evolutionary multiobjective optimization. In: Proceedings of the 3rd Annual Conference on Genetic and Evolutionary Computation, pp. 283–290. Morgan Kaufmann Publishers Inc. (2001)
Crawford, K.D., Wainwright, R.L.: Applying genetic algorithms to outlier detection. In: ICGA, pp. 546–550 (1995)
Cucina, D., di Salvatore, A., Protopapas, M.K.: Outliers detection in multivariate time series using genetic algorithms. Chemometr. Intell. Lab. Syst. 132, 103–110 (2014)
Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-ii. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)
Delen, D., Walker, G., Kadam, A.: Predicting breast cancer survivability: a comparison of three data mining methods. Artif. Intell. Med. 34(2), 113–127 (2005)
Duraj, A., Krawczyk, A.: Finding outliers for large medical datasets. Przeglad Elektrotechniczny 86, 188–191 (2010)
Duraj, A., Szczepaniak, P.S.: Information outliers and their detection. In: Burgin, M., Hofkirchner, W. (eds.) Information Studies and the Quest for Transdisciplinarity, vol. 9, Chap. 15, pp. 413–437. World Scientific Publishing Company (2017)
Duraj, A., Szczepaniak, P.S., Ochelska-Mierzejewska, J.: Detection of outlier information using linguistic summarization. In: Andreasen, T., et al. (eds.) Flexible Query Answering Systems 2015; Advances in Intelligent Systems and Computing 400, Proceedings of the 11th International Conference FQAS 2015, Cracow, Poland, pp. 101–113. Springer (2015)
Durillo, J.J., Nebro, A.J.: jMetal: a java framework for multi-objective optimization. Adv. Eng. Softw. 42(10), 760–771 (2011)
Durillo, J.J., Nebro, A.J., Alba, E.: The jMetal framework for multi-objective optimization: design and architecture. In: 2010 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2010)
Elyasaf, A., Sipper, M.: Software review: the heuristiclab framework. Genetic Prog. Evol. Mach. 15(2), 215–218 (2014). doi:10.1007/s10710-014-9214-4
Hadka, D.: MOEA framework: a free and open source java framework for multiobjective optimization (2012)
Hawkins, D.: Identification of Outliers. Monographs on Applied Probability and Statistics. Chapman and Hall, London (1980)
Konak, A., Coit, D.W., Smith, A.E.: Multi-objective optimization using genetic algorithms: a tutorial. Reliab. Eng. Syst. Saf. 91(9), 992–1007 (2006)
Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Lilford, R., Mohammed, M.A., Spiegelhalter, D., Thomson, R.: Use and misuse of process and outcome data in managing performance of acute medical care: avoiding institutional stigma. Lancet 363(9415), 1147–1154 (2004)
Nicolau, M., Levine, A.J., Carlsson, G.: Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proc. Nat. Acad. Sci. 108(17), 7265–7270 (2011)
Pincus, S.M., Gladstone, I.M., Ehrenkranz, R.A.: A regularity statistic for medical data analysis. J. Clin. Monit. Comput. 7(4), 335–345 (1991)
Schwarz, G., et al.: Estimating the dimension of a model. Ann. Stat. 6(2), 461–464 (1978)
Shaari, F., Bakar, A.A., Hamdan, A.R.: A predictive analysis on medical data based on outlier detection method using non-reduct computation. In: International Conference on Advanced Data Mining and Applications, pp. 603–610. Springer (2009)
Sotiriou, C., Wirapati, P., Loi, S., Harris, A., Fox, S., Smeds, J., Nordgren, H., Farmer, P., Praz, V., Haibe-Kains, B., et al.: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J. Nat. Cancer Inst. 98(4), 262–272 (2006)
Stasiak, B., Tarasiuk, P., Michalska, I., Tomczyk, A., Szczepaniak, P.: Localization of demyelinating plaques in MRI using convolutional neural networks. In: In Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2017), vol. 2, pp. 55–64 (2017)
Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: IS&T/SPIE’s Symposium on Electronic Imaging: Science and Technology, pp. 861–870. International Society for Optics and Photonics (1993)
Taloba, A.I., Marghny, M., El-Aziz, R.M.A.: Outlier detection using improved genetic k-means. Int. J. Comput. Appl. 28(11), 33–36 (2014)
Thongkam, J., Xu, G., Zhang, Y., Huang, F.: Support vector machine for outlier detection in breast cancer survivability prediction. In: Asia-Pacific Web Conference, pp. 99–109. Springer (2008)
Tolvi, J.: Genetic algorithms for outlier detection and variable selection in linear regression models. Soft Comput. 8(8), 527–533 (2004)
Tomczyk, A.: Application of active contours with expert knowledge to heart ventricle segmentation. J. Appl. Comput. Sci. 21(2), 181–194 (2013). http://it.p.lodz.pl/file.php/12/2013-2/jacs-2-2013-Tomczyk.pdf
Tomczyk, A., Szczepaniak, P.S.: On the relationship between active contours and contextual classification. In: Computer Recognition Systems, pp. 303–310. Springer (2005)
Wolberg, W.H., Street, W.N., Mangasarian, O.: Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 77(2–3), 163–171 (1994)
Wu, B.: Cancer outlier differential gene expression detection. Biostatistics 8(3), 566–575 (2007)
Xiong, X., Kim, Y., Baek, Y., Rhee, D.W., Kim, S.H.: Analysis of breast cancer using data mining & statistical techniques. In: Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing, 2005 and First ACIS International Workshop on Self-assembling Wireless Networks, SNPD/SAWN 2005, pp. 82–87. IEEE (2005)
Yi, W., Fuyong, W.: Breast cancer diagnosis via support vector machines. In: Control Conference, CCC 2006, Chinese, pp. 1853–1856. IEEE (2006)
Zitzler, E., Laumanns, M., Thiele, L., et al.: SPEA2: Improving the strength pareto evolutionary algorithm (2001)
Acknowledgment
The database used in experiments was taken from the UCI Machine Learning Repository [21].
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Duraj, A., Chomatek, L. (2018). Supporting Breast Cancer Diagnosis with Multi-objective Genetic Algorithm for Outlier Detection. In: Kościelny, J., Syfert, M., Sztyber, A. (eds) Advanced Solutions in Diagnostics and Fault Tolerant Control. DPS 2017. Advances in Intelligent Systems and Computing, vol 635. Springer, Cham. https://doi.org/10.1007/978-3-319-64474-5_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-64474-5_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64473-8
Online ISBN: 978-3-319-64474-5
eBook Packages: EngineeringEngineering (R0)