Improvement of grey wolf optimizer with adaptive middle filter to adjust support vector machine parameters to predict diabetes complications

Jeyafzam, Fereshteh; Vaziri, Babak; Suraki, Mohsen Yaghoubi; Hosseinabadi, Ali Asghar Rahmani; Slowik, Adam

doi:10.1007/s00521-021-06143-y

Improvement of grey wolf optimizer with adaptive middle filter to adjust support vector machine parameters to predict diabetes complications

Original Article
Open access
Published: 12 June 2021

Volume 33, pages 15205–15228, (2021)
Cite this article

Download PDF

You have full access to this open access article

Neural Computing and Applications Aims and scope Submit manuscript

Improvement of grey wolf optimizer with adaptive middle filter to adjust support vector machine parameters to predict diabetes complications

Download PDF

Fereshteh Jeyafzam¹,
Babak Vaziri¹,
Mohsen Yaghoubi Suraki²,
Ali Asghar Rahmani Hosseinabadi³ &
…
Adam Slowik ORCID: orcid.org/0000-0003-2542-9842⁴

2532 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

In medical science, collecting and classifying data from various diseases is a vital task. The confused and large amounts of data are problems that prevent us from achieving acceptable results. One of the major problems for diabetic patients is a failure to properly diagnose the disease. As a result of this mistake in diagnosis or failure in early diagnosis, the patient may suffer from complications such as blindness, kidney failure, and cutting off the toes. Nowadays, doctors diagnose the disease by relying on their experience and knowledge and performing complex and time-consuming tests. One of the problems with current diabetic, diagnostic methods is the lack of appropriate features to diagnose the disease and consequently the weakness in its diagnosis, especially in its early stages. Since diabetes diagnosis relies on large amounts of data with many parameters, it is necessary to use machine learning methods such as support vector machine (SVM) to predict the complications of diabetes. One of the disadvantages of SVM is its parameter adjustment, which can be accomplished using metaheuristic algorithms such as particle swarm optimization algorithm (PSO), genetic algorithm, or grey wolf optimizer (GWO). In this paper, after preprocessing and preparing the dataset for data mining, we use SVM to predict complications of diabetes based on selected parameters of a patient acquired by laboratory test using improved GWO. We improve the selection process of GWO by employing dynamic adaptive middle filter, a nonlinear filter that assigns appropriate weight to each value based on the data value. Comparison of the final results of the proposed algorithm with classification methods such as a multilayer perceptron neural network, decision tree, simple Bayes, and temporal fuzzy min–max neural network (TFMM-PSO) shows the superiority of the proposed method over the comparable ones.

Improved Grey Wolf Algorithm to Optimize the Classification of Support Vector Machine Model

Hybrid ANFIS-GA and ANFIS-PSO Based Models for Prediction of Type 2 Diabetes Mellitus

A Novel Wrapper-Based Feature Selection for Heart Failure Prediction Using an Adaptive Particle Swarm Grey Wolf Optimization

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

As the twenty-first century progresses, we are witnessing globalization, changes in people’s lifestyles and industrialization, one of the consequences of which is a change in the pattern of diseases [1]. Until recently, contagious diseases were considered to be the major health problem in third world countries, but now the increasing role of non-contagious diseases in mortality, especially in developing countries, is a serious threat. Diabetes is one of the most important diseases in this group [2]. Diabetes is a chronic endocrine disorder characterized by a malfunction in glucose metabolism due to problems with the production or utilization of insulin hormone. The long-term risks of diabetes are extremely serious for health, such as premature death, blindness, loss of organs if gangrene is not controlled, and impotence. Patients that require insulin treatment and whose disease has begun in childhood, adolescence, or early adulthood are at risk for such problems [1].

Self-care behavior, which is a key concept in health promotion, refers to decisions and activities that a person can use to adapt to a health problem or improve his health. Self-care behaviors prevent early and late complications of the disease and guarantee a long life for the patient. In diabetes, self-care is one of the most important factors for controlling the disease. Empowerment and acceptance status are personality factors that affect patients’ status and increase their ability to deal with problems such as illnesses. According to existing studies, the most important predictor of mortality in diabetic patients is lack of self-care [3].

Nowadays, it is important in medical science to collect a great deal of data on various diseases. Medical centers collect this data for a variety of goals. Researching these data to obtain useful results and models for diseases is one of the goals of using these data. A large amount of data and confusion resulting from that is a problem that prevents us from achieving acceptable results. Data mining is therefore used to overcome this problem and find useful relationships between risk factors in diseases [1].

The intensity of competition in the scientific, social, economic, political, and military fields has also increased the importance of speed or time of access to information. Therefore, the need to design systems that are capable of quickly discover interest information to users, with a focus on minimal human intervention, on the one hand, and approaching analysis methods proportional to the volume of bulk data, on the other, is well sensed. At present, data mining is the most important technology for the efficient, accurate, and rapid processing of bulk data, and its importance is increasing. Data mining is a bridge between statistics, computer science (CS), artificial intelligence (AI), pattern recognition (PR), and data machine learning. Data mining is a complex process for identifying the correct, new and potentially useful patterns and models in a large amount of data, so that these patterns and models are understandable to humans [4].

Data mining is not a product that can be purchased but is a scientific process that should be implemented as a project. Data are often bulky and cannot be used alone, but the hidden knowledge in the data can be used. Therefore, utilizing the power of data mining processes to identify patterns and models as well as the relationship between different elements in the database to discover the knowledge behind the data and ultimately convert the data into information becomes more and more essential. Data mining usually refers to the discovery of useful patterns among the data. A useful pattern is a model of data that describes the relationship between a subset of data and is valid, simple, understandable, and new [4].

In the information age, data are one of the most important assets of any organization. However, data can become a valuable resource for the organization when used correctly. To transform the potential value of data into usable information and knowledge, many organizations have adopted “data mining”. Because through data mining, it will be possible to discover the relationships, trends, and patterns hidden among data and gaining new knowledge in the field of explicit and latent organizational challenges [5].

In this paper, we try to create a data mining system that can first preprocess the collected data by laboratory tests of 1573 patients in the endocrinology department of Mazandaran University of Medical Sciences. Secondly, we use the one-versus-all method of SVM classifier to predict the type of disease based on the medical data of every patient into seven different diabetic complications, namely eye problem complication, high-blood-pressure complication, dialysis history complication, heart attack complications, stroke complications, diabetes foot ulcer complication, and diabetes coma complication. Thirdly, we improve the accuracy of the SVM method by feeding selected features of a patient using improved grey wolf optimizer (GWO). The improved GWO uses weighted adaptive middle filter (WAMF) at each step of the algorithm implementation, to filter the outliers (wolves far from the target) through a dynamic window. GWO algorithm [6] is a part of swarm intelligence algorithms [7]. These algorithms are widely used in many other practical application [8–10]. In this paper, we show how the GWO algorithm can be used in the medical area.

In brief, the structure of the paper is organized as follows: In Sect. 2, related work is presented. The proposed method is fully described in Sect. 3. The simulation results of the proposed algorithm and conclusion are summarized in Sects. 4 and 5, respectively.

2 Related work

Until now, many classification methods have been proposed for diabetes diagnosis problems that can be broadly classified into four major categories.

Artificial neural network (ANN)-based categories of classification method is the most frequently used method reported in the literature. In 2007, Anbananthen et al. [11] used ANN and DT made of C4.5 algorithm to diagnose diabetes in individuals based on features such as age and blood pressure. In 2008, Chan et al. [12] have studied the microvascular complications of diabetes. To do so, he compared the C5.0 algorithm and the multilayer perceptron neural network (MLP NN). Different factors have been identified for each of these complications, and their effect on each complication has been studied. Patil and Durga [13] have used the a priori algorithm to create turbulence rules for finding hidden relationships between variables. In 2009, Fang [14] has used various data mining techniques to cluster patients with diabetes. Important features considered in this study are age, family history, and weight. The accuracy of the model created using clustering is 80%. In 2014, Ganapathy et al. [15] propose a pattern classification system by combining temporal features with fuzzy min–max (TFMM) neural network-based classifier for effective decision support in medical diagnosis. In this work, a particle swarm optimization (PSO) algorithm-based rule extractor is proposed for improving the detection accuracy. Accuracy of the proposed TFMM-PSO method is compared with other methods [16–20] using the University of California Irvine (UCI) Machine Learning Repository Dataset [21]. Most of the reviewed methods lack in selecting a proper number of features that make the classifiers slow.

Decision tree-based algorithms can be categorized as the second batch of methods used for diabetic prediction. Breault et al. [22] performed the classification and analysis of regression using the classification and regression tree (CART) system in 2002 and deduced the dependency between a series of features. The classification accuracy was 59.9%. Miyaki et al. [23] also have used the card method to judge the factors influencing the incidence of diabetes in 2002. Rohlfing et al. [24] used linear regression analysis to examine the relationship between type 1 diabetes and HbA1c in 2002. Silverstein et al. [25] performed experiments on three medical databases and produced rules and then compared these rules with predetermined rules.

Trautvetter et al. [26] have used the association rule and decision tree (DT) to extract knowledge from the medical database. Juan et al. [27] have developed a type 2 diabetes data processing system (DDPS) using a combination of C4.5 and EM (maximum expectation) algorithms in 2007. Jarullah [28] has used the DT to diagnose type 2 diabetes. DT is generated using J48 decision tree classification algorithm (DTCA) in Weka software. Aljumah et al. [29] have used regression to analyze the prediction of diabetes treatment in two groups of young and old ages based on drug treatment and side effects. Antonelli et al. [30] have proposed a multi-level clustering-based analysis framework for identifying treatment pathways and examining patients for specific diseases. The proposed method has worked well in identifying groups of patients with similar disease history and increasing the severity of their complications. All decision tree-based algorithms need prior knowledge about different classes that require many annotated samples by experts to design the tree.

SVM-based algorithms are the third type of methods that we discovered in our literature review. In 2007, Huang et al. [31] conducted a study on identifying the major factors affecting diabetes controlling by using feature selection in the patient management system. 1n 2008, Han et al. [32] predicted diabetes in the patient database using Rapid Miner software and ID3 decision tree algorithm (DTA). In 2007, Cho et al. [33] predicted the presence of neuropathy in diabetic patients using SVM classification, feature selection, and visualization. In [34], authors have attempted to diagnose diabetes using data mining algorithms that are very important in diagnosis and prediction. In this study, SVM, k-nearest neighbor, Bayes network (BN), ID3, C4.5, C5.0, and CART are used for diabetes detection. In this study, 768 diabetic patients from the PID dataset with 8 important features are used to train and test the data, 80% of which are used as training data and 20% are used as test data. The results show that the SVM model is more accurate than other algorithms, and has an accuracy of 81.77%. Han et al. [35] have developed a batch system for the diagnosis of diabetes. They specifically used the SVM to diagnose diabetes. In this study, SVM is used to screen diabetes, while at the same time a group learning module is added to make the black box related to SVM decisions more comprehensive and transparent. In addition, this scheme is a useful and appropriate method to solve the imbalance problem. Radha and Srinivasan [36] have used three classification methods to predict diabetes. This study compares the results of five supervised data mining algorithms using five performance criteria. The three algorithms are C4.5, SVM, and k-nearest neighbor. The performance of data mining algorithms is compared based on accuracy, computation time, and bootstrap accuracy. This study describes the algorithmic discussion of the UCI dataset for this disease in the large dataset repository. In [37, 38], authors have used hybrid methods for feature selection and SVM for classification. In the existing databases, there are some not-so distinct and redundant features. These features are major contributing factors to the success of the classification tool and system processing time. The system developed in this study has attempted to increase system speed and success by eliminating these redundant features. Therefore, the purpose of this study is to investigate the effect of removing unnecessary and obsolete features from the dataset on classification success by using an SVM classifier. The feature selection algorithm based on the Bee Colony Optimization Algorithm (BCOA) developed in this study is the first sample of the BCOA used in feature selection. We also choose to use SVM in order to classify diabetic complications. However, we find that using SVM alone is not very accurate, so that we improve the method by selecting relevant features using an improved GWO method of optimization.

3 The proposed data mining system

In this section, we discuss the preprocessing method and the improved GWO method and also feature selection part of SVM classifier, totally called a complete data mining system.

3.1 Data aggregation

Required data are collected from the endocrinology department of Mazandaran University of Medical Sciences. The file information is from the second half of the year 2015. There are 1573 initial records of patients, 53 of which lack of complete information. The average age of patients is 53 years, and 30% are male and the rest are female. 70% of patients have a family history of diabetes. The laboratory features of the patients are evaluated and identified at this stage. For each patient, 23 features including name, family, file number, address, height, weight, age, body mass index, gender, heredity, maximum blood pressure, minimum blood pressure, education, fasting blood sugar, 2-h blood sugar, cholesterol, harmful fat, useful fat, triglyceride, blood urea, creatinine, activity rate, tobacco use, and 8 complications including high blood lipids, eye complication, high blood pressure, dialysis history, cardiac problems, stroke, diabetic foot ulcer, diabetic coma have been registered.

3.2 Preprocessing

Preprocessing in data mining usually involves in data cleaning (DC), data integration (DI), data reduction (DR), and data transformation (DT). In the real world, data are not always perfect, and as for medical information, this is always true. Therefore, if the quality of the data is not good enough, some steps of the preprocessing should be performed on data to improve the quality of data and deliver high-quality data to the data mining algorithm to minimize the impact of data weakness. Usually, data preprocessing and preparation consumes more than 70% of the time required for data mining and 75–90% of the success of data mining projects depends on that. In this study, Naim software (https://www.knime.com/) is used for data preprocessing and preparation. Table 1 describes details of the dataset.

Table 1 Field description of the Mazandaran University of Medical Sciences dataset

Improvement of grey wolf optimizer with adaptive middle filter to adjust support vector machine parameters to predict diabetes complications

Abstract

Similar content being viewed by others

Improved Grey Wolf Algorithm to Optimize the Classification of Support Vector Machine Model

Hybrid ANFIS-GA and ANFIS-PSO Based Models for Prediction of Type 2 Diabetes Mellitus

A Novel Wrapper-Based Feature Selection for Heart Failure Prediction Using an Adaptive Particle Swarm Grey Wolf Optimization

Explore related subjects

1 Introduction

2 Related work

3 The proposed data mining system

3.1 Data aggregation

3.2 Preprocessing

3.2.1 Data cleaning (DC)

3.2.2 Data transformation (DT)

3.3 Proposed classification method

3.3.1 Grey wolf optimizer (GWO)

3.3.2 Improved GWO using weighted adaptive middle filter

3.3.3 Applying filter at each step of the GWO implementation

3.3.4 Avoid improved GWO from stocking in local optimum

3.3.5 Features selection

4 Experimental results and discussion

4.1 Prediction of health complication

4.1.1 Increased blood lipids complication

4.1.2 Eye problem complication

4.1.3 High blood pressure complication

4.1.4 Dialysis history complication

4.1.5 Heart attack complications

4.1.6 Stroke complications

4.1.7 Diabetes foot ulcer complication

4.1.8 Diabetes coma complication

4.2 Evaluation and comparison of proposed method based on machine learning algorithms

4.2.1 Increased blood lipids complication

4.2.2 Eye problem complication

4.2.3 High blood pressure complication

4.2.4 Dialysis history complication

4.2.5 Heart attack complications

4.2.6 Stroke complications

4.2.7 Diabetes foot ulcer complication

4.2.8 Diabetes coma complication

4.3 Experimental evaluation on UCI dataset

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation