Development and validation of predictive model based on deep learning method for classification of dyslipidemia in Chinese medicine

Liu, Jinlei; Dan, Wenchao; Liu, Xudong; Zhong, Xiaoxue; Chen, Cheng; He, Qingyong; Wang, Jie

doi:10.1007/s13755-023-00215-0

Development and validation of predictive model based on deep learning method for classification of dyslipidemia in Chinese medicine

Research
Open access
Published: 06 April 2023

Volume 11, article number 21, (2023)
Cite this article

Download PDF

You have full access to this open access article

Health Information Science and Systems Aims and scope Submit manuscript

Development and validation of predictive model based on deep learning method for classification of dyslipidemia in Chinese medicine

Download PDF

Jinlei Liu¹,
Wenchao Dan^2,3,
Xudong Liu³,
Xiaoxue Zhong³,
Cheng Chen⁴,
Qingyong He¹ &
…
Jie Wang¹

3049 Accesses
6 Citations
Explore all metrics

Abstract

Backgrounds

Dyslipidemia is a prominent risk factor for cardiovascular diseases and one of the primary independent modifiable factors of diabetes and stroke. Statins can significantly improve the prognosis of dyslipidemia, but its side effects cannot be ignored. Traditional Chinese Medicine (TCM) has been used in clinical practice for more than 2000 years in China and has certain traits in treating dyslipidemia with little side effect. Previous research has shown that Mutual Obstruction of Phlegm and Stasis (MOPS) is the most common dyslipidemia type classified in TCM. However, how to compose diagnostic factors in TCM into diagnostic rules relies heavily on the doctor's experience, falling short in standardization and objectiveness. This is a limit for TCM to play its advantages of treating dyslipidemia with MOPS.

Methods

In this study, the syndrome diagnosis in TCM was transformed into the prediction and classification problem in artificial intelligence The deep learning method was employed to build the classification prediction models for dyslipidemia. The models were built and trained with a large amount of multi-centered clinical data on MOPS. The optimal model was screened out by evaluating the performance of prediction models through loss, accuracy, precision, recall, confusion matrix, PR and ROC curve (including AUC).

Results

A total of 20 models were constructed through the deep learning method. All of them performed well in the prediction of dyslipidemia with MOPS. The model-11 is the optimal model. The evaluation indicators of model-11 are as follows: The true positive (TP), false positive (FP), true negative (TN) and false negative (FN) are 51, 15, 129, and 9, respectively. The loss is 0.3241, accuracy is 0.8672, precision is 0.7138, recall is 0.8286, and the AUC is 0.9268. After screening through 89 diagnostic factors of TCM, we identified 36 significant diagnosis factors for dyslipidemia with MOPS. The most outstanding diagnostic factors from the importance were dark purple tongue, slippery pulse and slimy fur, etc.

Conclusions

This study successfully developed a well-performing classification prediction model for dyslipidemia with MOPS, transforming the syndrome diagnosis problem in TCM into a prediction and classification problem in artificial intelligence. Patients with dyslipidemia of MOPS can be accurately recognized through limited information from patients. We also screened out significant diagnostic factors for composing diagnostic rules of dyslipidemia with MOPS. The study is an avant-garde attempt at introducing the deep-learning method into the research of TCM, which provides a useful reference for the extension of deep learning method to other diseases and the construction of disease diagnosis model in TCM, contributing to the standardization and objectiveness of TCM diagnosis.

AI-based prediction for the risk of coronary heart disease among patients with type 2 diabetes mellitus

Article Open access 02 September 2020

A machine intelligence technique for predicting cardiovascular disease (CVD) using Radiology Dataset

Article 06 June 2022

Application of Deep Learning Methods in the Diagnosis of Coronary Heart Disease Based on Electronic Health Record

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Dyslipidemia is a medical condition that refers to an abnormal level of lipid metabolism, including high levels of total cholesterol (TC), Triglyceride (TG), and low-density lipoprotein cholesterol (LDL-C) and low levels of high-density lipoprotein cholesterol (HDL-C) [1, 2]. Dyslipidemia is recognized as a prominent risk factor for cardiovascular diseases (CVD) [3,4,5,6] and one of the primary independent modifiable factors of diabetes [7] and stroke [8]. Clinical observations and epidemiological studies have shown that dyslipidemia increases the risk of cardiovascular disease events [9,10,11]. Therefore, the effective prevention and treatment of dyslipidemia are of great importance [12].

Statins can significantly improve the prognosis of dyslipidemia. However, their side effects, including liver dysfunction [13], statin-induced myopathy [14], high creatinine level, high creatine kinase level, cannot be ignored. Hence, effective treatment for dyslipidemia with little side effect has become a major field of interest [15].

Traditional Chinese medicine (TCM) has been used in clinical practice for more than 2000 years in China. Clinical and lab research showed that TCM has certain traits and some strengths in treating dyslipidemia [16,17,18]. In diagnosing, a TCM doctor would first determine the syndrome (That's the classification or pattern of diseases in TCM), of the patient based on factors for diagnosis such as symptoms, tongue and pulse manifestations, etc., and then treat according to the syndrome (classification). The essence of TCM diagnosis is a classification problem. Previous research [19, 20] has shown that mutual obstruction of phlegm and stasis (MOPS) is the most common dyslipidemia classification. However, Which factors for diagnosis should be used as diagnostic rules and how to quantify their importance all depend on the doctor's personal experience, lacking of unified standards and verification on large sample data. Thus, we hope to develop a new type of prediction tool based on the factors that can diagnose objectively without relying on personal experience.

In recent years, great progress has been achieved in applying deep-learning in medical research. As an approach to deep learning, the artificial neural network (ANN) is a highly parameterized, non-linear model [21]. It can approximate observed outcomes with minimal error [22], or approximate any continuous function [23], and support non-linear complex classification problems. ANN has been widely used in medical research, such as providing decision support in cancer [24], predicting of clinical deterioration in adult patients with hematologic malignancies [25], and tumor biology [26, 27], etc.

As of now, there is no similar research on the classification and prediction model for dyslipidemia in TCM. In this study, we try to use the deep learning frameworks TensorFlow [28, 29] and Keras [30, 31] to train and construct an effective predictive model for dyslipidemia with MOPS. The TCM diagnosis process was converted into a process of multi-factor classification. A large amount of clinical data was used to develop the models. Research Flow for data processing and construction of the predictive model is illustrated in Fig. 1. Standardized and objective diagnostic patterns buried in data were then revealed, paving the way for more efficient TCM treatment of dyslipidemia (See Fig. 2)

Methods and materials

Source of data

This study was based on a cross-sectional data collection. The data came from 1,019 cases of confirmed dyslipidemia collected from 2013 to 2016 by three hospitals, namely Guang'anmen Hospital of Chinese Academy of Traditional Chinese Medicine, Beijing Traditional Chinese Medicine Hospital Affiliated to Capital Medical University and Dongzhimen Hospital of Beijing University of Chinese Medicine. All patients signed informed consent.

Diagnostic criteria

The diagnostic criteria of dyslipidemia in this study are based on the Guidelines for the Prevention and Treatment of Dyslipidemia in Chinese Adults (2016 Revision) [32], that is, TC ≥ 6.2 mmol /L and / or TG ≥ 2.3 mmol /L and / or LDL-C ≥ 4.1 mmol /L and / or HDL-C < 1.0 mmol /L.

Diagnostic criteria of TCM in this study are based on the differentiation standard of dyslipidemia with MOPS in the Clinical Diagnosis and Treatment Terminology of Traditional Chinese Medicine—Syndrome Part [33]. The study referred to the Terms of Traditional Chinese Medicine [34] for standardization for syndromes that are unclear in the above documents.

Inclusion and exclusion criteria

Inclusion criteria: (1) The patient meets the diagnostic criteria of the modern clinic and the syndrome differentiation criteria of TCM; (2) The patient is physically and mentally stable; (3) The patient is between 20 and 90 years old; (4) The patient agrees to sign informed consent.

Exclusion criteria: (1) Patients with secondary hyperlipidemia; (2) Patients with chronic consumptive diseases such as malignant tumors and tuberculosis; (3) Patients with serious heart, liver, kidney, hematopoietic system and other primary diseases; (4) Patients with recent surgery or trauma; (5) Patients with mental diseases; (6) Patients who have recently taken hypolipidemic drugs; (7) Patients with other metabolic abnormalities; (8) Patients whose observation data are incomplete and may affect the result evaluation.

Syndrome type distribution

Two cardiovascular experts with senior professional titles were responsible for determining the syndrome of the patients based on the disease and syndrome differentiation standard of TCM adopted in this study. A total of 1019 cases of dyslipidemia were included, of which 255 cases were identified as the syndrome of MOPS, accounting for 25.02%. The remaining 764 cases were non syndrome of MOPS, representing 74.98 percent of all cases. Among them were 96 cases of Yin blood deficiency syndrome, 90 cases of Qi deficiency and blood stasis syndrome, 76 cases of phlegm stagnation syndrome, 69 cases of spleen deficiency and Qi stagnation syndrome.

Data preprocessing

We used the EpiData 3.1 software to create a database after preliminarily sorting out the basic information of the patients' clinical medical records, data on diagnostic factors in TCM and data on the syndrome differentiation and classification of diseases. Two doctors input the data independently on two computers to reduce data error. TCM symptom terms of the study were standardized according to the Study on Standardization Status of TCM Terms [35]. Value 1 was assigned to any reported symptom and value 0 is assigned to symptoms that did not appear. Invalid data were cleared out and cases with incomplete records were removed so that a total of 89 diagnostic factors in TCM without missing items were identified, including chest tightness, wheezing, dark purple tongue, etc. Then we used the R package "stats" (version 3.6.3)[36, 37] to run a multiple linear regression analysis on these 89 diagnostic factors to rank them for feature importance and screen out 36 important factors. These factors were then used as input parameters for the following prediction models.

The architecture of the models

The fully connected ANN for all models was built based on TensorFlow and Keras, which are popular deep learning frameworks. Considering that a large number of parameter inputs may cause the model diagram automatically drawn by the package of R not to be fully displayed, schematic diagrams were used to describe the architecture of the models in this paper.

Parameters setting of the models

The activation function of the hidden layer is “ReLU” [38] and the activation function of the output layer is “sigmoid” [39]. The weights of the neural network were determined by minimizing the loss value through gradient descent in a process called standard back propagation [40]. The gradient descent algorithm employed was "Adam" [41], and the learning ratio was set to 0.001. Binary cross_entropy (BCE) [42] was set as the loss function (Formula 1), which is often used for binary classification problems [43].

$$Loss = - \frac{1}{output\_size}\mathop \sum \nolimits_{i = 1}^{output\_size} y_{i} \cdot {\text{log}}\hat{y}_{i} + \left( {1 - y_{i} } \right) \cdot {\text{log}}(1 - \hat{y}_{i} )$$

(1)

The “sklearn” [44], a random number generator in python, was used to randomly divide the data set of 1019 patients. 60% of the data (611 cases) were distributed to the training set, 20% of the data (204 cases) were distributed to the validation set, and 20% of the data (204 cases) were distributed to the test set. The training set optimization was used to tune model parameters. The validation set was used to test the model during training. The test set was used to test the model after training to evaluate the accuracy of the model.

The model.fit function (https://www.tensorflow.org/guide/keras/customizing_what_happens_in_fit) in TensorFlow was used to train models. The parameters were set as follows.

The class weighting scheme [45] is introduced in the training process for handling the imbalanced data. Tensorflow offers a parameter called class_weight in model.fit function that allows to specify the weights for each of the target classes to ensure the predictive performance for imbalanced data. The batch_size in training was set to 128, and the training Epoch was set to 50. To prevent overfitting, the "early stop" method was used, i.e., the training would stop when the accuracy of the verification set did not increase for ten consecutive trainings.

Data statistics of the models

The performance of the constructed model was evaluated by calculating the performance of the models on the test set. The statistical indicators were the number of true positive samples (TP), false positive samples (FP), true negative samples (TN), false negative samples (FN), loss, accuracy, precision and recall. Based on the above data, the confusion matrixes, PR and ROC curves were drawn, and the value of PRC (area under PR curve) and AUC (area under ROC curve) were calculated.

Calculating the factor importance of the diagnostic factors

The significance of the 36 factors that had been previously filtered out was calculated and visualized analysis was conducted using the Permutation feature importance (PFI) method[46, 47]. The R packages "pheatmap" [48] and "RColorBrewer" [49] were used to normalize the values and plot heat and cluster diagram for factor significance.

Results

Screening of the diagnostic factors in TCM

As mentioned before, we ran a multiple linear regression analysis on these 89 diagnostic factors in TCM with the R package “stats”, screening out 36 significant factors. These 36 factors were used to train and develop the models.

Baseline situation of base variables and screened diagnostic factors

The mean age of participants at baseline was 64.4 years, and 525 (51.52%) were males. There were 525 males (51.52%) and 494 females (48.48%). The male to female ratio is 1.06:1. 403 (39.55%) had a smoking history, 272 (26.93%) had a drinking history, 640 (63.94%) had a family history, 680(66.80%) had coronary heart disease and 795(78.10%) had hypertension. 445(43.71%) were complicated with diabetes, 392(38.55%) were complicated with cerebral infarction. See Table 1 for details (Table 1: Baseline situation of the 1019 cases with dyslipidemia). (In Table 1, the variable column for smoking has values ranging from 0 to 6. 0 represents smoking within 6 months, 1 represents smoking within 6 months to 5 years, 2 represents smoking within 5–10 years, 3 represents smoking within 10 to 15 years, 4 represents smoking within 15–20 years, 5 represents smoking for over 20 years, and 6 represents an unspecified time period.)

Table 1 Baseline situation

Full size table

Model construction for the prediction of dyslipidemia with MOPS

The number of neurons of each model can be seen on Table 2. We calculated performance indicators of the models such as accuracy, precision, recall, AUC, PRC and time cost for ten training sessions (timeCost10). Each model was trained ten times with data set randomly divided. The above indicators of each model were averaged to evaluate the impact of the number of hidden layers and neurons on the models’ performance, as shown in Fig. 3.

Table 2 Structure of the predictive models. The hidden layers of the models were set to 0, 1, 2, 3, and 4, respectively, and the number of neurons in each hidden layer was set to 0, 8, 16, 32, 64, 128, et al.

Full size table

As is shown in Fig. 3, from model-1 to model-5, accuracy, precision, recall, auc, and prc kept increasing while loss kept decreasing. And from model-5 to model-20, these indicators stayed steady. Time cost for ten trainings mostly remained steady across models. From model-1 to model-5, the number of hidden layers of the model increases from 0 to 1, and the number of neurons in the hidden layer also increases, which indicates that the model with 0 hidden layers does not predict as well as the model with 1 hidden layer, and within a certain range of the number of neurons in the hidden layer, the increase in the number of neurons helps to improve the prediction of the model. However, as the number of hidden layers and hidden layer neurons continues to increase, the performance of the model does not continue to improve but remains at a high level and does not increase. In general, for model training using TensorFlow, the training sample size needs to be adapted to the number of hidden layers and neurons, and it is not advisable to use complex neural network structures for training small samples, and similarly, it is not advisable to use simple neural network structures for training large samples. Figure 3 shows that models 5–20 are more appropriate with the sample size of 1019 cases used in this study. So we will evaluate and select the optimal model from model-5 to model-20.

Performance analysis of the prediction models for dyslipidemia with MOPS

The models were divided into five groups based on their number of hidden layers (0, 1, 2, 3, 4). Then we selected the model with the highest precision in each group: model-1, model-8, model-11, model-16 and model-18. Using the training data with optimal loss value, we evaluated the performance of the five models in the test set in 50 Epochs of training.

We calculated the confusion matrix of each model, as shown in Fig. 4. The sum of TN and TP samples meant the number of correct predictions by the model. Model-1, with 0 hidden layers, had 143 TN and TP samples (Fig. 4a). Model-8, with 1 hidden layer, had 180 TN and TP samples (Fig. 4b). Model-11, with 2 hidden layers, had 180 TN and TP samples (Fig. 4c). Model-16, with 3 hidden layers, also had 180 TN and TP samples (Fig. 4d). Model-18, with 4 hidden layers had 178 TN and TP samples (Fig. 4e). These results indicate that models with at least one hidden layer have better prediction performance than those with 0 hidden layer.

We plotted PR curves and ROC curves to evaluate the prediction performance of the models. As shown in Fig. 5a, b, the closer the PR curve is to the right, the better the model's performance; the closer the ROC curve is to the left, the better the model's performance [50]. Furthermore, the discrimination ability of the models was evaluated by the area under the curve of the PR and ROC. PRC is the area under the PR curve. AUC is the area under the ROC curve. When the values of PRC and AUC are greater than 0.5, it means the model performs well. The closer these values are to 1, the better the model's performance [50, 51]. The PRC and AUC values show that all the five models performed well, while Model-11 performed the best. In summary, all the five models had good prediction performance. Among them, those with multiple hidden layers performed better than those with 0 hidden layer. And Model-11 was optimal in performance.

The significance analysis of diagnostic factors for dyslipidemia with MOPS

In this study, the above five models were used to evaluate the significance of diagnostic factors for dyslipidemia with MOPS. We calculated the significance values of the diagnostic factors with PFI. The results are shown in Fig. 6. From the heat diagram, it can be seen that diagnostic factors, such as dark purple tongue, slippery pulse, slimy fur, expectoration, petechiae of the tongue, were of high significance. From the hierarchical cluster diagram, it can be seen that dark purple tongue, slippery pulse, slimy fur, expectoration and so on were on the higher hierarchy, indicating greater importance. According to this, we believe that standardized and objective diagnostic rules for dyslipidemia with MOPS can be constructed based on dark purple tongue, slippery pulse, slimy fur, expectoration, petechiae of the tongue.

It is worth mentioning that in the clinical diagnosis scale of MOPS compiled by Fang Ge et al. [52], dark purple tongue, slippery pulse and slimy fur are also in the high-frequency vocabulary of this syndrome. It means that from the perspective of clinical observation, dark purple tongue, slippery pulse and slimy fur are also of great significance in the differentiation and classification of dyslipidemia with MOPS, which is highly consistent with the prediction results of the models.

Discussion

Dyslipidemia has drawn extensive attention due to its important clinical significance. There is no disease term for dyslipidemia in TCM, but it is currently considered to be close to the concepts of “phlegm turbidity” and “cream fat” in TCM. Although dyslipidemia has a variety of classification in TCM, its basic pathological characteristics are phlegm and blood stasis as manifestation, deficiency as the root cause, and mutual obstruction of phlegm and blood stasis as the problem [19]. This description is also consistent with this study's finding that MOPS is the most frequent among all the classifications.

The process of diagnosis in TCM is essentially a classification problem. However, TCM diagnosis relies heavily on the doctors' experience, which is highly subjective. Standardized diagnostic rules cannot be formed, which makes it difficult to standardize and promote the characteristics of TCM diagnosis and treatment.

This study used deep learning to train and construct a prediction model of dyslipidemia based on the clinical data in TCM, so as to confirm the feasibility of inputting the diagnostic factors in TCM (such as dark purple tongue, chest tightness, etc.) into the model to predict whether the patient has dyslipidemia with MOPS, simulating the process of clinical syndrome classification of dyslipidemia in TCM.

One advantage of this model is that it can help solve the problem of the lack of objectivity of TCM in clinical diagnosis. Another significant advantage of this model is the high accuracy, which will reduce the workload caused by clinical misdiagnosis. In addition, the model efficiently uncovers and utilizes the hidden rules and patterns buried in a large amount of clinical data so that standardized and objective diagnostic rules for dyslipidemia with MOPS can be constructed.

To sum up, in this study, we constructed prediction models for dyslipidemia with MOPS through deep learning method with a large amount of multi-centered clinical data. We further evaluated the performance of the models. Results of the study show that the models performed well in predicting dyslipidemia with MOPS, and the model-11 is the optimal model. In the meantime, diagnostic factors in TCM, such as dark purple tongue, slippery pulse and slimy fur, were screened out as significant factors and diagnostic rules for the diagnosis of MOPS. The study is an avant-garde attempt at introducing the deep-learning method into the research of TCM, contributing to the standardization and objectiveness of TCM diagnosis for dyslipidemia.

Strengths and limitations

As far as we know, this study is the first to use a diagnostic model based on deep learning to predict whether patients have dyslipidemia with MOPS, so as to guide the corresponding treatment in TCM. Although this is just a small step ahead, it reveals the great potential in applying the deep learning method to clinical data mining and diagnosis, which will help clinicians reduce subjectivity and improve stability in clinical diagnosis.

Unlike traditional linear models, the prediction model based on deep learning is a nonlinear “black box”. Although the “black box” can provide more accurate prediction results, its opaqueness and lack of clinical interpretability may lead to some restrictions to applying the deep learning method. In addition, due to personnel and funds limitations, we cannot collect more data for external verification. In the next research, we plan to carry out clinical data collection of larger samples to verify further, improve and modify our prediction model for even stronger prediction performance.

Conclusions

This study proved the feasibility of constructing a diagnostic prediction model based on the deep learning method to predict whether patients have dyslipidemia with MOPS to guide the corresponding TCM treatment. The model-11 is the optimal model, with a high level of accuracy, and it provides clinicians with a more objective and stable guide for diagnosing and treating dyslipidemia with MOPS in TCM.

Data availability

The data and materials of this study will be made available by the corresponding author upon a reasonable request.

References

Pirillo A, Casula M, Olmastroni E, et al. Global epidemiology of dyslipidaemias. Nat Rev Cardiol. 2021;18:689–700.
Google Scholar
Zhu J, Z.Y., Wu Y, et al. Obesity and dyslipidemia in Chinese adults: a cross-sectional study in Shanghai China. Nutrients. 2022;14(11):2321.
Google Scholar
Nagasawa S-y, Okamura T, Iso H, Tamakoshi A, Yamada M, Watanabe M, Murakami Y, Miura K, Ueshima H. Evidence for cardiovascular prevention from observational cohorts in Japan research group relation between serum total cholesterol level and cardiovascular disease stratified by sex and age group: a pooled analysis of 65 594 individuals from 10 cohort studies in Japan. J Am Heart Assoc. 2012;1:e001974.
Google Scholar
Di Angelantonio E, Pennells L, Kaptoge S, Caslake M, Thompson A, Butterworth AS, Sarwar N, Wormser D, Saleheen D, et al. Lipid-related markers and cardiovascular disease prediction. JAMA-J Am Med Assoc. 2012;307:2499–506.
Google Scholar
Turgeon RD, A.T.J., Gregoire J., Pearson G.J. Guidelines for the management of dyslipidemia and the prevention of cardiovascular disease in adults by pharmacists. Can Pharm J. 2016;2017(150):243–50.
Google Scholar
Benjamin EJ, et al. Executive summary: heart disease and stroke statistics–2016 update: a report from the american heart association. Circulation. 2016;133(4):447–54.
Google Scholar
Katulanda P, Dissanayake HA, Neomal De Silva SD, Katulanda GW, Liyanage IK, Constantine GR, Sheriff R, Matthews DR. Prevalence, patterns, and associations of dyslipidemia among Sri Lankan adults-Sri Lanka diabetes and cardiovascular study in 2005–2006. J Clin Lipidol. 2018;2018(12):447–54.
Google Scholar
Lee JS, Chang P-Y, Zhang Y, Kizer JR, Best LG, Howard BV. Triglyceride and HDL-C dyslipidemia and risks of coronary heart disease and ischemic stroke by glycemic dysregulation status: the strong heart study. Diabetes Care. 2017;2017(40):529–37.
Google Scholar
Anderson KM, Levy D. Cholesterol and mortality 30 years of follow-up from the Framingham study. JAMA. 1987;257(16):2176–80.
Google Scholar
Nicholls S. The emerging role of lipoproteins in atherogenesis: beyond LDL cholesterol. Semin Vasc Med. 2004;4(2):187–95.
MathSciNet Google Scholar
Kopin L. Dyslipidemia. Ann Intern Med. 2017;167(11):ITC81–96.
Google Scholar
Lu Y, Lu J, Ding Q, Li X, Wang X, Sun D, Tan L, Mu L, Liu J, et al. Prevalence of dyslipidemia and availability of lipid-lowering medications among primary health care settings in China. JAMA Netw Open. 2021;4:e2127573.
Google Scholar
Cai T, Langford O, et al. Associations between statins and adverse events in primary prevention of cardiovascular disease: systematic review with pairwise, network, and dose-response meta-analyses. BMJ. 2021;374:n1537.
Google Scholar
Mollazadeh H, Fanni G, et al. Effects of statins on mitochondrial pathways. J Cachexia Sarcopenia Muscle. 2021;12(2):237–51.
Google Scholar
Guo M, Gao ZY, Shi DZ. Chinese herbal medicine on dyslipidemia: progress and perspective. Evid Based Complement Alternat Med. 2014;2014:163036.
Google Scholar
Chen J, Yang Z, Wang T, Xu B, Li P, Zhang S, Xue X. Study on the effect of macrophages on vascular endothelium in mice with different tcm syndromes of dyslipidemia and its biological basis based on RNA-Seq Technology. Front Pharmacol. 2021;12:665635.
Google Scholar
JG J. The clinical study progress of Chinese herbal medicine for hyperlipidemia. Journal of Practical Traditional Chinese Medicine. 2008;24(9):614–5 ((In Chinese)).
Google Scholar
Shi HX, Q L. Research progress of traditional Chinese medicine treatment of hyperlipidemia. J Med Forum. 2007;28(10):123–4.
Google Scholar
Gao J-L. Study on syndrome elements and targets of dyslipidemia based on 19877 cases of literature[J]. China J Tradit Chin Med Pharm. 2018;02:605–7 ((In Chinese)).
Google Scholar
Hai-fang Wu, Liu X-D, et al. Regularity study of traditional Chinese medicine syndrome distribution in 1 019 cases patients with dyslipidemia[J]. China J Tradit Chin Med Pharm. 2018;06:2672–5 ((In Chinese)).
Google Scholar
Stefaniak B, Tarkowska A. Algorithms of artificial neural networks - practical application in medical science. Pol Merkur Lekarski. 2005;19:819–22.
Google Scholar
Hornik K. Approximation capabilities of multilayer feedforward networks. Neural Netw. 1991;4:251–7.
Google Scholar
Zhang Z, Beck MW, Winkler DA, Huang B, Sibanda W. Hemant goyal opening the black box of neural networks: methods for interpreting neural network models in clinical applications. Ann Transl Med. 2018;6(11):216.
Google Scholar
Lisboa PJ, Taktak AFG. The use of artificial neural networks in decision support in cancer: a systematic review. Neural Netw. 2006;19(4):408–15.
Google Scholar
Hu SB, Wong DJL, Correa A, Li N, Deng JC. Prediction of clinical deterioration in hospitalized adult patients with hematologic malignancies using a neural network model. PLoS ONE. 2016;11(8):e0161401.
Google Scholar
Motta S, Pappalardo F. Mathematical modeling of biological systems. Brief Bioinform. 2013;14(4):411–22.
Google Scholar
Pappalardo F, Palladini A, Pennisi M, Castiglione F, Motta S. Mathematical and computational models in tumor immunology. Math Model Nat Phenom. 2012;7(3):186–203.
MathSciNet Google Scholar
Rampasek L, Goldenberg A. TensorFlow: Biology’s Gateway to Deep Learning? Cell Syst. 2016;2(1):12–4.
Google Scholar
Developers, T. TensorFlow (v2.8.2). 2022; Available from: https://github.com/tensorflow/tensorflow.
Chollet, F., Keras. 2015, GitHub: GitHub repository.
Saleh H, Alyami H, Alosaimi W. Predicting breast cancer based on optimized deep learning approach. Comput Intell Neurosci. 2022;2022:1820777.
Google Scholar
Zhu J-R, Shui-ping Zhao R-lG, et al. Guidelines for the prevention and treatment of dyslipidemia in Chinese Adults (Revised 2016) [J]. Chinese Circ J. 2016;10:937–53 ((In Chinese)).
Google Scholar
GB/T16751.2–1997., Clinical Terminology in Chinese Medicine - syndrome part. Beijing: National Standards Press, 1997: p. (In Chinese).
Committee, C.m.t.e.a.A., Terms of traditional Chinese Medicine. Beijing: Science Press, 2004, 56–124.
Zhu J. Study on Standardization Status of Traditional Chinese Medicine Terms[M]. Beijing: Ancient Chinese Medical Book Press; 2016.
Google Scholar
Team, R.C. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing 2020; Available from: https://www.R-project.org/.
Chambers, J.M., Hastie, T. J. Statistical Models in S. Wadsworth & Brooks/Cole 1992.
Agarap, A.F., Deep learning using rectified linear units (relu). arXiv preprint arXiv:1803.08375, 2018. 31.
Han J, Moraga C. The influence of the sigmoid function parameters on the speed of backpropagation learning in From Natural to Artificial Neural Computation. Berlin: Springer; 1995.
Google Scholar
Penny W, Frost D. Neural networks in clinical medicine. Med Decis Making. 1996;16(4):386–98.
Google Scholar
Kingma DP, B.J., Adam: A Method for Stochastic Optimization. 3rd International Conference on Learning Representations, ICLR 2015. , 2015.
Foucault C. Gated recurrence enables simple and accurate sequence prediction in stochastic, changing, and structured environments. Elife. 2021;10:e71801.
Google Scholar
Usha Ruby PT, Jacob J, Vamsidhar. Binary cross entropy with deep learning technique for Image classification. Int J Adv Trends Comput Sci Eng. 2020;9(4):5393–7.
Google Scholar
P F, et al. Scikit-learn: Machine learning in Python. J Mach Learn Res. 2021;12(2):2825–30.
MathSciNet Google Scholar
Jung G, Lee J, Kim I. Tracklet pair proposal and context reasoning for video scene graph generation. Sensors. 2021;21(9):3164.
Google Scholar
Altmann A, Toloşi L, Sander O, Lengauer T, et al. Permutation importance: a corrected feature importance measure. Bioinformatics. 2010;26(10):1340–7.
Google Scholar
Breiman L. Random forests. Mach Learn. 2001;45:5–32.
Google Scholar
Kolde, R. pheatmap: Pretty Heatmaps. R package version 1.0.12 2019; Available from: https://CRAN.R-project.org/package=pheatmap.
Neuwirth, E. RColorBrewer: ColorBrewer Palettes. R package version 1.1–3 2022; Available from: https://CRAN.R-project.org/package=RColorBrewer.
T., F. An introduction to ROC analysis. Pattern Recognit Lett. 2005;27(8):861–74.
Google Scholar
Yang Li, Haibin Wu, Jin X, Zheng P, Shiyun Hu, Xiaoling Xu, Wei Yu, Yan J. Study of cardiovascular disease prediction model based on random forest in eastern China. Sci Rep. 2020;10(1):5245.
Google Scholar
Ge Fang BW, Zhou X, et al. Item selection of diagnostic scale for syndrome of intermingled phlegm and blood stasis of angina pectoris and coronary heart disease[J]. J Tradition Chinese Med. 2019;60(22):1911–5 ((In Chinese)).
Google Scholar

Download references

Funding

The present study was supported by the State Key Program of National Natural Science Foundation of China [82230124], the General Program of the National Natural Science Foundation of China [81974556], the Youth Science Fund project [81202803], from National Nature Science Foundation of China, and Traditional Chinese Medicine Inheritance and innovation “Ten million” talent project - Qihuang Project Chief Scientist Project [0201000401] from National Administration of Traditional Chinese Medicine.

Author information

Authors and Affiliations

Department of Cardiology, Guang’anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, 10053, China
Jinlei Liu, Qingyong He & Jie Wang
Dermatological Department, Beijing Hospital of Traditional Chinese Medicine, Capital Medical University, Beijing, 100010, China
Wenchao Dan
Beijing University of Chinese Medicine, Beijing, 100029, China
Wenchao Dan, Xudong Liu & Xiaoxue Zhong
Xi’an Jiaotong University, Xi’an, 710049, China
Cheng Chen

Authors

Jinlei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wenchao Dan
View author publications
You can also search for this author in PubMed Google Scholar
Xudong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoxue Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Qingyong He
View author publications
You can also search for this author in PubMed Google Scholar
Jie Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JL, WD, QH and JW: conceived this study. XL and XZ: collected the data. JL and WD: performed the analysis. JL and CC: completed the model training and visualization. JL: wrote the manuscript. The listed authors have reviewed and revised the manuscript.

Corresponding author

Correspondence to Jie Wang.

Ethics declarations

Conflict of interest

All authors declare that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, J., Dan, W., Liu, X. et al. Development and validation of predictive model based on deep learning method for classification of dyslipidemia in Chinese medicine. Health Inf Sci Syst 11, 21 (2023). https://doi.org/10.1007/s13755-023-00215-0

Download citation

Received: 31 January 2023
Accepted: 20 February 2023
Published: 06 April 2023
DOI: https://doi.org/10.1007/s13755-023-00215-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Development and validation of predictive model based on deep learning method for classification of dyslipidemia in Chinese medicine

Abstract

Backgrounds

Methods

Results

Conclusions

Similar content being viewed by others

AI-based prediction for the risk of coronary heart disease among patients with type 2 diabetes mellitus

A machine intelligence technique for predicting cardiovascular disease (CVD) using Radiology Dataset

Application of Deep Learning Methods in the Diagnosis of Coronary Heart Disease Based on Electronic Health Record

Introduction

Methods and materials

Source of data

Diagnostic criteria

Inclusion and exclusion criteria

Syndrome type distribution

Data preprocessing

The architecture of the models

Parameters setting of the models

Data statistics of the models

Calculating the factor importance of the diagnostic factors

Results

Screening of the diagnostic factors in TCM

Baseline situation of base variables and screened diagnostic factors

Model construction for the prediction of dyslipidemia with MOPS

Performance analysis of the prediction models for dyslipidemia with MOPS

The significance analysis of diagnostic factors for dyslipidemia with MOPS

Discussion

Strengths and limitations

Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation