Machine learning for detection of stenoses and aneurysms: application in a physiologically realistic virtual patient database

Jones, G.; Parr, J.; Nithiarasu, P.; Pant, S.

doi:10.1007/s10237-021-01497-7

Machine learning for detection of stenoses and aneurysms: application in a physiologically realistic virtual patient database

Original Paper
Open access
Published: 31 July 2021

Volume 20, pages 2097–2146, (2021)
Cite this article

Download PDF

You have full access to this open access article

Biomechanics and Modeling in Mechanobiology Aims and scope Submit manuscript

Machine learning for detection of stenoses and aneurysms: application in a physiologically realistic virtual patient database

Download PDF

3819 Accesses
19 Citations
2 Altmetric
Explore all metrics

Abstract

This study presents an application of machine learning (ML) methods for detecting the presence of stenoses and aneurysms in the human arterial system. Four major forms of arterial disease—carotid artery stenosis (CAS), subclavian artery stenosis (SAS), peripheral arterial disease (PAD), and abdominal aortic aneurysms (AAA)—are considered. The ML methods are trained and tested on a physiologically realistic virtual patient database (VPD) containing 28,868 healthy subjects, adapted from the authors previous work and augmented to include disease. It is found that the tree-based methods of Random Forest and Gradient Boosting outperform other approaches. The performance of ML methods is quantified through the $F_1$ score and computation of sensitivities and specificities. When using six haemodynamic measurements (pressure in the common carotid, brachial, and radial arteries; and flow-rate in the common carotid, brachial, and femoral arteries), it is found that maximum $F_1$ scores larger than 0.9 are achieved for CAS and PAD, larger than 0.85 for SAS, and larger than 0.98 for both low- and high-severity AAAs. Corresponding sensitivities and specificities are larger than 90% for CAS and PAD, larger than 85% for SAS, and larger than 98% for both low- and high-severity AAAs. When reducing the number of measurements, performance is degraded by less than 5% when three measurements are used, and less than 10% when only two measurements are used for classification. For AAA, it is shown that $F_1$ scores larger than 0.85 and corresponding sensitivities and specificities larger than 85% are achievable when using only a single measurement. The results are encouraging to pursue AAA monitoring and screening through wearable devices which can reliably measure pressure or flow-rates.

A proof of concept study for machine learning application to stenosis detection

Article Open access 28 August 2021

HALE: Healthy Area of Lumen Estimation for Vessel Stenosis Quantification

Noninvasive estimation of aortic hemodynamics and cardiac contractility using machine learning

Article Open access 14 September 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Two of the most common forms of arterial disease are stenosis, narrowing of an arterial vessel, and aneurysm, an increase in the area of a vessel. They are estimated to affect between 1 and 20% of the population (Fowkes et al. 2013; Shadman et al. 2004; Mathiesen et al. 2001; Li et al. 2013), and ruptured abdominal aortic aneurysms alone are estimated to cause 6000–8000 deaths per year in the United Kingdom (Darwood et al. 2012). Current methods for the detection of arterial disease are primarily based on direct imaging of the vessels, which can be expensive and hence prohibitive for large-scale screening. If arterial disease can be detected by easily acquirable pressure and flow-rate measurements at select locations within the arterial network, then large-scale screening may be facilitated.

It is likely that the indicative biomarkers of arterial disease in the pressure and flow-rate profiles consist of micro inter- and intra-measurement details. In the past, detection of arterial disease has been proposed through the analysis of waveforms in combination with mathematical models of pulse wave propagation, see for example Sazonov et al. (2017), Stergiopulos et al. (1992). This, however, requires specification or identification of patient-specific network parameters, which is not easy to perform, especially at large scales.

This study explores the use of machine learning (ML) methods for the detection of stenoses and aneurysms in order to facilitate large scale low-cost screening/diagnosis. A data-driven ML approach is adopted, which does not require specification of patient-specific parameters. Instead, such algorithms learn patterns and biomarkers from a labelled data set. ML has a history of being used for medical applications (Kononenko 2001). Classification algorithms have been shown to be able to predict the presence of irregularities in heart valves (Çomak et al. 2007), arrhythmia (Song et al. 2005), and sleep apnea (Khandoker et al. 2009) from recorded time domain data. Recently, a study reported the successful use of ML methods to estimate pulse wave velocity from radial pressure wave measurements (Jin et al. 2020). Automatic detection, segmentation, and classification of AAAs in CT images are presented in Hong and Sheikh (2016), while severity growth of AAAs is predicted from CT images in Jiang et al. (2020). A previous study (Chakshu et al. 2020) has applied deep-learning methods to AAA classification, using a synthetic data set created by varying seven parameters. In this study, accuracies of $\approx 99.9\%$ are reported for binary classification of AAA based on three pressure measurements. Furthermore, Wang et al. (2021) achieved a sensitivity of 86.8% and a specificity of 86.3% for early detection of AAA from the photoplethysmogram pulse waves—using a synthetic data set created by finding the mean and standard deviation of six cardiovascular properties for subjects of each age decade from 55 to 75 years, and then varying each property in combination with each other by ± 1 standard deviation from their age-specific mean values. These studies motivate the application of ML to detect arterial disease—both stenosis and aneurysms—using only pressure and flow-rate measurements at select locations in the arterial network. A previous proof-of-concept study (Jones et al. 2021c) showed promising results that ML classifiers can detect stenosis in a simple three vessel arterial network using only measurements of pressures and flow-rates. Here, these ideas are extended to a significantly larger, physiologically realistic, network of the human arterial system. All the ML methods are trained and tested on the virtual healthy subject database proposed in Jones et al. (2021a), which is augmented to introduce disease into the virtual subjects.

This study is organised as follows. It begins by briefly explaining the healthy VPD proposed in Jones et al. (2021a). Modifications to this database to create four different forms of arterial disease are presented next, along with the parameterisation of disease forms. This is followed by presentation of the ML methodology and metrics used for quantification of classification accuracies. Finally, these accuracies are assessed when using different combinations of pressure and flow-rate measurements, along with the analysis of patterns and behaviours observed in the ML classifiers.

2 Methodology

The ML algorithms are trained and tested on a data set containing both healthy subjects and diseased patients.

2.1 Healthy subjects

A physiologically realistic VPD containing healthy subjects is created in Jones et al. (2021a) and forms the starting point of this study. This database is available in Jones et al. (2021b). The arterial network contains 71 vessel segments and is shown in Fig. 1, along with the locations where disease occurs in high prevalence, and where measurements of pressure and flow-rate can potentially be acquired (Jones et al. 2021a). The healthy patient database of Jones et al. (2021a) contains 28,868 VPs and is referred as $\text {VPD}_{\text {H}}$. Disease is introduced into these healthy arterial networks as described next.

2.2 Creation of unhealthy VPDs

2.2.1 Disease forms

The four most common forms of arterial disease are carotid artery stenosis (CAS), subclavian artery stenosis (SAS), peripheral arterial disease (PAD, a form of stenosis), and abdominal aortic aneurysm (AAA) (Jones et al. 2021a; Dyken et al. 1974; Kullo and Rooke 2016; Aboyans et al. 2010; Chen et al. 2013; Li et al. 2013). Their prevalence is restricted to the following vessels and shown in Fig. 1:

CAS is assumed to only affect the common carotid arteries. For simplification and consistency of notation, these vessels are referred to as the carotid artery chains ($\hbox {CA}_{{\mathbf {x}}}$).
SAS is assumed to affect the first and second subclavian segments. These two chains of vessels (one on the right and left side) are referred to as the subclavian artery chains ($\hbox {SA}_{{\mathbf {x}}}$).
PAD is assumed to affect the common iliacs; external iliacs; first and second femoral segments; and the first popliteal segments. These chains are referred to as the peripheral artery chains ($\hbox {PA}_{{\mathbf {x}}}$).
AAA is assumed to affect the first to forth abdominal aorta segment. This chain of vessels is referred to as the abdominal aortic chain ($\hbox {AA}_{{\mathbf {x}}}$).

It is assumed that each diseased VP has only one of the four forms of arterial disease. Four complementary databases corresponding to $\text {VPD}_{\text {H}}$ are constructed, each pertaining to one form of arterial disease. To create the diseased VPD corresponding to CAS, referred to as $\text {VPD}_{\text {CAS}}$, for every subject in $\text {VPD}_{\text {H}}$, disease is introduced in $\hbox {CA}_{\mathrm {x}}$ (i.e. the left or right carotid artery). This is achieved by taking the arterial network of a subject from $\hbox {VPD}_{\text {H}}$, artificially introducing a stenosis in $\hbox {CA}_{\mathrm {x}}$, and then using a one-dimensional pulse-wave propagation model—which has previously been widely employed, tested, and validated (Boileau et al. 2015; Formaggia et al. 2003; Alastruey et al. 2012; Olufsen et al. 2000; Reymond et al. 2009; Matthys et al. 2007)—to compute the pressure and flow-rate waveforms. Note that this model has also been used to study haemodynamics in both stenosis (Boileau et al. 2018; Carson et al. 2019; Jin and Alastruey 2021) and aneurysms (Sazonov et al. 2017; Chakshu et al. 2020; Jin and Alastruey 2021). The numerical implementation of the pulse-wave propagation model employed here is outlined in Jones et al. (2021a) and validated against a discontinuous Galerkin (DCG) scheme (Alastruey et al. 2012), which in turn has been successfully validated against a 3D model of blood-flow through stenosed arterial vessels (Boileau et al. 2018).

Thus, $\text {VPD}_{\text {CAS}}$ contains 28,868 VPs with CAS. Similarly, the databases corresponding to SAS, PAD, and AAA are created, and referred to as $\text {VPD}_{\text {SAS}}$, $\text {VPD}_{\text {PAD}}$, and $\text {VPD}_{\text {AAA}}$, respectively. The disease severities, locations, and shapes are varied randomly across these databases as described next.

2.2.2 Parameterisation of diseased vessels

The severity of stenoses (percentage reduction in area) is varied between 50 and 95%. The lower 50% limit is set for the stenoses to be haemodynamically significant (Aboyans et al. 2010; Subramanian et al. 2005) and the upper limit of 95% reflects near total occlusion. For aneurysms, based on (Ernst 1993) and (Davis et al. 2013), an allowable range of AAA severities of 4cm–6cm diameters is chosen. This corresponds to a cross-sectional area variation of $12.56\text { cm}^2$–$28.27\text { cm}^2$. With the abdominal aortic area in the reference network (Jones et al. 2021a) between 1.76 and $1.09\text { cm}^2$, the corresponding AAA severities are set to vary between 713% (12.56/1.76) and 2,593% (28.27/1.09). With the above ranges, parameterisation of area increase/reduction proposed in Jones et al. (2021c) is adopted, see Fig. 2. For a chain of diseased vessels ($\hbox {CA}_{\text {x}}$, $\hbox {SA}_{\text {x}}$, $\hbox {PA}_{\text {x}}$, or $\hbox {AA}_{\text {x}}$), the normalised area $A_n$ as a function of the normalised x-coordinate, $x_n$, is represented as:

$$\begin{aligned} A_{n}\!=\! {\left\{ \begin{array}{ll} \bigg (1\! \mp \! \dfrac{{\mathcal {S}}}{2} \bigg ) \pm \dfrac{S}{2} \cos \left( \dfrac{2 (x_n-b) \pi }{e-b}\right) &{} \text {for } b\le x_n \le e \\ 1 &{} \text {otherwise} \end{array}\right. } \end{aligned}$$

(1)

where ${\mathcal {S}}$ represents the severity, b represents the normalised starting location of the disease in the vessel chain, e represents the normalised end location, $A_n$ is normalised with respect to the healthy version of the vessel in $\hbox {VPD}_{\text {H}}$, and ± creates an aneurysm or stenosis, respectively. In $\hbox {CA}_{\text {x}}$, $\hbox {SA}_{\text {x}}$, and $\hbox {PA}_{\text {x}}$, the left and right side vessels are chosen with equal probability.

The disease severity ${\mathcal {S}}$, start location b, and end location e are assigned uniform distributions based on physical considerations. To sample values for these parameters, a fourth parameter, the reference location of the disease (represented by r) is introduced. This is included to impose a minimum length of 10% of the chain length on the disease profiles. Thus, the parameters for disease are sampled sequentially from uniform distributions within the following bounds:

$$\begin{aligned} \text {Bounds:} {\left\{ \begin{array}{ll} 0.2 \le r \le 0.8,\\ 0.1 \le b \le r-0.05, \\ r+0.05 \le e \le 0.9,\\ {\left\{ \begin{array}{ll} 0.5 \le {\mathcal {S}} \le 0.95 &\quad\text {for stenoses,}\\ 7.13 \le {\mathcal {S}} \le 25.93 &\quad \text {for aneurysms.}\\ \end{array}\right. } \end{array}\right. } \end{aligned}$$

(2)

Based on the above parameterisation, examples of healthy and diseased $\hbox {SA}_{\text {x}}$, $\hbox {PA}_{\text {x}}$, and $\hbox {AA}_{\text {x}}$ area profiles are shown in the left and right columns of Fig. 3, respectively.

2.3 Measurements

A review of potential measurements that can be acquired in the network is presented in Jones et al. (2021a). Based on this, the locations at which time-varying pressure and flow-rate measurements can be acquired are shown in Fig. 1 and described below.

Pressure in the carotid and radial arteries measured using applanation tonometry (Adji et al. 2006; O’rourke 2015). To simplify annotation and description, the right and left carotid artery pressures are referred as $P_1^{\text {(R)}}$ and $P_1^{\text {(L)}}$, respectively. Similarly, the radial artery pressures are referred to $P_3^{\text {(R)}}$ and $P_3^{\text {(L)}}$, respectively.
Pressure in the brachial arteries estimated through reconstruction of finger arterial pressure (Guelen et al. 2008). The right and left brachial artery pressures are referred to as $P_2^{\text {(R)}}$ and $P_2^{\text {(L)}}$ , respectively.
Flow-rate in the carotid, brachial, and femoral arteries measured using Doppler ultrasound (Byström et al. 1998; Oglat et al. 2018; Radegran 1997). The right and left carotid artery, brachial, and femoral flow-rates are referred to as $Q_1^{\text {(R)}}$, $Q_1^{\text {(L)}}$; $Q_2^{\text {(R)}}$, $Q_2^{\text {(L)}}$; and $Q_3^{\text {(R)}}$, $Q_3^{\text {(L)}}$, respectively.

2.3.1 Provision of measurements to ML classifiers

Unless specified otherwise, the measurements to ML classifiers are bilateral, i.e. when $Q_1$ is specified it is implied that both right and left carotid flow-rates are used:

$$\begin{aligned} Q_1 = \{Q_1^{\text {(R)}}, Q_1^{\text {(L)}}\}. \end{aligned}$$

(3)

There are, therefore, a total of by six bilateral measurements available: three pressure and three flow-rates. To reduce the dimensionality required to describe each pressure or flow-rate measurement, the periodic profiles are described through a Fourier series (FS) representation:

$$\begin{aligned} u(t)=\sum _{n=0}^N a_n \sin (n \omega t) + b_n \cos (n \omega t), \end{aligned}$$

(4)

where u represents any pressure or flow-rate profile; $a_n$ and $b_n$ represent the $n{\text {th}}$ sine and cosine FS coefficients, respectively; N represents the truncation order; and $\omega ={2 \pi }/{T}$, with T as the time period of the cardiac cycle. It is found in Jones et al. (2021c) that haemodynamic profiles can be described by a FS truncated at $N=5$. Thus, each individual measurement is described by 11 FS coefficients, and each bilateral measurement by 22 FS coefficients.

2.4 Machine learning classifiers

A model mapping a vector of input measurements, $\varvec{x}$, to a discrete output classification, y, can be described as:

$$\begin{aligned} y = m(\varvec{x}) \quad y \in \{{\mathcal {C}}^{(1)}, {\mathcal {C}}^{(2)}\}, \end{aligned}$$

(5)

where ${\mathcal {C}}^{(j)}$ represents the $j{\text {th}}$ possible classification. In the context of this study, the measured inputs, $\varvec{x}$, represent the FS coefficients of a user defined combination of the haemodynamic measurements $\{Q_1$, $Q_2$, $Q_3$, $P_1$, $P_2$, $P_3\}$ (see Sect. 2.3.1) taken from VPs, and the output classification represents the corresponding health of those VPs : ${\mathcal {C}}^{(1)}$= ‘healthy’ and ${\mathcal {C}}^{(2)}$= ‘diseased’. To account for large differences in magnitudes of the components of $\varvec{x}$, they are individually transformed with the Z-score standardisation method (Mohamad and Usman 2013) to have zero-mean and unit variance.

As previously stated, it assumed that in a patient disease is limited to only one of the four forms. As a first exploratory study, the ML classifiers are created for each form independently. All classifiers are therefore binary (see Jones et al. 2021c), i.e. four independent classifiers are trained to predict the following questions independently: “Does a VP belong to $\text {VPD}_{\text {H}}$ or $\text {VPD}_x$”, where x can be either CAS, SAS, PAD, or AAA.

2.4.1 Training and test sets

Each VP in $\text {VPD}_{\text {CAS}}$, $\text {VPD}_{\text {SAS}}$, $\text {VPD}_{\text {PAD}}$, and $\text {VPD}_{\text {AAA}}$ shares an identical underlying arterial network, apart from the diseased chain, with the corresponding healthy subject in $\hbox {VPD}_{\text {H}}$. It is, therefore, important to ensure that the same subset of VPs is not included in the both healthy and diseased data sets used for ML classifiers. As each form of disease is mutually exclusive, four independent training and test sets, each corresponding to one form of the disease, are constructed in the following three stages:

Step 1: Half of the available VPs are randomly selected from $\text {VPD}_{\text {H}}$ for inclusion within the ML data set; this is referred to as $\text {VPD}_{\text {H-ML}}$. The unhealthy VPs corresponding to the remaining unused half are taken from the appropriate unhealthy VPD ($\text {VPD}_{\text {CAS}}$, $\text {VPD}_{\text {SAS}}$, $\text {VPD}_{\text {PAD}}$, or $\text {VPD}_{\text {AAA}}$) and incorporated into the ML data set. These data sets are referred to as $\text {VPD}_{\text {CAS-ML}}$, $\text {VPD}_{\text {SAS-ML}}$, $\text {VPD}_{\text {PAD-ML}}$, or $\text {VPD}_{\text {AAA-ML}}$.
Step 2: The data sets of Step 1 are combined to create four complete data sets each containing 50% healthy and 50%, unhealthy VPs:
1. 1.
  $\text {VPD}_{\text {H-ML}}\cup \text {VPD}_{\text {CAS-ML}}$
2. 2.
  $\text {VPD}_{\text {H-ML}}\cup \text {VPD}_{\text {SAS-ML}}$
3. 3.
  $\text {VPD}_{\text {H-ML}}\cup \text {VPD}_{\text {PAD-ML}}$
4. 4.
  $\text {VPD}_{\text {H-ML}}\cup \text {VPD}_{\text {AAA-ML}}$
Step 3: The four data sets of Step 2 are randomly split into a training set containing 2/3 of all the VPs in the data set, and a test set containing 1/3 of all the VPs.

The performance of all ML classifiers is evaluated using a fivefold validation. For each fold, the same data set from Step 2 is used, but different subsets are sampled in Step 3 for training and testing.

2.4.2 ML methods

The purpose of this study is to perform an initial exploratory investigation into the possibility of using ML classifiers to detect different forms of arterial disease. Focus is, therefore, on uncovering patterns and behaviours—such as which haemodynamic measurements are particularly informative—rather than optimisation to achieve increasingly higher accuracies. With consideration for this objective, it is not feasible to perform extensive optimisation and analysis on every single ML classifier trained and tested. Thus, the ML methods used are chosen based on their “robustness”—i.e. minimal sensitivity to the hyper-parameters and minimal susceptibility to problems such as overfitting—relative to more complex deep learning methods. Five different ML methods are employed. These five methods are random forest, gradient boosting, naive Bayes’, support vector machine, and logistic regression. These methods encompass a range of probabilistic and non-probabilistic applications of different modelling approaches, see Table 1, while fulfilling the aforementioned characteristics. Along with these five ML methods, one deep learning method is also employed for comparison. This method is multi-layer perceptron. It is a priori expected that multi-layer perceptron classifiers will not perform to their full potential in this study, as they are more reliant on complex hyper-parameter optimisation and monitoring for overfitting than the five ML methods. The use of multi-layer perceptrons will, however, provide some, albeit limited, comparison of ML and deep learning methods. Since standard versions and implementations of these methods are employed without any modifications, methodological details of these methods are not presented in this study. Instead, the reader is referred to the following references for methodological details:

1.
Random Forest (RF) (Liaw and Wiener 2002; Breiman 2001)
2.
Gradient Boosting (GB) (Friedman 2001; Elith et al. 2008)
3.
Naive Bayes’ (NB) (Rish et al. 2001b, a)
4.
Support Vector Machine (SVM) (Kecman 2005)
5.
Logistic Regression (LR) (Sperandei 2014; Hilbe 2009; Jones et al. 2021c)
6.
Multi-layer Perceptron (MLP) (Murtagh 1991)

All implementations of the above algorithms in the Python package Scikit-learn (Pedregosa et al. 2011) are used. Some of these methods require optimisation of the hyper-parameters. This is described after presenting performance quantification metrics in the next section.

Table 1 The four different modelling approaches and how each classification method aligns with these approaches

Machine learning for detection of stenoses and aneurysms: application in a physiologically realistic virtual patient database

Abstract

Similar content being viewed by others

A proof of concept study for machine learning application to stenosis detection

HALE: Healthy Area of Lumen Estimation for Vessel Stenosis Quantification

Noninvasive estimation of aortic hemodynamics and cardiac contractility using machine learning

1 Introduction

2 Methodology

2.1 Healthy subjects

2.2 Creation of unhealthy VPDs

2.2.1 Disease forms

2.2.2 Parameterisation of diseased vessels

2.3 Measurements

2.3.1 Provision of measurements to ML classifiers

2.4 Machine learning classifiers

2.4.1 Training and test sets

2.4.2 ML methods

2.4.3 Quantification of results

2.5 Hyper-parameter optimisation

2.5.1 LR, SVM, and NB

2.5.2 Random Forest

2.5.3 Gradient Boosting

2.5.4 Multi-layer perceptron

2.6 Input measurement combination search

2.7 Overfitting and early stopping criterion

3 Results and discussion

3.1 Measurement combinations

3.1.1 CAS classification

3.1.2 SAS classification

3.1.3 PAD classification

3.1.4 AAA classification

3.2 Importance of carotid artery flow-rate

3.3 Feature importance

3.4 Lower severity aneurysms

3.5 Unilateral aneurysm measurement tests

3.6 MLP early stopping to avoid overfitting

3.6.1 CAS: early stopping

3.6.2 AAA: early stopping

4 Conclusions

5 Limitations & future work

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Appendices

CAS combination search results

SAS combination search results

PAD combination search results

AAA combination search results

AAA-L combination search results

GB results for all disease forms

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation