Coronary X-ray angiography segmentation using Artificial Intelligence: a multicentric validation study of a deep learning model

Nobre Menezes, Miguel; Silva, João Lourenço; Silva, Beatriz; Rodrigues, Tiago; Guerreiro, Cláudio; Guedes, João Pedro; Santos, Manuel Oliveira; Oliveira, Arlindo L.; Pinto, Fausto J.

doi:10.1007/s10554-023-02839-5

Coronary X-ray angiography segmentation using Artificial Intelligence: a multicentric validation study of a deep learning model

Original Paper
Open access
Published: 07 April 2023

Volume 39, pages 1385–1396, (2023)
Cite this article

Download PDF

You have full access to this open access article

The International Journal of Cardiovascular Imaging Aims and scope Submit manuscript

Coronary X-ray angiography segmentation using Artificial Intelligence: a multicentric validation study of a deep learning model

Download PDF

Miguel Nobre Menezes^1,2,
João Lourenço Silva³,
Beatriz Silva^1,2,
Tiago Rodrigues^1,2,
Cláudio Guerreiro⁴,
João Pedro Guedes⁵,
Manuel Oliveira Santos^6,7,
Arlindo L. Oliveira³ &
…
Fausto J. Pinto^1,2

2969 Accesses
8 Citations
15 Altmetric
Explore all metrics

Abstract

Introduction

We previously developed an artificial intelligence (AI) model for automatic coronary angiography (CAG) segmentation, using deep learning. To validate this approach, the model was applied to a new dataset and results are reported.

Methods

Retrospective selection of patients undergoing CAG and percutaneous coronary intervention or invasive physiology assessment over a one month period from four centers. A single frame was selected from images containing a lesion with a 50–99% stenosis (visual estimation). Automatic Quantitative Coronary Analysis (QCA) was performed with a validated software. Images were then segmented by the AI model. Lesion diameters, area overlap [based on true positive (TP) and true negative (TN) pixels] and a global segmentation score (GSS – 0 -100 points) - previously developed and published - were measured.

Results

123 regions of interest from 117 images across 90 patients were included. There were no significant differences between lesion diameter, percentage diameter stenosis and distal border diameter between the original/segmented images. There was a statistically significant albeit minor difference [0,19 mm (0,09–0,28)] regarding proximal border diameter. Overlap accuracy ((TP + TN)/(TP + TN + FP + FN)), sensitivity (TP / (TP + FN)) and Dice Score (2TP / (2TP + FN + FP)) between original/segmented images was 99,9%, 95,1% and 94,8%, respectively. The GSS was 92 (87–96), similar to the previously obtained value in the training dataset.

Conclusion

the AI model was capable of accurate CAG segmentation across multiple performance metrics, when applied to a multicentric validation dataset. This paves the way for future research on its clinical uses.

A comparative analysis of deep learning-based location-adaptive threshold method software against other commercially available software

Article Open access 18 April 2024

Deep learning segmentation of major vessels in X-ray coronary angiography

Article Open access 15 November 2019

Automatic coronary artery segmentation and diagnosis of stenosis by deep learning based on computed tomographic coronary angiography

Article 08 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The application of artificial intelligence (AI) to coronary angiography (CAG) has only been ascertained in very few medical/biology publications [1,2,3,4]. While the possibilities of such an approach are vast, the first step is arguably to produce accurate segmentation of CAGs, i.e., clearly identifying the coronary tree while excluding other structures.

We have previously published the first results of deep learning models capable of good quality CAG segmentation [5]. In this paper, we aim to validate the results, by applying the model to a new, previously unseen, dataset of coronary angiographies from multiple centers. A well-known validated software was used as reference for segments with non-occlusive lesions, where detailed measurements were undertaken, while also applying the previously described Global Segmentation Score for broad assessment of segmentation quality [5].

Methods

Participating centers and equipment

Four centers from across Portugal participated in this study. Images were acquired in Siemens Axiom Artis and Philips Azureon equipment.

Inclusion criteria

Retrospective selection of consecutive patients who had undergone CAG and percutaneous coronary intervention (PCI) and/or underwent invasive physiology assessment (Fractional Flow Reserve and/or other indexes), within a 1-month period of 2022, regardless of clinical context (i.e. both acute and chronic coronary syndrome). This ensures the model was tested in a real-world context where revascularization was either being considered or performed, thereby excluding a population with normal or near-normal coronary arteries.

Exclusion criteria

We excluded cases where any of the following applied:

1)
Patients with previous cardiac surgery, cardiac devices or other sources of potential artifact.
2)
Absence of coronary lesions 50–99% stenosis by visual estimation (i.e. single-vessel ST-elevation myocardial infarction – STEMI - or chronic total occlusions – CTO alone).
3)
Poor image quality.
4)
Unclear individualization of lesion outline with no overlapping vessels.
5)
Unsuccessful automatic measurements with validated software (details below).
6)
Unsuccessful software extraction and superimposition of lesion markers on segmented image (details below).

Image selection

For each selected lesion, a single end-diastolic frame with clear outline definition of the vessel and target lesion was selected. More than one segment per patient and/or image could be used. With an original training dataset of 416 images as previously published [5], we aimed to have a validation dataset of at least 100 images.

Brief description of previous work and AI model

In our previous work [5] we trained AI models for CAG segmentation using 416 images from patients undergoing physiology or PCI in a single center. The images were manually annotated by a small group (two Cardiology Fellows and an Interventional Cardiologist, who both annotated and supervised the process) and continuously reviewed and corrected, in order to minimize heterogeneity and errors.

We then performed segmentation using an encoder-decoder fully convolutional neural networks based on the U-Net [6], commonly used in medical image segmentation. These are composed of an encoder for extracting image features and a decoder to process those features and produce segmentation masks. To derive the best approach for this task, we conducted a comparative study of encoder and decoder architectures, which resulted in the proposal of the EfficientUNet++, a computationally efficient and high-performing decoder architecture [7], which obtained the best results when combined with an EfficientNet-B5 encoder [8].

To ensure fair evaluation and minimize any bias induced by the input data, each model was tested on data it had not seen during training. The dataset was thus split at the patient level, into 13 subsets of approximately 32 angiograms each. Each subset’s segmentation was performed using a neural network trained exclusively on the remaining data. This enabled the assessment of the segmentation results for the entire cohort, as the usual splitting into a training and testing dataset would have yielded a much smaller group of images for result assessment. The training hyperparameters, namely the number of training epochs and the learning rate decay schedule, were set on the first train-test split, using 1 of the 12 training data subsets for validation. The selected values were then used on every other train-test split, and to train the model on the whole training set of the first split. We also considered cross-validation, but it would be very compute-heavy.

This process resulted in an early AI model, which was then further improved by a second round of manual annotation, where the annotators corrected the resulting imperfections, thereby producing a final training dataset. An “enhanced” model was then trained once again using the same process with the new improved annotated dataset, yielding superior results to the early model, with a final Generalized Dice Score of 93,48/ +/- 2,84%. While we continue to work on improving our model, because the aim of this study is to validate the aforementioned “enhanced” model as previously published [5], no additional training was performed.

Original images analysis and segmentation

A well-established and validated software (CAAS Workstation 8.5.1) capable of semi-automatic segmentation and Quantitative Coronary Angiography (QCA) was used to generate a reference dataset for comparison. Because it is especially important for a model to correctly segment diseased segments, QCA analysis was performed in selected segments with a stenosis severity of 50–99% by visual estimation. For QCA measurements, calibration was performed either automatically (based on the DICOM information) or by measuring the catheter (5 or 6 Fr), provided it was clearly visible and measurable. The region of interest was then selected and automatic QCA measurements were undertaken.

For each region of interest where successful automatic QCA measurements were undertaken, the lesion diameter, reference diameter, diameter at proximal obstruction border and diameter at distal obstruction border were recorded. The diameter stenosis percentage was calculated as follows: ((reference diameter – lesion diameter) / reference diameter) x 100 [9]. No manual adjustments were accepted, in order to exclude human bias or human-induced imperfection. If the automated outline and measurements were not clearly accurate by visual inspection, the case was excluded (supplementary Fig. 1).

The original images (i.e., without the measurement annotations generated by the CAAS software) were then segmented using our best AI model to date [5], which segments the coronary tree in white and the catheter in red. This process is fully automatic and the only required human input is the image itself. These images were used for testing only, not training.

Performance assessment

Diameters and percentage diameter stenosis

A dedicated python script was written to extract the CAAS markers and superimpose them on the segmentation obtained by the model. The lesion diameter, diameter at proximal obstruction border and diameter at distal obstruction border were then measured using a dedicated python script as well, by verifying the superimposition of the markers with the coronary tree. Because the reference diameter does not exist in the segmented image (which only contains the coronary artery tree and catheter), the CAAS-generated value was used. Percentage stenosis was then calculated using the same equation. Finally, we also compared the measured catheter diameter on the original image versus the segmented image with another adaption of the same script, by measuring the distance between the two parallel lines generated in the original image from the CAAS software. The resulting measurements obtained in the original and the segmented images were then compared.

Overlap between original and segmented images

A dedicated python script was also used for assessing the overlap between the original and the segmented images in the region of interest, using the CAAS output as reference. Pixels were then classified as follows:

True positive (TP): a pixel marked as coronary in both the segmented and original image.
False positive (FP): a pixel marked as coronary only in the segmented image.
True negative (TN): a pixel marked as non-coronary in both the segmented and original image.
False negative (FN): a pixel marked as non-coronary only in the original image.

Using this classification, the following parameters were calculated:

Accuracy: ([TP + TN]/[TP + TN + FP + FN])
Sensitivity: TP / (TP + FN).
Specificity: TN/(TN + FP),
Positive Predictive Value: TP / (TP + FP).
Negative predictive value TN/(TN + FN).
Intersection over Union (IoU): TP / (TP + FN + FP).
Dice Score: 2TP / (2TP + FN + FP).

Global segmentation score

While the above-mentioned criteria offer a detailed account of the model’s accuracy, they do not provide a broad overview of the quality of segmentation as assessed by experts in CAG interpretation (i.e. Cardiologists). As a result, we have previously developed the Global Segmentation Score (GSS), which we have previously applied on the original CAG dataset used to train the AI model (details on its application on supplementary data file) [5]. The GSS was scored by consensus by four Interventional Cardiologists (one from each contributing center).

Figure 1 summarizes the above-mentioned steps for assessing coronary segmentation.

Statistical analysis

Descriptive variables are shown in absolute and relative (percentage) numbers. Quantitative variables are shown in average ± standard deviation (if normally distributed) or median (interquartile range) if non-normally distributed. If distribution was normal, we used the paired samples T-test to assess for differences in related samples quantitative variables. If distribution was not normal, we used the Mann-Whitney test (two independent groups) or the Kruskal Wallis test (multiple independent groups) to assess for differences in quantitative variables. A p-value < 0,05 was used for statistical significance. SPSS 27 was used for analysis.

Ethical issues

This study complies with the Declaration of Helsinki and was approved by the local Ethics’ Institutional Review Board.

Results

Baseline characteristics

We included 123 measurements from 117 images, from a total of 90 patients (flowchart in Fig. 2; clinical data on Table 1). The left anterior descending artery (LAD) was the most common target vessel (three measurements were taken on diagonals, two emerging proximally and one emerging in the middle segment of the LAD; all were taken on the proximal segment of the collateral), with measurements taking place more frequently in the middle and proximal segments. As measured by QCA, most lesions had a 50–69% diameter stenosis, with a minority of ≥ 70% lesions (Table 2).

Table 1 Clinical characteristics of included patients

Full size table

Table 2 Distribution of target vessel and lesion severity. LAD: Left Anterior Descending Artery; RCA: Right Coronary Artery; CX: Left Circunflex Artery

Full size table

Performance

Diameters and percentage diameter stenosis

Detailed metrics of images (Fig. 3) are depicted in Tables 3 and 4. There were no significant differences for all parameters except for diameter at proximal obstruction border, where the median difference between groups was 0,19 mm. All difference parameters (Table 3) had a non-normal distribution, with the interquartile range demonstrating that there is a clear predominant difference towards the lower-end values, as the 25th quartile is either 0 or very close to 0.

Table 3 Detailed measurements between the original and the segmented images. Values shown as mean ± standard deviation. AI – artificial intelligence. *Paired samples T-test;

Full size table

Table 4 Median differences between the original and segmented images. Values shown as median (IQ 25th – 75th)

Full size table

There were no significant differences across stenosis severity (supplementary Table 1) or target vessel (supplementary Table 2). There were also no significant differences considering across centers (supplementary Tables 3 and 4).

With regards to the catheter diameters (Fig. 4), results are shown on supplementary Table 5. A significant number of cases (26/117 − 22%) had to be excluded, either because of collimation (rendering the catheter not visible – 8 cases) or segmentation gaps leading to inaccurate border definition (18 cases). The latter occur because the model focuses especially on segmenting the distal part of the catheter for correctly identifying the transition between catheter and coronary, whereas in the original images calibration occurred predominantly in less distal portions. Because the presence of two groups (5 and 6 Fr) of catheters renders the overall distribution of the sample non-normal, the two groups were analysed separately. There were no significant differences between the original and segmented images. Again, the difference parameter had a non-normal distribution, with the interquartile range demonstrating that there is a clear predominant difference towards the lower-end values.

Overlap between original and segmented images

Results are detailed on Table 5. The model scored ≥ 90% in all metrics (Fig. 5). There were some significant differences between target vessel (supplementary Table 6) and stenosis severity (supplementary Table 7) which, in absolute terms, were between 1 and 3%. There were no differences between centers (supplementary Table 8).

Table 5 Overlap metrics. Values shown as median (IQ 25th – 75th)

Full size table

Global segmentation score

Results are shown on supplementary Table 9. The model scored well above or close to 90% in most criteria. Catheter gaps were common, usually due to contrast backflow impeding proper visualization of such portions. Catheter artifacts were common and mild gaps in distal parts of small collaterals were quite common as well.

N is lower than overall measurements due to assessment of more than one lesion per image and 8 cases of collimation where the catheter could not be scored, thereby excluding those cases from assessment.

Discussion

Main findings

A deep learning AI segmentation model was capable of fully automatic accurate CAG segmentation, as checked by a reference segmentation obtained with validated software and also when assessed by a broad assessment score we previously developed [5].

Diameters at both healthy segments (proximal and distal lesion borders) and diseased segments (diameter at maximum obstruction zone) were similar between the two groups, with statistically significant differences only at the proximal obstruction border. However, in absolute terms, the difference was very small (0,19 mm, a < 10% difference considering the proximal diameter in either group) and we therefore believe it is unlikely to be of clinical significance. The stenosis severity as assessed by percentage stenosis only differed by < 5% in absolute terms, a difference not meaningful either statistically or clinically. The latter is perhaps the single most important finding, as percentage diameter stenosis is the fundamental criteria assessed in clinical practice for proceeding with either revascularization or functional testing, as recommended in current guidelines [10]. Importantly, there were no significant differences in performance regarding target vessel, stenosis severity or centers.

When considering the overlap between the segmented image and the original image, accuracy, specificity and negative predictive value scored close to 100%. This was expected, because most of the image is composed by background rather than artery. As a result, we believe metrics that do not take into account true negatives provide a more faithful indication of the actual model performance. In that regard, sensitivity and positive predictive value still scored quite high, at approximately 95%. The metric that more directly assesses the true overlap between the original and segmented images in the region of interest (correctly identifying all of the vessel while avoiding non-artery pixels) is the intersection over union criteria, which fell just short of 90%. Lastly, the Dice Score puts greater emphasis on the fundamental task of segmentation – correctly identifying the target structure i.e. true positives – in this case, the coronary tree. With an average score of approximately 95%, while also considering all the remaining metrics, we believe our model can be described as accurate. Importantly, the Dice Score in our previous study was 93%, thus very similar to what we now found [5]. There were statistically significant differences in the IoU and Dice Scores between target vessel stenosis severity. Notwithstanding, the absolute differences were very minor (around 1–2%) and therefore of little or no clinical relevance.

With regards to the GSS, our model achieved a high score with a median of 92/100 points, exceedingly similar to what we had previously described in the dataset used to train and develop the model. The model scored very high in almost all tasks, while maintaining minor imperfections with regards to mild gaps in collateral branches, which were very frequent. Catheter segmentation was not as good as coronary segmentation, as usually small catheter artifacts or gaps in the vicinity of the coronary tree origin were common. This was due not only to contrast backflow, but also because of how AI models are trained and function. Indeed, performance is very dependent on class frequency. Because the catheter is a less frequent class (i.e. corresponds to much fewer pixels), the models receive less penalty for errors regarding its segmentation when compared to the coronary tree. This is partly mitigated by the use of an appropriate loss function, but the imbalance nevertheless persists to some extent. Once again, this was very similar to what we saw in the training dataset [5]. With regards to precise catheter measurements, the differences between original and segmented images (for both 5 and 6 Fr catheters) were not statistically significant, suggesting the catheter’s segmentation, from a calliper precision point of view, is accurate. However, due to the above-mentioned limitations and to a small number of images where only a small portion (or none at all) of the catheter was discernible, our sample was somewhat reduced, thereby limiting this assessment.

Other studies in the field

There are very few studies published in medical/biology journals to date where a comparison with our results can be made. With regards to the GSS in particular, no similar application has ever been undertaken, to our knowledge.

The largest published study [3] included a dataset of 1050 images distributed across all incidences and vessels for performance evaluation. An average 98% accuracy was obtained. While specificity and negative predictive values scored very highly, sensitivity and positive predictive value came closer to 80%. The performance was slightly inferior in more distal vessels. Intersection over union or Dice Score were not reported. Importantly, however, that study’s evaluation used the baseline human annotation as reference, rather than an external validated software, thereby not enabling the identification of bias or imperfections which might have become embedded in their AI model. In our previous study, we demonstrated that even with a small group of annotators and continuous review of the annotations, there is always some degree of imperfection in human annotation [5], hence the relevance of comparing against an automated and validated external software. Additionally, the reported accuracy focuses on the overlap across the entire coronary tree rather than the percentage stenosis of diseased segments. This is advantageous in the sense that a globally accurate performance can be tested. Notwithstanding, we believe testing only for diseased segments actually renders the comparison more demanding. This is because the segmentation of stenotic segments is harder from a technical point of view and also due to the fact that the number of true positive pixels is necessarily smaller in such segments – leading to a lower likelihood of true positives. Whichever interpretation is made, it is clear that an exact comparison with Du et al. [3] is not possible. However, broadly speaking, the accuracy of both models seems quite high and our model seems at least as accurate, if not more.

Su Yang et al. [4] also produced AI models for CAG segmentation. Their validation dataset was somewhat larger (181 images), but their performance seems slightly lower, with all overlap metrics generally scoring just short of 90% and a Dice Score of 89%. Importantly, they also only segmented diseased segments, with a minimum lesion of 30% and used the same reference software as we did. Thus, their results are more directly comparable to ours and our model seems to have superior performance. Two other works [1, 2], from the same baseline dataset, also went on to develop AI-based CAG segmentation, this time with a validation dataset of 550 images. While the model performed well, with an accuracy of 98% and a sensitivity of 87%, they also based their validation dataset on human annotation of the coronary tree without using external software. Thus the above-mentioned considerations for Du et al. [3] also apply.

Recently, Gao et al. [11] published the results of a CAG segmentation model trained on only 130 images. Their methodology, however, is somewhat different, since they combined features from deep learning segmentation models’ features and non-AI image filters to perform pixel-wise classification using gradient-boosting decision trees [12] and deep forests [13]. Their results also show good performance, with a Dice Score of 87,4%, sensitivity of 90,2% and specificity of 99,2%. This highlights that merging deep learning with traditional computer vision methods can yield good results, when working with relatively small datasets. However, no external validation software was used and the whole coronary tree was evaluated. As a result, once more, the previous considerations for Du et al. [3] apply.

Other works in the application of AI to coronary segmentation are primarily technical and featured in engineering publications. A detailed review of these falls outside the scope of this paper and can be consulted in our previous technical publication [7]. However, some considerations regarding these provide further contextualization of our findings.

Xian et al. [14] used a very large dataset of 3200 manually annotated images and experimented with the U-Net architecture as well, with a sensitivity of 90,1%, positive predictive value of 89,8% and Dice Score 90%. However, the annotations were undertaken with a specific software for the purpose of coarsely signaling the vessel route, and focused only on the main vessels. Since we achieved higher performance metrics, it seems a smaller but higher quality dataset, with very precise and cumbersome manual annotations, may be a better approach.

Yang et al [15] have obtained a sensitivity, positive predictive value and Dice Score of 91,3%, 92,5% and 91,9%, respectively, by using popular image classification backbones pre-trained on ImageNet instead of the U-Net’s encoder, while also using a modified generalized dice loss function. Their findings were influential in our training method, as we used a combination of their proposed loss function and the focal loss [16]. Other authors have explored the use of dense connections, improving on the performance of the standard U-Net [17]. This approach is also present in the U-Net ++ [18], which we used in our approach.

In all of the above studies, metrics regarding vessel diameters were not performed. Thus, a direct comparison with this study regarding those is not possible. M’hiri et al. addressed the issue of CAG diameter measurements, when dealing with the issue of diameter variation during the cardiac cycle due to vessel distensibility. They focused mainly in measuring specific segments of the coronary tree, as we did. However, they used a graph-based segmentation method, then tracked the changes across the cardiac cycle using a spatio-temporal segmentation method. They obtained a Dice Score of 98%, with a very small diameter mean error (0,18 mm) [19]. However, they did not focus on diseased regions. While this study is not focused on AI methods, it highlights that other methods may be of use for accurate CAG segmentation, potentially in combination with AI tools [11].

In light of all these studies, the performance of our model seems at least as good, if not better, than previously proposed AI models. We believe this is related to its neural network architecture, which was carefully chosen over a series of experiments [7], taking into consideration the invaluable contributions of previously mentioned studies. In addition to that, we also believe that our manual annotations methodology was essential, as it allowed us to obtain a highly reliable training dataset: a small number of annotators (to reduce heterogeneity) well trained in the interpretation of coronary angiograms; very careful review of annotations with recurrent iterations of quality checks and improvements; and further manual improvement of the already accurate segmentation images produced by an earlier AI model, thus combining the best of AI and human annotations into a final training dataset, as mentioned in the methods section and previous publication [5].

Limitations

Our study is not without limitations. Despite the multicentric approach, our dataset is relatively small when compared to previously published studies. We also only tested the model performance against validated software in diseased locations, rather than on the whole coronary tree. Therefore, we cannot affirm that the performance would be identical in the remaining areas. However, as previously explained, segmenting zones with lesions is actually more challenging for the model than segmenting broad, mostly healthy segments. In addition to that, we did not find differences regarding target vessel or lesion severity. Plus, considering the results of the GSS, the overall performance regarding CAG segmentation was quite appropriate. Thus, we believe that it is unlikely that performance would be significantly different had we tested for the whole coronary tree. Importantly, if we had chosen to segment whole vessels, it would be very likely that some manual corrections had to be undertaken, which might induce bias or imperfections in the reference images. Hence, the decision to proceed as described was deliberate. The assessment of catheter segmentation was also more limited than that of the coronary tree, as described above.

The exclusion of potential sources of artifacts from devices or previous cardiac surgery means our model is not yet applicable to such patients. Notwithstanding, we didn’t exclude cases with previous implantation of stents, but we did not perform detailed measurements on such segments.

The total number of patients/images who fully met exclusion criteria was somewhat high, thereby limiting the final amount of available images for analysis, which may raise questions as to whether this sample is representative of everyday CAGs and an therefore constitutes an adequate validation dataset. This was the result of somewhat stringent criteria, which we felt were nonetheless necessary due to basic feasibility (such as excluding single-vessel complete occlusion cases where QCA is not applicable, or excluding imaging artifacts for which the models are not yet trained), reduction of bias (such as not allowing for manual QCA correction), or excluding patients with normal/near-normal arteries (where testing would be much less challenging or useful in future clinical application). Notwithstanding, we included patients consecutively rather than selectively and the clinical characteristics of included patients are in agreement with everyday clinical practice. We therefore believe our sample to be reasonably representative of real-world practice. Furthermore, we exceeded the minimum validation target of 100 images, yielding relative rates of training vs. validation cases in agreement with other AI studies [2,3,4].

The imbalance in sample size limits the comparison between centers.

It has long been established that operators significantly differ in their interpretation of lesion severity and have a tendency to overestimate the importance of a stenosis [20,21,22,23,24], as we also saw in this study. Indeed, while visually all lesions were interpreted as > 50% stenosis, a significant amount of the sample actually had a < 50% stenosis, which further reflects the real-world nature of the dataset.

Lastly, the distance between the 2D centerline and the distance to the closest edge would have also been a good metric for assessing model performance in this setting. We did not perform such testing.

In light of all of the above, concerns may be raised regarding generalization from this dataset. However, we believe that the absence of statistically significant differences across all subgroups at least partially attenuates this concern.

Future directions

We are currently working in automatic anatomical interpretation, lesion severity based on auto-QCA and integration with physiology. We believe without effective segmentation models, none of these will be possible. Much like for human interpretation of CAG, separating the coronary arteries from everything else in the image is an essential first step. Our ultimate goal is to produce an intelligence augmentation tool that helps physicians perform a more objective and streamlined interpretation of CAG, hopefully contributing for better patient outcomes. As we continuously improve its performance, while also adding new capabilities, clinical application will potentially be possible in the near future, opening a new perspective and potentially more accurate method to assess coronary artery disease.

We are also continuously working to expand and improve the model, as segmentation alone is not a final goal in itself, but rather a fundamental step. We hope to release a public version in the near future, which other researchers may use for whichever application they may deem useful. Importantly, comparing or even merging with future models from other groups may also be very relevant. Since it uses an inherently data-hungry deep learning model, our coronary artery segmentation system would surely benefit from training on a larger volume of data. Manual annotation of coronary angiography images, however, is very cumbersome and time-consuming, and therefore it is difficult to obtain much larger labeled datasets. Hence, significant improvements to the model could probably be achieved, for example, by using self-supervised learning on existing very large volumes of unlabeled data. These possibilities are described in detail in our previous technical publication [7].

Data Availability

Detailed full-scale study data cannot currently be made publicly available due to limitations imposed by national data protection regulations, as this is a retrospective study and no informed consent was obtainable regarding this particular analysis. Both our research team and others in the national scientific community are working to develop a framework where such would be possible. However, independent replication of our analysis is possible, given that the detailed description of our experimentations and relevant code is publicly available [7].

Conclusions

Our AI model was capable of accurate CAG segmentation when applied to a multicentric validation dataset, with no differences between target vessels or stenosis severity. This paves the way for future research and implementation for its clinical uses.

Abbreviations

GSS:: Global Segmentation Score
AI:: Artificial Intelligence
CAG:: Coronary Angiography
STEMI:: ST-elevation myocardial infarction
CTO:: Chronic Total Occlusion

References

Wang L, Liang D, Yin X et al (2020) Coronary artery segmentation in angiographic videos utilizing spatial-temporal information. BMC Med Imaging 2020 201 20:1–10. https://doi.org/10.1186/S12880-020-00509-9
Article Google Scholar
Liang D, Qiu J, Wang L et al (2020) Coronary angiography video segmentation method for assisting cardiovascular disease interventional treatment. BMC Med Imaging 2020 201 20:1–8. https://doi.org/10.1186/S12880-020-00460-9
Article Google Scholar
Du T, Xie L, Zhang H et al (2021) Training and validation of a deep learning architecture for the automatic analysis of coronary angiography. EuroIntervention 17:32–40. https://doi.org/10.4244/EIJ-D-20-00570
Article CAS PubMed PubMed Central Google Scholar
Yang S, Kweon J, Roh J-H et al (2019) Deep learning segmentation of major vessels in X-ray coronary angiography. Sci Rep 2019 91 9:1–11. https://doi.org/10.1038/s41598-019-53254-7
Article CAS Google Scholar
Nobre Menezes M, Lourenço-Silva J, Silva B et al (2022) Development of deep learning segmentation models for coronary X-ray angiography: Quality assessment by a new global segmentation score and comparison with human performance. Rev Port Cardiol. https://doi.org/10.1016/J.REPC.2022.04.001
Article PubMed Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. Lect Notes Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinformatics) 9351:234–241
Google Scholar
Silva JL, Menezes MN, Rodrigues T et al (2021) Encoder-decoder architectures for clinically relevant Coronary artery segmentation. arXiv:2106.11447 [eess.IV]
Tan M, Le QV (2019) EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. 36th Int Conf Mach Learn ICML 2019 2019-June:10691–10700
Suzuki N, Asano T, Nakazawa G et al (2020) Clinical expert consensus document on quantitative coronary angiography from the Japanese Association of Cardiovascular intervention and therapeutics. Cardiovasc Interv Ther 35:105. https://doi.org/10.1007/S12928-020-00653-7
Article CAS PubMed PubMed Central Google Scholar
Neumann FJ, Sousa-Uva M, Ahlsson A et al (2019) 2018 ESC/EACTS guidelines on myocardial revascularization. Eur Heart J 40:87–165. https://doi.org/10.1093/EURHEARTJ/EHY394
Article PubMed Google Scholar
Gao Z, Wang L, Soroushmehr R et al (2022) Vessel segmentation for X-ray coronary angiography using ensemble methods with deep learning and filter-based features. BMC Med Imaging 22:1–17. https://doi.org/10.1186/S12880-022-00734-4/TABLES/5
Article CAS Google Scholar
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:1189–1232. https://doi.org/10.1214/AOS/1013203451
Article Google Scholar
Zhou ZH, Feng J (2017) Deep Forest. Natl Sci Rev 6:74–86. https://doi.org/10.48550/arxiv.1702.08835
Article Google Scholar
Xian Z, Wang X, Yan S et al (2020) Main coronary vessel segmentation using deep learning in Smart Medical. Math Probl Eng. https://doi.org/10.1155/2020/8858344
Article Google Scholar
Yang S, Kweon J, Kim Y-H (2022) Major vessel segmentation on X-ray coronary angiography using deep networks with a Novel Penalty loss function.Proc Mach Learn Res Rev1–5
Lin TY, Goyal P, Girshick R et al (2017) Focal loss for dense object detection. Proc IEEE Int Conf Comput Vis 2017-October 2999–3007. https://doi.org/10.1109/ICCV.2017.324
Jun TJ, Kweon J, Kim YH, Kim D (2020) T-Net: nested encoder–decoder architecture for the main vessel segmentation in coronary angiography. Neural Netw 128:216–233. https://doi.org/10.1016/J.NEUNET.2020.05.002
Article PubMed Google Scholar
Zhou Z, Rahman Siddiquee MM, Tajbakhsh N, Liang J (2018) Unet++: A nested u-net architecture for medical image segmentation. Lect Notes Comput Sci (including Subser Lect Notes Artif Intell Lect Notes Bioinformatics) 11045 LNCS:3–11. doi: https://doi.org/10.1007/978-3-030-00889-5_1/COVER
M’hiri F, Duong L, Desrosiers C et al (2017) Automatic evaluation of vessel diameter variation from 2D X-ray angiography. Int J Comput Assist Radiol Surg 12:1867–1876. https://doi.org/10.1007/S11548-017-1639-9/FIGURES/9
Article PubMed Google Scholar
Fischer JJ, Samady H, McPherson JA et al (2002) Comparison between visual assessment and quantitative angiography versus fractional flow reserve for native coronary narrowings of moderate severity. Am J Cardiol 90:210–215. https://doi.org/10.1016/S0002-9149(02)02456-6
Article PubMed Google Scholar
Adjedj J, Xaplanteris P, Toth G et al (2017) Visual and quantitative Assessment of Coronary Stenoses at Angiography Versus Fractional Flow Reserve: the impact of risk factors. Circ Cardiovasc Imaging. https://doi.org/10.1161/CIRCIMAGING.117.006243
Article PubMed Google Scholar
Nallamothu BK, Spertus JA, Lansky AJ et al (2013) Comparison of clinical interpretation with visual assessment and quantitative coronary angiography in patients undergoing percutaneous coronary intervention in contemporary practice: the assessing angiography (A2) project. Circulation 127:1793–1800. https://doi.org/10.1161/CIRCULATIONAHA.113.001952
Article PubMed PubMed Central Google Scholar
Zhang H, Mu L, Hu S et al (2018) Comparison of Physician Visual Assessment with quantitative coronary angiography in Assessment of Stenosis Severity in China. JAMA Intern Med 178:239–247. https://doi.org/10.1001/JAMAINTERNMED.2017.7821
Article PubMed PubMed Central Google Scholar
Shah R, Yow E, Jones WS et al (2017) Comparison of visual assessment of coronary stenosis with independent quantitative coronary angiography: findings from the PROMISE trial. Am Heart J 184:1. https://doi.org/10.1016/J.AHJ.2016.10.014
Article PubMed Google Scholar

Download references

Funding

Open access funding provided by FCT|FCCN (b-on). Cardiovascular Center of the University of Lisbon, INESC-ID / Instituto Superior Técnico, University of Lisbon.

Author information

Authors and Affiliations

Structural and Coronary Heart Disease Unit, Faculdade de Medicina, Cardiovascular Center of the University of Lisbon, Universidade de Lisboa (CCUL@RISE), Av Prof. Egas Moniz, Lisboa, 1649-028, Portugal
Miguel Nobre Menezes, Beatriz Silva, Tiago Rodrigues & Fausto J. Pinto
Serviço de Cardiologia, Departamento de Coração e Vasos, CHULN Hospital de Santa Maria, Av Prof. Egas Moniz, Lisboa, 1649-028, Portugal
Miguel Nobre Menezes, Beatriz Silva, Tiago Rodrigues & Fausto J. Pinto
INESC-ID / Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
João Lourenço Silva & Arlindo L. Oliveira
Centro Hospitalar de Vila Nova de Gaia, Porto, Portugal
Cláudio Guerreiro
Unidade de Hemodinâmica e Cardiologia de Intervenção, Serviço de Cardiologia, Centro Hospitalar Universitário do Algarve, Hospital de Faro, Faro, Portugal
João Pedro Guedes
Unidade de Intervenção Cardiovascular, Serviço de Cardiologia do Centro Hospitalar e Universitário de Coimbra, Praceta Professor Mota Pinto, Coimbra, 3004-561, Portugal
Manuel Oliveira Santos
Faculdade de Medicina da Universidade de Coimbra, R. Larga 2, Coimbra, 3000-370, Portugal
Manuel Oliveira Santos

Authors

Miguel Nobre Menezes
View author publications
You can also search for this author in PubMed Google Scholar
João Lourenço Silva
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Silva
View author publications
You can also search for this author in PubMed Google Scholar
Tiago Rodrigues
View author publications
You can also search for this author in PubMed Google Scholar
Cláudio Guerreiro
View author publications
You can also search for this author in PubMed Google Scholar
João Pedro Guedes
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Oliveira Santos
View author publications
You can also search for this author in PubMed Google Scholar
Arlindo L. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Fausto J. Pinto
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MNM was responsible for conceptualization, data/image gathering, processing and analysis and paper drafting. JLS was responsible for technical and AI tasks, data and image processing, model implementation and training. CR, JPG and MSO were responsible for data gathering and image analysis. TR and BS were responsible for data gathering, processing and analysis in the development of the original dataset. ALO was responsible for supervising the work of JLS, having directly taken part in the same tasks. FJP was responsible for supervising the work of MNM, having directly taken part in the same tasks.

All authors revised the paper critically for important intellectual content, gave final approval for its publication and agree to be accountable for all respects of its accuracy and integrity.

Corresponding author

Correspondence to Miguel Nobre Menezes.

Ethics declarations

Conflict of Interest

Not applicable.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

10554_2023_2839_MOESM3_ESM.png

Supplementary Fig. 1: Two examples of failed auto-QCA analysis. In the right coronary artery, a subocclusive lesion is visible (upper left image). The software fails to track the lesion accurately (upper right image). In the left anterior descending artery, the software tracks a collateral rather than the main vessel on the left border (original image - bottom left, failed tracking - the bottom right).

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Nobre Menezes, M., Silva, J.L., Silva, B. et al. Coronary X-ray angiography segmentation using Artificial Intelligence: a multicentric validation study of a deep learning model. Int J Cardiovasc Imaging 39, 1385–1396 (2023). https://doi.org/10.1007/s10554-023-02839-5

Download citation

Received: 20 November 2022
Accepted: 18 March 2023
Published: 07 April 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s10554-023-02839-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Coronary X-ray angiography segmentation using Artificial Intelligence: a multicentric validation study of a deep learning model

Abstract

Introduction

Methods

Results

Conclusion

Similar content being viewed by others

A comparative analysis of deep learning-based location-adaptive threshold method software against other commercially available software

Deep learning segmentation of major vessels in X-ray coronary angiography

Automatic coronary artery segmentation and diagnosis of stenosis by deep learning based on computed tomographic coronary angiography

Introduction

Methods

Participating centers and equipment

Inclusion criteria

Exclusion criteria

Image selection

Brief description of previous work and AI model

Original images analysis and segmentation

Performance assessment

Diameters and percentage diameter stenosis

Overlap between original and segmented images

Global segmentation score

Statistical analysis

Ethical issues

Results

Baseline characteristics

Performance

Diameters and percentage diameter stenosis

Overlap between original and segmented images

Global segmentation score

Discussion

Main findings

Other studies in the field

Limitations

Future directions

Data Availability

Conclusions

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Supplementary Material 2

10554_2023_2839_MOESM3_ESM.png

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation