Machine learning approach using 18F-FDG-PET-radiomic features and the visibility of right ventricle 18F-FDG uptake for predicting clinical events in patients with cardiac sarcoidosis

Nakajo, Masatoyo; Hirahara, Daisuke; Jinguji, Megumi; Ojima, Satoko; Hirahara, Mitsuho; Tani, Atsushi; Takumi, Koji; Kamimura, Kiyohisa; Ohishi, Mitsuru; Yoshiura, Takashi

doi:10.1007/s11604-024-01546-y

Machine learning approach using ¹⁸F-FDG-PET-radiomic features and the visibility of right ventricle ¹⁸F-FDG uptake for predicting clinical events in patients with cardiac sarcoidosis

Original Article
Open access
Published: 16 March 2024

Volume 42, pages 744–752, (2024)
Cite this article

Download PDF

You have full access to this open access article

Japanese Journal of Radiology Aims and scope Submit manuscript

Machine learning approach using ¹⁸F-FDG-PET-radiomic features and the visibility of right ventricle ¹⁸F-FDG uptake for predicting clinical events in patients with cardiac sarcoidosis

Download PDF

Masatoyo Nakajo¹,
Daisuke Hirahara²,
Megumi Jinguji¹,
Satoko Ojima³,
Mitsuho Hirahara¹,
Atsushi Tani¹,
Koji Takumi¹,
Kiyohisa Kamimura¹,
Mitsuru Ohishi³ &
…
Takashi Yoshiura¹

767 Accesses
2 Citations
Explore all metrics

Abstract

Objectives

To investigate the usefulness of machine learning (ML) models using pretreatment ¹⁸F-FDG-PET-based radiomic features for predicting adverse clinical events (ACEs) in patients with cardiac sarcoidosis (CS).

Materials and methods

This retrospective study included 47 patients with CS who underwent ¹⁸F-FDG-PET/CT scan before treatment. The lesions were assigned to the training (n = 38) and testing (n = 9) cohorts. In total, 49 ¹⁸F-FDG-PET-based radiomic features and the visibility of right ventricle ¹⁸F-FDG uptake were used to predict ACEs using seven different ML algorithms (namely, decision tree, random forest [RF], neural network, k-nearest neighbors, Naïve Bayes, logistic regression, and support vector machine [SVM]) with tenfold cross-validation and the synthetic minority over-sampling technique. The ML models were constructed using the top four features ranked by the decrease in Gini impurity. The AUCs and accuracies were used to compare predictive performances.

Results

Patients who developed ACEs presented with a significantly higher surface area and gray level run length matrix run length non-uniformity (GLRLM_RLNU), and lower neighborhood gray-tone difference matrix_coarseness and sphericity than those without ACEs (each, p < 0.05). In the training cohort, all seven ML algorithms had a good classification performance with AUC values of > 0.80 (range: 0.841–0.944). In the testing cohort, the RF algorithm had the highest AUC and accuracy (88.9% [8/9]) with a similar classification performance between training and testing cohorts (AUC: 0.945 vs 0.889). GLRLM_RLNU was the most important feature of the modeling process of this RF algorithm.

Conclusion

ML analyses using ¹⁸F-FDG-PET-based radiomic features may be useful for predicting ACEs in patients with CS.

Predicting T-Cell Lymphoma in Children From 18F-FDG PET-CT Imaging With Multiple Machine Learning Models

Article Open access 06 February 2024

Use of radiomics based on 18F-FDG PET/CT and machine learning methods to aid clinical decision-making in the classification of solitary pulmonary lesions: an innovative approach

Article 05 February 2021

Application of Machine Learning Analyses Using Clinical and [18F]-FDG-PET/CT Radiomic Characteristics to Predict Recurrence in Patients with Breast Cancer

Article 16 May 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Sarcoidosis is a systemic granulomatous inflammatory disease of unknown etiology. Cardiac involvement is clinically rare, and it only occurs in 5% of patients with sarcoidosis [1, 2]. However, cardiac sarcoidosis (CS) is an important predictor of poor prognosis in patients with sarcoidosis due to complications such as atrioventricular block (AVB), ventricular tachycardia (VT), and congestive heart failure [3,4,5]. Thus, it is extremely important to make early diagnosis and evaluate disease activity for managing patients with CS [6, 7].

Glucose metabolic activity can be evaluated by measuring ¹⁸F-FDG uptake during PET/computed tomography (CT) scan for not only oncological but also inflammatory disorders [8, 9]. However, only a few studies have used ¹⁸F-FDG-PET-based radiomic features for diagnosing or predicting treatment response in CS [10, 11]. Recently, the potential applications of machine learning (ML) analysis have been reported in the field of nuclear cardiology [12,13,14]. However, to the best of our knowledge, no study has examined the efficacy of the ML approach using ¹⁸F-FDG-PET-/CT-based radiomics on predicting adverse clinical events (ACEs) in patients with CS.

The current study aimed to investigate the usefulness of ML models using pretreatment ¹⁸F-FDG-PET-based radiomic features for predicting the risk of ACEs in patients with CS.

Materials and methods

Patients

This retrospective study was approved by the institutional review board, and the need for informed consent was waived. In total, 70 consecutive patients with known or suspected CS underwent pretreatment ¹⁸F-FDG-PET/CT scan from April 2012 to December 2022. Their clinical records were reviewed to identify patients who should be evaluated.

In a previous study [15], the usefulness of Patlak Ki images extracted from dynamic ¹⁸F-FDG-PET/CT scan for evaluating the risk of clinical events in CS was examined. The previous study enrolled 21 patients with CS who underwent 30 ¹⁸F-FDG-PET/CT scan, which included pretreatment, undertreatment, and follow-up scans, between April 2019 and January 2020. However, analyses using ML approaches for predicting the risk of ACEs in patients with CS using pretreatment ¹⁸F-FDG-PET-based radiomic features were not performed. Thus, among 21 patients, 8 with pretreatment ¹⁸F-FDG-PET/CT scans were included in the current study. The inclusion criteria were as follows: (1) patients diagnosed with CS according to the Japanese Society of Sarcoidosis and Other Granulomatous Disorders guidelines [16], (2) those without a history of steroid treatment, and (3) those with visible cardiac ¹⁸F-FDG uptake on PET/CT scan. The exclusion criteria were patients with a history or coexistence of other cardiac disorders.

Of 70 patients, 12 without cardiac ¹⁸F-FDG uptake were excluded. Among the remaining 58 patients, 11 were further excluded because of hypertrophic cardiomyopathy (n = 2), dilated cardiomyopathy (n = 2), ventricular aneurysm (n = 1), and lack of CS evidence (n = 6).

Finally, 47 patients (38 women and 9 men; mean age: 61 ± 10 [age: 39–81] years) were eligible for the analyses. Immunosuppressive treatment was adopted for these patients after the pretreatment ¹⁸F-FDG-PET/CT scan according to the recommendations of the Japanese Society of Sarcoidosis and Other Granulomatous Disorders guidelines [16]. The loading dose of prednisolone was 30 mg/day, which was tapered to a maintenance dose and administrated to all patients during the follow-up period.

Imaging protocols

All patients were instructed to follow a high-fat and low-carbohydrate diet for 1 day, and followed by a fast of at least 18 h before ¹⁸F-FDG-PET/CT scan, which resulted in a mean plasma glucose level of 102 (range: 71–154) mg/dL immediately before intravenous ¹⁸F-FDG administration.

All ¹⁸F-FDG-PET/CT scan procedures were performed using two whole-body PET/CT scanners. The Discovery 600M PET/CT scanner (GE Healthcare, Milwaukee, WI, the USA) was used from April 2012 to January 2018 and the Discovery MI scanner (GE Healthcare) from February 2018 to December 2022. The emission scan was performed 1 h after the administration of ¹⁸F-FDG (mean: 223 ± 30 [155–277] MBq) after CT data acquisition (slice thickness: 3.75 mm, pitch: 1.375 mm, 120 keV, auto mA: 40–100 mA, based on body mass, and reconstructed matrix size: 512 × 512). The acquisition time was 2.5 min per bed position (total: 7–11). Attenuation-corrected data were acquired. Using the Discovery 600M scanner, images were reconstructed with a three-dimensional ordered subset expectation–maximization algorithm (image matrix size: 192 × 192, 16 subsets, two iterations, voxel size: 3.125 × 3.125 × 3.27 mm³, and VUE Point Plus). Using the Discovery MI scanner, a Bayesian penalized likelihood reconstruction algorithm was used (image matrix size: 192 × 192, voxel size: 2.60 × 2.60 × 2.78 mm³, penalization factor: 700, and Q. Clear) with the point spread function. Each scanner used a consistent reconstruction setting and matrix.

Image and radiomic feature analyses

Two radiologists (with 12 and 20 years of ¹⁸F-FDG-PET/CT scan experience) who were knowledgeable about the study purpose but were blinded to the clinical information read the ¹⁸F-FDG PET/CT scan images. The radiologists visually assessed each ¹⁸F-FDG-PET/CT scan image as negative (myocardial visibility lower than or similar to that of the liver) or positive (myocardial visibility higher than that of the liver) ¹⁸F-FDG uptake [17] in the left ventricle (LV) and right ventricle (RV) myocardium. In case of a disagreement, they reached a consensus.

A third radiologist (18 years of ¹⁸F-FDG-PET/CT experience) performed quantitative analyses of the visible myocardial lesions. The third radiologist generated the volume of interest (VOI) by manually placing a region of interest on a suitable reference-fused axial image, and defined the craniocaudal and mediolateral extents encompassing the whole positive myocardial lesion, excluding any avid extracardiac structures. Next, the maximum standardized uptake value (SUV_max) threshold was set at 40%, which was commonly used in previous studies [18], to automatically delineate a VOI equal to or greater than the 40% threshold of SUVmax. The LIFEx package (version 6.00) [19] was used to extract 49 radiomic features from PET images (Supplemental Table 1). The LIFEx package is used to calculate textural features only for VOIs of at least 64 voxels. These 49 radiomic features were included in five categories (shape and first-order characteristics, gray level co-occurrence matrix, neighborhood gray-tone difference matrix [NGTDM], gray level run length matrix [GLRLM], and gray level zone length matrix). The VOI and SUV were resampled into discrete bins using absolute resampling to minimize the correlation between textural features and reduce the impact of noise and matrix size [20]. Sixty-four bins were used for the PET component with the minimum and maximum bounds of the resampling interval set to SUVs of 0 and 20, respectively. Moreover, the voxel size was resampled to 3.0 × 3.0 × 3.0 mm³. Therefore, a bin size with an SUV of 0.3 was used to analyze the PET component. Voxels with an SUV of > 20 were grouped in the highest bin [20].

As we used two different PET scanners, post-reconstruction harmonization was performed for all PET parameters using the ComBat harmonization method for R software (https://github.com/Jfortin1/ComBatHarmonization) [21], which is effective in PET scans [22].

Confirmation of ACEs

Echocardiography was performed within 2 months of ¹⁸F-FDG-PET/CT scan (mean ± standard deviation: 13 days ± 14 [range: − 50 to + 58 days]). The echocardiography report was used as the reference standard for cardiac function. Cardiac dysfunction was defined as a LV ejection fraction (LVEF) of < 50% [23]. Further, twelve-lead or Holter echocardiography was performed within 2 months of ¹⁸F-FDG-PET/CT scan (mean ± standard deviation: 17 days ± 15 [range: − 50 to + 58 days]). Moreover, patients were assessed to determine the presence of arrhythmic events, including sustained VT and AVB. AVB was characterized as either second- or third-degree AVB or trifascicular block [23, 24].

Medical records were used to obtain information on patient prognosis. The last follow-up was conducted in December 2023. ACE was defined as the reduction in LVEF with cardiac dysfunction (LVEF of < 50%), hospitalization due to cardiac arrhythmia such as recurrence or onset of sustained VT and AVB or heart failure, and death [25, 26]. Change in LVEF was determined by comparing the findings between echocardiography studies performed nearest to the pretreatment PET study and the last echocardiography studies of the follow-up period. Decrease in LVEF was defined as a negative change in LVEF.

ML approach

We adopted 49 radiomic features and the visibility of RV ¹⁸F-FDG uptake to predict ACEs using the ML approaches. Data were stratified according to event and were randomly assigned into the training (80%) and testing (20%) cohorts. Based on the ML analysis for predicting ACEs, decision tree, random forest (RF), neural network, k-nearest neighbors (kNN), Naïve Bayes, logistic regression (LR), and support vector machine (SVM), which are popular ML algorithms, were used for binary classification [27, 28].

The parameter selection for each ML method in this study was carefully made based on the specific clinical challenges and the characteristics of our dataset. For the decision tree, we limited node levels and split thresholds to prevent overfitting, and consequently we selected an induce binary tree with two minimum number of instances in leaves, a split greater than 5, with maximum 100 node levels for depth of classification tree and stop splitting the nodes after majority reach 95%. In the RF, a moderate number of trees were chosen to balance the model’s generalizability and computational efficiency, and consequently we selected 10 trees and did not split subsets smaller than 5. The neural network settings were optimized with rectified linear unit (ReLU) activation function and Adam optimization for efficient learning and good convergence, and consequently we selected 1000 neurons, alpha = 0.00001 and maximum iterations 1000. For kNN, setting the number of neighbors to 5 with metric Euclidean and weight uniform ensured suitable accuracy for our dataset size. The parameters for LR and SVM were chosen to optimize the tradeoff between model complexity and the risk of overfitting. Consequently, we selected a ridge with a coefficient score of 1 for LR. For SVM, we selected the Kernel radial basis function with cost 1 and regression loss epsilon 0.10, and the two optimization parameters, tolerance and iteration limit were set to 0.0010 and 500, respectively. In the case of Naïve Bayes, its simplicity and effective learning ability based on the distribution of data were valued. These parameter choices enabled us to construct robust and reliable predictive models aligned with the objectives of our study.

To overcome imbalanced data, the synthetic minority over-sampling technique was used in the training cohorts [29]. In this study, the sample size was small, and the set of features was reduced to prevent the influence of overfitting. The ranking-based method was only applied on the training cohort to reduce set features based on the decrease in Gini impurity. As a rule of thumb, it is necessary to use < 10% of the sample size as the number of features for classification problem [30]. The final sample size of this study was n = 47; thus, we selected the 4 top ranking features for constructing each ML model. Moreover, the use of a resampling technique referred to as k-fold cross-validation is one of the solutions of overfitting [31, 32]. Tenfolds are a common choice for k-fold cross-validation, particularly if the dataset is not extremely large or small [32]. In this study, a tenfold cross-validation was used to minimize the negative influence of overfitting on the training cohort.

Receiver operating characteristic curve (ROC) analysis was performed to compare the predictive performances of the models, and the area under the ROC curve (AUC) was calculated. The computed performance measures were AUC, accuracy, F1 score, precision (positive predictive value), and recall (sensitivity) for average over classes. The F1 score (F score or F measure) is the harmonic average between precision and recall [33]. Each ML algorithm was used to calculate each probability score (range: 0–1) of ACEs. The predictive performance of each machine model was independently estimated in the testing set by quantifying the AUC, accuracy, F1 score, precision, and recall.

The diagnostic indices including sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of the testing cohort were also calculated. The importance of features in the ML modeling process was calculated using the decrease in AUC [34]. A higher decrease in AUC for a feature indicates that such a variable has a higher importance [34].

The ML analysis was performed using Orange version 3.24.1 (Bioinformatics Laboratory, University of Ljubljana, Ljubljana, Slovenia), an open-source data-mining and visualization package [35].

Statistical analysis

The Mann–Whitney U test or the Chi-square test was used to appropriately assess differences between two quantitative variables or compare categorical data. The DeLong method was used to analyze the statistical significance of differences between AUCs [36]. The diagnostic indices including sensitivity, specificity, PPV, NPV, and accuracy were compared using the McNemar’s test or Chi-square test.

Data were presented as medians and interquartile ranges (IQRs). A p value of < 0.05 was considered statistically significant, and all p values were two-tailed. The MedCalc statistical software (MedCalc Software Ltd., Acacialaan 22, 8400 Ostend, Belgium) was used for statistical analyses.

Results

Characteristics of the patients

Of 47 patients, the median LVEF was 50.0% (IQR: 38.3%–63.8% [range: 20.8–81.0%]), and cardiac dysfunction was observed in 22 patients and arrhythmic events in 16 patients before treatment. There were seven patients with positive RV ¹⁸F-FDG uptake on the pretreatment ¹⁸F-FDG-PET/CT scans. The mean follow-up duration was 48.6 (range: 7–139) months. Of 47 patients, 17 presented with ACEs during follow-up: 11 patients were hospitalized because of cardiac arrhythmia (n = 6) and heart failure (n = 5), five patients had worsening of systolic LV function and one patient died. The complication rate of ACEs was significantly higher in patients with positive RV ¹⁸F-FDG uptake than in patients with negative RV ¹⁸F-FDG uptake (85.7% [6/7] vs. 27.5% [11/40], p = 0.006).

Table 1 shows the clinical characteristics of the participants in the training and testing cohorts. Of 38 patients in the training cohort, the median LVEF was 49.0% (IQR: 36.5–63.9% [range: 20.8–81.0%]) and cardiac dysfunction was observed in 19 patients and arrhythmic events in 13 patients before treatment. There were seven patients with positive RV ¹⁸F-FDG uptake on the pretreatment ¹⁸F-FDG-PET/CT scans. Fourteen patients developed ACEs during follow-up: eight patients were hospitalized because of cardiac arrhythmia (n = 4) or heart failure (n = 4), five patients had worsening of systolic LV function, and one patient died.

Table 1 Characteristics of patients with cardiac sarcoidosis (n = 47)

Full size table

Of nine patients in the testing cohort, the median LVEF was 54.4% (IQR: 43.9–64.0% [range: 30.3–74.1%]), and cardiac dysfunction was observed in three patients and arrhythmic events in three patients before treatment. Three patients developed ACEs during follow-up: all three patients were hospitalized because of cardiac arrhythmia (n = 2) or heart failure (n = 1).

No significant differences were observed in terms of LVEF, cardiac dysfunction, arrhythmic events, RV ¹⁸F-FDG uptake, and ACEs between the training and testing cohorts (each, p > 0.05).

ML models for predicting ACEs

Radiomic features were ranked based on the decrease in Gini impurity (Supplemental Table 2). The top four features for predicting ACEs were surface area, GLRLM_RLNU, coarseness from the NGTDM (NGTDM_Coarseness), and sphericity. Patients who experienced ACEs had a significantly higher surface area (p < 0.001), GLRLM_RLNU (p < 0.001) and a lower NGTDM_Coarseness (p = 0.002) and sphericity (p = 0.010) than those without ACEs (Table 2).

Table 2 Comparison of the top four radiomic predictive features between patients with cardiac sarcoidosis who developed adverse clinical events and those who did not

Full size table

The ML model was constructed using these top four features to prevent overfitting. Table 3 presents the diagnostic performance of each ML algorithm in the training and testing cohorts to predict ACEs.

Table 3 Diagnostic performance of each machine learning model using the top four radiomic features for predicting adverse clinical events in the training and testing cohorts

Full size table

In the training cohort, all ML algorithms achieved AUC values of > 0.80 for predicting ACEs (range: 0.841–0.944). Moreover, 5 of 7 ML algorithms (decision tree, RF, neural network, LR, and SVM) achieved F1 scores (range: 0.812–0.875), precision (range: 0.817–0.886), recall (range: 0.813–0.875), and accuracy (range: 0.813–0.875) of > 0.80 for predicting ACEs.

In the testing cohort, RF and neural network algorithms had an AUC of > 0.80 for predicting ACEs. The classification performance of RF (AUC—training cohort: 0.935, testing cohort: 0.889) and neural network (AUC—training cohort: 0.944, testing cohort: 0.889) in the testing cohort was similar to that of the training cohort. Meanwhile, the performance of the remaining five ML algorithms was poorer in the testing cohort (AUCs: 0.667–0.778) than in the training cohort.

The diagnostic indices including sensitivity, specificity, PPV, NPV, accuracy, and AUC did not significantly differ among these seven ML algorithms (each, p > 0.05) (Supplemental Table 3). However, among the seven ML algorithms, RF had the highest diagnostic index (average over classes—AUC: 0.889, F1 score: 0.882, precision: 0.905, recall: 0.899, sensitivity: 66.7% [2/3], specificity: 100% [6/6], PPV: 100% [2/2], NPV: 85.7% [6/7], and accuracy: 88.9% [8/9]). Supplemental Fig. 1 shows the important features of RF calculated using the decrease in AUC. GLRLM_RLNU was the most important feature with the highest mean value (0.150) and had a higher contribution in the modeling process.

Figures 1 and 2 show the representative ¹⁸F-FDG-PET/CT images of patients with and without ACEs, respectively.

Discussion

The current study evaluated the usefulness of the ML approach using pretreatment ¹⁸F-FDG-PET-based radiomic features and the visibility of RV ¹⁸F-FDG uptake for predicting ACEs in patients with CS. RF had the best performance for predicting ACEs, with the highest AUC and accuracy among all ML algorithms. GLRLM_RLNU had the highest contribution in the modeling process of RF. Therefore, ML analyses using ¹⁸F-FDG-PET-based radiomic features may be useful for predicting the risk of ACEs in patients with CS.

Previous studies have examined the characteristics of ¹⁸F-FDG-PET/CT radiomic features in CS. Manabe et al. [10] evaluated the diagnostic value of ¹⁸F-FDG-PET/CT texture analysis in patients with CS. Results showed that GLRLM long-run emphasis and GLRLM short-run low gray level emphasis were significant independent predictors of CS diagnosis. Moreover, their group examined the efficacy of ¹⁸F-FDG-PET/CT texture analysis on providing prognostic information on patients with CS. Moreover, they reported that GLRLM high gray level run emphasis was significantly associated with ACEs [11].

In our study, patients with CS who developed ACEs had a significantly higher surface area, GLRLM_RLNU, and a lower NGTDM_Coarseness and sphericity than those who did not. GLRLM_RLNU is one of the higher order texture features, and it measures differences between the lengths of runs. The high GLRLM_RLNU values are indicative of heterogeneous images [37]. Coarseness, which is one of the NGTDMs, is associated with granularity within an image and is related to the level of special rate of change in intensity. The heterogeneous images had a high rate of change in the gray level within a neighborhood, which results in a low coarseness value [38, 39]. Surface area represents the area of the surface encompassing the VOI and has a direct relationship with spiculatedness [40]. The sphericity represents the degree to which the VOI is similar to a sphere (formula of calculation of sphericity was presented in the Supplemental Material), and sphericity increases as the shape of VOI more closely resembles that of a sphere [41]. Thus, ACEs may occur in patients with CS as evidenced by a more heterogeneous and larger myocardial ¹⁸F-FDG uptake, and higher asphericity.

Recently, the potential applications of ML analysis have been reported in the field of nuclear cardiology [12,13,14]. Hu et al. [12] examined the usefulness of ML models for predicting early coronary revascularization after single-photon emission computed tomography (SPECT) myocardial perfusion imaging (MPI). Results showed that the ML model outperformed the expert interpretation of MPI by nuclear cardiologists for predicting early revascularization performance. Rios et al. [13] showed that the ML models using automatically extracted variables had a better prognostic accuracy for major cardiac ACEs compared with standard interpretation in patients undergoing SPECT MPI. However, to the best of our knowledge, no study has previously investigated the efficacy of ¹⁸F-FDG-PET-based radiomics and the visibility of RV ¹⁸F-FDG uptake using the ML approach for predicting ACEs in patients with CS.

In our study, to prevent the influence of overfitting, the ML models were constructed using the top four features ranked by the decrease in Gini impurity to predict ACEs. In the training cohort, all seven ML algorithms had a good classification performance with AUC values of > 0.80. However, in the testing cohort, only two algorithms with RF and neural network algorithm achieved AUC values of > 0.80. Meanwhile, the performance of the remaining five ML algorithms (decision tree, kNN, Naïve Bayes, LR, and SVM) was poorer in the testing cohort (AUCs of 0.667–0.778) than in the training cohort probably due to overfitting. Although neither the AUC nor accuracy significantly differed among the seven ML algorithms, RF was the best performing classifier as it had the highest diagnostic accuracy (88.9% [8/9]). Moreover, it exhibited a similar classification performance between the training and testing cohorts (AUC: 0.935 vs 0.889). GLRLM_RLNU was the most important feature for the ML modeling process of RF. Hence, the ML model with RF algorithm using ¹⁸F-FDG-PET-based radiomic features and the visibility of RV ¹⁸F-FDG uptake can potentially predict ACEs in patients with CS.

It has been reported that ¹⁸F-FDG accumulation in the RV is associated with the ACEs [25, 42]. In our study, the complication rate of ACEs was significantly higher in patients with positive RV ¹⁸F-FDG uptake than that of patients with negative RV ¹⁸F-FDG uptake (85.7% [6/7] vs. 27.5% [11/40], p = 0.006). Thus, this finding was compatible with the previous reports [25, 42]. However, the visibility of RV ¹⁸F-FDG uptake was not ranked within top four features, and the constructed each ML model was not influenced by the visibility of RV ¹⁸F-FDG uptake.

This study had several limitations. First, it was retrospective in nature, and it had a relatively small study cohort with conducting only in a single institution. Thus, it is necessary to perform a multicenter prospective study with a significantly larger population to validate and confirm our findings. Second, using different PET/CT scanners might have affected the results of ¹⁸F-FDG-PET-based radiomic analyses. However, the post-reconstruction harmonization using ComBat was conducted during analyses to overcome this issue. Third, only 49 radiomic features extracted from the LIFEx software were used in ML analyses. However, the LIFEx software has been widely used for radiomic analyses in the field of PET/CT scan studies [43, 44]. Fourth, only seven ML algorithms (specifically decision tree, RF, neural network, kNN, Naïve Bayes, logistic regression, and SVM) were applied in the ML analyses. Nevertheless, we only used the ML algorithms implemented in the Orange software, which is a popular open-source tool that provides a visual approach to ML for an interactive data analysis, thereby facilitating the easy construction and configuration of workflows for ML studies [35]. Finally, although training and testing validation had a good classification performance, a training–test scheme with a larger population might be preferred for model validation.

In conclusion, ML analyses using ¹⁸F-FDG-PET-based radiomic features can be useful for predicting ACEs in patients with CS.

References

Hulten E, Aslam S, Osborne M, et al. Cardiac sarcoidosis: state of the art review. Cardiovasc Diagn Ther. 2016;6:50–63.
PubMed PubMed Central Google Scholar
Doughan AR, Williams BR. Cardiac sarcoidosis. Heart. 2006;92:282–8.
PubMed PubMed Central Google Scholar
Banba K, Kusano KF, Nakamura K, et al. Relationship between arrhythmogenesis and disease activity in cardiac sarcoidosis. Heart Rhythm. 2007;4:1292–9.
PubMed Google Scholar
Roberts WC, McAllister Jr HA, Ferrans VJ. Sarcoidosis of the heart. A clinicopathologic study of 35 necropsy patients (group 1) and review of 78 previously described necropsy patients (group 11). Am J Med. 1977;63:86–108.
CAS PubMed Google Scholar
Yazaki Y, Isobe M, Hiroe M, et al. Prognostic determinants of long-term survival in Japanese patients with cardiac sarcoidosis treated with prednisone. Am J Cardiol. 2001;88:1006–10.
CAS PubMed Google Scholar
Mehta D, Lubitz SA, Frankel Z, et al. Cardiac involvement in patients with sarcoidosis: diagnostic and prognostic value of outpatient testing. Chest. 2008;133:1426–35.
PubMed Google Scholar
Ishida Y, Yoshinaga K, Miyagawa M, et al. Recommendations for (18)F-fluorodeoxyglucose positron emission tomography imaging for cardiac sarcoidosis: Japanese Society of Nuclear Cardiology recommendations. Ann Nucl Med. 2014;28:393–403.
PubMed Google Scholar
von Schulthess GK, Steinert HC, Hany TF. Integrated PET/CT: current applications and future directions. Radiology. 2006;238:405–22.
Google Scholar
Vaidyanathan S, Patel CN, Scarsbrook AF, et al. FDG PET/CT in infection and inflammation—current and emerging clinical applications. Clin Radiol. 2015;70:787–800.
CAS PubMed Google Scholar
Manabe O, Ohira H, Hirata K, et al. Use of ¹⁸F-FDG PET/CT texture analysis to diagnose cardiac sarcoidosis. Eur J Nucl Med Mol Imaging. 2019;46:1240–7.
CAS PubMed Google Scholar
Manabe O, Koyanagawa K, Hirata K, et al. Prognostic value of ¹⁸F-FDG PET using texture analysis in cardiac sarcoidosis. JACC Cardiovasc Imaging. 2020;13:1096–7.
PubMed Google Scholar
Hu LH, Betancur J, Sharir T, et al. Machine learning predicts per-vessel early coronary revascularization after fast myocardial perfusion SPECT: results from multicentre REFINE SPECT registry. Eur Heart J Cardiovasc Imaging. 2020;21:549–59.
PubMed Google Scholar
Rios R, Miller RJH, Hu LH, et al. Determining a minimum set of variables for machine learning cardiovascular event prediction: results from REFINE SPECT registry. Cardiovasc Res. 2022;118:2152–64.
CAS PubMed Google Scholar
Otaki Y, Miller RJH, Slomka PJ. The application of artificial intelligence in nuclear cardiology. Ann Nucl Med. 2022;36:111–22.
PubMed Google Scholar
Nakajo M, Ojima S, Kawakami H, et al. Value of Patlak Ki images from ¹⁸F-FDG-PET/CT for evaluation of the relationships between disease activity and clinical events in cardiac sarcoidosis. Sci Rep. 2021;11:2729.
CAS PubMed PubMed Central Google Scholar
Terasaki F, Azuma A, Anzai T, et al. JCS 2016 guideline on diagnosis and treatment of cardiac sarcoidosis—digest version. Circ J. 2019;83:2329–88.
PubMed Google Scholar
Morooka M, Moroi M, Uno K, et al. Long fasting is effective in inhibiting physiological myocardial 18F-FDG uptake and for evaluating active lesions of cardiac sarcoidosis. EJNMMI Res. 2014;4:1.
PubMed PubMed Central Google Scholar
Muser D, Santangeli P, Castro SA, et al. Prognostic role of serial quantitative evaluation of ¹⁸F-fluorodeoxyglucose uptake by PET/CT in patients with cardiac sarcoidosis presenting with ventricular tachycardia. Eur J Nucl Med Mol Imaging. 2018;45:1394–404.
CAS PubMed Google Scholar
Nioche C, Orlhac F, Boughdad S, et al. LIFEx: a freeware for radiomic feature calculation in multimodality imaging to accelerate advances in the characterization of tumor heterogeneity. Cancer Res. 2018;78:4786–9.
CAS PubMed Google Scholar
Brown PJ, Zhong J, Frood R, et al. Prediction of outcome in anal squamous cell carcinoma using radiomic feature analysis of pre-treatment FDG PET-CT. Eur J Nucl Med Mol Imaging. 2019;46:2790–9.
CAS PubMed PubMed Central Google Scholar
Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–27.
PubMed Google Scholar
Orlhac F, Boughdad S, Philippe C, et al. A postreconstruction harmonization method for multicenter radiomic studies in PET. J Nucl Med. 2018;59:1321–8.
CAS PubMed Google Scholar
Kandolin R, Lehtonen J, Airaksinen J, et al. Cardiac sarcoidosis: epidemiology, characteristics, and outcome over 25 years in a nationwide study. Circulation. 2015;131:624–32.
PubMed Google Scholar
Sinagra G, Anzini M, Pereira NL, et al. Myocarditis in clinical practice. Mayo Clin Proc. 2016;91:1256–66.
PubMed Google Scholar
Tuominen H, Haarala A, Tikkakoski A, Kähönen M, Nikus K, Sipilä K. FDG-PET in possible cardiac sarcoidosis: right ventricular uptake and high total cardiac metabolic activity predict cardiovascular events. J Nucl Cardiol. 2021;28:199–205.
PubMed Google Scholar
Kaneko K, Nagao M, Yamamoto A, Sakai A, Sakai S. FDG uptake patterns in isolated and systemic cardiac sarcoidosis. J Nucl Cardiol. 2023;30:1065–74.
PubMed Google Scholar
Choudhury P, Allen RT, Endres MG. Machine learning for pattern discovery in management research. Strateg Manag J. 2021;42:30–57.
Google Scholar
El-Sappagh S, Saleh H, Sahal R, et al. Alzheimer’s disease progression detection model based on an early fusion of cost-effective multimodal data. Future Gener Comput Syst. 2021;115:680–99.
Google Scholar
Chawla NV, Bowyer KW, Hall LO, et al. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–57.
Google Scholar
Chicco D, Shiradkar R. Ten quick tips for computational analysis of medical images. PLOS Comput Biol. 2023;19: e1010778.
CAS PubMed PubMed Central Google Scholar
Cook JA, Ranstam J. Overfitting. Br J Surg. 2016;103:1814.
CAS PubMed Google Scholar
Krizmaric M, Verlic M, Stiglic G, et al. Intelligent analysis in predicting outcome of out-of-hospital cardiac arrest. Comput Methods Programs Biomed. 2009;95:S22-32.
PubMed Google Scholar
Hyun SH, Ahn MS, Koh YW, et al. A machine-learning approach using PET-based radiomics to predict the histological subtypes of lung cancer. Clin Nucl Med. 2019;44:956–60.
PubMed Google Scholar
Mosavi A, Hosseini FS, Choubin B, et al. Susceptibility prediction of groundwater hardness using ensemble machine learning models. Water. 2020;12:2770.
CAS Google Scholar
Demsar J, Curk T, Erjavec A, et al. Orange: data mining toolbox in Python. J Mach Learn Res. 2013;14:2349–53.
Google Scholar
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–45.
CAS PubMed Google Scholar
Suzuki K, Yisong C. Artificial intelligence in decision support systems for diagnosis in medical imaging. Cham: Springer International Publishing; 2018.
Google Scholar
Xu R, Kido S, Suga K, et al. Texture analysis on ¹⁸F-FDG PET/CT images to differentiate malignant and benign bone and soft-tissue lesions. Ann Nucl Med. 2014;28:926–35.
CAS PubMed Google Scholar
Cheng L, Zhang J, Wang Y, et al. Textural features of ¹⁸F-FDG PET after two cycles of neoadjuvant chemotherapy can predict pCR in patients with locally advanced breast cancer. Ann Nucl Med. 2017;31:544–52.
CAS PubMed Google Scholar
Limkin EJ, Reuze S, Carre A, et al. The complexity of tumor shape, spiculatedness, correlates with tumor radiomic shape features. Sci Rep. 2019;9:4329.
PubMed PubMed Central Google Scholar
Su Y, Choi CE. Effects of particle shape on the cushioning mechanics of rock-filled gabions. Acta Geotech. 2021;16:1043–52.
Google Scholar
Blankstein R, Osborne M, Naya M, et al. Cardiac positron emission tomography enhances prognostic assessments of patients with suspected cardiac sarcoidosis. J Am Cardiol. 2014;63:329–36.
Google Scholar
Nakajo M, Jinguji M, Tani A, et al. Application of a machine learning approach for the analysis of clinical and radiomic features of pretreatment [¹⁸F]-FDG PET/CT to predict prognosis of patients with endometrial cancer. Mol Imaging Biol. 2021;23:756–65.
CAS PubMed Google Scholar
Li Y, Zhang Y, Fang Q, et al. Radiomics analysis of [¹⁸F]FDG PET/CT for microvascular invasion and prognosis prediction in very-early- and early-stage hepatocellular carcinoma. Eur J Nucl Med Mol Imaging. 2021;48:2599–614.
CAS PubMed Google Scholar

Download references

Funding

No funding.

Author information

Authors and Affiliations

Department of Radiology, Graduate School of Medical and Dental Sciences, Kagoshima University, 8-35-1 Sakuragaoka, Kagoshima, 890-8544, Japan
Masatoyo Nakajo, Megumi Jinguji, Mitsuho Hirahara, Atsushi Tani, Koji Takumi, Kiyohisa Kamimura & Takashi Yoshiura
Department of Management Planning Division, Harada Academy, 2-54-4 Higashitaniyama, Kagoshima, 890-0113, Japan
Daisuke Hirahara
Department of Cardiovascular Medicine and Hypertension, Graduate School of Medical and Dental Sciences, Kagoshima University, 8-35-1 Sakuragaoka, Kagoshima, 890-8544, Japan
Satoko Ojima & Mitsuru Ohishi

Authors

Masatoyo Nakajo
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Hirahara
View author publications
You can also search for this author in PubMed Google Scholar
Megumi Jinguji
View author publications
You can also search for this author in PubMed Google Scholar
Satoko Ojima
View author publications
You can also search for this author in PubMed Google Scholar
Mitsuho Hirahara
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Tani
View author publications
You can also search for this author in PubMed Google Scholar
Koji Takumi
View author publications
You can also search for this author in PubMed Google Scholar
Kiyohisa Kamimura
View author publications
You can also search for this author in PubMed Google Scholar
Mitsuru Ohishi
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Yoshiura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masatoyo Nakajo.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. This article does not contain any studies with animals performed by any of the authors.

Informed consent

Informed consent was waived by the institutional review board for this retrospective study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 116 KB)

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Nakajo, M., Hirahara, D., Jinguji, M. et al. Machine learning approach using ¹⁸F-FDG-PET-radiomic features and the visibility of right ventricle ¹⁸F-FDG uptake for predicting clinical events in patients with cardiac sarcoidosis. Jpn J Radiol 42, 744–752 (2024). https://doi.org/10.1007/s11604-024-01546-y

Download citation

Received: 20 December 2023
Accepted: 05 February 2024
Published: 16 March 2024
Issue Date: July 2024
DOI: https://doi.org/10.1007/s11604-024-01546-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Machine learning approach using ¹⁸F-FDG-PET-radiomic features and the visibility of right ventricle ¹⁸F-FDG uptake for predicting clinical events in patients with cardiac sarcoidosis