Radiomics with 3-dimensional magnetic resonance fingerprinting: influence of dictionary design on repeatability and reproducibility of radiomic features

Fujita, Shohei; Hagiwara, Akifumi; Yasaka, Koichiro; Akai, Hiroyuki; Kunimatsu, Akira; Kiryu, Shigeru; Fukunaga, Issei; Kato, Shimpei; Akashi, Toshiaki; Kamagata, Koji; Wada, Akihiko; Abe, Osamu; Aoki, Shigeki

doi:10.1007/s00330-022-08555-3

Radiomics with 3-dimensional magnetic resonance fingerprinting: influence of dictionary design on repeatability and reproducibility of radiomic features

Imaging Informatics and Artificial Intelligence
Open access
Published: 18 March 2022

Volume 32, pages 4791–4800, (2022)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Radiomics with 3-dimensional magnetic resonance fingerprinting: influence of dictionary design on repeatability and reproducibility of radiomic features

Download PDF

Shohei Fujita ORCID: orcid.org/0000-0003-4276-6226^1,2,
Akifumi Hagiwara¹,
Koichiro Yasaka³,
Hiroyuki Akai³,
Akira Kunimatsu³,
Shigeru Kiryu⁴,
Issei Fukunaga¹,
Shimpei Kato^1,2,
Toshiaki Akashi¹,
Koji Kamagata¹,
Akihiko Wada¹,
Osamu Abe² &
…
Shigeki Aoki¹

2104 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Objectives

We aimed to investigate the influence of magnetic resonance fingerprinting (MRF) dictionary design on radiomic features using in vivo human brain scans.

Methods

Scan-rescans of three-dimensional MRF and conventional T1-weighted imaging were performed on 21 healthy volunteers (9 males and 12 females; mean age, 41.3 ± 14.6 years; age range, 22–72 years). Five patients with multiple sclerosis (3 males and 2 females; mean age, 41.2 ± 7.3 years; age range, 32–53 years) were also included. MRF data were reconstructed using various dictionaries with different step sizes. First- and second-order radiomic features were extracted from each dataset. Intra-dictionary repeatability and inter-dictionary reproducibility were evaluated using intraclass correlation coefficients (ICCs). Features with ICCs > 0.90 were considered acceptable. Relative changes were calculated to assess inter-dictionary biases.

Results

The overall scan-rescan ICCs of MRF-based radiomics ranged from 0.86 to 0.95, depending on dictionary step size. No significant differences were observed in the overall scan-rescan repeatability of MRF-based radiomic features and conventional T1-weighted imaging (p = 1.00). Intra-dictionary repeatability was insensitive to dictionary step size differences. MRF-based radiomic features varied among dictionaries (overall ICC for inter-dictionary reproducibility, 0.62–0.99), especially when step sizes were large. First-order and gray level co-occurrence matrix features were the most reproducible feature classes among different step size dictionaries. T1 map-derived radiomic features provided higher repeatability and reproducibility among dictionaries than those obtained with T2 maps.

Conclusion

MRF-based radiomic features are highly repeatable in various dictionary step sizes. Caution is warranted when performing MRF-based radiomics using datasets containing maps generated from different dictionaries.

Key Points

• MRF-based radiomic features are highly repeatable in various dictionary step sizes.

• Use of different MRF dictionaries may result in variable radiomic features, even when the same MRF acquisition data are used.

• Caution is needed when performing radiomic analysis using data reconstructed from different dictionaries.

Accuracy, repeatability, and reproducibility of T1 and T2 relaxation times measurement by 3D magnetic resonance fingerprinting with different dictionary resolutions

Article Open access 24 November 2022

Magnetic Resonance Fingerprinting - a promising new approach to obtain standardized imaging biomarkers from MRI

Article Open access 24 March 2015

Time efficient whole-brain coverage with MR Fingerprinting using slice-interleaved echo-planar-imaging

Article Open access 27 April 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Radiomics involves high-throughput computer extraction of potentially innumerable numbers of quantitative imaging metrics, or “radiomic features,” which are collectively used for prediction of disease diagnosis, treatment response, and prognosis [1,2,3]. In contrast to focal biopsy, which is an invasive procedure that only evaluates a small portion of tissue, radiomics allows the assessment of total pathology, including surrounding tissue and tracking of changes over time via repetitive non-invasive imaging. The implementation of radiomics into clinical practice has been challenging due to its sensitivity to various factors, such as image acquisition, imaging platform vendors, and feature extraction software, which affect the repeatability and reproducibility of radiomic features [4,5,6,7,8]. Integration of MRI-based radiomics into clinical workflow is challenging, particularly because the acquired signal intensity of MRI does not directly reflect the local physical properties and may differ substantially across imaging platforms [7, 9, 10]. Due to the qualitative nature of MRI, MRI-based radiomic features currently used in practice predominantly comprise morphometry (e.g., size, shape, and volume) of the structure, rather than histogram measurements or texture [5].

Magnetic resonance fingerprinting (MRF) is an image generation framework that can be employed to acquire quantitative maps of multiple tissue properties simultaneously [11]. In MRF, repetition times and flip angles are concurrently varied in a pseudorandom fashion to create through-time signals that characterize the various relaxation processes unique to each tissue type. These through-time signals are pattern-matched to separately simulated dictionary entries to restore measurable tissue properties. While the signal intensity of conventional MR images (such as T1- and T2-weighted images) depends on manifold acquisition parameters and MR scanner variations, MRF can generate highly repeatable and reproducible quantitative maps that have absolute scales [12,13,14]. Indeed, MRF is projected to emerge as the key technology for reproducible radiomic analyses and has been adopted for various sites, including the heart [15], breast [16, 17], prostate [18], liver [19], and brain [20,21,22], with promising results. MRF-based radiomics has been reported to improve the differentiation of common adult brain tumors by enabling the characterization of tumor heterogeneity and facilitating the prediction of outcomes in patients with glioblastoma [23].

Pattern-matching and dictionary design are active research areas due to the dependence of reconstructed MRF maps and reconstruction time on these processes [24]. Currently, there is substantial heterogeneity in dictionaries used for MRF, as various dictionaries with different step sizes are employed at different institutions [11, 13, 25,26,27,28]. To fully harness the quantitative maps generated by MRF across scanners and sites that are highly repeatable and reproducible, an analysis of maps reconstructed with different dictionaries is warranted.

Despite the potential of MRF-based radiomics, the influence of dictionary design on MRF-based radiomic features has not been investigated extensively. Herein, we investigated the influence of dictionary step size on the repeatability of MRF radiomic features and evaluated the stability of each feature using dictionaries with different step sizes.

Materials and methods

MRF acquisition and dictionary-matching

This study was conducted in compliance with the Image Biomarker Standardization Initiative guidelines [5, 29,30,31]. The methodology used for radiomics analysis is reported accordingly. The study was approved by the local institutional review board. Written informed consent was obtained from all participants prior to the scan. Twenty-one participants (9 males and 12 females; mean age, 41.3 ± 14.6 years; age range, 22–72 years) with no history of neurological or psychological disorders were enrolled. Five patients with multiple sclerosis (3 males and 2 females; mean age, 41.2 ± 7.3 years; age range, 32–53 years, median Expanded Disability Status Scale, 1 [range, 0–7.5]; mean disease duration, 11.2 ± 4.4 years [range, 4–18 years]) were also included. Only inter-dictionary reproducibility was evaluated in patients because undergoing scan-rescan was not feasible. All subjects underwent non-contrast-enhanced brain scans using a 3-T scanner (Discovery 750 w, GE Healthcare) with a standard 32-channel head coil. No motion correction techniques were applied.

Scan-rescan was performed using an identical protocol consisting of whole-brain 3D MRF and conventional 3D fast spoiled gradient echo (FSPGR) imaging. After the first imaging set (consisting of an MRF scan and FSPGR scan) was acquired, participants exited the room and were repositioned before the rescan. The scanner was calibrated before each set of scans. The 3D MRF sequence was based on steady-state free precession with spiral projection k-space trajectory [14, 32]. The acquisition parameters of MRF were as follows: field of view, 200 × 200 × 200 mm; matrix size, 200 × 200 × 200; spatial resolution, 1.0 × 1.0 × 1.0 mm; and acquisition time, 9 min 51 s. The acquisition parameters of conventional 3D T1-weighted structural images were as follows: acquisition orientation, sagittal acquisition; TR/TE/inversion time, 7.7/3.1/400 ms; field of view, 256 × 256 mm; matrix size, 256 × 256; section thickness, 1.0 mm; spatial resolution, 1.0 × 1.0 × 1.0 mm; flip angle, 11°; receiver bandwidth, 244.1 Hz/pixel; number of excitations, 1; and acquisition time, 5 min 45 s. The spatial resolution of 3D MRF and 3D FSPGR was matched to gapless isotropic 1.0 mm.

Reconstructions and dictionary-matching were performed using an in-house program in MATLAB (R2019a, MathWorks). Dictionaries were generated using the extended phase graph formalism [11]. To generate a set of dictionaries with different step sizes, various step sizes were prepared as shown in Table 1. The dictionary T1 range and T2 range were kept the same across dictionaries, ranging from 10 to 3000 ms and 10 to 1000 ms, respectively. In all dictionaries, we used smaller step sizes for T2 than for T1 because T2 is smaller than T1 in biological tissues. MRF T1 and T2 maps were obtained using a maximum inner product search [11].

Table 1 Step sizes and total number of entries for each dictionary. All dictionaries had the same range of 10 to 3000 ms for T1 and 10 to 1000 ms for T2

Full size table

Data post-processing and radiomic feature extraction

An overview of data post-processing is illustrated in Fig. 1. Skull-stripping was performed using the bet function implemented in FMRIB Software Library (version 6.0.4; FMRIB Analysis Group). Spherical volumes of interest (VOIs) with a diameter of 20 mm were randomly and manually placed inside the skull on the first FSPGR image (Fig. 2) using the “Segment Editor” module in 3D Slicer (version 4.10.2, https://www.slicer.org/) [33]. Additionally, ellipsoid (axes, 20 × 12 mm) and cubic (12 mm each side) VOIs were also prepared to investigate the effect of dictionary design on the reproducibility of radiomic features under various VOI shapes. We did not align the MRF images and rescan FSPGR to initial FSPGR images to avoid intensity interpolation, which may have altered the signal differences originally contained within the data. Instead, VOIs set on initial FSPGR images were rigidly translated to the rescan FSPGR space, first-scan MRF space, and rescan MRF space using the flirt function implemented in FMRIB Software Library. Since all MRF datasets for each subject were inherently aligned (highly dense, dense, moderate, sparse, and highly sparse datasets were reconstructed from the same acquisition data), each VOI was copied and pasted across MRF datasets of the same subject. In total, 12,600 VOIs from healthy volunteers (21 subjects, 300 VOIs per subject, scan-rescan dataset) and 1250 VOIs from patients with multiple sclerosis (5 patients, 250 VOIs per subject, scan dataset) were used in subsequent analyses.

PyRadiomics (version 3.0) [34], an open-source radiomics software package that is compliant with the IBSI benchmarks [31], was used to extract first- and second-order features from the VOIs for each dataset as defined in default by PyRadiomics. Briefly, first-order features describe the distribution of voxel intensities within a VOI, whereas second-order features express combinations of voxel intensities of neighboring pixels distributed within a VOI. Second-order features included symmetrical gray level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM), gray level size zone matrix (GLSZM), and neighboring gray tone difference matrix (NGTDM). These features complied with feature definitions as described by the IBSI, which are available in a separate reference manual by Zwanenburg et al [7]. Bin counts of 64 were used because gray levels of 32 to 64 typically enable radiomic analysis without losing important features in medical imaging [3, 10, 23, 35]. No image resampling, image intensity normalization, or image filtering was performed.

Statistical analysis

To evaluate intra-dictionary repeatability, two-way mixed-effects models of the intraclass correlation coefficients (ICCs, unit: single rater/measurement, type: absolute agreement) and their 95% confidence intervals (CIs) were calculated for each feature value extracted from scans and rescans [36]. To evaluate reproducibility, two-way random-effects models of the ICCs (unit: single rater/measurement, type: agreement, consistency) and their 95% CIs were calculated against the reference value for each feature as a measure of the agreement between the highly dense dictionary and others. This analysis was performed to evaluate the stability of features against differences in dictionary step size. The radiomic features obtained from the densest dictionary were used as reference values, as they were considered to contain the greatest amount of information. Only the first scan was used to calculate reproducibility because only one scan is available (rescan is not performed) in clinical settings. Negative ICC estimates were truncated at zero. To evaluate the effects of different step sizes on radiomic features, percent relative changes were calculated with respect to corresponding references. ICCs exceeding 0.90 were categorized as high performance and acceptable, in accordance with thresholds reported in the literature [37,38,39].

All statistical analyses were performed using R statistical software (version 3.5.1; R Foundation for Statistical Computing) with packages “psych” (version 1.8.10) and “tidyverse” (version 1.2.1). The Wilcoxon rank sum test was used to compare individual radiomic features across dictionaries. Results were considered significant if p values were below the significance threshold (0.05 divided by the total number of combinations of the dictionaries) after applying the Bonferroni method for multiple-comparison correction. The significance threshold for adjusted p values was set at 0.05.

Results

Scan-rescan repeatability of MRF-based radiomic features using dictionaries with different step sizes

Intra-dictionary scan-rescan ICCs of radiomic features derived from MRF using different step sizes and conventional imaging are presented in Fig. 3. Repeatability of individual radiomic features is presented in Fig. 4 (first-order and symmetric GLCM), and see Electronic Supplementary Material Figure 1 (GLRLM, GLSZM, and NGTDM). ICCs were computed based on the entire study population. Intra-dictionary repeatability was generally insensitive to dictionary step sizes. Overall scan-rescan ICCs for conventional 3D T1-weighted imaging and highly dense, dense, moderate, sparse, and highly sparse dictionaries for MRF T1-derived radiomic features were 0.95 ± 0.06, 0.94 ± 0.09, 0.94 ± 0.09, 0.94 ± 0.9, 0.93 ± 0.14, and 0.92 ± 0.16, respectively (mean ± standard deviation). No significant differences were observed among different MRF dictionaries (p = 1.0–1.0). T1 map-derived GLCM features tended to have poorer repeatability in sparser dictionaries. Among dictionaries, the highly dense dictionary provided the highest number of highly repeatable (i.e., ICC > 0.90) radiomic features (n = 68/79, 86%), which was noninferior to conventional imaging (n = 65/79, 83%) (p = 0.82). T2 maps generally exhibited poorer intra-dictionary repeatability compared to T1 maps. Overall scan-rescan ICCs for highly dense, dense, moderate, sparse, and highly sparse dictionaries for T2-derived radiomic features were 0.87 ± 0.11, 0.87 ± 0.11, 0.86 ± 0.12, 0.87 ± 0.15, and 0.86 ± 0.15, respectively (mean ± standard deviation). No significant differences were noted in the repeatability of radiomic features derived from T2 maps among different MRF dictionaries (p = 1.0–1.0). The percentage of highly repeatable T2 map-derived radiomic features provided by the highly dense dictionary was 46% (n = 36/79).

The percentages of high repeatability (ICC > 0.90) of each feature class among T1 and T2 maps among dictionaries were 69% (n = 124/180), 85% (n = 205/240), 63% (n = 100/160), 44% (n = 71/160), and 58% (n = 29/50) for first-order, GLCM, GLRLM, GLSZM, and NGTDM features, respectively. Poor repeatability was observed in the range, minimum, and maximum for both T1 and T2 maps, and kurtosis of T2 maps. Features related to lower gray level values (e.g., run low gray level emphasis) exhibited poorer repeatability with increased dictionary step size using spherical VOIs.

The results using ellipsoid and cubic VOIs were similar to those obtained with spherical VOIs (Electronic Supplementary Material Figure 2), showing high intra-dictionary repeatability and general insensitivity to dictionary step sizes.

Inter-dictionary reproducibility of MRF-based radiomic features

The inter-dictionary reproducibility of radiomic features calculated with MRF using different step size dictionaries is presented in Fig. 5. The percent relative differences and ICCs of first-order and symmetric GLCM features are presented in Electronic Supplementary Material Figure 3 and Figure 6, respectively (see Electronic Supplementary Material Figures 4–6, which illustrate the inter-dictionary reproducibility of GLRLM, GLSZM, and NGTDM features). ICCs were computed based on the entire study population. Across all feature classes, features calculated using the dense dictionary generally exhibited greater agreement with those calculated with the reference (i.e., highly dense dictionary). Radiomic features calculated using the dense dictionary exhibited high reproducibility (ICC > 0.90) with those calculated using the highly dense dictionary (T1, n = 79/79, 100%; T2, n = 73/79, 92%). Reproducibility decreased with an increase in dictionary step size. The reproducibility ICCs (mean ± standard deviation) for dense, moderate, sparse, and highly sparse dictionaries in T1 maps were 0.99 ± 0.01, 0.96 ± 0.06, 0.70 ± 0.31, and 0.62 ± 0.36, respectively. For T2 maps, the reproducibility ICCs (mean ± standard deviation) for dense, moderate, sparse, and highly sparse dictionaries were 0.97 ± 0.06, 0.88 ± 0.16, 0.69 ± 0.33, and 0.66 ± 0.34, respectively. Significant differences were observed in all dictionary combinations for both T1 and T2 maps (p < 0.001), except for highly sparse and sparse dictionaries (p = 0.22 and 1.00 for T1 and T2, respectively).

First-order features were generally more reproducible than second-order features. Reproducibility of GLRLM, GLSZM, and NGTDM features was sensitive to an increase in dictionary step size. Among first-order features, entropy and uniformity were the most sensitive to changes in dictionary step size and exhibited a relative difference of > 25% compared to reference values (Electronic Supplementary Material Figures 3). Results for sparser dictionaries were generally more non-reproducible for second-order features than for first-order features.

Radiomic features calculated using T1 maps exhibited smaller percent relative differences among dictionaries than those calculated using T2 maps (Figs. 5 and 6; see Electronic Supplementary Material Figures 3–6). Overall, T2-derived radiomic features exhibited greater variability compared to T1-derived radiomic features. In contrast, T1-derived radiomic features exhibited less variability but larger bias (i.e., deviation from the highly dense dictionary, indicated by the magnitude of percent relative differences). This tendency was most evident in NGTDM (see Electronic Supplementary Material Figures 4–6).

The inter-dictionary reproducibility using ellipsoid and cubic VOIs was similar to those obtained with spherical VOIs (Electronic Supplementary Material Figure 2). Across all feature classes, features calculated using the dense dictionary generally exhibited greater agreement with those calculated with the reference (i.e., highly dense dictionary), and reproducibility decreased with an increased dictionary step size.

The inter-dictionary reproducibility of radiomic features in patients with multiple sclerosis is summarized in Electronic Supplementary Material Figure 7. The results were similar to those for healthy subjects. Across all feature classes, features calculated using the dense dictionary generally exhibited greater agreement with those calculated with the reference (i.e., highly dense dictionary). The reproducibility ICCs (mean ± standard deviation) for dense, moderate, sparse, and highly sparse dictionaries in T1 maps were 0.99 ± 0.02, 0.96 ± 0.07, 0.71 ± 0.31, and 0.64 ± 0.35, respectively. For T2 maps, the reproducibility ICCs for dense, moderate, sparse, and highly sparse dictionaries were 0.98 ± 0.05, 0.89 ± 0.16, 0.73 ± 0.30, and 0.68 ± 0.32, respectively.

Discussion

Due to its highly repeatable and reproducible quantitative maps, MRF has emerged as a key technology for reliable radiomic analyses. Maps derived using MRF may be used in multi-center radiomic studies by pooling the data without the need for normalization or harmonization [40] that could scale away important information originally contained within the images. Nevertheless, the influence of dictionary design, a crucial component of MRF, on radiomic features has not been comprehensively investigated. This study evaluated the influence of dictionary design on intra-dictionary repeatability and inter-dictionary reproducibility of extracted radiomic features. Our results demonstrated that (i) repeatability of MRF-based radiomic features is unaffected by the dictionary step size, and (ii) the use of different MRF dictionaries may result in variable radiomic features, even when the same MRF acquisition data are used, and (iii) these results were consistent even when using different VOI shapes for volunteers and patients. Therefore, maps obtained with different dictionaries may produce erroneous radiomic analyses when used simultaneously.

The repeatability of MRF-derived radiomic features using the highly dense dictionary was comparable to conventional imaging in our study (89%, 87%, and 82% for MRF T1 maps, MRF T2 maps, and conventional imaging, respectively). The repeatability observed in this study was generally higher than values reported in the literature based on conventional MRI. Baessler et al [9] reported that the percentage of robust scan-rescan features obtained from conventional T1-weighted images was 54% (n = 25/45), whereas MRF using the highly dense dictionary generated repeatability of 89%. Although a direct comparison is challenging since we used a different radiomics platform and Baessler et al used vegetables and fruits rather than the human brain for imaging, our results demonstrate that MRF with a highly dense dictionary provides repeatable radiomic features.

Although our results indicate that merging of MRF-derived radiomic features calculated using different step size dictionaries should be avoided, several features exhibited high reproducibility across different dictionaries. First-order features and GLCM were generally more reproducible compared to other second-order features, in accordance with previous literatures [30, 41]. We identified several features with high reproducibility (ICC > 0.90) across dictionaries. This indicates the possibility of merging MRF-derived radiomic features calculated using different step size dictionaries. For example, Badve et al reported that mean T1 and T2 values and T2 skewness exhibited significant differences between solid tumor regions in glioblastoma and low-grade gliomas [22]. These features were highly reproducible across different dictionaries (ICC > 0.90 for both T1 and T2), highlighting the feasibility of pooling data from different dictionaries for this purpose. MRF-based radiomic analyses of adult brain tumors by Dastmalchian et al. revealed that inverse differences were normalized and homogeneity (equivalent of inverse difference) values of peritumoral white matter provided the best discrimination of low-grade gliomas, glioblastomas, and metastases [23]. Our results suggest that inverse differences have high reproducibility among highly dense, dense, and moderate dictionaries, and normalized inverse differences are highly reproducible (ICC > 0.90 among all dictionaries) and repeatable (ICC > 0.90, except for T2 of the highly sparse dictionary).

Several limitations of our study should be acknowledged. First, although a wide age range of healthy volunteers was included, the sample size of patients was small. Thus, we only evaluated the influence of dictionary design on radiomic features but not on downstream analysis in clinically relevant predictive models. Radiomic features are conveyed to machine-learning models for use in certain tasks, such as predicting diagnosis, treatment response, and prognosis. Performing MRF-based radiomics on real patient data to evaluate the impact of dictionary design on resulting predictive models is warranted in the future. This may enable accurate radiomic analysis of MRF data from different dictionaries used simultaneously. Second, we used a single sequence with fixed acquisition parameters, which may overlook the flexibility and robustness of the MRF framework. It would be interesting to investigate these effects on radiomic features in a future study. Third, we employed a single scanner study design and were thus unable to evaluate the inter-scanner reproducibility of MRF-based radiomic features in this study. Due to the high reproducibility of MRF T1 and T2 maps across scanners, the inter-scanner reproducibility of MRF-based radiomic features may achieve greater reproducibility than conventional qualitative imaging. This would be of substantial interest in clinical settings and warrants further investigation. Nevertheless, given the current paucity of investigations on repeatability and dictionary dependence of MRF-based radiomic features, our findings serve as a baseline and provide fundamental information which will facilitate clinical integration of MRF-based radiomics.

Our findings indicate that MRF-based radiomic features are highly repeatable across various dictionary step sizes. The repeatability of MRF-based radiomics is insensitive to dictionary step size. Based on our results, except for a small subset of radiomic features, we recommend against performing radiomic analysis using data reconstructed from different dictionaries.

Abbreviations

CI:: Confidence interval
FSPGR:: Fast spoiled gradient echo
GLCM:: Gray level co-occurrence matrix
GLRLM:: Gray level run length matrix
GLSZM:: Gray level size zone matrix
ICCs:: Intraclass correlation coefficients
MRF:: Magnetic resonance fingerprinting
NGTDM:: Neighboring gray tone difference matrix
VOI:: Volume of interest

References

Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278:563–577
Article Google Scholar
Aerts HJ (2016) The potential of radiomic-based phenotyping in precision medicine: a review. JAMA Oncol 2:1636–1642
Article Google Scholar
Kassner A, Thornhill RE (2010) Texture analysis: a review of neurologic MR imaging applications. AJNR Am J Neuroradiol 31:809–816
Hagiwara A, Fujita S, Ohno Y, Aoki S (2020) Variability and standardization of quantitative imaging: monoparametric to multiparametric quantification, radiomics, and artificial intelligence. Invest Radiol 55:601–616
Article Google Scholar
Lambin P, Leijenaar RTH, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762
Article Google Scholar
Berenguer R, Pastor-Juan MDR, Canales-Vazquez J et al (2018) Radiomics of CT features may be nonreproducible and redundant: influence of CT acquisition parameters. Radiology 288:407–415
Article Google Scholar
Zwanenburg A, Vallieres M, Abdalah MA et al (2020) The Image Biomarker Standardization Initiative: standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology 295:328–338
Article Google Scholar
Meyer M, Ronald J, Vernuccio F et al (2019) Reproducibility of CT radiomic features within the same patient: influence of radiation dose and CT reconstruction settings. Radiology 293:583–591
Article Google Scholar
Baessler B, Weiss K, Pinto Dos Santos D (2019) Robustness and reproducibility of radiomics in magnetic resonance imaging: a phantom study. Invest Radiol 54:221–228
Article Google Scholar
Fornacon-Wood I, Mistry H, Ackermann CJ et al (2020) Reliability and prognostic value of radiomic features are highly dependent on choice of feature extraction platform. Eur Radiol 30:6241–6250
Article Google Scholar
Ma D, Gulani V, Seiberlich N et al (2013) Magnetic resonance fingerprinting. Nature 495:187–192
Article CAS Google Scholar
Korzdorfer G, Kirsch R, Liu K et al (2019) Reproducibility and repeatability of MR fingerprinting relaxometry in the human brain. Radiology 292:429–437
Article Google Scholar
Buonincontri G, Biagi L, Retico A et al (2019) Multi-site repeatability and reproducibility of MR fingerprinting of the healthy brain at 1.5 and 3.0T. Neuroimage 195:362–372
Article Google Scholar
Buonincontri G, Kurzawski JW, Kaggie JD et al (2020) Three dimensional MRF obtains highly repeatable and reproducible multi-parametric estimations in the healthy human brain at 1.5T and 3T. Neuroimage 226:117573
Article Google Scholar
Hamilton JI, Jiang Y, Chen Y et al (2017) MR fingerprinting for rapid quantification of myocardial T1, T2 , and proton spin density. Magn Reson Med 77:1446–1458
Article Google Scholar
Panda A, Chen Y, Ropella-Panagis K et al (2019) Repeatability and reproducibility of 3D MR fingerprinting relaxometry measurements in normal breast tissue. J Magn Reson Imaging 50:1133–1143
Article Google Scholar
Chen Y, Panda A, Pahwa S et al (2019) Three-dimensional MR fingerprinting for quantitative breast imaging. Radiology 290:33–40
Article Google Scholar
Panda A, Obmann VC, Lo WC et al (2019) MR fingerprinting and ADC mapping for characterization of lesions in the transition zone of the prostate gland. Radiology 292:685–694
Article Google Scholar
Chen Y, Jiang Y, Pahwa S et al (2016) MR fingerprinting for rapid quantitative abdominal imaging. Radiology 279:278–286
Article Google Scholar
Ma D, Jones SE, Deshmane A et al (2019) Development of high-resolution 3D MR fingerprinting for detection and characterization of epileptic lesions. J Magn Reson Imaging 49:1333–1346
Article Google Scholar
Ma D, Jiang Y, Chen Y et al (2018) Fast 3D magnetic resonance fingerprinting for a whole-brain coverage. Magn Reson Med 79:2190–2197
Article Google Scholar
Badve C, Yu A, Dastmalchian S et al (2017) MR fingerprinting of adult brain tumors: initial experience. AJNR Am J Neuroradiol 38:492–499
Article CAS Google Scholar
Dastmalchian S, Kilinc O, Onyewadume L et al (2020) Radiomic analysis of magnetic resonance fingerprinting in adult brain tumors. Eur J Nucl Med Mol Imaging. https://doi.org/10.1007/s00259-020-05037-w
Bipin Mehta B, Coppo S, Frances McGivney D et al (2019) Magnetic resonance fingerprinting: a technical review. Magn Reson Med 81:25–46
Article Google Scholar
Jiang Y, Ma D, Keenan KE, Stupic KF, Gulani V, Griswold MA (2017) Repeatability of magnetic resonance fingerprinting T1 and T2 estimates assessed using the ISMRM/NIST MRI system phantom. Magn Reson Med 78:1452–1457
Article CAS Google Scholar
Kato Y, Ichikawa K, Okudaira K et al (2019) Comprehensive evaluation of B1(+)-corrected FISP-based magnetic resonance fingerprinting: accuracy, repeatability and reproducibility of T1 and T2 relaxation times for ISMRM/NIST system phantom and volunteers. Magn Reson Med Sci. https://doi.org/10.2463/mrms.mp.2019-0016
Naganawa S, Nakane T, Kawai H et al (2019) Detection of IV-gadolinium leakage from the cortical veins into the CSF using MR fingerprinting. Magn Reson Med Sci. https://doi.org/10.2463/mrms.mp.2019-0048
Fujita S, Buonincontri G, Cencini M et al (2020) Repeatability and reproducibility of human brain morphometry using three-dimensional magnetic resonance fingerprinting. Hum Brain Mapp. https://doi.org/10.1002/hbm.25232
Sanduleanu S, Woodruff HC, de Jong EEC et al (2018) Tracking tumor biology with radiomics: a systematic review utilizing a radiomics quality score. Radiother Oncol 127:349–360
Article Google Scholar
Traverso A, Wee L, Dekker A, Gillies R (2018) Repeatability and reproducibility of radiomic features: a systematic review. Int J Radiat Oncol Biol Phys 102:1143–1158
Article Google Scholar
Hatt M, Vallieres M, Visvikis D, Zwanenburg A (2018) IBSI: an international community radiomics standardization initiative. J Nucl Med 59:287–287
Google Scholar
Cao X, Ye H, Liao C, Li Q, He H, Zhong J (2019) Fast 3D brain MR fingerprinting based on multi-axis spiral projection trajectory. Magn Reson Med 82:289–301
Article CAS Google Scholar
Fedorov A, Beichel R, Kalpathy-Cramer J et al (2012) 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn Reson Imaging 30:1323–1341
Article Google Scholar
van Griethuysen JJM, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77:e104–e107
Article Google Scholar
Mahmoud-Ghoneim D, Alkaabi MK, de Certaines JD, Goettsche FM (2008) The impact of image dynamic range on texture classification of brain white matter. BMC Med Imaging 8:18
Article Google Scholar
Weir JP (2005) Quantifying test-retest reliability using the intraclass correlation coefficient and the SEM. J Strength Cond Res 19:231–240
PubMed Google Scholar
Coroller TP, Agrawal V, Narayan V et al (2016) Radiomic phenotype features predict pathological response in non-small cell lung cancer. Radiother Oncol 119:480–486
Article Google Scholar
van Velden FH, Kramer GM, Frings V et al (2016) Repeatability of radiomic features in non-small-cell lung cancer [(18)F]FDG-PET/CT studies: impact of reconstruction and delineation. Mol Imaging Biol 18:788–795
Article Google Scholar
Bogowicz M, Riesterer O, Bundschuh RA et al (2016) Stability of radiomic features in CT perfusion maps. Phys Med Biol 61:8736–8749
Article CAS Google Scholar
Scalco E, Belfatto A, Mastropietro A et al (2020) T2w-MRI signal normalization affects radiomics features reproducibility. Med Phys 47:1680–1691
Jang J, Ngo LH, Mancio J et al (2020) Reproducibility of segmentation-based myocardial radiomic features with cardiac MRI. Radiol Cardiothorac Imaging 2:e190216
Article Google Scholar

Download references

Acknowledgements

We acknowledge Masahiro Abe for assistance with data-handling.

Funding

This study has received funding from Japan Agency for Medical Research and Development (AMED) under Grant Number JP19lk1010025h9902; JSPS KAKENHI (grant number 19K17150, 19K17177, 18H02772, and 18K07692); Health, Labor and Welfare Policy Research Grants for Research on Region Medical; a Grant-in-Aid for Special Research in Subsidies for ordinary expenses of private schools from The Promotion and Mutual Aid Corporation for Private Schools of Japan; and Brain/MINDS beyond program from AMED (Grant Number JP19dm0307024 and JP19dm0307101).

Author information

Authors and Affiliations

Department of Radiology, Juntendo University School of Medicine, 1-2-1, Hongo, Bunkyo, Tokyo, 113-8421, Japan
Shohei Fujita, Akifumi Hagiwara, Issei Fukunaga, Shimpei Kato, Toshiaki Akashi, Koji Kamagata, Akihiko Wada & Shigeki Aoki
Department of Radiology, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo, Tokyo, 113-8654, Japan
Shohei Fujita, Shimpei Kato & Osamu Abe
Department of Radiology, The Institute of Medical Science, The University of Tokyo, 4-6-1, Shiroganedai, Minato, Tokyo, 108-8639, Japan
Koichiro Yasaka, Hiroyuki Akai & Akira Kunimatsu
Department of Radiology, International University of Health and Welfare Narita Hospital, 852, Hatakeda, Narita, Chiba, 286-8520, Japan
Shigeru Kiryu

Authors

Shohei Fujita
View author publications
You can also search for this author in PubMed Google Scholar
Akifumi Hagiwara
View author publications
You can also search for this author in PubMed Google Scholar
Koichiro Yasaka
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Akai
View author publications
You can also search for this author in PubMed Google Scholar
Akira Kunimatsu
View author publications
You can also search for this author in PubMed Google Scholar
Shigeru Kiryu
View author publications
You can also search for this author in PubMed Google Scholar
Issei Fukunaga
View author publications
You can also search for this author in PubMed Google Scholar
Shimpei Kato
View author publications
You can also search for this author in PubMed Google Scholar
Toshiaki Akashi
View author publications
You can also search for this author in PubMed Google Scholar
Koji Kamagata
View author publications
You can also search for this author in PubMed Google Scholar
Akihiko Wada
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Abe
View author publications
You can also search for this author in PubMed Google Scholar
Shigeki Aoki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shohei Fujita.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Shigeki Aoki.

Conflict of interest

The authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article.

Statistics and biometry

One of the authors has significant statistical expertise.

Informed consent

Written informed consent was obtained from all subjects (patients) in this study.

Ethical approval

Institutional review board approval was obtained.

Study subjects or cohorts overlap

Some study subjects or cohorts have been previously reported in Fujita et al Hum Brain Mapp 2021.

Methodology

• prospective

• experimental

• performed at one institution

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

ESM 1

(DOCX 3768 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fujita, S., Hagiwara, A., Yasaka, K. et al. Radiomics with 3-dimensional magnetic resonance fingerprinting: influence of dictionary design on repeatability and reproducibility of radiomic features. Eur Radiol 32, 4791–4800 (2022). https://doi.org/10.1007/s00330-022-08555-3

Download citation

Received: 05 July 2021
Revised: 23 November 2021
Accepted: 23 December 2021
Published: 18 March 2022
Issue Date: July 2022
DOI: https://doi.org/10.1007/s00330-022-08555-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Radiomics with 3-dimensional magnetic resonance fingerprinting: influence of dictionary design on repeatability and reproducibility of radiomic features

Abstract

Objectives

Methods

Results

Conclusion

Key Points

Similar content being viewed by others

Accuracy, repeatability, and reproducibility of T1 and T2 relaxation times measurement by 3D magnetic resonance fingerprinting with different dictionary resolutions

Magnetic Resonance Fingerprinting - a promising new approach to obtain standardized imaging biomarkers from MRI

Time efficient whole-brain coverage with MR Fingerprinting using slice-interleaved echo-planar-imaging

Introduction

Materials and methods

MRF acquisition and dictionary-matching

Data post-processing and radiomic feature extraction

Statistical analysis

Results

Scan-rescan repeatability of MRF-based radiomic features using dictionaries with different step sizes

Inter-dictionary reproducibility of MRF-based radiomic features

Discussion

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Study subjects or cohorts overlap

Methodology

Additional information

Publisher’s note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation