Introduction

Skeletal muscles play a critical role in healthy aging. Beyond enabling daily activity and exercise, they hold several essential metabolic functions, including serving as the principal tissue for insulin-stimulated glucose disposal, anti-inflammatory myokine production, protein regulation, and maintenance of resting energy expenditure [1, 2]. Consequently, the loss of skeletal muscle mass, commonly termed myopenia, can have significant functional and metabolic consequences [3]. These include exacerbated cardiometabolic disease, exercise intolerance, disability, diminished quality of life, hospitalizations, and even death [4].

Given skeletal muscles’ diverse functions, myopenia is a critical component of the diagnostic criteria for sarcopenia and malnutrition [5, 6]. Like sarcopenia, which is defined as the loss of skeletal muscle mass (myopenia) and strength (dynapenia), myopenia can be primary (age-related) or secondary to factors other than aging, including chronic diseases such as heart failure (HF) [3]. Current literature reveals a significant discrepancy between myopenia and dynapenia, mainly attributable to reliance on dual-energy X-ray absorptiometry (DXA) and bioelectrical impedance analysis (BIA) as proxies for lean mass without directly evaluating muscle quantity or quality [7, 8].

Single-slice skeletal muscle area (SMA) has been shown to correlate with total body muscle mass and muscle strength [8, 9]. It also predicts poor outcomes in various clinical settings, with computed tomography (CT) and magnetic resonance imaging (MRI) considered gold-standard methods [10]. With much of the knowledge gained in oncology and hepatology, the current best practice is total muscle or psoas-only muscle quantification at the third or fourth lumbar vertebral levels [11]. However, this practice is of limited utility in heart or lung diseases, where imaging is less frequently obtained in the abdomen/pelvis or lumbar spine relative to the chest. Unfortunately, studies comparing thoracic-level to lumbar-level muscle measurements have been rare in cardiology and non-existent in HF.

Identifying an optimal thoracic landmark for skeletal muscle assessment could make possible opportunistic screening in patients with HF, who both tend towards functional decline and associated morbidity and commonly undergo thoracic CT evaluation. To that end, we sought to evaluate correlations between skeletal muscle quantity and quality measurements at various thoracic landmarks compared to the widely-used muscle landmarks at the L3 level. We aimed to determine the best surrogate with sex-stratified cutoff values in patients with HF.

Materials and methods

The Cleveland Clinic Institutional Review Board approved this study. Given the retrospective design, the need for written informed consent was waived.

Patients admitted to the Cleveland Clinic between January 2017 and December 2018 for a primary diagnosis of acute decompensated HF (ADHF) were retrospectively identified. ADHF was defined as an admission lasting > 24 h with signs and symptoms of congestion requiring intravenous diuretics. Patients with a history of HF or de novo HF were eligible for the current study irrespective of their admission left ventricular ejection fraction (EF). The main inclusion criterion was the presence of both CT imaging of the chest and abdomen/pelvis with or without contrast one month before the discharge date. Both contrast and non-contrast CTs were eligible as prior research has shown that SMA is minimally affected by contrast enhancement [12]. Exclusion criteria were primarily driven by issues with image extraction, quality, or muscles being cut off the image border or poorly differentiated from arm muscles (Fig. 1). Baseline features, including patient characteristics, comorbidities, medications, and laboratory tests, were collected retrospectively, as previously reported [13]. HF was classified based on the most current transthoracic echocardiogram into reduced (≤ 40%), mildly reduced (41–49%), and preserved (≥ 50%) EF; previous measurements were not collected for identification of improved EF.

Fig. 1
figure 1

Patient selection. CT, computed tomography; CIED, cardiac implantable electronic device

Body composition measurements were made on opportunistic CT axial images using the commercially available software Slice-O-Matic (Version 5.0, Tomovision, Quebec, Canada) and Automatic Body composition Analyzer using Computed tomography image Segmentation plus (ABACS+) module (Voronoi Health Analytics, Vancouver, British Columbia). ABACS + is a semi-automated segmentation tool that tags skeletal muscles using its knowledge of muscle shapes at the specific level and previously validated Hounsfield unit (HU) range of -29 to 150; this is followed by tagging the intermuscular, subcutaneous, and visceral adipose tissues using corresponding predefined HU ranges [14].

Two observers (S.M. and I.P.) made measurements within the institution network blinded to patient history and outcomes. Before initiation, both observers were instructed by a board-certified radiologist (P.H.C.) on identifying anatomic landmarks and regions of interest. Approximately five hours of training with Slice-O-Matic and ABACS + was done remotely with software developers. The observers each made measurements on 100 patients, with confirmation of a board-certified radiologist on accuracy (initial 20 measurements). Intra-observer variability was assessed using the intraclass correlation coefficient (ICC) on ten randomly selected patients, equating to 50 individual measurements. On a scale of 0 to 1, ICC scores were generated to assess observer agreement; a score greater than 0.90 was considered excellent reliability, 0.75 to 0.9 good, 0.5 to 0.75 moderate, and less than 0.5 poor. Intra-observer agreements for SMA and intermuscular adipose tissue area (IMAT) were excellent at all levels besides pectoralis unilateral AbvAoAr IMAT, which was moderate and nonsignificant (Table S1). Interobserver variability was not performed given the semi-automatic nature of the measurements with ABACS+.

For segmentation, the CT image files were extracted and uploaded into Slice-O-Matic, where sagittal views were reproduced from axial slices. The thoracic vertebrae were manually identified by locating the most cranial vertebra with protruding ribs attached anteriorly to the sternum, identified as the first thoracic vertebra, and counting down. The lumbar vertebrae were manually identified by locating the sacrum and counting up to the last thoracic vertebra with protruding ribs, with the highest lumbar-like vertebra considered the first lumbar vertebra. Following the identification of vertebral levels on sagittal view, automated measurements of total SMA and IMAT were made on axial view immediately above the aortic arch (AbvAoAr; Fig. 2a) and the mid-vertebral body of the eighth thoracic vertebra (T8; Fig. 2c), T12 (Fig. 2d), and L3 (Fig. 2e). In addition, manual SMA and IMAT measurements were made of the pectoralis major and minor unilaterally at AbvAoAr using the ranges of -29 to + 150 HU and − 190 to -30 HU, respectively (Fig. 2b).

Fig. 2
figure 2

Tagged computed tomography axial images above the aortic arch (AbvAoAr; (A), unilateral total; (B), unilateral pectoralis) and at the eighth thoracic (T8; C), twelfth thoracic (T12; D), and third lumbar (L3; E) vertebrae. Measurements AbvAoAr were made unilaterally on the right unless a right-sided device was present (F). The tagged tissues were skeletal muscle (red), intermuscular adipose tissue (green), subcutaneous adipose tissue (teal), and visceral adipose tissue (yellow)

Measurements AbvAoAr (both automatic and manual) were unilateral right-sided, given the prevalence of cardiovascular implantable electronic devices (CIEDs) in patients with HF on the left chest, but the left side was used when issues were encountered on the right side (Fig. 2f). Frequent reasons for left-sided measurements included right chest devices and hardware, poor chest and arm muscle differentiation, muscle of interest being cut out of the right image border, and high right intravenous contract load. Measurements at all other vertebral levels were bilateral. A subgroup analysis was performed comparing the correlation of total unilateral AbvAoAr to L3 measurements in patients with and without CIEDs to assess the effect of device presence.

Following the measurement of raw values, skeletal muscle index (SMI; cm2/m2) was calculated to normalize for body size by dividing the SMA (cm2) by the square of the patient’s height (m2) [15, 16]. Two definitions of low muscle mass were used: (1) the lowest sex-stratified L3 SMI tertile, and (2) L3 SMI cutoff values of ≤ 52.4 cm2/m2 in men and ≤ 38.5 cm2/m2 in women based on frequently used cancer cutoffs [17]. The degree of myosteatosis (skeletal muscle fat infiltration) was assessed via quantification of adipose tissue within muscle fascia, also known as IMAT, which was then used to calculate IMAT% with the formula: IMAT (cm2) / (SMA (cm2) + IMAT (cm2)) × 100 [18].

For statistical analysis, continuous variables were expressed as mean ± standard deviation if normally distributed or median (25-75th percentile) if non-normally distributed. Categorical values were reported as numbers and percentages. The student t-test and analysis of variance were used for the comparison of continuous normally distributed variables, and the Kruskal-Wallis and Mann-Whitney U tests for non-normally distributed variables. The Chi-squared test was used for categorical variables. Distributional histograms were used to assess the normality of distribution. The relationships between the different vertebral level body composition measurements were assessed using the Pearson correlation coefficient (r). Fisher Z-transformation was used to determine differences in correlation strength. Cutoffs for low muscle mass were obtained using receiver operating characteristic (ROC) curve analysis to maximize true-positive and minimize false-negative diagnoses from the lowest sex-stratified tertile and the frequently used cancer L3 SMI cutoffs. A P-value equal to or less than 0.05 indicated a statistically significant difference. All statistical analyses were performed using SPSS (Version 25, SPSS Inc., Chicago, IL, USA).

Results

Of the 316 patients who met inclusion criteria, 116 were excluded (Fig. 1). A total of 200 patients were included, 89 (44.5%) female. Overall, the characteristics of the population are reflected in Table 1. The average age was 71 ± 14, with most patients being Caucasian (75.5%). The majority had HF with preserved EF of ≥ 50% (49.0%), followed by reduced ≤ 40% (39.5%) and mildly reduced 41–49% (11.5%).

Table 1 Baseline characteristics of the total analytic cohort

The strongest correlation of thoracic SMI at the different thoracic levels with the widely used L3 SMI was at the T12 level (Table 2). This was followed, in descending order, by total unilateral AbvAoAr, T8, and pectoralis unilateral AbvAoAr. The T12 level persisted as the strongest correlation after sex-stratification (Fig. 3), but T8 was better than total unilateral AbvAoAr in men (r = 0.787 vs. r = 0.771), and pectoralis unilateral AbvAoAr was better than T8 in women (r = 0.609 vs. r = 0.567). A trend similar to SMI applied to IMAT%. The strongest correlation was at T12, followed by total unilateral AbvAoAr and T8; the correlation was weak at pectoralis unilateral AbvAoAr (Table 2).

Fig. 3
figure 3

Scatterplots demonstrating the correlation between computed tomography L3 SMI measurements and those AbvAoAr (unilateral total and unilateral pectoralis), at T8, and at T12 in men (left) and women (right). Horizontal and vertical lines represent the lowest sex-stratified tertile L3 SMI cutoffs (solid lines) and frequently used cancer L3 SMI cutoffs (dotted lines). SMI, skeletal muscle index; AbvAoAr, above the aortic arch; L3, third lumbar vertebra; Pec, pectoralis; T8, eighth thoracic vertebra; T12, twelfth thoracic vertebra

Table 2 Correlations between computed tomography-derived skeletal muscle quantity and quality measurements at various thoracic vertebral levels compared to the third lumbar vertebral level (L3), with group comparisons based on sex-specific tertiles of L3 skeletal muscle index

Next, a subgroup analysis was performed to assess the difference in correlation of L3 measurements to total unilateral AbvAoAr in those with (n = 27) and without (n = 173) a CIED present. The correlation of SMI and IMAT%, respectively, were r = 0.833 (p < 0.001) and r = 0.809 (p < 0.001) in those with a CIED compared to r = 0.725 (p < 0.001) and r = 0.716 (p < 0.001) in those without a CIED. These correlations were compared using Fisher Z-transformation without a significant difference for SMI (z = 1.28, p = 0.200) or IMAT% (z = 1.03, p = 0.303).

Finally, patients were grouped based on sex-stratified tertile cutoffs of SMI at L3 into lowest, middle, and highest tertiles. Detailed data for the groups are presented in Table S2. A total of 65 patients (32.5%) were identified as being in the lowest tertile. These patients were older with lower weight, BSA, and BMI than the other groups. There was no difference in sex, race, or HF classification. N-terminal prohormone of brain natriuretic peptide (NT-proBNP) was higher, and albumin was lower among patients in the lowest tertile. The only difference in comorbidities and medications was higher statin use among patients in the highest and middle tertiles. The average SMI differed at all levels between the groups, but the average IMAT% only differed at L3 (Table 2). Area under the ROC curve analysis was used to obtain low thoracic vertebral SMI cutoffs based on the sample’s lowest sex-stratified tertile L3 SMI cutoffs (65/200) and the frequently used cancer L3 SMI cutoffs (142/200), which are presented in Table 3 along with their sensitivities and specificities.

Table 3 Thoracic computed tomography cutoffs for low skeletal muscle index based on sex-stratified tertile and frequently used cancer cutoffs at the third lumbar vertebra

Discussion

Our study comparing single-slice skeletal muscle evaluation at the widely used L3 level on abdominopelvic CT with immediately above the aortic arch, T8, and T12 on chest CT demonstrates that SMI (muscle quantity) at T12 has the strongest correlation with L3. This was also true for IMAT% (muscle quality); however, the average IMAT% among the L3 SMI tertiles was only significantly different at L3, making it potentially less valuable at the other levels. Thus, based on these results, T12 may be a good target for opportunistic muscle evaluation in patients with HF without CT of the abdomen/pelvis to predict outcomes or diagnose sarcopenia or malnutrition. The cutoffs based on the lowest sex-stratified L3 SMI tertile were 26.3 cm2/m2 for females and 31.1 cm2/m2 for males at the T12 level, while the cutoffs based on the frequently used cancer L3 SMI cutoffs were 27.5 cm2/m2 for females and 41.0 cm2/m2 for males at the T12 level.

Early sarcopenia identification is critical for intervention and mitigation of poor outcomes, but a formal diagnosis can be complex, given the need for muscle strength testing [11]. Opportunistic evaluation of skeletal muscles on imaging alone to identify myopenia is useful for outcomes prediction and early recognition of muscle wasting [19]. However, the widely used abdominopelvic muscle measurements at L3 [11] are of low opportunistic use in cardiology, where most imaging is done of the chest.

Unfortunately, there is significant heterogeneity in muscle measurements of the chest used in the literature, with most data derived from oncology and pulmonology populations [20]. A notable 2017 study compared SMI at L3 to T12 and T7 on preoperative CT of the aorta in patients undergoing transcatheter aortic valve replacement (TAVR) [21]. They found a higher correlation at T12 (r = 0.709, p < 0.001), which agrees with our study, indicating T12 as the most likely candidate for opportunistic use in cardiology patients. The close correlation of muscle area at the third or fourth lumbar vertebral levels with whole-body muscle measurements has been postulated to be due to the psoas, paraspinal, and abdominal muscles at these levels being minimally influenced by activity, unlike the appendicular muscles [22]. Thus, the close correlation seen at T12 may be due to its proximity to the lumbar vertebrae with the presence of abdominal muscles at the level; although T12 does not contain psoas, muscles seen at this landmark include rectus abdominis, diaphragm, external oblique, intercostals, latissimus dorse, and erector spinae.

A unique aspect of thoracic imaging in cardiology, particularly HF, is the presence of CIEDs, which may impede the assessment of adjacent structures due to metal artifacts in CT studies. Prior studies have addressed this limitation by making unilateral measurements opposite the device implantation site [23], a technique we also utilized in our study. Despite this, 26 patients were excluded from the original sample due to significant artifacts extending to both sides. To ensure that the presence of devices did not alter the correlation of upper thoracic measurements with L3, we performed a subgroup analysis of patients with versus without CIEDs. Both groups showed good correlation without a significant difference between them, indicating that the contralateral artifact, not extending beyond the midline subjectively, did not alter HUs, and thus measurements, in the upper chest.

Given the widespread use of L3 SMI, various cutoff values have been proposed. The first and most used cutoffs were established in 2008 in an obese Canadian population with respiratory or gastrointestinal cancers [17]. Sex-specific cutoffs of 52.4 cm²/m² in men and 38.5 cm²/m² in women were found based on association with mortality. Despite this, most studies derive their own cutoff values from morbidity and mortality or sex-specific lowest tertile, quartile, or fifth percentile of subjects, particularly outside of oncology [22]. Given the lack of cutoffs in cardiology populations, the latter definitions are likely more appropriate.

The previously mentioned 2017 study in patients undergoing TAVR utilized the frequently used L3 SMI cancer cutoff established in 2008 and proposed T12 cutoffs of 42.6 cm2/m2 in men and 30.6 cm2/m2 in women [21]. We focused on using the lowest sex-stratified L3 SMI tertile for low muscle mass evaluation with T12 SMI cutoffs of 31.1 cm²/m² in men and 26.3 cm²/m² in women. In addition, we also provided cutoffs at T12 of 41.0 cm²/m² in men and 27.5 cm²/m² in women based on those L3 SMI cancer cutoffs. The cancer cutoffs identified a much higher number of patients as having low muscle mass than sex-stratified tertiles (71.0% vs. 32.5%). However, an outcomes-based evaluation of these cutoffs in larger samples is required to compare their utility in cardiology patients.

Among the grouped participants based on sex-stratified L3 SMI tertiles, a significant difference was seen in NT-proBNP and albumin levels. The median NT-proBNP among patients in the lowest tertile was more than double that of the other groups. This agrees with prior studies showing a strong inverse relationship between SMA and NT-proBNP [24]. Although this difference could also be attributed to the higher BMI in patients without low muscle mass, multiple studies have demonstrated lean rather than fat mass to be responsible for the association between higher BMI and lower NT-proBNP [24,25,26]. Although the mechanism is unclear, the influence of sex steroid hormones has been postulated [27]. The association between low muscle mass and hypoalbuminemia is also not surprising, given the close relationship between myopenia and malnutrition, both contributing to frailty and worse outcomes [28].

The association between low muscle mass and lower statin use in our study is less clear. Statins have well-known benefits for primary and secondary prevention of cardiovascular disease, but they can induce adverse effects on the muscles. Although statin use post-endovascular aortic repair has shown lower long-term mortality without predisposition to muscle wasting based on successive SMA measurements [29], statin-induced myopathy shares some proposed mechanisms with myopenia and sarcopenia in HF, such as mitochondrial dysfunction [30] and ubiquitin-proteasome system upregulation [31, 32]. This leads us to believe that the lower statin use in patients with myopenia may be due to intolerance from worsened myopathy instead of statin use being protective against muscle wasting.

There are several limitations to our study. Although the study sample was more than most myopenia or sarcopenia studies, it was still small without power or sample size calculations, making type II error possible. Several patients also had to be excluded, primarily because of issues with opportunistic CT windows; this may be addressed by incorporating standardized institutional scanning protocols on routine imaging to ensure adequate windows containing all tissues. Measurements were done using the update to ABACS, ABACS+, for which there is limited validation, particularly on vertebral levels other than L3. Measurements AbvAoAr also had to be made unilaterally, given the presence of unilateral artifacts in most patients. The sample was obtained from patients hospitalized for ADHF, which may alter study variables; however, data such as weight and creatinine were obtained from the last values before discharge to ensure euvolemia and homeostasis. Finally, the diagnosis of sarcopenia was not made formally, given the lack of muscle strength testing.

In conclusion, frequently obtained imaging studies, such as computed tomography, are of opportunistic use for body composition analysis in chronic diseases like heart failure. However, such evaluation has been difficult in cardiac patients given the current consensus of using third or fourth lumbar vertebrae on abdominopelvic imaging while most imaging in cardiology is of the chest. This study adds to the limited literature supporting using the twelfth thoracic vertebra as a landmark for skeletal muscle evaluation. This allows us to further expand our knowledge on myopenia and sarcopenia in patients with heart failure and work towards a consensus for skeletal muscle evaluation in this population.