Introduction

Sarcopenia is a prevalent geriatric syndrome that is characterized by the gradual loss of skeletal muscle mass and strength. It is associated with higher healthcare expenditure and adverse health consequences such as impaired quality of life and increased risk of morbidity and mortality1. However, despite the wide impact of this syndrome, there is currently no universal diagnostic criterion for the diagnosis of sarcopenia. The most commonly used diagnostic classification was developed by the European Working Group on Sarcopenia in Older People (EWGSOP) and the Asian Working Group for Sarcopenia (AWGS)2. Both EWGSOP and AWGS used the same measurements, i.e. muscle mass, muscle strength, and gait speed, but with different cut-off scores. It is because there are differences in ethnicities, body size, lifestyles, and cultural background between European and Asian population3. Depending on the diagnostic classification used, the latest systematic review and meta-analyses reported that the global prevalence of sarcopenia varied between 10 and 27%2.

The impact of sarcopenia in Asia is considered to be stronger than in other continents because it is the most populated and fastest aging continents in the world3. Therefore, AWGS introduced “possible sarcopenia” terminology in their latest guideline. It refers to a condition of reduced muscle strength with or without reduction of physical performance4. This new terminology is intended for early identification of people with or at risk for sarcopenia in the primary healthcare or community preventive service settings, where advanced diagnostic equipment is not available. With early identification, immediate interventions such as lifestyle interventions can be given while the patients await for confirmatory diagnosis in the secondary or tertiary healthcare centers4. For initial screening of possible sarcopenia, 3 modalities, i.e. the measurement of calf circumference (CC), strength, assistance with walking, rising from a chair, climbing stairs, and falls (SARC-F) questionnaire, and a combination of both (SARC-CalF) are recommended. When the screening evaluation gives an indication for possible sarcopenia, muscle strength and physical performance assessment are done to confirm the possible sarcopenia diagnosis4.

While the purpose for simple initial screening in primary healthcare settings are deemed beneficial, the cut-off scores determination for the screening modalities are based on the research evidences from only 7 Asian countries, i.e. China, Hong Kong, Japan, Singapore, South Korea, Taiwan, and Thailand4. Within the Asia continent, there is a wide range of ethnicities, cultural, lifestyle, and socioeconomic variations3, and this could possibly influence the applicability of the screening modalities. However, to the best of our knowledge, no validation study using data from other Asian countries has been done to confirm the diagnostic performance of the recommended screening modalities. Therefore, there is an urgent need for a validation study.

For a validation study, the use of data from Indonesian population might be of importance. There are 3 main reasons for this. First, Indonesia has the highest number of elderly population among other countries in Southeast Asia region (account for 30.86% of all elderly population in South East Asia region), and the fourth highest in Asia continent after China, India, and Japan5. Second, using the same diagnostic classification, the prevalence of sarcopenia in Indonesia was 50.25%6, higher than the average prevalence in Asian population that was reported to be 14%3. Third, Indonesian population have significantly smaller body size than Singaporean population7; however, the cut-off scores for the screening modalities were formulated based on the data from Singaporean and not from the Indonesian population4. Therefore, this study aims to evaluate the diagnostic performance of 3 recommended modalities, i.e. CC, SARC-F, and SARC-CalF, for possible sarcopenia screening using data from Indonesian population.

Methods

Design and study population

We used data from our previously published cross-sectional study8. This study was conducted in Surabaya city, the capital city of East Java province, Indonesia, between December 2017 and March 2018. Surabaya city had 63 primary healthcare centers across 5 administrative regions. We randomly selected 1 primary healthcare center from each administrative regions (5 primary healthcare centers in total) for the study site. The study population were subjects aged ≥ 60 years old who visited one of the selected primary healthcare centers during the study period8. Total sampling method was used in this study. Inclusion criteria were subjects who are willing to participate in the study and signed the inform consent. Exclusion criteria were subjects with severe cardiovascular or respiratory diseases, history of pacemaker implantation, and impaired cognitive function. However, we used the AWGS 2014 guideline for the previous study3, whereas this study used the AWGS 2019 guideline4. The study was approved by the Ethical Committee of the Faculty of Medicine Universitas Airlangga prior to the data collection (Approval number: 273/EC/KEPK/FKUA/2017) and conducted in accordance with the Declaration of Helsinki.

The minimum required sample size was calculated using EpiInfo9. According to the data from the Bureau of Statistics, the total number of elderly in Surabaya city was 246,069 in 201810. Using the prevalence of sarcopenia from our previous finding (41.8%)8, with a confidence level of 95%, and precision of 7%, a minimum of 191 subjects were needed to obtain sufficient statistical power.

Study instruments

This study used primary data collection. Sociodemographic and clinical characteristics were collected using a questionnaire. Cognitive function was evaluated using abbreviated mental test (AMT), where score below 8 indicates moderate cognitive impairment and were subjected for exclusion11. Nutritional state was assessed using the Mini Nutritional Assessment (MNA).

For possible sarcopenia screening, all 3 modalities, CC, SARC-F, and SARC-CalF, were evaluated. CC is measured with the subjects in a sitting condition using a non-elastic tape to find the maximum circumference of both calves. The SARC-F questionnaire evaluates 5 components: strength, walking assistance, rising from a chair, stair-climbing, and falls. Each component is given a score of 0–2. SARC-CalF is a combination between SARC-F and CC. The scoring of SARC-CalF is similar to SARC-F; however, additional 10 points is given to male subjects having CC < 34 cm or female subjects having CC < 33 cm. The cut-off scores for the possible sarcopenia based on the AWGS 2019 guideline are as follows: CC < 34 cm for male subjects or CC < 33 cm for female subjects, SARC-F score ≥ 4, or SARC-CalF score ≥ 114.

To confirm the diagnosis of possible sarcopenia, 2 indicators were used based on AWGS 2019 guideline, i.e. muscle strength and physical performance. The muscle strength was measured by TKK 5001 Grip A dynamometer, while physical performance was measured by the repeated chair stand test. The diagnosis of possible sarcopenia was confirmed if one of the indicators was fulfilled: (1) handgrip strength of < 28 kg for male subjects and < 18 kg for female subjects; (2) repeated chair stand test ≥ 12 s4.

Data analyses

Data were analyzed using SPSS statistics version 25.0 for Windows (IBM Corp., Armonk, NY, USA). Continuous variables were presented as median [interquartile range], and categorical variables were presented as frequency (percentages). Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and likelihood ratio (LR) were calculated and receiver operating characteristic (ROC) curve analysis was performed to evaluate the diagnostic performance of each screening modalities in predicting possible sarcopenia. In all analyses, p-value < 0.05 was considered statistically significant.

Results

There were 320 subjects from five primary healthcare centers in Surabaya screened for the study. Of them, 27 subjects were excluded because of cognitive impairment and 27 other subjects were excluded because they did not finished the relevant tests needed for this study, leaving 266 subjects for the analyses (Fig. 1). The majority of subjects (71.8%) were female, with median age of 65 years. More than half of the study subjects undergone routine exercise and were in good nutritional state. Detailed clinical characteristics of the study subjects are presented in Table 1.

Figure 1
figure 1

Flow chart of the study population selection.

Table 1 Study subjects’ clinical characteristics.

From 266 subjects, 186 (69.9%) of them were confirmed for possible sarcopenia from either having low handgrip strength or prolonged repeated chair stand test. When CC measurement was used as the screening modality, 127 (47.7%) subjects were suggestive of possible sarcopenia. In comparison, only 16 (6%) subjects were suggestive of possible sarcopenia when using SARC-F and 40 (15%) subjects when using SARC-CalF. CC had the highest sensitivity, whereas SARC-F had the highest specificity. The area under the curve (AUC) was 0.511 for CC, 0.543 for SARC-F, and 0.572 for SARC-CalF. The detailed diagnostic performance are presented in Table 2. The visualization of the ROC curve is shown in Fig. 2.

Table 2 Diagnostic performance of CC, SARC-F and SARC-CalF for possible sarcopenia screening.
Figure 2
figure 2

Receiver operating characteristic curve of CC, SARC-F, and SARC-CalF using the recommended cut-off from the AWGS 2019 guideline for possible sarcopenia screening. CC calf-circumference measurement, SARC-F strength, assistance with walking, rising from a chair, climbing stairs, and falls questionnaire, SARC-CalF combination of SARC-F and CC.

Discussion

In this study, we evaluated the diagnostic performance of 3 modalities for possible sarcopenia screening that is recommended by the AWGS 2019 guideline, including the cut-off value. Our study showed that the diagnostic performance of the recommended modalities were poor, as indicated by the low sensitivity and low AUC.

In theory, diagnostic performance of a screening modality is expected to have high sensitivity, specificity, PPV, and PPV, and an AUC nearing 1.0. However, the chance of finding such screening modality is very small. Therefore, depending on the expected performance, screening modality in practice do not necessarily need to be highly sensitive and specific at the same time. If the intended use is to screen disease with the low prevalence, high specificity and PPV are more favorable. In contrast, greater emphasis should be given towards sensitivity and NPV if it is intended to screen disease with high prevalence12. Since sarcopenia is common, screening modality should have high sensitivity and NPV.

Using the cut-off from AWGS 20194, we found that the CC had the highest sensitivity rate but lowest specificity rate among the screening indicators, although both are relatively low. This study’s finding is similar to the Korean study by Kim and Won13 which found that CC as the screening indicator had higher sensitivity (62.9–76.3%) but lower specificity (57.5–59.7%) in detecting sarcopenia compared to SARC-F and SARC-CalF13. Chinese study by Mo et al.14 found higher sensitivity and specificity of CC compared to our study and the Korean study (81.4% sensitivity and 77.0% specificity)14. Considering that our study, Chinese study, and Korean study used the same AWGS 2019 cut-off for CC with variable results of accuracy, it is suggested that there is a variability of the accuracy of CC as the screening indicator for possible sarcopenia, possibly influenced by factors such as anthropometry, nutritional, or others.

Both SARC-F and SARC-CalF in this study had comparable specificity; however, the sensitivity was higher in SARC-CalF. Similar finding has been shown by Ito et al.15, where the sensitivity of SARC-CalF was higher than SARC-F15. The low sensitivity of both indicators deems them unsuitable to be used for screening, but their high level of specificity makes them favorable to be used in diagnosing possible sarcopenia.

In this study, we found that CC had the lowest AUC and SARC-CalF had the highest AUC. Our findings were in contrast with previously published studies from Korea13 and China14, where both of the studies found that the CC has a relatively higher accuracy based on the AUC value compared to either SARC-F or SARC-CalF. The establishment of CC as a screening indicator of sarcopenia stems from the understanding that CC reflects the muscle mass component of sarcopenia pathophysiology16. However, there are functional aspects of sarcopenia that cannot be reflected by CC17. Considering this issue, and another finding by Ito et al.15 that establishes no significant change of accuracy of either CC, SARC-F, or SARC-CalF before and after change in AWGS cut-off15, we postulate that CC is specifically less accurate in detecting condition defined by AWGS 2019 guideline as “possible sarcopenia”, in which physical characteristic of sarcopenia does not always profound in the patients.

Other than that, some factors possibly influence the CC measurement. For instance, edema in the calf possibly overstates the muscular volume of the calf, influencing the accuracy of CC as a screening indicator of sarcopenia18. There is also a possible influence of the difference in CC measurement method in some studies, specifically in what posture the CC measurement is conducted. Ito et al.15, for example, conducted the CC measurement while the subject was in a sitting position, similar to our study15. Meanwhile, both Kusaka et al.19 and Kawakami et al.20 measured the CC while the subject was in standing and supine, respectively19,20. Those 3 studies provided different accuracies of CC, with sensitivity ranging from 73.6 to 88.0% and specificity ranging from 62.8 to 91.0%. With some studies finding the significant influence of posture on CC measurement21,22, standardization is needed to ensure the reliability of CC as the screening indicator for possible sarcopenia. For instance, since some studies found that CC measurement in a standing position has higher accuracy and lower overestimation21,23, we need to consider standardizing the CC measurement in a standing position.

There are several limitations in this study. This study only includes subjects from one urban city; thus, generalization of this study’s findings should be done cautiously. The majority of the subjects were women, whereas it is known that gender plays an important role in sarcopenia24,25. Next to that, we did not record any subjects with pretibial edema conditions. Nonetheless, this is the first study that validate the diagnostic performance of 3 screening indicators proposed by the AWGS 2019 guideline to detect possible sarcopenia, a definition that was also introduced by the AWGS in the same year4.

Conclusion

Our findings showed that none of the screening modalities that were recommended by the AWGS in their latest guideline were able to identify elderly subjects for possible sarcopenia in our study population. This suggest that the current cut-off scores for initial screening for possible sarcopenia may not be applicable in Indonesia. Multicenter studies with larger study population from other urban areas and also rural areas in Indonesia should be done to confirm these findings.