Background

Frailty is a clinical syndrome characterized by reduced physiological body reserve that was originally described in older persons [1, 2], but is increasingly identified in younger age groups [3,4,5]. This vulnerable state results in decreased responses to internal and external stressors that lead to negative clinical consequences [6]. Frailty has been associated with a higher number of comorbidities, increased incidence of falls, and more disability in older people [7,8,9]. Increased rates of hospital admission and mortality have been observed in studies of frailty [10, 11].

Frailty screening reliably classifies high-risk older patients in several medical contexts, including surgical care [12,13,14]. Frail patients are more likely to experience negative postoperative consequences [15, 16]. One systematic review reported a strong relationship between frailty and major adverse cardiac events after cardiac surgery [17]. In cancer patients, frailty status predicts outcomes for many modalities of treatments, including patients undergoing surgical intervention [18].

Improved methods to identify frailty in older, surgical patients could lead to better perioperative care. Preoperative preparation including prehabilitation, correcting malnutrition, psychological intervention, and prevention of postoperative delirium have been recommended to improve the quality of care. Shared decision-making is also suggested to better prepare frail patients for surgery [19, 20]. Intraoperative management of altered pharmacodynamic and pharmacokinetics [21], and postoperative multidisciplinary care in frail patients are also beneficial [22]. Therefore, frailty is included in the list for optimal preoperative assessment of older surgical patients guideline recommended by the American College of Surgeons National Surgical Quality Improvement Program and the American Geriatrics Society [23].

Several approaches have been proposed to identify frailty [2, 24,25,26]. Although the Fried criteria and Frailty Index have been widely cited in the literature, these tools are technically complex and time consuming. The FRAIL scale or Simple Frailty Questionnaire (SFQ), comprised of a 5-item, self-reported questionnaire, may be a more practical screening tool [5]. The FRAIL scale is increasingly accepted worldwide and many countries have validated it to their cultural context [5, 27,28,29].

Few studies have been conducted to assess the validity of frailty diagnosis in Thailand. The Thai Frailty Index (TFI) is constructed according to the accumulated deficits approach and is associated with mortality risk in older people [30]. However, administering the TFI is time consuming. The Simple Frailty Questionnaire could be preferable in busy clinical practices, including preoperative evaluation for elective surgery. Therefore, we performed a cross-cultural adaptation of the questionnaire, assessed its psychometric properties, and explored the optimal cutoff point for frailty in older patients at the surgical preoperative clinic.

Methods

Design, setting, and participants

The study aimed to perform a cross-cultural adaptation of the simple frailty questionnaire and assess its validity and reliability in comparison to the Thai Frailty Index in a preoperative setting. This study was conducted in accordance with the Declaration of Helsinki and was approved by Institutional Review Board, Faculty of Medicine Siriraj Hospital, Bangkok, Thailand (COA no. Si 397/2018). This cross sectional-study included 3 hundred patients from the Surgery outpatient clinic at Siriraj Hospital, enrolled from April 2019 to November 2020. Eligible participants were aged 60 years or over and were scheduled for elective surgery. The participants were excluded if they were unable to fluently communicate in Thai. After informed consent was obtained, baseline demographic data were recorded. The TFI and T-FRAIL were collected by two trained research assistants, separately.

The Thai frailty index (TFI)

The TFI was developed using accumulation deficit model and validated against mortality risk among community dwelling older people [30]. It was created following the suggested standard procedure [31] comprising 30 variables that included medical comorbidities, physical function, cognition and emotional health. The cutoff point of more than 0.25 was used for identifying frailty [30].

The FRAIL scale and T-FRAIL questionnaires

The FRAIL scale is, 5-item questionnaire for frailty screening. The questionnaire aims to “fatigue”, “resistance”, “ambulation”, “illnesses” and “loss of weight”. Each item required self-report response, which was rated as 1 and 0 points depending on the presence of characteristic in each criterion. The score range from 0 to 5 with higher score demonstrated more frailty characteristics. According to the original FRAIL scale, cutoff point of 3 or above were used to identify frail patient, while score of 2 indicated prefrail status [5]. The T-FRAIL was the Thai version translation of the FRAIL scale which was developed according to the following steps.

Translation process

Copyright permission for the Simple Frailty Questionnaire translation was granted by the owner, Professor John E. Morley, Division of Geriatric Medicine, Saint Louis University School of Medicine. The guideline for translation is adapted from ‘guidelines for the process of cross-cultural adaptation of self-report measures’ [32].

The translation process included six steps.

Step 1 initial translation

The Simple Frailty Questionnaire (FRAIL) was translated from the original English questionnaire by two Thai bilingual independent translators. The first translator was a geriatrician from Siriraj Hospital who was invited to translate the questionnaire from a clinical perspective. The second one was a non-medical translator who translated the questionnaire by language aspect only.

Step 2 synthesis of the translations

The research team held the discussion at Siriraj Hospital to review the two Thai versions of the questionnaire in comparison with the original English version. Thai words were carefully chosen from both translations to construct a single version of the questionnaire.

Step 3 Back-translation

The Thai version of the questionnaire was sent to two independent language institutions for back-translation. The first translator was invited from the Translation and Interpretation Center, Faculty of Liberal Arts, Mahidol University. The second translator was a Canadian-Thai translator from a private sector organization. These two translators were native English speakers and had no medical background.

Step 4 expert committee

An expert committee from Siriraj Hospital was formed to evaluate semantic equivalence, idiom equivalence, experiential equivalence, and conceptual equivalence. The original translation, the Thai questionnaire, and two back-translation versions were included in the discussion.

Step 5 test of the prefinal version

A preliminary assessment with 10 elderly participants aged more than 65 years old who attended a geriatric clinic, was done to assess questionnaire comprehension. Most words were easily comprehended by the elders, while some were too difficult. Drawbacks of the current questionnaire were recorded.

Step 6 submission of documentation to the developers

The research team adapted and finalized the questionnaire according to the pilot study results. All papers including two back-translations and the finalized Thai questionnaire were sent to the owner of the original version.

Validity and reliability assessment

Content validity

The Thai version of Simple Frailty Questionnaire (T-FRAIL) was evaluated by five independent geriatricians who were not involved in the translation process. Each item of the translated questionnaire was rated in a 4-point scale of ‘1 = not relevant’, ‘2 = somewhat relevant’, ‘3 = quite relevant’ and ‘4 = highly relevant’.

Test-retest reliability

The test-retest reliability was conducted in 30 elderly patients on two occasions 7 days apart with the questionnaire items having been rearranged. Later scores were compared with former scores to identify the Intraclass correlation coefficient.

Concurrent validity and diagnostic test assessment

The concurrent validity including sensitivity, specificity, likelihood ratio for positive and negative results, and the positive and negative predictive value of the T-FRAIL were analyzed in comparison with the TFI as a reference standard. Spearman’s correlation coefficient was estimated to address the correlation between the two tests.

The T-FRAIL_M1 and the T-FRAIL_M2 questionnaires

The original “illness” criterion in FRAIL scale required 5 or more underlying diseases to count as one score. Patients with several comorbidities were at higher risk of developing complications [33,34,35,36] and more likely to receive conservative treatment rather than undergoing surgeries [35,36,37,38]. The authors, therefore, anticipated that most patients recruited from preoperative setting for elective surgeries would have number of comorbidities less than 5 and might affect sensitivity of the test. Hence, we proposed T-FRAIL_M1 as a modification of T-FRAIL score by reducing illnesses criterion to 4 or more underlying diseases to determine whether this modification would improve test properties or not, in comparison with the original one. Another anticipated common underreported item would be the loss of weight as it was not a routine measurement among older people in our context. The modification of the questionnaire without this item was initially planned. Moreover, we believe that minimizing the items with reasonable diagnostic properties might facilitate the implementation of the test in busy clinical context. We, therefore, developed T-FRAIL_M2 by exploring the combination of 3 items in T-FRAIL and explore the diagnostic properties of those brief versions.

Statistical analysis

The content validity index for items (I-CVI) was chosen as the quantification method. Each item’s I-CVI is calculated by the proportion of experts who rated ‘3’ or ‘4’ of all. Excellent content validity required an I-CVI of 1.0 for at least three experts’ analysis [39]. The two-way mixed-effects model with the absolute agreement was used to calculate the intraclass correlation coefficient (ICC) of test-retest reliability. ICC values less than 0.5 were interpreted as poor, between 0.5 and 0.75 as moderate, between 0.75 and 0.9 as good, and greater than 0.90 as excellent [40]. Sensitivity, specificity, likelihood ratio for a positive and negative result, positive predictive value, and negative predictive value were calculated using 2 × 2 tables. Correlation between T-FRAIL and TFI score was demonstrated by Spearman’s correlation coefficient. Each item of the T-FRAIL was evaluated for frailty correlation by the phi correlation coefficient.

Results

The mean age of the study group (n = 300) was 70.4 years (SD 7.3 years), and144 subjects were male (48.0%). Frailty was diagnosed with the TFI in 120 patients (40.0%). Baseline characteristics of the participants are shown in Table 1. The content validity index of items (I-CVI) was 1.0 for all individual items in the T-FRAIL, indicating excellent content validity [39]. Test-retest reliability assessed in 30 patients yielded an intraclass correlation coefficient of 0.880 (95% confidence interval 0.76–0.94) which demonstrates good test-retest reliability [40].

Table 1 Characteristics of study population

T-FRAIL yielded an excellent accuracy against TFI, with an area under the receiving operating characteristic curve of 0.882. A strong correlation between T-FRAIL and TFI was represented by Spearman’s correlation coefficient of 0.708 (p = < 0.001). The cutoff-point for T- FRAIL of two points showed the highest Youden index of 63.1% compared to 43.3% of the original cutoff score of three. (Table 2) The cutoff of two points had a sensitivity of 77.5% (95% CI 69.0–84.6) and a specificity of 86.1% (95%CI 79.6–90.3), which was a better sensitivity than at three points. The positive predictive value of the two-point cutoff was 78.2% (95%CI 71.2–83.8), while the negative predictive value was 85.1% (95% CI 80.3–88.9).

Table 2 Frailty cut-point for original T-FRAIL with TFI as the reference standard

Elderly individuals with high comorbidities might be excluded from elective surgery due to the risks of the procedure. To explore its diagnostic properties, we modified the T-FRAIL by reducing the threshold of the “illnesses” criterion. A modification that reduced comorbid diseases from five to four (T-FRAIL_M1) and applied a cutoff point score of two improved test parameters (Table 3). Sensitivity was 83.3% (95% CI 75.4–89.5) and specificity was 82.8 (95% CI 76.5–88.0), which is much better than the original T-FRAIL at a cutoff point of three.

Table 3 Frailty cut-point for T-FRAIL_M1 with TFI as the reference standard

“Resistance” and “ambulation” were the best T-FRAIL items to identify frailty. The “resistance” yielded a sensitivity of 72.5% and specificity of 87.8%, while the “ambulation” yielded a sensitivity of 68.3% and specificity of 88.9% (Table 4). Another proposed modification of the T-FRAIL, each questionnaire item was combined into sets of three components (T-FRAIL_M2). A positive response from at least one item out of the three was considered as frail. The “fatigue”, “resistance”, and “illnesses” combination showed the highest Youden’s index of 66.4% with a sensitivity of 85.8% and a specificity of 80.6%. All modified questionnaires with three items were analyzed for sensitivity and specificity in comparison with the TFI. (Table 5).

Table 4 Sensitivity, specificity, and Youden’s index of each original T-FRAIL item
Table 5 Sensitivity, specificity and Youden’s index in T-FRAIL_M2

Discussion

The T-FRAIL was translated with a standardized protocol and achieved satisfying psychometric properties, including good or excellent content validity and test-retest reliability. The questionnaire was tested against the TFI and achieved excellent accuracy. Previous validity studies of the simple frailty questionnaire in other contexts have yielded varying results due to different methodologies and different populations.

The original FRAIL scale, using positive at three out of five item criteria, showed a significant correlation with impaired function in a cross sectional cohort [5], but the study did not investigate the correlation with the reference standard or using a different cutoff point. Nevertheless, several studies using various cutoff points were identified among later cross-culturally adapted versions of the FRAIL scale [27, 28, 41]. For example, using three points to diagnose frailty, the Korean (K-FRAIL) was tested against a comprehensive geriatric assessment-based frailty index as a reference standard and demonstrated excellent (90%) sensitivity but low (33%) specificity [27]. Meanwhile, the FRAIL-AR, using similar 3-point cutoff score, was compared with Fried Frailty Index with sensitivity and specificity of 72 and 67%, respectively [41]. Using a three-point cutoff score, we observed a significantly lower sensitivity with very high specificity, which may not be optimal for a screening tool.

Nevertheless, some studies in Chinese and Japanese populations (FRAIL-J) using a 2-point cutoff have yielded comparable results. In the Chinese study, A 2-point cutoff Chinese FRAIL scale represented a sensitivity of 90% and a specificity of 86% compared to the Fried Frailty Phenotype [28]. Similarly, the FRAIL-J was compared to Fried Frailty phenotype with a sensitivity of 77.8% and specificity of 88.9% [42]. Our modification using the original T-FRAIL at a cutoff score of two showed similar properties to the Japanese version. However, further modification by reducing the number of medical illnesses (T-FRAIL_M1), improves sensitivity from 77 to 83% with a better Youden’s index at 66%. Moreover, further modification of the cutoff point to be as low as one point for both the original T-FRAIL and T-FRAIL_M1 would yield a higher sensitivity of 93 and 94%, respectively, with a specificity of around 60%. This provides better properties compared to the 3-point score of K-FRAIL [27].

Another proposed method to modify the T-FRAIL modification is to lower the criterion of the questionnaire. According to the Study of Osteoporotic Fractures index (the SOF index) for frailty diagnosis, weight loss, inability to rise from a chair, and poor energy are sufficient to classify pre-frail and frail patients in comparison with the Fried Frailty phenotype. The SOF index also proved the relationship to clinical outcomes including falls, disability, fracture, and mortality in two large studies [43, 44]. We also modified the T-FRAIL instrument to improve its psychometric properties using a combination of three components (T-FRAIL_M2), resulting in a sensitivity and specificity of around 80%.

Although the T-FRAIL_M1 and the T-FRAIL_M2 improved sensitivity of the test, overall test properties were similar to the T-FRAIL. The AuROC of T-FRAIL_M1 and T-FRAIL_M2 were around 83%, while the original one was 81%. Despite its little difference, the modification leads to more sensitivity which serves author’s propose to use the questionnaire for screening test rather than confirmation test. Moreover, the T-FRAIL_M2, due to its simplicity, would be useful for primary care setting in rural area, where many citizens had lower education level. According to the Table 5, despite omitting the “loss of weight” criteria, the T-FRAIL_M2 (“fatigue”, “resistance” and “illnesses”) still had high performance for frailty diagnosing. This fact emphasized the T-FRAIL_M2 advantages in Thai context, whose most elders could not recall their previous bodyweight.

An optimal pre-operative screening tool should have high sensitivity and an acceptable specificity. The cutoff score of one point from five items from the original version of T-FRAIL and T-FRAIL_M1 would serve this purpose. Nevertheless, in resource limited settings, an alternative option could be the T-FRAIL_M1 with a 2-point cutoff score or T-FRAIL_M2 that provides reasonable sensitivity and better specificity.

The strengths of our study were the use of reference standard in addition to the standard translational procedure. The reference standard in this study was derived from large scale population in our country and has been studied for the validity in the local setting. Nevertheless, additional studies of predictive validity in stratifying risk in the elderly would be valuable. Further study to identify the elderly who might experience adverse outcomes or benefit from specific interventions using this test are needed. Limitation of the study was in relation to the studying in one setting in single center. All of our subjects were selected from elective surgery, the generalizability of the findings to other settings might be an issue. Additional studies among older people in similar setting at other center and in other medical context would ascertain the usability of the test.

Conclusion

The T-FRAIL demonstrated good validity and reliability for diagnosing frailty in older people preparing for elective surgery. A modified T-FRAIL using a lower cutoff point yielded higher sensitivity than the original cutoff point but gave lower specificity. Another modification was a reduction of the illnesses criterion or a combination with sets of three components increased sensitivity, while preserved excellent specificity of the test, making it suitable for use as a screening tool in practical use. A prospective study is needed to determine if the T-FRAIL and its modified versions may be useful to predict mortality, disability, or other clinical outcomes in different settings.