Introduction

Congruent with the global trend of population aging, the number of older people worldwide is projected to increase exponentially to over 1.5 billion in 2050 (1). It is therefore important for older adults to maintain their rights to health and enjoy good quality of life as they age. Different conceptual models have been put forward to encapsulate the notion of aging well, such as healthy ageing, active aging or successful aging (2); however, many of these models have hitherto focused on the absence of clinical diseases or the accumulation of deficits (3). Notably, there is a recent shift in focus from the perspective of mere presence or absence of disease to a function-based approach of healthy aging which is aimed at building and maintaining the functional ability of older people.

In 2015, the World Health Organization (WHO) published a world report on ageing and health and introduced the concept of intrinsic capacity (IC) to help define healthy aging (4). IC is the composite of all the physical and mental capacities of the individual and its interactions with the relevant environmental characteristics that determine the functional ability of that person, which is central for healthy ageing (5). The IC concept is aimed at measuring the capacities (as opposed to deficits) of multiple human biological systems based on body functions which are most relevant to healthy ageing (6, 7). Being a dynamic construct, its trajectory over time may inform clinical and public health actions, provided monitoring is contextualized at the individual or population level, respectively (8). Thus, assessment of biological age through constructs such as intrinsic capacity can enhance understanding of the functional trajectories and vulnerabilities of the individual and guide development of personalised preventive and therapeutic interventions that are tailored to the person’s age, comorbidities, and preferences. Studies have also quantified the concept of IC, demonstrated it to be a powerful predictor of subsequent care dependence, and suggested a possible structure (9).

To further operationalize intrinsic capacity in a clinical context, a clinical consortium on healthy ageing was held in 2017 by the WHO to identify core components of intrinsic capacity. Five key domains were proposed: locomotion, vitality, cognitive, psychological, and sensory. Experts in the consortium proposed components and suggestions of some tools that could be used as measures for each domain (8). Since then, studies have shown that it is feasible to measure intrinsic capacity using commonly used measures to predict an individual’s future functioning (1017). These studies used a variety of measurements for assessing intrinsic capacity in their respective domains in different settings and populations. Although early studies support the validity of the WHO Healthy Ageing framework built around the concept of intrinsic capacity, measures of different domains are not yet standardised and vary across studies (9). There is a lack of consensus how scores from the different domains should be computed to yield a representative composite index of intrinsic capacity. Moreover, it is unclear what are the factors which are associated with low intrinsic capacity in older adults.

Therefore, this study aims to address the knowledge gap and summarise the measurement for intrinsic capacity for the five WHO domains across different populations. It specifically aims to identify measurement tools, methods used for computation of a composite intrinsic capacity index and factors associated with intrinsic capacity among older adults. A rapid review was preferred over a systematic review or a scoping review to provide timely and relevant evidence in this emerging topic that has limited literature. Our findings would identify gaps that could help inform future research agenda on the topic.

Methods

Our method is guided by the rapid review guidance document from the Agency form Healthcare Research and Quality, which draws from mainstream systematic review methodology (18). The review question was formulated and refined after discussion with stakeholders comprised of senior clinicians who are involved in the care of older adults.

Search strategy

Our search strategy was crafted with the help of a librarian using the PICOTS framework (population, interest, comparison, outcomes, timeframe, settings), where applicable. The search terms and key databases or resources were identified through consultation with geriatricians who were considered experts in the field of frailty and related areas. Table 1 presents our search strategy in Medline. The following grey literature sources were searched using simplified terms such as ‘intrinsic capacity’, ‘healthy aging’, and ‘successful aging’: Google Scholar, bioRxiv, medRxiv. In addition, we searched conference abstracts from 2015 to 2020 for the following “high-yield” conferences: International Conference of Frailty and Sarcopenia Research, British Geriatrics Society, American Geriatrics Society, and European Geriatrics Society. Lastly, the reference lists of retrieved key papers were also examined for articles of relevance. The above searches were performed in the month of July 2020.

Table 1 Search strategy in Medline

Study Selection and Data Extraction

Three reviewers (PPG, OSP, PL) screened citations using Covidence, a web-based tool. Each citation was independently screened using title and abstract by two reviewers. The relevant citations then underwent a second stage full text screening by two reviewers. A third reviewer was involved in cases of uncertainty or disagreement. Selection was based on eligibility criteria presented in table 2 and in the PRISMA flow chart, Figure 1 (19). A data extraction sheet was created to extract information on publication information, study aims, measurements performed for each domain, information on composite score and its measurement. Reviewers (PPG, OSP, PL) tested the extraction sheet using two articles to calibrate understanding of the extraction fields, prior to independent extraction of the study data. Data extraction was performed by all three reviewers and crossed checked for accuracy.

Table 2 Eligibility criteria
Figure 1
figure 1

Prisma 2009 Flow Diagram

Data synthesis

Study characteristics and population demographics are presented in the form of frequencies and percentages. Information on the domain measurements and composite score were summarised and presented. Due to the significant heterogeneity in the sample populations, methods of intrinsic capacity assessment and statistical pooling of findings were not feasible.

Results

The literature search yielded 163 articles after removal of duplicates. Following the first round of title and abstract screening, 39 articles were eligible for the next full-text screening. Seven studies were found to fulfil the criteria for inclusion in this rapid review, Figure 1.

Study characteristics

The seven studies that fulfilled our selection criteria were published in 2019 and 2020, Table 3. Most of the studies were retrospective in nature (n=6, 86%); only one aimed to develop and validate an IC scale (14), while the rest were not designed to measure intrinsic capacity a priori. Of the seven studies, three were conducted in Europe and three in Central and South America. Six studies examined intrinsic capacity in older adults aged ≥ 60 years. All the studies involved participants who were healthy or with co-existing medical conditions. Five studies were conducted in the community setting, whereas one study each was conducted in nursing homes and in primary care. Four studies had a sample sizes between 1000 to 10000 participants, while two studies had less than 1000 participants. Only one study had a sample size of more than 10000 participants.

Table 3 Study characteristics and population demographics

Measurement of intrinsic capacity by domains

Locomotion

Performance based measures

All the seven studies employed performance-based measures for locomotion, and we have identified six measurement methods: Chair-stand, gait speed, standing balance, pick pencil, grip strength, and others, Table 4. Most of the studies adopted at least three components as measures (5 out of 8, 62.5%).

Table 4 Measurement tools and methods used for IC domains

All seven studies adopted all (10, 13, 16) or parts (12, 14, 15) of the Short Physical Performance Battery (SPPB) (20) that consisted of chair-stand, gait speed, and standing balance. Notably, both chair-stand and gait-speed were conducted with variation across studies. For example, while time taken to sit and rise five times was commonly measured in the chair stand test (17), chair raise speed was calculated by Gutiérrez-Robledo et al (12) using the number of times an individual rose from a chair divided by total time taken.

For the measure of gait speed, the distance adopted among the five studies varied between 2.4 to 5 meters (10, 12, 13, 15, 16) while one did not indicate the distance (12). Measurement was scored by the quicker time (12, 15) or mean time of two repeated tries (9) with varied definition on what was defined as having capacity (15), or graded capacity such as from worst to best (13, 16, 20, 21). For standing balance, all three studies were consistent in their methods by using hierarchical standing balance that consists of the side-by-side, semi-tandem, and full-tandem component for 10 seconds each (10, 13, 16). Consistent with scoring protocol for SPPB, performance was rated from 0 to 4, with 4 being the highest score.

In addition, other methods were also adopted in conjunction with SPPB or its components. Robledo et al (12) used a pick-pencil test, alongside chair-rise and gait-speed, for the measure of locomotion. In another study by Gutiérrez-Robledo et al (14), handgrip strength was used together with gait speed to measure locomotion.

Self-reported assessment

While Ramírez-Vélez et al (16) adopted the three tests in SPPB, they also included other measures such as sarcopenia, prevalence of falls, impairment in activities of daily living, and mobility/disability.

Vitality

Four studies used 1 method to assess vitality and only one study used three assessment methods for this domain.

Performance based measures

Three studies assessed anthropometric measurements using body mass index (12, 13), abdominal circumference (13) and mid upper arm circumference (15), or body composition using phase angle derived from bioimpedance measurement (14). Two studies examined lung function using the forced expiratory volume (10) and peak flow test (12). Charles and colleagues also included the measurement of handgrip strength as an indicator of vitality (13).

Self-reported assessments

Participants were asked about any unintended weight loss and appetite in three studies (15, 16). Mini Nutritional Assessment (13, 15) as an indication of nutritional status was also employed in the assessment of vitality.

Blood biomarkers

Only one study measured biomarkers (dehydroepiandrosterone and insulin-like growth factor (10) as an assessment of vitality).

Cognition

Performance based measures

All of the studies adopted performance based measures with majority of the studies (4 of the 7; 63%) using MMSE (12), modified MMSE (12, 16) and sub-components of MMSE (13) to assess cognition, Table 4. For example, Gutiérrez-Robledo et al (12) used a modified MMSE to assess cognition. Scores were standardized and stratified by education to define 3 cut-off points (severe deterioration [>2.5 SD], slight deterioration [≥1.5 to ≤2.5], optimal [<1.5SD]).

Three studies used other methods to assess cognition. One study used questions on orientation in time, space, and recall (17), one used using neuropsychological tests such as delayed recall, animal category fluency and letter cancellation (10), and Community Screening Instrument for Dementia (CSI-D) (14). In the latter study, scores in the cognitive tests were used as measures of the respective cognitive function, namely memory, executive function, and processing speed, respectively. Lastly, one study used a cut-off of ≥29.5 on the CSI-D to assess cognitive capacity, with scores below that threshold indicating ‘probable dementia’ (15).

Psychological

Self-reported assessment

All the studies used self-reported assessment scales or questions to assess depressive symptoms. These studies used a variety of tools to measure depression, with some using tools tailored specifically for older adults. The most commonly used scales were the Geriatric Depression Scale (GDS) (12) and the Center for Epidemiological Studies Depression scale (CES-D)(10, 14). Other studies utilized the EURO-D depression scale (15), a single item from EuroQOL-5D on «anxiety or depression» to identify presence and severity of anxiety or depression (13), and two questions to assess presence of depressed or hopeless feelings, and the experience of no interest or pleasure (17).

In addition to depressive symptoms, other symptoms in the psychological domain that were assessed include fatigue by using two questions from CES-D (13), sleep disturbance (10), and mental health problems (16). Gutiérrez-Robledo et al (12) included the most components for the psychological domain, measuring life satisfaction, locus of control, and social participation, as well as symptoms of depression.

Sensory

Performance based measures

Performance tests to assess hearing included the whisper test where the researcher whispers a two-syllabus word into the participants’ ears (16); audiometry test (16); or automated app-based digits-in-noise test (17). With regards to performance tests to assess vision, the Snellen eye test was adopted in only one study (14).

Self-reported assessments

All studies assessed vision and hearing for the sensory domain using self-reported questionnaires to assess either vision and/or hearing, Table 4. Only two studies (10, 13) used self-reported questions (such as the Strawbridge questionnaire) to assess deficits in hearing or vision. Some of the questions related to vision included the ability to see far, to read (10, 12, 17) and poor eyesight that interfered with daily activities (15). Similarly, participants were asked about their hearing in general (12) and if they have hearing problems or deafness that interfered with their daily activities (15).

Methods used for estimating composite intrinsic capacity score

Three studies derived a composite intrinsic capacity score (10, 12, 14). In two of these studies (12, 14), each domain of the 5 components of IC was further divided into three categories, assigning a numerical score as follows: optimal=0, mild impairment=1 and severe impairment=2. Scores of each domain was then summed to yield a complete intrinsic capacity index, ranging from zero (worst IC possible) to ten (optimal intrinsic capacity possible). The intrinsic capacity composite score for one study showed low internal consistency (Cronbach alpha=0.49), whereas factor analysis demonstrated substantial clustering of each domain to a single factor (14).

Beard et al (10) used the following tests to measure intrinsic capacity for the five domains: gait speed, chair-stand and standing balance tests of the SPPB (Locomotion); dehydroepiandrosterone (DHEA) and insulin-like growth factor (IGF-1), and forced expiratory volume (FEV) (Vitality); verbal fluency, delayed verbal memory, and letter cancellation tasks (Cognition); depression using the CES-D scale and sleep disturbances (Psychological); and self-reported hearing and visual impairment (Sensory). A bi-factor model that represented a pure measure of the underlying latent trait of interest, after controlling for all five specific subfactors, was used to estimate the individual factor score. The summary score of intrinsic capacity and specific subfactors showed good construct validity and displayed significant differences between sex, age, disease, and functional status.

Factors and outcomes associated with intrinsic capacity

Studies reported a decline in intrinsic capacity with increasing age (12, 14, 15). In women, the odds of having a lower intrinsic capacity score are 72% higher than among men (12). Socioeconomic condition (SEC) level and intrinsic capacity are positively associated, such that the higher the SEC level, the higher the intrinsic capacity score (12). In the domains of continence, hearing, and vision, declines in capacity were more pronounced after the age of 80 (15).

A unit increase in balance performance and in nutrition score decreased the probability of death by 12% and 4% respectively (13). The risk of falling decreases, when there is a one-unit increase in balance performance and in nutrition score. Low scores in nutrition are associated with a higher probability of autonomy decline (13). Declines in cognition and continence are strongly associated with incident dependence, and declines in psychological capacity, locomotion, and nutrition are moderately associated, while sensory capacities (vision and hearing) are not associated with incident dependence (15). Participants with optimal handgrip strength had better intrinsic capacity, while gender-stratified analyses revealed that men with optimal handgrip strength had a lower risk of hospitalization (16).

Multimorbidity has an independent direct relationship with incident loss of basic activities of daily living (ADLs), but not instrumental activities of daily living (IADLs). However, a greater proportion of the indirect effect on incident loss of ADLs and IADLs was mediated by intrinsic capacity than by multimorbidity (10). When compared with those without co-morbidity, the odds of having a significant decrease in intrinsic capacity score rose significantly (51%) among those with three or more conditions (12).

Discussion

Consistent with the ongoing shift in conceptual focus of healthy aging from a disease or deficit-centric to a function-centric model, the concept of intrinsic capacity was conceived to define a composite of all physical and mental capacities of an individual, and which interacts with the environment to determine functional ability (4). Our rapid review aimed at providing timely understanding of how measurements of the domains were operationalized to date, and the factors and outcomes associated with intrinsic capacity. Timely review of this information would facilitate ongoing pilot implementation of ICOPE guidelines worldwide, and facilitate better understanding of individual functional trajectories and vulnerabilities during a catastrophic event such as the current Covid-19 pandemic (22). Although a few studies have documented the measurement of intrinsic capacity and its association with health outcomes, not much is known about the collective body of evidence. We focused on studies that assessed all five intrinsic capacity domains, unlike a recent critical literature review by Gonzales and colleagues (9) which included studies with any of the five domains. Both reviews examined the assessment of intrinsic capacity, and the tools used to measure the domains. However, our review goes beyond Gonzalez et al (9) to document the association between intrinsic capacity with age, gender, socio-economic status, nutrition, mortality, falls risk, hospitalisation, ADLs, IADLs and multimorbidity.

Among the included studies, the majority were retrospective studies using data collected in the community (fig 3) and hence, were not primarily aimed to assess intrinsic capacity. Most commonly used tools for assessing intrinsic capacity were gait speed test and chair stand test (locomotion), grip-strength and mini-nutritional assessment (Vitality), mini-mental state examination (Cognition), Geriatric Depression Scale (GDS) or Centre for Epidemiological Studies Depression scale (CES-D), and self-reported vision and health questionnaires (Sensory). However, we found heterogeneity and low concordance in operationalization of some of the domain measurements, particularly the vitality and psychological domains, which would make cross-study comparisons difficult. In addition, even within the same tool, there were variations among the studies. For example, in the measure of gait speed in the Short Physical Performance Battery test, the distance executed in the studies and calculation of the speed were varied (10, 12, 13, 15, 16). Reasons for the heterogeneity and low concordance could be in part due to the fact that most studies utilized retrospective data that were designed to measure other aims, Table 3. Hence, they were not primarily aimed to assess intrinsic capacity per se. Only one on-going study, the INSPIRE-ICOPE-CARE (16), set out to measure the five domains of intrinsic capacity a priori.

Of all the domains, vitality is the most heterogeneous in its operationalization. It was assessed using different measures such as body composition, lung function, nutritional status, physical strength, specific molecular biomarkers and use of health questions related to weight loss and appetite. A similar observation in the vitality domain was also reported in a systematic review (9). Vitality has been proposed as the key domain in the construct of intrinsic capacity that drives the other domains (23). In this iteration, cellular level characteristics as well as the contribution of higher physiologic systems are included within the vitality construct; when the accumulation of accumulated deficits in these systems reach a certain point, they become manifest in the overt losses of capacity that are commonly associated with ageing (24). Using this conceptual framework, a total measure of vitality may thus capture an individual’s ‘biological age’ (10). Therefore, it is important to have a clearer illumination of the concepts pertaining to vitality and to achieve consensus on its appropriate measurement and weightage.

Validity and reliability are two key attributes to consider when selecting instruments for measuring any health outcome (25). In our review, we found that most of the instruments were validated. Performance-based tests and blood molecular biomarkers were used for locomotion and vitality domain, respectively. Self-reported measures were mostly used for psychological and sensory domains, which may increase social desirability bias and recall bias (26, 27) as compared to the other domains. On the other hand, none of the studies employed self-reported measures in the cognitive domain; the complementary use of validated scales or questions that evaluate subjective memory complaints such as the AD8 may be less susceptible to ceiling effects or educational bias in bedside mental status tests such as the MMSE (28, 29). Validity and reliability of intrinsic capacity composite score can also be further improved by using measurement tools less prone to bias (30) and using appropriate weightage for the different intrinsic capacity domains.

Of the seven studies identified in our review, only three operationalised a global intrinsic capacity score. Scores from the different domains were combined to create a global intrinsic capacity index using two approaches. Beard and colleagues (10) used an integrated approach to estimate factor score for each study participant, while the others (12, 14) used a summation of individual domain score to compute the intrinsic capacity index. Though the three studies provided information on the measurement of intrinsic capacity for the five WHO domains, information on summation methods and weightage were not adequately presented in the studies. A composite intrinsic capacity which is a summative measure of individual domain scores for the five WHO domains can be used, akin to the growth charts used to chart development in children, so that if population gender-specific norms are established, any significant deviation could be detected early with the early institution of treatment (31). The individual domain scores would provide information on one’s affected intrinsic capacity domains, which could guide interventions.

One the other hand, the remaining studies utilized information from each individual domain for feedback (13, 1517). Among this group of studies, variations on the measure of decline were observed. Three studies measured the domains by either using one or at least two measures, of which the average results in the measures were used to determine capacity or decline in that domain (1517). The fourth study in a nursing home population used between two to four measures for each domain and identified balance performance and nutrition (as measured by the MNA) to predict adverse health outcomes (13). Three of these studies were retrospective in nature, with goals to identify associations of the domains with adverse outcomes and dependence (13, 15, 16). The only prospective study was aimed at flagging decline, which would alert the need for further assessments and interventions (17). The overall goals in the studies were to measure and identify ways to use the five domains to detect one’s intrinsic capacity, which might help inform actions and interventions in clinical practice and healthcare systems. However, our current review highlighted that the current adopted methods to measure the domains were heterogenous and varied, which would present challenges in validating the use of IC across different populations.

Limitations

The study had few inherent limitations of a rapid review. It was conducted to provide evidence to stakeholders within a short time frame. In view of the time constraint, trade-offs in methodological choices were made, such as restricting the research to Medline and English publications. This may have resulted in omission of relevant data and left the review susceptible to selection and language bias. Despite these limitations, the review provided a timely synthesis of the evidence on the measurement for intrinsic capacity for the five WHO domains across different populations.

Conclusion

Intrinsic capacity in the context of healthy ageing has been measured using self-reported questionnaire, performance-based tests, and laboratory investigations. There is variation in measurement process or tools used for assessing intrinsic capacity domains, particularly in the vitality and psychological domains. A standard intrinsic capacity score for clinical or community-based has not been operationalised yet. Further validation of the intrinsic capacity concept and computation of composite intrinsic capacity score using validated scales is needed.