Background

In 2020, one billion of the world’s population was 60 years or older, with an increase of 400 million expected between 2021 and 2030 (13). As this demographic shift continues, exploring innovative mechanisms to promote healthy aging is an important global health and economic policy agenda. Advocacy for improved health across the lifespan to increase the likelihood of older people being functionally able and capable of doing what they value in older age is increasing.

The World Health Organisation (WHO) redefined Healthy Aging in 2015, taking a life-course approach in preparation for the predicted demographic shift globally. It was redefined as the life-long process of developing and maintaining functional ability (1), determined by intrinsic capacity (IC), the environment, and the interaction between these two factors (1). Intrinsic capacity refers to the composite of an individual’s physical and mental capacities across the five domains: locomotor, vitality, cognitive, psychological, and sensory (4). Higher IC levels are associated with decreased disability risk and better overall quality of life (57).

In the last two years, four scoping reviews relating to IC have been published (811). These reviews have focused on the sensitivity and specificity of WHO’s Integrated Care for Older People (ICOPE) step 1 tool in detecting loss of IC (11), demonstrated that IC predicts physical function, frailty, falls and quality of life over time (10), highlighted that there was a lack of consistency in terms of the domains and metrics used across studies (9), and queried if IC was as an underlying latent trait of all capacities rather than an aggregate summary measure of the sub-domain capacities (8). The scoping reviews thus far are yet to address IC’s biological (genetic) underpinnings. Intrinsic capacity is influenced by the person’s underlying genetic as well as the interaction between the person’s genetic make-up and their environment (including lifestyle). Also, research on IC is rapidly increasing, providing a basis for more recent reviews.

Our research group is researching to understand IC genetics better, leveraging existing cohorts where genetic data were collected. Within that context, the primary aim of this scoping review was to explore the existing literature to identify factors (especially genetics) relating to IC and to provide a current overview of knowledge regarding the measurement of IC, along with its predictive and construct validities.

Methods

Scoping Review Framework

We used the Joanna Brigs Institute’s (JBI’s) Preferred Reporting Items for Systematic Reviews and Meta-Analysis extension for Scoping Reviews (PRISMA-ScR) and Arksey and O’Malley methodological framework (1214). The process involved five stages, including defining the purpose, the research question, and the search terms (Stage 1); identifying relevant studies (stage 2); selecting studies that met the predetermined inclusion criteria (stage 3); mapping and charting the data obtained from the selected studies (stage 4); and collating, summarizing, and reporting of the review findings (stage 5).

Stage 1: Defining the research questions and search terms

This initial phase involved refining the scope and direction of the review based on a preliminary search conducted on Google Scholar. Through this step, we had background information on studies and search terms related to IC measurement, IC measures’ validity, health and health-related functional outcomes associated with IC, and its underlying biologic(genetic) basis.

“Generally, the research questions for this review were:

  1. 1.

    What are the IC measurement tools in literature? What are the approaches to computing composite IC scores and assessing the validity of the scores? This question aims to summarize research findings on IC domains used/found, their indicators, approach to developing composite scores, and the validation of indexes.

  2. 2.

    What are the different sociodemographic, health, and health-related factors associated with intrinsic capacity?

  3. 3.

    Does IC have a biological/genetic basis? What are the biomarkers associated with IC?

Inclusion and exclusion criteria

The Participant, Concept, Context (PCC) approach was employed to develop the eligibility criteria for study inclusion. The inclusion criteria were; studies conducted on human subjects of all ages (populations), focusing on the measurement of IC, its validation, association with socio-economic and health outcomes (concept), and in any setting – whether the studies were conducted in the community or institutional setting (context), published in the English language and published between 01/01/2015 – 20/10/2023. However, abstracts, conference proceedings, commentary, editorials, reviews, and personal opinions were excluded. No research records were available until WHO experts released the initial article operationalizing IC measurement (15).

Stage 2: Identifying relevant studies (Search strategy)

The literature search was conducted across Six Databases: PubMed, CINAHL, Web of Science, Scopus, AgeLine, and PsycINFO using the mesh terms and keywords “intrinsic capacity”, “intrinsic capacity decline”, “intrinsic capacity domains”, “intrinsic capacity impairment”, “intrinsic capacity index”, “intrinsic capacity model”, and “intrinsic capacity score” in the context of Aging. Each database’s detailed search string is presented in Supplementary Table 1 (Supplementary Table- 1).

Stage 3: Selection of relevant articles

Identified articles were imported into Clarivate Analytics EndNote 20 after the completion of the search, and duplicates were removed. Following this, two researchers (MB and AT) independently evaluated the titles and abstracts of the articles against the inclusion criteria. The two assessors thoroughly reviewed the full text of the selected articles. Articles that did not meet the inclusion criteria were excluded, and the reasons for their exclusion were documented and reported. Disagreements that arose during the selection process were resolved through discussion. The search outcomes and procedure for selecting or excluding studies can be observed in the PRISMA-ScR flowchart (Figure 1).

Figure 1
figure 1

PRISMA flow chart showing the steps of the literature search

Stage 4: Data extraction and synthesis

Using a data extraction tool, MB and AT collected various information from selected articles, including name of authors, publication year, characteristics of study participants, design and setting of the study, domains of IC measured, method used to calculate composite IC scores, validation approaches, and other relevant information. Supplementary Table 2 provides the data extraction form and the summary of information collected from the articles (Supplementary Table 2). The results collected from these selected articles were presented primarily using narrative descriptions and tables.

Results

Descriptive Summary

Our search strategy yielded 1498 articles, of which 398 were identified as duplicates and, thus, removed. After screening titles and abstracts (Figure 1), 986 publications were excluded, and full-text screening of 114 articles was conducted, resulting in 72 publications for full-text review. With a targeted citation search strategy, we found 12 additional articles relevant to our topic, and the final list for this scoping review was 84 articles.

Of the total 84 articles reviewed, the majority, 77 articles (92%), were published in the last two years, between 2021 and 2023.

The majority (51 articles, 61%) of the publications were carried out using samples sourced from Asia, of which 33 were from China. The remaining studies utilized study participants distributed across other geographical regions, including 20 (24%) studies in Europe (France (n=9), UK (n=4), Belgium (n=3), Spain (n=2), and Netherlands (n=1), and Norway (n=1)), North America (n=3), South America (n=5), New Zealand (n=1) (Supplementary Table-2). Four articles involving study samples from multiple continents. Out of the 84 reviewed articles, 33 were cross-sectional studies, 43 had a longitudinal approach (involving cohort, case-control, or longitudinal designs), and 8 were randomized control trials (Supplementary Table- 2).

Most (64 studies) have explored the validity of IC measurement in different ways. Some studies assessed the predictive validity (1620) by assessing if IC predicts future health outcomes, whereas others assessed construct validity through the cross-sectional association of IC with socio-demographic variables (2123), health and health-related functional outcomes (7, 20, 24, 25), mortality (2629), and quality of life (7).

Some of the studies have inquired into the structural validity of IC (30, 31), sensitivity and specificity analyses (23, 3234), tested internal consistency using Cronbach alpha (20, 35), performed ROC curve analysis (19), assessed criterion validity through logistic regression analysis (28), and conducted validation analysis by dividing the population into two as 70% for training and 30% validation cohort (26). The WHO has also published an expert consensus article on the measurement and validation of IC, providing a comprehensive working definition of vitality capacity (36).

The association of biological and environmental factors with IC

Biological markers with IC

Eight studies explored the association of IC with aging-related biomarkers. While specific studies estimating the heritability of IC are currently lacking, evidence based on the five IC domains suggests a heritability estimate of 20–85% (3754). Thus far, only one candidate gene study has been conducted, and this study showed a significant association of IC with ApoE carriage (26).

Research conducted by Lu WH, et al. (2023) showed an increased level of inflammatory markers such as Plasma C-reactive protein (CRP), interleukin-6 (IL-6), tumor necrosis factor receptor-1 (TNFR-1), monocyte chemoattractant protein-1 (MCP-1) and growth differentiation factor-15 (GDF-15) in individuals with lower IC (55). Another study by Lee WJ, et al. (2023) found that high serum levels of IL-6, CRP, hyperglycemia, and low dehydroepiandrosterone sulfate (DHEA-S) were associated with low IC (26, 56). Lower levels of serum albumin and folate (26), high homocysteine (55), Tumor Necrosis Factor Receptor 1 level (TNFR1) (57), Plasma N-Terminal Pro-B-Type Natriuretic Peptide level (58), or E-selectin (26) and increased allostatic load (26) were significantly associated with low IC. Recent studies have also reported associations between IC and plasma biomarkers reflecting inflammation (such as CRP, IL-6, TNFR-1, and MCP-1) and mitochondrial impairment (such as GDF-15, IF1) such that elevated levels of plasma interleukin-6 (IL-6), tumor necrosis factor receptor-1 (TNFR-1), CRP, growth differentiation factor-15 (GDF-15) and IF1 were associated with lower IC or faster decline in IC (5961). Details of all factors associated with intrinsic capacity are presented in Table 1 (Table 1).

Table 1 Association of intrinsic capacity with health and health-related outcomes

Lifestyle and socio-economic factors with IC

Exercise and lifestyle choices play a crucial role in IC, with studies revealing significant associations. Smoking is linked to lower IC (23, 64, 68, 7073), as is alcohol consumption (23, 71), and reduced meat intake (64). Interventional and cohort studies, respectively, underscore the positive impact of healthy eating (77) and fruits and vegetables and protein-rich diets on IC (96). A multidomain intervention has also been shown to enhance IC (102).

The exploration of socio-economic factors demonstrates noteworthy associations with IC. Age is inversely correlated, with lower IC found in older age (7, 2023, 32, 57, 6269), and women tend to exhibit lower IC (7, 20–23, 62, 63, 66, 67). Lower educational status (7, 20, 21, 23, 32, 64, 6668), low economic status (2022, 62, 64, 65, 67, 91), unmarried status (21, 23, 32, 64, 68), and urban residence are associated with lower IC.; however, being white race was associated with a higher IC (23, 64), and higher IC was observed in Chinese individuals compared to non- Chinese in Singapore (23, 97).

Social factors also play a role in IC. Lower social engagement, lower subjective social status, fewer social activities, and lower housing index are linked to lower IC (7, 32, 66).

Mortality and morbidity with IC

IC is a significant predictor of various health outcomes. It predicts multimorbidity (20, 65), mortality (10, 27, 28, 35, 56, 65, 74, 81, 9395), quality of life (5, 6, 10, 76), risk of dementia (87), cardiovascular diseases mortality (17), respiratory disease mortality (16), hospitalization (27), and complications related to hospitalization (95). Conversely, multimorbidity predicts declines in IC (68). In addition to predictive relationships, IC index has shown cross-sectional associations with quality of life (7), medication adherence (89), sleep health (21), nursing home-acquired pneumonia (90), polypharmacy (82), hypertension status (32), hospitalization (90), presence of chronic neurological illness (69), low self-rated health (82) and various health conditions, such as insomnia, memory loss, constipation, slowness, chronic obstructive pulmonary disease, and osteoarthritis (64).

Functional ability with IC

IC predicted functional difficulty parameters, including the future incidence of ADL (20, 24, 27, 35, 7481), IADL (20, 24, 27, 31, 66, 74, 76, 78, 82, 83), Frailty (24, 8488), disability/functional decline/dependence (6, 28, 65, 75), and have shown cross-sectional associations with ADL (7, 23, 25, 69), frailty (7, 25, 63) and IADL (7, 23, 25, 63). Moreover, IC was associated with fragility fracture (99), nursing home stay (35), life-space mobility (30), falls (24), incontinence (64, 82), and sarcopenia (18).

Measurement and validation of IC and its domains

Various approaches have been utilized in the process of constructing IC, each involving a different number and type of domain. Although most studies (n=38; 45%) utilized the five domains methodology (6, 7, 24, 2628, 30, 31, 5557, 62, 64, 71, 74, 79, 81, 84, 85, 94, 96, 98, 103107), seven studies utilized the bifactor structure (with one general domain (IC) and five sub-domains) (20, 22, 23, 35, 66, 67, 86). Five of these studies (20, 23, 35, 66, 67) comparing the goodness of fit when the bifactor approach was applied instead of the correlated factors and hierarchical model options found that the bifactor structure has better model fit statistics. Eight other studies adopted the six domains construct by dividing the sensory domain into two components, namely hearing and vision (21, 33, 75, 83, 87, 89, 91, 108), whereas eleven studies considered only four domains by excluding the sensory domain (34, 5961, 70, 73, 77, 88, 90, 93, 102) and another two studies used four domains excluding the cognition domain (16, 17). Three studies utilized other methods, including seven domains (65), eight domains (78), and no domain but summing up indicators directly (5).

Studies comparing the traditional (correlated factors) method, the bifactor method, and the hierarchical method consistently demonstrate that the bifactor model provides a superior fit compared to both the hierarchical model and the correlated factors models (20, 23, 35, 66, 67). In measuring IC, various methods have been used, ranging from simple approaches involving a single variable to complex composite assessments. Details of all methods used to operationalize each domain under the reviewed articles are shown in Table 2 (Table 2).

Table 2 Variables used to define/measure intrinsic capacity and/or domains

Computation of IC scores

Sixty-seven articles have created IC composite scores. Standard methods for constructing IC composite scores involve using factor analysis or principal component analysis (7, 25, 73), in line with original research relating to IC (20). Out of the nine studies utilizing factor analysis methods, two employed the traditional correlated five-factors approach (30, 98), while the remaining seven utilized the bifactor method, incorporating five specific factors and one general factor (20, 22, 23, 35, 66, 67, 86).

However, the majority 32 (47.8%) papers calculated the IC score by summing individual IC domains scores without weighting, using either a two-point scale (0-impaired/bad and 1-unimpaired/good) or a three-point scale (0-impaired, 1-slightly impaired, or 2-unimpaired) (18, 19, 21, 24, 32, 33, 5558, 6164, 68, 69, 71, 74, 75, 80, 8285, 94, 95, 97, 100, 106, 109, 113, 114). Other ways employed to compute IC were averaging the z-scores of the domains (70, 72, 81, 90, 93, 96, 102), direct summation of the values of indicators (5, 16, 17, 88, 101, 115), the latent growth modeling (LGM) method (31), weighted linear combination of indicators with loading greater than 0.3 (78), direct summation of indicators associated with domains in the regression model (26, 65, 79), averaging the domains’ average values (28, 59, 60), and a 2-parameter domains item response theory (IRT) which refers to direct categorization of IC as “0” for impairment in any domain and “1” for no impairment in any of the domains (6, 27).

Discussion

Some eight years since the WHO redefined Healthy Aging, there has been an exponential growth in research relating to IC, with almost all publications in the last 2 years and the majority (61%) leveraging data from Asia. Whilst studies have explored the association and predictive ability of IC with inflammatory, lifestyle, health, and socio-economic factors, research relating to genes and IC is rare. Many methods have been used to assess and score IC, making it vital that a consensus is reached globally.

While specific studies estimating the heritability of IC are currently lacking, evidence based on the five IC domains suggests a heritability range of 20–85%; specifically, genetic factors contribute to the variability in cognitive 50–70% (3739), sensory 20–30% (4043), locomotor 30–85% (4447), vitality 25– 65% (4850) and psychological 35–70% (5154) domains. There has been one attempt, through a candidate gene study approach, to identify genes associated with the broader IC domain. This particular study showed a significant association of IC with ApoE carriage (26). Understanding the interaction between genes and environment (including lifestyle factors) and IC is important. Before this, it is essential to identify genetic markers associated with IC. Such research may confirm the benefits of lifestyle or behavioral changes in helping people age well, including allowing the personalization of this intervention to the individual.

As outlined in the results section, our scoping review introduces new elements beyond those explored in the initial review addressing adverse health outcomes associated with IC. In addition to investigating biological and inflammatory biomarkers which are new, this review also identifies additional factors previously unexplored, which are related to socioeconomic (such as educational status, economic standing, marital status, ethnicity, residence, housing index, and social engagement); functional ability (such as fragility fracture, life-space mobility, nursing home stay, incontinence, and sarcopenia are also assessed); morbidity and mortality (such as multimorbidity, medication adherence, polypharmacy, hospitalization and its associated complications, sleep health, cardiovascular diseases mortality, and respiratory diseases mortality which were not addressed in the previous review) and behavioral and lifestyle-related factors (such as exercise, alcohol consumption, smoking, eating habits, and dietary patterns).

In the context of developing IC indices, no consistent method has been used thus far. New indicators for IC domains continue to be used, often without goodness of fit testing. Goodness of fit tests must be done while new indicators are used to determine whether those indicators are appropriate (118). Similarly, some studies used quick and direct tools, and others employed more complex and time-consuming composite measures. Balancing the precision of variable measurement with tool simplicity is crucial, and in the realm of measurement science, it is recommended to utilize straightforward yet robust instruments to assess variables like IC (119, 120). Studies also reveal that the bifactor method correlates with the five factors method and performs better than the correlated factors and hierarchical methods (20, 23, 35, 66, 67). There is also evidence of better explanatory power of the bifactor model when compared to the other two methods (121123). Whilst evidence supports that the bifactor method results have better conceptual clarity and fit than the other structures, the lack of standardization may introduce bias (128).

Furthermore, there needs to be an effort to reach a consensus in defining indicators for individual domains. Consensus about the indicators and methods is necessary to inform the planning of future cohort studies. Planned cohort studies, as opposed to leveraging already collected data, may have the advantage of enabling the collection of appropriate indicators to measure IC. Two primary approaches are being applied to compute the IC composite score: the CFA (20, 22, 23, 30, 66, 67, 86, 96) and the arithmetic sum/average of the values of the domains (19, 24, 32, 5558, 6264, 71, 74, 75, 82, 84, 85, 94, 106, 114). Both have demonstrated good construct and predictive validity. Concerning the approaches for measurement, the reflective versus formative nature of the IC measurement (i.e. whether IC should be considered as an underlying latent trait of all capacities or an aggregate summary measure of the subdomain capacities) shall also be considered in future studies (124).

Limitations and Strengths

The major strength of this review is that we followed a rigorous and stepwise screening process with the involvement of independent assessors. The findings were reported following the JBI’s PRISMA guideline extension for scoping review (PRISMA-ScR). The scope of the literature search was limited to peer-reviewed articles, and as a result, unpublished studies and organizational reports were not included. Due to logistic limitations, only articles published in the English language were reviewed, although studies published in other languages may have provided information on the external validity of IC tools in a multicultural setting.

Conclusion and recommendations

Since the introduction of IC for defining healthy aging and functional capacity, extensive research has been conducted to measure these indices and validate them, as well as to assess their relevance in medical, social, and behavioral sciences. This review identifies that there is a knowledge gap (no evidence) when it comes to our understanding of the genetics of IC. Such understanding can be improved by leveraging existing longitudinal studies but there is also a need for planned cohort studies where IC measurements are collected prospectively, and genetic data is also available. This review highlights the current difficulties in comparing existing studies given the lack of standardization in defining indicators that ought to be collected for each IC domain and how best to assess and score IC. Reaching such consensus is imperative because it will not only define the approach for the use of already collected data but will also support the planning and conduct of new longitudinal studies focused on IC and functional ability. The pooling of data globally to advance our understanding collectively would also be more likely through a standardized approach.