Background

The general approach to studying older people's health has been to look at relationships among measures of chronic conditions, cognition, frailty and steps along the disablement pathway [14]; the goals have been to identify determinants and rates of decline and recovery [514], and predict mortality [15], institutionalization [16, 13], and service utilization [1721]. Although these measures serve as valid outcomes predictors, they relate differently to various dimensions of health. Trends in any one of them are not evidence of health trends overall [22, 23]. Evidence suggests that population subgroups have qualitatively and quantitatively different patterns of change and probabilities of adverse outcomes [24, 9, 6, 26, 27, 14, 25, 8]. In fact, elderly populations are highly heterogeneous in their health status owing to the variability and interdependency in how health dimensions manifest themselves overtime. Thus, it becomes increasingly clear that health changes cannot be fully described by any one dimension. Instead, it takes an approach that allows for multiple measures of health to embrace this complexity.

Many valid approaches exist to study the relationship between multiple health indicators and the unobservable (or latent) construct of health [28]. The choice depends, among other considerations, on the distribution of health indicators. Because health indicators commonly used in geriatrics are rarely continuous or normally distributed, factor analysis is unwieldy. Instead, we chose a latent class analytical framework [29], also called finite mixture models [30]. Latent class analysis (LCA) is a "person-centered" approach designed to divide the population under study into latent subpopulations (i.e. classes) that share a distinct interpretation pattern of relationships among indicators [31]. In populations of older people with complex health needs, observed health indicators can serve to identify groups of individuals – unobserved a priori – who present with similar constellations of health problems. The goal is to find the smallest number of health profiles that can describe the association among the set of observed health indicators.

Latent class models present several advantages over classical statistical models (e.g. cluster analysis) [32, 33]: the classification is model-based and statistical diagnostic tools exist to assess the quality of the classification, variables may take several forms, there is no need to make parametric assumptions about the relationship between observations, and covariates can be included during model estimation to describe the groups. For the prediction of a dependant variable, LCA offer a parsimonious alternative to models with an unmanageably high number of variables and interaction terms.

Whereas LCA studies class membership using cross-sectional data, latent transition analysis (LTA) studies change in class membership using longitudinal data [31]. LTA combines the cross-sectional measurement of health state profiles and the description of transition over time from one health profile to another [34]. It thus supports the analysis of qualitative change in health status in contrast to continuous models were quantitative change is observed.

Several applications of LCA exist in the social, medical and economic literature, where heterogeneous populations constitute the typical object of analysis. In geriatrics, LCA was used to test criterion validity of physical frailty [35], measure mobility disability [36] and study behavioral syndromes in Alzheimer' patients [37]. We found no example of its use to model heterogeneity in older individuals' health status. Grade of membership (GoM), another latent classification technique [38], was successful in identifying clinically meaningful health profiles in community-living elderly [3942]. GoM assumes that individuals can be partial members of more than one class of a continuous distribution of latent variables [43]. In contrast, LCA assumes that individuals are full members of one of the class for a discrete latent variable. Practically, LCA classifies individuals in health profiles whereas GoM estimates their degree of proximity to each profile. To our knowledge, these studies did not study transitions in profile membership; though Portrait et al. [41] studied mortality. Conversely, Markov models were used to describe transitions along the disablement pathway [14, 9, 8, 44, 45]. Although similar in spirit to LTA [34], these models focused on observable disability indicators as opposed to directly unobservable health status profiles.

The objective of this paper is to identify profiles of health status and study their evolution in a population of frail elderly individuals living in the community. We used LCA to group individuals into homogenous categories of health status based on observed indicators of prevalent health problems. We then applied LTA to study transitions in profile membership over 2 consecutive periods of 12- and 10-months, respectively. The question we address is whether there exists a latent classification that makes substantive sense to study the dynamic of health status in older populations presenting with complex care needs. Many research and policy questions call for a person-centered as opposed to a variable-centered approach [31]. Practical examples include planning of long-term care resources and identification of appropriate sub-groups for prevention studies. We identified and validated a classification that captures the multidimensional and dynamic nature of health, and show how a person-centered approach could prove useful to address these important questions.

Methods

Data sources

This research uses data collected for the randomized trial of the SIPA program (French acronym for System of Integrated Care for Older Persons), carried out in Montreal, Canada (1999–2001). SIPA aimed at improving continuity of community- and institution-based care for older people with complex health and social needs; its distinguishing features and the results of the experimentation are described elsewhere [46, 47]. Eligible people were older than 64 years, competent in French or English (either the individual or the caregiver), had a score of -10 or less on the Functional Autonomy Measurement System (SMAF; -87 represents the worst health state) [48], and no plans for institutionalization within 3 months. Participants were recruited from two public community organizations responsible for home care (with a small proportion from other sources) and randomly assigned to the SIPA program or usual care. Health and sociodemographic data were collected via structured home interviews at baseline (T0), 12-months (T1), and 22-months (T2). Living status (deceased or not) and information on nursing home use come from administrative databases. Given the short intervention period, SIPA had no effect on change in health status or mortality [46]. For our purpose, we thus combined the two groups and included those who had a baseline questionnaire (n = 1164). The research ethics committee of the Montreal Jewish General Hospital granted approval for these secondary analyses.

Health indicators

LCA uses observed response patterns on a set of health indicators to model heterogeneity and reveal health profiles. We selected 17 indicators based on their high prevalence in older populations. Chronic conditions (no/yes) include self-reported hypertension, stroke, diabetes, cancer as well as circulatory, respiratory, joint and arthritis, stomach and bladder problems. To increase validity, individuals were asked whether a physician had confirmed the diagnosis. Sensory limitations (no/yes) refer to self-declared problems with speech, audition and/or vision. Cognition is measured with the Short Portable Mental Health Questionnaire (scores < 4 indicate no cognitive impairment; scores ≥4 indicate cognitive impairment) [49], and depression with the Geriatric Depression Scale (GDS scores < 3.5 indicate no depression; scores between 3.5–8.5 indicate moderate depression; scores ≥8.5 indicate severe depression) [50]. Functional limitations (no/yes) are defined as difficulty performing upper limbs (≥ 1; raising arms, picking/handling small objects, lifting 5 kg) and lower limbs (≥ 1; pulling/pushing large objects, bending/kneeling; using stairs) movements [51]. Disability is defined as requiring help with mobility activities of daily living (ADL) (getting up from bed/chair; using the toilet; taking a bath/shower; moving around the home, going up/down stairs, walking one block); personal care ADL (eating, drinking, dressing upper and lower body, personal grooming, washing) [52]; and instrumental ADL (IADL; using the phone, transportation, shopping, meal preparation, light housework, taking medication, managing money [53]. Disability measures are coded as three-levels categorical variables. The sociodemographic covariates used in our analyses are gender, age (64–75; 75–84; 85+ years) and living arrangements (whether people live alone or not).

Analysis Strategy

LTA requires a step-wise approach to be accurate in its account of the change process. Using Mplus [54], we first explored alternative LCA models at (T0) to reveal the latent class variable that best captures the health status of our sample (i.e. number and characteristics of health profiles). Then, we fitted LCA models at (T1) and (T2) to examine the profiles' stability over time. Finally, we fitted LTA models to study transitions in health status.

Baseline health state profiles

Under latent class theory [55], individuals are assumed to belong to one of a number of unobserved categories (i.e. latent classes). Here, these categories represent health status profiles. It is assumed that a sufficient number of profiles result in conditional independence among observed health indicators [29]. In Mplus [54], LCA relies on maximum-likelihood methods to estimate the posterior probability p [c/Y] of membership in health state profile c for an individual with an observed health indicator response pattern Y. These posterior probabilities determine the relative prevalence and serve to characterize the profiles [28].

Two types of parameters serve to estimate class membership: health indicator and health profile probabilities. Health indicator probabilities are profile-specific and consist of the probability that a response is associated with the profile (e.g. probability of cognitive problems given membership in health profile 1). Within classes, individuals have comparable health indicator probabilities. Health profile probabilities represent individuals' probability of belonging to each profile; assignment to one profile proceeds on the basis of their highest health profile probability.

Using data for our 17 health indicators, we fitted LCA models successively for 1 through 5 classes. We used multiple start values to avoid convergence on local maxima [34] and assumed a missing at random mechanism (MAR), i.e. missingness depends on the observed components of the complete data and not on missing data [56]. For most health indicators, MAR is justified given the low proportion of missing data. For depression, however, the depression score is missing for all individuals with cognitive impairment (n = 174) and for those who are missing on the cognitive test (n = 101). To test the effect of the MAR hypothesis on the classification, LCA models were fitted with and without the depression indicator for 1) the complete sample (n = 1164), 2) a sub-sample with complete data for depression (n = 890) and 3) a sub-sample that excludes individuals with missing data for the cognition and depression scores (n = 1063). These sensitivity analyses showed that although depression significantly contributes to the classification, assuming a MAR mechanism for missing depression scores did not significantly change how individuals are grouped.

We decided on the best model based on the lowest values observed for the Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC) and adjusted BIC (aBIC), which combine goodness of fit and parsimony [34]. The adjusted Lo-Mendell-Rubin likelihood ratio test (aLMR-LRT) served to decide on the number of classes [57]; it compares improvement in fit (p < 0.001) between sequential class models through an approximation of the LRT distribution. We used an entropy measure to assess how well the model predicts class membership; values range from 0 to 1 and high values are preferred [54]. Bivariate residual statistics served to confirm local independence [34].

Including covariates in LCA models can serve three purposes: describe the formation of health profiles, characterize and validate them. This is accomplished by the concurrent identification of the latent health profiles variable and its multinomial regression on covariates of interest [29, 36]. This approach avoids the limits of post-hoc regressions (i.e. performed on a priori classified individuals), which assume that the latent classification is an observed variable measured without error [29]. We found that including gender, age and/or living arrangements as covariates did not significantly influence the formation of the profiles. Therefore, the LCA model without covariates is our basecase. LCA regression models are used for characterization and validation.

To further validate the profiles, we fitted 2 LCA models to estimate the association between profile membership and distal outcomes: mortality and use of nursing home services at 22-months. These associations were estimated by allowing the proportion for each outcome to vary across profiles [54]. Age and gender are included to control for confounding. Differences between classes are reported as odd ratios. Finally, to evaluate construct validity, we measured the relationship between class membership and disability measures (ADL-Personal care, ADL-Mobility, IADL), cognitive status and comorbidity (# chronic conditions) with chi-square and contingency coefficient statistics.

T1 and T2 health state profiles

Using available data, we proceeded the same way to identify health status profiles at subsequent time points. To assess whether the health status profiles identified at T1 and T2 have the same substantive meaning as those identified at baseline, we first compared patterns of health indicators probabilities over time. Then, we compared the classifications to those obtained by LCA models constrained to have baseline class-specific health indicator probabilities. The concordance (%) in how individuals are grouped is used to confirm the classification' stability over time. This also allows testing of the classification' robustness to death and lost to follow-up (LTF) attrition.

Latent transition analysis

Building from LCA, LTA studies change in class membership using longitudinal data [31, 58]. Health indicators are measured repeatedly over time to identify profile membership at each occasion. The transition probability matrices are estimated by a logistic regression for nominal response (e.g. probability of health profile membership at T1, conditional on baseline health profile membership) [33, 54]. When covariates are included in LTA, transition probabilities are no longer conditioned only on the previous time(s) membership but also on covariate values [59].

We ran two separate LTA: one to assess transitions in health status between T0 and T1 (LTA-T0T1); one to assess transitions between T1 and T2 (LTA-T1T2). Age and gender are included in our models – a provision that allows us to compare transition probabilities across these covariates. Death and LTF are modeled as absorbing states. Thus, each model adjusts the probability of health status transition for the competing risks of death and being LTF. Finally, we imposed measurement invariance (MI; constraining conditional health indicator probabilities to be the same across time points) to ensure health profiles have the same meaning at each occasion. The plausibility of MI was confirmed by comparing the classifications with and without MI constraints: the differences in classification were not significant (χ2; p < 0.001) and information criterion favored the constrained model.

Results

Sample description

The baseline sample consists of 1164 people between 64 and 104 years old, with 70.9% female and considerable socioeconomic variability (Table 1). Comorbidity ranges from none to seven chronic conditions. Comparisons of health indicators proportions reveal great heterogeneity in health status (Table 2). Given the low proportions of missing values, attribution under the MAR hypothesis had no effect on health indicator proportions, except for depression, cognition and bladder problems. For these indicators, Mplus modeled proportions appear in parentheses.

Table 1 Sample characteristics at T0; % of deceased and LTF individuals at T1 and T2 *
Table 2 Health indicator proportions (%) at baseline, 12 months and 22 months*

At 12-months (T1), 11.5% (n = 134) of the sample had died, 7.6% (n = 88) did not complete the interview and 12.4% (n = 144) were lost-to-follow-up (LTF). At 22-months (T2), 20.8% (n = 242) of the sample had died, 38.2% (n = 446) were LTF. Individuals who died were more likely to be male, older; they had worst functional scores and were less likely to live alone. In bivariate analyses, LTF individuals, either at T1 or T2, did not significantly differ from the rest of the sample on sociodemographic characteristics or baseline health status (i.e. IADL, ADL-personal care, cognition, depression, sensory deficits, functional limitations) except for ADL-mobility at T1 (χ2, p < 0.05) and sensory deficits at T2 (χ2, p < 0.05). In multivariable models, only ADL-mobility remained significantly associated with LTF status at T1.

Health state profiles

LCA models estimated for each time point suggest that a 4-class solution provides the best overall fit and explanation of the observed health indicators frequencies. The classification identified at baseline was reproduced in the T1 and T2 LCA as well as in LTA. Accordingly, we describe the four health state profiles identified at baseline. We then present results for subsequent steps with reference to that basecase.

Table 3 presents model fit statistics for LCA models fitted at baseline, T1 and T2. At baseline, increasing the number of classes improved the classification up to the fifth class (i.e. the LMR-LRT are no longer significant). Information criterion statistics suggest the four-class model best fits the data. The quality of the classification for that model is high (entropy: 0.805), with no identification problem (condition number: 0.233) and no major violation of conditional independence. LCA performed at T1 and T2 also point to 4 health status profiles.

Table 3 Model fit statistics for latent class analysis at baseline, T1, T2

Table 4 presents profile-specific health indicators probabilities for the four latent classes (λ; presented as %). These probabilities express how individuals within a profile differ from those in other profiles at each time point. For example, at T0, the first two profiles are characterized by high probabilities of cognitive problems (λ = 0.687 and λ = 0.858, respectively), whereas the other two are not (λ = 0.078 and λ = 0.139). Profiles are assigned a label to substantiate these differences.

Table 4 Health indicators distribution and conditional probabilities per health profile *

All selected health indicators significantly contribute to the classification (p < 0.001). However, the pattern of relationships along the cognitive and physical dimensions best describes the profiles' distinguishing features. Severe cognitive and physical impairments characterize the first health profile. Individuals have high probabilities of cognitive disorders, chronic conditions, stroke, sensory problems, and functional limitations. Their high probabilities of disability in IADL, personal care and mobility ADL capture the severity and combined consequences of these problems. This group is labeled "Cognitively & physically impaired" (Cog&Physic-Imp) and represents 23% of the sample at baseline. The second health profile is predominantly "COGNITIVELY IMPAIRED" (Cog-Imp), with minimal physical impairments. These individuals (11.4%) report relatively low probabilities for chronic conditions and functional limitations. The likelihood for them to present with ADL disability is comparatively low but high for severe IADL disability.

Individuals in the third health status profile have the highest probabilities for chronic conditions, but no cognitive problem. They are very likely to report depression, functional limitations and mobility disability, but unlikely to require help for personal care. We labeled this group "PHYSICALLY IMPAIRED" (Physic-Imp). Finally, we found a "RELATIVELY HEALTHY" (R-Healthy) profile. It comprises older people who report comparatively less chronic conditions (circulatory problems, respiratory diseases, arthritis, depression; p < 0.01) and who manifest low probabilities of disability, functional limitations and cognitive disorders. The later two profiles represent, respectively, 35.6% and 29.9% of the sample at baseline.

Comparison across time points of profile-specific health indicator probabilities (Table 4) shows that the profiles revealed at T1 and T2 correspond to constellations of health problems equivalent to those observed at baseline. When we compared T1 and T2 profiles with those obtained with T1 and T2 models constrained to have baseline conditional health indicator probabilities, the concordance in how individuals are grouped reaches 90% at T1, and 88% at T2. At T1, the concordance is 96% for the Cog&Physic-Imp, 92% for the Cog-Imp, 83% (p < 0.01) for the Physic-Imp and 86% (p < 0.05) for the R-Healthy. At T2, the concordance is 83% (p < 0.05) for the Cog&Physic-Imp (with 13% classified as Physic-Imp), 100% for the Cog-Imp, 69% (p < 0.001) for the Physic-Imp (with 27.4% classified as R-Healthy), and 95.6% for the R-Healthy. These differences in classification reflect the net progression of the sample as a whole towards a more compromised health state (as seen in tables 2 and 4). Although this results in an upward shift in the "severity" of health profiles at T1 and T2, each maintained its substantive meaning. Combined, these results confirm the stability of our classification, despite mortality and LTF.

The health indicator probabilities predicted by LCA are consistent with the proportion of people who have contributed each response patterns in the actual data. Furthermore, chi-square and contingency coefficients (Table 5) for health state profiles versus disability measures, comorbidity and cognitive problems confirm both the qualitative differences between health profiles and their coherence with key measures of health status. The correlation with the comorbidity measure is much lower.

Table 5 Health indicators' prevalence by health status profile at T0 (%)

The relationships between profile membership, covariates, and distal outcomes also support the validity of the classes. Results of LCA regression models indicate that Cog&Physic-Imp individuals are significantly older compared to those classified in the Physic-Imp and R-Healthy profiles. Relative to their younger peers (64–74 years), individuals in the age groups 75–84 and 85+ are 1.6 and 2.5 times more likely to be highly disabled as opposed to being "only" physically impaired (p < 0.01). These odds increase to 2 and 4, respectively, when compared to being relatively healthy (p < 0.001). Whereas women tend to be classified in the profiles with disability or be relatively healthy, men are significantly more likely to be classified in the Physic-Imp profile (OR range: 2.18–3.2; p < 0.01), where the probability of any type of disability is comparatively low despite high probability of chronic conditions. In both profiles characterized by cognitive problems, individuals are more likely not to live alone (OR range: 3.84–9.28; p < 0.001).

Controlling for age and gender, we found that Cog&Physic-Imp individuals are significantly more likely to die within 22-months compared to Cog-Imp (OR 2.84; p < 0.002), Physic-Imp (OR 3.27; p < 0.001) and R-Healthy individuals (OR 4.75; p < 0.001). In turn, Cog-Imp individuals are more likely to die compared to the Physic-Imp (OR 1.15; p < 0.003) and the R-Healthy (OR 1.67; p < 0.003); and Physic-Imp individuals more likely to die than the R-Healthy (OR 1.45; p < 0.001). Finally, we find that Cog&Physic-Imp individuals are less likely to use nursing home services within 22-months compared to Cog-Imp individuals (OR 0.72; p < 0.001). In turn, individuals in the later two profiles, i.e. characterized by cognitive impairments, are more likely to use nursing home services compared to Physic-Imp individuals (OR 2.87 and 4.37, respectively; p < 0.001) and R-Healthy individuals (OR 3.18 and 4.38, respectively; p < 0.001). The likelihood of using nursing home services for Physic-Imp relative to the R-healthy is not as marked (OR 1.11; p < 0.004). These results all point in the expected direction.

Latent transition analyses

Table 6 presents age-controlled latent transition probabilities from baseline to T1 and from T1 to T2. Among Cog&Physic-Imp individuals, 51.4% are predicted to remain in that state and 24.9% are predicted to die within a year. Around 5% are predicted to move towards less disabled health states: 3.0% towards a Cog-Imp state and 2.3% towards a Physic-Imp state. The main gender difference lies in the increased probability of death for men and LTF for women. Among Cog-Imp individuals, women are more likely to transition towards the more disabled state (24.1%) or improve (4%), whereas men are comparatively more likely to stay in their same state (47.8% vs. 24.3%) or die (11.2% vs. 9%). The competing risk for women to be LTF is again higher. Physic-Imp individuals are characterized by more stability (63.8%) with no striking gender difference. Overall, their probability of transitioning towards more disabled states is 4.3%, and somewhat higher for men; their transition probability towards the R-healthy state is 3.4%, and somewhat higher for women. Men are twice as likely to die. Although the R-Healthy also tend to be characterized by stability (66.2%), this applies most importantly to women. Over a year, men' transition probabilities towards more compromised health states is higher despite their higher competing risk for dying or being LTF.

Table 6 Transition probabilities *

The patterns of transitions between T1 to T2 are similar, with quantitative differences. Overall, transition probabilities towards more disabled states are higher, and improvements less likely. A noteworthy gender difference is the increased probability of R-Healthy women to transition to the Physic-Imp or Cog-Physic-Imp states, as opposed to stability. Also, Cog-Imp women have a 35% probability of becoming highly disabled whereas men in that state appear more stable. The probability of dying across health profiles is lower, except for Physic-Imp men. These differences must be considered in light of the increased competing risk of being LTF and the 10-month interval.

Discussion

Our aim was to identify a meaningful latent classification that encompasses multiple dimensions of health and captures their synergistic effect on older people's health status. We identified four homogeneous health state profiles that are stable over time and sensitive to change.

The uncovered classification has face validity. It clearly distinguishes the physical and cognitive dimensions of health. And within each of these dimensions, a qualitative distinction along the disability dimension captures the consequences of diseases and impairments [1]. These findings generally agree with classifications obtained by other methods [39, 41, 40, 42]. In elderly populations comparable to ours [39, 40], published classifications revealed more nuanced groups (i.e. 5–7 profiles) but with meanings anchored in the same dimensions as those characterizing our profiles. In samples representative of community-living older people [41, 42], an additional "Healthy" profile typically emerges. Given our target population's compromised health, we did not find nor expected a healthy profile. In our sample, four latent classes were sufficient to capture health status heterogeneity while maintaining interpretability and stability over time. There is no longitudinal evidence to determine whether the additional profiles of previously published classifications possess the later qualities.

The profiles' stability over and above observed changes in the sample's overall health substantiates the validity of our classification. The profiles are robust not only in being comparable across time points but also in holding despite high mortality and LTF. The differentiated and gender-specific patterns of transition probabilities demonstrate the profiles' sensitivity to change in health states. Although most individuals tended to remain in their health state or died, we found higher probabilities of unfavorable transitions for individuals in the more compromised health states, and lower probabilities of improvements. These observations concord with studies of change in disability [7, 26, 25], functional limitations [6, 11, 10] and frailty [60]. The consistent finding, across outcome measures, is a decreased probability of recovery and an increased probability of decline or death when more deficits are reported at baseline. For the Cog-Imp profile, our findings appear consistent with the course of disability progression as cognitive problems worsen: IADL are affected first, followed by basic activities of personal care [4, 6]. Yet, a shortcoming of traditional functional measures is their inadequate ability to detect cognitive impairments, particularly when scaled with items influenced by physical ability [61]. Our results show that classification into homogenous health categories unmasked the differential relationship of physical and cognitive domains with progression in disability; gender specific analyses provide further insights.

Overall, the classification has good construct validity. We observed higher mortality and older individuals in the more vulnerable groups; an increased likelihood of nursing home use for cognitively impaired profiles; gender differences in transition probabilities, as well as high coherence between class membership and individual health indicators except comorbidity. The latter is consistent with previous work showing that assessment of disease alone is a weak marker of health status in older individuals, even when indicators of disease severity are considered [62].

In considering the generalizability of our findings, it is important to keep in mind that our reference population was selected to demonstrate the value of integrated services for older people with complex care needs [46, 47]. The characteristics of our sample thus closely match those of the sub-population of community-living elderly targeted by such programs [63], not those of the general population; our classification reflects their compromised health status. Evidence shows that particular groups, namely frail elderly, may be more likely than others to benefit from better integration of care [46] but identifying them remains a challenge. Application of LCA may prove useful for doing so.

Three other issues deserve discussion. The first relates to missing depression and cognitive scores. Firstly, depression scores are mainly missing by design for the cognitively impaired. Assuming a MAR mechanism for depression did not significantly change how individuals are grouped yet we cannot exclude misclassification for individuals who also have a missing cognition score. Secondly, despite the known association between depressive symptoms and cognitive impairments [64], we could not assess the effect of our MAR assumption on subsequent cognitive decline.

The second issue pertains to attrition. LTF individuals did not differ on sociodemographic characteristics and most health indicators at baseline but differed on mobility disability. Combined with our inability to control for unobserved individuals effects (e.g. lifestyle, social support), this means that we cannot exclude a selection bias. Yet, there is no significant difference between profiles in the proportions of LTF individuals at T1 (χ2:6.317; p = 0.097) or T2 (χ2:2.544; p = 0.467). Moreover, the high concordance in how individuals are grouped confirmed the classification's stability overtime, which also points to its robustness to the competing risk of being LTF. For the transition analyses, we captured LTF individuals through an absorbing state. This provision does not inform us on the effect of change in health status on attrition – or vice versa. Nevertheless, it deals with the potential attrition biases introduced in transition studies when these individuals are excluded.

Conversely, a mortality bias is unlikely: we recorded death using administrative databases and captured these transitions through an absorbing state. Excluding deceased individuals would have yielded a healthier sample, overestimated stability and recovery, and underestimated progression relative to a more representative sample. Our modeling approach avoids this bias without having to modify the indicators, run separate analyses or use imputation techniques – all common shortcomings in geriatric studies [65, 66].

Thirdly, we performed the transition analyses in two steps to avoid convergence problems due to the large number of missing data patterns. This approach is not as powerful as performing one LTA on 3 time points but it allowed us to account for the competing risks of death and being LTF in the same analyses. Moreover, because we constrained health profiles to have the same meaning across time points, this two-step strategy should yield valid transition patterns. To be sure, concurrent information on transition, death and LTF probabilities by health status provides critical information for the planning of longitudinal studies of aging.

Despite those limits, our work tackles a core challenge of gerontology research by making the multidimensional and dynamic nature of older people's health status computationally tractable. LCA capture multiple dimensions of health; reveal the smallest number of health profiles that can explain away the associations among observed health dimensions; and makes no assumption about the distribution of health indicators or their relationships other than that of local independence [34]. On these methodological grounds, LCA supersedes classical statistical models by eliminating part of the endogeneity bias [67] introduced in multivariable modeling when indicators of diseases, cognition and disability enter in the model simultaneously. Dealing with this problem is even more pressing when measuring the dynamic of health status, which implies concomitant and interrelated changes in various factors over time [68]. LTA provides a useful empirical heuristic for studying this complex process because the measurement model is specifically developed for dynamic variables as an outgrowth of substantive theory [33, 69].

Conclusion

In his seminal paper on the compression of morbidity, Fries argues that the means for affecting positive change in an aging population are to be found in the variability of the population, as well as in the average values [70]. Our study presents some means to identify and quantify inter-individual variability in health status. Notably, the important weight of the cognitive dimension in explaining this variability and transitions along the disability dimension underscores the importance of moving beyond "simple" functional measures if we are to comprehend the dynamic of elderly people health and social needs. The combination of chronic conditions, cognition and disability items for our LCA finds a parallel in the approaches used to develop the Frailty Index [71] and Clinical Frailty Scale [72]. Compared to our profiles, the former continuous measures of health status provide finer gradations likely to be pertinent to clinical practice and aging research. Conversely, our approach may be unwieldy for clinical use but finds its application at organization and policy levels where many issues call for classification of individuals into pragmatically meaningful groups. Econometric modeling has already demonstrated the sensitivity of such classifications to differences, and changes, in available patterns of health and social services in specific milieu [40, 73]. Applications of LCA and LTA to larger, more representative samples are needed to confirm our findings and expand the methodological underpinnings of these approaches to study health status in older populations.