Data-Driven Disease Progression Modeling

Oxtoby, Neil P.

doi:10.1007/978-1-0716-3195-9_17

Neil P. Oxtoby³

Part of the book series: Neuromethods ((NM,volume 197))

8104 Accesses
2 Altmetric

Abstract

Intense debate in the neurology community before 2010 culminated in hypothetical models of Alzheimer’s disease progression: a pathophysiological cascade of biomarkers, each dynamic for only a segment of the full disease timeline. Inspired by this, data-driven disease progression modeling emerged from the computer science community with the aim to reconstruct neurodegenerative disease timelines using data from large cohorts of patients, healthy controls, and prodromal/at-risk individuals. This chapter describes selected highlights from the field, with a focus on utility for understanding and forecasting of disease progression.

You have full access to this open access chapter, Download protocol PDF

Key words

1 Introduction

Chronic progressive diseases are a major drain on social and economic resources. Many of these diseases have no treatments and no cure. In particular, age-related chronic diseases such as neurodegenerative diseases of the brain are a global healthcare pandemic-in-waiting as most of the world’s population is living ever longer. A key example is Alzheimer’s disease—the leading cause of dementia—but there are numerous other conditions that cause abnormal deterioration of brain tissue, leading to loss of cognitive performance, bodily function, independence, and ultimately death. Despite the increasing socioeconomic burden, neurodegenerative disease research has made impressive progress in the past decade, driven largely by the availability of large observational datasets and the computational analyses this enables.

Understanding neurodegenerative diseases is vital if they are to be managed, or even cured, but our understanding remains poor despite impressive progress in recent years. This poor understanding can be attributed to the many challenges of neurodegenerative diseases: no well-defined time axis due in part to heterogeneity in onset/speed/presentation, and censoring/attrition especially in later stages as patients deteriorate. These challenges, coupled with intense debate in the neurology community (hypothetical models [1, 2]) and increasing availability of data, piqued the interest of computational researchers aiming to provide quantitative answers to the mysteries of neurodegenerative diseases. This has ranged from vanilla off-the-shelf machine learning approaches through to more holistic statistical modeling approaches, the most advanced of which is data-driven disease progression modeling (D³PM).

D³PMs are defined by two key features: (1) they simultaneously reconstruct the disease timeline and estimate the quantitative disease signature/trajectory along this timeline; and (2) they are directly informed by observed data. D³PMs strike a balance between pure unsupervised learning, which requires truly big data, and traditional longitudinal modeling, which relies on a well-defined temporal axis—neither of which are available in neurodegenerative diseases. For a review of the history and development of D³PM, see refs. 3.

The goal of this chapter is to highlight selected key D³PMs in a practical manner. The focus is on model capabilities and data requirements, aiming to inform the reader’s D³PM analysis strategy based on the desired disease insight(s) and the data available. Figure 1 places selected D³PMs on a capability×data quadrant matrix: single timeline estimation vs subtyping, and cross-sectional vs longitudinal data availability. Table A.1 lists more methodological papers relevant to D³PM, with model innovations grouped by the original paper for that method.

2 by 2 quadrant matrix of the capability of a single timeline, and subtypes versus data requirements of cross-sectional, and longitudinal. E B M, D E B M, K D E E B M, D P S, L T J M M, time-warping, G P P M, course maps, Sustaln, Sublign, and course maps + are marked inside. — **Fig. 1**

Table A.1 A taxonomy and pedigree of D³PM papers. *Asterisks denote models for cross-sectional data

Full size table

The chapter is organized as follows. It starts with a brief discussion of data preprocessing considerations in Subheading 2—an important step in medical data analysis. The treatment of D³PMs is separated into models for cross-sectional data (Subheading 3) and models for longitudinal data (Subheading 4), each split into approaches that estimate a single timeline of disease progression and those capable of estimating multiple timelines within a dataset (subtyping). Subheading 5 concludes.

For a detailed timeline of D³PM development including taxonomy and pedigree of key models, see Appendix.

2 Data Preprocessing

This section briefly touches on two common preprocessing steps before fitting a D³PM to data from a progressive condition such as an irreversible chronic disease: controlling for confounding variables, and handling missing data. We refer to input features as biomarkers and use “covariate” and “confounder” interchangeably. Missing data can refer to irregular/variable visits across individuals, or missing biomarker data due to one or more measurements not being performed for some reason. This section deals with the latter, since longitudinal models can typically handle irregular visits.

Controlling for confounding variables is an important element of any D³PM analysis. This helps to prevent the D³PM from learning non-disease-related patterns such as due to confounding covariates. Confounders can be included as covariates in certain models—to account for that source of variation alongside other variables of interest. Another approach, often used for continuous-valued confounders, is to “regress out” this source of variation prior to fitting a model—to remove non-disease-related signal in the data. This process involves training regression models on data from control participants (who are not expected to develop the disease being studied) and then removing the relevant trends from all data. This method can also be applied to categorical risk factors (discrete variables). The canonical example of a potentially confounding variable in neurodegenerative diseases of the brain is age—a key risk factor in many chronic diseases. Removing normal aging signal is often phrased as “adjusting for” or “controlling for” age.

Handling missing data is an active area of research with a considerable body of literature. Broadly speaking, there are two strategies. The easiest is to exclude participants having any missing biomarker (or covariate) data, but this can considerably reduce the sample size of data available for D³PM analysis. The second approach is to impute the missing data, e.g., using group mean values. Imputation can be explicit or implicit. An example of implicit imputation is in Bayesian models that map data to probabilities and then deal with missing data probabilistically such as in the event-based model [4] where P(event|x) = 0.5 represents maximal uncertainty such as when a measurement x is missing.

3 Models for Cross-Sectional Data

Box 1: Models for Cross-Sectional Data

Pro: Data-economical.

Require cross-sectional data only.
Con: Limited forecasting utility.

Forecasting requires augmentation with longitudinal data.
Key application(s): assessing disease severity from a single visit, e.g., economical stratification for clinical research/trials.

3.1 Single Timeline Estimation Using Cross-Sectional Data

There is only one framework for estimating disease timelines from cross-sectional data: event-based modeling.

3.1.1 Event-Based Model

The event-based model (EBM) emerged in 2011 [5, 6]. The concept is simple: in a progressive disease, biomarker measurements only ever get worse, i.e., become increasingly and irreversibly abnormal. Thus, among a cohort of individuals at different stages of a single progressive disease, the cumulative sequence of biomarker abnormality events can be inferred from only a single visit per individual. This requires making a few assumptions: measurements from individuals are independent and represent samples from a single sequence of cumulative abnormality, i.e., a single timeline of disease progression. Such assumptions are commonplace in many statistical analyses of disease progression and are reasonable approximations to make when analyzing data from research studies that typically have strict inclusion and exclusion criteria to focus on a single condition of interest. Unsurprisingly, the event-based model has proven to be extremely powerful, producing insight into many neurodegenerative diseases: sporadic Alzheimer’s disease [7,8,9,10], familial Alzheimer’s disease [6, 11], Huntington’s disease [6, 12], Parkinson’s disease [13], and others [14, 15].

3.1.1.1 EBM Fitting

The first step in fitting an event-based model maps biomarker values to abnormality values, similar to the hypothetical curves of biomarker abnormality proposed in 2010 [1, 2]. The EBM does this probabilistically, using bivariate mixture modeling where individuals can be labeled either as pre-event/normal or post-event/abnormal to allow for (later) events that are yet to occur in patients, and similarly for the possibility of (earlier) events to have occurred in asymptomatic individuals. Various distributions have been proposed for this mixture modeling: combinations of uniform [5, 6], Gaussian [5,6,7], and kernel density estimate (KDE) distributions [9]. This is visualized in Fig. 2.

A set of 3 histograms depicts a mixture model of Kernal density estimate of mixing component = 0.50, 0.66, and 0.75. The lines at the peak depict components one and two. — **Fig. 2**

The second step in fitting an EBM over N events is to search the space of N! possible sequences S to reveal the most likely sequence (see refs. 6, 7, 9 for mathematical details). For small N ≲ 10, it can be computationally feasible to perform an exhaustive search over all possible N! sequences to find the maximum likelihood/a posteriori solution. The EBM uses a combination of multiply-initialized gradient ascent, followed by MCMC sampling to estimate uncertainty in the sequence. This results in a model posterior that is a collection of samples from the posterior probability density for each biomarker as a function of sequence position. This is presented as a positional variance diagram [6], such as in Fig. 3.

A heat map depicts biomarker versus positional density plots smell, R B D S Q, M o C A, D T I R O I 2, 5, 4, 6, 3, and 1, M R I parietal L, temporal R, L, and cingulate L, R, and fluency of category, and letter. — **Fig. 3**

For further information and to try out EBM tutorials, the reader is directed to the open-source kde_ebm package (github.com/ucl-pond/kde_ebm) and disease-progression-modelling.github.io.

3.1.2 Discriminative Event-Based Model

The discriminative event-based model (DEBM) was proposed in 2017 by Venkatraghavan et al. [16]. Whereas the EBM treats data from individuals as observations of a single group-level disease cascade (sequence), the DEBM estimates individual-level sequences and combines them into a group-level description of disease progression. This is done using a Mallow’s model, which is the ranking/sequencing equivalent of a univariate Gaussian distribution—including estimation of a mean sequence and variance in this mean. Both EBM and DEBM estimate group-level biomarker abnormality using mixture modeling and both approaches directly estimate uncertainty in the sequence.

Additionally, Venkatraghavan et al. [16, 17] also introduced a pseudo-temporal “disease time” that converts the DEBM posterior into a continuous measure of disease severity.

3.1.2.1 DEBM Fitting

As with the EBM, DEBM model fitting starts with mixture modeling (see Subheading 3.1.1). Next, a sequence is estimated for each individual by ranking the abnormality probabilities in descending order. A group-level mean sequence (with variance) is estimated by fitting the individual sequences to a Mallow’s model. For details, see refs. 16, 17 and subsequent innovations to the DEBM. Notably, DEBM is often quicker to fit than EBM, which makes it appealing for high-dimensional extensions, e.g., aiming to estimate voxel-wise atrophy signatures from cross-sectional brain imaging data.

For further information and to try it out, the reader is directed to the open-source pyebm package (https://github.com/88vikram/pyebm).

3.2 Subtyping Using Cross-Sectional Data

Box 2: Subtyping Models

Pro: Uncovering heterogeneity without conflating severity with subtype.

Evidence suggests that disease subtypes exist.
Con: Overly simplistic.

Current models ignore comorbidity.

Augmenting the event-based model concept with unsupervised machine learning, subtype and stage inference (SuStaIn), was introduced by Young et al. [18]. This marriage of clustering to disease progression modeling has proven very powerful and popular, with high-impact results appearing in prominent journals for multiple brain diseases [19,20,21], chronic lung disease [22], and knee osteoarthritis [23]. SuStaIn’s popularity is perhaps unsurprising given that it was the first method capable of unraveling spatiotemporal heterogeneity (pathological severity across an organ) from phenotypic heterogeneity (disease subtypes) in progressive conditions using only cross-sectional data.

Figure 4 (adapted from [18]) shows the concept behind SuStaIn. SuStaIn iteratively solves the clustering problem from 1 to \( {N}_{\mathrm{S}}^{\mathrm{max}} \) subtypes. The N_S model is fitted by splitting each of the N_S − 1 subtypes into two clusters and then solving the N_S-cluster problem, which produces N_S − 1 candidate N_S-cluster models, from which the maximum likelihood model is chosen, and then the algorithm continues to N_S + 1 and so on.

4 schematic representations. A, an underlying model of 2 subtypes along with the human brain with respect to time and B. input data. d. depict the application that has 2 bar graph for probability and subtype versus stage and C. output of reconstruction of disease subtype and stages. — **Fig. 4**

Young et al. [18] also introduced the z-score event progression model that breaks down individual biomarker events into piecewise linear transitions between z-scores of interest. This removes the need for mixture modeling (such as in event-based modeling) and enables inference to be performed at subthreshold biomarker values.

SuStaIn Fitting

For the user, a SuStaIn analysis is very similar to an event-based model analysis. For further information, the reader is directed to the open-source pySuStaIn package [24] (https://github.com/ucl-pond/pySuStaIn), which includes tutorials. As well as the z-score progression model, pySuStaIn includes the various event-based models (see Subheading 3.1), and the more recent scored-events model for ordinal data [25] such as visual ratings of medical images.

4 Models for Longitudinal Data

Box 3: Models for Longitudinal Data

Pro: Good forecasting utility.

High temporal precision allows individualized forecasting.
Con: Data-heavy.

Require longitudinal data (multiple visits, years). Can be slow to fit.
Key application(s): assessing speed of disease progression and assessing individual variability.

The availability of longitudinal data has fueled development of more sophisticated D³PMs, inspired by mixed models. Mixed (effect) modeling is the workhorse of longitudinal statistical analysis against a known timeline, e.g., age. Mixed models provide a hierarchical description of individual-level variation (random effects) about group-level trends (fixed effects), hence the common parlance “mixed-effects” models. Many of the D³PMs for longitudinal data discussed below are in fact mixed models with an additional latent-time parameter that characterizes the disease timeline. Similar approaches in various fields are known as “self-modeling regression” or “latent-time” models. We focus on parametric models, but also mention nonparametric models, and an emerging hybrid discrete-continuous model.

4.1 Single Timeline Estimation Using Longitudinal Data

There are both parametric and nonparametric approaches to estimating disease timelines from longitudinal data. The common goal is to “stitch together” a full disease timeline (decades long) out of relatively short samples from individuals (a few years each) covering a range of severity in symptoms and biomarker abnormality. Some of the earliest work emerged from the medical image registration community, where “warping” images to a common template is one of the first steps in group analyses [26].

Broadly speaking, there are two categories of D³PMs for longitudinal data: time-shifting models and differential equation models. Time-shifting models translate/deform the individual data, metaphorically stitching them together into a quantitative template of disease progression. Differential equation models estimate a statistical model of biomarker dynamics in phase-plane space (position vs velocity), which is subsequently inverted to produce biomarker trajectories.

4.1.1 Explicit Models for Longitudinal Data: Latent-Time Models

Jedynak et al. [27] introduced the disease progression score (DPS) model in 2012, which aligns biomarker data from individuals to a group template model using a linear transformation of age into a disease progression score s_i = α_iage + β_i. Individuals have their own rate of progression α_i (constant over the short observation time) and disease onset β_i. Group-level biomarker dynamics are modeled as sigmoid (“S”) curves. A Bayesian extension of the DPS approach (BPS) appeared in 2019 [28]. Code for both the DPS and BPS was released publicly: https://www.nitrc.org/projects/progscore; https://hub.docker.com/r/bilgelm/bayesian-ps-adni/.

Donohue et al. [29] introduced a self-modeling regression approach similar to the DPS model in 2014. It was later generalized into the more flexible latent-time joint mixed (effects) model (LTJMM) [30], which can include covariates as fixed effects and is a flexible Bayesian framework for inference. The LTJMM software was released publicly: https://bitbucket.org/mdonohue/ltjmm.

A nonparametric latent-time mixed model appeared in 2017: the Gaussian process progression model (GPPM) of Lorenzi et al. [31]. This is a flexible Bayesian approach akin to (parametric) self-modeling regression that doesn’t impose a parametric form for biomarker trajectories. More recent work supplemented the GPPM with a dynamical systems model of molecular pathology spread through the brain [32] that can regularize the GPPM fit to produce a more accurate disease timeline reconstruction that also provides insight into neurodegenerative disease mechanisms (which is a topic that could be a standalone chapter of this book). The GPPM and GPPM-DS model source code was released publicly via gitlab.inria.fr/epione and tutorials are available at disease-progression-model.github.io.

In 2015, Schiratti et al. [33,34,35] introduced a general framework for estimating spatiotemporal trajectories for any type of manifold-valued data. The framework is based on Riemannian geometry and a mixed-effects model with time reparametrization. It was subsequently extended by Koval et al. [36] to form the disease course mapping approach (available in the leaspy software package). Disease course mapping combines time warping (of age) and inter-biomarker spacing translation. Time warping changes disease progression dynamics—time shift/onset and acceleration/progression speed—but not the trajectory. Inter-biomarker spacings shift an individual’s trajectory to account for individual differences in the timing and ordering of biomarker trajectories.

Figures 5 and 6 show example outputs of these models when trained on data from older people at risk of Alzheimer’s disease, including those with diagnosed mild cognitive impairment and dementia due to probable Alzheimer’s disease.

A. a line graph of normalized biomarker dynamics of the stage of A D P S plots increasing lines, A-b, multi-line of the timing of biomarker dynamics versus A D P S. b. 9 scatter plot of biomarker severity versus progression time. — **Fig. 5**

A. multi-line graph of population level predicted severity versus age plots outcomes in an increasing trend. B. A chart depicts Alzheimer's disease course at neuropsychological assessments, cortical thickness, hippocampus, and metabolism at 62, 70, 78, and 86 years. — **Fig. 6**

4.1.1.1 Fitting Longitudinal Latent-Time Models

Fitting D³PMs for longitudinal data is more complex than for cross-sectional data, and the software packages discussed above each expect the data in slightly different formats. One thing they have in common is that renormalization (e.g., min-max or z-score) and reorientation (e.g., to be increasing) is required to put biomarkers on a common scale and direction. In some cases, such preprocessing is necessary to ensure/accelerate model convergence. For example, the LTJMM used a quantile transformation followed by inverse Gaussian quantile function to put all biomarkers on a Gaussian scale. For further detailed discussion, including model identifiability, we refer the reader to the original publications cited above and the didactic resources at disease-progression-modelling.github.io.

4.1.2 Implicit Models for Longitudinal Data: Differential Equation Models

Parametric differential equation D³PMs emerged between 2011 and 2014 [38,39,40,41], receiving a more formal treatment in 2017 [42]. In a hat-tip to physics, these have also been dubbed “phase-plane” models, which aids in their understanding as a model of velocity (biomarker progression rate) as a function of position (biomarker value). Model fitting is a two-step process whereby the long-time biomarker trajectory is estimated by integrating the phase-plane model estimated on observed data.

A nonparametric differential equation D³PM using Gaussian processes (GP-DEM) was introduced in 2018 [11]. This added flexibility to the preceding parametric approaches and produced state-of-the-art results in predicting symptom onset in familial Alzheimer’s disease.

4.1.2.1 Fitting Differential Equation Models

The concept is shown in Fig. 7: differential equation model fitting is a three-step process. First, estimate a single value per individual of biomarker “velocity” and “position,” and then estimate a group-level differential equation model of velocity y as a function of position x, which is integrated/inverted to produce a biomarker trajectory x(t). For example, linear regression can produce estimates of position (e.g., intercept) and velocity (e.g., gradient). Differential equation models can be univariate or multivariate and can include covariates explicitly.

4 graphs of differential equation models for data, differential fit estimated trajectory, and Stochastic model. a, c, and d line graph of x of t and x cap of t versus t in years. b. scatter plot of y versus x for fit and data. — **Fig. 7**

4.1.3 Hybrid Discrete-Continuous Models

Recent work introduced the temporal EBM (TEBM) [43, 44], which augments event-based modeling with hidden Markov modeling to produce a hybrid discrete-continuous D³PM. This is a halfway house between discrete models (great for medical decision making) and continuous models (great for detailed understanding of disease progression). Trained on data from ADNI, the TEBM revealed the full timeline of the pathophysiological cascade of Alzheimer’s disease, as shown in Fig. 8.

A timeline chart of disease time in years marks no event A B E T A, ventricles T A U, P T A U, A D A S 13, R A V L T, M M S E, hippocampus, entorhinal, fusiform, mid-temporal, and whole-brain. 2 arrows labeled patient Y of A D, age = 88.5, and X of M C I, age = 69.5 are above. — **Fig. 8**

4.2 Subtyping Using Longitudinal Data

Clustering longitudinal data without a well-defined time axis can be extremely difficult. Jointly estimating latent time for multiple trajectories is an identifiability challenge, i.e., multiple parameter combinations can explain the same data. This is particularly challenging when observations span a relatively small fraction of the full disease timeline, as in age-related neurodegenerative diseases.

Chen et al. [45] introduced SubLign for subtyping and aligning longitudinal disease data. The authors frame the challenge eloquently as having misaligned, interval-censored data: left censoring from patients being observed only after disease onset and right censoring from patient dropout in more severe disease. SubLign combines a deep generative model (based on a recurrent neural network [46]) for learning individual latent time-shifts and parametric biomarker trajectories using a variational approach, followed by k-means clustering. It was applied to data from a Parkinson’s disease cohort to recover some known clinical phenotypes in new detail.

Poulet and Durrleman [47] recently added mixture-model clustering to the nonlinear mixed model approach of disease course mapping [36]. The framework jointly estimates model parameters and subtypes using a modification of the expectation-maximization algorithm. In simulated data experiments, their approach outperforms a naive baseline. Experiments on real data in Alzheimer’s disease distinguished rapid from slow clinical progression, with minimal differences in biomarker trajectories.

5 Conclusion

Twenty-first century medicine faces many challenges due to aging populations worldwide, including increasing socioeconomic burden from age-related brain disorders like Alzheimer’s disease. Many failed clinical trials fueled intense debate in neurology in the first decade of this century, culminating in the prominent hypothesis of Alzheimer’s disease progression as a pathophysiological cascade of dynamic biomarker events. This inspired the emergence of data-driven disease progression modeling (D³PM) from the computer science community during the second decade of the twenty-first century—an explosion of quantitative models for neurodegenerative disease progression enabling numerous high-impact insights across multiple brain disorders. The community continues to build and share open-source code (see Box 4) and run machine learning challenges [48,49,50]. What will the third decade of the twenty-first century bring for this exciting subset of machine learning for brain disorders?

References

Jack CR, Knopman DS, Jagust WJ, Shaw LM, Aisen PS, Weiner MW, Petersen RC, Trojanowski JQ (2010) Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. Lancet Neurol 9(1):119–128. https://doi.org/10.1016/S1474-4422(09)70299-6
Aisen PS, Petersen RC, Donohue MC, Gamst A, Raman R, Thomas RG, Walter S, Trojanowski JQ, Shaw LM, Beckett LA, Jr CRJ, Jagust W, Toga AW, Saykin AJ, Morris JC, Green RC, Weiner MW (2010) Clinical core of the Alzheimer’s disease neuroimaging initiative: progress and plans. Alzheimer’s Dementia 6(3):239–246. https://doi.org/10.1016/j.jalz.2010.03.006
Oxtoby NP, Alexander DC, EuroPOND Consortium (2017) Imaging plus X: multimodal models of neurodegenerative disease. Curr Opin Neurol 30(4). https://doi.org/10.1097/WCO.0000000000000460
Young AL, Oxtoby NP, Ourselin S, Schott JM, Alexander DC (2015) A simulation system for biomarker evolution in neurodegenerative disease. Med Image Anal 26(1):47–56. https://doi.org/10.1016/j.media.2015.07.004
PubMed Google Scholar
Fonteijn H, Clarkson M, Modat M, Barnes J, Lehmann M, Ourselin S, Fox N, Alexander D (2011) An event-based disease progression model and its application to familial Alzheimer’s disease. In: Székely G, Hahn H (eds) Information processing in medical imaging. Lecture notes in computer science, vol 6801. Springer, Berlin/Heidelberg, pp 748–759. https://doi.org/10.1007/978-3-642-22092-0_61
Google Scholar
Fonteijn HM, Modat M, Clarkson MJ, Barnes J, Lehmann M, Hobbs NZ, Scahill RI, Tabrizi SJ, Ourselin S, Fox NC, Alexander DC (2012) An event-based model for disease progression and its application in familial Alzheimer’s disease and Huntington’s disease. NeuroImage 60(3):1880–1889. https://doi.org/10.1016/j.neuroimage.2012.01.062
PubMed Google Scholar
Young AL, Oxtoby NP, Daga P, Cash DM, Fox NC, Ourselin S, Schott JM, Alexander DC (2014) A data-driven model of biomarker changes in sporadic Alzheimer’s disease. Brain 137(9):2564–2577. https://doi.org/10.1093/brain/awu176
PubMed PubMed Central Google Scholar
Oxtoby NP, Garbarino S, Firth NC, Warren JD, Schott JM, Alexander DC, FtADNI (2017) Data-driven sequence of changes to anatomical brain connectivity in sporadic Alzheimer’s disease. Front Neurol 8:580. https://doi.org/10.3389/fneur.2017.00580
PubMed PubMed Central Google Scholar
Firth NC, Primativo S, Brotherhood E, Young AL, Yong KX, Crutch SJ, Alexander DC, Oxtoby NP (2020) Sequences of cognitive decline in typical Alzheimer’s disease and posterior cortical atrophy estimated using a novel event-based model of disease progression. Alzheimer’s Dementia 16(7):965–973. https://doi.org/10.1002/alz.12083
PubMed Google Scholar
Janelidze S, Berron D, Smith R, Strandberg O, Proctor NK, Dage JL, Stomrud E, Palmqvist S, Mattsson-Carlgren N, Hansson O (2021) Associations of plasma phospho-Tau217 levels with tau positron emission tomography in early Alzheimer disease. JAMA Neurol 78(2):149–156. https://doi.org/10.1001/jamaneurol.2020.4201
PubMed Google Scholar
Oxtoby NP, Young AL, Cash DM, Benzinger TLS, Fagan AM, Morris JC, Bateman RJ, Fox NC, Schott JM, Alexander DC (2018) Data-driven models of dominantly-inherited Alzheimer’s disease progression. Brain 141(5):1529–1544. https://doi.org/10.1093/brain/awy050
PubMed PubMed Central Google Scholar
Wijeratne PA, Young AL, Oxtoby NP, Marinescu RV, Firth NC, Johnson EB, Mohan A, Sampaio C, Scahill RI, Tabrizi SJ, Alexander DC (2018) An image-based model of brain volume biomarker changes in Huntington’s disease. Ann Clin Transl Neurol 5(5):570–582. https://doi.org/10.1002/acn3.558
PubMed PubMed Central Google Scholar
Oxtoby NP, Leyland LA, Aksman LM, Thomas GEC, Bunting EL, Wijeratne PA, Young AL, Zarkali A, Tan MMX, Bremner FD, Keane PA, Morris HR, Schrag AE, Alexander DC, Weil RS (2021) Sequence of clinical and neurodegeneration events in Parkinson’s disease progression. Brain 144(3):975–988. https://doi.org/10.1093/brain/awaa461
PubMed PubMed Central Google Scholar
Eshaghi A, Marinescu RV, Young AL, Firth NC, Prados F, Jorge Cardoso M, Tur C, De Angelis F, Cawley N, Brownlee WJ, De Stefano N, Laura Stromillo M, Battaglini M, Ruggieri S, Gasperini C, Filippi M, Rocca MA, Rovira A, Sastre-Garriga J, Geurts JJG, Vrenken H, Wottschel V, Leurs CE, Uitdehaag B, Pirpamer L, Enzinger C, Ourselin S, Gandini Wheeler-Kingshott CA, Chard D, Thompson AJ, Barkhof F, Alexander DC, Ciccarelli O (2018) Progression of regional grey matter atrophy in multiple sclerosis. Brain 141(6):1665–1677. https://doi.org/10.1093/brain/awy088
PubMed PubMed Central Google Scholar
Firth NC, Startin CM, Hithersay R, Hamburg S, Wijeratne PA, Mok KY, Hardy J, Alexander DC, Consortium TL, Strydom A (2018) Aging related cognitive changes associated with Alzheimer’s disease in Down syndrome. Ann Clin Transl Neurol 5(6):741–751. https://doi.org/10.1002/acn3.571
Google Scholar
Venkatraghavan V, Bron EE, Niessen WJ, Klein S (2017) A discriminative event based model for Alzheimer’s disease progression modeling. In: Niethammer M, Styner M, Aylward S, Zhu H, Oguz I, Yap PT, Shen D (eds) Information processing in medical imaging. Lecture notes in computer science. Springer, Cham, pp 121–133. https://doi.org/10.1007/978-3-319-59050-9_10
Venkatraghavan V, Bron EE, Niessen WJ, Klein S (2019) Disease progression timeline estimation for Alzheimer’s disease using discriminative event based modeling. NeuroImage 186:518–532. https://doi.org/10.1016/j.neuroimage.2018.11.024
PubMed Google Scholar
Young AL et al (2018) Uncovering the heterogeneity and temporal complexity of neurodegenerative diseases with subtype and stage inference. Nat Commun 9(1):4273. https://doi.org/10.1038/s41467-018-05892-0
PubMed PubMed Central Google Scholar
Vogel JW, Young AL, Oxtoby NP, Smith R, Ossenkoppele R, Strandberg OT, La Joie R, Aksman LM, Grothe MJ, Iturria-Medina Y, the Alzheimer’s Disease Neuroimaging Initiative, Pontecorvo MJ, Devous MD, Rabinovici GD, Alexander DC, Lyoo CH, Evans AC, Hansson O (2021) Four distinct trajectories of tau deposition identified in Alzheimer’s disease. Nat Med. https://doi.org/10.1038/s41591-021-01309-6
Eshaghi A, Young AL, Wijeratne PA, Prados F, Arnold DL, Narayanan S, Guttmann CRG, Barkhof F, Alexander DC, Thompson AJ, Chard D, Ciccarelli O (2021) Identifying multiple sclerosis subtypes using unsupervised machine learning and MRI data. Nat Commun 12(1):2078. https://doi.org/10.1038/s41467-021-22265-2
CAS PubMed PubMed Central Google Scholar
Collij LE, Salvadó G, Wottschel V, Mastenbroek SE, Schoenmakers P, Heeman F, Aksman L, Wink AM, Berckel BNM, Flier WMvd, Scheltens P, Visser PJ, Barkhof F, Haller S, Gispert JD, Alves IL, for the Alzheimer’s Disease Neuroimaging Initiative; for the ALFA Study (2022) Spatial-temporal patterns of β-amyloid accumulation: a subtype and stage inference model analysis. Neurology 98(17):e1692–e1703. https://doi.org/10.1212/WNL.0000000000200148
Google Scholar
Young AL, Bragman FJS, Rangelov B, Han MK, Galbán CJ, Lynch DA, Hawkes DJ, Alexander DC, Hurst JR (2020) Disease progression modeling in chronic obstructive pulmonary disease. Am J Respir Crit Care Med 201(3):294–302. https://doi.org/10.1164/rccm.201908-1600OC
CAS PubMed PubMed Central Google Scholar
Li M, Lan L, Luo J, Peng L, Li X, Zhou X (2021) Identifying the phenotypic and temporal heterogeneity of knee osteoarthritis: data from the osteoarthritis initiative. Front Public Health 9. https://doi.org/10.3389/fpubh.2021.726140
Aksman LM, Wijeratne PA, Oxtoby NP, Eshaghi A, Shand C, Altmann A, Alexander DC, Young AL (2021) pySuStaIn: a python implementation of the subtype and stage inference algorithm. SoftwareX 16:100811. https://doi.org/10.1016/j.softx.2021.100811
Young AL, Vogel JW, Aksman LM, Wijeratne PA, Eshaghi A, Oxtoby NP, Williams SCR, Alexander DC, ftADNI (2021) Ordinal sustain: subtype and stage inference for clinical scores, visual ratings, and other ordinal data. Front Artif Intell 4:111. https://doi.org/10.3389/frai.2021.613261
Google Scholar
Durrleman S, Pennec X, Trouvé A, Gerig G, Ayache N (2009) Spatiotemporal atlas estimation for developmental delay detection in longitudinal datasets. In: Yang GZ, Hawkes D, Rueckert D, Noble A, Taylor C (eds) Medical image computing and computer-assisted intervention—MICCAI 2009. Springer, Berlin, pp 297–304. https://doi.org/10.1007/978-3-642-04268-3_37
Google Scholar
Jedynak BM, Lang A, Liu B, Katz E, Zhang Y, Wyman BT, Raunig D, Jedynak CP, Caffo B, Prince JL (2012) A computational neurodegenerative disease progression score: method and results with the Alzheimer’s disease neuroimaging initiative cohort. NeuroImage 63(3):1478–1486. https://doi.org/10.1016/j.neuroimage.2012.07.059
PubMed Google Scholar
Bilgel M, Jedynak BM, Alzheimer’s Disease Neuroimaging Initiative (2019) Predicting time to dementia using a quantitative template of disease progression. Alzheimer’s Dementia: Diagn Assess Dis Monit 11(1):205–215. https://doi.org/10.1016/j.dadm.2019.01.005
Google Scholar
Donohue MC, Jacqmin-Gadda H, Goff ML, Thomas RG, Raman R, Gamst AC, Beckett LA, Jack Jr. CR, Weiner MW, Dartigues JF, Aisen PS (2014) Estimating long-term multivariate progression from short-term data. Alzheimer’s Dementia 10(5, Supplement):S400–S410. https://doi.org/10.1016/j.jalz.2013.10.003
Li D, Iddi S, Thompson WK, Donohue MC (2019) Bayesian latent time joint mixed effect models for multicohort longitudinal data. Stat Methods Med Res 28(3):835–845. https://doi.org/10.1177/0962280217737566
PubMed Google Scholar
Lorenzi M, Filippone M, Frisoni GB, Alexander DC, Ourselin S (2019) Probabilistic disease progression modeling to characterize diagnostic uncertainty: application to staging and prediction in Alzheimer’s disease. NeuroImage 190:56–68. https://doi.org/10.1016/j.neuroimage.2017.08.059
PubMed Google Scholar
Garbarino S, Lorenzi M (2021) Investigating hypotheses of neurodegeneration by learning dynamical systems of protein propagation in the brain. NeuroImage 235:117980. https://doi.org/10.1016/j.neuroimage.2021.117980
CAS PubMed Google Scholar
Schiratti JB, Allassonnière S, Colliot O, Durrleman S (2015) Learning spatiotemporal trajectories from manifold-valued longitudinal data. In: Advances in neural information processing systems, Curran Associates, vol 28. https://proceedings.neurips.cc/paper/2015/hash/186a157b2992e7daed3677ce8e9fe40f-Abstract.html
Schiratti JB, Allassonnière S, Routier A, Colliot O, Durrleman S (2015) A mixed-effects model with time reparametrization for longitudinal univariate manifold-valued data. In: Ourselin S, Alexander DC, Westin CF, Cardoso MJ (eds) Information processing in medical imaging. Lecture notes in computer science. Springer, Cham, pp 564–575. https://doi.org/10.1007/978-3-319-19992-4_44
Schiratti JB, Allassonnière S, Colliot O, Durrleman S (2017) A Bayesian mixed-effects model to learn trajectories of changes from repeated manifold-valued observations. J Mach Learn Res 3:1–48
Google Scholar
Koval I, Bône A, Louis M, Lartigue T, Bottani S, Marcoux A, Samper-González J, Burgos N, Charlier B, Bertrand A, Epelbaum S, Colliot O, Allassonnière S, Durrleman S (2021) AD course map charts Alzheimer’s disease progression. Sci Rep 11(1):8020. https://doi.org/10.1038/s41598-021-87434-1
CAS PubMed PubMed Central Google Scholar
Li D, Iddi S, Thompson WK, Donohue MC (2017) Bayesian latent time joint mixed effect models for multicohort longitudinal data. arXiv 1703.10266v2. https://doi.org/10.48550/arXiv.1703.10266
Sabuncu M, Desikan R, Sepulcre J, Yeo B, Liu H, Schmansky N, Reuter M, Weiner M, Buckner R, Sperling R (2011) The dynamics of cortical and hippocampal atrophy in Alzheimer disease. Arch. Neurol. 68(8):1040. https://doi.org/10.1001/archneurol.2011.167
PubMed PubMed Central Google Scholar
Samtani MN, Farnum M, Lobanov V, Yang E, Raghavan N, DiBernardo A, Narayan V, Alzheimer’s Disease Neuroimaging Initiative (2012) An improved model for disease progression in patients from the Alzheimer’s disease neuroimaging initiative. J Clin Pharmacol 52(5):629–644. https://doi.org/10.1177/0091270011405497
PubMed Google Scholar
Villemagne VL, Burnham S, Bourgeat P, Brown B, Ellis KA, Salvado O, Szoeke C, Macaulay SL, Martins R, Maruff P, Ames D, Rowe CC, Masters CL (2013) Amyloid β deposition, neurodegeneration, and cognitive decline in sporadic Alzheimer’s disease: a prospective cohort study. Lancet Neurol 12(4):357–367. https://doi.org/10.1016/S1474-4422(13)70044-9
Oxtoby NP, Young AL, Fox NC, Daga P, Cash DM, Ourselin S, Schott JM, Alexander DC (2014) Learning imaging biomarker trajectories from noisy Alzheimer’s disease data using a Bayesian multilevel model. In: Cardoso MJ, Simpson I, Arbel T, Precup D, Ribbens A (eds) Bayesian and graphical models for biomedical imaging. Lecture notes in computer science, vol 8677. Springer, Berlin, pp 85–94. https://doi.org/10.1007/978-3-319-12289-2_8
Google Scholar
Budgeon C, Murray K, Turlach B, Baker S, Villemagne V, Burnham S, for the Alzheimer’s Disease Neuroimaging Initiative (2017) Constructing longitudinal disease progression curves using sparse, short-term individual data with an application to Alzheimer’s disease. Stat Med 36(17):2720–2734. https://doi.org/10.1002/sim.7300
Wijeratne PA, Alexander DC (2020) Learning transition times in event sequences: the event-based hidden Markov model of disease progression. https://doi.org/10.48550/arXiv.2011.01023. Machine Learning for Health (ML4H) 2020
Wijeratne PA, Alexander DC (2021) Learning transition times in event sequences: the temporal event-based model of disease progression. In: Feragen A, Sommer S, Schnabel J, Nielsen M (eds) Information processing in medical imaging. Lecture notes in computer science. Springer, Cham, pp 583–595. https://doi.org/10.1007/978-3-030-78191-0_45
Chen IY, Krishnan RG, Sontag D (2022) Clustering interval-censored time-series for disease phenotyping. In: Proceedings of the AAAI conference on artificial intelligence, vol 36(6), pp 6211–6221. https://doi.org/10.1609/aaai.v36i6.20570
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536. https://doi.org/10.1038/323533a0
Google Scholar
Poulet PE, Durrleman S (2021) Mixture modeling for identifying subtypes in disease course mapping. In: Feragen A, Sommer S, Schnabel J, Nielsen M (eds) Information processing in medical imaging. Lecture notes in computer science. Springer, Cham, pp 571–582. https://doi.org/10.1007/978-3-030-78191-0_44
Marinescu RV, Oxtoby NP, Young AL, Bron EE, Toga AW, Weiner MW, Barkhof F, Fox NC, Klein S, Alexander DC, EuroPOND Consortium (2018) TADPOLE challenge: prediction of longitudinal evolution in Alzheimer’s disease. https://doi.org/10.48550/arXiv.1805.03909
Marinescu RV, Oxtoby NP, Young AL, Bron EE, Alexander DC et al (2021) The Alzheimer’s disease prediction of longitudinal evolution (TADPOLE) challenge: results after 1 year follow-up. Mach Learn Biomed Imaging 1:1–10. https://www.melba-journal.org/papers/2021:019.html
Google Scholar
Bron EE, Klein S, Reinke A, Papma JM, Maier-Hein L, Alexander DC, Oxtoby NP (2022) Ten years of image analysis and machine learning competitions in dementia. NeuroImage 253:119083. https://doi.org/10.1016/j.neuroimage.2022.119083
PubMed Google Scholar
Ashford JW, Schmitt FA (2001) Modeling the time-course of Alzheimer dementia. Current Psychiatry Rep 3(1):20–28. https://doi.org/10.1007/s11920-001-0067-1
CAS Google Scholar
Gomeni R, Simeoni M, Zvartau-Hind M, Irizarry MC, Austin D, Gold M (2012) Modeling Alzheimer’s disease progression using the disease system analysis approach. Alzheimer’s Dementia 8(1):39–50. https://doi.org/10.1016/j.jalz.2010.12.012
PubMed Google Scholar
Bilgel M, Jedynak B, Wong DF, Resnick SM, Prince JL (2015) Temporal trajectory and progression score estimation from voxelwise longitudinal imaging measures: application to amyloid imaging. In: Ourselin S, Alexander DC, Westin CF, Cardoso MJ (eds) Information processing in medical imaging. Lecture notes in computer science. Springer, Cham, pp 424–436. https://doi.org/10.1007/978-3-319-19992-4_33
Bilgel M, Prince JL, Wong DF, Resnick SM, Jedynak BM (2016) A multivariate nonlinear mixed effects model for longitudinal image analysis: application to amyloid imaging. NeuroImage 134:658–670. https://doi.org/10.1016/j.neuroimage.2016.04.001
PubMed Google Scholar
Young AL, Vogel JW, Aksman LM, Wijeratne PA, Eshaghi A, Oxtoby NP, Williams SCR, Alexander DC, for the Alzheimer’s Disease (2021) Ordinal SuStaIn: subtype and stage inference for clinical scores, visual ratings, and other ordinal data. Front Artif Intell 4(613261). https://doi.org/10.3389/frai.2021.613261
Durrleman S, Pennec X, Trouvé A, Braga J, Gerig G, Ayache N (2013) Toward a comprehensive framework for the spatiotemporal statistical analysis of longitudinal shape data. Int J Comput Vis 103(1):22–59. https://doi.org/10.1007/s11263-012-0592-x
CAS PubMed PubMed Central Google Scholar
Oxtoby NP, Young AL, Fox NC, Daga P, Cash DM, Ourselin S, Schott JM, Alexander DC (2014) Learning imaging biomarker trajectories from noisy Alzheimer’s disease data using a Bayesian multilevel model. In: Cardoso MJ, Simpson I, Arbel T, Precup D, Ribbens A (eds) Bayesian and graphical models for biomedical imaging. Springer, Cham, pp 85–94. https://doi.org/10.1007/978-3-319-12289-2_8
Google Scholar
Guerrero R, Schmidt-Richberg A, Ledig C, Tong T, Wolz R, Rueckert D (2016) Instantiated mixed effects modeling of Alzheimer’s disease markers. NeuroImage 142:113–125. https://doi.org/10.1016/j.neuroimage.2016.06.049
CAS PubMed Google Scholar
Leoutsakos JM, Gross AL, Jones RN, Albert MS, Breitner JCS (2016) ‘Alzheimer’s progression score’: development of a biomarker summary outcome for AD prevention trials. J Prev Alzheimer’s Dis 3(4):229–235. https://doi.org/10.14283/jpad.2016.120
Google Scholar
Garbarino S, Lorenzi M (2019) Modeling and inference of spatio-temporal protein dynamics across brain networks. In: Chung ACS, Gee JC, Yushkevich PA, Bao S (eds) Information processing in medical imaging. Lecture notes in computer science. Springer, Cham, pp 57–69. https://doi.org/10.1007/978-3-030-20351-1_5
Marinescu RV, Eshaghi A, Lorenzi M, Young AL, Oxtoby NP, Garbarino S, Crutch SJ, Alexander DC (2019) DIVE: a spatiotemporal progression model of brain pathology in neurodegenerative disorders. NeuroImage 192:166–177. https://doi.org/10.1016/j.neuroimage.2019.02.053
PubMed Google Scholar
Petrella JR, Hao W, Rao A, Doraiswamy PM (2019) Computational causal modeling of the dynamic biomarker cascade in Alzheimer’s disease. Comput Math Methods Med 2019:e6216530. https://doi.org/10.1155/2019/6216530
Google Scholar
Abi Nader C, Ayache N, Frisoni GB, Robert P, Lorenzi M, for the Alzheimer’s Disease Neuroimaging Initiative (2021) Simulating the outcome of amyloid treatments in Alzheimer’s disease from imaging and clinical data. Brain Commun 3(2):fcab091. https://doi.org/10.1093/braincomms/fcab091

Download references

Acknowledgements

The author is a UKRI Future Leaders Fellow (MR/S03546X/1) and acknowledges conversations and collaborations with colleagues from the UCL POND group (http://pond.cs.ucl.ac.uk), the EuroPOND consortium (http://europond.eu), the E-DADS consortium (https://e-dads.github.io; MR/T046422/1; EU JPND), and the open Disease Progression Modeling Initiative (https://disease-progression-modelling.github.io). This project has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No. 666992.

Author information

Authors and Affiliations

UCL Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK
Neil P. Oxtoby

Authors

Neil P. Oxtoby
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Neil P. Oxtoby .

Editor information

Editors and Affiliations

CNRS, Paris, France
Olivier Colliot

Appendix

A taxonomy and pedigree of key D³PM papers is given in Table A.1. Box 4 contains links to open-source code for D³PMs.

Box 4: Example Open-Source D ³ PM Code

D³PM tutorials:

https://disease-progression-modelling.github.io
EuroPOND Software Toolbox:

https://europond.github.io/europond-software
KDE EBM:

https://ucl-pond.github.io/kde_ebm
pyEBM:

https://github.com/88vikram/pyebm
leaspy:

https://gitlab.com/icm-institute/aramislab/leaspy
LTJMM:

https://bitbucket.org/mdonohue/ltjmm

https://github.com/mcdonohue/rstanarm
DPS:

source code; docker image
pySuStaIn:

https://ucl-pond.github.io/pySuStaIn
TADPOLE-SHARE (from TADPOLE Challenge [48, 49]):

https://github.com/tadpole-share/tadpole-algorithms

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Oxtoby, N.P. (2023). Data-Driven Disease Progression Modeling. In: Colliot, O. (eds) Machine Learning for Brain Disorders. Neuromethods, vol 197. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-3195-9_17

Download citation

DOI: https://doi.org/10.1007/978-1-0716-3195-9_17
Published: 23 July 2023
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-3194-2
Online ISBN: 978-1-0716-3195-9
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

Data-Driven Disease Progression Modeling

Abstract

Key words

1 Introduction

2 Data Preprocessing

3 Models for Cross-Sectional Data

Box 1: Models for Cross-Sectional Data

3.1 Single Timeline Estimation Using Cross-Sectional Data

3.1.1 Event-Based Model

3.1.1.1 EBM Fitting

3.1.2 Discriminative Event-Based Model

3.1.2.1 DEBM Fitting

3.2 Subtyping Using Cross-Sectional Data

Box 2: Subtyping Models

4 Models for Longitudinal Data

Box 3: Models for Longitudinal Data

4.1 Single Timeline Estimation Using Longitudinal Data

4.1.1 Explicit Models for Longitudinal Data: Latent-Time Models

4.1.1.1 Fitting Longitudinal Latent-Time Models

4.1.2 Implicit Models for Longitudinal Data: Differential Equation Models

4.1.2.1 Fitting Differential Equation Models

4.1.3 Hybrid Discrete-Continuous Models

4.2 Subtyping Using Longitudinal Data

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

Box 4: Example Open-Source D 3 PM Code

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation

Box 4: Example Open-Source D ³ PM Code