Subtyping Brain Diseases from Imaging Data

Wen, Junhao; Varol, Erdem; Yang, Zhijian; Hwang, Gyujoon; Dwyer, Dominique; Kazerooni, Anahita Fathi; Lalousis, Paris Alexandros; Davatzikos, Christos

doi:10.1007/978-1-0716-3195-9_16

Junhao Wen³,
Erdem Varol⁴,
Zhijian Yang³,
Gyujoon Hwang³,
Dominique Dwyer⁵,
Anahita Fathi Kazerooni⁶,
Paris Alexandros Lalousis⁶ &
…
Christos Davatzikos³

Part of the book series: Neuromethods ((NM,volume 197))

8887 Accesses
1 Citations
6 Altmetric

Abstract

The imaging community has increasingly adopted machine learning (ML) methods to provide individualized imaging signatures related to disease diagnosis, prognosis, and response to treatment. Clinical neuroscience and cancer imaging have been two areas in which ML has offered particular promise. However, many neurologic and neuropsychiatric diseases, as well as cancer, are often heterogeneous in terms of their clinical manifestations, neuroanatomical patterns, or genetic underpinnings. Therefore, in such cases, seeking a single disease signature might be ineffectual in delivering individualized precision diagnostics. The current chapter focuses on ML methods, especially semi-supervised clustering, that seek disease subtypes using imaging data. Work from Alzheimer’s disease and its prodromal stages, psychosis, depression, autism, and brain cancer are discussed. Our goal is to provide the readers with a broad overview in terms of methodology and clinical applications.

You have full access to this open access chapter, Download protocol PDF

Key words

1 Introduction

There is a growing clinical evidence that structural and functional brain development and aging take heterogeneous paths within different subsets of the human population [1,2,3]. This heterogeneity has been relatively ignored in case-control study analyses, yielding a limited understanding of the diversity of underlying biological processes that might give rise to similar clinical phenotypes. The advent of high-throughput neuroimaging technologies and the concentrated efforts of the collection of large-scale datasets [4, 5] provide a unique opportunity to dissect the structural and functional heterogeneity of brain disorders in finer details and in an unbiased data-driven manner. A developing body of work that leverages ML and neuroimaging seeks disease subtypes of neuropsychiatric and neurodegenerative disorders, including Alzheimer’s disease (AD) [6,7,8,9,10,11], schizophrenia [12, 13], and late-life depression [14].

Subtyping brain diseases is a clustering problem where the goal is to break down the set of patients into distinct and relatively homogeneous subgroups (i.e., subtypes). While this has been actively investigated in the computer science community, subtyping neuroimaging data is endowed with a unique set of obstacles, such as the “curse of dimensionality” and the confounding nuisance effects, such as global demographics and scanner differences. Furthermore, brain development and pathologies often progress along a continuum, e.g., from healthy state to preclinical stages to full-fledged disease [15], thereby modeling directly in the patient domain may lead to a biased clustering solution. Thus, to tackle these problems, some recent efforts have focused on developing semi-supervised [6, 8, 9, 16] and unsupervised clustering methods [10, 11]. Early studies mainly focused on unsupervised clustering methods, such as K-means [17] or hierarchical clustering [18], to derive data-driven subtypes using imaging data. However, such approaches directly partition the patients based on similarities/dissimilarities, potentially biased by confounding factors, such as demographics or heterogeneity caused by unrelated pathological processes. More recently, semi-supervised clustering methods [6, 8, 9, 16] have been proposed to tackle this problem from a novel angle. To seek a pathology-oriented clustering solution, semi-supervised approaches dissect disease heterogeneity by the “1-to-k” mapping between the reference group (i.e., healthy control (CN)) and the subgroups of the patient group (i.e., the k subtypes). This approach presumably zooms into the heterogeneity of pathological processes rather than unwanted heterogeneity in general. Furthermore, confounding variations, such as demographics, are often ruled out in these approaches.

Aiming to provide the reader in the imaging and machine learning community with a broad guideline in terms of methodology and clinical applications, we organize the remainder of this chapter as follows. In Subheading 2, we provide a brief overview of clustering methods, including unsupervised and semi-supervised approaches. Subheading 3 discusses their applications in various neurological and neuropsychiatric disorders and diseases. Subheading 4 concludes the paper by discussing our main observations, methodological limitations, and future directions.

2 Methodological Development Using Machine Learning and Neuroimaging

Machine learning and neuroimaging have brought unprecedented opportunities to elucidate disease heterogeneity in various brain disorders and diseases [19]. Several trailblazing methodological papers have been recently published [9,10,11], challenging the conventional approach of patient stratification that puts all patients into the same bucket. Among these, unsupervised [10, 11] and semi-supervised clustering methods [9] sought to derive biologically data-driven disease subtypes, but they anchor the modeling from distinct perspectives. For conciseness, let us note that our imaging dataset contains q healthy control (CN) samples $ {\boldsymbol{X}}_r=\left[{\boldsymbol{x}}_1,\dots, {\boldsymbol{x}}_q\right],{\boldsymbol{X}}_r\in {\mathbb{R}}^{p\times q} $, representing our reference group, and n patient samples (PT) $ {\boldsymbol{X}}_t=\left[{\boldsymbol{x}}_1,\dots, {\boldsymbol{x}}_m\right],{\boldsymbol{X}}_t\in {\mathbb{R}}^{p\times m} $, representing the target subtype population. We denote the whole population as a matrix X that is organized by arranging each image as a vector per column $ \boldsymbol{X}=\left[{\boldsymbol{x}}_1,\dots, {\boldsymbol{x}}_{q+m}\right],\boldsymbol{X}\in {\mathbb{R}}^{p\times \left(q+m\right)} $, where p is the number of features per image. We use binary labels to distinguish the patient and control groups, where 1 represents PT and − 1 means CN. Disease subtyping sought to find the number of clusters (k) in the patient group that are neuroanatomically distinct while clinically relevant.

2.1 Unsupervised Clustering

Many recent efforts to discover the heterogeneous nature of brain diseases have investigated different unsupervised clustering algorithms [10, 11, 20,21,22,23,24,25,26,27,28,29,30,31,32]. Among these approaches, the key clustering methods are often K-means, hierarchical clustering, and nonnegative matrix factorization (NMF) (Fig. 1). In this subsection, we first briefly go through these methods. Subsequently, we focus on two representative models building on these unsupervised methods, i.e., Sustain [10] and latent Dirichlet allocation [11].

A schematic. 1. In k-means clustering, 3 clusters labeled clusters 1, 2, and 3 are represented along with their respective centroids. 2. In N M F, there are 3 matrices labeled X, C, and L. On the L matrix, clusters 1, 2, and 3 are indicated. 3. A hierarchical clustering method is presented. — **Fig. 1**

2.1.1 K-Means Clustering

K-means clustering aims to directly partition the n patients into k clusters. Each patient belongs to the cluster with the nearest mean (i.e., cluster centroid) quantified by a distance metric of choice (e.g., Euclidean distance). Since searching the global minimum in clustering is computationally difficult (NP-hard), local minima are searched in the K-means algorithm via an iterative refinement approach. This usually involves two steps after giving an initial set of k centroids: (i) assignment step, assigning each data point to the cluster with the nearest centroid with the least squared Euclidean distance, and (ii) update step, recalculating means (centroids) for all data points assigned to each cluster. The two steps iteratively continue until the convergence, i.e., the assignments no longer change. More details regarding the k-means algorithm are provided in Chap. 2, Subheading 12.1. Please refer to [33,34,35] for representative studies using K-means for disease subtyping.

2.1.2 NMF Clustering

Nonnegative matrix factorization (NMF) is a method that implicitly performs clustering by taking advantage that complex patterns can be construed as a sum of simple parts. In essence, the input data X_t is factorized into two nonnegative matrices $ \boldsymbol{C}\in {\mathbb{R}}^{p\times k} $ and $ \boldsymbol{L}\in {\mathbb{R}}^{k\times m} $, for which we refer to the component matrix and loading coefficient matrix, respectively. This method has been widely used as an effective dimensionality reduction technique in signal processing and image analysis [36]. By its nature, the L matrix can be directly used for clustering purposes, which is analogous to K-means if we impose an orthogonality constraint on the L matrix. Specifically, if L_kj > L_ij for all i≠k, this clusters the data point x_n into the k-th cluster. The vectors of the C matrix indicate the cluster centroids. Please refer to [32] for a representative study using NMF for disease subtyping.

2.1.3 Hierarchical Clustering

Hierarchical clustering aims to build a hierarchy of clusters, including two types of approach: agglomerative and divisive [18]. In general, the merges and splits are determined greedily and presented in a dendrogram. Similarly, a measure of dissimilarity between sets of observations is required. Most commonly, this is achieved by using an appropriate metric (e.g., Euclidean distance) and a linkage criterion that specifies the dissimilarity of sets as a function of the pairwise distances of observations. Please refer to [24, 25, 30, 37, 38] for representative studies using the hierarchical clustering for disease subtyping.

2.1.4 Representative Unsupervised Clustering Methods

Sustain [10] is an unsupervised clustering method for subtype and stage inference. Specifically, Sustain formulates the model as groups of subjects with a particular biomarker progression pattern as a subtype. The biomarker evolution of each subtype is modeled as a linear z-score model, a continuous generalization of the original event-based model [39]. Each biomarker follows a piecewise linear trajectory over a common timeframe. The key advantage of this model is that it can work with purely cross-sectional data and derive an imaging signatures of subtype and stage simultaneously.

A Bayesian latent Dirichlet allocation model [11] was proposed to extract latent AD-related atrophy factors. This probabilistic approach hypothesizes that each patient expresses one or more latent factors, and each factor is associated with distinct but possibly overlapping atrophy patterns. However, due to the nature of latent Dirichlet allocation methods, the input images have to be discretized. Moreover, this method exclusively models brain atrophy while ignoring brain enlargement. For example, larger brain volumes in basal ganglia have been associated with one subtype of schizophrenia [12].

2.2 Semi-supervised Clustering

Semi-supervised clustering methods dissect the subtle heterogeneity of interest under the principle of deriving data-driven and neurobiologically plausible subtypes (Fig. 2). In essence, these methods seek the “1-to-k” mapping between the reference CN group and the PT group, thereby teasing out clusters that are likely driven by distinct pathological trajectories, instead of by global similarity/dissimilarity in data, which is the core momentum of conventional unsupervised clustering methods.

An illustration of the 1-to-k mapping clustering method. The reference group is divided into sub-type 1, sub-type 2, and sub-type 3. — **Fig. 2**

In the following subsections, we briefly discuss four semi-supervised clustering methods. These methods employ different techniques to seek this “1-to-k” mapping. In particular, CHIMERA [16] and Smile-GAN [9] utilize generative models to achieve this mapping, while HYDRA [6] and MAGIC [8] are built on top of discriminative models.

Box 1: Representative Semi-supervised Clustering Methods

The central principle of semi-supervised clustering methods is to seek the “1-to-k” mapping from the reference domain to the patient domain.

CHIMERA: a generative approach that leverages the coherent point drift algorithm and maps the data distribution of the CN group to the PT group, thereby enabling to subtype by the distinct k regularized transformations.
Smile-GAN: a generative approach based on GANs to learn multiple distinct mappings by generating PT from CN. Simultaneously, a clustering model is trained interactively with mapping functions to assign PT into the corresponding subtype memberships.
HYDRA: a discriminative approach which leverages multiple linear support vector machines to construct a polytope that clusters the patients depending on the patterns of differences between the CN group and the PT group.
MAGIC: a generalization of HYDRA that aims to dissect disease heterogeneity at multiple imaging scales for a scale-consistent solution.

2.2.1 CHIMERA

CHIMERA employs a generative probabilistic approach, considers all samples as points in the imaging space, and infers the clusters from the transformations between the CN and PT distributions. It hypothesizes that the PT distribution can be generated from the CN distribution under k sets of transformations, each reflecting a distinct disease process.

Mathematically, the transformation T is a convex combination of the k linear transformations that map a CN subject in the reference space to the target space: $ {\boldsymbol{x}}_i^r\in {\mathbb{R}}^q\to {\boldsymbol{x}}_i^t=\boldsymbol{T}\left({x}_i\right)={\sum}_{j=1}^k{\xi}_j{\boldsymbol{T}}_j\left({x}_i\right) $, where ξ_j is the probability that a PT belongs to the j-th subtype. Ideally, if the disease subtypes were distinct, ξ_j should take value 1 for the transformation corresponding to this specific disease subtype and value 0 otherwise. At its core, the coherent point drift algorithm [40], a generative probabilistic approach, is used to estimate the transformation T. Specifically, the CN sample point is mapped to the PT domain and regarded as a centroid of a spherical Gaussian cluster, whereas the patient points are treated as independent and identically distributed data generated by a Gaussian mixture model (GMM) with equal weights for each cluster. The goal is to maximize the data likelihood during the distribution matching while also taking into account covariate confounds (e.g., age and gender). The expectation-maximization approach is adopted to optimize the resulting energy objective. Clustering inference is straightforward after the optimized transformation T_j is achieved, i.e., a patient can be assigned the subtype membership corresponding to the largest likelihood.

2.2.2 Smile-GAN

Smile-GAN is a novel generative deep learning approach based on generative adversarial networks (GAN). The reader may refer to Chap. 5 for generic information about GANs. Smile-GAN aims to learn a mapping function, f, from joint CN domain $ \mathcal{X} $ and subtype domain $ \mathcal{Z} $ to the PT domain $ \mathcal{Y} $, by transforming CN data x to different synthesized PT data y′ = f(x, z) that are indistinguishable from real PT data, y, by the discriminator, D. Mapping function, f, is regularized for inverse consistencies, with a clustering function, $ g:\mathcal{Y}\to \mathcal{Z} $, trained interactively to reconstruct z from synthesized PT data y′. The clustering function, g, can also be directly used to cluster both training and unseen test data after the training process.

More specifically, three different data distributions are denoted as x ∼ p_CN (for controls), y ∼ p_PT (for patients), and z ∼ p_Sub (for a subtype), respectively, where z ∼ p_Sub is sampled from a discrete uniform distribution and encoded as a one-hot vector with dimension K (number of clusters). Mapping function, $ f:\mathcal{X}\ast \mathcal{Z}\to \mathcal{Y} $, and clustering function, $ g:\mathcal{Y}\to \mathcal{Z} $, are learned through the following training procedure (l_c denotes the cross-entropy loss):

$$ f,g=\arg \underset{f,g}{\min}\underset{D}{\max }{L}_{\mathrm{GAN}}\left(D,f\right)+\mu {L}_{\mathrm{change}}(f)+\lambda {L}_{\mathrm{cluster}}\left(f,g\right) $$

(1)

where

$$ {\displaystyle \begin{array}{rlll}{L}_{\mathrm{GAN}}\left(D,f\right)& ={\mathbbm{E}}_{\mathbf{y}\sim {p}_{\mathrm{PT}}}\left[\log \left(D\left(\boldsymbol{y}\right)\right)\right]& & \\ {}& \kern1em +{\mathbbm{E}}_{\mathbf{z}\sim {p}_{\mathrm{Sub}},\mathbf{x}\sim {p}_{\mathrm{CN}}}\left[1-\log \left(D\left(f\left(\boldsymbol{x},\boldsymbol{z}\right)\right)\right)\right]\Big]& \end{array}} $$

(2)

$$ {L}_{\mathrm{change}}(f)\kern0.5em ={\mathbbm{E}}_{\mathbf{x}\sim {p}_{\mathrm{CN}},\mathbf{z}\sim {p}_{\mathrm{Sub}}}\left[{\left\Vert f\Big(\boldsymbol{x},\boldsymbol{z}\Big)-\boldsymbol{x}\right\Vert}_1\right]\kern0.5em $$

(3)

$$ {L}_{\mathrm{cluster}}\left(f,g\right)\kern0.5em ={\mathbbm{E}}_{\mathbf{x}\sim {p}_{\mathrm{CN}},\mathbf{z}\sim {p}_{\mathrm{Sub}}}\left[{l}_c\left(\boldsymbol{z},g\left(f\left(\boldsymbol{x},\boldsymbol{z}\right)\right)\right)\right]\kern0.5em $$

(4)

The objective consists of adversarial loss L_GAN, regularization terms L_change and L_cluster. Adversarial loss forces the synthesized PT data to follow similar distributions as real PT data. The discriminator D, trying to identify synthesized PT data from real PT data, attempts to maximize the loss, while the mapping f attempts to minimize against it. Both regularization terms serve to constrain the function class where the mapping function f is sampled from so that it is truly meaningful while matching the distributions. Minimization of L_change encourages sparsity of regions captured by f, with the assumption that only some regions are changed by disease effect. Optimizing L_cluster ensures that the input sub variable z can be reconstructed from synthesized PT data y′, so that the mutual information between z and y′ are maximized, and distinct imaging patterns are synthesized when z takes different values. Further regularization is also imposed by forcing mapping function f and clustering function g to be Lipschitz continuous. More importantly, thanks to the inverse consistencies led by L_cluster, function g can directly output cluster probabilities and cluster labels when given unseen test PT data.

2.2.3 HYDRA

In contrast to the generative approaches used in CHIMERA and Smile-GAN, HYDRA leverages a widely used discriminative method, i.e., support vector machines (SVM), to seek this “1-to-k” mapping. The novelty is that HYDRA extends multiple linear SVMs to the nonlinear case in a piecewise fashion, thereby simultaneously serving for classification and clustering. Specifically, it constructs a convex polytope by combining the hyperplane from k linear SVMs, separating the CN group and the k subpopulation of the PT group. Intuitively, each face of the convex polytope can be regarded to encode each subtype, capturing a distinct disease effect.

The convex polytope is estimated by sequentially solving each linear SVM as a subproblem under the principle of the sample weighted SVM [41]. The optimization stops when the sample weights become stable, i.e., the polytope is stably established. The objective of maximizing the polytope’s margin can be summarized as

$$ {\displaystyle \begin{array}{rlll}& \underset{{\left\{{\boldsymbol{w}}_j,{b}_j\right\}}_{\left(j=1\right)}^k}{\min}\sum \limits_{\left(j=1\right)}^k\frac{{\left\Vert {\boldsymbol{w}}_j\right\Vert}_2^2}{2}+\mu \sum \limits_{i\mid {y}_i=+1}\frac{1}{k}\max \left\{0,1-{\boldsymbol{w}}_j^T{\boldsymbol{X}}_i^T-{b}_j\right\}& & \\ {}& \kern1em +\mu \sum \limits_{i\mid {y}_i=-1}{s}_{i,j}\max \left\{0,1+{\boldsymbol{w}}_j^T{\boldsymbol{X}}_i^T+{b}_j\right\}& \end{array}} $$

(5)

where w_j and b_j are the weight and bias for each hyperplane, respectively. μ is a penalty parameter on the training error, and S is the subtype membership matrix of dimension m ∗ k deciding whether a patient sample i belongs to subtype j. The cluster membership is inferred as follows:

$$ {\boldsymbol{S}}_{i,j}=\Big\{{\displaystyle \begin{array}{ll}1,\kern1em & j=\arg \kern0.2em {\max}_j\left({\boldsymbol{w}}_j^T{\boldsymbol{X}}^T+{b}_j\right)\\ {}0,\kern1em & \mathrm{otherwise}\end{array}} $$

(6)

2.2.4 MAGIC

MAGIC was proposed to overcome one of the main limitations that HYDRA faced. That is, a single-scale set of features (e.g., atlas-based regions of interest) may not be sufficient to derive subtle differences, compared to global demographics, disease heterogeneity, since ample evidence has shown that the brain is fundamentally composed of multi-scale structural or functional entities. To this objective, MAGIC extracts multi-scale features in a coarse-to-fine granular fashion via stochastic orthogonal projective nonnegative matrix factorization (opNMF) [42], a very effective unbiased, data-driven method for extracting biologically interpretable and reproducible feature representations. Together with these multi-scale features, HYDRA is embedded into a double-cyclic optimization procedure to yield robust and scale-consistent cluster solutions.

MAGIC encapsulates the two previous proposed methods (i.e., opNMF and HYDRA) and optimizes the clustering objective for each single-scale feature as a sub-optimization problem. To fuse the multi-scale clustering information and enforce the clusters to be scale-consistent, it adopts a double-cyclic procedure that transfers and fine-tunes the clustering polytope. Firstly, (i) inner cyclic procedure: let us remind that HYDRA decides the clusters based on the subtype membership matrix (S). MAGIC first initializes the S matrix with a specific single-scale feature set, i.e., L_i, and then the S matrix is transferred to the next set of feature set L_i+1 until the predefined stopping criterion is achieved (i.e., the clustering solution across scales is stable). Secondly, (ii) outer cyclic procedure: the inner cyclic procedure was repeated by initializing with each single-scale feature set. Finally, to determine the final subtype assignment, we perform a consensus clustering by computing a co-occurrence matrix based on all the clustering results and then perform spectral clustering [43].

3 Application to Brain Disorders

Brain disorders and diseases affect the human brain across a wide age range. Neurodevelopmental disorders, such as autism spectrum disorders (ASD), are usually present from early childhood and affect daily functioning [44]. Psychotic disorders, such as schizophrenia, involve psychosis that is typically diagnosed for the first time in late adolescence or early adulthood [45]. Dementia and mild cognitive impairment (MCI) prevail both in late mid-life for early-onset AD (usually 30–60 years of age) and most frequently in late-life for late-onset AD (usually over 65 years of age) [46]. Brain cancers in children and adults are heterogeneous and encompass over 100 different histological types of tumors, based on cells of origin and other histopathological features, and have substantial morbidity and mortality [47]. Ample clinical evidence encourages the stratification of the patients in these brain disorders and cancers, potentially paving the road toward individualized precision medicine.

This section collectively overviews previous work aiming to unravel imaging-derived heterogeneity in ASD, psychosis, major depressive disorders (MDD), MCI and AD, and brain cancer.

3.1 Autism Spectrum Disorder

ASD encompasses a broad spectrum of social deficits and atypical behaviors [48]. Heterogeneity of its clinical presentation has sparked massive research efforts to find subtypes to better delineate its diagnosis [49, 50]. Recent initiatives to aggregate neuroimaging data of ASD, such as the ABIDE [51] and the EU-AIMS [52], also have motivated large-scale subtyping projects using imaging signatures [53].

Different clustering methods have been applied to reveal structural brain-based subtypes, but primarily traditional techniques such as the K-means [54] or hierarchical clustering [37]. Besides structural MRI, functional MRI [55] and EEG [56] have also been popular modalities. For reasons discussed earlier, normative clustering and dimensional analyses are better suited to parse a patient population that is highly heterogeneous [57]. However, efforts in this avenue have been primitive, with only a few recent publications using cortical thickness [58]. Taken together, although more validation and replication efforts are necessary to define any reliable neuroanatomical subtypes of ASD, some convergence in findings has been noted [53]. First, most sets of ASD neuroimaging subtypes indicate a combination of both increases and decreases in imaging features compared to the CN group, instead of pointing in a uniform direction. Second, most subtypes are characterized by spatially distributed imaging patterns instead of isolated or focal patterns. Both findings emphasize the significant heterogeneity in ASD brains and the need for better stratification.

The search for subtypes in the ASD population has unique challenges. First, the early onset of ASD implies that it is heavily influenced by neurodevelopmental processes. Depending on the selected age range, the results may significantly differ. Second, ASD is more prevalent in males, with three to four male cases for one female case [59], which adds a layer of potential bias. Third, individuals with ASD often suffer psychiatric comorbidities, such as ADHD, anxiety disorders, and obsessive-compulsive disorder, among many others [60], which, if not screened carefully, can dilute or alter the true signal.

3.2 Psychosis

Psychosis is a medical syndrome characterized by unusual beliefs called delusions and sometimes hallucinations of visions, sounds, smells, or body sensations that are not present in reality. Symptoms, functioning, and outcomes are highly heterogeneous across individuals, leading to long-standing hypotheses of underlying brain subgroups. However, objective brain biomarkers have largely not been discovered for any psychosis diagnosis, stage, or clinically defined subgroup [61, 62]. Neuroimaging studies are also affected by brain heterogeneity [63, 64]. Recent research has thus focused on finding structural brain subtypes using unbiased statistical techniques [12, 13, 65].

Psychosis studies have mainly focused on determining subtypes by clustering brain structural data within the chronic schizophrenia population that has had the illness for years, with results demonstrating two [12, 13], three [26], and six [31] subgroups. Various clustering techniques have been used to achieve these outcomes, including conventional approaches, such as k-means, in addition to more advanced machine learning methods, such as semi-supervised learning. A limitation of the work so far has been the lack of internal or external validation. Still, in studies with robust internal validation methods using metrics that choose the optimal cluster number based on the stability of the solution (e.g., consensus clustering), subtypes cluster along the lines of the severity of brain differences.

In a recent study, with the largest sample to date (n=671), clustered individuals with chronic schizophrenia using HYDRA and multiple internal validation procedures were applied (i.e., cross-validation resampling, split-half reproducibility, and leave-site-out validation) [12]. A two-subtype solution was found, with one subtype demonstrating widespread reductions and the other showing the localized larger volume of the striatum that was not associated with antipsychotic use. Interestingly, there were limited associations with current psychosis symptoms in this work, but indications of associations with education and illness duration in specific subtypes.

Functional imaging has also been used to define psychosis subgroups using functional connectivity at rest [66] and effective connectivity during task performance [67]. The research commonly has relatively low sample sizes with little internal or external validation. Still, of these works, preliminary results demonstrate that clusters can follow diagnostic divisions between individuals with psychosis [67] and that specific networks (e.g., frontoparietal network) are associated with specific psychotic symptoms [67] [66]. A recent advanced deep learning approach has also revealed clinical separations along the lines of symptom severity [68]. Taken together with brain structural results, it is possible that functional imaging maps onto symptom states rather than underlying illness traits that are captured by structural imaging. Further internal and external validation work is required to investigate this hypothesis by characterizing, comparing, and ultimately combining clustering solutions. A critical future direction will also be to conduct longitudinal studies that track individuals over time. Such research could lead the way toward clinical translation.

3.3 Major Depressive Disorder

MDD is a common, severe, and recurrent disorder, with over 300 million people affected worldwide, and is characterized by low mood, apathy, and social withdrawal, with symptoms spanning multiple domains [69]. Its vast heterogeneity is exemplified by the fact that according to DSM-5 criteria, at least 227 and up to 16,400 unique symptom presentations exist [70, 71]. The potential causes for this heterogeneity vary from divergent clinical symptom profiles to genetic etiologies and individual differences in treatment outcomes.

Despite neurobiological findings in MDD spanning cortical thickness, gray matter volume (GMV), and fractional anisotropy (FA) measures, objective brain biomarkers that can be used to diagnose and predict disease course and outcome remain elusive [71,72,73]. Recently, there have been efforts to identify neurobiologically based subtypes of depression using a bottom-up approach, mainly using data from resting-state fMRI [71]. Several studies [33,34,35] employed k-means clustering and group iterative multiple model estimation, respectively, to identify two functional connectivity subtypes, while Tokuda et al. [74] and Drysdale et al. [75] identified three and four subtypes, respectively, using nonparametric Bayesian mixture models and hierarchical clustering. These subtypes are characterized by reduced connectivity in different networks, including the default mode network (DMN), ventral attention network, and frontostriatal and limbic dysfunction. Regarding structural neuroimaging, one study has used k-means clustering on fractional anisotropy (FA) data to identify two depression subtypes. The first subtype was characterized by decreased FA in the right temporal lobe and the right middle frontal areas and was associated with an older age at onset. In contrast, the second subtype was characterized by increased FA in the left occipital lobe and was associated with a younger age at onset [76].

Current research in the identification of brain subtypes in MDD has produced results that are promising but confounded by methodological and design limitations. While some studies have shown clinical promises such as predicting higher depressive symptomatology and lower sustenance of positive mood [34, 35], depression duration [33], and TMS therapy response [75], they are confounded by limitations such as relatively small sample sizes; nuisance variances such as age, gender, and common ancestry; lack of external validation; and lack of statistical significance testing of identified clusters. Furthermore, there has been a lack of ambition in the use of novel clustering techniques. Clustering based on structural neuroimaging is limited compared to other disease entities and is an avenue that future research should consider. Future studies should also aim to perform longitudinal clustering to elucidate the stability of identified brain subtypes over time and examine their utility in predicting disease outcomes.

3.4 MCI and AD

AD, along with its prodromal stage presenting MCI, is the most common neurodegenerative disease, affecting millions across the globe. Although a plethora of imaging studies have derived AD-related imaging signatures, most studies ignored the heterogeneity in AD. Recently, there has been a developing body of effort to derive imaging signatures of AD that are heterogeneity-aware (i.e., subtypes) [7,8,9,10,11].

Most previous studies leveraged unsupervised clustering methods such as Sustain [10], NMF [32], latent Dirichlet allocation [11], and hierarchical clustering [24, 25, 30, 38]. Other papers [6, 9, 20, 77, 78] utilized semi-supervised clustering methods. Due to the variabilities of the choice of databases and methodologies and the lack of ground truth in the context of clustering, the reported number of clusters and the subtypes’ neuroanatomical patterns differ and cannot be directly compared. The targeted heterogeneous population of study also varies across papers. For instance, [6] focused on dissecting the neuroanatomical heterogeneity for AD patients, while [77] included AD plus MCI and [20] studied MCI only. However, some common subtypes were found in different studies. First, a subtype showing a typical diffuse atrophy pattern over the entire brain was witnessed in several studies [6, 8,9,10, 22, 27, 29, 30, 32, 38, 77]. Another subtype demonstrating nearly normal brain anatomy was robustly identified [8, 9, 16, 20, 22, 24, 25, 29, 30]. Moreover, studies [8, 9, 29, 30, 77] also reported one subtype showing atypical AD patterns (i.e., hippocampus or medial temporal lobe atrophy spared).

Though these methods enabled a better understanding of heterogeneity in AD, there are still limitations and challenges. First, due to demographic variations and the existence of comorbidities, it is not guaranteed that models cluster the data based on variations of the pathology of interest. Semi-supervised methods might tackle this problem to some extent, but more careful sample selection and further study with longitudinal data may ensure disease specificity. Second, spatial differences and temporal changes may simultaneously contribute to subtypes derived through clustering methods. Third, subtypes captured from neuroimaging data alone bring limited insight into disease treatments, thereby a joint study of neuroimaging and genetic heterogeneity may provide greater clinical value [14, 79].

3.5 Brain Cancer

Brain tumors, such as glioblastoma (GBM), exhibit extensive inter- and intra-tumor heterogeneity, diffuse infiltration, and invasiveness of various immune and stromal cell populations, which pose diagnostic and prognostic challenges, and render the standard therapies futile [80]. Deciphering the underlying heterogeneity of brain tumors, which arises from genomic instability of these tumors, plays a key role in understanding and predicting the course of tumor progression and its response to the standard therapies, thereby designing effective therapies targeted at aberrant genetic alterations [81, 82]. Medical imaging noninvasively portrays the phenotypic differences of brain tumors and their microenvironment caused by molecular activities of tumors on a macroscopic scale [83, 84]. It has the potential to provide readily accessible and surrogate biomarkers of particular genomic alterations, predict response to therapy, avoid risks of tumor biopsy or inaccurate diagnosis due to sampling errors, and ultimately develop personalized therapies to improve patient outcomes. An imaging subtype of brain tumors may provide a wealth of information about the tumor, including distinct molecular pathways [85, 86].

Recent studies on radiomic analysis of multiparametric MRI (mpMRI) scans provide evidence of distinct phenotypic presentation of brain tumors associated with specific molecular characteristics. These studies propose that quantification of tumor morphology, texture, regional microvasculature, cellular density, or microstructural properties can map to different imaging subtypes. In particular, one study [87] discovered three distinct clusters of GBM subtypes through unsupervised clustering of these features, with significant differences in survival probabilities and associations with specific molecular signaling pathways. These imaging subtypes, namely solid, irregular, and rim-enhancing, were significantly linked to different clinical outcomes and molecular characteristics, including isocitrate dehydrogenase-1, O6-methylguanine-DNA methyltransferase, epidermal growth factor receptor variant III, and transcriptomic molecular subtype composition.

These studies have offered new insights into the characterization of tumor heterogeneity on both microscopic, i.e., histology and molecular, and macroscopic, i.e., imaging levels, consequently providing a more comprehensive understanding of the tumor aggressiveness and patient prognosis, and ultimately, the development of personalized treatments.

4 Conclusion

Taken together, these novel clustering algorithms tailored for high-resolution yet highly variable neuroimaging datasets have demonstrated a broad utility in disease subtyping across many neurological and psychiatric conditions. Simultaneously, cautions need to be taken in order not to overclaim the biological importance of subtypes, since all clustering methods find patterns in data, even if such patterns don’t have a meaningful underlying biological correlate [88]. External validations are necessary. For instance, evidence of post hoc evaluations, e.g., a difference in clinical variables or genetic architectures, can support the biological relevance of identified neuroimaging-based subtypes [14]. Moreover, good practices such as split-sample analysis, permutation tests [12], and comparison to the guideline of semi-simulated experiments [8] discern the robustness of the subtypes. As dataset sizes and imaging resolution improve over time, unique computational challenges are expected to appear, along with unique opportunities to further refine our methodologies to decipher the diversity of brain diseases.

References

Murray ME, Graff-Radford NR, Ross OA, Petersen RC, Duara R, Dickson DW (2011) Neuropathologically defined subtypes of Alzheimer’s disease with distinct clinical characteristics: a retrospective study. Lancet Neurol 10(9):785–796. https://doi.org/10.1016/S1474-4422(11)70156-9
Noh Y, Jeon S, Lee JM, Seo SW, Kim GH, Cho H, Ye BS, Yoon CW, Kim HJ, Chin J, Park KH, Heilman KM, Na DL (2014) Anatomical heterogeneity of Alzheimer disease: based on cortical thickness on MRIs. Neurology 83(21):1936–1944. https://doi.org/10.1212/WNL.0000000000001003
PubMed PubMed Central Google Scholar
Whitwell JL, Petersen RC, Negash S, Weigand SD, Kantarci K, Ivnik RJ, Knopman DS, Boeve BF, Smith GE, Jack CR (2007) Patterns of atrophy differ among specific subtypes of mild cognitive impairment. Arch Neurol 64(8):1130–1138. https://doi.org/10.1001/archneur.64.8.1130
PubMed PubMed Central Google Scholar
Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, Bartsch AJ, Jbabdi S, Sotiropoulos SN, Andersson JLR, Griffanti L, Douaud G, Okell TW, Weale P, Dragonu I, Garratt S, Hudson S, Collins R, Jenkinson M, Matthews PM, Smith SM (2016) Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat Neurosci 19(11):1523–1536. https://doi.org/10.1038/nn.4393
CAS PubMed PubMed Central Google Scholar
Petersen RC, Aisen PS, Beckett LA, Donohue MC, Gamst AC, Harvey DJ, Jack CR, Jagust WJ, Shaw LM, Toga AW, Trojanowski JQ, Weiner MW (2010) Alzheimer’s disease neuroimaging initiative (ADNI): clinical characterization. Neurology 74(3):201–209. https://doi.org/10.1212/WNL.0b013e3181cb3e25
PubMed PubMed Central Google Scholar
Varol E, Sotiras A, Davatzikos C (2017) HYDRA: revealing heterogeneity of imaging and genetic patterns through a multiple max-margin discriminative analysis framework. NeuroImage 145:346–364. https://doi.org/10.1016/j.neuroimage.2016.02.041
PubMed Google Scholar
Vogel JW, Young AL, Oxtoby NP, Smith R, Ossenkoppele R, Strandberg OT, La Joie R, Aksman LM, Grothe MJ, Iturria-Medina Y, Pontecorvo MJ, Devous MD, Rabinovici GD, Alexander DC, Lyoo CH, Evans AC, Hansson O (2021) Four distinct trajectories of tau deposition identified in Alzheimer’s disease. Nat Med 27(5):871–881. https://doi.org/10.1038/s41591-021-01309-6
CAS PubMed PubMed Central Google Scholar
Wen J, Varol E, Sotiras A, Yang Z, Chand GB, Erus G, Shou H, Abdulkadir A, Hwang G, Dwyer DB, Pigoni A, Dazzan P, Kahn RS, Schnack HG, Zanetti MV, Meisenzahl E, Busatto GF, Crespo-Facorro B, Rafael RG, Pantelis C, Wood SJ, Zhuo C, Shinohara RT, Fan Y, Gur RC, Gur RE, Satterthwaite TD, Koutsouleris N, Wolf DH, Davatzikos C, Alzheimer’s disease neuroimaging initiative (2021) Multi-scale semi-supervised clustering of brain images: deriving disease subtypes. Med Image Anal 75:102304. https://doi.org/10.1016/j.media.2021.102304
Yang Z, Nasrallah IM, Shou H, Wen J, Doshi J, Habes M, Erus G, Abdulkadir A, Resnick SM, Albert MS, Maruff P, Fripp J, Morris JC, Wolk DA, Davatzikos C, iSTAGING Consortium, Baltimore Longitudinal Study of Aging (BLSA), Alzheimer’s Disease Neuroimaging Initiative (ADNI) (2021) A deep learning framework identifies dimensional representations of Alzheimer’s disease from brain structure. Nature Communications 12(1):7065. https://doi.org/10.1038/s41467-021-26703-z
Young A, Marinescu R, Oxtoby N, Bocchetta M, Yong K, Firth N, Cash D, Thomas D, Moore K, Cardoso MJ, Swieten J, Borroni B, Galimberti D, Masellis M, Tartaglia M, Rowe J, Graff C, Tagliavini F, Frisoni G, Alexander D (2018) Uncovering the heterogeneity and temporal complexity of neurodegenerative diseases with subtype and stage inference. Nat Commun. https://doi.org/10.1038/s41467-018-05892-0
Zhang X, Mormino E, Sun N, Sperling R, Sabuncu M, Yeo BT (2016) Bayesian model reveals latent atrophy factors with dissociable cognitive trajectories in Alzheimer’s disease. Proc Natl Acad Sci 113:E6535–E6544. https://doi.org/10.1073/pnas.1611073113
CAS PubMed PubMed Central Google Scholar
Chand GB, Dwyer DB, Erus G, Sotiras A, Varol E, Srinivasan D, Doshi J, Pomponio R, Pigoni A, Dazzan P, Kahn RS, Schnack HG, Zanetti MV, Meisenzahl E, Busatto GF, Crespo-Facorro B, Pantelis C, Wood SJ, Zhuo C, Shinohara RT, Shou H, Fan Y, Gur RC, Gur RE, Satterthwaite TD, Koutsouleris N, Wolf DH, Davatzikos C (2020) Two distinct neuroanatomical subtypes of schizophrenia revealed using machine learning. Brain 143(3):1027–1038. https://doi.org/10.1093/brain/awaa025
PubMed PubMed Central Google Scholar
Dwyer DB, Cabral C, Kambeitz-Ilankovic L, Sanfelici R, Kambeitz J, Calhoun V, Falkai P, Pantelis C, Meisenzahl E, Koutsouleris N (2018) Brain subtyping enhances the neuroanatomical discrimination of schizophrenia. Schizophr Bull 44(5):1060–1069. https://doi.org/10.1093/schbul/sby008
PubMed PubMed Central Google Scholar
Wen J, Fu CHY, Tosun D, Veturi Y, Yang Z, Abdulkadir A, Mamourian E, Srinivasan D, Ioanna S, Ashish S, Bao J, Erus G, Shou H, Habes M, Doshi J, Varol E, Mackin SR, Sotiras A, Fan Y, Saykin AJ, Sheline YI, Shen L, Ritchie MD, Wolk DA, Albert M, Resnick SM, Davatzikos C (2022) Characterizing heterogeneity in neuroimaging, cognition, clinical symptomatology, and genetics among patients with late-life depression. JAMA Psychiatry. https://doi.org/10.1001/jamapsychiatry.2022.0020
Yang Z DC Wen P (2022) Surreal-gan:semi-supervised representation learning via GAN for uncovering heterogeneous disease-related imaging patterns. ICLR 2022. https://openreview.net/forum?id=nf3A0WZsXS5
Dong A, Honnorat N, Gaonkar B, Davatzikos C (2016) CHIMERA: clustering of heterogeneous disease effects via distribution matching of imaging patterns. IEEE Trans Med Imaging 35(2):612–621. https://doi.org/10.1109/TMI.2015.2487423
PubMed Google Scholar
Hamerly G, Elkan C (2004) Learning the k in k-means. In: Thrun S, Saul LK, Schölkopf B (eds) Advances in neural information processing systems, vol 16. MIT Press, Cambridge, MA, pp 281–288. http://papers.nips.cc/paper/2526-learning-the-k-in-k-means.pdf
Google Scholar
Day WHE, Edelsbrunner H (1984) Efficient algorithms for agglomerative hierarchical clustering methods. J Classif 1(1):7–24. https://doi.org/10.1007/BF01890115
Google Scholar
Davatzikos C (2019) Machine learning in neuroimaging: progress and challenges. NeuroImage 197:652–656. https://doi.org/10.1016/j.neuroimage.2018.10.003
PubMed Google Scholar
Ezzati A, Zammit AR, Habeck C, Hall CB, Lipton RB, Alzheimer’s Disease Neuroimaging Initiative (2020) Detecting biological heterogeneity patterns in ADNI amnestic mild cognitive impairment based on volumetric MRI. Brain Imaging Behav 14(5):1792–1804. https://doi.org/10.1007/s11682-019-00115-6
PubMed PubMed Central Google Scholar
Honnorat N, Dong A, Meisenzahl-Lechner E, Koutsouleris N, Davatzikos C (2019) Neuroanatomical heterogeneity of schizophrenia revealed by semi-supervised machine learning methods. Schizophr Res 214:43–50. https://doi.org/10.1016/j.schres.2017.12.008
PubMed Google Scholar
Jung NY, Seo SW, Yoo H, Yang JJ, Park S, Kim YJ, Lee J, Lee JS, Jang YK, Lee JM, Kim ST, Kim S, Kim EJ, Na DL, Kim HJ (2016) Classifying anatomical subtypes of subjective memory impairment. Neurobiol Aging 48:53–60. https://doi.org/10.1016/j.neurobiolaging.2016.08.010
PubMed Google Scholar
Lubeiro A, Rueda C, Hernández JA, Sanz J, Sarramea F, Molina V (2016) Identification of two clusters within schizophrenia with different structural, functional and clinical characteristics. Progr Neuro-Psychopharmacol Biol Psychiatry 64:79–86. https://doi.org/10.1016/j.pnpbp.2015.06.015
Google Scholar
Nettiksimmons J, DeCarli C, Landau S, Beckett L (2014) Biological heterogeneity in ADNI amnestic mild cognitive impairment. Alzheimer’s Dementia 10(5):511–521.e1. https://doi.org/10.1016/j.jalz.2013.09.003
Ota K, Oishi N, Ito K, Fukuyama H (2016) Prediction of Alzheimer’s disease in amnestic mild cognitive impairment subtypes: Stratification based on imaging biomarkers. J Alzheimer’s Disease. https://doi.org/10.3233/JAD-160145
Pan Y, Pu W, Chen X, Huang X, Cai Y, Tao H, Xue Z, Mackinley M, Limongi R, Liu Z, Palaniyappan L (2020) Morphological profiling of schizophrenia: cluster analysis of MRI-based cortical thickness data. Schizophr Bull 46(3):623–632. https://doi.org/10.1093/schbul/sbz112
PubMed PubMed Central Google Scholar
Park JY, Na HK, Kim S, Kim H, Seo S, Na D, Han C, Seong JK (2017) Robust identification of Alzheimer’s disease subtypes based on cortical atrophy patterns. Sci Rep 7:43270. https://doi.org/10.1038/srep43270
PubMed PubMed Central Google Scholar
Planchuelo-Gomez A, Lubeiro A, Nunez-Novo P, Gomez-Pilar J, de Luis-García R, Del Valle P, Martin-Santiago O, Pérez-Escudero A, Molina V (2020) Identificacion of MRI-based psychosis subtypes: replication and refinement. Progr Neuro-Psychopharmacol Biol Psychiatry 100:109907. https://doi.org/10.1016/j.pnpbp.2020.109907
Google Scholar
Poulakis K, Ferreira D, Pereira JB, Smedby O, Vemuri P, Westman E (2020) Fully Bayesian longitudinal unsupervised learning for the assessment and visualization of AD heterogeneity and progression. Aging 12(13):12622–12647
CAS PubMed PubMed Central Google Scholar
Poulakis K, Pereira JB, Mecocci P, Vellas B, Tsolaki M, Kłoszewska I, Soininen H, Lovestone S, Simmons A, Wahlund LO, Westman E (2018) Heterogeneous patterns of brain atrophy in Alzheimer’s disease. Neurobiol Aging 65:98–108. https://doi.org/10.1016/j.neurobiolaging.2018.01.009
PubMed Google Scholar
Sugihara G, Oishi N, Son S, Kubota M, Takahashi H, Murai T (2017) Distinct Patterns of cerebral cortical thinning in schizophrenia: a neuroimaging data-driven approach. Schizophr Bull 43(4):900–906. https://doi.org/10.1093/schbul/sbw176
PubMed Google Scholar
ten Kate M, Dicks E, Visser PJ, van der Flier WM, Teunissen CE, Barkhof F, Scheltens P, Tijms BM, Alzheimer’s Disease Neuroimaging Initiative (2018) Atrophy subtypes in prodromal Alzheimer’s disease are associated with cognitive decline. Brain 141(12):3443–3456. https://doi.org/10.1093/brain/awy264
PubMed PubMed Central Google Scholar
Feder S, Sundermann B, Wersching H, Teuber A, Kugel H, Teismann H, Heindel W, Berger K, Pfleiderer B (2017) Sample heterogeneity in unipolar depression as assessed by functional connectivity analyses is dominated by general disease effects. J Affective Disord 222:79–87. https://doi.org/10.1016/j.jad.2017.06.055
Google Scholar
Price RB, Gates K, Kraynak TE, Thase ME, Siegle GJ (2017) Data-driven subgroups in depression derived from directed functional connectivity paths at rest. Neuropsychopharmacology 42(13):2623–2632. https://doi.org/10.1038/npp.2017.97
PubMed PubMed Central Google Scholar
Price RB, Lane S, Gates K, Kraynak TE, Horner MS, Thase ME, Siegle GJ (2017) Parsing Heterogeneity in the brain connectivity of depressed and healthy adults during positive mood. Biolo Psychiatry 81(4):347–357. https://doi.org/10.1016/j.biopsych.2016.06.023
Google Scholar
Lee DD, Seung HS (2001) Algorithms for non-negative matrix factorization. In: Proc. Neural Information Processing Systems, p 7
Google Scholar
Hong SJ, Valk SL, Di Martino A, Milham MP, Bernhardt BC (2018) Multidimensional neuroanatomical subtyping of autism spectrum disorder. Cerebral Cortex (New York, NY: 1991) 28(10):3578–3588. https://doi.org/10.1093/cercor/bhx229
Google Scholar
Jeon S, Kang JM, Seo S, Jeong HJ, Funck T, Lee SY, Park KH, Lee YB, Yeon BK, Ido T, Okamura N, Evans AC, Na DL, Noh Y (2019) Topographical heterogeneity of Alzheimer’s disease based on MR imaging, Tau PET, and amyloid PET. Front Aging Neurosci 11:211. https://doi.org/10.3389/fnagi.2019.00211
CAS PubMed PubMed Central Google Scholar
Young AL, Oxtoby NP, Daga P, Cash DM, Fox NC, Ourselin S, Schott JM, Alexander DC, Alzheimer’s Disease Neuroimaging Initiative (2014) A data-driven model of biomarker changes in sporadic Alzheimer’s disease. Brain 137(Pt 9):2564–2577. https://doi.org/10.1093/brain/awu176
PubMed PubMed Central Google Scholar
Myronenko A SX (2010) Point set registration: coherent point drift. IEEE Trans Pattern Anal Mach Intell 32:2262–2275. https://doi.org/10.1109/TPAMI.2010.46
PubMed Google Scholar
Chang CC, Lin CJ (2011) LIBSVM: A library for support vector machines. ACM Trans Intell Syst Technol 2(3):1–27. https://doi.org/10.1145/1961189.1961199
Google Scholar
Sotiras A, Resnick SM, Davatzikos C (2015) Finding imaging patterns of structural covariance via non-negative matrix factorization. NeuroImage 108:1–16. https://doi.org/10.1016/j.neuroimage.2014.11.045
PubMed Google Scholar
Ng A, Jordan M, Weiss Y (2002) On spectral clustering: analysis and an algorithm. In: Dietterich T, Becker S, Ghahramani Z (eds) Advances in neural information processing systems, vol 14. MIT Press, Cambridge, MA. https://proceedings.neurips.cc/paper/2001/file/801272ee79cfde7fa5960571fee36b9b-Paper.pdf
Google Scholar
Faras H, Al AN, Tidmarsh L (2010) Autism spectrum disorders. Ann Saudi Med 30(4):295–300. https://doi.org/10.4103/0256-4947.65261
PubMed PubMed Central Google Scholar
Gottesman II, Shields J, Hanson DR (1982) Schizophrenia. CUP Archive. Google-Books-ID: eoA6AAAAIAAJ
Google Scholar
Mucke L (2009) Alzheimer’s disease. Nature 461(7266):895–897. https://doi.org/10.1038/461895a
CAS PubMed Google Scholar
Ostrom QT, Adel Fahmideh M, Cote DJ, Muskens IS, Schraw JM, Scheurer ME, Bondy ML (2019) Risk factors for childhood and adult primary brain tumors. Neuro-Oncology 21(11):1357–1375. https://doi.org/10.1093/neuonc/noz123
CAS PubMed PubMed Central Google Scholar
Masi A, DeMayo MM, Glozier N, Guastella AJ (2017) An overview of autism spectrum disorder, heterogeneity and treatment options. Neurosci Bull 33(2):183–193. https://doi.org/10.1007/s12264-017-0100-y
PubMed PubMed Central Google Scholar
Lord C, Jones RM (2012) Annual research review: re-thinking the classification of autism spectrum disorders. J Child Psychol Psychiatry Allied Disciplines 53(5):490–509. https://doi.org/10.1111/j.1469-7610.2012.02547.x
Google Scholar
Wolfers T, Floris DL, Dinga R, van Rooij D, Isakoglou C, Kia SM, Zabihi M, Llera A, Chowdanayaka R, Kumar VJ, Peng H, Laidi C, Batalle D, Dimitrova R, Charman T, Loth E, Lai MC, Jones E, Baumeister S, Moessnang C, Banaschewski T, Ecker C, Dumas G, O’Muircheartaigh J, Murphy D, Buitelaar JK, Marquand AF, Beckmann CF (2019) From pattern classification to stratification: towards conceptualizing the heterogeneity of Autism Spectrum Disorder. Neurosci Biobehav Rev 104:240–254. https://doi.org/10.1016/j.neubiorev.2019.07.010
PubMed Google Scholar
Di Martino A, O’Connor D, Chen B, Alaerts K, Anderson JS, Assaf M, Balsters JH, Baxter L, Beggiato A, Bernaerts S, Blanken LME, Bookheimer SY, Braden BB, Byrge L, Castellanos FX, Dapretto M, Delorme R, Fair DA, Fishman I, Fitzgerald J, Gallagher L, Keehn RJJ, Kennedy DP, Lainhart JE, Luna B, Mostofsky SH, Müller RA, Nebel MB, Nigg JT, O’Hearn K, Solomon M, Toro R, Vaidya CJ, Wenderoth N, White T, Craddock RC, Lord C, Leventhal B, Milham MP (2017) Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci Data 4:170010. https://doi.org/10.1038/sdata.2017.10
PubMed PubMed Central Google Scholar
Loth E, Charman T, Mason L, Tillmann J, Jones EJH, Wooldridge C, Ahmad J, Auyeung B, Brogna C, Ambrosino S, Banaschewski T, Baron-Cohen S, Baumeister S, Beckmann C, Brammer M, Brandeis D, Bölte S, Bourgeron T, Bours C, de Bruijn Y, Chakrabarti B, Crawley D, Cornelissen I, Acqua FD, Dumas G, Durston S, Ecker C, Faulkner J, Frouin V, Garces P, Goyard D, Hayward H, Ham LM, Hipp J, Holt RJ, Johnson MH, Isaksson J, Kundu P, Lai MC, D’ardhuy XL, Lombardo MV, Lythgoe DJ, Mandl R, Meyer-Lindenberg A, Moessnang C, Mueller N, O’Dwyer L, Oldehinkel M, Oranje B, Pandina G, Persico AM, Ruigrok ANV, Ruggeri B, Sabet J, Sacco R, Cáceres ASJ, Simonoff E, Toro R, Tost H, Waldman J, Williams SCR, Zwiers MP, Spooren W, Murphy DGM, Buitelaar JK (2017) The EU-AIMS Longitudinal European Autism Project (LEAP): design and methodologies to identify and validate stratification biomarkers for autism spectrum disorders. Mol Autism 8:24. https://doi.org/10.1186/s13229-017-0146-8
PubMed PubMed Central Google Scholar
Hong SJ, Vogelstein JT, Gozzi A, Bernhardt BC, Yeo BTT, Milham MP, Di Martino A (2020) Toward neurosubtypes in Autism. Biol Psychiatry 88(1):111–128. https://doi.org/10.1016/j.biopsych.2020.03.022
PubMed Google Scholar
Chen H, Uddin LQ, Guo X, Wang J, Wang R, Wang X, Duan X, Chen H (2019) Parsing brain structural heterogeneity in males with autism spectrum disorder reveals distinct clinical subtypes. Hum Brain Mapp 40(2):628–637. https://doi.org/10.1002/hbm.24400
PubMed Google Scholar
Easson AK, Fatima Z, McIntosh AR (2019) Functional connectivity-based subtypes of individuals with and without Autism Spectrum Disorder. Netw Neurosci 3(2):344–362. https://doi.org/10.1162/netn_a_00067
PubMed PubMed Central Google Scholar
Duffy FH, Als H (2019) Autism, spectrum or clusters? An EEG coherence study. BMC Neurol 19(1):27. https://doi.org/10.1186/s12883-019-1254-1
PubMed PubMed Central Google Scholar
Tang S, Sun N, Floris DL, Zhang X, Di Martino A, Yeo BTT (2020) Reconciling dimensional and categorical models of autism heterogeneity: a brain connectomics and behavioral study. Biol Psychiatry 87(12):1071–1082. https://doi.org/10.1016/j.biopsych.2019.11.009
PubMed Google Scholar
Zabihi M, Floris DL, Kia SM, Wolfers T, Tillmann J, Arenas AL, Moessnang C, Banaschewski T, Holt R, Baron-Cohen S, Loth E, Charman T, Bourgeron T, Murphy D, Ecker C, Buitelaar JK, Beckmann CF, Marquand A, EU-AIMS LEAP Group (2020) Fractionating autism based on neuroanatomical normative modeling. Transl Psychiatry 10(1):384. https://doi.org/10.1038/s41398-020-01057-0
PubMed PubMed Central Google Scholar
Loomes R, Hull L, Mandy WPL (2017) What is the male-to-female ratio in autism spectrum disorder? A systematic review and meta-analysis. J Am Acad Child Adolesc Psychiatry 56(6):466–474. https://doi.org/10.1016/j.jaac.2017.03.013
PubMed Google Scholar
Gillberg C, Fernell E (2014) Autism plus versus autism pure. J Autism Dev Disord 44(12):3274–3276. https://doi.org/10.1007/s10803-014-2163-1
PubMed Google Scholar
Haijma SV, Van Haren N, Cahn W, Koolschijn PCMP, Hulshoff Pol HE, Kahn RS (2013) Brain volumes in schizophrenia: a meta-analysis in over 18000 subjects. Schizophr Bull 39(5):1129–1138. https://doi.org/10.1093/schbul/sbs118
PubMed Google Scholar
Pantelis C, Velakoulis D, McGorry PD, Wood SJ, Suckling J, Phillips LJ, Yung AR, Bullmore ET, Brewer W, Soulsby B, Desmond P, McGuire PK (2003) Neuroanatomical abnormalities before and after onset of psychosis: a cross-sectional and longitudinal MRI comparison. Lancet (London, England) 361(9354):281–288. https://doi.org/10.1016/S0140-6736(03)12323-9
Abi-Dargham A, Horga G (2016) The search for imaging biomarkers in psychiatric disorders. Nat Med 22(11):1248–1255. https://doi.org/10.1038/nm.4190
CAS PubMed Google Scholar
Insel T, Cuthbert B (2015) Medicine. brain disorders? precisely. Science (New York, NY) 348:499–500. https://doi.org/10.1126/science.aab2358
Kaczkurkin AN, Moore TM, Sotiras A, Xia CH, Shinohara RT, Satterthwaite TD (2020) Approaches to defining common and dissociable neurobiological deficits associated with psychopathology in youth. Biol Psychiatry 88(1):51–62. https://doi.org/10.1016/j.biopsych.2019.12.015
PubMed Google Scholar
Yang Z, Xu Y, Xu T, Hoy CW, Handwerker DA, Chen G, Northoff G, Zuo XN, Bandettini PA (2014) Brain network informed subject community detection in early-onset schizophrenia. Sci Rep 4:5549. https://doi.org/10.1038/srep05549
CAS PubMed PubMed Central Google Scholar
Brodersen KH, Deserno L, Schlagenhauf F, Lin Z, Penny WD, Buhmann JM, Stephan KE (2014) Dissecting psychiatric spectrum disorders by generative embedding. NeuroImage Clin 4:98–111. https://doi.org/10.1016/j.nicl.2013.11.002
PubMed Google Scholar
Yan W, Zhao M, Fu Z, Pearlson GD, Sui J, Calhoun VD (2022) Mapping relationships among schizophrenia, bipolar and schizoaffective disorders: a deep classification and clustering framework using fMRI time series. Schizophr Res 245:141–150. https://doi.org/10.1016/j.schres.2021.02.007
PubMed Google Scholar
World Health Organization (1992) The ICD-10 classification of mental and behavioural disorders: clinical descriptions and diagnostic guidelines. Tech. rep., World Health Organization, p 362. https://apps.who.int/iris/handle/10665/37958. ISBN: 9787117019576
Fried EI, Nesse RM (2015) Depression is not a consistent syndrome: an investigation of unique symptom patterns in the STAR*D study. J Affective Disord 172:96–102. https://doi.org/10.1016/j.jad.2014.10.010
Google Scholar
Lynch CJ, Gunning FM, Liston C (2020) Causes and consequences of diagnostic heterogeneity in depression: paths to discovering novel biological depression subtypes. Biol Psychiatry 88(1):83–94. https://doi.org/10.1016/j.biopsych.2020.01.012
CAS PubMed Google Scholar
Buch AM, Liston C (2021) Dissecting diagnostic heterogeneity in depression by integrating neuroimaging and genetics. Neuropsychopharmacology 46(1):156–175. https://doi.org/10.1038/s41386-020-00789-3
PubMed Google Scholar
Rajkowska G, Miguel-Hidalgo JJ, Wei J, Dilley G, Pittman SD, Meltzer HY, Overholser JC, Roth BL, Stockmeier CA (1999) Morphometric evidence for neuronal and glial prefrontal cell pathology in major depression. Biol Psychiatry 45(9):1085–1098. https://doi.org/10.1016/s0006-3223(99)00041-4
Tokuda T, Yoshimoto J, Shimizu Y, Okada G, Takamura M, Okamoto Y, Yamawaki S, Doya K (2018) Identification of depression subtypes and relevant brain regions using a data-driven approach. Sci Rep 8:14082. https://doi.org/10.1038/s41598-018-32521-z
PubMed PubMed Central Google Scholar
Drysdale AT, Grosenick L, Downar J, Dunlop K, Mansouri F, Meng Y, Fetcho RN, Zebley B, Oathes DJ, Etkin A, Schatzberg AF, Sudheimer K, Keller J, Mayberg HS, Gunning FM, Alexopoulos GS, Fox MD, Pascual-Leone A, Voss HU, Casey BJ, Dubin MJ, Liston C (2017) Resting-state connectivity biomarkers define neurophysiological subtypes of depression. Nat Med 23(1):28–38. https://doi.org/10.1038/nm.4246
CAS PubMed Google Scholar
Cheng Y, Xu J, Yu H, Nie B, Li N, Luo C, Li H, Liu F, Bai Y, Shan B, Xu L, Xu X (2014) Delineation of early and later adult onset depression by diffusion tensor imaging. PLoS ONE 9(11). https://doi.org/10.1371/journal.pone.0112307
Dong A, Toledo JB, Honnorat N, Doshi J, Varol E, Sotiras A, Wolk D, Trojanowski JQ, Davatzikos C, for the Alzheimer’s Disease Neuroimaging Initiative (2017) Heterogeneity of neuroanatomical patterns in prodromal Alzheimer’s disease: links to cognition, progression and biomarkers. Brain 140(3):735–747. https://doi.org/10.1093/brain/aww319
Filipovych R, Resnick SM, Davatzikos C (2012) JointMMCC: joint maximum-margin classification and clustering of imaging data. IEEE Trans Med Imaging 31(5):1124–1140. https://doi.org/10.1109/TMI.2012.2186977
PubMed PubMed Central Google Scholar
Chand GB, Singhal P, Dwyer DB, Wen J, Erus G, Doshi J, Srinivasan D, Mamourian E, Varol E, Sotiras A, Hwang G (2022) Two schizophrenia imaging signatures and their associations with cognition, psychopathology, and genetics in the general population. https://doi.org/10.1101/2022.01.07.22268854
Andrea S, Inmaculada S, Sara P, Anestis T, Peter C, John M, Christina C, Colin W, Simon T (2013) Intratumor heterogeneity in human glioblastoma reflects cancer evolutionary dynamics. Proc Natl Acad Sci USA 110:4009–4014. https://doi.org/10.1073/pnas.1219747110
Google Scholar
Akhavan D, Alizadeh D, Wang D, Weist MR, Shepphird JK, Brown CE (2019) CAR T cells for brain tumors: lessons learned and road ahead. Immunol Rev 290(1):60–84. https://doi.org/10.1111/imr.12773
CAS PubMed PubMed Central Google Scholar
Qazi MA, Vora P, Venugopal C, Sidhu SS, Moffat J, Swanton C, Singh SK (2017) Intratumoral heterogeneity: pathways to treatment resistance and relapse in human glioblastoma. Ann Oncol 28(7):1448–1456. https://doi.org/10.1093/annonc/mdx169
CAS PubMed Google Scholar
Davatzikos C, Sotiras A, Fan Y, Habes M, Erus G, Rathore S, Bakas S, Chitalia R, Gastounioti A, Kontos D (2019) Precision diagnostics based on machine learning-derived imaging signatures. Magn Reson Imaging 64:49–61. https://doi.org/10.1016/j.mri.2019.04.012
PubMed PubMed Central Google Scholar
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278(2):563–577. https://doi.org/10.1148/radiol.2015151169
PubMed Google Scholar
Fathi Kazerooni A, Bakas S, Saligheh Rad H, Davatzikos C (2020) Imaging signatures of glioblastoma molecular characteristics: a radiogenomics review. J Magn Reson Imaging 52(1):54–69. https://doi.org/10.1002/jmri.26907
PubMed Google Scholar
Gevaert O, Mitchell LA, Achrol AS, Xu J, Echegaray S, Steinberg GK, Cheshier SH, Napel S, Zaharchuk G, Plevritis SK (2014) Glioblastoma multiforme: exploratory radiogenomic analysis by using quantitative image features. Radiology 273(1):168–174. https://doi.org/10.1148/radiol.14131731
PubMed Google Scholar
Rathore S, Akbari H, Rozycki M, Abdullah KG, Nasrallah MP, Binder ZA, Davuluri RV, Lustig RA, Dahmane N, Bilello M, O’Rourke DM, Davatzikos C (2018) Radiomic MRI signature reveals three distinct subtypes of glioblastoma with different clinical and molecular characteristics, offering prognostic value beyond IDH1. Sci Rep 8(1):5087. https://doi.org/10.1038/s41598-018-22739-2
PubMed PubMed Central Google Scholar
Altman N, Krzywinski M (2017) Clustering. Nat Methods 14(6):545–546. https://doi.org/10.1038/nmeth.4299
CAS Google Scholar

Download references

Acknowledgements

This work was supported, in part, by NIH grants R01NS042645, U01AG068057, R01MH112070, and RF1AG054409.

Author information

Authors and Affiliations

Center for Biomedical Image Computing and Analytics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Junhao Wen, Zhijian Yang, Gyujoon Hwang & Christos Davatzikos
Department of Statistics, Center for Theoretical Neuroscience, Zuckerman Institute, Columbia University, New York, NY, USA
Erdem Varol
Department of Psychiatry and Psychotherapy, Ludwig-Maximilian University, Munich, Germany
Dominique Dwyer
Institute for Mental Health and Centre for Human Brain Health, School of Psychology, University of Birmingham, Birmingham, UK
Anahita Fathi Kazerooni & Paris Alexandros Lalousis

Authors

Junhao Wen
View author publications
You can also search for this author in PubMed Google Scholar
Erdem Varol
View author publications
You can also search for this author in PubMed Google Scholar
Zhijian Yang
View author publications
You can also search for this author in PubMed Google Scholar
Gyujoon Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Dominique Dwyer
View author publications
You can also search for this author in PubMed Google Scholar
Anahita Fathi Kazerooni
View author publications
You can also search for this author in PubMed Google Scholar
Paris Alexandros Lalousis
View author publications
You can also search for this author in PubMed Google Scholar
Christos Davatzikos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christos Davatzikos .

Editor information

Editors and Affiliations

CNRS, Paris, France
Olivier Colliot

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Wen, J. et al. (2023). Subtyping Brain Diseases from Imaging Data. In: Colliot, O. (eds) Machine Learning for Brain Disorders. Neuromethods, vol 197. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-3195-9_16

Download citation

DOI: https://doi.org/10.1007/978-1-0716-3195-9_16
Published: 23 July 2023
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-3194-2
Online ISBN: 978-1-0716-3195-9
eBook Packages: Springer Protocols

Publish with us

Policies and ethics