Unsupervised multimodal modeling of cognitive and brain health trajectories for early dementia prediction

Unsupervised multimodal modeling of cognitive and brain health trajectories for early dementia prediction

Play all audios:

Loading...

ABSTRACT Predicting the course of neurodegenerative disorders early has potential to greatly improve clinical management and patient outcomes. A key challenge for early prediction in


real-world clinical settings is the lack of labeled data (i.e., clinical diagnosis). In contrast to supervised classification approaches that require labeled data, we propose an unsupervised


multimodal trajectory modeling (MTM) approach based on a mixture of state space models that captures changes in longitudinal data (i.e., trajectories) and stratifies individuals without


using clinical diagnosis for model training. MTM learns the relationship between states comprising expensive, invasive biomarkers (β-amyloid, grey matter density) and readily obtainable


cognitive observations. MTM training on trajectories stratifies individuals into clinically meaningful clusters more reliably than MTM training on baseline data alone and is robust to


missing data (i.e., cognitive data alone or single assessments). Extracting an individualized cognitive health index (i.e., MTM-derived cluster membership index) allows us to predict


progression to AD more precisely than standard clinical assessments (i.e., cognitive tests or MRI scans alone). Importantly, MTM generalizes successfully from research cohort to real-world


clinical data from memory clinic patients with missing data, enhancing the clinical utility of our approach. Thus, our multimodal trajectory modeling approach provides a cost-effective and


non-invasive tool for early dementia prediction without labeled data (i.e., clinical diagnosis) with strong potential for translation to clinical practice. SIMILAR CONTENT BEING VIEWED BY


OTHERS A MULTIDIMENSIONAL ODE-BASED MODEL OF ALZHEIMER’S DISEASE PROGRESSION Article Open access 23 February 2023 COGNITIVE AND MRI TRAJECTORIES FOR PREDICTION OF ALZHEIMER’S DISEASE Article


Open access 22 January 2021 CHARACTERIZING HETEROGENEITY IN ALZHEIMER’S DISEASE PROGRESSION: A SEMIPARAMETRIC MODEL Article Open access 05 March 2025 INTRODUCTION Dementia due to


Alzheimer’s disease (AD) involves a cascade of pathophysiological processes from normal cognition to Mild Cognitive Impairment (MCI) to dementia, with different markers of progression across


disease stages1,2. Despite decades of research and development, clinical trials of potential disease-modifying treatments for dementia have remained largely unsuccessful. However, recent


developments in drug discovery (e.g.3,4) call for interventions earlier in the progression of disease5,6 to enhance patient outcomes and aid future clinical trials. Identifying individuals


at-risk and predicting dementia early (i.e., at early disease stages or before the onset of symptoms) have strong potential to impact clinical management, drug discovery, and treatment


outcomes. Yet, early dementia prediction remains challenging in the following key respects. First, neuroimaging-derived biomarkers (i.e., MRI, PET) have been shown to be important for


detecting neurodegeneration. For Alzheimer’s Disease (AD) in particular, there is a strong link between biomarkers (i.e., β-amyloid accumulation, tau accumulation, neurodegeneration) and


symptoms (i.e., cognitive decline)1,2,7,8. However, PET scans that use radioactive contrast agents to extract biomarkers (β-amyloid, tau) are invasive and expensive (e.g., over $3000 for an


Amyloid PET scan in the USA9) for large-scale use in the general population, resulting in health inequalities related to their availability across healthcare settings. Addressing this


challenge raises the need for early prediction from low-cost and non-invasive measures (e.g., cognitive tests alone). Second, predicting dementia at early or pre-symptomatic stages of the


disease means that individuals have not yet been assigned a clinical diagnosis. Recent work on machine learning and mathematical modeling for dementia prediction predominantly focusses on


supervised models (e.g., SVMs, neural networks) using cross-sectional labelled data from patients that have clinical diagnoses (see, e.g.,10,11,12, for recent reviews). In contrast, making


predictions based on unlabeled data requires unsupervised modeling approaches. Third, the current diagnostic framework suffers from misdiagnosis (estimated sensitivity 70.9–87.3%;


specificity: 44.3–70.8%)13,14 in the early stages of Mild Cognitive Impairment (MCI), potentially due to age-related comorbidities (e.g., geriatric depression, stroke) that result in


cognitive decline15,16. Modeling approaches that make predictions based on binary clinical labels risk incorporating this misdiagnosis into their predictions. Thus, novel trajectory modeling


approaches are needed to reduce misdiagnosis at early stages by capturing changes in biomarkers and cognition based on longitudinal data rather than classifying patients based on clinical


diagnosis. To tackle these key challenges for early dementia prediction, we propose an unsupervised multimodal trajectory modeling (MTM) approach based on a mixture of state space models


that trains on longitudinal data (states, observations) to learn relationships between the progression of neuroimaging-derived biomarkers and cognitive data and stratifies individual


trajectories into clusters of cognitive health (Fig. 1). States correspond to neuroimaging-based biomarkers (β-amyloid, grey matter density): these features are available during model


training but may be unavailable or missing when the model is tested (a scenario known in machine learning as Learning with Privileged Information17). Observations correspond to cognitive


test scores that can be collected inexpensively and non-invasively at scale and have been shown to relate to biomarker levels18,19,20. Our modeling assumptions allow cluster formation to be


driven by the biomarker dynamics that characterize the progression of AD, with cognitive measurements tracking biomarker values. We train the MTM on longitudinal data from the Alzheimer’s


Disease Neuroimaging Initiative (ADNI;21) and we derive clusters that stratify individuals (from independent test data) based on cognitive decline and clinical outcomes (i.e., clinical


diagnosis) that have not been used in model training. Further, we demonstrate that the trained multimodal trajectory model stratifies individuals into discriminable clusters when tested with


cognitive data alone (i.e., observations without access to biomarkers) or a single assessment (rather than longitudinal data). Importantly, MTM allows us to derive an individualized


cognitive health index based on cluster membership that predicts conversion to AD more precisely than standard clinical data (grey matter atrophy, β-amyloid, cognitive scores). Finally, we


provide out-of-sample validation by testing the trained MTM (i.e., training on research cohort data: ADNI)-on real-world patient data from memory clinics (Memory Ageing & Cognition


Centre at the National University of Singapore: MACC). MTM generalizes successfully, despite missing data, stratifying MACC patients into distinct clusters, as profiled by clinical diagnoses


and clinical scales (i.e., mini-mental state examination, MMSE22). These findings provide evidence for a robust unsupervised trajectory modeling approach that delivers early prediction on


unlabeled multimodal data, handles missing data, and generalizes from research to real-world patient data, enhancing the clinical utility of our modeling approach and its potential for


translation to clinical practice. METHODS ADNI DATA Data for this study comprised 571 trajectories from the Alzheimer’s Disease Neuroimaging Initiative (ADNI; Table S1) database comprising


2–4 assessments (trajectory length \(= 2\), \(n= 337\); trajectory length \(= 3\), \(n= 194\); trajectory length \(= 4\), \(n= 40\)). Trajectories used for model training comprised


longitudinal measurements of neuroimaging-based biomarkers (grey matter density from medial temporal cortex20 and Florbetapir-based amyloid score23 temporally aligned (within 6 months) with


cognitive test scores (ADNI-Mem for memory24, ADNI-EF for executive functioning25, the Montreal Cognitive Assessment (MoCA)26, and the Alzheimer’s Disease Assessment Scale (ADAS-13))27. To


profile the clusters, we used clinical diagnosis and cognitive decline as measured with longitudinal scores from the mini-mental state examination (MMSE22) that is commonly used in clinical


practice. Clinical diagnoses based on individuals’ final assessments were as follows: cognitively normal (CN: 234), stable MCI (sMCI: 224), progressive MCI (pMCI: 19), and Alzheimer’s


disease (AD: 94). Patients were identified as pMCI if they progressed from MCI to AD in a 3-year period while sMCI patients remained diagnosed as MCI for the same period. MACC DATA We tested


MTM on independent memory clinic cohort data from the Memory Ageing & Cognition Centre at the National University of Singapore (MACC; Table S1). The MACC dataset comprises 158


trajectories (trajectory length = 2, \(n = 21\); trajectory length = 3, \(n = 137\)) containing a single β-amyloid PET measurement and at least one measurement of grey matter (GM) density


score derived from structural MRI scans (1 GM score, \(n=5\); 2 GM scores, \(n=52\); 3 GM scores, \(n=101\)). MoCA and MMSE scores were available for most patients. However, ADNI-Mem,


ADNI-EF, and ADAS-13 that were used for training the MTM on ADNI data were not available in the MACC dataset. Clinical outcomes based on individuals’ final assessments were as follows:


cognitively normal (CN, \(n=36\)), mild MCI (\(n=50\)), moderate MCI (\(n=18\)), and Alzheimer’s disease (AD, \(n= 54\)). MODELING APPROACH _Unsupervised generative model_: as


neurodegeneration progresses heterogeneously, we learn a probabilistic mixture of trajectory models on longitudinal data. Each prototypical mixture component takes the form of a state space


model where biomarkers are states and cognitive test scores are observations. Explicitly, we model the sequences \(z^i_{1:T} = (z_1^i,\dotsc ,z_T^i)\) of biomarkers and \(x^i_{1:T} =


(x_1^i,\dotsc ,x_T^i)\) cognitive test scores for the _i_th participant (\(1\le i \le n_d\)) as an i.i.d. sample from $$\begin{aligned} p(z^i_{1:T}, x^i_{1:T}) = \textstyle \sum _{c=1}^{n_c}


\pi _{c} \delta _{\{c=c^i\}} \cdot p(x^i_{1:T}; z^i_{1:T}|c^i) \end{aligned}$$ (1) where \(c^i \in \{1,\dotsc ,n_c\}\) denotes the assigned cluster, \(\delta _{\{c=c^i\}}\) is 1 when


\(c=c^i\) and 0 otherwise, and \(p(c)=\pi _c\) is a categorical distribution on the \(n_c\) clusters. For each cluster \(1\le c\le n_c\), we specify a linear, Gaussian model


$$\begin{aligned} p(x^i_{1:T}; z^i_{1:T}|c) = \textstyle \eta _d(z_1; m_c, S_c) \prod _{t=2}^T \eta _d(z_t;z_{t-1}A_c,\Gamma _c) \prod _{t=1}^T \eta _\ell (x_t;z_tH_c,\Lambda _c),


\end{aligned}$$ (2) where \(m_c\) and \(S_c\) correspond to the mean and covariance of the first biomarker measurement, \(A_c\) and \(\Gamma _c\) govern the dynamics of the biomarkers, and


\(H_c\) and \(\Lambda _c\) describe the relationship between the biomarkers and cognitive scores. In an abuse of notation, we allow _T_ to depend on _i_; our framework allows trajectories to


have differing lengths. Figure 1a contains our full modeling framework in plate notation. In contrast to the pioneering work of Chiappa and Barber who fit models using variational inference


with probabilistic assignment28, we fit models using expectation–maximization (EM)29 with hard latent indicator variable assignment in the E-step. This affords us the possibility of readily


extending our framework to allow nonlinear relationships between biomarkers and cognitive scores. To avoid local optima, we train models from multiple different random initializations and


select the one with the highest objective on training data. Each mixture component is characterized by the underlying dynamics of the biomarkers and the relationship between biomarkers and


cognitive scores (Fig. 1a). In the proposed framework, the first-order Markovian sequence of biomarker levels drives our process of interest and the cognitive scores at each assessment can


be viewed as read-outs of a patient’s biomarker levels at a given time/disease stage. That is, as the underlying biological dynamics closely captured by changes in biological markers (i.e.


grey matter density, β-amyloid) drive cognitive decline, the current state of cognitive decline can be then non-invasively “read-out” from the system in terms of cognitive scores. In this


context, cognitive scores could be interpreted as causally driven by changes in biological markers. Training the multimodal trajectory model requires access to trajectories that include both


biomarkers and cognitive scores, in contrast to previous work that inferred underlying states30,31. Giving MTM access to states during training leads to a simpler and more straightforward


process for inferring model parameters. Following training, marginalizing over missing variables allows us to stratify new trajectories that have missing biomarkers or incomplete cognitive


scores (Supplementary Information). _Model selection:_ To choose the optimal number of components for the MTM, we calculate the Bayesian information criterion (BIC)32 versus the number of


clusters. _Cross-validation:_ To assess stability of cluster assignment to variations in training data, we performed 10-fold cross validation on the ADNI dataset as follows. For each fold,


we trained the MTM using the data from the 9 other folds (\(n\approx 514\) trajectories) and tested on the remaining data (\(n\approx 57\) trajectories). This gives us 10 models, but only


one model is used to make a prediction for any given datapoint (i.e., the unique model that did not include the datapoint in its training set). The prediction from this model is the unique


cross-validated prediction. Performing cross-validation with unsupervised models entails an additional challenge of harmonizing the labels across the ten separate mixture models learned


during training. To harmonize across folds, we assigned labels to clusters alphabetically (i.e. cluster A–D) with A corresponding to the cluster with the lowest percentage of individuals


with an AD diagnosis in the training set, B to the cluster with the second lowest percentage, C to the cluster with the third lowest percentage, and D to the cluster with the fourth lowest


(i.e. highest) percentage of individuals with an AD diagnosis in the training set. Table S2 shows concordance in the percentage of individuals with an AD diagnosis per cluster across the 10


cross-validation folds. Further, for each trajectory assigned to a given cluster during testing, we determine the number of training runs that assign the trajectory to that cluster. Note


that for 10-fold cross-validation, each trajectory is in a test set once and in a training set 9 times, resulting in values ranging from 0 to 9. Fig. S1 shows that the histograms concentrate


on higher values (ranging from 7 to 9), indicating that most individual trajectories remain in the same cluster across cross-validation runs. _Model-derived index:_ Given a fitted model, we


define the cognitive health index (up to an additive constant) as the logarithm of the likelihood under the cluster A model. This is given explicitly by: $$\begin{aligned} M(z^i_{1:T},


x^i_{1:T}) = \log (p(x^i_{1:T}; z^i_{1:T}|c^i = A)). \end{aligned}$$ We used cross-validation to evaluate the model-derived index, in the same manner as we used to make predictions. This


prevents information leakage, as the index is always calculated on data not used for model training. RESULTS MTM STRATIFIES INDIVIDUALS BASED ON HEALTH TRAJECTORIES Our multimodal trajectory


model (MTM) effectively stratifies individuals to clusters of cognitive health based on the relationship between longitudinal biomarker (β-amyloid, grey matter density) and cognitive data


(ADNI-Mem, ADNI-EF, MoCA, ADAS-13). Informed by both BIC (Bayesian information criterion) and an elbow plot (Fig. S2), we present results for a 4-cluster MTM trained and tested on ADNI data


in an unsupervised manner (i.e. without labelled data). Figure 2A, C shows 4 trajectory clusters based on the relationship of state variables (i.e. biomarkers: grey matter density vs.


β-amyloid) derived from the model following 10-fold cross validation. Figure 2B, D shows 4 trajectory clusters based on the relationship of state (grey matter density) and cognitive


(ADNI-Mem) data. In particular, individuals in different clusters vary in disease progression based on the combination of different markers (Fig. S3, Table S3 for nonlinear dynamics).


Individuals in cluster A show the highest grey matter density, lowest β-amyloid burden and highest cognitive scores, while individuals in cluster D show the highest β-amyloid burden, lowest


grey matter density and lowest cognitive scores with clusters B and C falling between A and D (Fig. 2C, D). That is, individuals in cluster C have similar β-amyloid accumulation but higher


grey matter atrophy compared to individuals in cluster B, suggesting a higher degree of neurodegeneration. Next, we demonstrate that MTM-derived clusters of cognitive health differ in


biomarker dynamics (Fig. 2E) and the relationship between biomarker dynamics and cognitive decline (Fig. 2F) over time. In particular, we used a linear mixed effects model (LME) to test


whether β-amyloid burden predicts change in grey matter density over time. This LME analysis showed a significant relationship between biomarkers (main effect of cluster \(F_{3, 585}=12.0,


p<0.001\), interaction \(F_{4, 625}=11.4, p<0.001\)). The estimated cluster-specific trends (slopes for a linear approximation of grey matter density change versus β-amyloid burden)


decreased monotonically by cluster and differed significantly from zero for all clusters (B: \(t_{592}=-3.62, p<0.001\); C: \(t_{574}=-3.83, p<0.001\); D: \(t_{667}=-4.18,


p<0.001\)) except cluster A (\(t_{681}=0.62, p=0.534\)). Similarly, testing (linear mixed effects model) whether changes in grey matter density predict changes in cognition (ADNI-Mem)


showed a significant main effect of cluster (\(F_{3, 837}=6.1, p<0.001\)) and interaction (\(F_{4, 837}=8.1, p<0.001\)). The estimated cluster-specific trends were significantly


different from zero for clusters C & D (C: \(t_{837}=4.18, p<0.001\), D: \(t_{837}=3.53, p<0.001\)) but not A or B (A: \(t_{837}=-0.07, p=0.94\), B: \(t_{837}=1.59, p=0.11\)). To


enhance MTM interpretability, we profiled MTM-derived clusters using clinical diagnoses (Fig. 3A & S4A, Table S4). 98% of individuals in cluster A and 93% in cluster B have a cognitively


normal (CN) or sMCI diagnosis, whereas 21% of individuals in cluster C and 81% in cluster D have an AD or pMCI diagnosis. Conversely, 78% of individuals diagnosed as cognitively normal are


assigned to clusters A or B, while 89% of individuals diagnosed as AD are assigned to clusters C or D. These clusters differed significantly in their observed frequencies of clinical


outcomes (\(\chi ^2(9)=308.01, p<0.001\)). Employing a 2-cluster model results in less precise stratification than 4 clusters, showing a coarser partition of our sample; that is, 94% of


individuals in the first cluster are diagnosed as CN or sMCI (2% pMCI, 3% AD), while 58% of individuals in the second cluster are diagnosed as AD (8% CN, 27% sMCI, 7% pMCI). We next profiled


clusters by rate of cognitive decline, as indicated by age-adjusted MMSE, a scale typically used in clinical practice (Fig. 3B; note MMSE was not included in model training). A linear mixed


effects model used to predict age-adjusted MMSE showed significant effects for cluster, time, and a significant interaction between cluster and time (main effect of cluster \(F_{3, 874}


=61.8, p<0.001\); main effect of time \(F_{1, 932}=157.1; p<0.001\); interaction: cluster x time \(F_{3, 929}=64.1, p<0.001\)). The estimated trends for clusters A and B were not


significantly different from zero (\(A: t_{922}=0.810, p=0.42; B: t_{923}=-1.74, p=0.082\)), while trends for clusters C and D were significantly lower than zero (\(C: t_{922}=-3.80,


p<0.001; D: t_{938}=-13.2, p<0.001\)). The trend for cluster D was significantly lower than the trends for clusters A, B, C (post-hoc comparisons, A vs. D, \(t_{944}=13.2,


p<0.001\); B vs. D, \(t_{946}=12.2, p<0.001\); C vs. D, \(t_{945}=10.4, p<0.001\)). These results suggest that individuals in clusters A and B may show a slower cognitive decline


consistent with normal aging, in contrast to individuals in cluster D that decline at a rate higher than 1 point of MMSE per year (i.e., 95% confidence interval for trend of D: \([-1.35,


-1.05]\)). MTM-DERIVED COGNITIVE HEALTH INDEX PREDICTS CONVERSION TO AD We next tested whether assignment of individuals to MTM-derived clusters allows us to make predictions about


progression to AD. To quantify individual membership to MTM-derived clusters, we derived a cognitive health index as the logarithm of the likelihood under the cluster A model. We showed that


this MTM-derived index relates to future cognitive decline as indicated by age adjusted rate of MMSE change/year (Pearson’s, A = 0.396, p < 0.001, Fig. 4). Further, using logistic


regression, we showed that this MTM-derived index predicts conversion to AD from baseline data. ROC analysis showed higher AUC for the MTM-derived index (0.878) than a) univariate markers:


MMSE (0.812), β-amyloid (0.804), GM density (0.736), b) combinations of markers: GM density and β-amyloid (AUC = 0.850), GM density and MMSE (AUC = 0.852). We corroborated these results by


training Cox proportional hazard models to predict conversion to AD from baseline data. Concordance as measured with cross-validation was higher for the MTM-derived index (0.836) than


β-amyloid (0.807), GM density (0.703) or cognitive scores (e.g. ADNI-mem: 0.829, MOCA: 0.801). These results suggest that the MTM-derived cognitive health index predicts conversion to AD


more precisely than commonly-used clinical assessments. MTM TRAINING ON TRAJECTORIES VS. BASELINE DATA ALONE We asked whether training MTM on longitudinal data (i.e. trajectories) provides


an advantage over training on baseline data alone. We compared the 4-component MTM model trained on longitudinal data to a 4-cluster Gaussian mixture model trained on baseline data alone


(GMM-baseline) using the same cross-validation approach as for MTM. We found only 52.9% agreement between the cluster labels assigned by GMM-baseline vs. MTM. Further, comparing


distributions of clinical diagnoses across clusters showed that clusters derived by GMM-baseline (\(\chi ^2(9) = 226.4, p< 0.001\)) have a more uniform distribution of clinical diagnoses


than MTM-derived clusters (\(\chi ^2(9) = 308.0, p < 0.001\)). Comparing between models (Table S5) showed significantly higher deviation of the joint distribution of cluster assignment


and clinical diagnosis from the product of the marginals for MTM than GMM-baseline (Sharma-Song test for second-order differentials in contingency tables33, \(\chi ^2 (9) = 48.3, p<


0.001\)). These results suggest that MTM training on longitudinal rather than baseline data alone stratifies individuals more reliably to clusters of cognitive health based on future disease


progression. MTM STRATIFICATION BASED ON A SINGLE ASSESSMENT Collecting longitudinal data often proves challenging and costly in clinical practice, while multiple visits lengthen waiting


time to diagnosis for patients. Here, we test whether the MTM trained on longitudinal data stratifies individuals reliably when tested using a single assessment alone. In particular, we


tested the MTM on the first (i.e., baseline) or final assessment available per individual in an independent dataset. We found that MTM maintains similar stratification (as profiled by


clinical diagnosis) when tested on a single assessment. 81% of individuals remained in the same cluster when using the initial assessment data compared to trajectory data; this increased to


85% for the final assessment (columns III–VI in Table S4). These results strengthen the clinical utility of our modeling approach, suggesting that MTM stratifies individuals reliably based


on single patient assessments (e.g., first baseline assessment) with potential impact in expediting diagnosis and improving clinical management pathways. MTM STRATIFICATION BASED ON


COGNITIVE DATA ALONE We next asked whether MTM stratifies individuals reliably when tested with low-cost non-invasive data (i.e., cognitive observations) in the absence of state variables


comprising biomarkers. We tested the MTM—trained on both biomarker and cognitive data—on held-out test data (during the cross-validation process) comprising cognitive scores alone (i.e.,


ADNI-Mem, ADNI-EF, MoCA, and ADAS-13). We show that the model clusters individuals to discriminable trajectories of cognitive health and partly maintains the stratification we observed when


testing with both biomarker and cognitive data (Table S4: columns II, IV, & VI). That is, 58% of individuals remained in the same MTM-derived cluster when MTM was tested with cognitive


data alone, while 26% individuals were stratified to a cluster with lower cognitive decline compared to MTM stratification based on both biomarker and cognitive data. These results suggest


that MTM handles missing biomarker data and stratifies individuals based on non-invasive data (e.g., cognitive assessments), enhancing the clinical utility of our modeling approach. OUT OF


SAMPLE VALIDATION AND MODEL TRANSFER FROM RESEARCH TO CLINICAL DATA To test the generalizability and validate the clinical utility of our approach, we tested the trained MTM (i.e. training


on ADNI data), on independent memory clinic cohort data from the Memory Ageing & Cognition Centre at the National University of Singapore (MACC). This is a challenging task that requires


transfer from a research to a clinical dataset with missing data; that is, trajectories (\(n=158\)) in the MACC dataset comprise single β-amyloid PET measurements, some longitudinal


structural MRI scans, and limited cognitive data (MoCA and MMSE for most participants but no ADNI-Mem, ADNI-EF, and ADAS-13 data that were used for training the MTM). MTM stratifies patients


to clinically meaningful clusters when (I) using trajectory data, (II) cognitive scores alone (i.e., MoCA), or (III) data from the final assessment alone (Table S6). A 3-cluster MTM (as


determined by BIC; Fig. S2) showed that 94% of individuals assigned to cluster A have CN or MCI diagnosis, whereas 47% of individuals assigned to cluster C have AD diagnosis (Fig. 5A).


Conversely, 74.4% of individuals diagnosed as cognitively normal (CN) are assigned in clusters A or B and 73.6% of individuals diagnosed as AD are assigned to cluster C (Fig. S4B). These


MTM-derived clusters differ significantly in their observed frequencies of clinical outcomes (\(\chi ^2(6) = 39.5, p <0.001\)). Profiling the clusters for cognitive decline showed


significant differences in rate of cognitive decline (i.e. rate of MMSE change/year) between clusters A and C (Fig. 5B; linear mixed effects model of MMSE over time including age). A mixed


effects model for age-adjusted MMSE showed a significant effect for cluster (\(F_{2, 195.11}=10.8, p<0.001\)) and a significant cluster x time interaction (\(F_{3, 294.97}=9.23,


p<0.001\)). We found that the estimated trends for clusters A and B were not significantly different from zero (A: \(t_{295}=0.527, p=0.60\); B: \(t_{296}=-0.433, p=0.66\)), while for


cluster C the trend was significantly below zero (\(t_{294}=-5.22, p <0.001\)). The trend for cluster C was significantly higher than for cluster A (\(t_{295}=2.93, p=0.010\)), suggesting


that individuals in cluster C are predicted to have higher rate of cognitive decline. Finally, stratification in discrete clusters was maintained when testing the MTM using limited data


(i.e. first assessment or cognitive data alone). That is, 79% of individuals remained in the same cluster when MTM was tested with data from the first assessment. When MTM was tested with


limited cognitive data (MoCA alone) without biological (i.e. GM density) data, 48% of individuals retained their cluster assignment. 44% of individuals were assigned to a healthier cluster,


suggesting that when biomarker data is missing, the model assigns more individuals to cluster A, corresponding to a healthier trajectory. This is consistent with MTM stratification on ADNI


data, suggesting that biomarker data may provide additional information for disease progression. DISCUSSION We develop and validate an unsupervised multimodal trajectory modeling approach


(MTM) that stratifies individuals early and precisely based on their brain and cognitive health trajectories. MTM learns class-specific relationships over time between biomarkers-that are


acquired through costly and invasive measurements (e.g., PET scans)-and cognitive observations that are acquired through less costly and non-invasive testing. To tackle the challenge of


early dementia prediction with unlabeled data (i.e. at early or pre-symptomatic disease stages before cognitively normal individuals have a clinical diagnosis), we adopt an unsupervised


training approach. In contrast to most machine learning models for dementia prediction that adopt supervised learning to classify patients based on clinical diagnosis, our unsupervised MTM


approach stratifies individuals based on their brain and cognitive health trajectories without using clinical labels for model training. Working on a generatively specified framework allows


us to obtain interpretable models of prototypical biomarker and cognitive progression, handle trajectories of variable lengths, short observation sequences (i.e. single assessments), and


missing data (i.e. cognitive data alone). Our modeling approach is inspired by recent work on unsupervised trajectory clustering for AD prediction including mixtures of generalized mixed


effects models34, hierarchical mixture models of longitudinal Siamese neural networks35, and mixtures of Gaussian processes inductively biased towards monotonic decline36. Ramamoorthy36


found that the majority of the largest AD-related trajectory components were well-approximated as linear, providing justification for our choice of linear state model. However, cluster


interpretability in some of these previous approaches34,35 may be limited by small sample sizes (\(n < 100\)). Further, given the paucity of available longitudinal data in research and


clinical practice, it may be difficult to learn the underlying dynamics with modeling frameworks that may allow more degrees of freedom (i.e. kernel- or neural network-based approaches).


Specifically, MTM aims to map the variability structure in health trajectories, using state space models to model trajectories. We explore whether the trajectory variability in the data is


captured through a limited number of “prototypical” state space models. We perform probabilistic clustering of trajectories using probabilistic mixture modeling where the individual mixture


components correspond to those prototypical state space models. This is inherently an unsupervised learning process. However, as health trajectories contain information about the dynamics of


disease progression, it is expected that clusters of trajectories governed by the corresponding group-level state space models relate to different disease progression stages. In comparison


to previous trajectory modeling approaches (SuStaIn;37,38,39) that focus on stratifying disease subtypes based on cross-sectional data and require data from individuals who have progressed


to later disease stages, MTM focuses on health trajectories and disease progression, targeting stratification of healthy or at-risk individuals. Further, MTM takes into account rate of


decline (i.e. time between assessments) rather than simply event order37, allowing us to robustly stratify individuals at early stages based on disease progression compared to modeling


approaches that require data from individuals at advanced disease stages to differentiate between dementia subtypes. Thus, MTM allows us to make inferences about an individual’s future


cognitive health given their probabilistic assignment to a cluster based on their multimodal health trajectory. We demonstrate that model training on trajectories offers an advantage over


training on baseline data alone; that is, MTM stratifies individuals more reliably into clusters when trained on longitudinal data. To formalize this, we derive an individualized index of


cluster assignment and show that this index predicts conversion to AD more precisely than standard clinical data (grey matter atrophy, β-amyloid, cognitive scores). Our findings provide the


following main advances with strong potential for clinical translation. First, MTM reliably stratifies individuals to clinically meaningful clusters and allows us to derive an individualized


index of cognitive health that predicts conversion to AD more reliably than standard clinical assessments (i.e. cognitive tests or MRI scans alone) with strong potential for reducing risk


of misdiagnosis. Further, investigating individual variability in cognitive health and resilience remains a key challenge for understanding the underlying mechanisms of neurodegenerative


disorders40,41. That is, why do some individuals show increased β-amyloid burden without experiencing grey matter loss42,43 or clinical cognitive symptoms44,45,46, or others experience grey


matter loss without exhibiting memory symptoms47,48,49? MTM allows us to determine how individuals in different clusters vary in disease progression based on the combination of different


markers. For example, for individuals in cluster A, β-amyloid burden does not significantly impact grey matter degeneration or memory decline. Individuals in cluster C have similar β-amyloid


accumulation but higher grey matter atrophy compared to individuals in cluster B, suggesting that individuals in cluster C may have progressed to later neurodegeneration stages than


individuals in cluster B. Thus, MTM offers a framework for precise individual patient stratification and individualized predictions of cognitive health early (i.e. at early or


pre-symptomatic disease stages) with strong translational potential for clinical management and personalized interventions for improved patient outcomes. Second, MTM harnesses the power of


longitudinal multimodal data during training (i.e., learning relationships between biomarker and cognitive data over time) and generalizes to independent test datasets with missing data. In


particular, MTM reliably stratifies individuals when tested with data from single assessments or cognitive trajectories alone. This has the potential to expedite clinical diagnosis from


first assessment, reducing time to diagnosis-that currently varies between 6 and 18 months-and enhancing clinical management efficiency. Further, unsupervised modeling of non-invasive and


low-cost data (e.g., cognitive tests) has strong potential to support early dementia prediction at scale from non-invasive testing, reducing patient burden, costs and health inequalities


(e.g., due to limited access to MRI/PET scanners). Third, MTM generalizes from research to clinical cohort data; that is, we validate our model by testing not only on research (ADNI) but


also an independent clinical cohorts (MACC) despite missing data. Research cohort data may be subject to biases (e.g., selective volunteer demographics) while patient data tends to be more


diverse and representative of the population at large. Validating MTM with data from patients across different settings (i.e., different MRI scanners, cognitive screening tools) and


countries (USA, Singapore) provides evidence for interoperability with strong potential for reducing bias and enhancing the clinical utility of our approach. Finally, our approach has strong


potential to generalize to other non-invasive and cost- effective data (e.g., digital markers from wearable technologies) and dementia subtypes (extending beyond AD), enhancing the


translational impact of MTM for early and precise stratification at pre-symptomatic stages. In the future, MTM-as an unsupervised AI-guided tool-could be optimized to make early dementia


predictions from digital data and implemented in brain health checks (e.g. digital assessments at home) to select individuals at-risk for more extensive follow-up, making early dementia


prediction scalable and cost-effective with potential impact for prevention and population health. DATA AVAILABILITY Data and code used for the figures in this manuscript are publicly


available at the University of Cambridge data repository: https://doi.org/10.17863/CAM.102058 REFERENCES * Jack, C. R. _et al._ A/T/N: An unbiased descriptive classification scheme for


Alzheimer disease biomarkers. _Neurology_ 87(5), 539–547 (2016). Article  CAS  PubMed  PubMed Central  Google Scholar  * Jack, C. R. _et al._ NIA-AA research framework: Toward a biological


definition of Alzheimer’s disease. _Alzheimers Dement._ 14(4), 535–562 (2018). Article  PubMed  Google Scholar  * Mintun, M. A. _et al._ Donanemab in early Alzheimer’s disease. _N. Engl. J.


Med._ 384(18), 1691–1704 (2021). Article  CAS  PubMed  Google Scholar  * van Dyck, C. H. _et al._ Lecanemab in early Alzheimer’s disease. _N. Engl. J. Med._ 388(1), 9–21 (2022). Article 


PubMed  Google Scholar  * Abbott, A. Could drugs prevent Alzheimer’s? These trials aim to find out. _Nature_ 603, 216–219 (2022). Article  ADS  CAS  PubMed  Google Scholar  * Reardon, S.


Alzheimer’s drug donanemab helps most when taken at earliest disease stage, study finds. _Nature_ 619, 682–683 (2023). Article  ADS  CAS  PubMed  Google Scholar  * Jack, C. R. _et al._


Tracking pathophysiological processes in Alzheimer’s disease: An updated hypothetical model of dynamic biomarkers. _Lancet Neurol._ 12(2), 207–216 (2013). Article  CAS  PubMed  PubMed


Central  Google Scholar  * Jack, C. R. _et al._ Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. _Lancet Neurol._ 9(1), 119–128 (2010). Article  CAS  PubMed


  PubMed Central  Google Scholar  * Dolgin, E. A tough spot. _Nature_ 559, S10–S12 (2018). Article  ADS  CAS  PubMed  Google Scholar  * Borchert, R., Azevedo, T., Badhwar, A., Bernal, J.,


Betts, M., Bruffaerts, R., _et al_. Artificial intelligence for diagnostic and prognostic neuroimaging in dementia: A systematic review. _Alzheimers Dement._ 19(12), 5885–5904 (2023).


Article  PubMed  Google Scholar  * Myszczynska, M. A. _et al._ Applications of machine learning to diagnosis and treatment of neurodegenerative diseases. _Nat. Rev. Neurol._ 16(8), 440–456


(2020). Article  PubMed  Google Scholar  * Pellegrini, E. _et al._ Machine learning of neuroimaging for assisted diagnosis of cognitive impairment and dementia: A systematic review.


_Alzheimers Dement. Diagn. Assess. Dis. Monit._ 10(1), 519–535 (2018). Google Scholar  * Beach, T., Monsell, S., Phillips, L. & Kukull, W. Accuracy of the clinical diagnosis of Alzheimer


disease at national institute on aging Alzheimer disease centers. _J. Neuropathol. Exp. Neurol._ 71(4), 266–273 (2012). Article  PubMed  Google Scholar  * Ranginwala, N. A., Hynan, L. S.,


Weiner, M. F. & White, C. L. Clinical criteria for the diagnosis of Alzheimer disease: Still good after all these years. _Am. J. Geriatr. Psychiatry_ 16(5), 384–388 (2008). Article 


PubMed  Google Scholar  * Edmonds, E., Delano-Wood, L., Galasko, D., Salmon, D. & Bondi, M. Subjective cognitive complaints contribute to misdiagnosis of mild cognitive impairment. _J.


Int. Neuropsychol. Soc._ 20(8), 836–847 (2014). Article  PubMed  PubMed Central  Google Scholar  * Edmonds, E. C. _et al._ Early versus late MCI: Improved MCI staging using a


neuropsychological approach. _Alzheimers Dement._ 15(5), 699–708 (2019). Article  PubMed  Google Scholar  * Vapnik, V. & Izmailov, R. Learning using privileged information: Similarity


control and knowledge transfer. _J. Mach. Learn. Res._ 16(61), 2023–2049 (2015). MathSciNet  Google Scholar  * Belleville, S., Fouquet, C., Hudon, C., Zomahoun, H. T. V. & Croteau, J.


Neuropsychological measures that predict progression from mild cognitive impairment to Alzheimer’s type dementia in older adults: A systematic review and meta-analysis. _Neuropsychol. Rev._


27, 328–353 (2017). Article  PubMed  PubMed Central  Google Scholar  * Bilgel, M. _et al._ Effects of amyloid pathology and neurodegeneration on cognitive change in cognitively normal


adults. _Brain_ 141(8), 2475–2485 (2018). Article  PubMed  PubMed Central  Google Scholar  * Giorgio, J., Landau, S. M., Jagust, W. J., Tino, P. & Kourtzi, Z. Modelling prognostic


trajectories of cognitive decline due to Alzheimer’s disease. _NeuroImage Clin._ 26, 102199 (2020). Article  PubMed  PubMed Central  Google Scholar  * Weiner, M. W. _et al._ The Alzheimer’s


disease neuroimaging initiative 3: Continued innovation for clinical trial improvement. _Alzheimers Dement._ 13(5), 561–571 (2017). Article  PubMed  Google Scholar  * Folstein, M., Folstein,


S. & McHugh, P. Mini-mental state: A practical method for grading the cognitive state of patients for the clinician. _J. Psychiatr. Res._ 12(3), 189–198 (1975). Article  CAS  PubMed 


Google Scholar  * Landau, S. M. _et al._ Measurement of longitudinal β-amyloid change with 18F-florbetapir pet and standardized uptake value ratios. _J. Nucl. Med._ 56(4), 567–574 (2015).


Article  CAS  PubMed  Google Scholar  * Crane, P. K. _et al._ Development and assessment of a composite score for memory in the Alzheimer’s disease neuroimaging initiative (ADNI). _Brain


Imaging Behav._ 6(4), 502–516 (2012). Article  PubMed  PubMed Central  Google Scholar  * Gibbons, L. E. _et al._ A composite score for executive functioning, validated in Alzheimer’s disease


neuroimaging initiative (ADNI) participants with baseline mild cognitive impairment. _Brain Imaging Behav._ 6(4), 517–527 (2012). Article  PubMed  PubMed Central  Google Scholar  *


Nasreddine, Z. S. _et al._ The Montreal cognitive assessment, MoCA: A brief screening tool for mild cognitive impairment. _J. Am. Geriatr. Soc._ 53(4), 695–699 (2005). Article  PubMed 


Google Scholar  * Mohs, R. C. _et al._ Development of cognitive instruments for use in clinical trials of antidementia drugs: Additions to the Alzheimer’s disease assessment scale that


broaden its scope. _Alzheimer Dis. Assoc. Disord._ 11, 13–21 (1997). Article  Google Scholar  * Chiappa, S., & Barber, D. Dirichlet mixtures of Bayesian linear Gaussian state-space


models: A variational approach. Max Planck Institute for Biological Cybernetics, Tech. Rep. 161 (2007). * Dempster, A., Laird, N. & Rubin, D. B. Maximum likelihood from incomplete data


via the EM algorithm. _J. R. Stat. Soc. Ser. B Stat. Methodol._ 39(1), 1–38 (1977). Article  MathSciNet  Google Scholar  * Chiappa, S. & Barber, D. Bayesian factorial linear Gaussian


state-space models for biosignal decomposition. _IEEE Signal Process. Lett._ 14(4), 267–270 (2007). Article  ADS  Google Scholar  * Umatani, R., Imai, T., Kawamoto, K. & Kunimasa, S.


Time series clustering with an EM algorithm for mixtures of linear gaussian state space models. _Pattern Recognit._ 138, 109375 (2023). Article  Google Scholar  * Wit, E., van den Heuvel, E.


& Romeijn, J.-W. “All models are wrong...’’: An introduction to model uncertainty. _Stat. Neerl._ 66(3), 217–236 (2012). Article  MathSciNet  Google Scholar  * Sharma, R., Kumar, S.


& Song, M. Fundamental gene network rewiring at the second order within and across mammalian systems. _Bioinformatics_ 37(19), 3293–3301 (2021). Article  CAS  PubMed  Google Scholar  *


Poulakis, K. _et al._ Fully Bayesian longitudinal unsupervised learning for the assessment and visualization of AD heterogeneity and progression. _Aging_ 12(13), 12622–12647 (2020). Article


  CAS  PubMed  PubMed Central  Google Scholar  * Bhagwat, N., Viviano, J. D., Voineskos, A. N. & Chakravarty, M. M. Modeling and prediction of clinical symptom trajectories in


Alzheimer’s disease using longitudinal data. _PLoS Comput. Biol._ 14(9), e1006376 (2018). Article  ADS  PubMed  PubMed Central  Google Scholar  * Ramamoorthy, D. _et al._ Identifying


patterns in amyotrophic lateral sclerosis progression from sparse longitudinal data. _Nat. Comput. Sci._ 2(9), 605–616 (2022). Article  PubMed  PubMed Central  Google Scholar  * Young, A. L.


_et al._ Uncovering the heterogeneity and temporal complexity of neurodegenerative diseases with subtype and stage inference. _Nat. Commun._ 9(1), 4273 (2018). Article  ADS  PubMed  PubMed


Central  Google Scholar  * Archetti, D. _et al._ Inter-cohort validation of sustain model for Alzheimer’s disease. _Front. Big Data_ (2021). * Vogel, J. W. _et al._ Four distinct


trajectories of tau deposition identified in Alzheimer’s disease. _Nat. Med._ 27(5), 871–881 (2021). Article  CAS  PubMed  PubMed Central  Google Scholar  * Bowles, E. J. A. _et al._


Cognitive resilience to Alzheimer’s disease pathology in the human brain. _J. Alzheimers Dis._ 68(3), 1071–1083 (2019). Article  PubMed Central  Google Scholar  * Gómez-Isla, T. &


Frosch, M. P. Lesions without symptoms: Understanding resilience to Alzheimer disease neuropathological changes. _Nat. Rev. Neurol._ 18(6), 323–332 (2022). Article  PubMed  PubMed Central 


Google Scholar  * Dickson, D. W. _et al._ Identification of normal and pathological aging in prospectively studied nondemented elderly humans. _Neurobiol. Aging_ 13(1), 179–189 (1992).


Article  CAS  PubMed  Google Scholar  * Jagust, W. Is amyloid-β harmful to the brain? Insights from human imaging studies. _Brain_ 139(1), 23–30 (2015). Article  PubMed  PubMed Central 


Google Scholar  * Katzman, R. _et al._ Clinical, pathological, and neurochemical changes in dementia: A subgroup with preserved mental status and numerous neocortical plaques. _Ann. Neurol._


23(2), 138–144 (1988). Article  CAS  PubMed  Google Scholar  * Crystal, H. _et al._ Nondemented subjects with pathologically confirmed Alzheimer’s disease. _J. Neurol._ 38(11), 1682–1682


(1988). CAS  Google Scholar  * Aizenstein, H. J. _et al._ Frequent amyloid deposition without significant cognitive impairment among the elderly. _Arch. Neurol._ 65(11), 1509–1517 (2008).


Article  PubMed  PubMed Central  Google Scholar  * Iacono, D. _et al._ Neuronal hypertrophy in asymptomatic Alzheimer disease. _J. Neuropathol. Exp. Neurol._ 67(6), 578–589 (2008). Article 


PubMed  Google Scholar  * O’Brien, R. J. _et al._ Neuropathologic studies of the Baltimore longitudinal study of aging (BLSA). _J. Alzheimers Dis._ 18(3), 665–675 (2009). Article  PubMed 


PubMed Central  Google Scholar  * Driscoll, I. & Troncoso, J. Asymptomatic Alzheimers disease: A prodrome or a state of resilience?. _Curr. Alzheimer Res._ 8(4), 330–335 (2011). Article


  CAS  PubMed  PubMed Central  Google Scholar  Download references ACKNOWLEDGEMENTS We would like to thank William Jagust, Joseph Giorgio, and the Alzheimer’s Disease Neuroimaging Initiative


for access to the ADNI dataset and the Memory, Aging, and Cognition Center (MACC) at the National University of Singapore for access to the memory clinic research cohort. We would like to


thank Jenny Venton, Spencer Thomas, Nadia Smith, Tameem Adel, Padmini Krishnadas, Arron Lacey, Richard Everson, and Carola Schönlieb for helpful discussions. We thank Timothy Rittman and


Robin Borchert for clinical input, and Anthony Fernandes and Avraam Papadopoulos for computing support. Data collection and sharing for this project was funded by ADNI (National Institutes


of Health Grant U01 AG024904) and DOD ADNI (Department of Defence award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical


Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.;


Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company


Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson, Pharmaceutical Research & Development


LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging;


Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector


contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and


Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro


Imaging at the University of Southern California. Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database


(adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of


this report. A complete listing of ADNI investigators can be found at: https://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf FUNDING This work was supported


by grants to Z.K. from the Alzheimer’s Research UK (Early Detection of Neurodegenerative diseases (EDoN) research initiative), the Alzheimer’s Drug Discovery Foundation Diagnostics


Accelerator, EPSRC & Alan Turing Institute (TU/B/000095), Wellcome Trust (205067/Z/16/Z, 221633/Z/20/Z), Royal Society (INF\(\backslash\)R2\(\backslash\)202107). For the purpose of open


access, the authors have applied for a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission. AUTHOR INFORMATION AUTHORS AND AFFILIATIONS *


Department of Psychology, University of Cambridge, Cambridge, CB2 3EB, UK Michael C. Burkhart, Liz Y. Lee, Delshad Vaghari & Zoe Kourtzi * Department of Pharmacology, Memory, Aging, and


Cognition Center, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore An Qi Toh, Eddie Chong & Christopher Chen * School of Computer Science,


University of Birmingham, Birmingham, B15 2TT, UK Peter Tiňo Authors * Michael C. Burkhart View author publications You can also search for this author inPubMed Google Scholar * Liz Y. Lee


View author publications You can also search for this author inPubMed Google Scholar * Delshad Vaghari View author publications You can also search for this author inPubMed Google Scholar *


An Qi Toh View author publications You can also search for this author inPubMed Google Scholar * Eddie Chong View author publications You can also search for this author inPubMed Google


Scholar * Christopher Chen View author publications You can also search for this author inPubMed Google Scholar * Peter Tiňo View author publications You can also search for this author


inPubMed Google Scholar * Zoe Kourtzi View author publications You can also search for this author inPubMed Google Scholar CONTRIBUTIONS M.B.: designed the project, developed methods and


analysed data, wrote the manuscript. L.L.: developed methods and analysed data, wrote the manuscript. D.V.: developed methods and analysed data, wrote the manuscript T.Q.: curated data.


E.C.: curated data. C.C.: curated data, wrote the manuscript. P.T.: designed the project, developed methods, wrote the manuscript. Z.K.: designed and co-ordinated the project, wrote the


manuscript All authors reviewed the manuscript. CORRESPONDING AUTHOR Correspondence to Zoe Kourtzi. ETHICS DECLARATIONS COMPETING INTERESTS The authors declare no competing interests.


ADDITIONAL INFORMATION PUBLISHER'S NOTE Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION


SUPPLEMENTARY INFORMATION. RIGHTS AND PERMISSIONS OPEN ACCESS This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation,


distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and


indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to


the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will


need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE


CITE THIS ARTICLE Burkhart, M.C., Lee, L.Y., Vaghari, D. _et al._ Unsupervised multimodal modeling of cognitive and brain health trajectories for early dementia prediction. _Sci Rep_ 14,


10755 (2024). https://doi.org/10.1038/s41598-024-60914-w Download citation * Received: 13 December 2023 * Accepted: 29 April 2024 * Published: 10 May 2024 * DOI:


https://doi.org/10.1038/s41598-024-60914-w SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a shareable link is not


currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative