Predicting rehospitalization within 2 years of initial patient admission for a major depressive episode: a multimodal machine learning approach

Play all audios:

ABSTRACT Machine learning methods show promise to translate univariate biomarker findings into clinically useful multivariate decision support systems. At current, works in major depressive

disorder have predominantly focused on neuroimaging and clinical predictor modalities, with genetic, blood-biomarker, and cardiovascular modalities lacking. In addition, the prediction of

rehospitalization after an initial inpatient major depressive episode is yet to be explored, despite its clinical importance. To address this gap in the literature, we have used baseline

clinical, structural imaging, blood-biomarker, genetic (polygenic risk scores), bioelectrical impedance and electrocardiography predictors to predict rehospitalization within 2 years of an

initial inpatient episode of major depression. Three hundred and eighty patients from the ongoing 12-year Bidirect study were included in the analysis (rehospitalized: yes = 102, no = 278).

Inclusion criteria was age ≥35 and <66 years, a current or recent hospitalisation for a major depressive episode and complete structural imaging and genetic data. Optimal performance was

achieved with a multimodal panel containing structural imaging, blood-biomarker, clinical, medication type, and sleep quality predictors, attaining a test AUC of 67.74 (_p_ = 9.99−05). This

multimodal solution outperformed models based on clinical variables alone, combined biomarkers, and individual data modality prognostication for rehospitalization prediction. This finding

points to the potential of predictive models that combine multimodal clinical and biomarker data in the development of clinical decision support systems. SIMILAR CONTENT BEING VIEWED BY

OTHERS MAGNETIC RESONANCE IMAGING FOR INDIVIDUAL PREDICTION OF TREATMENT RESPONSE IN MAJOR DEPRESSIVE DISORDER: A SYSTEMATIC REVIEW AND META-ANALYSIS Article Open access 15 March 2021

INDIVIDUALIZED PREDICTION OF PSYCHIATRIC READMISSIONS FOR PATIENTS WITH MAJOR DEPRESSIVE DISORDER: A 10-YEAR RETROSPECTIVE COHORT STUDY Article Open access 23 April 2022 PROBING THE CLINICAL

AND BRAIN STRUCTURAL BOUNDARIES OF BIPOLAR AND MAJOR DEPRESSIVE DISORDER Article Open access 14 January 2021 INTRODUCTION Relapse rates in specialized mental healthcare settings are high in

patients with major depressive disorder (MDD) (60% after 5 years, 67% after 10, and 85% after 15)1. To better predict relapse, previous research has predominantly focused on the presence

and magnitude of clinical symptoms, including residual depressive symptoms2, illness severity3, number of prior episodes4, age of onset4, and comorbid personality disorders5. Other studies

have explored group level associations between biomarkers and relapse, observing smaller hippocampal volumes6, higher levels of post-treatment glucocorticoids7, high cortisol response on the

combined dexamethasone-CRH test8, as well as catecholamine and tryptophan depletion9. Such findings have been beneficial in constructing modality specific aetiological hypotheses as well as

sign posts for relapse in clinical practice. However, the elucidation of clinically meaningful predictors of relapse is contingent on the construct validity of the prediction outcome and

the size of the analysed sample. Some of these studies identifying biomarkers as predictors of relapse have used self-reported relapse into a new depressive episode and samples of less than

50 patients7,10. Whether patients have truly relapsed into a new depressive episode or merely have a treatment refractory illness is unknown. These observations may help to explain the lack

of empirically validated relapse prediction tools and subsequent intervention strategies for relapse prevention10. To overcome this blind spot in patient care, studies of increased size that

focus on a well-defined outcome such as rehospitalization are needed to identify clinically robust predictors of this illness trajectory. To facilitate this, a longitudinal, multimodal

sample of clinically diagnosed inpatients is required. The BiDirect depression cohort provides such a sample11. In addition, multimodal, multivariate modelling techniques that prognosticate

individually, rather than at the group level are needed given the heterogeneous nature of MDD and its illness trajectories12,13. Recent work has shown that machine learning (ML) models are

well suited to problems of this nature, demonstrating their efficacy in drug response and functional outcome prediction in MDD14,15, while other studies using neuroimaging modalities have

shown similar success for ML-based MDD diagnostics16. Interestingly, no studies have applied ML models to relapse prediction; thus, the utility of differing data modalities for relapse

prediction remains poorly understood. Regarding illness trajectory modelling, one previous work from Schmaal et al.17 combined neuroimaging and clinical data to predict MDD remission

trajectories with moderate success (accuracy = 69–73%). Further, Koutsouleris et al.15 used both neuroimaging and clinical data to predict functional outcomes in recent onset MDD (balanced

accuracy = 70.3%). However, studies employing a combination of neuroimaging markers with biomarkers from different modalities as well as clinical data are lacking up until now. Given the

apparent gap in multimodal studies of outcome prediction in MDD, we have combined a range of clinical and biomarker predictors that have shown significant associations to MDD in previous

works but are yet to be used for ML-based rehospitalization trajectory modelling. Modalities used included clinical18, blood biomarker19,20,21, structural imaging16, electrocardiography22,

genetic23, cognitive24, nutritional25, sleep26, and exercise27. Using these modalities, we predicted rehospitalization in a cohort of patients within 2 years of initial hospitalization for

an acute episode of MDD. MATERIALS AND METHODS DATASET DESCRIPTION The BiDirect study is an ongoing study of (a) patients, hospitalized for an acute episode of major depression at time of

recruitment, (b) population controls randomly drawn from the register of the city of Münster11, and (c) patients 3 months after an acute coronary event or myocardial infarction. Examination

of all participants included a computer-assisted face-to-face interview of socio-demographic characteristics and medical history as well as an extensive psychiatric assessment (Supplementary

Information 1.1). Only patients in the depression cohort were used in this analysis. At baseline, 999 MDD patients were recruited and 684 completed their 2-year follow-up and provided their

rehospitalization status. Specifically, patients were asked, have you had at least one or more re-admissions to hospital for an acute depressive episode since the initial examination?

Patient response was recorded by study assistants. As our aim was to assess both multi and unimodal prediction models of reported rehospitalization, a requirement for inclusion was complete

imaging and genetic data. Twenty-nine patients had incomplete genetic data while 294 had incomplete imaging data. In addition, 14 participants were excluded due to poor MRI quality, leaving

a final sample of 380 participants (rehospitalized: yes = 102, no = 278). See Table 1 for sociodemographic characteristics. In this final sample, 87.9% (334/380) of patients were taking some

form of antidepressant medication at their baseline assessment, 40.3% (153/380) were taking an antipsychotic, while 92.6% (352/380) were taking some form of psychotropic medication (see

Table 2). PREDICTOR MODALITIES CLINICAL Detailed information regarding socio-demographic and socio-economic status, lifetime medical diagnoses, current medication use, healthcare

utilization, insurance status, lifestyle and risk behaviour (e.g., diet, physical activity, alcohol consumption, smoking status), and perceived health state was collected via a

computer-assisted interview. A combination of individual items as well as total scores were included from the Hamilton Depression Rating Scale (HAM-D), the Hamilton Anxiety Rating Scale

(HAM-A), the Center for Epidemiologic Studies Depression Scale (CES-D), the Inventory of Depressive Symptomatology (IDS), the International Physical Activity questionnaire (IPAQ), and the

Food Frequency Questionnaire (FFQ) (Supplementary Information 1.1 and 1.2). In addition, we included measures from a cognitive functioning module and several self-report measures

(Supplementary Information 1.3). In total, 208 clinical and demographic predictors were included. STRUCTURAL IMAGING We included imaging data derived from structural magnetic imaging

sequences (3D-T1). To reduce the size of the predictor space, we a priori selected 15 regions that have been shown to be significantly associated with MDD in previous ENIGMA

meta-analyses6,28. Selected regions included right and left mean hippocampal volume, cortical thickness of the bilateral medial orbitofrontal cortex (OFC), fusiform gyrus, insula, rostral

anterior and posterior cingulate cortex and unilaterally in the left middle temporal gyrus, right inferior temporal gyrus and the right caudal anterior cingulate cortex (Supplementary

Information 1.4). SERUM AND GENETIC MARKERS Our serum biomarker panel consisted of 10 measures of high sensitive C-reactive protein19, free triiodothyronine, thyroxine, thyroid-stimulating

hormone29,30,31, 17 beta-estradiol32,33, sex hormone-binding globulin, testosterone, the free androgen index34,35,36, total cholesterol37, and high-density lipoprotein cholesterol38. Due to

the correlations between genetic variants shared across psychiatric traits and common comorbidities between psychiatric disorders39,40, we also included seven polygenic risk scores (PGRS)

with a _p_ value threshold of 0.5, for MDD, anxiety, Alzheimer’s, anorexia, autism spectrum disorder, bipolar, and schizophrenia (Supplementary Information 1.5–1.6 and Supplementary

Information 2.1–2.6). In addition, we also assessed PGRS with _p_ value thresholds of 0.05 and 0.01 (Supplementary Information 3.2). CARDIOVASCULAR We determined cardiovascular and general

health status through the assessment of different cardiovascular markers. First, we measured weight (without shoes and heavier clothes), height, and waist circumference. Following, we used

bioelectrical impedance measurements (Body Impedance Analyzer BIA 2000-S, Data Input GmbH) including the determination of body fat and water, extracellular mass, body cell mass, and basic

metabolic rate to assess general markers of body composition. In addition, a measurement of a standard three-channel electrocardiogram (ECG) was performed (Supplementary Information 1.7).

From these assessments we included the following seven predictors: heart rate (beats per minute), body mass index (BMI), extracellular mass to body cell mass (ECM/BCM ratio), basic metabolic

rate, corrected body fat (kg), total body water (kg), and lean body mass (kg). In total, 247 predictors were included in the analysis (see Supplementary Table 1 for all predictors). ML

PIPELINE For our first set of models we entered all predictor modalities into our pipeline for consideration. To ensure the unbiased approximation of the model’s generalizability to new

patients, we trained and tested all models using repeated nested cross-validation in a pipeline to prevent information leaking between patients used for training and validation. In the inner

cross validation loop, we conducted imputation, standardization, feature selection, hyperparameter optimization, and the fitting of a linear support vector machine (SVM). To begin, we

imputed predictors using multivariate imputation of chained equations with the 10 nearest predictors used in the imputation process. No predictors entered into the pipeline had more than 20%

missing data. Following, all variables were scaled to have a mean of zero and a standard deviation of one. Next, we used the elastic net, a form of penalized logistic regression to select a

final subset of variables for prediction. This approach shrinks the coefficients of highly correlated predictors towards each other while removing irrelevant predictors from the model41,42.

This process was completed simultaneously with an exhaustive grid search to tune model hyperparameters. For the elastic net, the parameters alpha (the amount of penalization) and the l1

ratio (mixing parameter between the l1 and l2 norms) were tuned for predictor selection. For the l1 ratio parameter we searched the values _λ_∈{0.1, 0.5, 0.7, 0.9, 0.95, 0.99}, with values

closer to 1 representing the l1 norm. For alpha, we searched _a_∈{0.1, 0.2…,1.0}. For the SVM, we tuned four values of the regularization parameter _C_∈{0.001, 0.01, 0.1, 1.0}. To

accommodate for the class imbalance in the outcome, each _C_ value was weighted by the inverse percentage proportion of each class label (rehospitalized: yes = 102, no = 278). This approach

increases the penalty for misclassifying the minority class (rehospitalized) given its relative scarcity. Finally, we used Platt scaling to calibrate the probability estimates for the SVMs

binary predictions43,44. The set of predictors and hyperparameters that maximized area under the curve on the receiver operator characteristic (AUC) were selected in the pipeline. See

Supplementary Information 1.8–1.8.7 for further details on the pipeline and Supplementary Information 1.8.5 for the selected hyperparameter values for the elastic net and SVM classifier. All

steps were completed in an inner cross-validation loop with five repeats of 10-fold cross-validation. This method divides the sample into 10 separate subsets, uses nine for training, and

then makes predictions on the final set. To avoid favourable splits in the data, this process is repeated five times, initializing splits uniquely for each repeat. For the testing of our

final models, we used 10-fold cross validation in the outer cross validation loop, averaging model performance metrics across test folds. To assess the statistical significance of our final

best performing model, we used a permutation test _(m_ _=_ 10,000) (see Fig. 1). To assess the predictive capacity of each modality, we trained, clinical, sMRI, cardiovascular, and PGRS only

models using the same pipeline as above, as well as a biomarker only model that aggregated together all biomarker modalities. To assess for a significant omnibus effect between models, we

used a Kruskal–Wallis _H_-test. Following, we used Mann–Whitney rank tests to assess for post hoc differences between our multi and unimodal models. All _p_ values were FDR corrected using

the Benjamini and Hochberg method. All ML models were developed using Scikit-learn in Python 3.6.5. All code for analysis is available on request. OPEN SOURCING OF MODEL To provide

transparency, the use of our model to other research groups, and encourage further external validation of our multimodal model, we uploaded our trained model to the Photon AI online model

repository (https://www.photon-ai.com/repo). This repository allows our multimodal model to be downloaded and tested by other research groups. ANALYSES OF MULTIMODAL PREDICTORS To analyse

the direction, magnitude, and significance of the models selected predictors after controlling for covariates, we used a non-penalized implementation of logistic regression in Python’s

statsmodels package. In addition, we included the average SVM weight values from the outer 10-fold cross-validation loop from our ML pipeline (Supplementary Information 3.1 and Supplementary

Table 2). Further, we conducted ordinary least-squares regression sub-analyses of variables including age, gender, BMI, overall psychotropic medication load, and individual psychotropic

medication use on the biomarkers that were selected in the final multimodal model. This was done to elucidate an understanding of variables that may be associated with these biomarkers and

help explain their inclusion in the multimodal model. In addition, we conducted an exploratory analysis of patient characteristics for those taking a specific type of medication selected in

the multimodal model. This was done to determine: (a) whether this medication proxied for the treatment or prevention of a specific comorbid illness, (b) was taken as a polytheraputic

treatment strategy for MDD, or (c) was acting as a proxy for the presence of unmedicated patients that were relapsing at higher rates than the rest of the sample. For these additional

analyses and results, see Supplementary Information 3.3–3.5. RESULTS MULTIMODAL MODEL Within 2 years of hospitalization for an acute episode of MDD 102 patients (26.8%) were rehospitalized

for a depressive episode, while 278 (73.2%) were not. Our best performing model was our multimodal model (test AUC = 67.74). For our multimodal solution, 10 predictors were selected for

optimal predictions by the elastic net. Those with a positive association with rehospitalization included the number of previous inpatient depressive episodes, individual CES-D items 5 (Last

week I had trouble concentrating) and 3 (In the last week I could not get rid of my mood, although my friends/family tried to cheer me up), Pittsburgh Sleep Quality Index (PSQI) item 7

(During the past month, how often have you had trouble staying awake while driving, eating meals, or engaging in social activity?), the PSQI sleep quality index (global score), taking

diazepines, oxazepines, thiazepines and oxepines, and right hippocampal volume. Predictors with a negative association included cholesterol (mmol/l), taking thyroid medications: yes/no, and

how often do you drink alcohol? (At most once a week/more than once a week). See Supplementary Figs. 1–3. Our final multimodal model provided an increase in prognostic certainty of 23.21%

(prognostic summary index = (PPV+NPV)–100). Further, a patient classified as being re-hospitalized by our model was 83% more likely to be rehospitalized than a patient who was not

(\({\mathrm{positive}}\;{\mathrm{likelihood}}\;{\mathrm{ratio}} = \frac{{\mathrm {sensitivity}}}{{1 - \mathrm {specificity}}}\)). Finally, our multimodal classifier was statistically

significant after permutation testing (_p_ _=_ 9.99−e05) (Fig. 1, Table 3). For individual modality models, see Supplementary Information 3.2. DISCUSSION The current study is the first of

its kind to investigate the role of multiple predictor modalities for MDD rehospitalization trajectory modelling. Overall, our multimodal model provided a 23.21% increase in prognostic

certainty for patient rehospitalization classification while providing an isolated subset of multimodal predictors for analysis. Furthermore, our multimodal model was statistically

significant after permutation testing. Of clinical importance is the positive (PPV) and negative predicted values (NPV) of our multimodal model and how they can be used to inform clinical

decision making. As PPV was low (PPV = 41.64), using this model to confirm clinician suspicion of rehospitalization is not supported. On the contrary, NPV was modestly high (NPV = 81.57).

Suggesting that the model could be used to confirm a clinician’s suspicion of low rehospitalization risk, potentially identifying the patient as suitable for less assertive follow-up. As few

studies are yet to consider a diverse range of biomarker modalities for rehospitalization trajectory modelling, the discriminative ability of the included modalities, as well as their

interaction with clinical markers is of interest. First, our best performing model was multimodal, including a range of clinical, blood-biomarker, and structural imaging predictors. A recent

work by Koutsouleris et al.45 also demonstrated the superiority of multimodal models, finding that a combination of clinical and neuroimaging markers was most predictive of functional

outcomes in a cohort of patients at risk for psychosis with recent onset depression. Our findings build on this work as well as provide preliminary evidence for the discriminative ability of

blood biomarkers on a well-defined and clinically meaningful dichotomous endpoint. Given that clinical predictors have consistently demonstrated their discriminative ability in multivariate

pattern recognition studies12,14,46, it is unsurprising that 8 of 10 predictors in our multimodal solution were clinical. On the contrary, PGRS scores were of no use for rehospitalization

classification (test AUC = 50.5) showing the least discriminative ability of all the modelled modalities. Even with lower _p_ value thresholds (_p_ = 0.05 and 0.01), no changes were seen in

our multimodal model. In addition, unimodal PGRS discrimination was still low at these thresholds (AUC = 54.83 and 54.17 respectively, Supplementary Information 3.2). PGRS scores have

received strong interest in psychiatric research, hoping that the aggregation of multiple single-nucleotide polymorphisms may illuminate genetic differences in psychiatric traits as well as

parse the heterogeneity of outcomes such as medication response47,48. Significant associations with outcomes of interest have been consistently demonstrated, however, commonly explain less

than 1–2% of variance, limiting their current clinical use. Given that PGRS scores could not classify rehospitalization any better than chance in the current work, the discovery of more

variants as well as non-linear modelling techniques may improve their clinical utility in future works49. All other biological modalities performed better, yet still lacked clinically

meaningful discriminative ability when modelled without clinical predictors. Given these findings and those of Koutsouleris et al.45, it appears that biomarkers may be of prognostic use but

likely perform best when modelled with clinical predictors also. Regarding the selection of diazepines, oxazepines, thiazepines, and oxepines and their positive association with

rehospitalization, it is possible that patients on this class of drugs at baseline had a more severe form of illness requiring augmentation with antipsychotic medications, placing them at a

greater risk of rehospitalization. In addition, thyroid medication use was negatively associated with rehospitalization. Besides a small handful of patients, those with a past diagnosis of

hyper/hypothyroidism who were not currently taking thyroid medications had t3, t4, and TSH levels within healthy reference ranges (Supplementary Information 3.3), suggesting that it was not

the presence of unmedicated patients with thyroid disorders relapsing at high rates that were responsible for thyroid medication use and its negative association with rehospitalization.

Considering that nearly all patients were taking antidepressant medications at their baseline assessment, it is possible that a synergistic prophylactic relationship between thyroid and

antidepressant medication use may exist. Such a polytheraputic relationship has been robustly demonstrated in the multisite sequenced alternatives to relieve depression trial (STAR*D)29.

However, prophylactic effects against rehospitalization should be explored in future works. Of interest, right hippocampal volume showed a positive association with rehospitalization. This

was surprising, given the well-established effect of smaller hippocampal volumes in MDD patients compared to healthy controls6. To better understand this observation, we conducted a range of

sub-analysis (Supplementary Information 3.4). In accordance with previous works50,51, we showed that those taking any form of antipsychotic at baseline had larger right hippocampal volumes

than those who were not, albeit, this effect was not significant after controlling for covariates. In addition, we showed that there was a significant gender/diazepine, oxazepine,

thiazepine, and oxepine use interaction effect on right hippocampal volumes, with women currently taking medications from this class having significantly larger right hippocampal volumes

than those who were not. Forty-four percent of these women ended up being rehospitalized between baseline and their 2-year follow-up assessment, compared to only 22% of women who were not.

While we had binary usage data for medication, dosage data were not available. Dosage effects were likely prevalent, but not quantifiable in the current work. Overall, it is plausible that

changes in right hippocampal volume proxied for a complex gender/medication/dosage-specific aetiology not fully represented in our clinical data. The first limitation of the current work is

the models’ scope. As our model was trained on a middle-aged European cohort, it is plausible that some of the selected predictors in our pipeline were unique to this demographic. Predictors

such as the number of previous inpatient episodes will likely be larger on an older cohort, offering more discriminative ability. Therefore, the clinical utility of the model needs to be

considered within this scope. To overcome this limitation, we have provided our trained model online through the Photon AI model repository. Further limitations include the reporting of

nominal significance in our logistic regression model (Supplementary Information 3.1). It is important to consider that the nominally significant effects that were found in our non-penalized

logistic regression model did not survive FDR corrections (Supplementary Table 2). However, our primary interest was the emergent multivariate pattern that demonstrated statistically

significant class separation between cases and controls at _p_ _=_ 9.99−e05. Given the importance and statistical significance of this aggregated multivariate pattern, we deemed it necessary

to conduct an exploratory analysis to illuminate the contribution of each individual predictor after rigorously controlling for known covariates age, gender, smoker status, BMI, and

intracranial volume. Given this, we believe the nominal significance of predictors in our logistic regression model to be of use for hypothesis generation in future works. Regarding clinical

use, while our multimodal model was statistically significant after permutation testing, its balanced accuracy was relatively low (BA = 63.05). Whether this level of performance is

sufficient for clinical use is unknown; however, this BA is similar to that attained in a previous work that is now deployed clinically (BA = 59.60–64.6)14. Further, we argue that it is not

the absolute accuracy of a model that should dictate whether it is of clinical use, but whether or not it outperforms current clinical best practice. For example, as rehospitalization risk

is not formally and routinely quantified by clinicians to inform their clinical decision making, we would not expect clinician prognostication of rehospitalization to be any better than

chance. Therefore, even a modestly performing model would theoretically confer clinical advantage when deployed at scale. In addition, such a model would be free of well-documented

decision-making biases that are known to affect clinician prognostication52 (e.g., anchoring effects), potentially providing further clinical benefit under high workloads commonly seen in

inpatient care. Nonetheless, for these questions to be answered, future works that benchmark clinicians’ prognostic abilities are needed before such incremental utility can be quantified.

Finally, it is important to note that while our multimodal model provided the highest degree of class separation, after testing for post hoc differences between all models and FDR correcting

_p_ values, our multimodal model did not significantly outperform our model containing only clinical predictors (_p_ _=_ 0.15) (Supplementary Information 3.2). Future works of increased

size may be required to elucidate a statistically significant difference given the small predictive discrepancy between the multimodal and clinical model. On the contrary, clinical data of

sufficient depth (for example, that captures the previously discussed gender/medication/dosage effects and their association with right hippocampal volumes) may even mitigate the

contribution of biomarkers in future works. If so, these models will bestow both ease of use and economic advantage, rendering the inclusion of costly and harder to attain biomarkers

ineffectual. To answer these questions a greater dearth of clinical data, discovery samples of increased size, as well as validation samples that are phenotypically and geographically

diverse will be required. In conclusion, the presented findings suggest that the combination of ML techniques with multimodal clinical and biomarker data may lead to an increase in

prognostic certainty compared to chance level. Continued research is required, but ML may be of use to derive models for clinicians to make personalized predictions regarding

rehospitalization risk as well as better inform prophylactic treatment strategies. REFERENCES * Hardeveld, F., Spijker, J., De Graaf, R., Nolen, W. A. & Beekman, A. T. Prevalence and

predictors of recurrence of major depressive disorder in the adult population. _Acta Psychiatr. Scand._ 122, 184–191 (2010). Article CAS PubMed Google Scholar * Paykel, E. S. et al.

Residual symptoms after partial remission: an important outcome in depression. _Psychol. Med._ 25, 1171–1180 (1995). Article CAS PubMed Google Scholar * Ramana, R. et al. Remission and

relapse in major depression: a two-year prospective follow-up study. _Psychol. Med._ 25, 1161–1170 (1995). Article CAS PubMed Google Scholar * Keller, M. B., Lavori, P. W., Lewis, C. E.

& Klerman, G. L. Predictors of relapse in major depressive disorder. _JAMA_ 250, 3299–3304 (1983). Article CAS PubMed Google Scholar * Grilo, C. M. et al. Personality disorders

predict relapse after remission from an episode of major depressive disorder: a 6-year prospective study. _J. Clin. Psychiatry_ 71, 1629–1635 (2010). Article PubMed PubMed Central Google

Scholar * Schmaal, L. et al. Subcortical brain alterations in major depressive disorder: findings from the ENIGMA Major Depressive Disorder working group. _Mol. Psychiatry_ 21, 806–812

(2016). Article CAS PubMed Google Scholar * Appelhof, B. C. et al. Glucocorticoids and relapse of major depression (dexamethasone/corticotropin-releasing hormone test in relation to

relapse of major depression). _Biol. Psychiatry_ 59, 696–701 (2006). Article CAS PubMed Google Scholar * Zobel, A. W., Yassouridis, A., Frieboes, R. M. & Holsboer, F. Prediction of

medium-term outcome by cortisol response to the combined dexamethasone-CRH test in patients with remitted depression. _Am. J. Psychiatry_ 156, 949–951 (1999). Article CAS PubMed Google

Scholar * Berman, R. M. et al. Transient depressive relapse induced by catecholamine depletion: potential phenotypic vulnerability marker? _Arch. Gen. Psychiatry_ 56, 395–403 (1999).

Article CAS PubMed Google Scholar * Gili, M., Vicens, C., Roca, M., Andersen, P. & McMillan, D. Interventions for preventing relapse or recurrence of depression in primary health

care settings: a systematic review. _Prev. Med._ 76, S16–S21 (2015). Article PubMed Google Scholar * Teismann, H., et al. Establishing the bidirectional relationship between depression

and subclinical arteriosclerosis—rationale, design, and characteristics of the BiDirect Study. _BMC Psychiatry_ 14, 3–4. https://doi.org/10.1186/1471-244x-14-174 (2014). * Chekroud, A. M. et

al. Reevaluating the efficacy and predictability of antidepressant treatments: a symptom clustering approach. _JAMA Psychiatry_ 74, 370–378 (2017). Article PubMed PubMed Central Google

Scholar * Drysdale, A. T. et al. Resting-state connectivity biomarkers define neurophysiological subtypes of depression. _Nat. Med._ 23, 28–38 (2017). Article CAS PubMed Google Scholar

* Chekroud, A. M. et al. Cross-trial prediction of treatment outcome in depression: a machine learning approach. _Lancet Psychiatry_ 3, 243–250 (2016). Article PubMed Google Scholar *

Koutsouleris, N. et al. Prediction models of functional outcomes for individuals in the clinical high-risk state for psychosis or with recent-onset depression: a multimodal, multisite

machine learning analysis. _JAMA Psychiatry_. https://doi.org/10.1001/jamapsychiatry.2018.2165 (2018). Article PubMed PubMed Central Google Scholar * Kambeitz, J. et al. Detecting

neuroimaging biomarkers for depression: a meta-analysis of multivariate pattern recognition studies. _Biol. Psychiatry_ 82, 330–338 (2017). Article PubMed Google Scholar * Schmaal, L. et

al. Predicting the naturalistic course of major depressive disorder using clinical and multimodal neuroimaging information: a multivariate pattern recognition study. _Biol. Psychiatry_ 78,

278–286 (2015). Article PubMed PubMed Central Google Scholar * Moayedoddin, B. et al. Prevalence and clinical characteristics of the DSM IV major depression among general internal

medicine patients. _Eur. J. Intern. Med._ 24, 763–766 (2013). Article PubMed Google Scholar * Lopresti, A. L., Maker, G. L., Hood, S. D. & Drummond, P. D. A review of peripheral

biomarkers in major depression: the potential of inflammatory and oxidative stress biomarkers. _Prog. Neuropsychopharmacol. Biol. Psychiatry_ 48, 102–111 (2014). Article PubMed Google

Scholar * Weber, B., Lewicka, S., Deuschle, M., Colla, M. & Heuser, I. Testosterone, androstenedione and dihydrotestosterone concentrations are elevated in female patients with major

depression. _Psychoneuroendocrinology_ 25, 765–771 (2000). Article CAS PubMed Google Scholar * Schweiger, U. et al. Testosterone, gonadotropin, and cortisol secretion in male patients

with major depression. _Psychosom. Med._ 61, 292–296 (1999). Article CAS PubMed Google Scholar * Musselman, D. L., Evans, D. L. & Nemeroff, C. B. The relationship of depression to

cardiovascular disease: epidemiology, biology, and treatment. _Arch. Gen. Psychiatry_ 55, 580–592 (1998). Article CAS PubMed Google Scholar * Wray, N. R. et al. Genome-wide association

analyses identify 44 risk variants and refine the genetic architecture of major depression. _Nat. Genet._ 50, 668–681 (2018). Article CAS PubMed PubMed Central Google Scholar * Knight,

M. J. & Baune, B. T. Cognitive dysfunction in major depressive disorder. _Curr. Opin. Psychiatry_ 31, 26–31 (2018). Article PubMed Google Scholar * Lopresti, A. L., Hood, S. D. &

Drummond, P. D. A review of lifestyle factors that contribute to important pathways associated with major depression: diet, sleep and exercise. _J. Affect. Disord._ 148, 12–27 (2013).

Article PubMed Google Scholar * Murphy, M. J. & Peterson, M. J. Sleep disturbances in depression. _Sleep Med. Clin._ 10, 17–23 (2015). Article PubMed Google Scholar * Chekroud, S.

R. & Chekroud, A. M. Efficacy of resistance exercise training with depressive symptoms. _JAMA Psychiatry_ 75, 1091–1092 (2018). Article PubMed Google Scholar * Schmaal, L. et al.

Cortical abnormalities in adults and adolescents with major depression based on brain scans from 20 cohorts worldwide in the ENIGMA Major Depressive Disorder Working Group. _Mol. Psychiatry_

22, 900–909 (2017). Article CAS PubMed Google Scholar * Nierenberg, A. A. et al. A comparison of lithium and T(3) augmentation following two failed medication treatments for depression:

a STAR*D report. _Am. J. Psychiatry_ 163, 1519–1530 (2006). quiz 665. Article PubMed Google Scholar * Joffe, R. T. & Marriott, M. Thyroid hormone levels and recurrence of major

depression. _Am. J. Psychiatry_ 157, 1689–1691 (2000). Article CAS PubMed Google Scholar * Joffe, R., Segal, Z. & Singer, W. Change in thyroid hormone levels following response to

cognitive therapy for major depression. _Am. J. Psychiatry_ 153, 411–413 (1996). Article CAS PubMed Google Scholar * Sassarini, D. J. Depression in midlife women. _Maturitas_ 94,

149–154. Article PubMed Google Scholar * Schiller, C. E., Meltzer-Brody, S. & Rubinow, D. R. The role of reproductive hormones in postpartum depression. _CNS Spectr._ 20, 48–59

(2015). Article PubMed Google Scholar * Rodgers, S. et al. Serum testosterone levels and symptom-based depression subtypes in men. _Front. Psychiatry_ 6, 61 (2015). Article PubMed

PubMed Central Google Scholar * Almeida, O. P., Yeap, B. B., Hankey, G. J., Jamrozik, K. & Flicker, L. Low free testosterone concentration as a potentially treatable cause of

depressive symptoms in older men. _Arch. Gen. Psychiatry_ 65, 283–289 (2008). Article CAS PubMed Google Scholar * Kische, H. et al. Associations of androgens with depressive symptoms and

cognitive status in the general population. _PLoS One_ 12, e0177272 (2017). Article PubMed PubMed Central CAS Google Scholar * Parekh, A., Smeeth, D., Milner, Y. & Thure, S. The

role of lipid biomarkers in major depression. _Healthcare (Basel)_. 5, https://doi.org/10.3390/healthcare5010005 (2017). Article PubMed Central Google Scholar * Maes, M. et al. Lower

serum high-density lipoprotein cholesterol (HDL-C) in major depression and in depressed men with serious suicidal attempts: relationship with immune-inflammatory markers. _Acta Psychiatr.

Scand._ 95, 212–221 (1997). Article CAS PubMed Google Scholar * Lee, S. H. et al. Genetic relationship between five psychiatric disorders estimated from genome-wide SNPs. _Nat. Genet._

45, 984–994 (2013). Article CAS PubMed Google Scholar * Brainstorm C. et al. Analysis of shared heritability in common disorders of the brain. _Science_ 360,

https://doi.org/10.1126/science.aap8757 (2018). * Zou, H. & Hastie, T. Regularization and variable selection via the elastic net. _J. R. Stat. Soc. Ser. B (Stat. Methodol.)_ 67, 301–320

(2005). Article Google Scholar * Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. _J. Stat. Softw._ 33, 1 (2010).

Article PubMed PubMed Central Google Scholar * Platt, J. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Smola, A. J., Barlett,

P. L., Scholkopf, B. & Schuurmans, D. (eds.) _Advances in Large Margin Classifiers_. Vol. 10, 61–74 (The MIT press, Cambridge, Massacheussets, London, England, 1999). * Niculescu-Mizil

A., Caruana R. Predicting good probabilities with supervised learning. In: _Proceedings of the 22nd International Conference on Machine Learning_. 625–632 (ACM, 2005). * Koutsouleris, N. et

al. Prediction models of functional outcomes for individuals in the clinical high-risk state for psychosis or with recent-onset depression: a multimodal, multisite machine learning analysis.

_JAMA Psychiatry_ 75, 1156–1172 (2018). Article PubMed PubMed Central Google Scholar * Koutsouleris, N., Kahn, R. S. & Chekroud, A. M. Multisite prediction of 4-week and 52-week

treatment outcomes in patients with first-episode psychosis: a machine learning approach (vol 3, pg 935, 2016). _Lancet Psychiatry_ 4, 95 (2017). Article Google Scholar * Amare, A. T. et

al. Association of polygenic score for schizophrenia and HLA antigen and inflammation genes with response to lithium in bipolar affective disorder: a genome-wide association study. _JAMA

Psychiatry_ 75, 65–74 (2018). PubMed Google Scholar * Amare, A. T. et al. Association of the polygenic scores for personality traits and response to selective serotonin reuptake inhibitors

in patients with major depressive disorder. _Front. Psychiatry_ 9, 65 (2018). Article PubMed PubMed Central Google Scholar * Zheutlin, A. B. et al. Multivariate pattern analysis of

genotype–phenotype relationships in schizophrenia. _Schizophrenia Bull._ 44, 1045–1052 (2018). Article Google Scholar * Bodnar, M. et al. The effect of second-generation antipsychotics on

hippocampal volume in first episode of psychosis: longitudinal study. _BJPsych Open._ 2, 139–146 (2016). Article PubMed PubMed Central Google Scholar * Chakos, M. H. et al. Duration of

illness and treatment effects on hippocampal volume in male patients with schizophrenia. _Br. J. Psychiatry_ 186, 26–31 (2005). Article PubMed Google Scholar * Saposnik, G., Redelmeier,

D., Ruff, C. C. & Tobler, P. N. Cognitive biases associated with medical decisions: a systematic review. _BMC Med Inf. Decis. Mak._ 16, 138 (2016). Article Google Scholar Download

references ACKNOWLEDGEMENTS The BiDirect Study is funded by the German Federal Ministry of Education and Research (BMBF (01ER0816 and 01ER1506, and 01ER1205)). Additional funding for the

analysis was provided by the German Research Foundation (DFG, grant FOR2107 DA1151/5-1 and DA1151/5-2 to U.D.; SB-TRR58, Projects C09 and Z02), the Interdisciplinary Center for Clinical

Research (IZKF) of the medical faculty of Münster (grant Dan3/012/17 to U.D., SEED 11/18 to NO) and the Deanery of the Medical Faculty of the University of Münster. In addition, this work

was supported with supercomputing resources provided by the Phoenix HPC service at the University of Adelaide. The funding sources had no role in study design, collection, analysis, or

interpretation of the data, in the writing of the report, or in the decision to submit the article for publication. AUTHOR INFORMATION Author notes * These authors contributed equally: Micah

Cearns, Nils Opel AUTHORS AND AFFILIATIONS * Discipline of Psychiatry, School of Medicine, University of Adelaide, Adelaide, Australia Micah Cearns & Scott Clark * Department of

Psychiatry, University of Münster, Münster, Germany Nils Opel, Claas Kaehler, Udo Dannlowski & Bernhard T. Baune * Interdisciplinary Centre for Clinical Research (IZKF), University of

Münster, Münster, Germany Nils Opel * Centre for Healthy Brain Ageing, UNSW, Sydney, Australia Anbupalam Thalamuthu * Institute of Clinical Radiology, University of Münster, Münster, Germany

Walter Heindel * Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany Theresa Winter * Integrated Research Biobank, University

Medicine Greifswald, Greifswald, Germany Theresa Winter * Institute of Epidemiology and Social Medicine, University of Münster, Münster, Germany Henning Teismann, Heike Minnerup & Klaus

Berger * Department of Psychiatry, Melbourne Medical School, The University of Melbourne, Melbourne, Australia Bernhard T. Baune * The Florey Institute of Neuroscience and Mental Health, The

University of Melbourne, Melbourne, Australia Bernhard T. Baune Authors * Micah Cearns View author publications You can also search for this author inPubMed Google Scholar * Nils Opel View

author publications You can also search for this author inPubMed Google Scholar * Scott Clark View author publications You can also search for this author inPubMed Google Scholar * Claas

Kaehler View author publications You can also search for this author inPubMed Google Scholar * Anbupalam Thalamuthu View author publications You can also search for this author inPubMed

Google Scholar * Walter Heindel View author publications You can also search for this author inPubMed Google Scholar * Theresa Winter View author publications You can also search for this

author inPubMed Google Scholar * Henning Teismann View author publications You can also search for this author inPubMed Google Scholar * Heike Minnerup View author publications You can also

search for this author inPubMed Google Scholar * Udo Dannlowski View author publications You can also search for this author inPubMed Google Scholar * Klaus Berger View author publications

You can also search for this author inPubMed Google Scholar * Bernhard T. Baune View author publications You can also search for this author inPubMed Google Scholar CORRESPONDING AUTHOR

Correspondence to Bernhard T. Baune. ETHICS DECLARATIONS CONFLICT OF INTEREST The authors declare that they have no conflict of interest. ADDITIONAL INFORMATION PUBLISHER’S NOTE Springer

Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION RIGHTS AND PERMISSIONS OPEN

ACCESS This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format,

as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third

party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the

article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright

holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. Reprints and permissions ABOUT THIS ARTICLE CITE THIS ARTICLE Cearns, M., Opel, N., Clark, S. _et

al._ Predicting rehospitalization within 2 years of initial patient admission for a major depressive episode: a multimodal machine learning approach. _Transl Psychiatry_ 9, 285 (2019).

https://doi.org/10.1038/s41398-019-0615-2 Download citation * Received: 30 April 2019 * Revised: 08 October 2019 * Accepted: 20 October 2019 * Published: 11 November 2019 * DOI:

https://doi.org/10.1038/s41398-019-0615-2 SHARE THIS ARTICLE Anyone you share the following link with will be able to read this content: Get shareable link Sorry, a shareable link is not

currently available for this article. Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative