Transdiagnostic individualized clinically-based risk calculator for the automatic detection of individuals at-risk and the prediction of psychosis: external replication in 2,430,333 US patients

Oliver, Dominic; Wong, Chiew Meng Johnny; Bøg, Martin; Jönsson, Linus; Kinon, Bruce J.; Wehnert, Allan; Jørgensen, Kristian Tore; Irving, Jessica; Stahl, Daniel; McGuire, Philip; Raket, Lars Lau; Fusar-Poli, Paolo

doi:10.1038/s41398-020-01032-9

Transdiagnostic individualized clinically-based risk calculator for the automatic detection of individuals at-risk and the prediction of psychosis: external replication in 2,430,333 US patients

Article
Open access
Published: 29 October 2020

Volume 10, article number 364, (2020)
Cite this article

Download PDF

You have full access to this open access article

Translational Psychiatry

Transdiagnostic individualized clinically-based risk calculator for the automatic detection of individuals at-risk and the prediction of psychosis: external replication in 2,430,333 US patients

Download PDF

Dominic Oliver ORCID: orcid.org/0000-0002-8920-3407¹^na1,
Chiew Meng Johnny Wong²^na1,
Martin Bøg³,
Linus Jönsson^3,4,
Bruce J. Kinon⁵,
Allan Wehnert³,
Kristian Tore Jørgensen³,
Jessica Irving¹,
Daniel Stahl ORCID: orcid.org/0000-0001-7987-6619⁶,
Philip McGuire^7,8,
Lars Lau Raket^3,9 &
…
Paolo Fusar-Poli ORCID: orcid.org/0000-0003-3582-6788^1,7,10

2376 Accesses
16 Citations
17 Altmetric
Explore all metrics

Abstract

The real-world impact of psychosis prevention is reliant on effective strategies for identifying individuals at risk. A transdiagnostic, individualized, clinically-based risk calculator to improve this has been developed and externally validated twice in two different UK healthcare trusts with convincing results. The prognostic performance of this risk calculator outside the UK is unknown. All individuals who accessed primary or secondary health care services belonging to the IBM^® MarketScan^® Commercial Database between January 2015 and December 2017, and received a first ICD-10 index diagnosis of nonorganic/nonpsychotic mental disorder, were included. According to the risk calculator, age, gender, ethnicity, age-by-gender, and ICD-10 cluster diagnosis at index date were used to predict development of any ICD-10 nonorganic psychotic disorder. Because patient-level ethnicity data were not available city-level ethnicity proportions were used as proxy. The study included 2,430,333 patients with a mean follow-up of 15.36 months and cumulative incidence of psychosis at two years of 1.43%. There were profound differences compared to the original development UK database in terms of case-mix, psychosis incidence, distribution of baseline predictors (ICD-10 cluster diagnoses), availability of patient-level ethnicity data, follow-up time and availability of specialized clinical services for at-risk individuals. Despite these important differences, the model retained accuracy significantly above chance (Harrell’s C = 0.676, 95% CI: 0.672–0.679). To date, this is the largest international external replication of an individualized prognostic model in the field of psychiatry. This risk calculator is transportable on an international scale to improve the automatic detection of individuals at risk of psychosis.

The Biopsychosocial Approach: Towards Holistic, Person-Centred Psychiatric/Mental Health Nursing Practice

Bipolar Disorder and Suicide: a Review

Article 18 January 2020

Deep learning and machine learning in psychiatry: a survey of current progress in depression detection, diagnosis and treatment

Article Open access 24 April 2023

Introduction

Under standard care, clinical outcomes in psychosis are suboptimal; prevention and early intervention are essential to improve outcomes of this disorder¹. Primary indicated prevention of psychosis revolves around the ability to detect, assess and care for individuals at risk of psychosis. The Clinical High Risk state for Psychosis (CHR-P)² includes individuals who present with attenuated psychotic symptoms, impaired functioning³ and help-seeking behavior. Twenty percent of these individuals develop a psychotic disorder within two years⁴. Primary indicated prevention of psychosis through specialized CHR-P clinical services⁵ is uniquely positioned to alter the course of the disorder and improve outcomes¹.

The impact of the CHR-P approach is contingent on effective identification of individuals at risk of developing psychosis. Because of complex interactions between help-seeking behaviors, recruitment strategies and referral pathways⁶, detection of at-risk individuals is currently inefficient: only 5%⁷–12%⁸ of first-episode cases are identified by specialized or youth mental health CHR-P services. Moreover, these services are only available to a limited number of individuals, with only 48 services mapped worldwide⁹. To overcome these problems, a transdiagnostic, individualized, clinically-based risk calculator has been developed in the South London and Maudsley (SLaM) NHS Trust boroughs of Lambeth and Southwark (n = 33,820)⁷. This prognostic model uses core predictors that were selected on a priori meta-analytical knowledge¹⁰ (age, gender, ethnicity, primary index diagnosis and age*gender interaction), that are routinely collected in clinical care, to forecast individual level of psychosis risk up to six years. This model leverages electronic health record (EHR) data, therefore allowing for the automatic detection of at-risk individuals. This prognostic model has shown adequate performance in a first external validation in the SLaM boroughs of Lewisham and Croydon (n = 54,716, Harrell’s C = 0.79)⁷ and in a second external validation in the Camden and Islington NHS Foundation Trust (C&I; n = 13,702, Harrell’s C = 0.73)¹¹, with Harrell’s C demonstrating the probability that a randomly selected patient who experienced an event will have a higher score than a patient who did not. This prognostic model is also currently being piloted for real-world implementation in clinical routine in the UK¹².

Despite these promising results, it is not yet clear whether this prognostic model is transportable to international healthcare settings. External validation studies are scarce in psychiatry, undermining the translational impact of research discoveries. This study aims to investigate the international external validity of the original transdiagnostic, clinically-based, individualized risk calculator using large scale EHRs from the US.

Materials and methods

Design

Retrospective cohort study using Electronic Health Records (EHRs) conducted according to the REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement¹³ (see checklist reported in Table S1).

Data source

The IBM^® MarketScan^® Commercial Database (hereafter Commercial) contains data from approximately 65 million people from multiple geographically dispersed US states, who are covered by employer-sponsored health insurance plans. This data includes all medical and pharmaceutical claims for these individuals and their dependents (Methods S1). It provides contemporaneous and ‘real-world’ data on both routine primary and secondary mental healthcare.

Study population

All patients accessing primary or secondary healthcare between 1 January 2015 and 31 December 2017 who received an ICD-10 primary index diagnosis of a nonorganic and nonpsychotic mental disorder (Methods S2). To ensure correct diagnosis classification, a lookback period of six months was applied to each patient (Methods S3).

Follow-up

Follow-up started at the time of the ICD-10 index diagnosis and ended when a transition to psychosis was recorded, or when the patient dropped out of the EHR (as documented by the last entry on Commercial).

Model specifications

The original transdiagnostic, clinically-based, individualized risk calculator was developed using a retrospective cohort study leveraging EHRs of the SLaM boroughs of Lambeth and Southwark, firstly validated in the SLaM boroughs of Croydon and Lewisham⁷ and secondly validated in C&I¹¹ in the UK. In summary, a Cox model was used to predict the hazard ratio of developing any psychotic disorder over time (see Methods S2 for definition) as primary outcome of interest. The predictors included age (at the time of the index diagnosis), gender, age*gender, self-assigned ethnicity, and cluster index diagnosis (ICD-10 diagnostic spectra: acute and transient psychotic disorders (ATPD), substance use disorders, bipolar mood disorders, nonbipolar mood disorders, anxiety disorders, personality disorders, developmental disorders, childhood/adolescence onset disorders, physiological syndromes, mental retardation). Self-assigned ethnicity and index diagnoses were operationalized as indicated in Tables S2 and S3. A weighted sum of covariates with the model weights from the Cox model resulted in the Prognostic Index (PI). From this, the risk of the individual developing a psychotic disorder within a time period (between one and six years) could be calculated¹⁴.

Since this model was originally developed on a retrospective cohort⁷, it excluded cases with an onset of psychosis within the first three months to minimize the short-term diagnostic instability of baseline ICD-10 index diagnoses. However, during the subsequent implementation study^12,15 an updated version of the model was adapted for prospective use (i.e., not excluding transitions occurring in the first three months), demonstrating similar prognostic performance (Table S4). Furthermore, a lookback period was additionally used in this study (see Methods S3), to minimize the risk of misclassification of index diagnosis date. The specifications of the present model are fully detailed in Table S5.

A main difference compared to the SLaM dataset was that there were no patient-level ethnicity data in Commercial. To mitigate this issue, aggregate ethnicity coefficients were generated for patients who had Metropolitan Statistical Area (MSA) and state-level ethnicity data using Integrated Public Use Microdata Series (IPUMS) census data (www.ipums.org). The geographical information from IPUMS were matched with the geographical data available for each patient in the study population from Commercial, assigning each patient with a vector of ethnic weights for each level of the ethnicity predictor. For example, if a patient were matched for New York (NY) state and Ithaca, NY MSA and was diagnosed in 2016, the proportions of White individuals in the MSA in the year of index diagnosis was 0.82, Black individuals was 0.03, Asian individuals was 0.10, Mixed individuals was 0.03 and Other was 0.01. For comparability purposes we also reported the performance of the original model⁷ (i) without ethnicity as a predictor and (ii) with computed aggregate ethnicity using census data¹⁶ (Table S6).

Statistical analysis

Model external validation followed the guidelines of Royston and Altman¹⁷, Steyerberg et al.¹⁸, and the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD)¹⁹. The study protocol is uploaded in the Research Registry database (www.researchregistry.com, researchregistry5130).

For a general overview of prognostic modeling methods, including external validation procedures, see our recent review²⁰. To interpret the performance of a risk model in the context of external validation, it is essential to first quantify the similarities between development and validation samples²¹. External validity only assesses model transportability if validation samples have a different case-mix, with the greater the difference in the case-mixes, the greater the possibility of generalizing to other populations. Thus, we investigated the extent to which the SLaM and Commercial datasets comprised patients with sets of prognostically relevant predictors in common, comparable time to event outcomes with roughly similar follow-up times, and the same clinical condition observed in similar settings²².

As a first step, we described the Commercial patient population, including the configuration of clinical services and compared with SLaM. Baseline clinical and sociodemographic characteristics of the sample (including missing data) were described by means and SDs for continuous variables, and absolute and relative frequencies for categorical variables²².

In a second step, we visually compared the two Kaplan–Meier failure functions, showing the number of patients developing a psychotic disorder, as well as those still at risk, over time. The overall cumulative risk of psychosis onset in Commercial was visualized with the Kaplan–Meier failure function (1—survival)²³ and Greenwood 95% confidence intervals (CIs)²⁴. Curves that vary noticeably may indicate systematic differences within the study populations²².

In a third step, we reported the spread (SD) and mean of the PI in the two datasets. An increased (or decreased) variability of the PI would indicate more (or less) heterogeneity of case-mix between the two datasets, and therefore, of their overarching target populations²¹. Differences in the mean PI indicate differences in overall (predicted) outcome frequency, reflecting case-mix severity between the two datasets (and revealing the model’s calibration-in-the-large in the Commercial database)²¹. Continuous variables were tested with independent t-tests.

We then performed the formal external validation, assessing the prognostic accuracy of the model in the Commercial database²². Accordingly, the regression coefficients obtained from our model developed in SLaM (see Table S6) were applied to each case in the external Commercial database, to generate the PI in the Commercial database. In the case of ethnicity, the aggregate ethnic weights were multiplied by their respective regression coefficients to provide an aggregate coefficient for that patient. The sum of an individual’s regression coefficients resulted in an individualized PI. The greater the PI, the higher the risk of the individual developing a psychotic disorder.

Since we were interested in discrimination, the primary outcome measure for this study was the external model performance (accurate predictions discriminate between those with and those without the outcome)¹⁸, defined with the Harrell’s C-index¹⁷. Harrell’s C is a recommended measure for external validation of Cox models according to established guidelines¹⁷. Harrell’s C is the probability that for a random pair of “case” and “control,” the predicted risk of an event (PI) is higher for the “case”²⁵. In addition, we estimated the overall model performance¹⁸ using the Brier score (average mean squared difference between predicted probabilities and actual outcomes, which also captures calibration and discrimination aspects)¹⁸. Calibration (agreement between observed outcomes and predictions)¹⁸ was assessed using the regression slope of the PI^17,18.

As a further exploratory step, we updated the model using the regression slope on the PI as a shrinkage factor for recalibration, in line with the Royston et al. guidelines²².

All analyses were conducted in R version 3.3.2²⁶. using the survival package, and significance was set to P < .05.

Results

Commercial sample characteristics

A total of 3,828,791 patients accessing primary or secondary healthcare between January 2015 and December 2017 received an ICD-10 primary index diagnosis of a nonorganic and nonpsychotic mental disorder. 2,430,333 (63.5%) of these individuals could be matched with ethnicity data, and were included in the analysis, as detailed in the study flow-diagram (Fig. 1). Patients accessing Commercial and included in this study had an average age of 34.2 years (95% CI: 34.19–34.23), 59% were female, and White ethnicity was particularly common in patients’ MSAs (79%). The most frequent index diagnosis was anxiety disorders (45%). Full sociodemographic information is provided in Table 1.

**Fig. 1: Flowchart of the study population.**

Table 1 Sociodemographic characteristics of the commercial dataset compared with the SLaM dataset.

Full size table

Differences between the commercial and SLaM databases

Sociodemographic and service configuration differences

The most important difference is that while the SLaM database contains data on individuals accessing publicly funded secondary mental healthcare, Commercial is limited to individuals covered by employer-sponsored health insurance plans. Compared to the full population, incidence of psychosis may be rarer in those covered by private insurance such as in the Commercial dataset. Similar to the C&I Trust that was the basis of the second external replication study, Commercial did not include CHR-P services; therefore, there were no CHR-P diagnoses. Additional differences are that Commercial data incorporates both primary and secondary healthcare, compared to solely secondary healthcare in SLaM and C&I, as well as the aggregation of ethnicity data as discussed in “Methods” section. The average patient’s age in the Commercial was 0.2 years lower than in SLaM (p = 0.03). Compared with SLaM, there was a lower incidence of ATPD, substance use disorders, bipolar mood disorders, personality disorders, developmental disorders, physiological syndromes and mental retardation in the Commercial dataset. Conversely, there were higher rates of nonbipolar mood disorders, anxiety disorders and childhood/adolescence onset disorders. Finally, there were fewer males in Commercial than in SLaM (Table 1).

Cumulative risk of psychosis in commercial compared with the SLaM derivation dataset

The average follow-up time in Commercial was 460.89 days (SD = 280.04) compared with 1580.64 days (SD = 927.72) in SLaM. There were 24,941 (1.03% of the sample size) events (transition to psychosis) in Commercial compared with 1,273 (3.72% of the sample size) in SLaM. The average time to transition to psychosis in those who transitioned was 199.77 days (SD = 204.48) in Commercial compared to 664.03 days (SD = 621.04) in SLaM. The two-year cumulative risk of psychosis in the Commercial was 1.43% (95% CI: 1.41–1.45%, with the last transition being observed at 819 days), compared to 2.57% (95% CI: 2.40%–2.75%, with the last transition being observed at 3,246 days) in SLaM. The cumulative incidences curves (Kaplan–Meier) from the Commercial and SLaM datasets are compared in Fig. 2. Mean values of the PI within the Commercial and SLaM databases were −1.51 and −1.18, respectively (P < .001). SD of the PI in the Commercial and SLaM databases were 0.70 and 0.94, respectively (P < .001).

**Fig. 2: Cumulative incidence for the risk of psychotic disorders in Commercial Database and SLaM derivation database.**

External validation in the commercial database

The comparative model performance in the SLaM dataset using aggregate ethnicity data was 0.761 (Table S5). In the Commercial dataset, the model predicted significantly better than chance, with a Harrell’s C of 0.676 (95% CI: 0.672–0.679, Harrell’s C in SLaM = 0.79). The two-year Brier score was 0.013 (two-year Brier score in SLaM = 0.012). The model did not show major calibration issues, with a regression slope close to 1: 0.93, 95% CI: 0.91–0.94 (P < .001).

Updating the model optimized calibration (regression slope = 1) but conferred no substantial improvement in model performance (full model specifications are appended in Table S6).

Discussion

This is the largest ever replication study of a risk prediction model in psychiatry. The study demonstrates that the transdiagnostic, individualized risk calculator was able to detect individuals at risk of psychosis in an international setting with a prognostic discriminative performance that was significantly above chance.

To our knowledge, this is the largest ever external replication study of a risk calculator not only in early psychosis but also in clinical psychiatry. Importantly, this study included 24,941 events (transitions to psychosis) which are over one hundred times the minimum recommended amount of 100 events required to produce accurate estimates of external prognostic accuracy^20,27. The previous largest external validation study of this kind was our first external replication of this calculator conducted in SLaM (n = 33,820)⁷, followed by a validation study of a calculator that predicts major depressive disorder (n = 29,621)²⁸ and by another calculator that predicts risk of violent crime in patients with severe mental illness (n = 16,387)²⁹, all smaller than our sample size of 2,40,333. This is a substantial achievement given that prognostic modeling in psychiatry is affected by a severe scarcity of replication efforts³⁰, to the point that replication has become equally as—or even more—important than discovery³¹. A systematic review and meta-analysis of clinical prediction models for predicting the onset of psychosis in CHR-P people uncovered 91 studies, none of which performed a true external validation of an existing model³². This is the only transdiagnostic clinical prediction model to be externally validated in three different populations (Lewisham & Croydon SLaM NHS Trust, C&I and now Commercial); another risk prediction model for use in CHR-P patients has also received three independent validations^33,34,35. A full list of individualized risk prediction models that have been externally replicated in the field of early psychosis is detailed in Table 2.

Table 2 Individualized clinical prediction models that have been externally validated for early psychosis.

Full size table

The additional strength of this study is that it provides further empirical support for the use of EHRs in the context of precision psychiatry. Transporting risk prediction models across different EHRs representing heterogeneous clinical settings is complex because they reflect underlying differences in the patient population. A first empirical challenge is the availability of predictors and outcomes. The vast majority of predictors were available in the Commercial database, with the exception of ethnicity; patient-level ethnicity variables were computed to compensate for this. There was also a shorter follow-up time in Commercial compared to SLaM, as ICD-10 was only integrated into United States healthcare on 1 October 2015. Use of ICD-9 diagnoses was considered to extend follow-up but converting diagnostic clusters to ICD-9 proved inexact and therefore inappropriate. A second challenge is to quantify the differences between development and validation databases to interpret the performance of a risk model in the context of external validation²¹. For example, compared with SLaM, where the model was developed, there were apparent differences in sociodemographic characteristics in Commercial (fewer males and fewer patients of Black ethnicity and different frequency of ICD diagnoses, reflected by smaller spread of the PI) and time to event (shorter). Furthermore, similar to our second replication in C&I¹¹, there were no CHR-P services in Commercial and, therefore, no CHR-P designations. However, as ATPD diagnoses are typically not made in CHR-P or early intervention services³⁶, the number of ATPD diagnoses in Commercial are unlikely to be affected by this difference in service configuration. Because of this case-mix, the incidence of psychosis was about half in Commercial (1.43/2.57 at two years, reflected by a lower mean value of the PI). The most important difference is that, while previous replications were performed in data collected from publicly funded secondary mental healthcare alone, the Commercial database was composed of both primary and secondary healthcare data composed of commercially insured patients. Given such relevant differences, it was expected that the risk calculator could not be easily transported to the Commercial setting and that it would achieve a lower prognostic performance and calibration than that observed in the first two external validations.

Despite these differences in clinical setting and populations, the overall prognostic accuracy of the transdiagnostic, clinically-based risk calculator remained significantly above chance. As expected, the level of prognostic performance (Harrell’s C = 0.68) was suboptimal and lower than our previous external validation (Harrell’s C = 0.73)¹¹. Yet, this level of accuracy is comparable to that of structural neuroimaging methods (i.e., gray matter volume) to detect a first-episode of psychosis at the individual level, with accuracies ranging from 0.5 to 0.63³⁷. A recent machine-learning study externally validated a risk calculator to predict treatment outcome in depression in 151 patients. The study reported a one year prognostic accuracy of 0.59 and concluded that, if implemented at scale, performance even only significantly above chance can be considered to be clinically useful³⁸. Given that our risk calculator has been developed on real-world EHR data, it offers the potential for automatically screening large mental health populations. Psychiatry is undergoing a digital revolution³⁹, and there is an ongoing expansion of EHR adoption worldwide. More to this point, this risk calculator was evidently developed with a clear vision of future implementation as decision support in clinical routine and is currently being piloted in this capacity^12,15. For example, it uses simple predictors that can easily be understood by clinicians, as compared to complex black-box machine-learning-derived algorithms⁴⁰. Furthermore, harnessing data from EHRs is cheaper than other methods such as patient recruitment, because most of the predictors are available as part of clinical routine. There are no competing algorithms (CHR-P instruments are not usable for screening purposes)⁴¹ to screen the at-risk population at scale. Other risk prediction tools in early psychosis have shown promise, however they predominantly rely on clinical symptom scores^42,43, which means they are more financially and labor intensive than this tool; potential for automation is therefore limited. Moreover, these tools are focused on identifying transition to psychosis and are reliant on prior identification of CHR-P, whereas our tool is able to predict psychosis risk transdiagnostically outside of this designation. Thus, there is potential benefit in utilizing this risk calculator to screen for psychosis risk in large numbers.

There is scope for optimization of the current risk calculator through stepped risk stratification and model refinement. As a first step, this risk calculator could be deployed in a screening pathway where an individual’s risk is calculated upon entry into secondary mental health services. Individuals flagged by our risk calculator as being at risk for psychosis would progress to a more thorough clinical CHR-P assessment in the context of a staged sequential risk assessment^44,45. This could supplement other detection strategies targeting the general population, such as the Youth-Mental Risk and Resilience study (YouR-Study)⁴⁶, which provided the first evidence of digital detection tools improving identification of psychosis in the general population. A potential further step would be combining the risk calculator with additional information (environmental, genetic or biomarkers) to improve prognostic accuracy further^44,45,47, refine estimates of individuals’ risk and stratify them accordingly. This is in keeping with the current clinical staging model of early psychosis, which aims to improve preventative care and reduce the duration of untreated psychosis to improve outcomes¹. In addition to its clinical utility, this risk calculator could improve CHR-P research by aiding recruitment for much needed large-scale international collaborations in the vein of the HARMONY project, incorporating NAPLS (https://campuspress.yale.edu/napls/), PRONIA (https://www.pronia.eu/) and PSYSCAN (http://psyscan.eu), and the proposed 26-site ProNET cohort study. Furthermore, this prognostic model can be refined. In companion studies, we have tested whether using machine-learning methods and expanding the range of⁴⁸, or redefining⁴⁹, predictors might improve the prognostic accuracy of this risk calculator.

The limitations of this study are largely inherited from the original study. We did not employ structured psychometric interviews to ascertain the type of emerging psychotic diagnoses at follow-up. However, we predicted psychotic disorders rather than specific ICD-10 diagnoses, a category which has good prognostic stability⁵⁰. Therefore, while the psychotic diagnoses in our analyses are high in ecological validity (i.e., they represent real-world clinical practice), they have not been subjected to formal validation with research-based criteria. However, the use of structured diagnostic interviews can lead to selection biases, decreasing the transportability of models⁵¹. There is also meta-analytical evidence indicating that within psychotic disorders, administrative data recorded in clinical registers are generally predictive of true validated diagnoses⁵².

Other limitations were inherent in the Commercial database, mostly due to the lack of patient-level ethnicity data and a short follow-up time. These two issues reduced the prognostic performance of the model a priori, in particular considering that risk for psychosis may well extend beyond two years⁵³. It is therefore possible that prognostic performance of this model in the longer term may actually be better than the performance reported here. A further limitation is that the study team for this replication is not completely independent from the team who completed the original study⁵⁴, which is particularly relevant given the support of a pharmaceutical company. However, Lundbeck has no financial interests nor patents on this project. As this study involved a large commercial dataset and a refined version of the model, it was logistically impossible to conduct this research independently from the original team. To mitigate against this overlap, we adhered to the Royston²², RECORD¹³, and TRIPOD¹⁹ guidelines to ensure transparency. Finally, although we welcome further external validation studies, it must be noted that even strong replication does not automatically imply the potential for successful adoption in clinical or public health practice. Ideally, randomized clinical trials or economic modeling are needed to assess whether our risk calculator effectively improves patient outcomes.

Conclusion

The largest international external replication of an individualized prognostic model in psychiatry confirms that precision medicine in this discipline is feasible even at large scale. The transdiagnostic, individualized, clinically-based risk calculator is potentially transportable on an international scale to improve the automatic detection of individuals at risk of psychosis. Further research should refine the model and test the benefit of implementing this risk prediction model in clinical routine.

References

Fusar-Poli, P., McGorry, P. D. & Kane, J. M. Improving outcomes of first-episode psychosis: an overview. World Psychiatry 16, 251–265 (2017).
Article Google Scholar
Fusar-Poli, P. The clinical high-risk state for psychosis (CHR-P), Version II. Schizophr. Bull. 43, 44–47 (2017).
Article Google Scholar
Fusar-Poli, P. et al. Disorder, not just state of risk: meta-analysis of functioning and quality of life in people at high risk of psychosis. Br. J. Psychiatry 207, 198–206 (2015).
Article Google Scholar
Fusar-Poli, P. et al. Heterogeneity of psychosis risk within individuals at clinical high risk: a meta-analytical stratification. JAMA Psychiatry 73, 113–120 (2016).
Article Google Scholar
Fusar-Poli, P., Byrne, M., Badger, S., Valmaggia, L. R. & McGuire, P. K. Outreach and support in south London (OASIS), 2001-2011: ten years of early diagnosis and treatment for young individuals at high clinical risk for psychosis. Eur. Psychiatry 28, 315–326 (2013).
Article CAS Google Scholar
Fusar-Poli, P. et al. The dark side of the moon: meta-analytical impact of recruitment strategies on risk enrichment in the clinical high risk state for psychosis. Schizophr. Bull. 42, 732–743 (2016).
Article Google Scholar
Fusar-Poli, P. et al. Development and validation of a clinically based risk calculator for the transdiagnostic prediction of psychosis. JAMA Psychiatry 74, 493–500 (2017).
Article Google Scholar
McGorry, P. D., Hartmann, J. A., Spooner, R. & Nelson, B. Beyond the “at risk mental state” concept: transitioning to transdiagnostic psychiatry. World Psychiatry 17, 133–142 (2018).
Article Google Scholar
Kotlicka-Antczak, M. et al. Worldwide implementation of clinical services for the prevention of psychosis: the IEPA early intervention in mental health survey. Early Interv. Psychiatry. https://doi.org/10.1111/eip.12950 (2020).
Radua, J. et al. What causes psychosis? An umbrella review of risk and protective factors. World Psychiatry 17, 49–66 (2018).
Article Google Scholar
Fusar-Poli, P. et al. Transdiagnostic risk calculator for the automatic detection of individuals at risk and the prediction of psychosis: second replication in an independent national health service trust. Schizophr. Bull. 45, 562–570 (2019).
Article Google Scholar
Fusar-Poli, P. et al. Real world implementation of a transdiagnostic risk calculator for the automatic detection of individuals at risk of psychosis in clinical routine: study protocol. Front. Psychiatry 10, 109 (2019).
Article Google Scholar
Benchimol, E. I. et al. The reporting of studies conducted using observational routinely-collected health data (RECORD) statement. PLoS Med. 12, e1001885 (2015).
Article Google Scholar
Fusar-Poli, P. et al. Development and validation of a clinically based risk calculator for the transdiagnostic prediction of psychosis. JAMA Psychiatry 74, 493–500 (2017).
Article Google Scholar
Oliver, D. et al. Real-world implementation of precision psychiatry: transdiagnostic risk calculator for the automatic detection of individuals at-risk of psychosis. Schizophr. Res. https://doi.org/10.1016/j.schres.2020.05.007. (2020).
Office for National Statistics (ONS). Ethnic Groups by Borough. Opinion Research and General Statistics (GLA). (2018).
Royston, P. & Altman, D. G. External validation of a Cox prognostic model: principles and methods. BMC Med. Res. Methodol. 13, 33 (2013).
Article Google Scholar
Steyerberg, E. W. et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 21, 128–138 (2010).
Article Google Scholar
Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. M. Transparent reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement. Ann. Intern. Med. 162, 55–63 (2015).
Article Google Scholar
Fusar-Poli, P., Hijazi, Z., Stahl, D. & Steyerberg, E. W. The science of prognosis in psychiatry: a review. JAMA Psychiatry 75, 1289–1297 (2018).
Article Google Scholar
Debray, T. P. A. et al. A new framework to enhance the interpretation of external validation studies of clinical prediction models. J. Clin. Epidemiol. 68, 279–289 (2015).
Article Google Scholar
Royston, P., Parmar, M., Altman, D. G. External Validation and Updating of a Prognostic Survival Model. (Department of Statistical Science, University College London, London, 2010).
Kaplan, E. L. & Meier, P. Nonparametric estimation from incomplete observations. J. Am. Stat. Assoc. 53, 457 (1958).
Article Google Scholar
Lazarus-Barlow, W. S. & Leeming, J. H. The natural duration of cancer. Br. Med. J. 2, 266–267 (1924).
Article CAS Google Scholar
Hosmer, W. & Lemeshow, S. Applied Survival Analysis: Regression Modeling of Time to Event Data. (Wiley & Sons, New York, NY, 1999).
Google Scholar
R. Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, Austria, 2014).
Collins, G. S., Ogundimu, E. O. & Altman, D. G. Sample size considerations for the external validation of a multivariable prognostic model: a resampling study. Stat. Med. 35, 214–226 (2016).
Article Google Scholar
Nigatu, Y. T., Liu, Y. & Wang, J. External validation of the international risk prediction algorithm for major depressive episode in the US general population: the PredictD-US study. BMC Psychiatry 16, 256 (2016).
Article Google Scholar
Fazel, S. et al. Identification of low risk of violent crime in severe mental illness with a clinical prediction tool (Oxford Mental Illness and Violence tool [OxMIV]): a derivation and validation study. Lancet Psychiatry 4, 461–468 (2017).
Article Google Scholar
Szucs, D. & Ioannidis, J. P. A. Empirical assessment of published effect sizes and power in the recent cognitive neuroscience and psychology literature. PLoS Biol. 15, e2000797 (2017).
Article CAS Google Scholar
Ioannidis, J. P. A. Evolution and translation of research findings: from bench to where. PLoS Clin. Trials 1, e36 (2006).
Article Google Scholar
Studerus, E., Ramyead, A. & Riecher-Rössler, A. Prediction of transition to psychosis in patients with a clinical high risk for psychosis: a systematic review of methodology and reporting. Psychol. Med. 47, 1163–1178 (2017).
Article CAS Google Scholar
Carrión, R. E. et al. Personalized prediction of psychosis: external validation of the NAPLS-2 psychosis risk calculator with the EDIPPP project. Am. J. Psychiatry 173, 989–996 (2016).
Article Google Scholar
Osborne, K. J. & Mittal, V. A. External validation and extension of the NAPLS-2 and SIPS-RC personalized risk calculators in an independent clinical high-risk sample. Psychiatry Res. 279, 9–14 (2019).
Article Google Scholar
Zhang, T. et al. Validating the predictive accuracy of the NAPLS-2 psychosis risk calculator in a clinical high-risk sample from the SHARP (Shanghai At Risk for Psychosis) program. Am. J. Psychiatry 175, 906–908 (2018).
Article Google Scholar
Minichino, A. et al. Unmet needs in patients with brief psychotic disorders: too ill for clinical high risk services and not ill enough for first episode services. Eur. Psychiatry 57, 26–32 (2019).
Article Google Scholar
Vieira, S. et al. Using machine learning and structural neuroimaging to detect first episode psychosis: reconsidering the evidence. Schizophr. Bull. https://doi.org/10.1093/schbul/sby189. (2019).
Chekroud, A. M. et al. Cross-trial prediction of treatment outcome in depression: a machine learning approach. Lancet Psychiatry 3, 243–250 (2016).
Article Google Scholar
Baker, J. T., Germine, L. T., Ressler, K. J., Rauch, S. L. & Carlezon, W. A. Digital devices and continuous telemetry: opportunities for aligning psychiatry and neuroscience. Neuropsychopharmacology 43, 2499–2503 (2018).
Article Google Scholar
Castelvecchi, D. Can we open the black box of AI? Nature 538, 20–23 (2016).
Article CAS Google Scholar
Fusar-Poli, P. et al. At risk or not at risk? A meta-analysis of the prognostic accuracy of psychometric interviews for psychosis prediction. World Psychiatry 14, 322–332 (2015).
Article Google Scholar
Cannon, T. D. et al. An individualized risk calculator for research in prodromal psychosis. Am. J. Psychiatry 173, 980–988 (2016).
Article Google Scholar
Zhang, T. et al. Prediction of psychosis in prodrome: development and validation of a simple, personalized risk calculator. Psychol. Med. 49, 1990–1998 (2019).
Article Google Scholar
Oliver, D., Radua, J., Reichenberg, A., Uher, R. & Fusar-Poli, P. Psychosis polyrisk score (PPS) for the detection of individuals at-risk and the prediction of their outcomes. Front. Psychiatry 10, 174 (2019).
Article Google Scholar
Schmidt, A. et al. Improving prognostic accuracy in subjects at clinical high risk for psychosis: systematic review of predictive models and meta-analytical sequential testing simulation. Schizophr. Bull. 43, 375–388 (2017).
Google Scholar
McDonald, M. et al. Using online screening in the general population to detect participants at clinical high-risk for psychosis. Schizophr. Bull. 45, 600–609 (2019).
Article Google Scholar
Oliver, D. et al. Real-world digital implementation of the Psychosis Polyrisk Score (PPS): a pilot feasibility study. Schizophr. Res. https://doi.org/10.1016/j.schres.2020.04.015. (2020).
Fusar-Poli, P. et al. Clinical-learning versus machine-learning for transdiagnostic prediction of psychosis onset in individuals at-risk. Transl. Psychiatry 9, 1–11 (2019).
Fusar-Poli, P. et al. Transdiagnostic individualized clinically based risk calculator for the detection of individuals at risk and the prediction of psychosis: model refinement including nonlinear effects of age. Front. Psychiatry 10, 313 (2019).
Article Google Scholar
Fusar-Poli, P. et al. Diagnostic stability of ICD/DSM first episode psychosis diagnoses: meta-analysis. Schizophr. Bull. 42, 1395–1406 (2016).
Article Google Scholar
Webb, J. R. et al. Specificity of incident diagnostic outcomes in patients at clinical high risk for psychosis. Schizophr. Bull. 41, 1066–1075 (2015).
Article Google Scholar
Davis, K. A. S., Sudlow, C. L. M. & Hotopf, M. Can mental health diagnoses in administrative data be used for research? A systematic review of the accuracy of routinely collected diagnoses. BMC Psychiatry 16, 263 (2016).
Article Google Scholar
Nelson, B. et al. Long-term follow-up of a group at ultra high risk (“prodromal”) for psychosis: the PACE 400 study. JAMA Psychiatry 70, 793–802 (2013).
Article Google Scholar
Ioannidis, J. P. A. Scientific inbreeding and same-team replication: type D personality as an example. J. Psychosom. Res. 73, 408–410 (2012).
Article Google Scholar
Fusar-Poli, P. et al. Deconstructing pretest risk enrichment to optimize prediction of psychosis in individuals at clinical high risk. JAMA Psychiatry 73, 1260–1267 (2016).
Article Google Scholar
Koutsouleris, N. et al. Multisite prediction of 4-week and 52-week treatment outcomes in patients with first-episode psychosis: a machine learning approach. Lancet Psychiatry 3, 935–946 (2016).
Article Google Scholar
Leighton, S. P. et al. Predicting one-year outcome in first episode psychosis using machine learning. PLoS ONE 14, e0212846 (2019).
Article CAS Google Scholar
Leighton, S. P. et al. Development and validation of multivariable prediction models of remission, recovery, and quality of life outcomes in people with first episode psychosis: a machine learning approach. Lancet Digital Health 1, e261–e270. (2019).
Article Google Scholar
Irving, J., Patel, R., Oliver, D., Colling, C., Pritchard, M. & Broadbent, M. et al. Using natural language processing on electronic health records to enhance detection and prediction of psychosis risk. Schizophr Bull (2020).

Download references

Acknowledgements

D.O. is supported by the UK Medical Research Council (MR/N013700/1) and King’s College London member of the MRC Doctoral Training Partnership in Biomedical Sciences. D.S. was part funded by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health. P.F.-P. is supported by a research grant from H. Lundbeck A/S. These funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

These authors contributed equally: Dominic Oliver, Chiew Meng Johnny Wong

Authors and Affiliations

Early Psychosis: Interventions and Clinical detection (EPIC) Lab, Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 8AF, UK
Dominic Oliver, Jessica Irving & Paolo Fusar-Poli
Lundbeck Singapore Pte, Ltd., Singapore, 307591, Singapore
Chiew Meng Johnny Wong
H. Lundbeck A/S, Valby, Denmark
Martin Bøg, Linus Jönsson, Allan Wehnert, Kristian Tore Jørgensen & Lars Lau Raket
Karolinska Institutet, Stockholm, Sweden
Linus Jönsson
Lundbeck Pharmaceuticals LLC, Deerfield, IL, 60015, USA
Bruce J. Kinon
Department of Biostatistics, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 8AF, UK
Daniel Stahl
OASIS Service, South London and the Maudsley NHS National Health Service Foundation Trust, London, SE5 8AZ, UK
Philip McGuire & Paolo Fusar-Poli
Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 8AF, UK
Philip McGuire
Clinical Memory Research Unit, Lund University, Lund, Sweden
Lars Lau Raket
Department of Brain and Behavioural Sciences, University of Pavia, 27100, Pavia, Italy
Paolo Fusar-Poli

Authors

Dominic Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Chiew Meng Johnny Wong
View author publications
You can also search for this author in PubMed Google Scholar
Martin Bøg
View author publications
You can also search for this author in PubMed Google Scholar
Linus Jönsson
View author publications
You can also search for this author in PubMed Google Scholar
Bruce J. Kinon
View author publications
You can also search for this author in PubMed Google Scholar
Allan Wehnert
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Tore Jørgensen
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Irving
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Stahl
View author publications
You can also search for this author in PubMed Google Scholar
Philip McGuire
View author publications
You can also search for this author in PubMed Google Scholar
Lars Lau Raket
View author publications
You can also search for this author in PubMed Google Scholar
Paolo Fusar-Poli
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.F.-P. developed the original model, validated it, and conceived this study. D.O., M.B., L.J., K.T.J., and P.F.-P. developed the protocol. D.O. and C.M.J.W. wrote all analysis scripts and led the analyses. D.O. drafted the first version of this manuscript. M.B., L.J., B.J.K., A.W., K.T.J., J.I., D.S., and L.L.R. advised on data organization, cleaning and statistical analysis. D.O., P.F.-P., and P.M. interpreted the results of the analyses. All authors approved the final manuscript.

Corresponding author

Correspondence to Paolo Fusar-Poli.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oliver, D., Wong, C.M.J., Bøg, M. et al. Transdiagnostic individualized clinically-based risk calculator for the automatic detection of individuals at-risk and the prediction of psychosis: external replication in 2,430,333 US patients. Transl Psychiatry 10, 364 (2020). https://doi.org/10.1038/s41398-020-01032-9

Download citation

Received: 15 June 2020
Revised: 03 September 2020
Accepted: 04 September 2020
Published: 29 October 2020
DOI: https://doi.org/10.1038/s41398-020-01032-9
Springer Nature Limited

Transdiagnostic individualized clinically-based risk calculator for the automatic detection of individuals at-risk and the prediction of psychosis: external replication in 2,430,333 US patients

Abstract

Similar content being viewed by others