Introduction

Psychotic experiences (PE) are symptoms deemed comparable to the positive symptoms of psychosis, but not sufficiently severe to warrant a diagnosis upon clinical presentation. The predominant viewpoint is that there is a phenotypic continuum of PE whereby psychosis and bipolar disorder are the most extreme manifestations along the spectrum1. PE are prevalent (5.8–7%), signify a susceptibility to a broad spectrum of adverse mental health outcomes and are thought to represent a transdiagnostic indicator of psychopathological severity2,3,4. As such, there is increasing recognition that PE merit their own platform for research to further understand their emergence, distribution and possible prognostic relevance (outside of their risk for the development of psychosis).

Several studies have shown that, consistent with diagnosed psychosis and schizophrenia, PE have a multidimensional structure5,6. The number and structure of the dimensions varies across studies (subject to analysis and measures)6. Genetic heritability has been shown to range from 33 to 57% depending on the dimension of PE. Allelic variants also differ in the type of PE they confer the greatest risk for 6,7. Dimensions also vary in their severity, associated distress and risk for the development of further mental health difficulties6.

The current study

In summary, the literature tells us that not all types of PE are created equal. However, it remains poorly understood to date how assorted environmental risks (e.g. cannabis, brain injury or trauma) confer risk for different dimensions of PE. While several studies acknowledge individual environmental risk factors for the development of PE, rarely do they attempt to accumulate them to explore the environmental risk profile for a specific dimension of PE8,9. To help address this, we designed the Oxford Wellbeing Life and Sleep Survey (OWLS) survey to examine the prevalence of established environmental risk factors for psychosis and their efficacy in predicting the occurrence of six PE dimensions (negative symptoms, perceptual abnormalities, bizarre ideas (BI), delusional mood (DM), persecutory ideation, and cognitive disorganisation (CD)). To ensure only reliable risk factors were included, we conducted a systematic review of the meta-analyses and systematic reviews of these risk factors (Supplementary Table 1). In addition to the factors revealed by the literature review, sleep and circadian rhythm disruption has been identified as a “nonspecific” but important risk factor for psychosis, meriting its inclusion10,11. Thus, the goals for the present study are (1) to explore the prevalence for established risk factors for psychosis, (2) to see how these risk factors distribute across the number of PE a respondent endorses and (3) to examine how environmental risk factors distribute in their ability to predict different dimensions of PE12.

Methods

Survey sampling

The survey was targeted at the general population (aged 18–65 years) in the Oxford area. As such, advertisements were placed in a diverse number of venues and locations in Oxford, as well as online. However, as this study took place in a university city the sample was inevitably biased towards a younger cohort. Given psychosis with late-life onset has been shown to have a different risk profile to psychosis developed earlier in life and this study is designed to explore the risk factors for emerging PE (which are often during adolescence to early adulthood), a skewed age distribution was not considered a limitation in this study in the same way it may be in other survey-based studies12. The survey was hosted online on the Oxford University network. The survey protocol and contents were approved by the Medical Sciences Interdivisional Research Ethics Committee (MSD-IDREC-C1–2014–054), and all participants gave informed consent online when agreeing to complete the survey.

Environmental risk factor selection

The selection of risk factors presented in the survey was premised upon a systematic ‘meta’ review of the literature, i.e. a review of the available meta-analyses and systematic reviews that globally account for the published data on a specified risk factor for the development of psychosis. Full details of the methodology employed and the risk factors identified in this process can be found in Supplementary Materials 1. Parental communication was the only risk factor highlighted by this review that could not be included, as it is evaluated by means of video recordings and there was no feasible way to replicate this accurately within the survey.

Survey structure and instruments

Sociodemographic characteristics

Data were gathered on gender, age, ethnicity, education level, psychiatric disorder diagnoses, and help-seeking behaviour for any psychiatric disorder listed. An overview of each of the risk factors, sociodemographic characteristics and questionnaires evaluated in the OWLS survey is presented in Supplementary Tables 2 and 3.

Psychotic-like experiences

Prodromal Questionnaire 16 Item Version (PQ16; α = 0.79 for this sample)13. The PQ16 contains 16 items with yes/no responses, yielding a score out of 16. Scoring 6 or above warrants further screening for an at-risk mental state. The questionnaire assesses positive symptoms (perceptual abnormalities, BI, DM, paranoia and CD) and negative symptoms (social anxiety and avolition). An overview of the items and their respective dimensions can be found in Supplementary Table 4. The PQ16 was selected as it does not have ‘hypothetical qualifiers’ or describe beliefs of cultural subgroups (e.g. voodoo) which have been highlighted to produce misleading results in the estimated prevalence of PE12,13.

Axis I symptomatology

Depression Anxiety and Stress Scale (21 item version; α = 0.93 (overall); α = 0.93 (D); α = 0.80 (A); α = 0.87 (S)). This scale was selected as it takes a dimensional view of depression, anxiety and stress. It can be subdivided into three categories (of seven items each). The 21 items are each scored on a 4-point scale from 0 to 314.

Traumatic events

Life-Threatening Experiences Scale (LTE)15. The LTE is a 12-item self-report questionnaire assessing different categories of traumatic life events, selected for their established long-term consequences15. The total score is the number of items endorsed (maximum = 12). Participants are asked to rate their level of stress associated with each event endorsed, choosing one of four options (not stressful, slightly stressful, moderately stressful, and very stressful).

Sleep quality

Pittsburgh Sleep Quality Index (PSQI; α = 0.82)16. The PSQI measures subjective sleep quality over the previous month, yielding a score ranging from 0 to 21. Higher scores represent poorer quality sleep17.

Insomnia

Short Form Sleep Condition Indicator (SCI; α = 0.82)18. The SCI consists of two items: (1) ‘thinking about a typical night in the last month, how many nights a week do you have a problem with your sleep?’; and (2) ‘thinking about the past month, to what extent has poor sleep troubled you in general?’. Possible responses to the first question are ‘0–1’, ‘2’, ‘3’, ‘4’ and ‘5–7’; and to the second question are ‘not at all’, ‘a little’, ‘somewhat’, ‘much’ and ‘very much’. Both answer sets are scored 4, 3, 2, 1 and 0, respectively. The two scores are added to give the SCI score. Lower scores indicate more aggressive insomnia complaints18.

Circadian phase

The Munich Chronotype Questionnaire (MCTQ)19. The MCTQ assesses habitual sleep-wake timing from bedtime to wake-up time and is based on subjects’ judgement of their sleep habits over the last 2 weeks. These timings are assessed separately for work and free days. The responses produce a time-based variable, the mid-sleep point on free days (MSF), which is then corrected for accumulated sleep deficits (MSFsc) during the work-week. The corrected variable, MSFsc, provides a correlate of circadian phase in local clock time with a later mid-sleep point indicating a later circadian phase (synonymous with late or ‘evening chronotype’).

Other environmental risk factors

Based upon the results of the systematic review, the following risk factors were included: infections of the brain; brain injury; cannabis use; childhood abuse (questions taken from Cuijpers et al. 2011)20; childhood bullying; childhood social withdrawal (social withdrawal subscale items 42, 65, 88 and 111 from the Child Behavioural Checklist edited to make appropriate for retrospective report); family history of psychiatric disorders; migrant status; help-seeking behaviours in relation to the PE; latitude position at birth; diagnosis of epilepsy; obstetric complications at birth; paternal age; season of birth; presence of 22q11.2 deletion syndrome diagnosis; and urbanicity (Supplementary Tables 2 and 3)21,22.

Statistical analyses

Before the analyses, all survey data were subjected to quality control. A detailed breakdown of the cleaning process is provided in Supplementary Materials 2. All subsequent statistical analyses were performed within the R statistical environment (Version 3.4.1). Cross-sectional differences were explored across four predefined PE risk levels according to the number of PE endorsed in the sample: (1) a PQ16 score of 0, indicating a very low risk; (2) a PQI6 score between PE 1 and 5, indicating minimal risk; (3) a PQ16 score above 5 (the established cut-off for the PQ16), indicating a moderate or ‘at-risk’ group; and (4) a PQ16 score above 5 and having sought help and with distress associated with these symptoms, indicating the highest risk group. These groups were designed to examine the continuity of risk for PE across the respondents.

To examine specific risk factors for the six dimensions of PE, we built multivariate logistic regression models, using the set of risk factors, demographics, sleep variables, and psychopathology measures as predictor variables. The ‘negative symptoms’ PQ16 dimension was modelled first as a proof of concept, as many of the predictor variables relate to the presence of a mood disorder or depressive/anxious symptomatology. As this model indeed had high predictive power, we extended this approach to the other dimensions.

Given the large number of possible combinations of the predictor variables for inclusion in each model, we performed automated model selection using the Akaike information criterion (AIC) to objectively provide a set of candidate models for further consideration. The AIC measures the relative quality of a collection of models and penalises model complexity, thus discouraging overfitting.

For model quality assessment, and to further discourage overfitting, we built models using 70% of the responses. These ‘training’ data were randomly selected. Upon finalisation, the models were then fit to the remaining 30% (the ‘test’ data), to provide an assessment of their generalisability based on the quality of model fit to data the models had not seen. To measure model accuracy, we calculated classification success rates with both training and test data (i.e. the models’ ability to predict the known response data), and computed receiver operating characteristic (ROC) curves. A more detailed overview of the modelling process and the model quality evaluation can be found in Supplementary Materials 3.

Results

Demographics and distribution of risk factors partitioned by the number of PE endorsed

The sample (n = 1789, 487 men) had a mean age of 27.6 years (range: 16–65). Just under half (n = 789, 44.1%) had a low to medium level of education (completed secondary school at most), while over half the sample (n = 1000; 55.9%) possessed an undergraduate degree or a postgraduate higher qualification (MA or PhD). Many were studying towards a higher level of education (n = 1234; 69.0%), indicating this to be a highly educated sample. A demographic overview partitioned by number of PE endorsed is provided in Table 1.

Table 1 Demographic overview of survey respondents according to the number of psychotic symptoms endorsed (n = 1789).

At least one PE was endorsed by 1417 respondents (79.2% of sample). Of these, 364 (20.3%) endorsed a PQ16 above 5. Overall, 144 (8.1%) respondents endorsed a PQ16 above 5 with associated distress and help-seeking behaviour specific to the PE (Fig. 1; top). The most common experience endorsed was avolition (akin to depression; 36.8%), which was closely followed by social anxiety (33.2%), absorption (33.0%), thought insertion (29.8%) and thought broadcasting (27.6%; Fig. 1; bottom).

Fig. 1: Histogram highlighting the prevalence and types of PE endorsed in this sample.
figure 1

The histogram in blue highlights that at least one PE was endorsed by the majority of the sample (79.2%). The histogram has a sharp downward curve as the number of PEs endorsed rises with 20.3% endorsing 5 or more PEs. The histogram in red highlights which types of PE saw the highest rates of endorsement. The most endorsed PE was Avolition (akin to depression; 36.8%), which was closely followed by social anxiety (33.2%), absorption (33.0%), thought insertion (29.8%) and thought broadcasting (27.6%). This supports the notion of a continuum of PE seen in the normal population.

An increasing prevalence across the four predefined PE risk groups was observed for all risk factors for which a sufficient incidence rate was present (Fig. 2; Tables 1, 2). Among these 22 variables, chi-square tests revealed that all risk factors were unequally distributed between the risk groups including adverse childhood experiences, cannabis use, the need for care, and diagnoses of non-psychotic mental health disorders. Demographic factors, however, were not significantly differently distributed across the risk groups (Supplementary Table 5). Altogether, these data imply that OWLS survey respondents replicate observations based on the psychosis-proneness continuum concept.

Fig. 2: Bar charts highlighting the prevalence of risk factors with respect to the number of PE endorsed.
figure 2

An increasing prevalence across the four predefined PE risk groups was observed for all risk factors for which a sufficient incidence rate was present. Among these 22 variables, chi-square tests revealed that all risk factors were unequally distributed between the risk groups. Demographic factors (ex: ethnicity), however, were not significantly differently distributed across the risk groups.

Table 2 Distribution of risk factors across different risk levels (n = 1789).

Examining model performance for the PQ16 dimensions

Multivariate logistic regression models were built for the six PE dimensions (Supplementary Materials 3). Correct classification rates ranged between 66 and 79% across the six models for both the training and test data (Supplementary Table 6). ROC curves (and their AUC values) showed that each model reliably discriminated between participants who endorse and those who do not endorse at least one item for that PE dimension (Fig. 3, Supplementary Table 6).

Fig. 3: Receiver operating characteristic (ROC) curves for each of the six logistic regression models for survey respondents in the training dataset.
figure 3

For each plot, the points show how the true positive rate (sensitivity) and the false positive rate (1-specificity) vary as the threshold for classification into the two groups is varied.

The relationship between risk factors and PE dimensions

The risk factors included in each of the selected PE dimension models are detailed in Table 3. The odds ratios associated with each risk factor in the models are represented in Fig. 4.

Table 3 Model summaries for negative symptoms, perceptual abnormalities, persecutory ideation, bizarre ideas, delusional mood and cognitive disorganisation.
Fig. 4: Odds ratios and corresponding confidence intervals for the modelled risk factors for the different dimensions of PE.
figure 4

For each model, the risk factors that significantly increase or decrease the probability of endorsing psychotic symptoms are displayed with green confidence intervals. Risk factors that do not significantly impact this probability (but are included in the model of best fit) are displayed with red confidence intervals. If the risk factor is not included in the selected model, no odds ratio is presented. The top six windows display the binary risk factors (e.g. using cannabis), which possess much wider CIs than the continuous risk factors (e.g. depression scores) displayed in the bottom six windows.

For negative symptoms (Table 3), depression, anxiety, social withdrawal and sleep quality were found to be significant predictor variables. The odds ratios imply a one-point increase in depression, anxiety, social withdrawal or poorer sleep quality was associated with a respective 13%, 6%, 21% and 10% increase in the probability of endorsing negative symptoms, assuming all other variables are kept constant (Fig. 4).

Perceptual abnormalities were best predicted by anxiety, traumatic events and social withdrawal. A one-point increase in anxiety, number of traumatic events endorsed or social withdrawal was associated with a respective 8%, 13% and 7% increases in the probability of endorsing perceptual abnormalities.

Persecutory ideation was predicted by depression, anxiety and social withdrawal, with one-point increases associated with respective 4%, 7% and 11% increases in the probability of endorsing persecutory ideation. Furthermore, psychological abuse, having used cannabis and bullying were all found to significantly increase the probability of endorsing persecutory ideation by 45%, 46% and 44%, respectively, assuming all other variables are kept constant. Participants identifying as female had a reduced risk (OR = 0.60, 0.41–0.88, p = 0.0094), being 40% less likely to endorse persecutory ideation than men. Comorbidities and distress from traumatic events did not significantly impact the probability of endorsing persecutory ideation (p = 0.06 and 0.08, respectively).

Like negative symptoms and persecutory ideation, BI were associated with by depression, anxiety, traumatic events and social withdrawal, with 5, 8, 13 and 16% increases in the probability of endorsing BI for a one-point increment in each of the respective predictors. Furthermore, physical abuse during childhood and cannabis use were found to be significant predictors, with a respective 52% and 43% increased probability of endorsing BI for those who have used cannabis or experienced physical abuse. Small effects of insomnia (SCI) and emotional neglect were also observed.

DM was predicted by anxiety, social withdrawal and traumatic events, with one-unit increases in the predictor variables leading to 4%, 10% and 19% respective increases in the probability of endorsing DM. Other significant predictors for DM were psychological abuse and circadian phase (MSFsc).

Finally, CD was predicted by anxiety, social withdrawal and traumatic events. Having used cannabis also predicted CD, with cannabis users having an estimated 32% increased probability of endorsing CD. Furthermore, CD was also predicted by sleep quality and age, with a one-point decrease in sleep quality predicting a 10% increased probability of endorsing CD, and with a 1-year age increase predicted to give a 3% decrease in the probability of endorsing CD.

The overall contribution and overlap of factors predicting each PE dimension is presented in Fig. 5. Ethnicity, season of birth, paternal age, urbanicity, first-degree relatives with a family history of mental illness, presence of any psychiatric comorbidity or the treatment for comorbid mental health problems did not significantly predict an increased (or decreased) risk of any of the PE dimensions (Figs. 4 and 5).

Fig. 5
figure 5

Six psychotic experience dimensions with predictive overlapping and distinct risk factors.

Discussion

This cross-sectional survey examined how risk factors for psychosis distribute across the number of PE a respondent endorses and whether heightened PE indicates heightened need for care. It then sought to further explore whether environmental risk factors predict different dimensions of PE.

Our results support the hypothesis that PE present as heterogeneous, subclinical features in the general population, with certain experiences more abundantly endorsed (i.e. avolition and social anxiety) than others (i.e. visual, auditory or olfactory sensory-perceptual abnormalities). The results also lend further support for the phenotypic continuum of PE with those reporting higher numbers of PE also reporting more mental health diagnoses, sleep disturbances and need for care (Table 1). Furthermore, childhood risks (e.g. sexual abuse and bullying), a family history, obstetric complications, cannabis use and exposure to trauma also appear to be more prevalent in those who endorse higher numbers of PE (Table 2; Supplementary Table 5), which again would support the notion of a continuum and indicate that endorsing more PE might indicate heightened psychopathological severity. As this is cross-sectional data, this must be interpreted with caution and causal inferences are not possible.

While many environmental risk factors have been described to increase the likelihood of developing psychotic and affective symptoms upon their exposure and over time23, to date there have been no reports of environmental risk factors differentially predicting assorted dimensions of PE in adults. We sought to bring clarity to this by examining probability models and their reliability in correctly identifying the occurrence of dimensionally assessed PE. Figure 5 illustrates the distribution of risk factors across the six dimensions of PE.

Overall, we found that anxiety and social withdrawal during childhood are shared across all six dimensions. Traumatic life events predicted the occurrence of four dimensions (BI, DM, CD and perceptual abnormalities). These results are perhaps unsurprising given all three risk factors have long been hailed as important triggers for psychosis12. Furthermore, both childhood trauma and stressful life events appear to contribute to the persistence of PE over time24.

Depressive symptoms and cannabis use each predicted three dimensions of PE. Importantly, cannabis was not found to predict perceptual abnormalities or DM, commonly regarded as the most discernible symptoms of psychosis. One explanation for this may be the low numbers of perceptual abnormalities endorsed, or perhaps that cannabis presents as a risk factor for only certain dimensions of PE.

Risk factors that uniquely predicted the occurrence of a single PE dimension were childhood physical abuse for bizarre ideation, bullying for persecutory ideation and circadian phase for DM. Protective risk factors were age (decreasing the risk of CD) and gender (with women at a lower risk of persecutory ideation). Overall, childhood abuse (psychological and physical) and cannabis presented the highest severity of risk.

Sleep phenotypes distributed differently across different dimensions. Poor sleep quality increased the probability of endorsing negative symptoms and CD, yet late circadian phase predicted the occurrence of DM. That sleep quality and circadian phase are predictive of different PE dimensions is not unexpected given the heterogeneous sleep-wake phenotypes observed in patients diagnosed with schizophrenia and bipolar disorder25,26. This may relate to the variants of circadian clock genes which produce different physiological susceptibilities and phenotypic effects, independent from sleep regulation. Given this survey is cross-sectional and only contains subjective report, considerably more research is warranted to substantiate this claim.

CD in our survey is similar to the core characteristics of dissociative experiences, which appear to show strong ties to sleep27. Given the suggested mediating role of dissociative experiences on auditory hallucinations28, they present an interesting avenue to investigate the role of sleep in the development of PE29.

The analyses outlined here present broader implications for the differences in outcomes found between cross-sectional observations and regression analyses. On a cross-sectional basis, differences in group prevalence (number of psychotic symptoms and help seeking) are seen for most risk factors examined here (e.g. family history, obstetric complications, child abuse, etc.), except for urbanicity, ethnicity, season of birth, migrant status and paternal age. However, only about half of the risk factors translated to the multivariate regression models. Psychiatric diagnoses, which presented stark differences cross-sectionally and family history, a widely accepted genetic risk, did not significantly predict the occurrence of any dimension of PE. Similar differences have been reported before. For example, Cannon et al. reported that despite being established risk factors for psychosis, stressful life events, traumas, age and family history, did not predict actual transition to a psychotic episode using their individualised risk calculator30. However, family history of psychosis had a very pronounced effect on the onset of schizophrenia and non-affective psychosis in another study, while urbanicity had no effect12. Similarly, the impact of a winter/spring birth on the prevalence of schizophrenia has been highlighted in a meta-analysis31, yet negative results have been reported for impact of a winter/spring birth on the detection of PE32.

Given the heterogeneity across studies in what predicts risk, the results presented here suggest that variation may result (in part) from a difference in high/low affinity between risk factors and dimensions of PE. More refined approaches could certainly help inform why psychosis-spectrum disorders are so heterogeneous in their clinical presentation, and why individuals develop PE subsequently to mental health disorders (and vice versa)3. More broadly, dimensional research (including the study described here) is integral for the implementation of personalised medicine within the treatment of mental health disorders. Schematics (such as Fig. 5) could provide helpful roadmaps to clinicians as to what risks most often co-occur with which symptoms or PE. In turn, more targeted interventions to the symptomatic profile of the patient could be offered.

Limitations

While this study has many strengths, i.e. comprehensive risk factor search, uniform data collection and high-resolution questionnaires, it has also a number of important limitations. Students are overrepresented in this sample and we did not examine the relationship status of respondents, both of which have been shown to be associated with endorsement of PE and limit generalisability2. The negative symptoms dimension (which is comprised of two items examining anhedonia and social anxiety) is primarily predicted by depression and anxiety symptoms (alongside sleep quality and social withdrawal during childhood). This model was first run as a proof of concept to explore the viability of dimensional analyses but is limited in that the predictors are correlates of the outcome measure. Geographical dispersion and ethnical dispersal is low for this survey, both have also been shown to impact the prevalence of PEs2,12. Risk factors, such as alcohol and psychoactive drugs, as well as cognitive abilities and genetic risk factors (other than family history), all known to increase the risk of psychosis and PEs could not be examined in survey format30. We relied on self-report that has been shown to overestimate the occurrence of PEs, thereby not permitting cross-examination of reported experiences being indeed psychotic33. The questionnaires addressed whether a certain experience occurred during the last year but not the number of times it was experienced, which can vary to a considerable degree33. Finally, the rates of PE presented here are above what has been previously reported using other instruments (7–12%) but are in line with previous reports of the same PE questionnaire in similar populations23,34. This could reflect a sampling bias, as the survey was advertised as relating to wellbeing or it could be reflective of the wording of the PQ16, which encourages a more liberal response style comparative to other measures, or indeed, could reflect both23. As with any cross-sectional multidimensional dataset, causality cannot be inferred from the predictive direction of effects in the models described here. Furthermore, it is possible that effects of the environment may involve genes or gene-environment interactions or correlations (which again cannot be captured within the results presented here).

Conclusions and future directions

The OWLS survey is the first of its kind: a survey designed to tackle the question of whether empirically robust environmental risk factors can predict the occurrence of dimensionally assessed PE, or whether the majority of these risks are only observable using cross-sectional comparisons. Previous studies have used larger samples but lower resolution measures, or have reanalysed national survey data, but the OWLS survey is the first to specifically target risk factors for psychosis and examine in detail what their relationship holds to PE dimensions. It is also the first survey to examine the independent roles sleep and circadian phase may play in the dimensionally assessed psychotic symptoms using high-resolution measures and modelling analyses, which deserves greater attention. Further research aimed at replicating the specificity of risk factors to certain dimensions of PE may be of real benefit to understanding the heterogeneity of presentations observed in clinical practice. Future work should consider longitudinal follow-up surveys to understand the role of risk factors in predicting outcomes, including transition to psychosis, other mental health diagnoses, number of PEs, and need for care.