Evidence from comprehensive independent validation studies for smooth pursuit dysfunction as a sensorimotor biomarker for psychosis

Meyhoefer, Inga; Sprenger, Andreas; Derad, David; Grotegerd, Dominik; Leenings, Ramona; Leehr, Elisabeth J.; Breuer, Fabian; Surmann, Marian; Rolfes, Karen; Arolt, Volker; Romer, Georg; Lappe, Markus; Rehder, Johanna; Koutsouleris, Nikolaos; Borgwardt, Stefan; Schultze-Lutter, Frauke; Meisenzahl, Eva; Kircher, Tilo T. J.; Keedy, Sarah S.; Bishop, Jeffrey R.; Ivleva, Elena I.; McDowell, Jennifer E.; Reilly, James L.; Hill, Scot Kristian; Pearlson, Godfrey D.; Tamminga, Carol A.; Keshavan, Matcheri S.; Gershon, Elliot S.; Clementz, Brett A.; Sweeney, John A.; Hahn, Tim; Dannlowski, Udo; Lencer, Rebekka

doi:10.1038/s41598-024-64487-6

Evidence from comprehensive independent validation studies for smooth pursuit dysfunction as a sensorimotor biomarker for psychosis

Article
Open access
Published: 15 June 2024

Volume 14, article number 13859, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Evidence from comprehensive independent validation studies for smooth pursuit dysfunction as a sensorimotor biomarker for psychosis

Download PDF

Inga Meyhoefer^1,2,3,
Andreas Sprenger⁴,
David Derad⁴,
Dominik Grotegerd¹,
Ramona Leenings¹,
Elisabeth J. Leehr¹,
Fabian Breuer¹,
Marian Surmann¹,
Karen Rolfes¹,
Volker Arolt^1,2,
Georg Romer⁵,
Markus Lappe^2,6,
Johanna Rehder⁶,
Nikolaos Koutsouleris^7,8,9,
Stefan Borgwardt^10,11,
Frauke Schultze-Lutter^3,12,13,
Eva Meisenzahl³,
Tilo T. J. Kircher¹⁴,
Sarah S. Keedy¹⁵,
Jeffrey R. Bishop¹⁶,
Elena I. Ivleva¹⁷,
Jennifer E. McDowell¹⁸,
James L. Reilly¹⁹,
Scot Kristian Hill²⁰,
Godfrey D. Pearlson²¹,
Carol A. Tamminga¹⁷,
Matcheri S. Keshavan²²,
Elliot S. Gershon¹⁵,
Brett A. Clementz¹⁸,
John A. Sweeney^23,24,
Tim Hahn¹,
Udo Dannlowski¹ &
…
Rebekka Lencer^1,2,10

335 Accesses
Explore all metrics

Abstract

Smooth pursuit eye movements are considered a well-established and quantifiable biomarker of sensorimotor function in psychosis research. Identifying psychotic syndromes on an individual level based on neurobiological markers is limited by heterogeneity and requires comprehensive external validation to avoid overestimation of prediction models. Here, we studied quantifiable sensorimotor measures derived from smooth pursuit eye movements in a large sample of psychosis probands (N = 674) and healthy controls (N = 305) using multivariate pattern analysis. Balanced accuracies of 64% for the prediction of psychosis status are in line with recent results from other large heterogenous psychiatric samples. They are confirmed by external validation in independent large samples including probands with (1) psychosis (N = 727) versus healthy controls (N = 292), (2) psychotic (N = 49) and non-psychotic bipolar disorder (N = 36), and (3) non-psychotic affective disorders (N = 119) and psychosis (N = 51) yielding accuracies of 65%, 66% and 58%, respectively, albeit slightly different psychosis syndromes. Our findings make a significant contribution to the identification of biologically defined profiles of heterogeneous psychosis syndromes on an individual level underlining the impact of sensorimotor dysfunction in psychosis.

Genome-wide association studies of smooth pursuit and antisaccade eye movements in psychotic disorders: findings from the B-SNIP study

Article Open access 24 October 2017

Eye movements in patients in early psychosis with and without a history of cannabis use

Article Open access 12 May 2021

Eye Movements as Biomarkers to Evaluate Pharmacological Effects on Brain Systems

Introduction

Given the generally weak associations between clinically defined psychiatric diagnoses with specific neurobiological alterations of the central nervous system, the development and validation of biomarkers has been a major goal in psychiatric research for decades¹. Many studies have combined a large number of variables and/or multiple biomarkers using multivariate pattern recognition approaches^{2,3,4,5,6,7,8,9}. There is growing interest in parameters affecting the stability of these results including internal and external validation procedures as well as sample sizes of training and validation samples as validation sample size must be regarded as major risk of misestimation^10,11,12,13. This finding might explain why larger data sets tend to display weaker (presumable closer to “true”) accuracies (e.g. in the classification of depressive patients vs. healthy controls 60–65% accuracy based on structural MRI data in N = 2240 participants¹¹ or 54–56% accuracy based on different neuroimaging modalities in N = 1809 participants¹⁴) than many previous findings in small samples (e.g.Refs.^15,16).

One well-established and quantifiable biomarker in psychosis research is smooth pursuit eye movements (SPEM). SPEM testing involves having individuals visually track a small moving object relying on continuous sensorimotor processing of perceptual motion signals into dynamic adjustments of motor actions¹⁷. Thus, specific SPEM parameters reflect the ability of the brain to continuously receive visual motion information and simultaneously generate, monitor and adjust motor output accordingly to provide a clear visual percept of a moving object of interest. As early as 1908, numerous studies have emphasized SPEM dysfunctions as a biomarker for schizophrenia and other psychotic disorders indicating specific impairments of visual sensorimotor processing not only in stable but also early states of the disorder^{18,19,20,21,22,23,24,25,26}.

The assessment of SPEM was recently included in studies initiated by the Bipolar-Schizophrenia Network on Intermediate Phenotypes (B-SNIP) consortium aiming to develop a biologically valid framework (e.g. biologically defined phenotypes) for psychotic disorders (i.e. stable probands with schizophrenia, schizoaffective disorder, or psychotic bipolar-I disorder)^{9,27,28,29,30}. With regard to psychosis symptoms in the B-SNIP1 sample, Reininghaus and colleagues³¹ reported evidence of a transdiagnostic dimension underlying affective and non-affective psychotic symptoms. In line with this, results from the first recruitment period of the B-SNIP study (B-SNIP1, N = 674) indicate SPEM deterioration not only in schizophrenia but also in probands with schizoaffective and bipolar disorder with psychotic symptoms²⁰. These findings, consistent with smaller sample studies²⁵, imply that SPEM deficits can be regarded as a transdiagnostic biomarker for psychosis.

To determine the specificity of the relationship between psychotic symptoms and SPEM performance, it is essential to study probands with disorders that lack psychotic symptoms such as non-psychotic affective, substance, attention-deficit/hyperactivity, and obsessive–compulsive. Such studies revealed either intact SPEM performance or only minimal SPEM deficits^{32,33,34,35,36}. As sample sizes were rather small for most of these studies, however, conclusions remain unclear. Underlining its usefulness as a biomarker, subtle SPEM deficits were not only found in chronically ill but also in first episode patients^23,37,38 and in unaffected first-degree relatives of schizophrenia patients^20,39. Such subtle SPEM deficits are reflected by specific impairments of certain SPEM measures, e.g. during SPEM initiation, while other SPEM measures, e.g. sustained eye velocity, appear unimpaired indicating that certain compensation mechanisms within the oculomotor systems, e.g. derived from prediction, are in play^20,40. Thus, we would expect that similar SPEM disturbances would also be present in a clinical high-risk state for psychosis⁴¹ which has not been investigated so far.

To provide comprehensive internal and external validation in the present study, we developed a machine-learning based model that was trained on a set of traditional measures characterizing specific SPEM subfunctions, i.e. mean eye velocity, initial eye acceleration and initiation latency²⁰, in the large sample of the B-SNIP1 study. We then applied several external validation steps to determine stability and specificity in an independent sample of psychosis probands (external validation-1: B-SNIP2 sample), in bipolar probands with and without psychosis symptoms (external validation-2: Psychosis and Affective Research Domains and Intermediate Phenotypes (PARDIP) sample), in probands with predominately affective disorders as well as psychosis probands (external validation-3: DFG-Forschergruppe 2107 (FOR2107) sample) and, following an exploratory approach, in clinical high risk as well as recent-onset psychosis and depression states (external validation-4: Personalised Prognostic Tools for Early Psychosis Management (PRONIA) sample), see Fig. 1. Our aim was to develop an algorithm based on SPEM characteristics which allows evaluation of psychosis-related sensorimotor transformation function on an individual level. Ideally, such SPEM characteristics can be assessed in a short 5-min test ensuring practical utility.

Results

Demographics, clinical characteristics, and SPEM descriptive information for proband groups by study can be found in Tables 1, 2. With regard to the B-SNIP1 sample, there were no significant differences for age (T(977) = 1.20, p = 0.23) and sex (χ(1) = 1.16, p = 0.28) between psychosis probands and healthy controls. However, healthy controls yielded higher cognition scores than psychosis probands (T(953) = 5.93, p < 0.001). Correlations between SPEM performance and possible confounds, i.e. cognition scores or chlorpromazine equivalents were negligible, see Supplementary Tables 8–10.

Table 1 Descriptive information and clinical characteristics for proband groups by study.

Full size table

Table 2 Descriptive results of smooth pursuit eye movements for proband groups by study.

Full size table

Machine training and internal validation: B-SNIP1

The model distinguished psychosis probands from healthy controls by SPEM variables with a mean balanced accuracy of 63.96% (p < 0.001, Table 3; for further results parameter refer to Supplementary Table 4). On average 53% of the psychosis probands and 75% of the control subjects were correctly classified (sensitivity = 52.97%, specificity = 74.96%, Table 3). Mean likelihood ratios⁴² resulted in: positive test result = 2.18, negative test result = 0.63.

Table 3 Prediction accuracies for all samples and model results for the comparison of chronic psychosis probands vs. controls.

Full size table

External validation-1: B-SNIP2

Validation in the B-SNIP2 sample included n = 666 psychosis probands and n = 289 healthy controls (n = 64 participants could not be entered into the machine due to at least one missing value). Emphasizing high validity, the B-SNIP1 derived model discriminated psychosis probands from healthy controls in the independent B-SNIP2 sample with a balanced accuracy of 65.03% (see Table 3 and Supplementary Table 4). About 56% of the psychosis probands and 74% of the control subjects were correctly classified (sensitivity = 56.01%, specificity = 74.05%, Table 3).

External validation-2: PARDIP

For the PARDIP sample, n = 44 bipolar probands with psychosis symptoms, n = 33 bipolar probands without psychosis symptoms and n = 70 healthy controls were included in the validation procedure (n = 9 participants were excluded due to at least one missing value). Our trained model could distinguish bipolar probands with psychosis symptoms from healthy controls with a balanced accuracy of 65.52% (Table 3 and Supplementary Table 4). About 68% of the bipolar probands with psychosis were correctly classified as psychosis probands and 63% of the control subjects were correctly classified as healthy controls (sensitivity = 68.18%, specificity = 62.86%, Table 3). Furthermore, about 61% of the bipolar probands without psychosis symptoms were classified as controls (which means that they are closer to the healthy non-psychotic than the psychosis category, Table 3).

External validation-3: FOR2107

To validate the machine in predominately affective psychopathology, data from n = 94 probands with major depression and n = 25 probands with bipolar disorder, both groups without psychotic symptoms, from the FOR2107 consortium were entered into the analyses. Using the B-SNIP1 machine, nearly 81% of the probands with major depression and 60% of the probands with bipolar disorder were classified as being closer to the healthy non-psychotic than the psychosis category, Table 3.

As proof of principle, we also validated the B-SNIP1 machine on n = 51 psychosis probands and n = 72 healthy controls from FOR2107 revealing a balanced accuracy of 58.37% (Table 3 and Supplementary Table 4). In detail, about 43% of the psychosis probands and 74% of the control subjects were correctly classified (sensitivity = 43.14%, specificity = 73.61%, Table 3).

External validation-4: PRONIA

Validation in high-risk and recent-onset psychotic or depressive disorder could be computed in n = 11 probands with recent-onset psychosis, n = 17 probands with recent-onset depression, n = 19 participants with clinical high risk of psychosis, and n = 16 controls (PRONIA study). Emphasizing the validity of the machine, about 94% of the controls were categorized as healthy. However, in contrast to previous results in chronically ill psychosis probands, only 18% of the recent-onset psychosis probands were classified as psychosis patients (Table 3). Interestingly, the machine labeled nearly 42% of the participants with clinical high risk of psychosis as psychosis probands (Table 3). Of the probands with recent-onset, non-psychotic depression, 76% were classified as healthy controls (Table 3).

Effects of sample size on model performance

Training models in reduced (randomly selected 50% of the B-SNIP1 sample) and larger (combined B-SNIP1 and B-SNIP2) samples showed that balanced accuracies (50% B-SNIP1 = 62.28%, B-SNIP1 = 63.96%, B-SNIP1 + B-SNIP2 = 65.87%), specificities (50% B-SNIP1 = 78.48%, B-SNIP1 = 74.96%, B-SNIP1 + B-SNIP2 = 80.94%) and sensitivities (50% B-SNIP1 = 46.08%, B-SNIP1 = 52.97%, B-SNIP1 + B-SNIP2 = 50.80%) were rather unaffected by sample size (Supplementary Tables 6, 7 and Supplementary Fig. 1).

Discussion

In the current study we examined a set of traditional SPEM measures (i.e. predictive eye velocity maintenance gain, early eye velocity maintenance gain, initial eye acceleration, and eye latency; Leigh & Zee¹⁷; Lencer et al.²⁰) and their interactions as quantifiable biological indicators of psychosis-related visual sensorimotor dysfunction in large samples of probands with psychotic disorders. This is an important approach since identified SPEM deteriorations point to specific deficits in the transformation of sensory motion signals into motor action being associated with alterations in occipito-parieto-frontal networks^24,43.

To overcome limitations by classical frequentist statistics, we implemented multivariate pattern analyses (e.g. supervised machine learning approaches)⁴⁴ using internal (i.e. a hold-out subsample consisting of participants that were not used for training) and external (i.e. an independent dataset) validation in sufficient large data samples¹¹ to allow for clinically relevant single-subject statements pointing to sensorimotor transformation deficits. Most importantly, we not only trained and internally validated the machine-learning algorithm in a single sample but also applied and externally validated the machine in an independent large sample of psychosis probands and healthy controls (external validation-1: B-SNIP2), in a sample of bipolar probands with and without psychotic symptoms (external validation-2: PARDIP), in a sample of probands with affective disorders without psychotic symptoms and psychosis probands (external validation-3: FOR2107), and in a sample with recent-onset psychosis or depression and clinical high risk of psychosis (external validation-4: PRONIA). Our main finding shows high consistency for the identification of psychosis probands vs. healthy controls by these sensorimotor indicators throughout the four different samples (B-SNIP1: 63.96%, B-SNIP2: 65.03%, PARDIP: 65.52%, FOR2107: 58.37%). However, it is important to consider that our model performed notably better in accurately classifying controls as controls (specificities in the different samples ranged from 63 to 75%) than psychosis probands as psychosis probands (sensitivities ranged from 43 to 68%).

Although a balanced accuracy score of nearly 64% as derived from our training sample (B-SNIP 1) may be regarded as insufficient for SPEM performance to be used as a single screening instrument for determining psychosis-related sensorimotor transformation function, it significantly exceeds chance level and remains within the range of expectable results in similar heterogenous psychiatric sample sizes¹¹. Additionally, a likelihood ratio for a positive test result of 2.18 could be interpreted as small (but important) changes in probability⁴². Our second key finding emphasizes the generalization to new data when applying the model to an independent cohort of chronically ill psychosis probands and healthy controls. Regarding the first external validation in the B-SNIP2 sample (external validation-1), our machine yielded a comparable (even slightly higher) balanced accuracy of 65.03% when discriminating the two groups. This result is particularly meaningful due to (a) the independence of both data sets and (b) slight differences in the SPEM task design underlining the robustness of classification results by our model. A third cohort with chronically ill psychosis probands and healthy controls was derived from the FOR2107 consortium (external validation-3) and could be classified correctly with a balanced accuracy of 57.64%.

Our findings support the original suggestions by Diefendorf and Dodge⁴⁵ to use SPEM as a neurobiological diagnostic tool coming with multiple advantages including standardized measurements and brief 5-min testing feasible even for severely impaired patients. Here, we applied a constellation of SPEM tasks consisting of full-ramp and foveo-petal step-ramp trials at 18.7 degrees of visual angle constant velocity. These specific SPEM tasks allow the computation of the four key measures to evaluate SPEM performance and can be recommended for future studies. Our results add to previous findings based on traditional group analyses in indicating that SPEM is a valuable psychosis-related biomarker of sensorimotor integrity being useful even at the single-subject level²⁰. Besides its diagnostic value this biomarker bears highly relevant information for establishing personalized treatment regimes.

Very recently St Clair and colleagues⁴⁶ applied a multiclass machine-learning model to differentiate patients with schizophrenia, bipolar affective disorder, major depression disorder, and healthy controls on the basis of 98 eye movement symptoms (including several SPEM variables). The model was tested in two validation sets achieving balanced accuracies for schizophrenia patients of 73% and 75%. Both validation sets were relatively small (test-1 internal validation: n = 30 schizophrenia, n = 35 bipolar, n = 33 depression, n = 35 controls; test-2 external validation: n = 60 schizophrenia, n = 184 controls) which entails an increased risk of misclassification¹¹. To avoid this common short coming we have used a large internal validation sample as well as applied our machine to several extensive independent data sets. Of note, the task from St Clair and colleagues took about 15 min in total yielding a total of 98 eye movement measures⁴⁷ derived from free viewing, fixation duration, and smooth pursuit tasks⁴⁶ limiting its clinical practicability.

To further determine the model’s specificity regarding the relationship between psychotic symptoms and SPEM performance we applied the machine to other patient groups. To this regard, there has been an extensive discussion concerning similarities and differences between schizophrenia and bipolar disorder⁴⁸. Machine-learning models based on brain data have been used to discriminate both patient groups⁴⁹, though often merging data from bipolar patients with and without history of psychotic episodes⁵⁰.

Similarly, St Clair and colleagues⁴⁶ did not specify psychosis symptoms in those patients suffering from bipolar disorder and major depression which we found has a significant impact as demonstrated by our external validation-2 sample from the PARDIP. In line with the idea of the relationship between SPEM deterioration and psychotic psychopathology, our machine classified about 68% of the bipolar probands with psychosis correctly as psychosis patients, while 61% of the bipolar probands without psychosis symptoms were classified as healthy (which means that they are closer to healthy individuals). Underlining its generalizability, 60% of the bipolar probands without psychotic symptoms from the FOR2107 study (external validation-3) were also rated closer to the healthy non-psychotic category.

Broadening the perspective of specificity regarding SPEM deficits in affective disorders, we found that nearly 81% of probands suffering from major depression without psychotic episodes (FOR2107 study, external validation-3) were classified as healthy indicating closer affiliation to the non-psychotic category. This result is in line with previous findings of only minor impaired SPEM performance from traditional group statistics³⁶ and multivariate pattern analyses based on brain data indicating major depression and schizophrenia as two end points of an interjacent continuum⁵⁰.

Our external-validation sample 4 from the PRONIA study was used to test our model in young probands being at clinical high risk for psychosis or experiencing a first psychotic or first depressive episode. Interestingly, about 42% of probands with clinical high risk of psychosis were categorized as psychosis probands which might support the idea of an underlying susceptibility of SPEM deficits in the psychosis spectrum⁵¹. Indeed, the specific SPEM measures of predictive and early maintenance gain indicated the worst performance in this proband group compared to all three other PRONIA groups (see Table 2). However, this group is extremely heterogeneous as indicated by large standard deviations in the early and maintenance gains (see Table 2). Note, transition rates for CHR to ROP are about 25% within 3 years indicating a high heterogeneity of CHR subjects regarding susceptibility to psychosis⁵². In contrast, in the relatively small (n = 11) and heterogeneous sample of recent-onset psychosis probands our machine only classified two probands (18%) as belonging to the psychosis group. Despite the small sample size, this observation points to possible differences in SPEM performance between recent-onset and chronic states of psychosis (see also Table 1 for information about illness duration) as discussed previously⁵³. That study observed subtle impairments of immediate sensorimotor processing in first-episode psychosis patients with only short duration of treatment, e.g. after 6 weeks, which appeared to be compensated by predictive drive to pursuit. In more detail, first-episode patients demonstrated slightly worse performance in the pure-ramp task (comparable to the step-ramp task in the current study) but were unaffected in the oscillating task (comparable to the triangle wave task in the current study). Deficits were discussed as possible medication effects with regard to their serotonergic antagonism of brainstem sensorimotor systems. However, same as in the present study, no associations between SPEM variables and medication dosage were found⁵³. Indeed, in our ROP group (which might be comparable to the first-episode patients after short duration of treatment from the study by Lencer and colleagues⁵³), early maintenance gain -driven by immediate sensorimotor processing- was considerably reduced while predictive maintenance gain was unaffected (see Table 2). Notably, 76% of probands with recent-onset depression and 94% of healthy controls from the PRONIA sample were correctly classified as not belonging to the psychosis group.

Despite the clear strengths of the study, some limitations need to be discussed: (1) SPEM results for initial eye acceleration and latency differed between laboratories/recording devices (Supplementary Table 11). To estimate the impact of these two variables on the prediction of our machine, we additionally trained a machine in the B-SNIP1 sample using only the two eye velocity gain measures as predictors. The machine was able to distinguish psychosis probands from healthy controls with a balanced accuracy of 61.90% (Supplementary Table 12) which is close to the main result using all SPEM variables (63.96%). However, laboratory conditions and/or recording devices may have an impact on the measurement of SPEM initial eye acceleration and latency that could have affected prediction results. (2) As we trained the machine in a sample of chronically ill psychosis probands, possible effects of medication have to be taken into account. Although we found only small and inconsistent correlations between SPEM and chlorpromazine equivalents, effects of medication cannot be fully ruled out⁵³. (3) Furthermore, we found significant differences in cognition scores between psychosis probands and healthy controls in the B-SNIP1 sample. There might be effects of cognitive skills that cannot be entirely discarded. (4) Despite our comprehensive validation samples, our machine was not validated in a group of MDD with psychosis. (5) There is a discrepancy between sensitivity (53%) and specificity (75%) implying our model to be particularly suitable to correctly identify healthy probands as healthy. (6) No follow-up data of samples from the PRONIA study is available to evaluate transition rates of those CHR participants with bad SPEM performance.

Our comprehensive findings support SPEM as an indicator of sensorimotor transformation impairments relevant to patients suffering from chronic psychosis. Thus, our machine learning algorithm based on the performance in a 5 min SPEM task can help to obtain an overview of sensorimotor transformation profiles on an individual level that might inform treatment decisions in rehabilitation contexts, e.g. regarding sensorimotor remediation strategies.

Future studies should broaden this biomarker approach by combining indicators of sensorimotor function with multiple other relevant neurobiological measures, e.g. brain structure indices, to improve individual prediction accuracies and to inform personalized therapeutic decisions for psychotic disorders. Additionally, future studies should target the question whether SPEM-Impairments can indicate illness progression independently from the factor of illness duration.

Methods

Subjects

SPEM data from five independent samples were included in the following analyses (Fig. 1):

B-SNIP1

First, the machine was trained and internally validated with SPEM data from the B-SNIP1 sample consisting of n = 674 chronically ill psychosis probands (n = 265 schizophrenia, n = 178 schizoaffective, and 231 bipolar with psychotic symptoms) and 305 healthy controls. Participants were recruited by the B-SNIP consortium across five sites in the US (Baltimore, Boston, Chicago, Dallas, Hartford; Tamminga et al.²⁷). Diagnoses were derived by a consensus of experienced clinicians based on all available clinical information and the Structured Clinical Interview for DSM-IV⁵⁴. Inclusion criteria comprised (1) age between 15 and 65 years; (2) reading score of the Wide Range Achievement Test ≥ 60⁵⁵; (3) no history of a neurologic disorder; (4) normal or corrected to normal vision (minimum of 20/40 acuity), (5) no history of substance abuse within the last month or substance dependence within the last three months, and negative urine toxicology on study day. Additionally, healthy controls were not allowed to have a personal or family history (first-degree) of psychotic or bipolar disorders, to have a history of recurrent mood disorder or to exhibit a history of psychosis spectrum personality traits⁵⁶. The protocol of the study was approved by institutional review boards at each of the study sites and participants provided written informed consent. For group differences in SPEM performance see Lencer et al.²⁰.

Second, the remaining study samples were used (a) as external validation data for the machine trained in the B-SNIP1 sample and (b) for investigating psychosis-related specificity of SPEM against probands with predominately affective disorders.

External validation-1: B-SNIP2

B-SNIP2 is the follow-up to B-SNIP1. SPEM data were available from n = 727 chronically ill psychosis probands (n = 288 schizophrenia, n = 264 schizoaffective, and 175 bipolar with psychotic symptoms) as well as n = 292 healthy controls recruited in Boston, Chicago, Dallas, Hartford, and Athens (GA). Inclusion criteria were identical to B-SNIP1. For further details on B-SNIP2 eye movements see Huang et al., 2021⁶², but SPEM data have not been published so far.

External validation-2: PARDIP

SPEM data from the multisite PARDIP consortium were available from n = 49 bipolar probands with psychotic symptoms (BPwP), n = 36 bipolar probands without psychotic symptoms (BPwoP), and n = 71 healthy controls. The PARDIP study took place in Dallas, Boston, and Hartford. It was nested within the B-SNIP consortium using similar inclusion criteria but, importantly, there was no overlap between PARDIP and B-SNIP participants. Further information on inclusion criteria and group differences in SPEM performance see Brakemeier et al.⁵⁷.

External validation-3: FOR2107

In collaboration with the bicentric FOR2107 project (https://for2107.de/, Kircher et al.⁵⁸), SPEM data were measured in n = 94 probands with major depressive disorder without psychotic symptoms (MDwoP), n = 25 bipolar probands without psychotic symptoms (BPwoP), n = 51 psychosis probands, and n = 72 healthy controls at the Münster site.

External validation-4: PRONIA

Following an exploratory approach for testing the validity of our machine developed in stable probands with chronic psychosis, SPEM data were also collected in collaboration with the multisite PRONIA consortium (https://www.pronia.eu/, Koutsouleris et al.⁵⁹) from n = 11 probands with recent-onset psychosis, n = 19 probands with a high clinical risk for the development of psychosis, n = 17 probands with recent-onset depression, and n = 16 healthy controls at the Münster site.

All patients were medicated as prescribed by their doctors except for regular or current sedative medication which was an exclusion criterium (see chlorpromazine equivalents at time of testing in Table 1). Note, prior to inclusion ROP patients from the PRONIA sample had not been allowed to take any antipsychotic medication for longer than 90 days (within the past 24 months) with a daily dose rate at or above the minimum dosage of DGPPN S3-guidelines⁶⁰.

Participants gave written informed consent according to the Declaration of Helsinki. Each study was approved by the respective local ethics committee.

Eye movement measurement and task

At all sites, the SPEM target consisted of a small stimulus (0.5°) moving back and forth in the horizontal plane at 18.7°/s constant velocity displayed on a monitor to constitute full-ramp trials within triangle wave tasks and foveo-petal step-ramp trials⁶¹. Participants were instructed to follow the stimulus with their eyes as accurately as possible while sitting in front of the monitor with their heads stabilized using a chin and forehead restraint. Across all studies, eye movements were recorded in a quiet and darkened room.

For B-SNIP1, B-SNIP2, and PARDIP samples (for further details refer to Brakemeier et al.⁵⁷; Huang et al.⁶²; Lencer et al.²⁰), participants were seated 60 cm from a 22-inch CRT monitor (1360 × 768 resolution; 150 Hz refresh rate) and eye movements were recorded using an Eyelink II (SR Research Ltd., Ontario/Canada) recording device at 500 Hz sampling rate. The stimulus comprised a red cross in a box covering 0.5° moving horizontally between ± 12° across the screen.

In B-SNIP1 and PARDIP studies, 48 full-ramp and 32 foveo-petal step-ramp trials⁶¹ both at 18.7° of visual angle per second constant velocity, were applied in order to assess SPEM performance. In full-ramp trials, the stimulus moved back and forth with constant velocity in a triangular waveform. During step-ramp trials, the target started from the central position, stepped either to the right or the left (2.4° of visual angle in a randomized order) and afterwards moved towards the peripheral opposite direction at 18.7° of visual angle per second constant velocity. The stimulus re-crossed the central line after 133 ms allowing the initiation of SPEM without a necessary catch-up saccade⁶¹. Additionally, some trials with 9.7° of visual angle/second and 26.6° of visual angle/second target velocities as well as trials with intervals where the target was blanked were displayed to enhance attention but were not included into the analyses (30% of trials). In order to ensure data quality, additional calibration trials were presented between blocks of trials. SPEM measurement was conducted identically across sites.

For B-SNIP2 a slightly different task order was applied: Here, 48 full-ramp trials at 18.7° of visual angle per second constant velocity and a total of 48 foveo-petal step-ramp trials (32 × 18.7° of visual angle/second; 8 × 9.7° of visual angle/second; 8 × 26.6° of visual angle/second; randomized for direction; Rashbass⁶¹) were presented in two test sets (each consisting of 48 trials) including six alternating blocks of either eight full-ramp or step-ramp trials. Step-ramp trials at 9.7° of visual angle/second and 26.6° of visual angle/second velocities were shown to enhance attention but were not included into the analyses. To ensure data quality, additional calibration trials were displayed between blocks of trials. SPEM measurement was conducted identically across sites.

FOR2107 and PRONIA eye movements were recorded using an Eyelink 1000 (SR Research Ltd., Ontario/Canada) recording device at 500 Hz sampling rate. Participants were seated 60 cm from a 22-inch CRT monitor (1360 × 768 resolution; 150 Hz refresh rate). Stimulus and task were identical to B-SNIP2.

Eye movement data processing

All SPEM data were analyzed using the identical routines in MatLab (The MathWorks, Natick, MA) developed by one of the authors (AS). Eye position data were filtered using a one-dimensional Gaussian filter (30 Hz) and, subsequently, smoothed eye velocity was computed with central median differentiation of 9 ms^20,57,63. Sections of saccades and blinks were automatically detected and excluded from computations of SPEM variables. To revise automatic calculations, individual velocity traces were checked by visual inspection.

To assess the different sensorimotor aspects of SPEM performance, the following variables were computed^20,36,57 (see Fig. 2, adapted from Ref.⁵⁷):

Predictive maintenance gain during continuous pursuit was calculated from triangular wave tasks as the ratio of median eye velocity to target velocity from middle sections (300–840 ms after stimulus direction reversal) over all full-ramp trials (total duration of a ramp is 1200 ms). Predictive maintenance gain highly depends on predictive drive, i.e. cognitive input to the pursuit system for sustained SPEM under closed-loop conditions.

In contrast, measures from foveo-petal step-ramp tasks represent rapid sensorimotor transformations using immediate visual motion and early performance feedback.

This included first, early maintenance gain as the ratio of median eye velocity to target velocity from middle sections (350-550 ms after stimulus onset) over all unpredictable step-ramp trials, thus reflecting early eye velocity under visual feedback control⁶⁴. Typically, early maintenance gain is considerably lower compared to sustained predictive maintenance gain.

Second, for the computation of initial eye acceleration under open-loop conditions, when visual feedback is not yet available, eye velocity was smoothed using a Savitzky-Golay finite impulse response filter (polynomial order of 3 and a frame length of 63). The onset of eye acceleration was defined as eye velocity exceeding a noise threshold (above 3.2 standard deviations of mean resting eye velocity which was calculated from 200 ms before to 100 ms after ramp-onset, Carl & Gellman⁶⁵) for at least 20 ms. Initial eye acceleration was then computed using robust linear regression slope (RobustFit^® in MatLab) in a 100 ms time window starting with the acceleration onset over all trials.

Third, eye latency was determined as time that had elapsed between onset of stimulus movement and onset of eye acceleration⁶⁵ over all trials.

Psychometric, cognitive, and clinical measures

Psychosis-related symptoms

For B-SNIP1, B-SNIP2, PARDIP, and PRONIA studies, psychosis-related symptoms were rated using the positive and negative syndrome scale (PANSS)⁶⁶ while the FOR2107 study used the Scale for Assessment of Positive Symptoms (SAPS) and the scale for assessment of negative symptoms (SANS)⁶⁷. To provide comparability, SANS and SAPS scores were converted to PANSS scores⁶⁸, see Supplementary Table 1.

Depression

Depressive symptoms were quantified with the Montgomery–Åsberg Depression Rating Scale (MADRS; Montgomery & Åsberg⁶⁹) in the B-SNIP1, B-SNIP2 and PARDIP studies and using the original Beck Depression Inventory in the 1978 version⁷⁰ in the FOR2107 sample. For PRONIA, the Beck Depression Inventory-II (BDI-II)⁷¹ was applied. Severity gradation (MADRS⁷², BDI⁷¹) is given in Supplementary Table 2.

Mania

For B-SNIP1, B-SNIP2, PARDIP and FOR2107 samples, mania was estimated using the Young Mania Rating scale⁷³. Mania was not assessed in the PRONIA sample.

Cognitive abilities

A total score indicating cognitive abilities was estimated using the Wide Range Achievement Test 4 (WRAT4⁵⁵) in the B-SNIP1, B-SNIP2, and PARDIP samples. For the FOR2107 study, the Multiple-Choice Vocabulary Test, version B (MWT-B⁷⁴) was used. Scores were converted to the IQ scale⁷⁴. For the PRONIA sample the Wechsler adult intelligence scale matrix reasoning⁷⁵ was applied to evaluate cognition.

Statistical analyses

Machine learning approach

The machine learning model was trained in the B-SNIP1 sample to distinguish psychosis probands from healthy (non-psychotic) controls using PHOTONAI software⁷⁶ and scikit-learn toolboxes⁷⁷. A k-fold nested cross-validation procedure was applied to split data used to train the model from data taken for internal validation. Thus, to obtain the most informative model, parameters were optimized using an inner cycle (10 folds) and the best performing model chosen by highest balanced accuracy ([sensitivity + specificity]/2 taking into account imbalanced data sets) was deployed to an outer cycle (3 folds). Special attention was given to ensure that there was (1) no information leakage between train and validation data⁷⁶ and (2) a sufficient large validation set to provide stable and meaningful results for unseen (external) samples¹¹. For specifications of the best model see Supplementary Table 3.

For each of the models the following preprocessing steps were applied: (1) SPEM variables were standardized by scaling. (2) Missing values (predictive maintenance gain = 0%, early maintenance gain = 0.51%, initial eye acceleration = 1.23%, eye latency = 0.51%) were imputed with the median of the corresponding variable. (3) In order to consider different group sizes (674 psychosis probands and 305 healthy controls), data were balanced by either randomly under sampling the majority class or oversampling the minority class using SMOTE⁷⁸. (4) Principal component analysis was applied to reduce the dimensional space.

Predictors included the four SPEM variables described above (i.e. predictive maintenance gain, early maintenance gain, initial eye acceleration, and eye latency). Then, multiple classifiers with default parameters were used to optimize representation of the underlying data (Support vector machine, Random forest, Gaussian naïve bayes, Logistic regression, Ada boost) and to discriminate the label group membership (i.e. psychosis proband or healthy control). Additionally, for the support vector machine, kernel (linear, rbf) and regularization (C = [0.1, 0.3, 0.5, 0.7, 0.9, 1]) parameters were optimized.

Statistical inference was examined using permutation tests⁷⁹. Therefore, true results were compared to a permutation distribution created from 1000 random rearrangement of the two group labels (healthy controls vs. psychosis group) to the predictors.

Additionally, we trained machine learning algorithms to separate psychosis probands in the B-SNIP1 sample. In line with the idea of SPEM deterioration across the whole psychosis spectrum, results for distinguishing individual proband groups are close to chance level (balanced accuracies: schizophrenia vs. schizoaffective probands 52.65%, schizophrenia vs. bipolar probands 52.48%, schizoaffective vs. bipolar probands 51.00%, Supplementary Table 5).

External validation of the model was investigated by applying the best performing model from B-SNIP1 to B-SNIP2 (external validation-1), PARDIP (external validation-2), FOR2107 (external validation-3), and PRONIA (external validation-4) samples. Here, in accordance with the idea that there is a specific relationship between SPEM performance and psychosis syndromes, we also applied the model to other non-psychotic psychiatric patient groups expecting them not to be classified as psychosis probands (thus be closer to the healthy non-psychotic control group).

To examine the effect of sample size on model performance, additional models were trained and internally validated in randomly selected half of the B-SNIP1 and in the combined B-SNIP1 and B-SNIP2 samples.

Kendall’s Tau correlation coefficients were computed between SPEM measures and chlorpromazine equivalents⁸⁰. Additionally, correlations were calculated between SPEM measures and WRAT4 scores as well as z-scores of the Brief assessment of cognition in schizophrenia (BACS; Keefe et al.⁸¹). Analyses were computed in the B-SNIP1 sample. Results are reported using Bonferroni–Holm-corrected alpha level adjusted for each of the studies over all four SPEM variables^82,83.

Data availability

The data can be provided by Rebekka Lencer pending scientific review and a completed material transfer agreement. Requests for the data should be submitted to: Rebekka Lencer, lencer@uni-muenster.de.

References

García-Gutiérrez, M. S. et al. Biomarkers in psychiatry: Concept, definition, types and relevance to the clinical reality. Front. Psychiatry 11, 1–14 (2020).
Article Google Scholar
Miranda, L., Paul, R., Pütz, B., Koutsouleris, N. & Müller-Myhsok, B. Systematic review of functional MRI applications for psychiatric disease subtyping. Front. Psychiatry https://doi.org/10.3389/fpsyt.2021.665536 (2021).
Article PubMed PubMed Central Google Scholar
Quaak, M., van de Mortel, L., Thomas, R. M. & van Wingen, G. Deep learning applications for the classification of psychiatric disorders using neuroimaging data: Systematic review and meta-analysis. NeuroImage Clin. 30, 102584 (2021).
Article PubMed PubMed Central Google Scholar
Steardo, L. et al. Application of support vector machine on FMRI data as biomarkers in schizophrenia diagnosis: A systematic review. Front. Psychiatry 11, 1–9 (2020).
Article Google Scholar
Bracher-Smith, M., Crawford, K. & Escott-Price, V. Machine learning for genetic prediction of psychiatric disorders: A systematic review. Mol. Psychiatry 26, 70–79 (2021).
Article PubMed Google Scholar
Rashid, B. & Calhoun, V. Towards a brain-based predictome of mental illness. Hum. Brain Mapp. 41, 3468–3535 (2020).
Article PubMed PubMed Central Google Scholar
Shatte, A. B. R., Hutchinson, D. M. & Teague, S. J. Machine learning in mental health: A scoping review of methods and applications. Psychol. Med. 49, 1426–1448 (2019).
Article PubMed Google Scholar
Schnack, H. G. Improving individual predictions: Machine learning approaches for detecting and attacking heterogeneity in schizophrenia (and other psychiatric diseases). Schizophr. Res. 214, 34–42 (2019).
Article PubMed Google Scholar
Clementz, B. et al. Psychosis biotypes: Replication and validation from the B-SNIP consortium. Schizophr. Bull. 48, 56–68 (2022).
Article PubMed Google Scholar
Varoquaux, G. Cross-validation failure: Small sample sizes lead to large error bars. Neuroimage 180, 68–77 (2018).
Article PubMed Google Scholar
Flint, C. et al. Systematic misestimation of machine learning performance in neuroimaging studies of depression. Neuropsychopharmacology 46, 1510 (2021).
Article PubMed PubMed Central Google Scholar
Varoquaux, G. et al. Assessing and tuning brain decoders: Cross-validation, caveats, and guidelines. Neuroimage 145, 166–179 (2017).
Article PubMed Google Scholar
Arbabshirani, M. R., Plis, S., Sui, J. & Calhoun, V. D. Single subject prediction of brain disorders in neuroimaging: Promises and pitfalls. Neuroimage 145, 137–165 (2017).
Article PubMed Google Scholar
Winter, N. R. et al. Quantifying deviations of brain structure and function in major depressive disorder across neuroimaging modalities. JAMA Psychiatry 79, 879 (2022).
Article PubMed PubMed Central Google Scholar
Shi, D. et al. Machine learning of schizophrenia detection with structural and functional neuroimaging. Dis. Mark. https://doi.org/10.1155/2021/9963824 (2021).
Article Google Scholar
Jo, Y. T. et al. Diagnosing schizophrenia with network analysis and a machine learning method. Int. J. Methods Psychiatr. Res. https://doi.org/10.1002/mpr.1818 (2020).
Article PubMed PubMed Central Google Scholar
Leigh, J. R. & Zee, D. S. The Neurology of Eye Movements (Oxford University Press, 2015). https://doi.org/10.1093/med/9780199969289.001.0001.
Book Google Scholar
Levy, D. L., Sereno, A. B., Gooding, D. C. & O’Driscoll, G. A. Eye tracking dysfunction in schizophrenia: Characterization and pathophysiology. Curr. Top. Behav. Neurosci. 4, 311–347 (2010).
Article PubMed PubMed Central Google Scholar
O’Driscoll, G. A. & Callahan, B. L. Smooth pursuit in schizophrenia: A meta-analytic review of research since 1993. Brain Cogn. 68, 359–370 (2008).
Article PubMed Google Scholar
Lencer, R. et al. Pursuit eye movements as an intermediate phenotype across psychotic disorders: Evidence from the B-SNIP study. Schizophr. Res. 169, 326–333 (2015).
Article PubMed PubMed Central Google Scholar
Morita, K., Miura, K., Kasai, K. & Hashimoto, R. Eye movement characteristics in schizophrenia: A recent update with clinical implications. Neuropsychopharmacol. Rep. 40, 2–9 (2020).
Article PubMed Google Scholar
Wolf, A., Ueda, K. & Hirano, Y. Recent updates of eye movement abnormalities in patients with schizophrenia: A scoping review. Psychiatry Clin. Neurosci. 75, 82–100 (2021).
Article PubMed PubMed Central Google Scholar
Lencer, R. et al. Sensorimotor transformation deficits for smooth pursuit in first-episode affective psychoses and schizophrenia. Biol. Psychiatry 67, 217–223 (2010).
Article PubMed Google Scholar
Lencer, R. et al. Altered transfer of visual motion information to parietal association cortex in untreated first-episode psychosis: Implications for pursuit eye tracking. Psychiatry Res. 194, 30–38 (2011).
Article PubMed PubMed Central Google Scholar
Sweeney, J. A. et al. Eye tracking dysfunction in schizophrenia: Characterization of component eye movement abnormalities, diagnostic specificity, and the role of attention. J. Abnorm. Psychol. 103, 222–230 (1994).
Article CAS PubMed Google Scholar
Holzman, P. S., Proctor, L. R. & Hughes, D. W. Eye-tracking patterns in schizophrenia. Science 181, 179–181 (1973).
Article ADS CAS PubMed Google Scholar
Tamminga, C. A. et al. Clinical phenotypes of psychosis in the bipolar-schizophrenia network on intermediate phenotypes (B-SNIP). Am. J. Psychiatry 170, 1263–1274 (2013).
Article PubMed Google Scholar
Clementz, B. et al. Testing psychosis phenotypes from bipolar-schizophrenia network for intermediate phenotypes for clinical application: Biotype characteristics and targets. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 5, 808–818 (2020).
PubMed Google Scholar
Xiao, Y. et al. Subtyping schizophrenia patients based on patterns of structural brain alterations. Schizophr. Bull. 48, 241–250 (2022).
Article PubMed Google Scholar
Mothi, S. S. et al. Machine learning improved classification of psychoses using clinical and biological stratification: Update from the bipolar-schizophrenia network for intermediate phenotypes (B-SNIP). Schizophr. Res. 214, 60–69 (2019).
Article PubMed Google Scholar
Reininghaus, U. et al. Transdiagnostic dimensions of psychosis in the bipolar-schizophrenia network on intermediate phenotypes (B-SNIP). World Psychiatry 18, 67–76 (2019).
Article PubMed PubMed Central Google Scholar
Takahashi, J. et al. Eye movement abnormalities in major depressive disorder. Front. Psychiatry https://doi.org/10.3389/fpsyt.2021.673443 (2021).
Article PubMed PubMed Central Google Scholar
Bey, K. et al. Schizotypy and smooth pursuit eye movements as potential endophenotypes of obsessive-compulsive disorder. Eur. Arch. Psychiatry Clin. Neurosci. 269, 235–243 (2019).
Article PubMed Google Scholar
Kathmann, N., Wagner, M., Rendtorff, N., Schöchlin, C. & Engel, R. R. Information processing during eye tracking as revealed by event-related potentials in schizophrenics, alcoholics, and healthy controls. Schizophr. Res. 16, 145–156 (1995).
Article CAS PubMed Google Scholar
Ross, R. G., Olincy, A., Harris, J. G., Sullivan, B. & Radant, A. Smooth pursuit eye movements in schizophrenia and attentional dysfunction: Adults with schizophrenia, ADHD, and a normal comparison group. Biol. Psychiatry 48, 197–203 (2000).
Article CAS PubMed Google Scholar
Lencer, R. et al. Smooth pursuit deficits in schizophrenia, affective disorder and obsessive-compulsive disorder. Psychol. Med. 34, 451–460 (2004).
Article CAS PubMed Google Scholar
Hutton, S. B. et al. The relationship between antisaccades, smooth pursuit, and executive dysfunction in first-episode schizophrenia. Biol. Psychiatry 56, 553–559 (2004).
Article PubMed Google Scholar
Sweeney, J. A., Haas, G. L. & Li, S. Neuropsychological and eye movement abnormalities in first-episode and chronic schizophrenia. Schizophr. Bull. 18, 283–293 (1992).
Article CAS PubMed Google Scholar
Clementz, B., Sweeney, J. A., Hirt, M. & Haas, G. Pursuit gain and saccadic intrusions in first-degree relatives of probands with schizophrenia. J. Abnorm. Psychol. 99, 327–335 (1990).
Article CAS PubMed Google Scholar
Reilly, J. L., Lencer, R., Bishop, J. R., Keedy, S. & Sweeney, J. A. Pharmacological treatment effects on eye movement control. Brain Cogn. 68, 415–435 (2008).
Article PubMed PubMed Central Google Scholar
Fusar-Poli, P. et al. The psychosis high-risk state. JAMA Psychiatry 70, 107 (2013).
Article PubMed PubMed Central Google Scholar
Jaeschke, R., Guyatt, G. H. & Sackett, D. L. Users’ guides to the medical literature. III. How to use an article about a diagnostic test. B. What are the results and will they help me in caring for my patients?. JAMA 271, 703–707 (1994).
Article CAS PubMed Google Scholar
Lencer, R. & Trillenberg, P. Neurophysiology and neuroanatomy of smooth pursuit in humans. Brain Cogn. 68, 219–228 (2008).
Article PubMed Google Scholar
Perna, G., Grassi, M., Caldirola, D. & Nemeroff, C. B. The revolution of personalized psychiatry: Will technology make it happen sooner?. Psychol. Med. 48, 705–713 (2018).
Article CAS PubMed Google Scholar
Diefendorf, A. R. & Dodge, R. An experimantal study of the ocular reactions of the insane from photographic records. Brain 31, 451–489 (1908).
Article Google Scholar
St Clair, D. et al. Eye movement patterns can distinguish schizophrenia from the major affective disorders and healthy control subjects. Schizophr. Bull. Open https://doi.org/10.1093/schizbullopen/sgac032 (2022).
Article PubMed PubMed Central Google Scholar
Bestelmeyer, P. E. G. et al. Global visual scanning abnormalities in schizophrenia and bipolar disorder. Schizophr. Res. 87, 212–222 (2006).
Article PubMed Google Scholar
Grunze, H. & Cetkovich-Bakmas, M. “Apples and pears are similar, but still different things”. Bipolar disorder and schizophrenia- discrete disorders or just dimensions ?. J. Affect. Disord. 290, 178–187 (2021).
Article PubMed Google Scholar
Claude, L. A., Houenou, J., Duchesnay, E. & Favre, P. Will machine learning applied to neuroimaging in bipolar disorder help the clinician? A critical review and methodological suggestions. Bipolar Disord. 22, 334–355 (2020).
Article PubMed Google Scholar
Koutsouleris, N. et al. Individualized differential diagnosis of schizophrenia and mood disorders using neuroanatomical biomarkers. Brain 138, 2059–2073 (2015).
Article PubMed PubMed Central Google Scholar
Ivleva, E. I. et al. Smooth pursuit eye movement, prepulse inhibition, and auditory paired stimuli processing endophenotypes across the schizophrenia-bipolar disorder psychosis dimension. Schizophr. Bull. 40, 642–652 (2014).
Article PubMed Google Scholar
Salazar de Pablo, G. et al. Probability of transition to psychosis in individuals at clinical high risk: An updated meta-analysis. JAMA Psychiatry 78, 970–978 (2021).
Article PubMed Google Scholar
Lencer, R. et al. Effects of second-generation antipsychotic medication on smooth pursuit performance in antipsychotic-naive schizophrenia. Arch. Gen. Psychiatry 65, 1146–1154 (2008).
Article PubMed PubMed Central Google Scholar
First, M. B., Spitzer, R. L., Gibbon, M. & Williams, J. B. W. Structured Clinical Interview for DSM-IV Axis I Disorders-Patient Edition (SCID-I/P, Version 2.0) (Biometrics Research Department, New York State Psychiatric Institute, 1995).
Google Scholar
Wilkinson, G. S. & Robertson, G. J. Wide Range Achievement Test 4 (WRAT4) (Psychological Assessment Resources, 2006).
Google Scholar
Pfohl, B., Blum, N. & Zimmerman, M. Structured Interview for DSM-IV Personality: SIDP-IV (American Psychiatric Pub, 1997).
Google Scholar
Brakemeier, S. et al. Smooth pursuit eye movement deficits as a biomarker for psychotic features in bipolar disorder—Findings from the PARDIP study. Bipolar Disord. 22, 602–611 (2020).
Article PubMed Google Scholar
Kircher, T. et al. Neurobiology of the major psychoses: A translational perspective on brain structure and function-the FOR2107 consortium. Eur. Arch. Psychiatry Clin. Neurosci. 269, 949–962 (2019).
Article PubMed Google Scholar
Koutsouleris, N. et al. Prediction models of functional outcomes for individuals in the clinical high-risk state for psychosis or with recent-onset depression: A multimodal, multisite machine learning analysis. JAMA Psychiatry 75, 1156–1172 (2018).
Article PubMed PubMed Central Google Scholar
Schwarzer, J. M. et al. The impact of visual dysfunctions in recent-onset psychosis and clinical high-risk state for psychosis. Neuropsychopharmacology https://doi.org/10.1038/s41386-022-01385-3 (2022).
Article PubMed PubMed Central Google Scholar
Rashbass, C. The relationship between saccadic and smooth tracking eye movements. J. Physiol. 159, 326–338 (1961).
Article CAS PubMed PubMed Central Google Scholar
Huang, L. Y. et al. Antisaccade error rates and gap effects in psychosis syndromes from bipolar-schizophrenia network for intermediate phenotypes 2 (B-SNIP2). Psychol. Med. https://doi.org/10.1017/S003329172000478X (2021).
Article PubMed PubMed Central Google Scholar
Sprenger, A. et al. The role of prediction and anticipation on age-related effects on smooth pursuit eye movements. Ann. N. Y. Acad. Sci. 1233, 168–176 (2011).
Article ADS PubMed Google Scholar
Lisberger, S. G., Morris, E. J. & Tychsen, L. Visual motion processing and sensory-motor integration for smooth pursuit eye movements. Annu. Rev. Neurosci. 10, 97–129 (1987).
Article CAS PubMed Google Scholar
Carl, J. R. & Gellman, R. S. Human smooth pursuit: Stimulus-dependent responses. J. Neurophysiol. 57, 1446–1463 (1987).
Article CAS PubMed Google Scholar
Kay, S. R., Fiszbein, A. & Opler, L. A. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr. Bull. 13, 261–276 (1987).
Article CAS PubMed Google Scholar
Andreasen, N. C. The scale for the assessment of negative symptoms (SANS): Conceptual and theoretical foundations. Br. J. Psychiatry 155, 49–52 (1989).
Article Google Scholar
Van Erp, T. G. M. et al. Converting positive and negative symptom scores between PANSS and SAPS/SANS. Schizophr. Res. 152, 289–294 (2014).
Article PubMed Google Scholar
Montgomery, S. A. & Åsberg, M. A new depression scale designed to be sensitive to change. Br. J. Psychiatry 134, 382–389 (1979).
Article CAS PubMed Google Scholar
Beck, A. T. & Steer, R. A. Internal consistencies of the original and revised beck depression inventory. J. Clin. Psychol. 40, 1365–1367 (1984).
Article CAS PubMed Google Scholar
Beck, A. T., Steer, R. A. & Brown, G. K. Manual for Beck Depression Inventory-II (Psychological Corporation, 1996).
Google Scholar
Müller, M. J., Szegedi, A., Wetzel, H. & Benkert, O. Moderate and severe depression: Gradations for the Montgomery-Åsberg depression rating scale. J. Affect. Disord. 60, 137–140 (2000).
Article PubMed Google Scholar
Young, R. C., Biggs, J. T., Ziegler, V. E. & Meyer, D. A. A rating scale for mania: Reliability, validity and sensitivity. Br. J. Psychiatry 133, 429–435 (1978).
Article CAS PubMed Google Scholar
Lehrl, S. Mehrfachwahl-Wortschatz-Intelligenztest MWT-B [Multiple Choice Vocabulary Test, Version B] (Spitta, 2005).
Google Scholar
Wechsler, D. Wechsler Adult Intelligence Scale-Fourth Edition (WAIS–IV) (NCS Pearson, 2008).
Google Scholar
Leenings, R. et al. PHOTONAI—A Python API for rapid machine learning model development. PLoS One 16, e0254062 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002).
Article Google Scholar
Collingridge, D. S. A Primer on quantitized data analysis and permutation testing. J. Mix. Methods Res. 7, 81–97 (2013).
Article Google Scholar
Andreasen, N. C., Pressler, M., Nopoulos, P., Miller, D. & Ho, B. C. Antipsychotic dose equivalents and dose-years: A standardized method for comparing exposure to different drugs. Biol. Psychiatry 67, 255–262 (2010).
Article CAS PubMed Google Scholar
Keefe, R. et al. Norms and standardization of the brief assessment of cognition in schizophrenia (BACS). Schizophr. Res. 102, 108–115 (2008).
Article PubMed Google Scholar
Wright, S. P. Adjusted p-values for simultaneous inference. Biometrics 48, 1005–1013 (1992).
Article Google Scholar
Holm, S. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 6, 65–70 (1979).
MathSciNet Google Scholar

Download references

Acknowledgements

We thank all participants who contributed their time and effort to participate in this study. Additionally, we thank Gunvant Thaker, MD, for his many scientific contributions to the B-SNIP consortium, especially his role in supporting and initiating the pursuit studies reported here, as well as collecting data.

Funding

Open Access funding enabled and organized by Projekt DEAL. This work was supported as follows: B-SNIP study: National Institute of Mental Health (grant numbers MH103366, MH096900, MH103368, MH077851, MH096913, MH078113, MH096942, MH077945, MH096957). PARDIP study: National Institute of Mental Health (grant numbers MH077851, MH077852, MH077862, MH077945, MH078113, MH096900, MH096913, MH096942 and MH096957). FOR2107 study: German Research Council (Deutsche Forschungsgemeinschaft, DFG, grant numbers KI 588/14-1, KI 588/14-2, KR 3822/7-1, KR 3822/7-2, NE 2254/1-2, DA 1151/5-1, DA 1151/5-2, SCHW 559/14-1, SCHW 559/14–2, WO 1732/4-1, WO 1732/4-2, AL 1145/5-2, CU 43/9-2, GA 545/7-2, RI 908/11-2, WI 3439/3-2, NO 246/10-2, DE 1614/3-2, HA 7070/2-2, JA 1890/7-1, JA 1890/7-2, MU 1315/8-2, RE 737/20-2, PF 784/1-2, KI 588/17-1, CU 43/9-1). PRONIA study: EU-FP7-HEALTH (agreement number 602152). Additionally, the current work was supported by the fund Innovative Medical Research of the University of Münster Medical School (IM, EJL; grant number ME 1 2 18 05) and the German Research Council (RL; Deutsche Forschungsgemeinschaft, DFG, grant number LE 1122/7-1).

Author information

Authors and Affiliations

Institute for Translational Psychiatry, University of Muenster, Albert Schweitzer Campus 1, Build. A9a, 48149, Muenster, Germany
Inga Meyhoefer, Dominik Grotegerd, Ramona Leenings, Elisabeth J. Leehr, Fabian Breuer, Marian Surmann, Karen Rolfes, Volker Arolt, Tim Hahn, Udo Dannlowski & Rebekka Lencer
Otto-Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
Inga Meyhoefer, Volker Arolt, Markus Lappe & Rebekka Lencer
Department of Psychiatry and Psychotherapy, Medical Faculty, Heinrich-Heine University, Duesseldorf/LVR, Duesseldorf, Germany
Inga Meyhoefer, Frauke Schultze-Lutter & Eva Meisenzahl
Department of Neurology, University of Luebeck, Luebeck, Germany
Andreas Sprenger & David Derad
Department of Child Adolescence Psychiatry and Psychotherapy, University of Muenster, Muenster, Germany
Georg Romer
Institute of Psychology, University of Muenster, Muenster, Germany
Markus Lappe & Johanna Rehder
Department of Psychiatry and Psychotherapy, Ludwig-Maximilian University Munich, Munich, Germany
Nikolaos Koutsouleris
Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
Nikolaos Koutsouleris
Max-Planck-Institute of Psychiatry Munich, Munich, Germany
Nikolaos Koutsouleris
Department of Psychiatry and Psychotherapy, University of Luebeck, Luebeck, Germany
Stefan Borgwardt & Rebekka Lencer
Department of Psychiatry, Psychiatric University Hospital, University of Basel, Basel, Switzerland
Stefan Borgwardt
Department of Psychology, Faculty of Psychology, Airlangga University, Surabaya, Indonesia
Frauke Schultze-Lutter
University Hospital of Child and Adolescent Psychiatry and Psychotherapy, University of Bern, Bern, Switzerland
Frauke Schultze-Lutter
Department of Psychiatry and Psychotherapy, Philipps-University Marburg, Marburg, Germany
Tilo T. J. Kircher
Department of Psychiatry and Behavioral Neuroscience, University of Chicago, Chicago, USA
Sarah S. Keedy & Elliot S. Gershon
Department of Experimental and Clinical Pharmacology and Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis, USA
Jeffrey R. Bishop
Department of Psychiatry, The University of Texas Southwestern Medical Center, Dallas, TX, USA
Elena I. Ivleva & Carol A. Tamminga
Departments of Psychology and Neuroscience, Bio-Imaging Research Center, University of Georgia, Athens, GA, USA
Jennifer E. McDowell & Brett A. Clementz
Department of Psychiatry and Behavioral Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
James L. Reilly
Department of Psychology, Rosalind Franklin University of Medicine and Science, Chicago, IL, USA
Scot Kristian Hill
Departments of Psychiatry and Neuroscience, Yale School of Medicine, and Olin Research Center, Institute of Living/Hartford Hospital, Hartford, CT, USA
Godfrey D. Pearlson
Department of Psychiatry, Harvard Medical School, Beth Israel Deaconess Medical Center, Boston, MA, USA
Matcheri S. Keshavan
Huaxi MR Research Center (HMRRC), Department of Radiology, West China Hospital of Sichuan University, Chengdu, China
John A. Sweeney
Department of Psychiatry and Behavioral Neuroscience, University of Cincinnati College of Medicine, Cincinnati, USA
John A. Sweeney

Authors

Inga Meyhoefer
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Sprenger
View author publications
You can also search for this author in PubMed Google Scholar
David Derad
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Grotegerd
View author publications
You can also search for this author in PubMed Google Scholar
Ramona Leenings
View author publications
You can also search for this author in PubMed Google Scholar
Elisabeth J. Leehr
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Breuer
View author publications
You can also search for this author in PubMed Google Scholar
Marian Surmann
View author publications
You can also search for this author in PubMed Google Scholar
Karen Rolfes
View author publications
You can also search for this author in PubMed Google Scholar
Volker Arolt
View author publications
You can also search for this author in PubMed Google Scholar
Georg Romer
View author publications
You can also search for this author in PubMed Google Scholar
Markus Lappe
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Rehder
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaos Koutsouleris
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Borgwardt
View author publications
You can also search for this author in PubMed Google Scholar
Frauke Schultze-Lutter
View author publications
You can also search for this author in PubMed Google Scholar
Eva Meisenzahl
View author publications
You can also search for this author in PubMed Google Scholar
Tilo T. J. Kircher
View author publications
You can also search for this author in PubMed Google Scholar
Sarah S. Keedy
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey R. Bishop
View author publications
You can also search for this author in PubMed Google Scholar
Elena I. Ivleva
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer E. McDowell
View author publications
You can also search for this author in PubMed Google Scholar
James L. Reilly
View author publications
You can also search for this author in PubMed Google Scholar
Scot Kristian Hill
View author publications
You can also search for this author in PubMed Google Scholar
Godfrey D. Pearlson
View author publications
You can also search for this author in PubMed Google Scholar
Carol A. Tamminga
View author publications
You can also search for this author in PubMed Google Scholar
Matcheri S. Keshavan
View author publications
You can also search for this author in PubMed Google Scholar
Elliot S. Gershon
View author publications
You can also search for this author in PubMed Google Scholar
Brett A. Clementz
View author publications
You can also search for this author in PubMed Google Scholar
John A. Sweeney
View author publications
You can also search for this author in PubMed Google Scholar
Tim Hahn
View author publications
You can also search for this author in PubMed Google Scholar
Udo Dannlowski
View author publications
You can also search for this author in PubMed Google Scholar
Rebekka Lencer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: B-SNIP and PARDIP (Sarah S. Keedy, Jeffrey R. Bishop, Elena I. Ivleva, Jennifer E. McDowell, James L. Reilly, Scot Kristian Hill, Godfrey D. Pearlson, Carol A. Tamminga, Matcheri S. Keshavan, Elliot S. Gershon, Brett A. Clementz, John A. Sweeney), FOR2107 (Tilo T. J. Kircher, Udo Dannlowski), PRONIA (Nikolaos Koutsouleris, Eva Meisenzahl, Frauke Schultze-Lutter, Stefan Borgwardt), Rebekka Lencer, Inga Meyhoefer. Methodology: Tim Hahn, Rebekka Lencer, John A. Sweeney, Andreas Sprenger, Inga Meyhoefer. Investigation: B-SNIP and PARDIP (Sarah S. Keedy, Jeffrey R. Bishop, Elena I. Ivleva, Jennifer E. McDowell, James L. Reilly, Scot Kristian Hill, Godfrey D. Pearlson, Carol A. Tamminga, Matcheri S. Keshavan, Elliot S. Gershon, Brett A. Clementz, John A. Sweeney), FOR2107 (Tilo T. J. Kircher, Udo Dannlowski), PRONIA (Marian Surmann, Karen Rolfes, Volker Arolt, Georg Romer), Fabian Breuer, David Derad, Johanna Rehder. Visualization: Inga Meyhoefer. Supervision: B-SNIP and PARDIP (Sarah S. Keedy, Jeffrey R. Bishop, Elena I. Ivleva, Jennifer E. McDowell, James L. Reilly, Scot Kristian Hill, Godfrey D. Pearlson, Carol A. Tamminga, Matcheri S. Keshavan, Elliot S. Gershon, Brett A. Clementz, John A. Sweeney), FOR2107 (Tilo T. J. Kircher, Udo Dannlowski), PRONIA (Nikolaos Koutsouleris, Eva Meisenzahl, Frauke Schultze-Lutter, Stefan Borgwardt), Dominik Grotegerd, Ramona Leenings, Elisabeth J. Leehr, Markus Lappe. Writing—original draft: Inga Meyhoefer, Rebekka Lencer. Writing—review & editing: all authors.

Corresponding author

Correspondence to Rebekka Lencer.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Meyhoefer, I., Sprenger, A., Derad, D. et al. Evidence from comprehensive independent validation studies for smooth pursuit dysfunction as a sensorimotor biomarker for psychosis. Sci Rep 14, 13859 (2024). https://doi.org/10.1038/s41598-024-64487-6

Download citation

Received: 11 March 2024
Accepted: 10 June 2024
Published: 15 June 2024
DOI: https://doi.org/10.1038/s41598-024-64487-6
Springer Nature Limited

Evidence from comprehensive independent validation studies for smooth pursuit dysfunction as a sensorimotor biomarker for psychosis

Abstract

Similar content being viewed by others

Genome-wide association studies of smooth pursuit and antisaccade eye movements in psychotic disorders: findings from the B-SNIP study

Eye movements in patients in early psychosis with and without a history of cannabis use

Eye Movements as Biomarkers to Evaluate Pharmacological Effects on Brain Systems

Introduction

Results

Machine training and internal validation: B-SNIP1

External validation-1: B-SNIP2

External validation-2: PARDIP

External validation-3: FOR2107

External validation-4: PRONIA

Effects of sample size on model performance

Discussion

Methods

Subjects

B-SNIP1

External validation-1: B-SNIP2

External validation-2: PARDIP

External validation-3: FOR2107

External validation-4: PRONIA

Eye movement measurement and task

Eye movement data processing

Psychometric, cognitive, and clinical measures

Psychosis-related symptoms

Depression

Mania

Cognitive abilities

Statistical analyses

Machine learning approach

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation