Hearing non-existent voices is a potential marker of psychosis development, and may create subjective distress and anxiety with corresponding functional decline and mental health problems. However, voice-hearing (VH) is quite common in the age period when psychosis develops, complicating clinical interpretation [1]. The prevalence of VH and auditory hallucinations (AVH) in the general population has been estimated to be around 4–7.5% [2,3,4,5]. Importantly, there seems to be a developmental effect, with psychotic symptoms (predominantly AVH) occurring in 17% of children aged 9–12 but only in 7.5% of adolescents aged 13–18 years [6]. A previous study from the present dataset found that 10.6% of a population-based sample of 9646 adolescents reported often hearing a voice speaking thoughts out loud [7]. However, only 5.3% reported that they experienced being troubled by voices, suggesting two groups of VH adolescents; a relatively large group hearing voices without feeling distressed, and a smaller group being distressed by the voices.

Both these groups may be referred to early psychosis intervention services, but the existence of distress may guide further treatment decisions [8], since distress may predict transition to psychosis [9]. Distress during VH, related to the existence of a non-realistic, psychotic interpretation of the voices that consequently increases anxiety, vigilance and personal suffering, is of clinical importance. VH may be considered to be several different phenomena: a symptom of psychosis [10], a symptom of psychosis risk; often depicted as “high-risk” (HR) “ultra-high risk”, “prodromal state” and “at-risk mental state” for psychosis, as well as a “psychosis-(like) experience” (PLE/PE), or as a normal phenomenon [8, 11,12,13]. Not being distressed suggests a non-psychotic interpretation; e.g. “I am hearing voices because I am tired” or “Sometimes my mind plays tricks on me” (for details, see [14, 15]). However, one may question the emphasis on distress, as there is limited knowledge on whether non-distressed voice-hearers are different to voice-hearers who are distressed by their voices. Do these groups differ in psychosocial characteristics known to be related to the development of psychosis? Research on adults demonstrates that there is a considerable high number of voice-hearers in the population who do not need psychiatric care [16, 17]. By examining characteristics of different voice-hearers in early development, more light could be shed on the possible symptom trajectories of voice-hearing individuals. If the non-distressing VH experience is a variation of normal perceptual phenomena, while the distressed adolescents are on a (non-deterministic) pathway to psychosis, the latter group should possibly have more psychosis-related characteristics, typically; (1) comorbid mental health symptoms (2) risk factors for psychosis (3) predictors of psychosis conversion, and (4) fewer psychosocial moderators and resilience factors.

Several mental health conditions often co-exist with psychosis. The most prevalent are affective symptoms [18], anxiety [19] and illicit substance abuse [20,21,22,23,24,25,26]. General markers of sub-optimal mental health are typically present before psychosis onset, such as anxiety and depressive disorders [11] and dysregulated activation [27]. Self-harm is particularly prevalent in early untreated psychosis phases [28], and is an important marker of mental distress in adolescents. Psychosocial variables may alone or in interaction with other variables or genetic vulnerability heighten the risk for psychosis [29], e.g. illicit substance use [30]. Cannabis has been implicated in psychosis development in prospective studies [31, 32], especially for moderate to severe use and/or if started in the early teens [33,34,35]. Similarly, other substances acting on dopamine receptors have been linked to psychosis development, especially (meta)amphetamines [36]. There is also an increasing international consensus that childhood trauma is a risk factor for psychosis [37,38,39,40,41] and for PEs [42], with reports of an almost three–fold greater likelihood of experiencing psychotic symptoms after childhood trauma [43], with a dose–response relationship [39, 40, 43,44,45,46]. A dose–response relationship of a history of bullying and psychotic experiences has also been reported [47].

Predictors of conversion to psychosis include attenuated positive symptoms and functional decline in areas of school and social functioning. General and social functioning has been shown to be significantly lower for child and adolescent patients with psychotic-like experiences than without [48, 49]. Negative symptoms and social withdrawal [50] predict psychosis, typically seen as reduced school attendance in youngsters. In a five year prospective study conversion to psychosis was best predicted by disorganized communication, suspiciousness, verbal memory deficits, and decline in social functioning [51]. Furthermore, cognitive deficits [52, 53] and being part of high-risk populations [54, 55] are shown to be predictive of psychosis conversion [56,57,58]. For adolescents, cognitive difficulties are typically reflected by lower school grades.

Finally, there may be moderators of psychosis risk related to psychological resilience; family support, self-efficacy and experience of self-worth. Smith and colleagues [59] found that lower self-esteem was related to severity and negative content of AVH, and negative self-evaluations have been shown to be strongly related to psychosis even when depression has been controlled for [60, 61]. Reduced self-efficacy—an individual's belief in his or her capacity to execute behaviors necessary to produce specific performance attainments [62]—has been shown also in patients at-risk for psychosis [63]. Family support is important for avoiding new episodes of psychosis, [60, 64, 65], but is decreased and related to symptomatic and functional improvement in HR patients as well [49, 66].

By including information regarding the risk factors and characteristics described above, the present study aims to test the validity of a distress-based distinction of the concept of VH. We expect to find a more psychosis-associated psychosocial profile among the distressed adolescents compared to the voice-hearers not distressed by the voices; more comorbid mental health symptoms (anxiety, affective symptoms, dysregulation of activation, self-harm), risk factors for psychosis (experience of trauma, bullying, illicit drug use) and predictors of psychosis conversion and markers of mental distress (social functioning, school grades, days absent from school, distractibility) and fewer psychosocial moderators and resilience factors (family support, self-efficacy, experience of self-worth).



The present study included a population-based sample of adolescents participating in the fourth wave of the Bergen Child study (see for details), headed by the Regional Centre for Child and Youth Mental Health and Child Welfare, Norce Norwegian Research Center. The sample included all adolescents attending high school during the spring 2012, mostly students born between 1993 and 1995. The adolescents received information about the study to their homes and/or via their school e-mail, and completed the internet-based questionnaire at school upon consent. The study was approved by the Regional Committee for Medical and Health Research Ethics in Western Norway. The current study is based on the first version of quality-assured data files released in May 2012. Information about sex and year of birth were based on the personal identity number in the Norwegian National Population Register. Of the included adolescents (n = 19,121), 11,092 completed or filled in parts of the questionnaire. Some subjects (n = 746) were removed due to missing personal identity numbers and unreliable answer profiles resulting in a total sample of 10,346 adolescents in the present study. The sample has previously been found to be representative of Norwegian high school youth [67].



Presence of VH was assessed with two items from extended Launay-Slade Hallucinations Scale (LSHS-E) [68]; (1) “I often hear a voice speaking my thoughts aloud”; and (2) “I have been troubled by hearing voices in my head”. Responses are given on a 5-point scale (0 = “certainly does not apply to me”, 1 = “possibly does not apply to me”, 2 = “unsure”, 3 = “possibly applies to me” and 4 = “certainly applies to me”). Factor analyzes of the LSHS-E have shown that both of these items load on an underlying “auditory hallucinations” factor [68, 69], but in addition, item 2 load on a factor associated with psychopathology [70, 71], and has been associated with higher degree of negative emotionality than item 1 [72]. The five response categories to the two VH items were recoded to responses “0″ and “1″ being coded as “No endorsement” (0) and responses “3″ and “4″ being coded as endorsement (1). Adolescents answering “2″ were coded as unsure and were excluded from the group analysis (n = 473). The following three groups were examined (1) No VH (no endorsement, both items) (2) VH (endorsement of hearing voices only) and (3) Distressing VH (endorsement of distressing voices). This procedure resulted in a score in one of these three response categories only, thus the subjects were only allocated to one of the three groups.

Predictor variables

The predictor variables were based on selected items about psychosocial, demographic and clinical status and characteristics chosen from the web-based questionnaire based on previous studies on factors related to early psychosis, see Table 1 for details. To validate the constructs and content, seven of the predictor variables were derived using confirmatory factor analyzes (factor loadings not given, goodness of fit: χ2 = 17,665.639, p < 0.001, CFI =0.92, TLI =0.91, RMSEA =0.048, 90% RMSEA CI 0.047–0.049, RMSEA close fit =1.000). For the other seven predictor variables factor analysis was not required since the relevant questions were clearly defined with narrow connotations. Six of these variables were based on one single item and one was based on 4 items. The non-factor analyses variables were; (1) Experience of trauma; 4 items about negative life events from a previous Norwegian national survey [73] (2) Experience of bullying; from the national Olweus bullying survey [74] (3) Illicit drug use; developed for the web-based questionnaire [75] (4) Self-harm; from the Child and Adolescent Self-harm in Europe (CASE) Study [76] (5) Anxiety; from the Screen for Child Anxiety Related Disorders; SCARED [77] (6) School grades; mean grades from official registers from the Hordaland County, excluding Physical Education, and (7) Days absent from school (web-based questionnaire).

Table 1 Overview of predictor variables

The factor analyses-based variables were (1) Family support; nine items from the Resilience Scale for Adolescents; READ [78]; (2) Self-efficacy; three items from READ (3) Distractibility, four items from the adult ADHD Self-Report Scale; ASRS [79], two items from The Strengths and Difficulties Questionnaire; SDQ [80] and one item from the Short Mood and Feelings Questionnaire; SMFQ [81]; (4) Dysregulation of activation; three items from ASRS, two items from SDQ and one items from SMFQ (5) Experience of self-worth; seven items from SMFQ (6) Affective symptoms; seven items from SMFQ, and (7) Social functioning; three items from the Autism symptom self-report; ASSERT [82] and two items from READ.


IBM SPSS Statistics 23 was used for descriptive analyses [83]: mean, standard deviation (SD) and skewness. Factor scores were estimated through confirmatory factor analysis with the program Mplus 7.4 [84]. Since the outcome variable was nominal with three categories, a multinomial regression model was analyzed with SPSS. All predictors were included simultaneously in one block [85], and give adjusted estimates for all predictor variables. Thus, this model analyzes the effect of “hearing voices” and “being troubled by voice hearing” in contrast to no voice hearing as the reference category.


Descriptive results

Of the 10,346 adolescents, 53.5% were girls. The mean age was 17.70 years (SD =1.60). The prevalence of VH was No VH = 80.6% (N = 7954), VH =13.9% (N = 1371), and Distressing VH = 5.6% (N = 548), yielding a total of n = 9873 (excluding adolescents answering “unsure”). The descriptive results for the predictor variables are presented in Table 2 and 3. The results indicate relatively low prevalence of endorsement in general, except for Anxiety with 15% endorsement.

Table 2 Frequencies of the categorical predictor variables
Table 3 Means and standard deviations of the continuous predictor variables

Prediction of voice-hearing

The results from the predictor model of VH are presented in Table 4. The outcome variable was specified with the predictor effect on “VH” and “distressing VH” against the “No VH” as the reference category. Models for subjects between 16–19 and 16–25 years old are presented. No evidence for high multicollinearity between the predictor variables emerged, indicated by VIF with the highest value of 4.75. One predictor correlation was found to be high (r = 0.86), with 73% explained common variance.

Table 4 Results from multinomial regression for the categories “voice-hearing” and “distressed voice-hearing” in contrast to “no voice-hearing” (reference category)

Some variables were related to the category “VH” in contrast to “no VH”. These were Experience of trauma, School grades, Distractibility, Social dysfunction, Affective symptoms, Gender, and Age. Including a broader age span resulted in almost identical estimates as the model based on subjects 16–19 years old. This means that not distressing VH was more plausible among subjects with trauma experiences, higher mean academic characters, more problems regarding distractibility and social functioning, higher levels of affective symptoms, younger age and among boys in contrast to girls. The strongest statistical relation was Social dysfunction with odds ratio (OR) at 2.21, which means 121% increased risk for VH for each increase in this variable.

Regarding distressing VH, the following predictors were found to be statistically significant: experience of bullying, trauma exposure, negative self-worth, less family support, stronger levels of distractibility, dysregulation of activation, negative self-efficacy, self-harm, anxiety, and lower age. The strongest risk factor was found to be negative self-worth.


Hearing thoughts out aloud without being disturbed by them versus being distressed by VH was mainly predicted by different constellations of psychosocial characteristics, suggesting that VH with and without distress represent two different groups of adolescents. Overall, this supports the emphasis put on distress in clinical practice. Not being disturbed by the voices was primarily related to social dysfunction, in addition to the experience of trauma, distractibility, affective symptoms, higher school grades, male gender and older age. Perceiving the voices as disturbing was related to the experience of bullying and trauma, negative self-worth, and self-efficacy, less family support, dysregulation of activation, distractibility, self-harm, anxiety and younger age. Negative self-worth was the strongest predictor.

As expected, the adolescents with disturbing VH were characterized by most of the typically depicted comorbid mental health symptoms in psychosis (anxiety, dysregulation of activation, self-harm), risk factors for psychosis (experience of trauma and bullying), predictors of psychosis conversion and markers of mental distress (distractibility) and fewer psychosocial moderators and resilience factors (family support, self-efficacy, experience of self-worth). Some psychosis-related psychosocial variables were not, however, related to distressing voices in this population-based sample, i.e. worse school grades, more school absence, reduced social functioning, illicit substance use and more affective symptoms. This may be explained by characteristics of the sample, as most attended high school. Adolescents with major issues related to these variables may not be part of the present sample because they probably have dropped out of school.

Adolescents not distressed by voices displayed a somewhat different profile of psychosocial characteristics as compared to their no VH peers. Social dysfunction was the strongest predictor, followed by better grades and male gender. Distractibility was related to voice-hearing in both groups, and this is consistent with several studies finding reduced attentional control in all voice hearers regardless of diagnosis and treatment needs [86]. In addition, the importance of experience of trauma was seen for both groups, in line with studies showing childhood trauma in broader groups than psychosis only [41]. The effect of lower age yielding more VH seen in previous studies [6] was also seen in both groups. The relationship to better grades for the not distressed by voices group seems paradoxical. It can be speculated that this may reflect high-or over-achieving youngsters with suboptimal healthy work-rest balance, social life and mental overload. Youngsters within a high-functioning autism/Asperger spectrum continuum with low social functioning, male gender, better grades and voice hearing could also belong in this group. Increased school related stress and a more competitive school environment have been suggested as factors for explaining increases in student mental health problems in Norway, supporting the first speculation, and in line with a somewhat increased affective symptom load.

The difference between the distressed and non-distressed group seems to be related to negative self-apprehensions, less support from family and possibly peers via bullying, all together potentially leading to reduced ability to use psychological and social coping strategies. Dysregulation of activation including questions on restlessness, anxiety and self-harm creates a picture of youngsters in emotional distress. The relationship between these variables and the development of psychosis should be studied in more detail. The present study does not test causality; whether these characteristics are an effect of the troubling voices or vice versa. The topic clearly awaits longitudinal studies.

Other questions also remain unanswered, representing limitations of the present study. We do not know whether being distressed or disturbed by the voices actually is a sign of a HR or a psychotic state, which is dependent on a more comprehensive understanding of the individual interpretations of the voices than is possible via a web-based questionnaire. For instance, there may be a few individuals within the autism-spectrum in the present sample (0.45% broader defined Autism Spectrum Disorders; [82]) who may have very concrete interpretations of “voice speaking my thoughts aloud”, but since these youngsters also have a vulnerability for psychosis exclusion would create a biased sample. The current paradigm was not set out to test whether the voice hearing was part of other mental health disorders with frequent voice hearing. Some of the risk factors reported in the present study clearly overlap with other disorder, such as borderline and dissociative disorders. We do not know what the phenomenon of hearing voices out loud actually stands for, and we do not have data on the frequency or content of these experiences, variables which have been shown to identify voice-hearers in treatment [16]. When adolescents actually seek help for their voices, they present with a high level of suffering, a reduced level of functioning and severe and/or comorbid psychopathology [87].

There is a trade-off between the comprehensiveness of topics covered by the web-based questionnaire, the sample size and the in-depth understanding of the different phenomena in question. Finally, the study did not test the relative importance of the selected features in explaining the hearing voices categories, and whether the distress-related distinction is categorical or continuous. The strengths of the present study are the high number of participants and the representativeness of the sample via the population-based inclusion [67], supported by the high rate (93%) of 16–18 year olds that attend high-school in Norway [88]. In addition, the sample size enables the possibility of examining several important variables simultaneously with high levels of statistical power.

In conclusion, the findings support the validity of the clinical distinction between adolescents being disturbed by VH or not as seen in high quality early intervention services. Since VH in itself is a relatively frequent experience in adolescence, examining the amount of personal distress related to these alternative perceptual experiences may aid referral practices and treatment decisions in services for psychosis or high risk for psychosis.