Somatization is one of the most common issues in health care services, associated with substantial functional impairment and health care utilization [13]. Their valid and reliable acquisition is urgently necessary. Somatoform symptoms often account for sick leave and are characterized by long duration and medically unexplained symptoms [47]. The most frequently reported symptoms are fatigue, low energy, sleeping trouble, and pain (back pain, headaches, abdominal pain, and chest pain) [8, 9]. Medically unexplained symptoms are one of the key features of somatoform disorders. Although they are currently treated as both categorical (in terms of the diagnosis of somatoform disorders) and dimensional (in terms of quantitative measures of somatization/somatic symptom reporting), little is known about the empirical latent structure of medically unexplained symptoms. Accordingly to recent study results, the latent structure of somatization/somatic symptom reporting as assessed by the PHQ-15 is dimensional in both primary care and student samples [10].

Estimated prevalence rates of undifferentiated somatoform disorders vary between 8.6%-25.6% in primary care, depending on the chosen screening instrument and whether pain is taken into account or not [8, 1113]. Recent reported data on somatoform symptom clusters in the general population are still scarce [14]. Wittchen and colleagues (2011) reported in their systematic review a 12-month prevalence of somatoform disorders of 6.3% in the EU with little evidence for considerable cultural or country variation [15]. 4-week, 12-month and lifetime prevalence rates of any somatoform disorder in the German general population was reported with 7.5%, 11.0%, and respectively 16.2% [16].

Most clinicians nowadays evaluate, whether or not the reported somatoform complaints are associated with distress and psychological impairment, both predictors for somatoform disorders [17]. Screening instruments can add valuable diagnostic information, yet they vary considerably in length and diagnostic focus (for an overview of measures used in clinical trials of somatoform disorders see [18]). Patients often complain about the amount of items, which can lead to a difficult doctor-patient relationship and lower self-perception of quality of life and life satisfaction [1, 1921]. The reported impairment of every day functioning can be even higher when the patients are affected by comorbid conditions as depression and/or anxiety, which occurs in up to 43% with increasing number of physical symptoms [11, 17, 22]. A difficult encounter, as perceived by the clinician, may be another predictor of psychiatric comorbidity in patients who have somatoform symptoms [22]. The collaboration between patients and their doctors might also carry the risk of shaping, reinforcing, and legitimizing somatoform syndromes [23]. Hence it is important to take standardized assessment of somatization into account. These measures can have a variety of uses, including screening, early pre-diagnosis, assessment of severity, and gauging treatment decisions of both clinicians and patients.

The PHQ-15 is a self-administered somatic symptoms subscale, derived from the full Patient-Health-Questionnaire [7, 24]. Relatively brief, it screens for 15 somatic symptoms that account for more than 90% of the physical complaints reported in the outpatient setting (exclusive of self-limited upper respiratory symptoms) [20]. The PHQ-15 is a valid measure, which has been used in 40 studies so far in different health care settings (for an overview see [11]). Valid and reliable measures for the assessment of somatic symptoms, as the PHQ-15, have been used in psychiatric research and routinely in clinical practice so far (i.e. primary care). Normative data, which could be used to compare a subject's scale score with those determined from a general population reference group, are still scarce and restraint to relative risk factors [25]. The obtained findings could be further utilized as reference categories in community studies and open-access web-based screening tools [15, 26, 27].

In this study we provide normative data for the PHQ-15 for different age groups and both genders. In addition we address the relations of somatic symptoms with depression, and quality of life and life satisfaction to provide further evidence for the construct validity in a general population. According to previous results, we expect that higher PHQ-15 scores will be associated with worsening quality of life and life satisfaction as well as with increased depression [11].


Study sample

Nationally representative face-to face household surveys were conducted in Germany between 2003 and 2008 (n=5,031), representative of the German general population, with the assistance of an institute specialized for demographic research (USUMA, Berlin) according to the German law of data protection (§30a BDSG) and with written consent. Previously ethics were weighted to the respective interests of the public and of the individuals concerned following §823 (BGB) of the Civil Code of Law and in accordance with the guidelines in the Declaration of Helsinki. Representativeness was assured through a weighting process. Age, gender, and educational level were the major criteria for representativeness according to the register of the Federal Elections. Two callbacks had to be without success before an address was considered a failure. The sampling procedure consisted of sample points, household, and persons in the last stage. Target households within the sample points were determined using the random-route procedure: choosing sample point areas within Germany, randomly choosing households within these areas, and randomly choosing target persons within these households.

Sample characteristics

Attempts were made to contact 8008 persons. The set of questionnaires was administered to a sample of 5031 persons. Therefore the response rate was 62.8%. The main reasons for non-participation (37.2%) were: the general information request was refused (15.8%), the interview request was refused (7.9%), or there was no one at home for three times in a row (7.3%).

Sociodemographic characteristics of the sample are reported in Table 1. The analysis of the distribution of the data yielded skewness and kurtosis values of somatization of 1.63 and 3.29, respectively. We therefore decided to investigate group differences for sociodemographic characteristics using non-parametric tests.

Table 1 Demographic characteristics of the study sample and associations with PHQ-15 scores (N=5,031)

There were significant gender, age, education level, employment status, and income effects in the general population associated with a higher PHQ-15 score. The most marked group and the lowest groups were considered calculating the value of Cohen’s d using the means and standard deviations. As noted in Table 1, the calculated effect sizes were moderate for income and education, and high for age. Gender and employment status yielded small effect sizes.


Somatization (PHQ-15)

Somatization was measured using the somatic symptom module of the PHQ, the PHQ-15 [7, 28]. The items include the most prevalent DSM-IV somatization disorder somatic symptoms [29]. Subjects were asked for the last 4 weeks to rate the severity of 13 symptoms as 0 (“not bothered at all”), 1 (“bothered a little”), or 2 (“bothered a lot”). Two additional physical symptoms - feeling tired or having little energy, and trouble sleeping – are contained in the PHQ-9 depression module. For scoring, response options for these two symptoms are coded as 0 (“not at all”), 1 (“several days”), or 2 (“more than half the days” or “nearly every day”).

Thus, the total PHQ-15 score ranges from 0 to 30 and scores of ≥5, ≥10, ≥15 represent mild, moderate and severe levels of somatization. The reliability and validity of the PHQ-15 are high in clinical and occupational health care settings [2, 7, 11].

Depression (PHQ-9)

Depression was assessed with the PHQ nine item depression module (PHQ-9) [30]. Each of the nine PHQ depression items corresponds to one of the DSM-IV Diagnostic Criterion A symptoms for major depressive disorder [29]. Subjects were asked how often, over the last two weeks, they have been bothered by each of the depressive symptoms. Response options are “not at all”, “several days”, “more than half the days”, and “nearly every day”, scored as 0, 1, 2 and 3, respectively. PHQ-9 scores range from 0 to 27, with scores of ≥5, ≥10, ≥15, representing mild, moderate and severe levels of depression severity [31]. Psychometric properties of the PHQ-9 are well documented (for an overview see [11]).

Quality of life (SF-12)

The SF-12 is an ubiquitary adopted generic questionnaire on the subjectively perceived health-related quality of life and records the overall subjective state of health of adults for different diseases, in relation to their physical, psychological, and social aspects [32]. A longer version of the SF-12, the SF-20, was already previously used to assess functional impairment in combination with the PHQ-15 [7, 17].

SF-12 scales are namely: general health, physical functioning, role physical, bodily pain, vitality, social functioning, mental health, role emotional, yielding the summary scales physical- and mental health.

Life satisfaction (SWLS)

Satisfaction with life was measured with the Satisfaction With Life Scale, designed to measure global cognitive judgments of satisfaction with one's life, and consists of five items [33, 34]: “In most ways my life is close to my ideal”, “The conditions of my life are excellent”, “I am satisfied with my life”, “So far I have gotten the important things I want in life”, and “If I could live my life over, I would change almost nothing”.

Respondents indicated the extent to which they agreed with each item on a seven-point Likert scale ranging from “strongly agree” to “strongly disagree”. Translations of the SWLS into various languages are available and psychometric properties have been reviewed [35].

Internal consistencies

The parameter of internal consistency (Cronbach’s α) for the PHQ15 scale reached the value of α =0.82, for the PHQ-9 α=0.88 respectively. The Satisfaction with Life Scale showed a very good Cronbach’s α of 0.91. Cronbach’s α for the mental component scale (MCS) was 0.84, 0.91 respectively for the physical component scale (PCS) of the SF-12.

Data analysis

For reliability, internal consistency of the PHQ-15 was assessed. Base rates for single symptoms were calculated using frequency analysis. Descriptive statistics included analyses of prevalence. To determine prevalence rates, a cut-off score of ≥10 was used on the PHQ-15 because the range of ≥10 up to 30 reflects medium and high somatic symptom severity, respectively [7]. The selection of this cut-off score resulted in previous studies in a sensitivity of 80.2% und specifity of 58.5% for a somatoform disorder [3]. For construct validity, we investigated PHQ-15 scale intercorrelations with the PHQ-9 [7, 30], the SF-12 [32], and the Satisfaction With Life Scale [33]. In addition, we investigated group differences for sociodemographic characteristics using χ 2 -test and Kruskal-Wallis-test, respectively. Based on results from previous studies with the PHQ-15, we expected that women would have higher somatization scores compared with men and that levels of somatization increase with age and lower levels of education [8]. To provide normative data for the PHQ-15, we generated age- and gender specific percentiles for the PHQ-15 total score. Sample size was sufficient to be divided into gender-specific age groups comprising 10 years each. Statistical analyses were conducted using SPSS with an α-level of 1%. According to previous other studies with the PHQ-15 [11, 17], missing values were replaced with the mean value of the remaining items if the number of missing items was below 20%. If the number of missing items in the scale exceeded 20%, the sum score was not computed and counted as missing.


Prevalence of somatization syndromes

By using the cut-off scores described below, the total prevalence of somatization syndromes at a moderate to high level was estimated to be 9.3%; 8.1% of the men and 10.3% of women had a PHQ-15 sum score ≥10.

Base rates of single symptoms

The gender-stratified prevalence rates of the individual symptoms are shown in Figure 1. The most common symptoms were various types of pain (back pain, headache, pain of the joints and extremities) with prevalence rates >35%, if symptom reporting of any degree of severity was considered for both genders. Highest rates for severe symptom rating were found for the same symptoms (>4%). Further 2.4% of the total sample complained about sleeping trouble and 1.4% of a lack of energy nearly every day.

Figure 1
figure 1

Gender-stratified base rates of somatoform symptoms. Symptoms for which the subject had been “bothered a lot” are indicated by the black part of the bar and defined as severe.

Construct validity

The intercorrelations between the PHQ-15 total score and the PHQ-9 depression scale, the SF-12 for the assessment of quality of life (physical and mental factor), and the Satisfaction with Life Scale are summarized in Table 2. Intercorrelations with somatization were highest with depression (r=0.75 p<0.001), followed by the physical component summary scale of health related quality of life (r=−0.64 p<0.001), and the subscale “bodily pain” respectively (r=−0.68 p<0,001). Intercorrelation of depression was higher with the mental component summary scale of health related quality of life (r=−0.68 p<0.001) than the physical component (r=−0.48 p<0,001) compared to somatization. Two items, “feeling tired or having little energy” and “trouble falling asleep, or sleeping too much” represent shared questions between the PHQ-9 and the PHQ-15. Omitted from the somatization scale, the intercorrelation reduced from 0.75 to 0.65 (p<0.001).

Table 2 Intercorrelations of somatization, depression, life satisfaction, and health related quality of life (N=5,031)

Both somatization and depression were significantly related to life satisfaction.

The associations of the PHQ-15 scores with demographic characteristics are shown in Table 1. As hypothezised, PHQ-15 scores increased with age, and women exhibited higher scores than men. Also in accordance with the hypotheses, scores for somatization syndromes were higher in subjects with lower educational levels compared to subjects with higher educational levels. No differences were found in terms of relationship or employment status.

Normative data

Table 3 summarizes the normative data for the different age levels and both genders. Percentiles from this table can be used to compare an individual subject’s PHQ-15 score with those determined from the general population reference group based on age and gender.

Table 3 Normative data from the general population for the PHQ-15

For example, a PHQ-15 score of 11 in a 30-year-old man indicates a percentile rank of 93.4% in the total population and of 98.9% in a group of subjects of the same age and gender. Likewise, a PHQ-15 score of 11 in a 30-year-old woman corresponds to a percentile rank of 93.4% in the total population and of 94.9% in the same age and gender group.


A main result of this study was the standardization of the PHQ-15 with the provision of normative data from the general population. Given that age and gender specific comparative data were generated based on subgroups consisting of n=156 to 542 subjects each, the sample sizes were sufficient to provide normative data. Results of a standardization study of the Patient Health Questionnaire-4 (PHQ-4) on depression and anxiety, yielded that the German general population could be considered comparable to the American general population [36]. The prevalence rate of 9.3% for somatization syndromes corresponds to previous results of surveys in the general population reporting on any somatoform disorder [16] and can be considered for further exploration for the presence of the spectrum of subclinical to full somatoform disorder in clinical practice [3, 7]. Previous studies in the general population on base rates for somatoform symptoms report similar frequencies and dominance of various types of pain [9, 14]. In primary care the different pain symptoms are also the most prominent ones, accompanied by “lack of energy” and “trouble sleeping” as an indicator of exacerbation [8].

The present study, including more than 5000 subjects, gives evidence that the PHQ-15 is not only a reliable and valid self-report measure for somatization in health care settings but also in the general population. Specifically, the intercorrelations of the PHQ-15 with the PHQ-9 depression scale (r = 0.65-0.75), the SF-12 quality of life scale (r = −0.53-0.68), and the life satisfaction scale (r = −0.37) are similar to intercorrelations between these concepts in other studies suggesting further construct validity of the PHQ-15 [11]. In the original PHQ-15 validation study, which comprised of 6,000 unselected primary care patients, higher PHQ-15 scores were also strongly associated with worsening function on all six SF-20 scales - a longer version of the SF-12 used in the present study -, as well as increased disability days and health care utilization [7, 17]. The impact on the physical component scale of the SF-12 was higher for somatization than for depression. The expressed mental component scale showed higher associations with depression than somatization. The high association of somatization and depression in the present study might be partly explained by the overlap of two items in the PHQ-15 and PHQ-9 (“lack of energy”, “sleep disturbance”). Yet these results of concurrent validity are supported by a former study of the PHQ-15 in relation to depression and general mental health [37]. The comorbidity of somatic, anxiety and depressive symptoms (the “SAD” triad) is well-established [11, 20]. Still the concordance could not be found in immunological parameters, where results suggest different immune alterations in somatization syndrome and depression [38]. What is known, is that physiological activity (i.e. heart rate, tension) is high in patients with somatization and may interact with psychological processes [39].

The controverse discussion on the classifying of somatoform disorders, respectively syndromes, would have gone beyond the purpose of this study (for an overview see [18]). Although the PHQ-15 does not explicitly ask for “medically unexplained symptoms”, it is highly associated with clinician-rated somatoform disorder symptom counts [40].

Yet a potential limitation of this general population study is that it did not include standard criterion interviews, which would have allowed for calculating specificity and sensitivity for optimal cut point and construction of a receiver operating characteristic (ROC). The sensitivity and specificity of the PHQ-15, as measured by the concordance with the SCID-I diagnosis of somatoform disorders, has previously been established as 78% and 71%, respectively, in primary care [41]. Another limitation might be that normative data were not reported according to the socioeconomic status.

Reviews have identified effective behavioural and pharmacological interventions for somatoform disorders [4245], and guidelines are close to be published (e.g. S3-guideline). Reported “green flags” or prognostic factors are so far: (a) proactive coping strategies of the patient, e.g. optimism, motivation for psychotherapy; (b) healthy lifestyle, e.g. balanced diet, relaxation, exercising, and enough sleep; (c) social support; and (d) a good doctor-patient relationship with shared decision making.

Reducing the burdens and enhancing early detection of mental disorders in general requires major shifts in research, clinical practice, and public health by incorporating multidisciplinary models of intervention. The good news is that such changes are under way, as reflected, for example by the experts drafting Research Roadmaps (see for the European Union and the U.S. (see


Somatization is one of the most common issues in health care services, associated with substantial functional impairment and health care utilization. Somatization syndromes occur in 9.3% of the general population. Thus validate acquisition of somatoform symptoms is necessary in several health care settings. The PHQ-15 is a good basis for this task. Normative data for the PHQ-15 in the general population were generated for both genders and different age levels and can be used for the interpretation and comparisons with other populations.