Diagnostic utility of a one-item question to screen for depressive disorders: results from the KORA F3 study

Blozik, Eva; Scherer, Martin; Lacruz, Maria E; Ladwig, Karl-Heinz

doi:10.1186/1471-2296-14-198

Diagnostic utility of a one-item question to screen for depressive disorders: results from the KORA F3 study

Research article
Open access
Published: 23 December 2013

Volume 14, article number 198, (2013)
Cite this article

Download PDF

You have full access to this open access article

BMC Family Practice Aims and scope Submit manuscript

Diagnostic utility of a one-item question to screen for depressive disorders: results from the KORA F3 study

Download PDF

Eva Blozik^1,2,
Martin Scherer¹,
Maria E Lacruz³,
Karl-Heinz Ladwig^3,4 &
the KORA study group

2640 Accesses
15 Citations
2 Altmetric
Explore all metrics

Abstract

Background

Screening for depressive disorders in the general adult population is recommended, however, it is unclear which instruments combine user friendliness and diagnostic utility. We evaluated the test performance of a yes/no single item screener for depressive disorders (“Have you felt depressed or sad much of the time in the past year?”) in comparison to the depressive disorder module of the Patient Health Questionnaire (PHQ-9).

Methods

Data from 3184 participants of the population-based KORA F3 survey in Augsburg/ Germany were used to analyse sensitivity, specificity, ROC area, positive likelihood ratio (LR+), negative likelihood ratio (LR-), positive predictive value (PPV), and negative predictive value (NPV) of the single item screener in comparison with “depressive mood” and “major depressive disorder” defined according to PHQ-9 (both interviewer-administered versions).

Results

In comparison to PHQ-9 “depressive mood”, sensitivity was low (46%) with an excellent specificity (94%), (PPV 76%; NPV 82%; LR + 8.04; LR- .572, ROC area .702). When using the more conservative definition for “major depressive disorder”, sensitivity increased to 83% with a specificity of 88%. The PPV under the conservative definition was low (32%), but NPV was 99% (LR + 6.65; LR- .196; ROC area .852). Results varied across age groups and between males and females.

Conclusions

The single item screener is able to moderately decrease post-test probability of major depressive disorders and to identify populations that should undergo additional, more detailed evaluation for depression. It may have limited utility in combination with additional screening tests or for selection of at-risk populations, but cannot be recommended for routine use as a screening tool in clinical practice.

View this article's peer review reports

Current major depressive syndrome measured with the Patient Health Questionnaire-9 (PHQ-9) and the Composite International Diagnostic Interview (CIDI): results from a cross-sectional population-based study of adults in Germany

Article Open access 10 April 2015

Utility of the PHQ-9 to identify major depressive disorder in adult patients in Spanish primary care centres

Article Open access 09 August 2017

The diagnostic accuracy of the Patient Health Questionnaire-2 (PHQ-2), Patient Health Questionnaire-8 (PHQ-8), and Patient Health Questionnaire-9 (PHQ-9) for detecting major depression: protocol for a systematic review and individual patient data meta-analyses

Article Open access 27 October 2014

Background

Depressive disorders are a major burden for the healthcare systems worldwide leading to loss of productivity, functional decline, and increased mortality [1–6]. The daily functioning and overall health of patients with depression can be improved when patients receive appropriate therapies [7, 8]. Screening alone does not improve the health of patients with undiagnosed depressive disorders [9–12] but screening combined with patient-support programs, such as regular nurse follow-ups and close monitoring of adherence to therapy, seems to be useful [13]. Therefore, the U.S. Preventive Services Task Force recommends screening for depressive disorders in the general adult population when there are staff-assisted depression care supports in place to assure accurate diagnosis, effective treatment, and follow-up [14]. Additionally, screening for depressive disorders is recommended in populations at risk such as those with a family or personal history of depressive disorders, multiple medical problems, unexplained physical symptoms, chronic pain, or use of medical services that is more frequent than expected even if no depression care supports are available [15].

For screening purposes, different instruments exist [16]. Administering and evaluating comparatively long screening instruments can be time-consuming and it may thus be difficult to implement them in busy clinical settings [17]. Simple tests focusing explicitly on depressive disorders and without the need for additional computation on the clinician’s side seem to have the highest probability that this information is integrated into the clinical decision-making process [18]. In the context of comprehensive research evaluations long instruments may increase respondent burden [19]. This is why research teams searching for the shortest possible measure proposed and evaluated screeners consisting of one or two items [20]. Williams et al. presented a simple and easy-to-administer single item question (“Have you felt depressed or sad much of the time in the past year?”) and reported good sensitivity and less specificity as compared to the Center for Epidemiologic Studies Depression Screen (CES-D) [21] using a diagnostic SCID interview (Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders) as the criterion (85% vs 88% and 66% vs 75%, respectively) [11]. In contrast, Corson et al. found this single item to be specific (88%) but less sensitive (78%) when using the 9-item Patient Health Questionnaire (PHQ-9) [22] algorithm for major depression as the reference standard [20].

Given these discrepancies and given the fact that the previous studies were conducted in very specific study populations (predominantly female Hispanics or veterans in the USA), this study evaluates measures of test performance of the Williams et al. single-item screener in comparison to PHQ-9 in a population-based sample of adults from Germany. The aim of this study is to conclude on the utility of this single item screener to screen for depression in the general population.

Methods

Study design and subjects

The data stem from the city of Augsburg (Bavaria, Germany) and from surrounding districts covering about 600,000 inhabitants drawn from mixed urban and rural areas whose demographic and socioeconomic characteristics roughly reflect those of the average middle European population in general. The present analysis investigates data from the population-based KORA F3 survey conducted in 2004/05 within the framework of the ongoing KORA project (Cooperative Health Research in the Augsburg Region, Germany), a research platform for population-based health research [23]. The KORA F3 survey is a follow-up survey to the MONICA S3 survey conducted in 1994/95—at that time one cooperative centre within the worldwide WHO MONICA (Monitoring Trends and Determinants on Cardiovascular Diseases) project investigating the general and cardiovascular health of diverse populations. For the MONICA S3 survey, a stratified random representative sample of 6481 eligible subjects was drawn in 1994/95 from the population, of whom a total of 4856 subjects (response rate: 74.9%) participated in the S3 baseline survey. By the F3 follow-up study one decade later (2004/05), a total of 405 (8%) subjects had died. Furthermore, subjects were considered ineligible for inclusion in the F3 follow-up survey if they lived too far outside the study region or were completely lost to follow-up (n = 222, 5%), or had demanded deletion of their address data (n = 270, 6%). Of the remaining 3959 eligible subjects, 161 could not be contacted, 295 were unable to come because they were too ill, and 497 were not willing to participate, resulting in an interim total of 3006 participants in the F3 follow-up survey (response rate: 76% of S3 participants). Furthermore, additional efforts were made to reach those 1300 eligible subjects from the original S3 sampling frame who had not participated in the S3 baseline survey. Thus, another 178 (14%) participated in the present KORA F3 study, for a total sample size of 3184 (overall response rate: 49.12%). Written informed consent was obtained from each study participant and the study was approved by the ethics committee of the Bavarian Medical Association.

Instruments

All participants underwent a standardized face-to face interview including the Patient Health Questionnaire and the single item screener and an extensive medical examination. The interviews were performed by experienced study nurses at the KORA Study Centre, Augsburg. Before start of the study, they received an extended training program and were certified thereafter. All interviews were taped and subjected to a routine quality assessment in the KORA data centre to avoid bias. At study halftime, all interviewers were recertified. Depression was assessed in an interview version of the 9 item depression module of the Patient Health Questionnaire (PHQ-9) [24]. Patients rate the frequency of symptoms of depression over the past 2 weeks on an ordinal scale (0 = not at all, 1 = several days, 2 = more than half the days, 3 = nearly every day). The 9 items are based on the 9 DSM-IV criteria for the diagnosis of depression. The total score ranges from 0 to 27. In order to be congruent with the DSM-IV criteria, the algorithm developed and validated by Spitzer et al. was used for classification: “Major depressive disorder” was defined as having at least five questions answered with “more than half the time in the past two weeks”, of which at least one of the first two questions (little interest in doing things, feeling depressed) had to be included. Participants were labelled to have “depressive mood” when 2 to 4 questions were answered with “more than half the time in the past two weeks”, also including one of the first two questions of the PHQ-9 questionnaire [24]. PHQ-9 was used as reference standard in this study because it has been shown to have a sensitivity of 88% and a specificity of 88% for major depression compared with diagnostic SCID interviews [24] as well as concurrent validity, high internal consistency, and test-retest reliability [25].

The single item screener “Have you been depressed or sad most of the past year?” uses a yes/no response format [11]. Based on a frequently used question for medical history taking, this single item question has been developed in the context of a randomised controlled trial of case finding for depression. The sample was predominantly female and Hispanic and was recruited at family and internal medicine clinics in the United States. Consecutive patients were randomly assigned to be asked the single item screener, to fill out the 20-item (CES-D), or to usual care. Corson et al. reported a LR + of the single item screener of 6.77 and an area under the ROC of .83 (95% confidence interval (CI) .79, .87) [20]. The single item screener was administered directly in advance to the PHQ-9.

Statistical analyses

Firstly, the distribution of socio-demographic and clinical characteristics across the study sample was calculated for description of the study population. Secondly, we calculated several measures of test performance of the single item screener in comparison to the reference standard PHQ-9. This was done for the PHQ-9 “depressive mood” definition as well as for the “major depressive disorder” definition based on a 2×2 table (see Table 1). Specifically, we calculated the prevalence of persons with “depressive mood” and of “major depressive disorder”. Sensitivity (the proportion of persons having depression according to the PHQ-9 who test positive in the single item screener), specificity (the proportion of persons without the disease according to the PHQ-9 who test negative in the single item screener), receiver operating characteristic (ROC) area, the positive likelihood ratio (LR+, the probability of a person who has the disease according to the PHQ-9 and tests positive in the single item screener divided by the probability of a person who does not have the disease and tests positive), the negative likelihood ratio (LR-, the probability of a person who has the disease and tests negative divided by the probability of a person who does not have the disease and tests negative), the positive predictive value (PPV, the proportion of persons testing positive in the single item screener who have the disease), and the negative predictive value (NPV, the proportion of persons testing negative in the single item screener who do not have the disease) of the single item screener in comparison with either PHQ-9 depressive disorder definition were calculated, including 95% confidence intervals. The ROC is a graphical plot of the fraction of true positives out of the total actual positives (sensitivity) vs. the fraction of false positives out of the total actual negatives (1-specificity), at various threshold settings. The area under the ROC is a measure for test accuracy with a value of 1 representing a perfect test and an area of 0.5 representing a worthless test. These analyses were repeated stratified for age group (34–44, 45–54, 55–64, 65–74, 75–85 years) and gender (female, male), two variables known to be linked with a different prevalence of depressive disorders in the general population [26]. Additionally, the proportion of false positive test results was calculated using the PHQ-9 “depressive mood” definition. All analyses were done using STATA version 11.0 (Stata Corporation, College Station, Texas, USA).

Table 1 2 × 2 table of the single item screener using the “depressive mood” definition and the “major depressive disorder” definition of the 9-item Patient Health Questionnaire (PHQ-9) as reference standard

Full size table

Results

Table 2 depicts the socio-demographic and clinical characteristics of the study sample. The proportion of male and female participants was almost equal with all age groups included being adequately represented. 21.63% of male participants and 33.93% of female participants were categorised to have “depressive mood” according to the established PHQ-9 definition. “Major depressive disorder” was prevalent in 4.46% of men and in 8.37% of women. The prevalence of depressive disorders of either definition increased with advancing age.

Table 2 Socio-demographic and clinical characteristics of the study sample

Full size table

The prevalence of “depressive mood” increased from 20% (95% CI 17–23.3) in persons aged 34-to 44 years to 34% (29–39.4) in persons older than 75 years. Sensitivity of the single item screener was low across all age groups and genders, though it increased from 37.5% (29.1-46.5) to 52.5% (43.2-61.6) with advancing age. Specificity was >90% in all subgroups investigated, with very high values of >95% in persons younger than 55 years and in males. An area under the curve (AUC) of.702 (.685-.719) in the ROC analysis of the total sample was moderately good (Table 3).

Table 3 Prevalence and test performance of the single item screener using the “depressive mood” definition of the 9-item Patient Health Questionnaire (PHQ-9) as reference standard (95% confidence interval)

Full size table

An LR + of >10 indicates that the post-test probability of having “depressive mood” is considerably increased. LR+ > 10 have been detected in our analysis in the younger age groups and in the male study population, but not in the higher age groups or in females, resulting in a LR + of 8.04 (6.71-9.64) for the total sample. LR- indicate the ability of the single item screener to decrease the post-test probability of having “depressive mood”, the conventional cut-point being LR- < .1. LR- in our analysis ranged from 0.523 (.432-.632) to 0.809 (.555-.667) indicating no reasonable decrease in post-test probability. PPVs correspond to a probability of having “depressive mood” in the presence of a positive single item screener of >70% in all subgroups investigated. NPVs ranging from 77.8% (75.4-80.1) to 86% (82.9-88.8) relate to fairly high probability to be healthy when the single item response is negative (Table 3). The proportion of false-positive test results (single item screener positive, but no diagnosis of “depressive mood” in PHQ-9) was 130/2269, i.e. 5.7%, ranging from 3.5% in the 34–44 age group up to 8.9% in the 75–85 age group.

When using the more conservative classification of PHQ-9, 6.5% (5.6-7.4) in the total sample were identified as having a “major depressive disorder” (3.8% (2.4-5.5) in the 34–44 age group, N = 24; 10% (7.4-14.1) in the >75 age group, N = 37). In comparison to this PHQ-9 definition, the single item screener demonstrated fairly good sensitivity with 75% (53.3-90.2) in the low-prevalence age group of 34–44 up to 86.5% (71.2-95.5) in those >65 years of age. Specificity of 87.5% (86.3-88.7) in the total sample was also fairly good with comparably low specificity in those subgroups with comparably high sensitivity and vice versa (e.g. specificity of 92.2% (89.8-94.2) in the 34–44 age group and 83.3% (78.8-87.3) in the >75 age group). As compared to the “depressive mood” definition, using the “major depressive disorder” definition resulted in a significantly higher ROC area of .852 (.825-.879) (Table 4).

Table 4 Prevalence and test performance of the single item screener using the “major depressive disorder” definition of the 9-item Patient Health Questionnaire (PHQ-9) as reference standard (95% confidence interval)

Full size table

The single item screener is not useful for ruling in major depressive disorder, as the LR + in the total sample is 6.65 (5.93-7.46) and for most subgroups far away from >10. The ability of ruling out major depressive disorder is much better with a LR- of .196 (.145-.265) in the total sample. However, in none of the subgroups investigated, the LR- was < .1. Given the low prevalence of major depressive disorder (according to PHQ-9), the PPVs and NPVs as shown in Table 4 must be interpreted with care, as a prevalence of >15% is considered to be adequate for this type of analysis. Albeit, PPVs of about 30% indicate a quite low probability of having major depressive disorder in the presence of a positive single item screener (resulting a high number of false positives), whereas it is almost sure that a person does not have a major depressive disorder in the presence of a negative test result (NPV in the total sample 98.7% (98.1-99.1)).

Discussion

Interpreting the clinical meaning of the test result of a simple yes/no single item question (“Have you been depressed or sad most of the past year?”) in comparison to the 9-item PHQ instrument is complex: In the presence of a positive test result, the likelihood of the person having a clinically relevant depressive disorder is considerably increased (LR + 8.04 in comparison to PHQ-9 “depressive mood”, LR + 6.65 in comparison to PHQ-9 “major depressive disorder”). A person presenting with a positive single item screener would therefore be in need for a more detailed evaluation of depressive symptoms. In the presence of a negative test result, a major depressive disorder is relatively unlikely (LR- 0.196 in comparison to PHQ-9 “major depressive disorder”), though the presence of a major depressive disorder cannot completely excluded. However, a negative test result does only minimally decrease the likelihood of a person having depressive mood (LR- 0.572 in comparison to PHQ-9 “depressive mood”). As a result of the varying prevalence of depressive disorders across age groups and between females and males, we detected differences in test performance measures across these strata. However, the differences were not clear enough to recommend the single item for specific use in certain groups of patients.

When associating this study with previous research, our results for sensitivity (83% in comparison with PHQ-9 “major depressive disorder”) are comparable with Williams et al. [11] (85%) and slightly higher than those of Corson et al. (78%) [20]. With respect to specificity, the present study (88%) and the results of Corson et al. (88%) are concordant, both done in comparison with PHQ-9 “major depressive disorder”. However, when Williams et al. investigated the single item screener in comparison to SCID interviews specificity was considerably lower (66%). Given the fact that the PHQ-9 has been shown to have a specificity of 80-90% in comparison to SCID interviews, [24] the previous findings seem plausible.

However, poor specificity as compared to the gold standard translates into high rates of false-positive test results. There is a vivid discussion on whether current criteria for clinical diagnosis of depression are medicalising sadness [27] or whether - in contrary - there are still many people missing on life saving treatment [28]. The debate also includes whether screening for depression increases over diagnosis or whether it is an effective public health measure [14, 29]. We did not detect substantial differences in the rates of false-positives between the single item screener and PHQ-9 (5.7% of single item test results in comparison to PHQ-9 “depressive mood”). However, as stated above, we did not compare against the gold standard, and there is a considerable amount of false-positives when applying the PHQ-9 which we were not able to detect in the present study [22, 24, 25].

In comparison to PHQ-9, the main limitation of the single item screener is the relatively low ability to detect less-than-severe depressive disorders. Therefore, the utility of the single item in clinical context is very limited. It might be used as a first step of a screening procedure in combination with other, more detailed assessment instruments. For example, such a two-step screening procedures has been recommended by the American Heart Association for patients with coronary heart disease [30]. Elderon et al. evaluated this recommendation using the PHQ-2 and the PHQ-9 sequentially and found this procedure to be highly specific, poorly sensitive, but predictive of poor coronary outcomes [31]. Similar two-step screening procedures may also be applied in other settings or other patient populations.

In contrast to clinical settings, the single item screener may be helpful for selection of specific patient populations if the absence of a depressive disorder (negative test result) or the presence of a major depressive disorder (positive test result) is selection criterion and if space, time or resources for more comprehensive questionnaires are limited.

When interpreting this study, several limitations need to be considered. This is a secondary analysis of data of the large, population-based KORA cohort study which has not specifically been designed for the research question addressed in the present manuscript. SCID interviews which were not available in this project are considered to be the gold standard for diagnosing depressive disorders in research contexts. However, we used PHQ-9 as the reference standard which has been shown to have good concordance with clinical diagnosis of depression [32]. Additionally, all participants lived in Bavaria so that there may be cultural differences in the prevalence and diagnostic identification of depressive disorders as compared to other countries. Moreover, some of the persons who were eligible for the study were not willing to participate (S3 baseline survey response rate = 74.9%), and some of those who participated at baseline, had dropped out for the F3 follow-up (F3 follow-up survey response rate: 76% of S3 participants) so that selection bias cannot be excluded. However, the demographic and socioeconomic characteristics of the underlying population roughly reflect those of the average middle European population in general [23]. In addition, the reader should keep in mind that the PHQ-9 assesses depressive symptoms within the last 2 weeks, whereas the single item screener inquires about the past year. So, the PHQ-9 is in line with a diagnosis of depression according to the DSM-IV or DSM-V criteria, when the single item screener includes a global assessment of a much longer interval but does not inquire detailed aspects of depression. Another limitation is that reliability of the single item screener, e.g. test-retest performance has not been evaluated so far and should be included in future research.

Conclusions

In comparison to PHQ-9, the single item screener proposed by Williams et al. is able to moderately decrease the likelihood of major depressive disorders and to identify populations that should undergo additional, more detailed depression screening measures. However, in comparison to PHQ-9 the single item screener has a low ability to detect less-than-severe depressive disorders and can therefore not be recommended for routine use as a screening tool in clinical practice.

References

Ustün TB, Ayuso-Mateos JL, Chatterji S, Mathers C, Murray CJ: Global burden of depressive disorders in the year 2000. Br J Psychiatry. 2004, 184: 386-392. 10.1192/bjp.184.5.386.
Article PubMed Google Scholar
Broadhead WE, Blazer DG, George LK, Tse CK: Depression, disability days, and days lost from work in a prospective epidemiologic survey. JAMA. 1990, 264: 2524-2548. 10.1001/jama.1990.03450190056028.
Article CAS PubMed Google Scholar
Wells KB, Stewart A, Hays RD, Burnam MA, Rogers W, Daniels M, Berry S, Greenfield S, Ware J: The functioning and well-being of depressed patients: results from the medical outcomes study. JAMA. 1989, 262: 914-919. 10.1001/jama.1989.03430070062031.
Article CAS PubMed Google Scholar
Hays RD, Wells KB, Sherbourne CD, Rogers W, Spritzer K: Functioning and well-being outcomes of patients with depression compared with chronic general medical illnesses. Arch Gen Psychiatry. 1995, 52: 11-19. 10.1001/archpsyc.1995.03950130011002.
Article CAS PubMed Google Scholar
Covinsky KE, Fortinsky RH, Palmer RM, Kresevic DM, Landefeld CS: Relation between symptoms of depression and health status outcomes in acutely ill hospitalized older persons. Ann Intern Med. 1997, 126: 417-425. 10.7326/0003-4819-126-6-199703150-00001.
Article CAS PubMed Google Scholar
Whooley MA, Browner WS: Association between depressive symptoms and mortality in older women. Arch Intern Med. 1998, 158: 2129-2135. 10.1001/archinte.158.19.2129.
Article CAS PubMed Google Scholar
Coulehan JL, Schulberg HC, Block MR, Madonia MJ, Rodriguez E: Treating depressed primary care patients improves their physical, mental, and social functioning. Arch Intern Med. 1997, 157: 1113-1120. 10.1001/archinte.1997.00440310079008.
Article CAS PubMed Google Scholar
Katzelnick DJ, Simon GE, Pearson SD, Manning WG, Helstad CP, Henk HJ, Cole SM, Lin EH, Taylor LH, Kobak KA: Randomized trial of a depression management program in high utilizers of medical care. Arch Fam Med. 2000, 9: 345-351. 10.1001/archfami.9.4.345.
Article CAS PubMed Google Scholar
Callahan CM, Hendrie HC, Dittus RS, Brater DC, Hui SL, Tierney WM: Improving treatment of late life depression in primary care: a randomized clinical trial. J Am Geriatr Soc. 1994, 42: 839-846.
Article CAS PubMed Google Scholar
Dowrick C, Buchan I: Twelve month outcome of depression in general practice: does detection or disclosure make a difference?. BMJ. 1995, 311: 1274-1276. 10.1136/bmj.311.7015.1274.
Article CAS PubMed PubMed Central Google Scholar
Williams JW, Mulrow CD, Kroenke K, Dhanda R, Badgett RG, Omori D, Lee S: Case-finding for depression in primary care: a randomized trial. Am J Med. 1999, 106: 36-43. 10.1016/S0002-9343(98)00371-4.
Article PubMed Google Scholar
Whooley MA, Stone B, Soghikian K: Randomized trial of case-finding for depression in elderly primary care patients. J Gen Intern Med. 2000, 15: 293-300. 10.1046/j.1525-1497.2000.04319.x.
Article CAS PubMed PubMed Central Google Scholar
Wells KB, Sherbourne C, Schoenbaum M, Duan N, Meredith L, Unützer J, Miranda J, Carney MF, Rubenstein LV: Impact of disseminating quality improvement programs for depression in managed primary care: a randomized controlled trial. JAMA. 2000, 283: 212-220. 10.1001/jama.283.2.212.
Article CAS PubMed Google Scholar
U.S.A. U.S. Preventive Services Task Force: Screening for Depression in Adults. Available at http://www.uspreventiveservicestaskforce.org/uspstf/uspsaddepr.htm [website]. Accessed June 03, 2013
Whooley MA, Simon GE: Managing depression in medical outpatients. N Engl J Med. 2000, 343: 1942-1950. 10.1056/NEJM200012283432607.
Article CAS PubMed Google Scholar
Pignone MP, Gaynes BN, Rushton JL, Burchell CM, Orleans CT, Mulrow CD, Lohr KN: Screening for depression in adults: a summary of the evidence for the U.S. Preventive services task force. Ann Intern Med. 2002, 136: 765-776. 10.7326/0003-4819-136-10-200205210-00013.
Article PubMed Google Scholar
Nutting PA, Rost K, Dickinson M, Werner JJ, Dickinson P, Smith JL, Gallovic B: Barriers to initiating depression treatment in primary care practice. J Gen Intern Med. 2002, 17: 103-111. 10.1046/j.1525-1497.2002.10128.x.
Article PubMed PubMed Central Google Scholar
Gilbody S, Sheldon T, House A: Screening and case-finding instruments for depression: a meta-analysis. CMAJ. 2008, 178: 997-1003. 10.1503/cmaj.070281.
Article PubMed PubMed Central Google Scholar
Ulrich CM, Wallen GR, Feister A, Grady C: Respondent burden in clinical research: when are we asking too much of subjects?. IRB. 2005, 27: 17-20. 10.2307/3563957.
Article PubMed Google Scholar
Corson K, Gerrity MS, Dobscha SK: Screening for depression and suicidality in a VA primary care setting: 2 items are better than 1 item. Am J Manag Care. 2004, 10: 839-845.
PubMed Google Scholar
Weissman MM, Sholomskas D, Pottenger M, Prusoff BA, Locke BZ: Assessing depressive symptoms in five psychiatric populations: a validation study. Am J Epidemiol. 1977, 106: 203-214.
CAS PubMed Google Scholar
Kroenke K, Spitzer RL, Williams J: The PHQ-9. Validity of a brief depression severity measure. J Gen Intern Med. 2001, 16: 606-613. 10.1046/j.1525-1497.2001.016009606.x.
Article CAS PubMed PubMed Central Google Scholar
Holle R, Happich M, Löwel H, Wichmann HE, MONICA/KORA Study Group: KORA–a research platform for population based health research. Gesundheitswesen. 2005, 67: S19-S25. 10.1055/s-2005-858235.
Article PubMed Google Scholar
Spitzer RL, Kroenke K, Williams JB: Validation and utility of a selfreport version of PRIME-MD. JAMA. 1999, 282: 1737-1744. 10.1001/jama.282.18.1737.
Article CAS PubMed Google Scholar
Kroenke K, Spitzer RL: The PHQ-9: a new depression diagnostic and severity measure. Psychiatric Ann. 2002, 32: 509-515.
Article Google Scholar
Wittchen HU, Jacobi F, Rehm J, Gustavsson A, Svensson M, Jönsson B, Olesen J, Allgulander C, Alonso J, Faravelli C, Fratiglioni L, Jennum P, Lieb R, Maercker A, van Os J, Preisig M, Salvador-Carulla L, Simon R, Steinhausen HC: The size and burden of mental disorders and other disorders of the brain in Europe 2010. Eur Neuropsychopharmacol. 2011, 21: 655-679. 10.1016/j.euroneuro.2011.07.018.
Article CAS PubMed Google Scholar
Parker G: Is depression overdiagnosed? Yes. BMJ. 2007, 335: 328-10.1136/bmj.39268.475799.AD.
Article PubMed PubMed Central Google Scholar
Hickie I: Is depression overdiagnosed? No. BMJ. 2007, 335: 329-10.1136/bmj.39268.497350.AD.
Article PubMed PubMed Central Google Scholar
U.K. National Institute for Health and Care Excellence: NICE Guidance on Depression in Adults. Available at: http://publications.nice.org.uk/depression-in-adults-cg90/guidance#step-1-recognition-assessment-and-initial-management [website]. Accessed June 03, 2013
Lichtman JH, Bigger JT, Blumenthal JA, Frasure-Smith N, Kaufmann PG, Lespérance F, Mark DB, Sheps DS, Taylor CB, Froelicher ES, American Heart Association Prevention Committee of the Council on Cardiovascular Nursing; American Heart Association Council on Clinical Cardiology; American Heart Association Council on Epidemiology and Prevention; American Heart Association Interdisciplinary Council on Quality of Care and Outcomes Research; American Psychiatric Association: Depression and coronary heart disease: recommendations for screening, referral, and treatment: a science advisory from the American heart association prevention committee of the council on cardiovascular nursing, council on clinical cardiology, council on epidemiology and prevention, and interdisciplinary council on quality of care and outcomes research: endorsed by the American psychiatric association. Circulation. 2008, 118: 1768-1775. 10.1161/CIRCULATIONAHA.108.190769.
Article PubMed Google Scholar
Elderon L, Smolderen KG, Na B, Whooley MA: Accuracy and prognostic value of American heart association: recommended depression screening in patients with coronary heart disease: data from the heart and soul study. Circ Cardiovasc Qual Outcomes. 2011, 4: 533-540. 10.1161/CIRCOUTCOMES.110.960302.
Article PubMed Google Scholar
Kendrick T, Dowrick C, McBride A, Howe A, Clarke P, Maisey S, Moore M, Smith PW: Management of depression in UK general practice in relation to scores on depression severity questionnaires: analysis of medical record data. BMJ. 2009, 338: b750-338. 10.1136/bmj.b750.
Article PubMed Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2296/14/198/prepub

Download references

Acknowledgements

The KORA research platform and the MONICA/KORA Augsburg studies are financed by the Helmholtz Zentrum München, German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education, Science, Research, and Technology (Berlin, Germany), and by the State of Bavaria.

An abstract of a previous version of this manuscript has been presented orally at the German Congress for Health Services Research (Deutscher Kongress für Versorgungsforschung) taking place at Bonn, Germany in October 2010.

Author information

Authors and Affiliations

Department of Primary Medical Care, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
Eva Blozik & Martin Scherer
Institute of Social Medicine, University of Lübeck, Lübeck, Germany
Eva Blozik
Helmholtz Zentrum München, German Research Center for Environmental Health, Institute of Epidemiology-II, Neuherberg, Germany
Maria E Lacruz & Karl-Heinz Ladwig
Department of Psychosomatic Medicine and Psychotherapy, Klinikum rechts der Isar, Technische Universität München, Munich, Germany
Karl-Heinz Ladwig

Authors

Eva Blozik
View author publications
You can also search for this author in PubMed Google Scholar
Martin Scherer
View author publications
You can also search for this author in PubMed Google Scholar
Maria E Lacruz
View author publications
You can also search for this author in PubMed Google Scholar
Karl-Heinz Ladwig
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

the KORA study group

Corresponding author

Correspondence to Eva Blozik.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

EB and MS designed the study and drafted the manuscript. EB performed the statistical analyses. MEL and KHL reviewed the study design and helped to interpret data and to draft the manuscript. All authors read and approved the final manuscript.

Eva Blozik, Martin Scherer contributed equally to this work.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Blozik, E., Scherer, M., Lacruz, M.E. et al. Diagnostic utility of a one-item question to screen for depressive disorders: results from the KORA F3 study. BMC Fam Pract 14, 198 (2013). https://doi.org/10.1186/1471-2296-14-198

Download citation

Received: 19 June 2013
Accepted: 12 December 2013
Published: 23 December 2013
DOI: https://doi.org/10.1186/1471-2296-14-198

Diagnostic utility of a one-item question to screen for depressive disorders: results from the KORA F3 study