Background

Mental health disorders, including anxiety and post-traumatic stress disorder (PTSD) are among the leading contributors to global disability adjusted life years, comprising five of the top twenty contributing disorders [1]. The World Health Organization International Classification of Disease (ICD-11) defines anxiety as a disorder in which there is an extreme and excessive focus on an “anticipated threat” and defines PTSD as a disorder that results from exposure to one or more “horrific events”, both of whose symptoms include apprehension, motor tension and autonomic overactivity [2]. In 2017, it was estimated that over 264 million people experienced an anxiety disorder, with the global prevalence for both anxiety disorders and PTSD ranging from 2.5 to 7% by country [2,3,4]. Both anxiety and PTSD are widespread common mental disorders (CMDs) that have been shown to cause significant negative health outcomes within various populations and contribute to a large portion of the global disease burden [5, 6]. There are noteworthy discrepancies in quality of life between people diagnosed with anxiety and/or PTSD and those who are not diagnosed with either, such as increased years lived with disability and decreased life expectancy [7,8,9]. Additionally, there is evidence suggesting that the presence of an anxiety disorder or PTSD increases the likelihood of comorbidity with other severe health conditions, such as major depressive disorder and substance use disorder [10, 11].

Anxiety and PTSD in low to middle income countries (LMICs) are highly prevalent and require further study given that access to care is hindered by availability and stigma [12,13,14]. Prevalence of these disorders is higher within LMICs; roughly 83% of people with mental illnesses globally are living within LMICs [15]. In many LMICs, there is no robust mental healthcare system in place and the number of mental health professionals is sparse [16]. Assessment and diagnosis of psychiatric illnesses thus often falls to primary care and general practitioners who have little training in mental health [16]. Use of brief screening tools have been proposed as a way to improve identification and management of mental health problems, and may be useful in LMICs, especially among populations with elevated risk (e.g., pregnant women, refugees/displaced persons, and youth) within LMIC communities [17,18,19].

Despite multiple screening instruments for CMDs, there are significantly fewer screening instruments for anxiety and PTSD that have been validated in LMIC populations. Screening instruments that have been validated exclusively in high-income countries may not perform equivalently in LMIC populations, as anxiety and PTSD often present differently in different cultural contexts. For example, in sub-Saharan Africa, anxiety and PTSD are described through somatic symptoms as well as spiritual descriptions [20]. Furthermore, differences in clinical presentation may render screening tools less accurate in LMICs. Thus, optimum cut-off scores validated in high income populations may not apply in LMIC populations. For instance, in a sample of 75 participants from Tajikistan [21], the optimal cut-off of 1.88 for the Harvard Trauma Questionnaire (HTQ), a measure of PTSD, was substantially lower than the standard cut-off score of 2.5 that has been recommended in previous studies in high-income countries [22]. Failure to apply suitable cut-off scores may lead to an imbalance of positive and negative screening results. If chosen cutoffs are too high, actual cases of anxiety and PTSD may not reach the threshold for further assessment and diagnosis; thus, cases will be missed. Conversely, if chosen cutoffs are too low, there may a very large number of positive screens requiring substantial resources for further assessment, and healthcare systems may not be able to manage the load.

Although there has been an increasing interest in studying mental health within LMICs, there are still large gaps related to screening tools to assess mental health disorders, especially anxiety and PTSD. The most recent systematic review investigating screening tools for CMDs in LMICs was published in 2016 [23]. Of the 273 validations included, 236 were validated tools for CMDs or depressive disorders while only 24 and 13 validated tools for anxiety and PTSD, respectively. Therefore, the objective of this study was to conduct a systematic review of screening tools for anxiety and PTSD within LMIC populations.

Methods

Aim: To validate screening tools for anxiety disorders and PTSD in LMICs.

We published a study protocol in advance in the PROSPERO registry (CRD42019121794).

Search strategy and study selection

We systematically searched four databases (MEDLINE, EMBASE, Global Health and PsychINFO) from inception to April 22, 2020 (see Fig. 1).

Fig. 1
figure 1

Search strategy

Inclusion criteria

Our eligibility criteria required that studies: (1) screen specifically for anxiety (generalized anxiety disorder or anxiety disorders not otherwise specified) and/or PTSD; (2) provide estimates of sensitivity and specificity for a given cut-off value for one of the included disorders; (3) were conducted in a LMIC (based on the World Bank Classification) [24]; and (4) compare screening results to a validated reference standard. Reference standards included unstructured clinical diagnostic interviews as well as structured clinical interviews including the Mini International Neuropsychiatric Interview (MINI and MINI-KID) [25], Structured Clinical Interview for DSM (SCID, SCID-1 and NetSCID) [26, 27], Composite International Diagnostic Interview (CIDI and CIDI-PHCV) [28], Clinical Interview Schedule-Revised (CIS-R) [29], Psychiatric Assessment Schedule (PAS) [30], Kiddie Schedule for Affective Disorders and Schizophrenia (K-SADS and K-SADS-PL) [31] and Clinician-Administered PTSD Scale (CAPS and CAPS-5) [32, 33]. LMIC populations residing in a LMIC at the time of study were included. No search restrictions were put on age, gender or comorbidities.

Exclusion criteria

We excluded papers that did not report sensitivity, specificity and cut-off value; that were not published in English; and that involved populations originally from an LMIC residing outside a LMIC at the time of the study. Persons from an LMIC residing in another LMIC at the time of the study were included (e.g., refugee populations and displaced persons).

Literature review

Abstracts returned from the search were reviewed separately by two independent reviewers for inclusion, with any discrepancies resolved by discussion and use of a third senior reviewer as needed. For abstracts meeting inclusion criteria, full-text articles were retrieved and reviewed by two separate reviewers for final inclusion, with discrepancies resolved by discussion and use of a third senior reviewer as needed. We also searched the reference lists of relevant systematic reviews for additional articles to add to our full-text review.

Quality appraisal

To assess study quality, we used a modified version of Greenhalgh’s ten item checklist previously used in a study by Ali et al. [23] Elements of the quality checklist are provided in Fig. 2. Credit was given for translation if a previously validated translated version of the tool or reference standard was used, or if the tool was administered in English. Studies of ‘very good’ quality fulfilled all the quality criteria. Studies deemed ‘good’ quality fulfilled criteria 1 through 3 in addition to at least one other criterion from 4 to 5. ‘Fair’ quality studies did not avoid work-up bias and ‘acceptable’ quality studies did not perform receiver operating characteristic curve (ROC) analysis to determine a normal range from the results. ‘Unblinded’ studies include studies that reported the interviewers were not blinded to the screening results; if the study did not specify whether the screening tool administrators and interviewers were blinded to each other’s results, we considered it unblinded but clarified this designation was unconfirmed.

Fig. 2
figure 2

A modified Greenhalgh’s ten item checklist, adapted from Ali et al. [23]

Data abstraction and analysis

Numerical data was abstracted by one reviewer and checked by a separate reviewer to ensure quality extraction. Data abstraction sheets included extraction of the screening tool and disorder, number of participants, DSM version, screening tool administrator, language, region, population study characteristics and age, country, gold standard, area under the curve (AUC), cut-off score, sensitivity and specificity. If multiple screening tools and/or cut-offs were used, data was extracted for each cutoff, for each tool, separately. If values were split by population, the value most representative of the total was chosen (e.g., community values for data split by hospital inpatient unit). If multiple cut-offs were given without AUC, we extracted the set of values for the cutoff that maximized Youden’s J [34]. Results were presented separately by disorder, screening tool and cut-off value. As anxiety and depression were combined in many screening tools, a third category of mixed anxiety and depression was included.

For validations of screening tools for the same disorder that used identical cut-off values, bivariate random-effects meta-analytic models were fitted to provide estimates of pooled sensitivity and specificity for the cut-off value.

Results

Study selection

Of 6322 unique citations identified from the database search, 6188 were excluded after title and abstract review and five additional papers from the reference lists of relevant systematic reviews were added. Of 140 included for full-text review, 81 were excluded, leaving 59 eligible articles inclusive of 77 screening tools (see Fig. 3). The most common reasons for exclusion were not screening for the disorder of interest, not comparing to a gold standard, and failing to provide either sensitivity/specificity data or a threshold for screening.

Fig. 3
figure 3

Flow chart of study selection

Quality appraisal

Two studies met all the criteria of the modified Greenhalgh’s ten item checklist and deemed ‘very good’ quality while 20 studies were deemed to be ‘good’ quality, due to lack of reporting the confidence intervals for sensitivity, specificity or AUC. Two studies were ‘fair’ quality for not avoiding work-up bias and five were deemed ‘acceptable’ for failing to perform ROC analysis. A total of 29 studies were labelled ‘unblinded’ for failing to specify if they blinded the researchers or for explicitly stating they were not blinded (see Table 1).

Table 1 Quality rating statistics

Description of included studies

The final 59 studies selected included a total of 77 screening tools. There were 46 validations of screening tools for anxiety disorders, 19 for PTSD and 12 for anxiety and depression (see Table 2).

Table 2 Screening tool validation by disorder category

A minority of studies accounted for children and adolescent validations (10) despite a relatively young demographic present in LMICs [35]. The majority of validations studied adults (36), with a select few including adolescents and adults (6) (see Table 3). Particularly well-represented groups included the general population and clinical outpatients (13), perinatal populations (6), psychiatric patients (7) and those with another psychiatric comorbidity (7) (see Table 3). Of the 19 validations for PTSD, only four studied children and adolescents.

Table 3 Distribution by age a population characteristic

The majority of screening tool validations were in Asia (35) followed by Africa (20), the Americas (5) and Europe (1) (see Table 4). The best represented regions include South and West Asia, as well as South and East Africa, with a noticeable gap in Middle and Northern Africa. There were no studies from the Oceanic region.

Table 4 Number of Studies by Region and Country

The most commonly used tools to screen for generalized anxiety disorder were the Kessler-10 (K-10) and the Generalized Anxiety Disorder-7 item scale (GAD-7), totaling seven and six validations respectively. The Hopkins Symptom Checklist-25 item scale (HSCL-25), Hospital Anxiety and Depression Scale (HADS) and Hospital Anxiety and Depression Scale anxiety subscale (HADS-A) were validated almost equally while the majority of tools only had one validation (see Table 5). PTSD had far fewer validations (19) with a wide range of tools receiving between one and three validations, similar to the screening tools validated for both anxiety and depression.

Table 5 Screening Tool by Disorder and Number of Validations

Each included study is listed in Table 6 by region, screening tool and study quality with the respective sensitivity, specificity and cut-off for each disorder. Continentally, Asia had the most validations (35) and the majority of studies were considered unblinded (29). Due to incomplete reporting, we could meta-analyze results from only two studies, which involved the GAD-7; using a cut-off ≥10; sensitivity = 76%, specificity = 64%.

Table 6 Included studies listed by continent, sub-region, screening tool/disorder and quality

Discussion

This review aimed to examine the screening tools that have been validated to detect anxiety and PTSD in LMICs. The most commonly validated tools were the K-10 and GAD-7 for anxiety and the HTQ and the Posttraumatic Diagnostic Scale (PDS) for PTSD. It is difficult to recommend one screening tool for anxiety and PTSD respectively, as various tools and cut-off values were tested, and sensitivities and specificities varied based on region, country and screening tool. Indeed, only two studies tested the same tool using the same cut-off value and reported sufficient information to allow us to quantitatively synthesize the results. Locally validated screening tools for anxiety and PTSD need further evaluation in well-designed studies to assess whether they can improve the detection and management of these common disorders.

A total of 46 validated screening tools were found for anxiety disorders. The most common tool used to screen for anxiety disorders was the Kessler-10 followed by the GAD-7, which had wide ranges of sensitivities (57–94%) and specificities (53–94%) varying by region and sample size. While previously the HADS-A was recommended [23], our updated review found that it was not as widely validated as the GAD-7 and Kessler-10, although it had consistent specificities (72–79%) with a range of sensitivities (38–86%). The Kessler may have an added time-efficiency component, as it is possible to screen for multiple common mental disorders, whereas screening tools such as the HADS-A target anxiety specifically. The GAD-7 reported some of the highest sensitivities for detection of generalized anxiety disorder. Other anxiety disorders, including agoraphobia, panic disorder and social anxiety disorder were less commonly validated. Our results are consistent with a previous systematic review [23] and indicate using the GAD-7, K-10 or HAD-A yield good sensitivities and specificities while taking population-specific characteristics into account. Future research is needed to validate screening tools for these anxiety disorders in more regions.

The number of validations for PTSD increased from 10 to 19 since 2013 [23]. The HTQ and PDS were the most commonly validated tools for PTSD, and sensitivities were generally high. Our findings add that in addition to the previously recommended HTQ, the PDS should be considered in screening for PTSD [23]. Unfortunately, many tools were validated only once, preventing our combining them for analytic purposes. Only four PTSD validations describe children and adolescents, despite recent events that have displaced thousands of youth [93]. The prevalence of PTSD remains high in LMICs and is expected to rise given increasing civil unrest and war [19, 94]. The year 2018 saw the highest recorded number of displaced persons globally leading the authors to emphasize more attention into detection and treatment of PTSD [95].

Anxiety and depression had the fewest validations across our search [11] though were not the target of our validation given the existing literature on depression alone [23]. All tools with the exception of the HSCL-25 had only one validation. The only independently developed screening tool of all the studies was for anxiety and depression, developed in Zambia. These disorders commonly occur together, and further research is needed to determine which tools are best suited to a region’s mental health screening needs.

We searched four databases with a robust library of psychiatric publications available. We also placed minimal exclusion criteria on our searches so as to maximize the number of studies returned, and we additionally reviewed relevant systematic reviews for additional relevant papers. At every stage of the process from title/abstract screen to data abstraction, two reviewers assessed each article and numerical data point to reduce human error. Our search strategy and protocol were published in PROSPERO and were not altered from the time of submission, with the exception that we did not calculate diagnostic odds ratios (DORs), as they provide no guidance to clinicians on what screening tool and cut-off threshold would be most appropriate to use in clinical practice. Rather, we reported sensitivity and specificity of each screening tool and cutoff separately, to better describe the accuracies of individual tools and cut-offs.

Our extraction was limited by the individual papers’ specific data reporting. Varying prevalence of an individual study may affect the cut-off score, sensitivity and specificity of screening tools, and some studies did not publish prevalence. Providers should reference the prevalence of each specific disorder to ascertain whether the cut-off is applicable to their respective population. The majority of studies did not provide sensitivities and specificities for multiple cut-off values. Reporting multiple cut-off values and their respective sensitivity and specificity estimates would allow providers to decide which cut-off they would choose to optimize screening for their setting. A lower cut-off with a higher sensitivity may be desired if cases are not to be missed and false negatives reduced. A higher cut-off with a higher specificity may be desired if false positives are to be minimized. Furthermore, reporting multiple cut-off values and their respective sensitivity and specificity estimates would also allow researchers to better synthesize accuracy results across multiple studies in meta-analysis. In the present study, only two validations with identical cut-off scores for the GAD-7 could be combined via meta-analysis as no other validations of the same disorder with identical cut-off values provided sufficient information to conduct a meta-analysis (i.e., 2 × 2 table numbers). Studies used various versions of the DSM and ICD. While the symptomatology for psychiatric diagnoses have not changed significantly, providers should reference which version was used when conducting the validation of the screening tool (see Table 6).

Our review was also limited by the available publications on mental health screenings in LMICs. The entire region of Middle and North Africa, constituting over 300 million people, was not represented by a single validation while other regions such as South-East Asia were fairly well-represented. Cultural and linguistic factors may influence screening tool validation yet further discussion may be best served for individual validation papers. Most studies were rated in the lowest quality category of the modified Greenhalgh scale as they were unblinded, or downgraded to unblinded due to incomplete reporting. This is a severe limitation in the design of studies that may impact validation results; future studies should ensure adequate blinding in addition to the remainder of the quality checklist.

Our study did not look at CMDs or depression specifically, although we did consider anxiety and depression when screened for together. We chose to focus on anxiety and PTSD as they are less well-represented in the realm of LMIC validated screening tools. Additionally, anxiety and PTSD are becoming more important with the current displacement of millions of people due to civil unrest, socioeconomic upheaval and war.

The number of validated screening tools for mental health disorders as a whole has increased since 2013 [23]. However, no large increase in the number of validations for specific disorders was seen, and most screening tools from our search were validated only once. We advise researchers and providers to refer to Table 6 for a summary of validations for locations and disorders of interest and to use this table to identify their region of interest, find their disease focus of interest, and then identify what tools have been identified by the highest quality evidence.

Conclusions

Mental health disorders are highly prevalent yet are frequently stigmatized and disregarded as medical diseases. Validated screening tools for anxiety and PTSD in LMIC have made considerable progress, with validations for both disorders almost doubling since the prior systematic review completed in December 2013 [23]. The increase in validated screening tools generally followed a regional pattern, with more emerging in countries already represented. For example, more tools have been validated in South Africa without an increase in validations in Botswana, Lesotho, Namibia or Swaziland. Middle and Northern Africa were also not well-represented by either anxiety or PTSD screening tools. The authors recognize that it may be near impossible to validate screening tools in areas of intense conflict and instability but acknowledge the need to evaluate screening tools in these areas.

The age distribution among screening tools was heavily biased towards the adult population. Children and adolescents accounted for only four of 19 validations for PTSD and six of 58 for anxiety and anxiety and depression. Given that age is skewed towards a younger population in LMICs [35], it is imperative that more research focuses on identifying anxiety and PTSD disorders in a pediatric population, especially in areas of increased civil war and conflict.

Use of brief screening instruments can bring much needed attention and research opportunities to various at-risk populations in LMICs. Many screening tools for anxiety and PTSD have been validated in LMICs, but there remain regions and subgroups of individuals for which more research is needed. Locally validated screening tools for anxiety and PTSD should be further evaluated in clinical trials to determine whether their use can reduce the burden of disease.