Background

While depression has been cited as a prominent and growing contributor to the global burden of disease [1], the construct and experience of depression may differ cross-culturally, with extensive variation in the expression of emotions, prominence of somatic symptoms and experience of and coping with suffering [2]. Despite this variation, evidence suggests that “depressive states can be studied as a feature of local forms of suffering” across the globe [2] and that symptoms associated with depression are present in all cultures [3]. The universality of depressive conditions combined with cross-cultural variation in the experience of depression mean that their identification and treatment are not culturally neutral. Strategies and tools that are culturally valid are essential.

The integration of mental health services into primary care is a recommended approach to address a large and growing gap in treatment [4, 5]. In the time-and resource-constrained context of primary care, identifying appropriate measures is needed to improve the use of limited resources and access to care [6].

Vietnam is a lower middle-income country with a diverse population of approximately 90 million people [7] and 54 distinct ethnic groups [8]. Over 4 million Vietnamese also live abroad [9]. Mental health services in Vietnam have been largely concentrated in tertiary psychiatric facilities and have focused on schizophrenia and epilepsy [1012]. The availability of trained mental health practitioners in Vietnam is very low, with approximately 1 per 100,000 persons [13]. Primary health care service delivery is provided by 10,750 commune health stations that operate on a catchment system and staff approximately 47,000 primary care providers throughout the country [13]. The nature and scope of services and capacity in the primary care sector varies throughout the country. The government of Vietnam launched the National Community Mental Health Care (CMHC) project in 2001, with the goal of integrating mental health services into primary care. In practice, the CMHC emphasizes the provision of medications for patients with mental disorders although delivery and resources vary throughout the country [13]. Although depressive disorders are prevalent in Vietnam [11, 14, 15] evidence about their treatment is limited, suggesting that little has been done to build treatment capacity, especially at the primary care level. The government has recently shown interest in improving and expanding community-based services for depression [16].

This scoping review examines measures of depression that have been used in Vietnam or among the Vietnamese diaspora. The review assesses the validity of the measures as reported in the literature, taking into account the whether their cultural validity has been tested, the context in which they were validated or used, and their appropriateness for use in primary care with the objectives of: (1) making recommendations regarding the use of depression measures in primary care with Vietnamese populations and cross-culturally; (2) identifying gaps in the existing evidence.

Review

Scoping reviews are appropriate to map “the relevant literature in the field of interest”, allowing for findings to be summarized and gaps identified [17]. Scoping reviews are rigorous, with methods that allow for replication, but findings are not synthesized or aggregated to the extent customary in systematic reviews. They allow for the inclusion of diverse study designs and involve an iterative search process where search terms may evolve during the review [17]. This method allows for identifying a broad range of studies.

We searched PubMed, PsychInfo and IDRC databases on June 12th and June 16th 2014. Search terms included: depression, mental disorder and mental health; scales, measures and screening; Vietnam and Vietnamese; we excluded ‘veterans’ given the many studies on American veterans [18]. Studies were included that validated or used measures deemed appropriate for use in general populations. The review included studies that explicitly validated instruments and those that used instruments in research, as several studies did not validate the instruments but nonetheless produced findings relevant to this review. Given the limited volume of research in this area, the search was not limited by date of publication.

Figure 1 shows the search and selection processes undertaken in this review. The database search identified 91 articles, which was reduced to 78 after removing duplicates. Initial screening eliminated 6 additional titles. Seventy-two full-text articles were reviewed for eligibility, with 44 then excluded. Studies were excluded because they: used data from the Vietnam Era Twin Study and were not related to Vietnamese populations; focused on neurological disorders or genetic markers; focused on specific rather than general populations or measures (e.g. scales to measure post-partum depression); or were not related to depression. Twenty-eight articles were first included in the review, with 11 measures initially identified. One measure and its source paper were excluded post hoc, since it compared mental health status in immigrants in Europe without discussing considerations specific to the measure or its use in Vietnamese populations [19]. The final set was 27 papers, with 10 measures identified.

Fig. 1
figure 1

PRISMA diagram: search and selection process using Pubmed, PsychInfo and IDRC databases

Validation of psychometric measures is a broad methodological domain, with validity referring to “how well one can legitimately trust the results of a test as interpreted for a specific purpose” [20] or as “the ability of the instrument to measure the attributes of the construct under study” [21]. Validity typically addresses construct, content and criterion validity, and measures of reliability [20, 21]. Considerations such as semantic equivalence, which examines linguistic and idiomatic equivalence [22, 23], may also be included. Construct validity is the “degree to which an instrument measures the construct it is intended to measure” [21]. Content validity refers to whether the items in a measure reflect the full range of characteristics of the construct being examined. Criterion validity examines how a measure performs in relationship to another instrument or test. Finally, reliability examines the consistency with which a measure accurately identifies the construct of interest. Reliability is necessary but not sufficient for validity [20, 21]. Cook and Beckman [20] recommend that conclusions about an instrument’s validity should be based on several data types. The studies we reviewed employed various methods, some more comprehensive than others, to draw conclusions about validity.

Socio-cultural context is an essential consideration when assessing and understanding depression. This review includes measures that were developed for use in Western countries and subsequently applied with Vietnamese populations. We consider both the studies’ statistical results and qualitative observations about the appropriateness and cross-cultural validity of the measures in the context of the populations studied. We also consider factors influencing the appropriateness of measures for use in primary care. While the included studies did not all take place in primary care settings, characteristics of the measures, including the instrument’s length and accessibility, were identified as relevant to the measures’ appropriateness for use in such settings. Several measures assess common mental disorders (CMD) including depression, anxiety and somatic symptoms using sub-scales within one instrument. This analysis emphasizes the measurement of depression symptoms, but as symptoms of CMD often co-occur, attention is also given to the role that other symptoms might play in the Vietnamese experience of depression. Several studies validate or use more than one measure of depression. A list of the measures and their characteristics is found in Table 1.

Table 1 Overview of depression measures

Results

Population

The studies reviewed measures used in Vietnam and among populations of Vietnamese living abroad. In several cases, different studies present data from the same study sample, while some present data from more than one study site or sample. Of the studies undertaken in Vietnam, seven present data from the north [2430], five from the central provinces [6, 14, 24, 27, 31], and eight from the south [6, 15, 24, 27, 3135]. In addition to studies conducted among general adult populations, two studies focused on older adults [35, 36], one on youth [14], five on mothers [24, 25, 27, 30, 37], one on men [28] and one on ‘Amerasians’, who are identified as a vulnerable population due to discrimination [38]. Of the studies taking place outside of Vietnam, one was in Taiwan among Vietnamese women [39], two were in Australia among adult immigrants and refugees [34, 40], and eight were in the United States (US), mostly with refugees or immigrants [38, 4147]. One study identified the study population as Vietnamese Americans [47].

Validation

The detailed results of the review are presented in Table 2. Of the ten measures found, eight were explicitly validated, including the: CES-D, CIDI, DASS, GHQ, iHSCL, PVPS, SRQ-20 and the VDS. Four of the measures were validated by one study each, while the iHSCL was validated by three studies (all in the US), the PVPS was validated by two (one in Australia and one in south Vietnam), the SRQ-20 was validated by three studies (in north, central and south Vietnam), and the VDS was validated by four (three in the US, one in south Vietnam). Of the validated measures, only five were validated for content validity (CES-D, CIDI, iHSCL, PVPS, SRQ-20), while seven were validated for construct validity (CIDI, DASS, GHQ, iHSCL, PVPS, SRQ-20, and VDS). Six measures were assessed for criterion validity (CIDI, DASS, iHSCL, PVPS, SRQ-20, and VDS) and four were tested for reliability (CES-D, DASS, PVPS, SRQ-20). The results of content, construct and criterion validity and reliability testing as assessed by each study are described in Table 2.

Table 2 Reported results on validity and reliability (unless otherwise reported, all values are for depression subscales)

Characteristics of measures

While most studies did not assess the acceptability of the measures for use in primary care, the characteristics of the measures, including their brevity and accessibility, can help to inform decisions about their use in such settings. Of the validated measures, four were explicitly developed for use in non-clinical or community settings (DASS, GHQ, iHSCL and SRQ-20). The CES-D was developed for use in epidemiological studies, while the CIDI requires certification by the WHO-WMH, but training is unavailable in Vietnamese [48]. The PVPS is 53 items long, making it lengthy for use in time-sensitive settings. In the US contexts in which it was used, the iHSCL is described as easy to use, with clear language, and acceptable to target populations [43, 46] Based on its use in Vietnam, the brevity of the SRQ-20, the ease of administration and training, and cost-effectiveness are described as benefits of the measure [6, 31].

Discussion

While there are numerous methods of assessing validity, definitions of “validation” and its associated methodology vary [20]. For the purposes of this review, we have taken into account the following criteria when assessing the extent to which validity of measures for use in primary care among Vietnamese populations has been tested: the extent to which a measure has been validated, including testing for construct and content validity in addition to criterion validity and reliability; the populations in which measures were tested, including the diversity of the populations and whether the measures were tested in Vietnam or among the diaspora; the length of the measures and ease of use.

While the review identified 11 studies that explicitly validated at least one measure of depression, the methods used and criteria on which conclusions were drawn were variable. While some studies reported minimally on content validity and reliability [47], others applied extensive testing [40]. The potential variation in cultural meaning and experience of depression in Vietnamese populations is an important issue that requires further study [40, 41, 49]. Consequently, testing only criterion validity and reliability does not assess whether depression as defined by the measure appropriately captures the culture-specific construct of depression. In studies that validate Western-developed measures for use in Vietnamese populations, assessing content and construct validity are critical aspects in determining if a measure is in fact valid. Many studies in this review examined the validity of measures in terms of their ability to determine caseness, but did not test the construct validity of the measure. Stratton et al.’s [6] psychometric evaluation of the Self Reporting Questionnaire-20 (SRQ-20) was a response to this gap in the literature. In addition, when criterion validity was assessed it was often based on comparison with international psychiatric diagnostic assessments [30, 40, 45] that have not been validated for use in Vietnam. The absence of somatic symptoms in these measures was associated with underreporting of depression in Vietnamese subjects [33, 50]. Assessing validity based on comparisons with these measures may be problematic.

The review included two measures that were developed for use among Vietnamese populations [40, 51] and one that was adapted for use among South Asian populations [41, 50], raising the question of whether indigenously derived measures are superior to adapted Western measures. The findings of this review suggest that a balance of culture-specific and universal constructs is essential for identifying depression in Vietnamese populations [41, 50]. Whether this can be accomplished sufficiently by modifying Western measures is debatable. In their work on depression in Zimbabwe, Patel et al. [52] found that Western derived measures could be used cross-culturally with sufficient attention to the translation of culture-specific concepts. All instruments would benefit from extensive evaluation of content and construct validity prior to being used in new populations.

This review includes studies that took place in Vietnam, or which took place among the Vietnamese diaspora. There are sizeable Vietnamese populations living in many countries, whose experience as migrants distinguishes them from Vietnamese living in Vietnam. The studies included in this review focused almost exclusively on immigrant or refugee populations and did not explicitly validate the measures for use among populations of Vietnamese descent living in these countries. While Vietnamese diaspora populations would certainly benefit from culturally appropriate measures of depression in primary care, consideration of the diversity of experience among these populations should be made when selecting appropriate measures. Vietnam itself is a diverse country with linguistic and cultural differences between regions, among rural and urban populations and among the diverse minority ethnic populations. For this reason, the validation of measures within specific populations should be considered before use. Many measures were only validated in one region of Vietnam or among one population sub-group. This should be considered when selecting measures for use in the country. Notably, no studies validated measures among minority population groups. This is a significant gap and additional research to validate measures for use in these populations is required.

Special considerations are also required for the use of depression measures in primary care environments, which often involve time and resource constraints and rely on non-specialist health staff. Measures must be brief, easy to use and to interpret, cost-effective and accessible. The PVPS, while extensively validated and based on a culturally relevant construct of CMD, is 53-items long with a 26-item depression subscale, making it lengthy for use in primary care environments.

Drawing definitive conclusions regarding the most appropriate measures for use among Vietnamese populations is difficult. The iHSCL may be most appropriate for use among immigrant or refugee populations. It includes 25 items, with a 15-item depression subscale, and has been found to be easy to use, with clear language. While it was found acceptable by refugees living in the US it has been only minimally tested in Vietnam and should be further validated for use in the country and for use among more established Vietnamese communities. The SRQ-20, which was validated and used in several regions of Vietnam, was found to be brief, easy for training and use and cost-effective [6], and thus appears most suitable for use in primary care settings in Vietnam. There are, however, concerns about differences between men and women when responding to the measure [6, 26] and it has not been validated among minority populations. Further research about differences in validity between urban and rural populations is also recommended.

Limitations

This review includes only studies published in English. While we intended to include Vietnamese journals in this review, we found that as these journals are not indexed it was not feasible to include them. Further research in this area would benefit from including research published in Vietnamese journals. It was not possible to conduct a quantitative comparison across measures. The results therefore describe statistical results as reported in the studies and qualitative assessments.

Conclusions

Assessing the validity of depression measures, their cultural acceptability and their appropriateness for use in primary care requires a balance of standard validity testing, assessment within the specific sociocultural context, and criteria such as brevity and cost-effectiveness. Vietnam is not only ethnically diverse but also has linguistic and cultural diversity by region that mean that, even when found to be valid in one population or region, a measure might require additional testing for use elsewhere. The increasing influence of Western culture, particularly in urban areas, may also lead to variation among the validity of measures between rural and urban populations. As demonstrated through testing of the SRQ-20, differences in gender in Vietnam may also mean that measures that can be used universally in Western populations will require gender-specific assessment in Vietnam.

The variability in methods used to validate instruments and populations in which the measures were tested makes definitive conclusions about the most effective measure difficult. Studies or services wishing to implement depression screening in primary care have an extensive body of literature on which to draw to make decisions about the most appropriate instrument for their purposes. Specific consideration, however, to population, region and gender specific factors should be given when using these measures in practice. Pilot testing of measures validated in one region or population to be used in another is recommended in order to ensure conceptual and linguistic equivalence of measures across the diverse Vietnamese population.

While a discussion of the validation of depression measures in other contexts was beyond the scope of this review, the findings have implications beyond Vietnamese populations. We found little consistency in the validation of depression measures, with limited attention to construct and content validity, both of which are important to determine cross-cultural validity. To guide further studies of validation of psychometric measures in cross-cultural contexts and in primary care, enhanced methodological guidelines for validation, which include testing of all types of validity and which draw on anthropological methods to avoid pitfalls of testing against non-validated Western psychiatric assessments, would benefit researchers conducting validation studies and health services wishing to employ the most appropriate measure.

Abbreviations

Depression measures

BDI: Beck Depression Inventory; CES-D: Centre for Epidemiologic Studies-Depression Scale; CIDI: Composite International Diagnostic Interview; DASS: Depression and Anxiety Stress Scale; FMoMH: Four Measures of Mental Health; GHQ: General Health Questionnaire; HSCL: Hopkins Symptom Checklist; iHSCL: Indochinese Hopkins Symptom Checklist; PVPS: Phan Vietnamese Psychiatric Scale; SRQ-20: Self-Reporting Questionnaire-20; VDS: Vietnamese Depression Scale

Statistical tests

Se: sensitivity; Sp: specificity; AUC: area under the receiver operating curve; PPV: positive predictive value; NPV: negative predictive value; PCC: probability of correct classification; K: kappa coefficient; EFA: exploratory factor analysis; CFI: Comparative Fit Index; TLI: Tucker–Lewis Index; RMSEA: root mean square error of approximation; PCA: principal component analysis; MT-MM: multitrait-multimeasure

Contextual acronyms

NIMH: National Institute of Mental Health; HCMC: Ho Chi Minh City; VN: Vietnamese; MMORPG: Mix multi-player online role playing game; MKD: Mekong Delta Region; IDRC: International Development Research Centre; DSM: Diagnostic and Statistical Manual; US: United States; WHO: World Health Organization; POW: Prisoner of War; DIS: Diagnostic Interview Schedule for DSM