Background

Depressive and anxiety disorders are common worldwide, affecting, respectively, 322 million and 264 million, equivalent to 4.4% and 3.6% of the world’s population [1]. The prevalence of depressive and anxiety symptoms and their concurrence is even higher ranging from 9.3% to 27.2% [2,3,4]. Low- and lower-middle income countries witness the highest prevalence where poverty and economic pressure prevail [5]. Depression and anxiety are associated with several social and physical problems such as disturbed family relations, high suicidality (more than 800,000 per year), poor academic performance, and use of illicit drugs [1,2,3,4, 6, 7].

More than three quarters of drug users have symptoms of depression and anxiety (Moody, Franck, & Bicke, 2016); the severity of symptoms varies between those in long-term residential treatment and those in outpatient treatment [8]. Their psychological distress is associated with low abstinence self-efficacy, craving, treatment failure, relapse, and continued use [9, 10], but the frequency of drug use decreases when negative emotions decline during treatment [11]. Therefore, measuring emotional negativity in people with substance use disorders is essential to evaluate and improve treatment outcomes and possibly prevent relapse [12].

Although a large number of scales that measure anxiety and/or depression exists, these measures fail to differentiate between both constructs [13]. The Depression Anxiety Stress Scale (DASS) is a 42-item tool for assessing the discrete features of depression, anxiety, and stress [14]. The aim of its development was to reduce the measurement overlap between depression and anxiety that threatens the purity of standalone measures of depression or anxiety [15]. Its 3-factor structure ideally matches the tripartite classification of depression and anxiety symptoms: non-specific symptoms of general distress, anhedonia/low positive affect specific to depression, and somatic arousal specific to anxiety [16]. Because the short version (DASS-21) is relatively easy to administer, it has been broadly used for research and clinical purposes in various groups and settings [15, 17, 18]—including people with substance use disorders [19].

Psychometric properties of the full version are well-established [14, 20]. However, for the DASS-21 several studies reported inconsistent findings in different cultural contexts (including English speaking ones where it was originally developed) and via various quality assessment techniques [5, 13, 15, 17, 18, 21,22,23,24,25,26,27,28]. Similarly, examinations of scalability and item functioning indicate that the Arabic version of the DASS-21 contains some problematic items with regard to discrimination, level of difficulty, and invariance across different groups [26, 29]. Meanwhile, the factor analysis of the Arabic DASS-21 has not been examined neither remedial actions were carried out to address erroneous items. Therefore, the current study aims to report the results of factor analysis, reliability testing, and convergent validation testing of the Arabic DASS-21.

Convergent or construct validity examines whether constructs that should be related are related [30]. Deriving from previous reports which indicate that self-stigma—negative self-views: awareness of stereotypes (negative public attitudes toward substance users), personal agreement with public stereotypes, self-occurrence (application of negative public attitudes to self), and shame (loss of self-esteem because of flawed self-evaluations) [31]—lead to unpleasant emotions [32], bivariate correlation between the DASS-21 and Self-Stigma in Alcohol Dependence Scale was used to test convergent validity (described in details latter in the Methods) [30]. Specifically, we hypothesized that scores on the stigma subscales measuring awareness of public stereotypes and agreement to them would be only weakly correlated with the DASS-21 score, because awareness of public stereotypes and agreement to them reflect knowledge and cognitive processes rather than emotions. In contrast, we hypothesized that scores on the stigma subscales measuring stereotype self-occurrence and drug-related shame (which reflect affective aspects of stigma that are more related to psychological symptoms) would positively correlate with scores on the DASS-21.

Methods

Participants

This study recruited people who were being treated for substance use disorders and were inpatients at a government psychiatric hospital in Alexandria, Egypt. Eligible patients were included if they could read and write, were free from severe mental disorders (e.g., schizophrenia, suicidal ideation), and gave written informed consent. Of 420 inpatients, 51.2% were eligible to participate, but only 35.5% took part in the study. The sample comprised 149 participants (95.3% men, mean age = 32.5 years, SD = 6.8 years, age range: 19–60 years). Heroin, synthetic drugs, and cannabis were the most commonly used drugs 80.5, 79.2, and 75.2%, respectively (see Table 1 for sociodemographic and clinical characteristics of the participants). The current study is a secondary analysis of a former study on self-stigma of substance use disorders [33]. The board of research ethics of Faculty of Nursing, University of Alexandria, approved the study, and participants provided a written informed consent prior to data collection.

Table 1 Sociodemographic and clinical characteristics (n = 149)

Instruments

The depression anxiety stress scale–21 (DASS–21)

The DASS-21 has 21 items in 3 subscales of 7 items each. They ask about depressive symptoms (e.g., feeling down-hearted and blue), anxiety symptoms (e.g., feeling close to panic), and general stress symptoms (e.g., having a tendency to over-react to situations). Response options are on a 4-point scale (0 = did not apply to me at all and 3 = applied to me most of the time). Higher scores indicate more psychological distress [34].

The self-stigma in alcohol dependence scale (SSAD)

The SSAD uses 5-point Likert-type responses. It has four subscales of 16 items each. It measures four aspects of stigma: awareness of public stereotypes, stereotype agreement, stereotype self-occurrence, and drug-related shame [31]. This scale was translated into Arabic and modified by substituting “substance dependence” for “alcohol dependence” since participants used alcohol and other drugs. Estimates of internal consistency for the translated version were adequate (coefficient alpha = 0.81, 0.86, 0.83, and 0.84 for the four subscales). These indices of stigma were used for validation testing. All questionnaires were self-administered.

Statistical analyses

Data from the DASS-21 were analyzed in 2 stages. First, to check if the data fit the 3-factor structure of the DASS-21 set by Lovibond and Lovibond (1995), items’ loadings of the DASS-21 were examined after exploratory factor analysis (EFA), retaining the number of factors with an eigenvalue greater than one. Then, a set of EFA was conducted using maximum-likelihood extraction and direct oblimin rotation with Kaiser normalization. The number of factors was set to 4, 3, and 2. The χ2 and df values were obtained from Bartlett’s test of sphericity and also from the Goodness-of-Fit test, and they were used to compute the normed fit index (NFI), the comparative fit index (CFI), the Tucker-Lewis index (TLI) (all should be > 0.95), and the root mean square error of approximation (RMSEA, which should be < 0.08) [35]. Parallel analysis — principal components analysis with raw data permutation — [36] was performed to determine the number of components to extract. In the second stage, items 2, 6, 14, and 18 were removed because their factor loadings were less than 0.4 [37] and then ran EFA and parallel analysis.

For the full DASS-21 and the modified version (DASS-17), reliability indices were computed: alpha coefficients, corrected item-total correlations, and the values of alpha if an item was deleted. As for validation tests, correlations between the SSAD subscale scores and the DASS-21 and DASS-17 scores were computed to test the hypotheses described above. The data were analyzed with SPSS version 22.

Results

Factor analysis

In the initial EFA of the DASS-21, 5 factors had eigenvalues greater than 1 (6.48, 1.65, 1.41, 1.26, and 1.12). The scree plot had a rather distinct “elbow” at the second factor (Fig. 1). Table 2 shows loadings for both the 3-factor and 1-factor solutions. In the 3-factor model, almost all items had their highest loadings on the first factor. The only exceptions were items 2 and 3, which cross-loaded on the third and second factors, respectively. In the 1-factor model, similar to the 3-factor model, items 2, 6, 14, and 18 had the lowest loadings (all below 0.4) and the lowest communalities (0.196, 0.133, 0.108, and 0.145). The model fit indices indicated failure of the DASS-21 items to load on 4, 3, or 2 factors—the NFI, in particular, was substantially low: 0.84, 0.80, and 0.75 for 4, 3, and 2 factors, respectively (Table 3).

Fig. 1
figure 1

Scree plot of eigenvalues from un-rotated EFA of the DASS-21

Table 2 Results of exploratory factor analysis of the DASS-21 and the DASS-17 (extraction by maximum likelihood)
Table 3 Partial confirmatory factor analysis for the DASS-21

In the parallel analysis (Table 4), only 1 component had an eigenvalue that was above the 95th percentile of eigenvalues of 1000 random datasets of the same dimension. The eigenvalue of that component was 6.48, which is equal to the eigenvalue of the first factor extracted in the initial EFA, which explained 30.9% of the common variance.

Table 4 Parallel analysis (principle components analysis with the method of raw data permutation) of the DASS-21 and the DASS-17

As shown in Table 2, factor analysis after removal of the four items with low loadings resulted in acceptable loadings of the 17 remaining items on one factor. That one factor explained 34.9% of the variance. Similarly, parallel analysis indicated that only one factor should be retained (the right-hand side of Table 4).

Reliability and validation tests

Reliability reflects the accuracy, precision, and consistency of a test score. Coefficient alpha is the most popular reliability coefficient. Though alpha of 0.70 can be considered reliable, the higher its value the more reliable the scale is [38]. Coefficient alpha was almost the same in the DASS-17 and the DASS-21 (0.881, and 0.883, respectively). Removal of any item from the DASS-17 would reduce its reliability (Table 5). For the DASS-17, the mean of inter-item correlations was slightly higher (0.283) than for the original scale (0.249).

Table 5 Means, standard deviations, internal-consistency reliability, and correlations of the DASS-21 and DASS-17 scores with stigma variables

As hypothesized, for stereotype awareness and stereotype agreement the correlations with DASS scores were all weak: from − 0.007 to 0.172 (Table 5). Also as hypothesized, for stereotype self-occurrence and for shame the correlations with DASS scores were all positive, and they were moderately strong: from 0.417 to 0.495 (Table 5). All four SSAD subscale scores correlated more strongly with DASS-17 scores than with DASS-21 scores.

Discussion

A 17-item version of the DASS performed better than the 21-item version, and this DASS-17 provided a unidimensional index of overall psychological distress. Other studies have reported a 3-factor structure of the DASS-21 [15, 22, 23, 39,40,41]. However, in at least one study, items were forced to load in a manner that served the original structure of Lovibond and Lovibond (1995): for the Malay DASS-21, Nur Azma et al. (2014) developed four CFA models, although all had low fit indices. Some studies relied mainly on the criterion of an eigenvalue above 1 to decide how many factors to retain [39, 41], which can overestimate the number of factors that should be retained [36]. Further, in six studies, several items (up to 11 items) had cross-loadings i.e., loaded on more than one factor [13, 17, 25, 39,40,41] while some items had low loadings (below 0.3) [13, 39, 41], which can indicate model-to-data misfit i.e., a considerable number of items of the scale do not differentiate between its three main constructs.

High correlations among the DASS-21 factors have been reported in different age and ethnic groups [13, 15, 22, 28, 41]. In the study by Norton [15] those correlations were extremely high (r’s = .920–.974), which casts great doubt on the 3-factor structure of the DASS-21. Moreover, other studies indicated that the DASS-21 has a “bifactor” structure: a general factor of overall psychological distress and 3 specific factors of depression, anxiety, and stress [18, 42, 43]. The presence of a general factor resulted in a better fit and is consistent with the high correlations among depression, anxiety, and stress [13, 42, 43]. In some instances, that general factor “accounted for the greatest proportion of common variance in the DASS-21 item scores” [13, 18].

Taken together, the previous reports of a bifactor structure of the DASS-21, of high inter-factor correlations, and of item cross-loadings all support the plausibility of a 1-factor solution, which was obtained for the DASS-17 in this study. Results of parallel analysis after removal of four items with low loadings and communalities (items 2, 6, 14, and 18), indicated that the DASS-17 was unidimensional (Table 4). This is consistent with the previous finding that items 2 and 18 had very low item-discrimination indices (< 0.2) [26].

In agreement with our results, the DASS-21 was unidimensional in Latino college students near the US-Mexico border [24], nursing students in Brunei [13], and in Australian adolescents [44]. Symptoms of depression, anxiety, and anger are strongly linked and may be not easy for young people to distinguish [44]. Similarly in adults, Tran et al. (2013) found that all items of the Vietnamese DASS-21 (except item 18) loaded on one factor, and they concluded that the DASS-21 was useful only for identifying people who had both symptoms of depression and anxiety. Similar findings were reported from a study of a non-clinical population of adults in the United States [45]. The Turkish version of the DASS-21 could be used to differentiate people with depressive disorder and anxiety disorders from healthy controls, but not to differentiate patients with a diagnosis of major depression from patients with anxiety disorders [27]. Therefore, the DASS-21 can be used to assess the severity and frequency of negative emotional states, but not to measure, separately, the severity of depression or of anxiety.

Although deleting items from a scale can be associated with reduction of reliability, removal of four items in the current study decreased the value of coefficient alpha only from 0.883 to 0.881, which is considered very trivial indicating that deleted items did not really contribute to reliability of the scale. Despite the slightly lower alpha value, the DASS-17 might even perform better than the DASS-21 given the improvement of item-total correlations and inter-item correlation reported in Table 5—suggesting better convergent validity of the DASS-17. On the other side, the DASS scores correlated as hypothesized with drug-related stigma (Table 5), and stigma is strongly associated with both depression and anxiety [31]. Some drug users also have other common risk factors for both depression and anxiety: unemployment, dysfunctional relations, poor family and support networks, serious infections, financial difficulties, incarceration, and homelessness [8, 46,47,48]. In addition, depression, anxiety, and post-traumatic stress disorders are highly co-morbid in this group [49]. Thus it would not be at all surprising for symptoms of depression and anxiety to be strongly interwoven in this study’s participants, which might account for the unidimensionality in the present study. This argument is supported by findings of a former report, which indicated that scores of the total DASS-21 could predict depressive disorders among people with substance use disorders at sensitivity and specificity levels of 78–89% and 71–76%, respectively [19]. This could well be what has been called “common mental disorders of depression and anxiety” [5] or “anxious-depression symptomatology” [24], which were described in former studies that assessed the validity of this scale. In final, the short form of the Arabic DASS scale may not differentiate between anxiety and depression disorders (at least among people with substance use disorders) as it was intended to do; however, it might be a suitable measure of negative emotions that overlap between anxiety and depression.

Conclusion

Even though the DASS-21 has been reported to successfully measure three different latent variables in some groups, the Arabic DASS-21 was unidimensional in the present context. Also, removing four items with poor loadings resulted in a unidimensional 17-item scale of overall psychological distress. DASS-17 scores were as reliable as the DASS-21 scores. Both the DASS-21 and DASS-17 performed as hypothesized on validation tests using the SSAD subscales. On the convergent-validation tests, correlations between the DASS-17 and stigma aspects (stereotype agreement, self-occurrence, and shame) were better than those of the DASS-21.

Because the participants were a convenience sample from a public hospital and were mainly men who used multiple drugs, they may not be representative of all Egyptian drug users. Further validation testing and studies in other populations would certainly be useful.