Introduction

Perceived stigma can be a problem for individuals of all ages with behavioral health problems. Research focused on older adults has found that older patients are often reluctant to discuss their behavioral health problems with a health care provider. As with individuals in other age groups, some older adults may reject the diagnosis of depression or another behavioral health disorder due to the stigma attached to it. A psychiatric diagnosis may spark concerns of a potential loss of independence and fears of being institutionalized (Wagenaar et al. 2002). The problem of stigma not only affects the identification of behavioral problems in older adults, but treatment adherence as well. Greater perceived stigma toward individuals with behavioral health problems is associated with a greater likelihood of treatment discontinuation (Sirey et al. 2001).

In 1963, Goffman defined stigma as an “attribute that is deeply discrediting” and reduces the bearer “from a whole and usual person to a tainted, discounted one” (as cited in Bambauer and Prigerson 2006). Perceived or self stigma is the belief that most people will devalue and discriminate against individuals who use behavioral health services or who have a behavioral health disorder (Link et al. 1989). Factors that have been found to be associated with stigma include, sex, with males being more prone to the negative aspects of stigma (Reynolds and Kupfer 1999), though it is possible that females are more prone to feelings of stigma in relation to alcohol use/abuse (Gomberg 1988). Ethnicity and race are associated with stigma though understudied. There is evidence to suggest in relation to stigma associated with alcohol use disorder that perceived stigma is higher for non-White compared with non-Hispanic white race/ethnicity (Keyes, et al. 2010) and in another study compared to Whites, higher for Blacks, and highest for Asians and Hispanics (Smith et al. 2010). Aside from background characteristics, there exists a relationship between stigma and quality of life, where respondents with lower quality of life experience greater perceived stigma (Mechanic et al. 1994).

Specifically, the stigma surrounding behavioral health care has been widely studied in younger populations. Less, however, is known about the impact of stigma on the health care utilization of older adults. The general topic of stigma and the negative outcomes associated with it has received more attention in the past decade following the 1999 Surgeon General’s Report on mental health. This report states that stigma is the most formidable obstacle to further progress in the field of mental illness (Substance Abuse and Mental Health Services Administration, Center for Mental Health Services, National Institutes of Health, & National Institute of Mental Health 1999). In 2003, the World Health Organization (WHO) and the Old Age Psychiatry section of the World Psychiatric Association (WPA) jointly distributed a technical consensus paper on the problem of stigma and discrimination against older adults with mental disorders. This report called for further research in the area (Graham et al. 2003).

Despite the call for further research, few instruments to measure stigma, specifically with older adults, have been developed. One questionnaire does exists, however it focuses on bereaved older adults (Bambauer and Prigerson 2006). Currently, there is no general stigma questionnaire for older adults. In light of findings that older adults in need of behavioral health services are acutely affected by perceived stigma, measures developed specifically for this population would be of great use in understanding the problem of perceived stigma. In response to this shortcoming, the research group associated with the Primary Care Research in Substance Abuse and Mental Health for Elderly (PRISM-E) study with funding from substance abuse and mental health services administration (SAMHSA) developed the SAMHSA Mental Health and Alcohol Abuse Stigma Assessment (see Table 1). With older adults in mind, this measure was designed to assess two attributes of stigma: stigma towards mental illness and stigma toward related services, though it is general in nature and could be used with any age sample with appropriate validation. This new SAMHSA Stigma Assessment has been assessed for reliability and validity, but has not undergone any extensive psychometric testing.

Table 1 SAMHSA mental health and alcohol use stigma assessment items

To make comparisons of stigma between groups or change in stigma over time, it is import to test whether the questionnaire and items meet the assumption of measurement invariance. When testing this assumption we can use time or group membership as possible violators of this assumption. The assumption of invariance requires that the relationships between the observed items and the latent attribute (theoretical construct) remain constant across measurement occasions and/or groups. In other words, two people with the same level of, say stigma, have the same probability of responding a certain way to the items of the questionnaire, and knowing what group or measurement occasion they are from has no affect on this probability. If this assumption is violated and measurement bias present, comparisons may be invalid regardless of whether the scale has a high Chronbach’s alpha or reliability (Mellenbergh 1989; Meredith 1993).

Different statistical methods are available for testing the assumption of invariance with respect to group membership, time, and other violating variables. The method we chose is confirmatory factor analysis (CFA). With CFA we can test the structure of the items in relation to the latent attributes and also test if the items are invariant across groups and/or time. If the results suggest that the assumption of measurement invariance holds, that is, the items are only measuring the attribute of stigma and are not influenced by group specific attributes or changing respondents’ response patterns, then the latent means can be confidently compared for true differences or change. There is a large body of literature that has investigated invariance with respect to multiple-groups (Vandenberg and Lance 2000; Schmitt and Kuljanin 2008). Longitudinal invariance, on the other hand has received less attention.

Response shift is one definition for describing a violation of invariance in longitudinal data (Howard and Dailey 1979; Sprangers and Schwartz 1999). In general, response shift can be thought of as a special case of measurement bias (non-invariance) that can occur when investigating change within an attribute of interest (Oort 2005; Oort et al. 2005). A researcher investigating response shift might expect some items of the measurement instrument to be biased, especially when a catalyst between measurement occasions occurs for all respondents (for example, treatment or planned intervention) that is expect to cause change.

In the current study we compared the latent means of stigma across three measurement occasions and two treatment groups. Such comparisons are only valid and meaningful if the assumption of measurement invariance is met. To investigate this assumption, and assess whether the psychometric properties of the Stigma Assessment are invariant over time and across treatment groups we follow the procedure presented in King-Kallimanis et al. (King-Kallimanis et al. 2010). This procedure has previously only been applied to longitudinal data, in the current study we extend the methodology by testing for measurement invariance across both time and groups. We do not expect to meet the assumption of invariance given that respondents are exposed to a catalyst in the form of treatment. The implication of this expectation is that results associated with mean differences could be incorrect. To circumvent incorrect conclusions, we will illustrate how a violation of this assumption can be overcome and meaningful comparisons of stigma can be made and highlight any differences in conclusions had invariance not been tested.

Method

The data used in this study are from the larger Primary Care Research in Substance Abuse and Mental Health for Elderly (PRISM-E) study. PRISM-E was a multi-site, randomized, comparative trial examining two models of care for persons aged 65 and older with symptoms of depression, anxiety, and at-risk drinking. Participants randomized into the integrated behavioral health care group received behavioral health services located in the same practice as their primary care physician. Participants randomized into the enhanced referral care group received behavioral health services at a specialty behavioral health clinic that was geographically separate from their primary care provider (Levkoff et al. 2004).

Measures

SAMHSA Mental Health and Alcohol Abuse Stigma Assessment

The questionnaire contains seven items with a five-point response scale with options ranging from “Not at all” to “Extremely”; an additional response option is included to allow patients to respond “Don’t know/Refused” (see Table 1). The scale requires administration by an interviewer as the scale utilizes gateway items. That is, the first two items determine the wording of the following five items, and these items focus on the subject of stigma, namely mental health stigma only, alcohol abuse stigma only or both. Two items are reversed scores, so that higher scores are indicative of greater stigma. Limited psychometric information on the Stigma Assessment exists. This is the only published article based on PRISM-E that uses the full Stigma Assessment.

Diagnosis

Participants were assessed at all measurement occasions for symptoms of depression, anxiety, and at-risk drinking using a variety of measures (for further details see Levkoff et al. 2004). Anxiety symptoms were assessed using the MINI Panic Disorder Scale, MINI Generalized Anxiety Scale (MINI; Sheehan et al. 1998), and the Beck Anxiety Inventory (BAI) (Beck et al. 1988); for depression symptoms the MINI Major Depression Scale, MINI Dysthymia Scale, MINI Depression History Scale (MINI; Sheehan et al. 1998), and the Center for Epidemiological Studies Depression Scale (CES-D) (Radloff 1977) were used; and for at-risk drinking The Short Michigan Alcoholism Screening Test – Geriatric Version (Blow et al. 1998) were used. In our analysis diagnosis was dichotomized into two variables: mental health symptoms (yes/no) and at-risk drinking (yes/no) and only baseline information was included.

Health Related Quality of Life – The Medical Outcomes Study SF-36

This questionnaire is comprised 36 items that can be summed into the mental health quality of life (MENT HRQoL) and the physical health quality of life (PHYS HRQoL) component scores. These scores range from 0–100, with higher scores indicating higher levels of quality of life. The SF-36 is a well validated scale with good psychometric properties (Ware et al. 1993).

Background Characteristics

Age, sex, education and race are included in the analysis. Education was categorized into three groups, ‘less than high school’, ‘high school’ and ‘more than high school’. Race was also categorized into three groups, ‘White’, ‘Black’ and ‘Other’. Other is not tested in relation to measurement invariance due to the diversity of this category, it is included so as not to lose power.

Sample

The final PRISM-E study group comprised 2,022 participants aged 65 or older, with a baseline measurement and three follow-up measurements (3, 6 and 12 months from baseline). In the current analysis we do not investigate the measurement occasion at 12 months as the Stigma Assessment was not included in the research protocol for that assessment period. We excluded participants who missed an interview (n = 539) or had a missing mental health symptoms or at-risk drinking information (n = 58), age, or gender. Patients’ with missing data on items 1 and 2 of the Stigma Assessment were also excluded as these items determined the wording of items 3–7 (n = 527). When partial data were present on the SF-36 and/or items 3–7 of the Stigma Assessment, or a “Don’t know/refused” response was present, the values were imputed using the expected maximization procedure available in SPSS. These exclusions resulted in a final sample of 1,198 patients.

The final sample was predominately male (75.39%) due to increased participation from Veteran Affairs facilities, and had a mean age of 73.16 (±6.07). Over three quarters (77.22%) of the sample reported mental health symptoms, a third with at-risk drinking (30.65%). Only, 7.87% of the sample experienced both symptoms of mental health and at-risk drinking (see Table 2).

Table 2 Descriptive statistics for respondent characteristics

Statistical Analysis

The analysis is an application of methods presented in King-Kallimanis et al (2010). However, we extend these methods to include not only the longitudinal data structure, which includes three measurement occasions, but also multiple groups, the integrated behavioral health care group and the enhanced referral care group.

This procedure follows three main steps that require fitting a series of structural equation models (SEM) and assessing their fit. To achieve this, the maximum likelihood estimation method is used, while the items are not continuous, previous work suggests that with a 5 point Likert scale, estimates will not be too affected (Dolan 1994). Three goodness-of-fit statistics are used to assess the model, the chi-square test of exact fit, the root mean square error of approximation (RMSEA) and the expected cross-validation index (ECVI). A non-significant value of the chi-square test indicates good model fit; however the test is sensitive to small deviations between the data and the model. Therefore, we also consider the RMSEA, where as a rule of thumb, values of less than .05 suggest close fit and values less than .08 suggest satisfactory fit (Browne and Cudeck 1992). And also the ECVI, which is not a stand alone index, but can be used when comparing nested models, as we do in our paper. A smaller value suggests improved overall model fit (Browne and Cudeck 1992).

To assess the appropriateness and significance of changes made to the model we use the chi-square difference test, with a Bonferroni correction, which tests the difference in the fit of the null and alternative models. When using the chi-square difference test to assess the invariance of the items, we use global tests, where all factor loadings and intercepts of the items are simultaneously tested (King-Kallimanis et al. 2010). In the presence of a high level of power an additional method is required to assess the significance of model modifications. We therefore consider standardized observed parameter changes (OPC) as a means to guard against chance findings in Step 2 and Step 3 of the procedure. The OPC is the difference between the standardized parameter of interest (factor loading or intercept) in the null model versus the same standardized parameter in the alternative model. These are scaled for comparison, when a factor loading OPC is greater than 0.1 or an intercept OPC is greater than 0.2 (Cohen 1988) and the chi-square difference test is significant we consider the parameter possibly biased. All analyses are conducted using Mx32 (Neale 2004).

Procedure

  1. Step 1:

    Establishing a measurement model.

    SEM was used to fit an appropriate confirmatory factor model using two (integrated behavioral health care and enhanced referral care) variance-covariance matrices for the five items of the Stigma Assessment scale at three measurement occasions. For both groups and all measurement occasions the same factors and pattern of factor loadings were modeled and all parameters were free to be estimated. To assess areas of misfit the standardized residuals were investigated. Any model changes were assessed using the chi-square difference test. Before moving to Step 2, we required that the final model had satisfactory fit as determined by the goodness of fit statistics.

  2. Step 2:

    Testing measurement invariance across measurement occasions and groups.

    To test measurement invariance, the final model from Step 1 was used and the factor loadings and intercepts were simultaneously constrained to be equal at all time points and across both groups. A strong indicator of measurement bias is a significant deterioration in the fit of this fully constrained model when compared to the model where all parameters are fully estimated. However, if the model does not significantly deteriorate, we still investigate the possible presence of measurement bias. This is because one biased parameter at one time point, in one group may not cause significant deterioration in the overall model fit, yet may be substantially important. In order to investigate whether any observed items(s) were biased the equality constraints imposed were removed and assessed using a series of global tests. In the first iteration there were five global tests, one for each item. In the first test, the equality constraints of the factor loadings and intercepts of the first item are removed and the model compared with that of fully constrained model using the chi-square difference test and the OPCs. Once all five tests are conducted, if there is an item with a significant chi-square difference test and at least one OPC meeting our criteria, this item’s equality constraints are permanently removed. In the presence of more than one item meeting these criteria, the item associated with the largest improvement in overall model fit will be considered biased. This procedure is continued iteratively, until no large OPCs remain that are associated with a significant chi-square difference test. Any biases identified will be further explored using the partitioning formula of Oort (2005) and the size of the effects will be assessed using Cohen’s d effect sizes (Cohen 1988).

  3. Step 3:

    Testing measurement invariance with respect to exogenous variables.

    Invariance, in this step, was tested by extending the final model in Step 2 to include additional exogenous variables which may induce bias on the observed items. In theory the relationship between the exogenous and observed items should be fully explained via their relationships with the latent variable, if this does not hold true, bias has been found. We included the subject of stigma (Items 1 and 2) of the Stigma Assessment at all measurement occasions, age, gender, race, education, mental health symptoms and at-risk drinking and the Physical and Mental HRQoL component scores of the SF-36. In this model these exogenous variables were correlated with the latent constructs, at all three measurement occasions, and their relationship with the observed items, should be explained via these correlations.

    Like in Step 2, we used global tests to investigate the presence of measurement bias between the exogenous variables and observed items. For each exogenous variable we conducted five global tests. This involved adding additional parameters to the model both longitudinally and for both groups. When all direct effects were tested, any effects producing a significant chi-square difference test and large OPCs were included in the model. In the presence of more than one effect meeting these criteria, the effect that produced the largest improvement to the overall model fit was included in the model. This process continued in an iterative manner until no more large OPCs with a significant chi-square difference test were found.

Results

  1. Step 1:

    Establishing a measurement model.

    As the structure of the Stigma Assessment had not previously been investigated, we hypothesized two possible measurement models. A uni-dimensional measure of stigma and, based on the content of the items, a two-dimensional measure. We first tested the uni-dimensional model with all five items loading on the single latent variable of stigma (Model 1, Table 3). The chi-square test was significant and the RMSEA also suggested poor fit. Next, we tested the two-dimensional model. The chi-square test was still significant but the RMSEA suggested satisfactory fit (Model 1.F).

    Table 3 Overall goodness-of-fit and chi-square difference test (n = 1198)

    The two dimensions were defined as: 1) Perceived Stigma: which included items measuring level of embarrassment and the perception of a negative response from others regarding seeking behavioral health treatment and 2) Comfort Level: which included items measuring the respondents’ level of comfort in talking with a service provider about their own mental health or alcohol abuse problems. This final measurement model is depicted in Fig. 1.

    Fig. 1
    figure 1

    SAMHSA Mental Health and Alcohol Abuse Stigma Assessment measurement model for three measurement occasions and one group. Note - Abbreviations – Edu. – Education; MH Sympt. – Mental health symptoms; Ment HRQoL – Mental health-related quality-of-life; Phys HRQoL – Physical health-related quality-of-life

  2. Step 2:

    Testing measurement invariance across measurement occasions and groups.

    In this step all factor loadings and intercepts were constrained to be equal across the three measurement occasions and the two groups. The fit of this model significantly deteriorated when compared to the unconstrained measurement model (χ2diff(30) = 83.61, p < 0.001).

    When investigating bias using global tests and OPCs, we found that the removal of the equality constraints for Item 4 (referring to difficulty in starting treatment if other people knew) led to large OPCs and a significant global test (improvement in model fit) (χ2diff(10) = 36.21, p < 0.001). No more significant improvements were found.

    We conducted a post hoc analysis to see whether the bias was associated with time, group membership or both. The bias is considered to be response shift as the parameters are invariant across groups, but not across measurement occasions. The bias seems predominately associated with the intercepts, with the intercept at each measurement occasion increasing relative to Perceived Stigma. This suggests that when Perceived Stigma decreases over time, Item 4 did not decrease as much as we would have expected, this is regardless of treatment groups.

  3. Step 3:

    Testing measurement invariance with respect to exogenous variables.

    In this step we used the final model from Step 2 and included the additional exogenous variables. In this model (Model 3) these twelve exogenous variables were correlated with the latent constructs, Perceived Stigma and Comfort Level at all three measurement occasions. The largest correlations were between both subjects of stigma (Item 1 and 2) and Perceived Stigma (range: 0.36–0.20). There were also small negative correlations between MENT HRQoL and Perceived Stigma and Comfort Level. A similar correlation pattern was seen for PHYS HRQoL. The correlations between age, sex, education, mental health symptoms and at-risk drinking and the latent constructs were very small (<0.01). Finally, the correlations between race and both Perceived Stigma and Comfort Level were negative and small (range: −0.23– −0.08).

    Using Model 3 as the reference model a total of five biases were found, four of these biases were related to Item 3 of the Stigma Assessment, the other bias was related to Item 6. In the sixth iteration no additional significant effects were found.

    Ad hoc analyses indicated that four of the five biases detected were constant across time and over treatment groups. The remaining bias was considered response shift as it was biased over time, but not across groups. The biases of Item 3 with respect to the two subjects of stigma (Item 1 and 2), gender and Mental HRQoL were in the same direction as the correlations and suggest that: people embarrassed by mental health reported greater stigma, people embarrassed by alcohol abuse reported greater stigma, females reported greater stigma and people with worse Mental HRQoL reported greater stigma, than would be expected given their “true” stigma scores (i.e., common factor scores). Mental HRQoL and Comfort Level, were negatively correlated, as higher scores on Comfort Level are indicative of greater un-comfort. The fifth bias on Item 6 with respect to Mental HRQoL was in the same direction as this correlation, and suggested that people with better Mental HRQoL reported less un-comfort speaking with a mental health profession than would be expected given their “true” Comfort Level score (see Table 4 for selected final parameter estimates).

    Table 4 Selected parameter estimates for the final model after all biases were accounted for

Measuring True Change in the Stigma Assessment

After accounting for the bias found in Step 2 and 3 we assessed the impact of these biases on Perceived Stigma and Comfort Level, both between treatment groups and longitudinally. The effect sizes (Cohen 1988) of true change for the biased Item 4 is small/medium, −0.29 and −0.28 for the integrated and referral groups. Not taking into consideration the effect of bias, we would have concluded that the effect size for change was of small magnitude -.18 for the integrated group and -.15 for the control group, however a large amount of this change is attributable to bias (0.11 in the integrated group and 0.12 in the referral group). The biases associated with the exogenous variables had little impact on their correlations with Perceived Stigma and Comfort Level.

When we consider the latent means of Perceived Stigma and Comfort Level we see in Fig. 2a and b that there is little difference between the means of the original model, before bias has been accounted for, and the means after all biases have been included at the end of Step 3.

Fig. 2
figure 2

a Perceived Stigma mean change by group and between models. b . Comfort level mean change by group and between models

Discussion

In this study we found a measurement model that included two latent constructs, Perceived Stigma and Comfort Level. This suggests that the Stigma Assessment Scale is not uni-dimensional and therefore when considering change in stigma, the five items should not be summed together to make a total Stigma score. In addition, six biases were found when investigating measurement invariance in the longitudinal/ multi-group data of the Stigma Assessment Scale. However, despite these findings, the substantive impact of accounting for these biases on stigma over time and across treatment groups in our analysis was small.

In Step 2, when assessing bias with respect to measurement occasions and groups, we found response shift. This was found in the item assessing whether it would be difficult for the respondent to start treatment if others knew they were to have treatment. In general, overall Perceived Stigma decreased over the course of the study, however Item 4 did not decrease as much as we would have expected given the overall decrease in Perceived Stigma. Apparently, in comparison with their overall Perceived Stigma, respondents initially are reserved when answering this item. However, after treatment they feel more open to report their true level of stigma on this item, in other words, they have recalibrated the response scale anchors. The impact of this bias on substantive conclusions was small. Without considering this bias, we would have erroneously concluded that there was small change in this item over time. After the bias was accounted for in this item, we actually see that there is very little true change and most of the observed change seen is attributable to the bias. That is, respondents’ stigma towards starting treatment if others knew, did not change, regardless of treatment group.

When testing the assumption of measurement invariance, a limitation in using SEM is that we are only able to identify biased items when they are in the minority. Perceived Stigma is represented by three observed items; with one biased item, this constitutes a third of the items. Even though this is a minority it is still difficult to be confident that the bias observed is truly associated with Item 4 (difficulty in starting treatment if other people knew). If a bias had been associated with an item measuring Comfort Level, with just two observed items, we would not be able to tell which of the two items were truly biased.

In Step 3, when testing measurement invariance with respect to exogenous variables, we found five examples of bias. Four of these biases were related to Item 3 that assessed whether the respondents thought people would think differently of them if they received treatment. This item, which should primarily measure Perceived Stigma, is particularly vulnerable to bias as it is also significantly indicative of other respondent characteristics. This may be due to the very general wording of the item. The biases associated with this item are, both subjects of stigma (mental illness and alcohol abuse), gender and Mental HRQoL. The effect of Mental HRQoL was in the same direction for both groups, but was significantly larger in the integrated care group. The other bias was associated with the comfort the respondent feels in talking with a mental health counselor. Respondents with better Mental HRQoL reported greater comfort speaking with a mental health professional than those with low Mental HRQoL, even if their Comfort Levels are similar.

As discussed above the biases found were interpretable; however, accounting for these biases had little impact on the interpretation of the final results of the Stigma Assessment in this example. Comfort Level did not differ significantly between treatment groups or significantly change across measurement occasions. Perceived Stigma declined over the course of treatment in both groups. The rate of change was not significantly different between treatment groups; however, there was a significant decrease over the course of treatment. As it turns out stigma may be more difficult to change than has been previously thought. After the Surgeon General’s report on stigma (Substance Abuse and Mental Health Services Administration et al. 1999) there was a strong public relations effort to inform the public about the underlying neurobiology of mental illnesses. In a study that assessed the effect of this campaign on stigma the authors found that while there was a greater understanding of the neurobiology of mental illness, stigma towards mental illness and substance dependence went unchanged (Pescosolido et al. 2010). Stigma towards mental illness is resistant to change, in the current study, the integration of care into primary care practice had little impact on the stigma the participants felt.

While a satisfactory measurement model was found, it required two factors, Perceived Stigma and Comfort Level. With just five items assessing stigma we may be missing other import conceptualizations of stigma. In a review of general stigma measurements, Link and colleagues (2004), point out the numerous components of stigma and the instruments that have already been designed to assess these. In the absence of an elder specific measure of stigma it may be advantageous to adapt one of the more comprehensive measures than to further pursue the Stigma Assessment as a specific elder measurement of stigma.

In addition to problem of missing constructs within the Stigma Assessment, there is also the problem of the use of Items 1 and 2 to determine the wording of the remaining items. Questionnaires employing such devices always require the questionnaire to be administered by a research assistant and require a strict administration protocol so as not to introduce interviewer bias. In this analysis we found bias associated with both these items and one of the items associated with Perceived Stigma. The items seem to induce bias, in future elder specific stigma scales, items could be included that pertain to stigma related to alcohol abuse and stigma associated with mental health rather than using items to determine the wording of the remaining items.

The development of valid and reliable measures of behavioral health stigma for use in an older adult population is important for many reasons. Early identification and treatment is key as older males seem particularly affected by the stigma associated with a behavioral health diagnosis, which is cause for concern because older males have the highest rate of completed suicide (Reynolds and Kupfer 1999). Health care practitioners and social service providers need to have the ability to identify those older adults in need of behavioral health services who might be averse to seeking treatment in order to provide targeted outreach.

As more attention focuses on the stigma associated with behavioral health, being able to accurately measure the impact of research studies such as PRISM-E is paramount. As research funding gets more competitive, the ability to employ a psychometrically sound instrument developed specifically for use in a specific population, such as older adults, may be viewed as a way to better inform the evidence base than study protocols using more generalized measures of stigma.

With proper identification techniques, interventions designed to lessen the feelings of perceived stigma in an individual with behavioral health challenges will play an important part in ensuring older adults get treatment they need. Improving self- esteem and self-efficacy has been linked to treatment access and adherence (Link et al. 2002).

One important limitation of the current study is related to the respondents who agreed to receive behavioral health treatment and participate in the PRISM-E study. This group of participants may differ in a number of ways from those individuals who declined to participate in the study. Namely, those who declined may have greater feelings of stigma which have a greater influence over their behavioral health treatment choices than those who agreed to participate in the study. If this occurred, the sample would be biased toward those whose feelings of stigma have less of an impact on their behavioral health care utilization. Stigma may be more of a barrier to treatment for those who declined to participate in the study than for those who agreed to participate. Another limitation is lack of a formal clinical diagnosis of depression, anxiety and alcohol use/abuse, while the measures used to screen for these conditions, in general, have good agreement with the DSM-IV, it is possible that some people were included who would not have met the criteria for a diagnosis and vice versa. However, this is just the first step in further understanding, the psychometric characteristics of the Stigma Assessment and the respondent characteristics that are related to stigma over time.

Future research endeavors to understand the complex relationship between feelings of stigma and behavioral health outcomes will add to the research base and help to reduce the impact of perceived stigma as a barrier to seeking and receiving behavioral health treatment. By developing well validated and psychometrically sounds instruments to assess stigma we will be more able to break through the barrier of stigma and ensure that all older adults receive the care that they need.