Background

As of 2017, there were 36.9 million people living with HIV (PLWH) globally, 19.6 million of whom lived within Eastern and Southern sub-Saharan Africa (SSA) [1]. In Zambézia Province, Mozambique, HIV prevalence is estimated to be 15% within the general population, with higher estimated rates in pregnant women [2]. Given the well-documented association between higher maternal HIV viral load and higher likelihood of infant HIV diagnosis [3], understanding the role of empathy and partner empathy - within the medical, social, economic, and cultural setting of rural Mozambique – on retention in care, adherence to treatment, and maternal-to-child transmission among PLWH is essential to decreasing HIV/AIDS-related morbidity and mortality in Mozambique and globally. With the increased focus on couples-based interventions to improve HIV outcomes in SSA, [4,5,6,7,8,9,10] understanding the interpersonal skills of each member within a couple, such as empathy, may be key mediators of the couples-based intervention efficacy. For example, partners who are better able to understand each other’s perspective may be better able to support them in adherence to antiretroviral medication regimes. However, to our knowledge, an empathy scale has never been validated in or adapted anywhere in SSA, which limits our ability to measure the impact of empathy on retention in HIV care in this setting.

Introduction to empathy, partner empathy, and the interpersonal reactivity index

Empathy, the ability to understand another’s emotions and act accordingly, has a cognitive domain — the ability to understand the experiences of others — and an affective domain — emotional responses to others’ experiences [11,12,13]. Empathy is considered essential to understanding the mental states and experiences of others, a key component of social behavior that allows individuals to function within groups [12, 13]. Specific to this study, empathy is recognized as a key mediator of health intervention fidelity and, within the context of HIV, higher physician empathy is associated with improved patient outcomes [14,15,16,17]. It stands to reason that partner empathy would increase emotional support, treatment adherence, retention in care, and relationship quality – all of which contribute to reducing mother-to-child transmission of HIV.

Recently, there has been increased interest in examining empathy among partners, given that partner interdependence within a dyad results in survey item correlation among partners [18,19,20]. These new methods recognize how one partner’s affective empathy influences the other partner’s cognitive empathy and vice versa [20]. They also show that one partner’s empathy is related to supportive behaviors directed towards the other partner [21]. These between-partner relationships demand a methodological approach that recognize subject interdependence (i.e., does not assume subject independence).

The Interpersonal Reactivity Index (IRI) was developed to assess the distinct cognitive and affective empathy domains [11]. Péloquin and Lafontaine [19] developed and used a modified version of the IRI to assess empathy among couples [19]. Levesque used a dyadic model to further validate the updated IRI scale [19, 20]. We could not, however, find evidence of IRI validation among couples outside of North America. This may limit its assessment of empathy in individuals in other regions, such as within SSA, in individuals with lower levels of education, and within the context of a dyadic relationship outside of North America. This has important implications for interventions targeted to improve health outcomes, including interventions for people living with HIV in SSA.

The current study

This study uses baseline data, specifically demographic data and the IRI, from an ongoing cluster randomized controlled trial — Homens para Saúde Mais (HoPS+) [Men for Health Plus] — that assesses the impact of involving HIV-positive male partners in routine prenatal care for women living with HIV [5]. The HoPS+ trial represents a unique opportunity to take the first step in adapting a measure that will allow researchers to assess empathy, as well as changes associated with behavioral interventions, in Mozambique and, with subsequent studies, in SSA.

The purpose of this study was to adapt the IRI among study participants in the ongoing HoPS+ study through an exploratory factor analysis, dyadic confirmatory factor analysis, and dyadic measurement invariance testing. We used a dyadic approach to account for how each partner’s perceived empathy impacts the other partner’s perceived empathy. This informs understanding of how supportive behaviors, shaped by between-partner interdependence, may augment engagement with and outcomes from HoPS+ and other behavioral interventions. Further, we assess correlations between the adapted IRI measure and related demographic and psychological (e.g., depression) factors, to provide further convergent and divergent validity evidence for the adapted measure. These results will help us evaluate the effect of our intervention on male and female empathy in males and pregnant women living with HIV in Mozambique and SSA and lay the groundwork for future assessments of dyadic constructs, including empathy, in SSA.

Methods

Participant selection

The HoPS+ study protocol is described in detail elsewhere [5]. Briefly, trained local study personnel collected baseline age, sex, education, marital status, occupation, IRI, and Patient Health Questionaire-9 data using a REDCap® (Research Electronic Data Capture) survey administered to pregnant women living with HIV and their seroconcordant male partner from 24 clinic sites in Zambézia Province [5, 22].

This analysis included the first 666 couples, or 1332 individual participants, from all HoPS+ study sites beginning on the date of study initiation (November 16th, 2017) through June 13th, 2019, when data were downloaded from our REDCap® database [22]. Our final study population, after excluding 147 participants (42 complete couples and 51 additional individuals) for missing data, included 1185 individuals (567 complete couples and 51 additional individuals) from 24 sites (Table 1). This included 595 (50.2%) females and 590 (49.8%) males with a median age of 25 (Interquartile Range (IQR) 21–30) and 5 years (IQR 2–7) of education. The most common occupation was farming (46.9%). Three districts, Pebane (30%), Inhassunge (16.5%), and Namacurra (19.3%), were overrepresented in the validation sample as compared with the excluded sampled population (18.4, 1.4, and 8.8%% respectively, p-value < 0.001), likely because when we randomly selected HoPS+ sites, more were located in these three districts than the other two. Furthermore, because of their size, recruitment was initially faster than the more remote, smaller districts. Mungia (43.1%) and Chuabo (37.6%) were the most commonly spoken languages, based on the most popular language(s) in each study district.

Table 1 Patient Demographic Information

Setting

Zambézia Province, located in north-central Mozambique, is home to approximately 4.4 million people from five primary ethnic groups (Chuabo, Macua-Lomwe, Manhaua, Merenge, and Senas) who speak at least four languages in our study area [5]. It has some of the poorest health and development indicators in Mozambique. Mozambique’s literacy rate is 47%; only 28% of women are literate (vs. 60% of men) and these numbers are lower in rural communities [23]. The majority of inhabitants are subsistence farmers [23]. Nationwide, 40% of the population live in poverty, but 80% of those poor live in rural areas like Zambézia [24]. In addition to these contextual conditions, the HIV prevalence in the province is estimated to be 15%, one of the highest in the country [2].

Scale translation and adaptation

This is a novel setting for employing the IRI, and this population presented unique challenges in study implementation. Measures were translated (and back translated to confirm meaning was maintained) from English to Portuguese (a shared language among translators, and a commonly spoken language in Mozambique) and then from Portuguese to Muniga, Chuabo, Lomue, and Nharringa. At least seven trilingual interpreters carefully reviewed each study question and made modifications relevant to the local sociocultural and linguistic context. Specific phrases, including feeling “touched” were not translated verbatim, but were replaced with similar, locally relevant concepts. The final measure was subsequently field tested at each study site before enrolling participants. During interviewer-assisted survey implementation (due to low levels of literacy among participants), statements and response categories were read aloud in each participant’s preferred language on enrollment. Responses were captured by the study counselor. Twenty-four trained counselors fluent in the local language and Portuguese were trained to capture participant responses over two 5-day training sessions. All counselors were supervised in the field by a study manager, who provided regular booster trainings to ensure consistency of survey delivery.

Instruments

Interpersonal reactivity index

The IRI consists of four empathy domains with seven questions each (28 questions total) [11]. The fantasy scale (FS) assesses one’s ability to place oneself in fictional situations; the perspective-taking scale (PT) reflects one’s ability to understand another person’s point of view; the empathic concern scale (EC) measures one’s ability to have caring feelings towards another individual; and the personal distress scale (PD) characterizes an individual’s own negative feelings when witnessing adverse events in others [11]. The fantasy and perspective taking scales constitute the cognitive component of empathy, while the empathic concern and personal distress scales constitute the affective component of empathy [11]. More recent research, albeit conducted in North America, further supports that distinct cognitive and affective empathy domains undergird the IRI scale [25,26,27,28,29]. This includes the development of two-factor empathy scales [25, 28] and imaging and molecular research that suggest distinct, but interrelated, cognitive and affective neural circuitry [27, 29].

Each item is scored on a 5-point Likert-like scale ranging from “Does not describe me well” (0) to “Describes me very well” (4). Although the original IRI contained nine reverse scored items, in the above-described scale adaptation, all questions were positively phrased and scored to avoid confusion during translation and survey administration as well as to improve response accuracy. Previous studies report Cronbach’s alpha values for IRI subscales from 0.70–0.83 and correlation coefficients of 0.01–0.37 between subscales [11, 30,31,32].

Patient health Questionaire-9

The Patient Health Questionaire-9 (PHQ-9) measures the nine attributes that characterize major depressive disorder [33]. Participants rate each attribute from ‘Not at all’ (0) to ‘Nearly every day’ (3) and were considered depressed if they scored 10 or greater. The PHQ-9 has been validated to screen PLWH in SSA for depressive symptoms [34,35,36] and has been used to measure depressive symptoms in Mozambique [37]. Participants who disclosed suicidal ideation (item 9 on the PHQ-9) were immediately assessed by our trained counselors and referred to the psychologist based at each site for additional psychological services.

Missing data

We excluded participants with missing IRI data on more than eight questions (~ 30% of answers), an average of more than two questions per subscale (n = 147; 11%). We believed that these criteria excluded potentially biased data from interactions among interviewer-interviewee pairs who had difficulty administering or understanding the survey, given the low levels of education among our participants. Those missing more than one PHQ-9 item (n = 124; 10.5%) were excluded from our analysis as described above. We used a stricter threshold for the PHQ-9 because it has previously been used in Mozambique and because it was a shorter measure.

Missing IRI and PHQ-9 data for the participants with eight or fewer missing IRI items (n = 276; 23.3%) and one or fewer missing PHQ-9 items (n = 112; 9.4%) were imputed over 10 data sets using non-missing empathy survey questions with the multivariate imputation by chained equations (mice) package version 3.4.0 [38]. We used a chi-squared test to assess the differences between included and excluded participants by sex, marital status, district, highest education, occupation, and predicted language (based on the most frequently spoken language in each district because we did not collect individual level data on language).

Included (n = 1185) and excluded (n = 147) participants did not statistically differ by age or occupation (Table 1). However, there were statistically significant differences by relationship status – a higher percentage of included participants self-identified as in a domestic partnership (37.8 to 24.5%) and fewer self-identified as married (21 to 33.3%). Included and excluded participants also had different levels of education and were differentially representative of districts (p < 0.001) and predicted languages (p < 0.001) (Table 1).

Data preparation

IRI questions were translated into several new languages and administered to participants in Zambézia, who were markedly different from the college-educated and junior high school students in previous IRI validations [19, 20, 30,31,32]. Although it has been used in South Africa as a composite 28-item scale or in its 4-subscale form [39, 40], to our knowledge the IRI has never been validated in or adapted to SSA. Given the new context, language translation, and rephrasing of negatively worded items, we did not feel comfortable making the a priori configural assumptions necessary to start with a confirmatory factor analysis of the IRI. We therefore hypothesized that the factor structure might differ from the previously identified four-factor structure [11]. We randomly split, without replacement, the full sample of 666 couples into two groups to identify (in the first phase) and then confirm, via confirmatory factor analysis (in the second phase) the factor structure of the IRI in discrete dyadic samples. Data were treated as ordinal in both EFA and CFA validation analyses. All data cleaning and analysis was conducted in R, version 3.5.1 (R Foundation for Statistical Computing) (2018-07-02) [41].

Exploratory factor analysis

The exploratory factor analysis (EFA) included 400 individuals (200 dyads split for the analysis). After re-examining the question translations for this analysis, we removed questions number 4 and 9 from the pool of items subjected to analyses, due to discrepancies after translation that changed the meaning of the questions (Supplementary Table 1). We performed maximum-likelihood exploratory factor analyses on the polychoric correlation matrices of two, three, and four-factor solutions consistent with the affective and cognitive attributes of empathy and IRI for couples scale (two factor-model suggested by extant theory) [19, 25,26,27,28,29], the results of a parallel analysis on the imputed datasets (three factor-model suggested by parallel analyses) [42, 43], and previous versions of the IRI (four factors in the initial validation) [11]. We used an oblique promax rotation and selected items for a particular factor if the loading was greater than 0.40 and unique to one factor (i.e., the question did not load greater than 0.40 for another factor; if so, it was discarded). We assessed internal factor consistency with Cronbach’s alpha to compare our results with previous IRI psychometric evidence.

Dyadic confirmatory factor analysis and measurement invariance

We used a dyadic confirmatory factor analysis (CFA), which included 466 dyads distinct from those analyzed in the EFA, to capture the interdependence between partner responses to items on the IRI (Fig. 1). This perspective presupposes that the latent construct being modeled achieves measurement invariance [44]. This means that the latent construct — empathy in this case — means the same thing and is measured the same way across partners. Thus, measurement invariance needs to be tested as part of the dyadic model validation. We examined three progressively restrictive levels of measurement invariance — configural invariance, metric invariance, and scalar invariance — using latent variable analysis (lavaan version 0.6–3) [45]. Statistical power for invariance testing may be “reasonable” with a sample of 200 or more dyads and “adequate” with 400 dyads [46]. Robust absolute and incremental fit indices are reported using standard benchmarks [47,48,49,50].

Fig. 1
figure 1

A Conceptual Diagram of the Dyadic Empathy Confirmatory Factor Analysis. Cognitive and Affective dimensions of empathy are uniquely modeled for male and female partners. Dashed lines from each empathy construct to the corresponding items (boxes) indicate the relation between the construct and the items used to model it. Circles pointing to items represent each item’s error term. Covariances, represented by the curved and dashed lines between the constructs, capture the relationships between partners’ dimensions of empathy. Double-headed arrows between error terms capture the interdependence between partners’ responses to survey items

Configural invariance assesses statistical equivalence of factor loading patterns across both partners. Configural invariance provides evidence that, when underlying latent constructs are measured between partners, items are organized in a similar fashion (i.e., the same set of items measure the same latent constructs for both partners). Constructs exhibiting metric invariance confirm statistically equivalent factor loading values across latent constructs and actors. Metric invariance provides evidence that items are similarly related to the underlying constructs across both actors. Finally, scalar invariance confirms that items are expected to have the same value across actors when the corresponding latent construct equals zero. In other words, the scaling of items is equivalent across partners. Scalar invariance indicates whether the amount of variation in each group is equivalent, and that each groups’ mean differences are interpretable [51, 52]. Power analyses for invariance testing suggests the use of changes in alternative fit indices (ΔAFI). Because the sensitivity of model fit indices may vary by sample size and model specification, simulations on sample sizes larger than 300 suggests evaluating model differences via changes in the confirmatory fit index (ΔCFI) between .002 and .01, and changes in the root mean squared error of approximation (ΔRMSEA) between .007 and .015. ΔRMSEA has been shown to be more sensitive than ΔCFI [46, 53]. Thus, due to the uncertainty of the novel context and language translation, we chose to use the more conservative options of ΔCFI > .002, and did not consider ΔRMSEA (though we do report it) for evaluating measurement invariance.

We report adjusted (robust) and unadjusted fit indices for each configural, metric, and scalar invariance model (see Table 4). Robust fit indices reflect modifications to traditional fit indices to account for failure to meet distributional assumptions, such as non-normality [47, 48]. These adjustments to absolute (Root Mean Squared Error of Approximation - RMSEA) and incremental (Comparative Fit Index (CFI) & Tucker Lewis Index (TLI)) fit indices make use of the Satorra-Bentler scaling constant to modify the equations of the naïve fit estimates [54]. Simulation studies have found these adjustments to be robust across a variety of unmet distributional assumptions and attenuate the overestimation of fit quality.

Demographic comparisons of cognitive and affective empathy

To further probe the construct validity of the resulting factor model, we assessed inter-scale correlation, correlation between subscales and the continuous PHQ-9 scores, and the relationship between each scale and sex, age, education, and depression. We used unpaired t-tests to assess the relationships between each subscale and the dichotomous measure of sex, age (> 26 years-old), education (> 7 years), and depression (> 9 PHQ-9 score is depressed). We hypothesized that participants who were older (> 26) would have lower empathy scores given their exposure to the war of independence and/or civil war and its sequelae which devastated Zambézia Province [55, 56]; that women would have higher empathy scores consistent with previous validations [30,31,32]; that those with more depressive symptoms – due to a higher degree of internalization of difficult emotional situations [57,58,59,60] – and that participants with higher levels of education would have higher levels of empathy, which has been observed in other settings [55]. We then assessed mean empathy subscale scores by district and predicted language with a one-way analysis of variance.

Results

Exploratory factor analysis

All the exploratory factor analyses (343 individuals) resulted in factor structures that mixed items from the four IRI subscales, and, to a lesser extent, between the cognitive and affective attributes of empathy (Table 2). In the absence of a theoretical framework to explain mixing of cognitive and affective items, we selected the two-factor solution (distinct cognitive and affective empathy factors) to minimize these discrepancies. This aligns with more recently developed empathy scales and the number of factors on the IRI for couples scale, which allows for improved interpretability of the two-factor solution instead of the three- or four-factor solutions, which had subscale mixing that limited interpretability [25, 28]. Across ten imputed datasets, factor correlations ranged from 0.56 to 0.59. We then removed cognitive questions that loaded on the predominantly affective scale and vice versa to maintain the integrity of previous empathy scale frameworks [11, 30,31,32].

Table 2 Interpersonal Reactivity Index Exploratory Factor Analysis Question Loading Across Ten Multiply Imputed Iterations (n = 343)

The first factor contained ten items that crossed the 0.40 factor loading threshold in at least two of the ten multiply imputed datasets. Seven of those ten items were from cognitive subscales (fantasy and perspective taking). This seven-item subscale had an intra-scale Cronbach’s α of 0.78 and represents the cognitive empathy subscale (Table 3). The second factor contained nine items that crossed the 0.40 threshold in at least two of the imputed datasets. Six of those nine items were from affective subscales (personal distress and empathetic concern) with an intra-scale Cronbach’s α of 0.73. These six items represent the affective empathy subscale (Table 3).

Table 3 Intra-Scale Correlation for the Cognitive and Affective Scales

Confirmatory factor analysis and measurement invariance

Following the two-factor solution found by the EFA, we sought to confirm this factor structure within partner dyads using dyadic CFA and invariance testing. We did this by imposing gradually increased levels of measurement invariance (Table 4). Results were aggregated across ten multiply imputed data sets. First, we found that the configural model (n = 396 dyads, 792 individuals) – where the constellation of items and latent construct was compared for male vs female partners – offered the following fit to the data: CFI = 0.903, TLI = 0.887, RMSEA = 0.059. Second, we found that the model constrained to metric invariance had a similar, but marginally (and unexpectedly, because more constrained models generally are not a better fit to the data) better fit than the baseline configural model (CFI = 0.914, TLI = 0.904, RMSEA = 0.056, ΔCFI = 0.011 > 0.002). Lastly, we found that the scalar model provided a satisfactory fit, but did not fit the data as well as the metric model and did not exhibit scalar invariance (CFI = 0.897, TLI = 0.903, RMSEA = 0.057, ΔCFI = 0.017 > 0.002) [46]. Therefore, we determined that the two-factor model exhibited metric, but not scalar, invariance. Factor correlations ranged from 0.03 to 0.31 across imputed datasets.

Table 4 Adjusted and Unadjusted Fit Indices for Dyadic Invariance Testing Confirmatory Factor Analysis

After EFA, application of the underlying IRI theoretical framework, and dyadic CFA, our finalized empathy instrument included 13 of the 28 original IRI items. The cognitive and affective subscales suggested by EFA and supported by dyadic CFA were moderately correlated within each participant (Pearson’s rho: 0.621, 95% CI: 0.585, 0.655) (Table 3).

Demographic comparisons

The exploratory and dyadic confirmatory factor analyses suggested a two-factor solution. To further probe the construct validity of this two-factor IRI model, we examined demographic and psychological differences on this adapted measure. These comparisons are confounded by the inability to establish scalar invariance (i.e., men and women interpreted the scaling of the items differently). Any statistically significant sex differences should therefore be interpreted with caution. We found unexpectedly lower mean scores on the cognitive subscale among women (Mean difference 95% CI: − 0.205, − 0.026) and those with a PHQ-9 score of 9 or lower (not depressed) at baseline (Mean difference 95% CI: − 0.352, − 0.120) and higher mean cognitive empathy score in participants 26 or younger (Mean difference 95% CI: 0.010, 0.196) (Table 5). We observed lower affective empathy scores among females (Mean difference 95% CI: − 0.186, − 0.0004) and those with more than 7 years of education (Mean difference 95% CI: − 0.247, − 0.024) and higher affective empathy scores in participants 26 or younger (Mean difference 95% CI: 0.010, 0.200) (Table 5). There is at least one district and one language that has a mean affective and cognitive empathy score that is statistically different from the others (Table 5). There was a low positive correlation (rho = 0.152, 95% CI: 0.093–0.211) between continuous PHQ-9 scores and cognitive empathy (Table 3).

Table 5 Demographic Comparisons of the Cognitive and Affective Scales

Discussion

Our exploratory factor analysis, consideration of the distinct cognitive and affective domains of empathy, and dyadic CFA suggested that a refined, shortened IRI scale is an acceptable measure of empathy in this population of adult partners living with HIV in Zambézia Province, Mozambique. Instead of the original four factor structure, exploratory factor analyses suggested, and the CFA with dyadic invariance testing confirmed, two empathy subscales: an affective subscale (with items assessing personal distress and empathetic concern) and a cognitive scale (with items assessing perspective taking and fantasy), which aligns with more recently developed empathy scales and recent data that describe the neurological processes underlying empathy [25,26,27,28,29]. The obtained subscales were moderately correlated with each other and had comparable intra-scale reliabilities among women and men.

This analysis takes an important initial step toward advancing our capacity to assess empathy changes associated with behavioral interventions in SSA, building upon insights from one province in Mozambique. This paper assesses empathy within romantic partners via a dyadic approach to more accurately capture the interdependence of this relational process. In turn, this yields a more precise assessment of partner empathy, which augments our capacity to assess how the HoPS+ intervention (in this case) or other couples-based interventions may change partner empathy, which itself may contribute to positive “downstream” treatment outcomes. For example, more empathic partners may better enable adherence to medication protocols, attending clinic care, and emotional support. This may also lead to interventions that target this important mediator of health intervention fidelity [13,14,15,16,17].

Our analysis established metric invariance, suggesting that the IRI items loaded similarly for male and female partners within the dyadic CFA analytic framework. Despite having a satisfactory fit, scalar invariance could not be established because of a statistically significant decreased fit from the metric invariance model (ΔCFI = 0.017 > 0.002). While the CFI and TLI fall below the commonly targeted thresholds of model fit proposed by Hu & Bentler [49], we feel the use of robust adjusted estimates, adoption of conservative ΔAFI thresholds, translation into four new languages, and application to an Mozambican sample with a median 5 years of formal education (as opposed to college-educated college-aged students) makes this fit a satisfactory introduction of this scale to a new population. More work can and should be done to further refine the measurement of cognitive and affective empathy in SSA.

Although our two-factor IRI had higher inter-scale correlations (0.62) within each participant than previous IRI or IRI for couple validations (range of 0.01–0.44), we had very similar intra-scale correlations and the dyadic CFA showed inter-scale correlations of 0.03–0.31 among partners [11, 19, 20, 30,31,32]. We cannot rule out that the higher intra-scale correlations are due to the new context for IRI questions.

The lack of scalar invariance across sex suggests that partners may exhibit different baseline values, limiting the degree to which mean differences (t-tests) can be interpreted across partners. In practice, this implies that partners with a latent empathy value of zero should not be expected to have the same average response on survey items. Thus, results from sex comparisons of empathy levels (e.g., men were observed to have higher levels of affective and cognitive empathy) ought to be considered with this in mind. Furthermore, similar empathy across both sexes aligns with our experience working with the communities in Zambézia Province. Higher empathy among younger individuals and higher empathy among those with more education aligns with previous research [55]. In contrast to a review that suggests higher affective empathy with depression – secondary to an increased focus on the self during a depressed state [58], we found no meaningful differences in affective empathy among patients with and without depression. However, our sample size is more than double that of all the reviewed studies combined and our participants are members of the community instead of mixed (hospitalized, on medications, etc.) and therefore are unlikely to act in the same way as those previously described [58].

Our study has several other limitations. Due to translation discrepancies, we excluded two items from the original IRI prior to starting our analyses. We excluded additional items when they loaded on the “incorrect” subscale (cognitive items on affective and vice versa), however, we feel it was reasonable to apply substantive considerations in this way, given the differences between this population and the populations on which the IRI was originally validated. Furthermore, having a more succinct scale will reduce missing responses when this scale is used in other settings. Additionally, about 11% of our participants were excluded from the analyses because they had more than two missing items per subscale. This resulted in a reduction of our sample size and statistical differences between the included and excluded participants.

Finally, the inclusion of four language groups in seven study districts resulted in substantial variability without a sufficient sample to conduct language- and/or district-specific analyses. Though we would have liked to have accounted for dyads within a language- or a district-nested model, the relatively low number of strata and sample sizes within some strata prevented these multilevel analyses. As our data collection continues, this may be a possibility for future research.

Conclusions

In conclusion, our findings provide support for a two-subscale version of the IRI that measures cognitive and affective empathy among HIV-positive adults living in Zambézia Province, Mozambique. Given the association between empathy and health intervention fidelity, this new scale should be useful in assessing the effectiveness of interventions designed to increase social support among couples, family, and groups. This scale validation will help us measure the effect of the HoPS+ intervention on male and female empathy within the context of partner dyads, which, in turn, is hypothesized to impact adherence to treatment, retention in care, and maternal-to-child HIV transmission. Furthermore, this dyadic approach provides inroads to assess how constructs such as empathy, stigma, physician trust, and social support (other constructs measured in HoPS+) are related to each other and positive treatment outcomes for PLWH. Future applications and validations will allow health researchers to develop interventions that target empathy within partner dyads as a mediator of health intervention uptake in HIV and other chronic health conditions. Subsequent scale validations/adaptations would benefit from testing this scale in different languages and in different regions in Mozambique and/or SSA.