Introduction

Patient reported experience measures (PREMs) are health care quality indices derived from the perspective of patients. Although patient experiences are distinct from more classic measures for clinical effectiveness (e.g. decrease in blood pressure) and patient safety, these measures are interrelated [1, 2]. Therefore, understanding what improves or undermines patient experiences can help achieve treatment goals and improve quality of care.

Social concordance is a concept of similarity based on demographic characteristics. Most research in this field has focused on the influence of ethnic concordance, gender concordance or language concordance on health care consultation outcomes. Studies focusing on gender concordance are inconclusive whether PREMs are affected by the doctor-patient gender composition. For example, female gender concordance can be associated with better communication [3], higher health care provider score [4], more harmonious connection [5] and more agreement on advice [6]. Gender concordance is also associated with patients experiencing more overall satisfaction [7], although this finding is refuted in a large comprehensive American study [8]. This study, containing over 100.000 patient experience survey’s, showed no association between gender concordance and Press Ganey scores (reflecting the likelihood of recommending the physician to others). Also, other studies find that gender concordance is not related to experienced participatory decision-making, encounter quality, trust, quality of life or quality of care [9,10,11,12,13]. Research on age concordance is scarce, but existing evidence suggests associations with enhanced health care quality perceptions [7]. In conclusion, whether doctor-patient concordance regarding gender and age can influence PREMs is still under debate.

In this study, we will investigate the influence of both gender concordance and age concordance on doctor-patient communication, patient involvement in decision making, experienced comprehensiveness of care and patient satisfaction in primary care. It is hypothesized that gender and age concordance are associated with higher scores on PREMs.

The extent to which gender and age concordance affect PREMs could also be culturally determined. In particular gender, which is a social construct, can have different interpretations among different countries [14]. Countries differ in gender norms (e.g. masculinity norms or ideas how man and women should relate to each other) [15], possibly influencing how gender concordance affects patient-doctor interaction. Therefore, we will investigate whether associations between gender concordance and PREMs vary between countries with a different degree of gender equality. It is hypothesized that in countries with a lower gender equality index (GEI), gender stereotypes are stronger resulting into a higher likelihood of gender concordance/discordance influencing PREMs. This aligns with the convergence hypothesis, which states that health inequalities between man and women decrease as gender equality increases [16, 17].

Methods

Data source

For this study, the ‘Quality and Costs of Primary Care in Europe’ database (QUALICOPC) is used, provided by the Netherlands Institute for Health Services Research (Nivel). The QUALICOPC is a multinational survey held among GPs and their patients, aimed at gathering extensive information on the participating general practices, the professional behavior of GPs and the experiences of their patients [18, 19]. Recently, this database has also been used by other researchers studying the association of migration concordance with PREMs [20].

Data collection procedure

Between 2011 and 2013, data was collected across 31 European countries, Australia, Canada and New Zealand. The primary objective was to establish a nationally representative sample of GPs in each participating country, with a predefined target sample size of 220 GPs per country (Cyprus, Iceland, Luxembourg and Malta had a target of 80 GPs). In most countries (19/34), this was achieved by drawing a random list sample from the national register of GPs. To prevent clustering, only one GP per practice was included. In countries where a national register was not available (5/34), a multistage sampling procedure was used. This involved combining registers from different regions within the country and subsequently selecting GPs through a random process. Notably, in larger countries with discernible variations in healthcare systems across regions (4/34), GPs were exclusively sampled from nationally representative regions. Conversely, in smaller countries (3/34), the entire GP population was approached for participation. In Italy and Norway, GPs were included by opportunity sampling. The overall mean response rate was 38%. Importantly, the participating GPs demonstrated representativeness concerning age and gender in comparison to the broader GP population within their respective countries [21].

In each participating practice, one GP and nine adult patients filled in a questionnaire. Patients who just had a face to face consultation with their GP were consecutively invited by trained fieldworkers to fill in a questionnaire about their experiences resulting from the consultation. This was continued until nine questionnaires per practice were collected. Participation was voluntary and informed consent was obtained. The average response rate of patients was 74%. In total, 63.887 patients completed a questionnaire and 7.438 GPs completed a questionnaire. 62.478 (97.8%) of the patient experience questionnaires were successfully matched with the GP questionnaires. All included respondents were 18 years or older and all information was made anonymous. More details about the study protocol, the questionnaire background and survey design have been published elsewhere [21,22,23].

Variables

Of the questionnaire directed to the GPs, only the information concerning gender, age and practice location (city/suburbs/town/urban–rural/rural) was used. The questionnaire directed to the patients consisted of statements which could be answered yes/no and were designed to measure, amongst others, the latent variables ‘doctor-patient communication’ (5 items) and ‘comprehensiveness of care’ (2 items). Moreover, 2 items were selected which were interpreted as measuring ‘patient satisfaction’. The variable ‘patient involvement in decision making’ was measured with one item (dichotomous). For the multiple item latent variables, scales were constructed using multilevel latent variable analyses in a four-level model (items nested within patients, nested within GPs, nested within countries). The reliability of these scales are reported in Appendix A. Gender concordance was considered a four category variable, containing these dyads: male/male, male/female, female/male and female/female (gender GP/gender patient). Age concordance was defined as a maximum age difference of five years between patient and GP, which is similar to how this construct was defined in other studies [7, 24]. Age discordance was divided into two groups: patients being substantially younger than their GP (‘younger patient/older GP’ dyad) and more involvement in decision making and patients being substantially older than their GP (‘older patient/younger GP’ dyad).

Countries were given a score on the gender equality index (GEI) using the indexation of 2015 published by the European Institute of Gender Equality [25], based on datapoints originating from 2012. No GEI scores were available for Iceland, Norway, Switzerland, Turkey, Australia, Canada, New Zealand and FYR Macedonia. Scores ranged from 50.10 (Greece) to 79.70 (Sweden), with higher scores corresponding with more gender equality. Countries were categorized into ‘low GEI’ (index < 55), ‘average GEI’ (index 55–65) and ‘high GEI’ (index > 65). Table 1 shows all countries and their corresponding GEI.

Table 1 Categorization of countries based on Gender Equality Index (GEI)a

Statistics

Due to clustering of observations at the level of countries and GPs, we performed multi-level analysis [26], using three different data levels: patients (1) nested within GPs (2) nested withing countries (3). We predicted doctor-patient communication, patient satisfaction and comprehensiveness of care using linear regressions and patient involvement in decision making using logistic regressions (with a fixed intercept and random effects at the country and GP level). Main predictors in our models are ‘gender concordance’ and ‘age concordance’ and the control variables are ‘GP age’, ‘patient age’, ‘chronic condition’ (yes/no), ‘origin of patient’ (native/not native), ‘education’ (primary/secondary/post-secondary), ‘self-reported level of household income’ (low/average/high) and ‘location of practice’ (big city/town/mixed rural/rural). Comparisons between low, middle and high GEI countries were made by adding interaction terms (gender concordance * GEI-categories) to the model. All analyses were performed using STATA 17.0.

Results

Descriptive statistics

About half of the questionnaires were filled in after the patient visited a male GP (47,8%) and most of the included patients were female (61.2%). Half of the consultations were gender concordant (54.3%) and 20.2% were age concordant. All gender dyad characteristics and age dyad characteristics are reported in Table 2. Notably, the age of the patient varies greatly between the three age dyads. Prevalence of one or more chronic conditions is also higher in the dyad with older patients and the dyad with younger patients contained more higher educated individuals.

Table 2 Descriptive statistics of gender dyads and age dyads

Scores for communication, satisfaction, involvement and comprehensiveness were negatively skewed (towards high scores). Figure 1 shows that especially in consultations with a female GP, there are notable differences in PREM-scores between concordant and discordant gender dyads. In consultations with a male GP, scores given by male and female patients are more or less equal. Figure 2 shows that patients aged younger than their GP report the lowest scores on communication, satisfaction, involvement and comprehensiveness, whereas patients aged older than their GP report better communication and higher satisfaction.

Fig. 1
figure 1

Score for each PREM per gender dyad

Fig. 2
figure 2

Score for each PREM by age dyad

Main results

Female concordance is associated with higher scores on communication (p ≤ 0.01) and more involvement in decision making (p ≤ 0.01) compared to the other dyads (Table 3). The female/female dyad is associated with higher patient satisfaction compared to the male/female dyad (p = 0.01) and the male/male dyad (p = 0.02), but not compared to the female/male dyad. Findings are similar for comprehensiveness of care: the female/female dyad is associated with higher scores compared to both dyads with a male GP (p = 0.00), but not compared to the female/male dyad. Patient satisfaction and comprehensiveness of care are thus associated with the gender of the GP (higher scores when the GP is female) and not with gender concordance.

Table 3 Multi-level multivariate regression predicting the associations between gender and age concordance on four PREMs

Age concordance was not associated with experienced communication. Consultations with an older patient/younger GP dyad were associated with higher satisfaction (p = 0.01) and less experienced comprehensiveness (p = 0.02) compared to consultations with age concordance. Consultations with a younger patient/older GP dyad were associated with lower involvement scores compared to consultations with age concordance (OR 0.92, p = 0.02).

In contrast to the former model (Table 3), there are no associations between gender concordance and all four PREMs in both low and high GEI countries (Table 4). In both low and high GEI countries, consultations with a female GP were associated with higher scores for communication and satisfaction. Male concordance was associated with lower involvement in decision making compared to female concordance in low GEI countries whereas in high GEI countries, both dyads with a female GP were associated with higher involvement in decision. There are no associations between gender dyads and comprehensiveness of care in low GEI countries, however consultations with a female GP were associated with better experienced comprehensiveness compared to consultations with a male GP in high GEI countries. Our findings do not support the hypothesis that associations between gender concordance and PREMs are stronger in countries with a low GEI.

Table 4 Multi-level multivariate regression predicting the associations between gender and age concordance on four PREMs, accounting for Gender Equality Index (GEI)a,b

Model properties

The extent to which the combination of independent variables used in the model (Table 3) explain the variance of communication, satisfaction and involvement in decision making on the country and GP level is limited (ranging from 0.3% to 4.4%). The model explains 15.7% of the variance in comprehensiveness of care on the country level. After adding the GEI variable to the model (and excluding the countries of which a GEI was not available), the variance of the communication, satisfaction and involvement in decision making was explained to a bigger extend (Table 4). This was especially true on the country level (reduction of variance ranging from 7.0% to 27.1%).

Discussion

Summary

Female concordance is associated with better communication and more involvement in decision making. The age concordant dyad is associated with more experienced comprehensiveness and less satisfaction (compared to the older patient/younger GP dyad) and with more involvement in decision making (compared to the younger patient/older GP dyad). Communication was not associated with age concordance. The hypothesis that countries with a lower GEI would demonstrate stronger associations between gender concordance and PREMs could not be supported.

Strengths and limitations

The combination of a large number of respondents gathered across a large number of countries and the ability to take into account the nested nature of our respondents, strengthens the reliability of our results. The response rates among GPs and patients, at 38% and 74% respectively, introduce a risk of selection bias. Although the participating GPs generally reflected the age and gender distribution of the broader GP population in their respective countries, the representativeness regarding other characteristics, such as ethnicity, remains uncertain. Additionally, it is possible that participating GPs have greater interest in research or have more time availability compared to their non-participating counterparts, potentially influencing our study outcomes. Varying GP response rates (e.g., less than 10% in Sweden, over 70% in Spain) impact result generalizability in those particular countries. Notably, one in four patients opted not to participate, with the reasons for non-participation uncollected. Possible explanations for patient non-participation may include time constraints and privacy concerns. The latter, in particular, raises the possibility that GP visits for specific sensitive issues (e.g., psychological or sexual problems) may be underrepresented in our study sample. However, the precise impact of this possible underrepresentation on the measured patient experiences is challenging to ascertain.

The surveys contained patient experiences reported directly after consultation, which strengthens their validity. Moreover, the questions used to operationalize patient experiences were derived from various validated sources. However, all of them consisted out of yes/no answers which is less nuanced than for example Likert-scales. The distribution of scores were negatively skewed meaning most respondents reported very high scores, which is a common finding in post-consultation evaluation studies [8]. Most PREMs were measured by integrating multiple items into a scale, creating latent constructs. Although this method helps to provide more valid variables, this also makes the interpretation of scores somewhat abstract.

We used the gender equality indexation of the European Institute of Gender Equality, which is among the most comprehensive indexations available. However, this report did not contain indexations for 8 out of the 34 countries included in this study. These countries were therefore excluded in our model, lowering the statistical power.

Comparison with existing literature

Research on gender concordance and patient experiences show mixed results. It should however be taken into account that there are a lot of derivates of patient experiences, making it challenging to compare these study results. Some studies find associations between female concordance and specific PREMs [3,4,5,6,7], such as physician communication or experienced agreement on advice. Other studies find no significant association between gender concordance and PREMs [8,9,10,11,12,13]. Some of these findings can be explained by lack of statistical power [9] or the use of too broad outcome variables [10]. Nonetheless, the study of Takeshita et al. [8] was large and comprehensive and showed no association between gender concordance and one specific PREM: the likelihood of recommending the physician to a friend (on a 1 to 5 scale). Although this appears to contrast our findings, it should be noted that our studied PREMs are dissimilar. Moreover, as in most studies finding significant associations, especially the female-female dyad appears to be linked with higher scores on PREMs. This distinction was not specifically reported in the study of Takeshita et al.

Associations between age concordance and PREMs are poorly studied. In an observational study, age concordance was part of the construct social concordance (also entailing race, gender and education concordance). Social concordance was associated with higher satisfaction of care, but it is not clear to what extent age concordance contributed to this finding [7]. Our finding that age concordance can be associated with patient experiences is consistent with the idea that proximity in demographics may positively influence patient experiences.

This is the first study which takes the cross-national variability of gender equality into account when focusing on gender concordance. We hypothesized that more gender equality would diminish effects of gender concordance. However, we did not find a pattern to substantiate this. Possibly, the impact of specific gender dyads on consultations outcomes mainly arises from interpersonal factors and are to a lesser extent related to sociocultural norms.

How social concordance between patient and GP influences health care consultations is not clear. However, there are multiple conceivable explanations. First, sharing demographic characteristics can give the persons involved a suggestion of shared identity, ideas and beliefs [5, 12, 27], smoothening for example shared decision making. Secondly, social concordance is associated with a more patient-centered communication style applied by GPs [28,29,30], which is marked by GPs more actively attempting to understand the patient perspective and reaching shared understanding. Lastly, doctors treating patients who are more like themselves might experience less professional uncertainty because they can better relate to the patient’s problems [5]. These processes could occur simultaneously.

We suggest that future research should focus on more qualitative views on what actually happens in the consultation room. For example, performing qualitative analysis on video recorded consultations could give more insight into consultations room dynamics within each gender dyad. Also, interviewing GPs or patients after a consultation could be an interesting opportunity to unravel how social concordance affects consultation room dynamics.

Conclusion

We conclude from our study results that age concordance and female/female gender concordance can positively affect PREMs regardless of the gender equality of the countries concerned, although differences were small. This contributes to the idea that the mere act of matching basic demographic characteristics, especially matching a female patient to a female GP, can have a beneficial impact on the experience of healthcare.