Introduction

Previous studies on cohorts of rheumatoid arthritis (RA) patients have examined gender differences on parameters such as the course of disease [1], paraclinical markers (RF/anti-CCP status) and other clinical characteristics [2], treatment response [3], symptom burden [3], reporting of symptoms [4] and coping strategies [5].

According to both Danish and international guidelines, methotrexate (MTX) is the first choice of treatment in rheumatoid arthritis (RA) [6], with studies demonstrating a reduced risk of death in patients treated with MTX [7]. Treatment commenced within the first year of diagnosis has proven essential for disease and symptom control [8], targeting low disease activity and eventually remission [9]. Further, one recent study concluded that patient’s belief of illness improved after 1 year of treatment with MTX [10]. However, a large group of RA patients does not respond clinically to MTX or develop gastrointestinal or liver toxicity issues [11]. Despite clear recommendations for MTX use and prescription, dosing of MTX for RA patients seems to be underutilized or suboptimal [12].

There is no agreement in the literature on gender differences in the treatment effect of MTX. While some studies report men responding better to MTX in both early and established RA [13, 14], others report a low response to MTX in women [15]. Such differences have been attributed to hormonal factors [16] or a generally worse global prognosis and increased disability in women with RA [17]. Other studies proved no gender differences concerning MTX response [16, 18].

Depression is one of the most prevalent comorbidity factors amongst patients with RA, and the presence of clinical depression may negatively influence patients’ experience of RA [19], coping ability [19] and treatment adherence [20]. Feelings of loneliness may also be important in an RA trajectory [21]. Loneliness may thus indicate a risk for poorer health, including depression and increased perception of pain [22], and it can negatively affect treatment adherence [23].

Sexual dysfunction can emerge as a consequence of RA symptoms such as fatigue, pain and joint stiffness because of side effects from medical treatment or due to psychological factors such as distress, depression or loneliness [24].

Associations between RA and PROMs, such as depression, loneliness, physical function and sexual functioning, are complex. However, characterizing male and female MTX users and non-users concerning their mental, physical and sexual functioning may improve our understanding of various adverse RA trajectories. Against this background, this study investigates sexual functioning, loneliness, depression, fatigue and physical function in RA patients who, respectively, use and do not use MTX. This may shed light on the emergence of clinically meaningful adversity and vulnerability in specific subgroups of RA patients. Furthermore, the study aims to investigate possible gender differences and gender-specific associations about sexual functioning, loneliness, depression, fatigue and physical function in subgroups of both MTX users and non-users.

Material and methods

Study design and patient involvement

The present study was cross-sectional, with consecutive enrolment of patients with RA. As PROMs, the study used selected validated questionnaires (described in detail in the next section) distributed electronically to participants as a joint survey when accepting participation in the study. The electronic survey design was discussed with two patient research partners affiliated with the department of rheumatology, but the questionnaires were not changed.

Data from questionnaires were supplemented with clinical and paraclinical data (described in detail in the next section) collected retrospectively from medical records at each participant’s diagnosis and again after 1 year. Patients were recruited at the outpatient department of rheumatology at a Danish University Hospital, and the electronic survey was distributed to 380 patients between April and September 2018. The collection of data ended in October 2018. Data from the medical records were collected using a structured guide.

Further, the STROBE guidelines were used in the reporting of the study [25].

Participants

Inclusion criteria included people over 18 years diagnosed with RA (diagnosis codes: M059, seropositive RA; M060, seronegative RA and M069 RA without specification) who could speak and understand Danish without cognitive impairment. Also, inclusion required the availability of medical record data regarding MTX treatment/non-treatment 1 year after diagnosis.

A sample size of 150–200 participants was considered sufficient based on previous cross-sectional studies involving, e.g., the Health Assessment Questionnaire (HAQ) score [26], Becks Depression Index (BDI) [27] and Bristol Rheumatoid Arthritis Fatigue Numeric Rating Scale (BRAF NRS) [28]. All participants signed an informed consent form following both oral and written information and procedures which align with the ethical standards of the Danish Code of Conduct for Research Integrity [29].

Data collection

Retrospective evaluation of medical records

The following data were collected from the electronic patient record at two points in time: (1) treatment with MTX or other synthetic and biologic disease-modifying antirheumatic drug (DMARD) 1 year after diagnosis and (2) seropositivity with IgM-RF- and/or anti-CCP-status; CRP; number of tender joints (NTJ); number of swollen joints (NSJ); Disease Activity Score (DAS)28CRP (based on 28 tender and swollen joints and level of CRP); Charlson’s Comorbidity Index (CCI) and Health Assessment Questionnaire (HAQ) including fatigue (visual analogue scale, VAS), pain (VAS), Patient Global Assessment (PtGA) (VAS) and Physician Global Assessment (PhGA) (VAS) at the time of diagnosis. Disease duration was calculated from the date of diagnosis in the medical record to the date of answering the electronic survey. Data were collected from the electronic medical record, where all clinical and paraclinical data were documented prospectively.

Questionnaires included in the electronic survey

Patient-related outcomes from the questionnaire

The questionnaire included 11 sociodemographic items: age, gender (male, female and other), cohabitation status (living alone and living together with children and/or other adults), employment status (working and not working), years of education (< or > 12 years of education), daily smoking (cigarettes/cheroots/cigars/pipe), weekly alcohol consumption (units of alcohol/week) (the cut-off values for alcohol consumption being > seven units of alcohol per week (women) and > 14 units of alcohol per week (men), height (m), weight (kg) and body mass index (BMI) (weight(kg)/height(m)2) and comorbidity measured by response categories from Charlson´s Comorbidity Index (CCI).

Changes in sexual functioning questionnaire (CSFQ-14)

CSFQ-14 is a 14-item gender-specific questionnaire examining changes in sexual functioning due to illness and/or treatment in five domains (desire/frequency, desire/interest, arousal/excitement, orgasm/completion and pleasure). The CSFQ-14 uses a 5-point Likert scale, and the scores range from 14–70, with low scores reflecting a high extent of sexual dysfunction. The cut-off point indicating sexual dysfunction is 47 for men and 41 for women [31]. The CSFQ-14 has been validated in Danish and in a Danish setting [32].

The University of California Los Angeles (UCLA) loneliness scale

The UCLA Loneliness Scale is a 20-item questionnaire with responses on a 4-point Likert scale (never, rarely, sometimes or always) [33]. Scoring ranges from 20 to 80 points, with a higher score reflecting higher self-rated loneliness [33]. There is no cut-off point, but some studies have categorized the UCLA scores into four categories: mean score 20–34 (low degree of loneliness), 35–49 (moderate degree of loneliness), 50–64 (moderately high degree of loneliness) and 65 to 80 (high degree of loneliness) [34].

The UCLA Loneliness scale is considered the gold standard for measuring loneliness indirectly and has good psychometric properties [35].

Beck’s depression inventory (BDI)

The BDI is a 21-item questionnaire with responses on a 4-point Likert scale (none, mild, moderate and severe) measuring symptoms of depression. Scores range from 0–63, with high scores reflecting a higher extent of depression [36]. Diagnostic categories are ≤ 13 (none or minimal depression), 14–19 (mild depression), 20–28 (moderate depression) and ≥ 29 (severe depression) [36]. The revised version of the BDI was tested on 248 outpatients (Cronbach's alpha = 0.86) [37]. It has been translated into Danish.

Bristol rheumatoid arthritis fatigue numerical rating scale (BRAF NRS)

The BRAF NRS contains three items (fatigue, effect on life and coping ability) measured from 0–10 on numeric rating scales: i) fatigue severity describing the average level of fatigue (no fatigue–totally exhausted), ii) effect of fatigue on your life (no effect–a great deal of effect) (items are measured from 0–10, with 0 being no problems) and iii) coping with fatigue (not coping at all well–coping very well) (items are measured from 0–10, with 10 indicating no problems in coping with fatigue) [38]. The BRAF-NRS has been validated in Danish [39].

The health assessment questionnaire (HAQ)

The HAQ score is a widely used scoring instrument to measure physical disability in rheumatology. It covers eight domains (dressing, arising, eating, walking, hygiene, reach, grip and daily activities), and it has 20 items with four response options each (without any difficulties, with some difficulties, with many difficulties and unable to do). Scores range between 0 and 3, with a higher mean score indicating increased severity [40]. Three visual analogue scales (VAS) (range: 0–100 mm) on pain, fatigue and the total impact of RA (PtGA VAS) are included in the HAQ instrument but not part of the eight domains and, therefore, not included in the score calculation. The HAQ has been validated repeatedly [41], including adaptation in Danish and a Danish setting [42].

Statistical methods

A prespecified statistical analysis plan (SAP) was established with a statistician. Linear regression models were applied to compare the numerical outcome measures (HAQ, BRAF-NRS, BDI, UCLA Loneliness Scale and CSFQ) between male and female patients as well as between MTX users and MTX non-users, and in a combined model including an interaction between gender and MTX use. Hence, all our analyses were performed on the whole cohort, considering the desired comparisons by specifying proper interaction structures between gender and MTX status. Furthermore, logistic regression was applied for current smoking and alcohol consumption as outcome measures comparing MTX users and MTX non-users. Unadjusted analyses were conducted, as well as analyses adjusted for patient age, gender, years of education, employment and cohabitation status.

Due to non-normally distributed residuals in the linear regressions (evaluated by quantile–quantile plot and Shapiro–Wilks test resulting in p-values < 0.001 for most models), bootstrapping with 1,000 repetitions was applied to estimate confidence intervals and p-values. Interaction terms between gender and MTX usage were used to investigate possible gender differences in the secondary outcomes.

The main analysis was conducted on available data to manage missing data. The analyses were repeated for smoking and alcohol consumption by ordinal logistic regression considering the amount of smoking/alcohol consumption. Furthermore, we repeated the analyses on measured pain, fatigue and PtGA scales and for CCI at enrolment rather than at the time of diagnosis.

All statistical analyses were performed using STATA.

Results

Of the 380 participants invited, 286 patients (69 men and 217 women) were included in the analyses (Fig. 1). Participants’ mean age was 45.0 (SD 10.8) years at the time of diagnosis and 56.6 (SD 10.0) years at the completion of the survey. The mean disease duration was 12.1 (SD 8.3) years. Most participants had received more than 12 years of school education (73.0%). Half of the respondents were currently unemployed (50.1%), and one-fourth lived alone (24.1%) (Table 1).

Fig. 1
figure 1

Flowchart of inclusion and analyses. Vertical lines show patient flow. Horizontal lines illustrate groups for comparison by statistical analysis: A Women compared to men in both groups/in general B MTX users compared with MTX non-users C Female MTX non-users compared to male MTX non-users D Female MTX users compared to male MTX users E Male MTX users compared to male MTX non-users F Female MTX users compared to female MTX non-users

Table 1 Clinical and demographic characteristics and patient-reported outcome measures

The demographic characteristics and disease activity measures are listed in Table 1. The majority of participants (67.8%) were MTX users (n = 194; 52 men and 142 women), while 92 participants (17 men and 75 women) were MTX non-users (Table 1).

Correlation between PROMs

As shown in Table 2, several PROMs were correlated. The HAQ total score collected in the survey correlated with VAS fatigue, VAS pain, VAS global, the HAQ collected at the time of diagnosis and the Beck Depression Inventory (BDI).

Table 2 Spearman’s correlation coefficients between patient-reported outcome measures (PROMs) selected at diagnosis and enrolment in the study

BDI correlated with all BRAF measures, the total UCLA score, VAS fatigue and VAS global.

The CSFQ did not correlate with other measures, and the UCLA Loneliness score only correlated with the BDI as mentioned above.

Comparing women and men (overall)

Overall, analyses adjusted for age, years of education, employment and cohabitation status found significant mean differences in several categories between women and men (Table 3, column A): women had a significantly higher mean score than men in the HAQ score at diagnosis (mean difference 0.4; CI 0.2–0.7) and in several other parameters at the time of enrolment in the study: HAQ score (mean difference 0.4; CI 0.2–0.5), fatigue score (VAS) (mean difference 8.8; CI 2.4–15.3), PtGA (VAS) (mean difference 7.4; CI 0.8–14.0), BDI (mean difference 2.8; CI 1.0–4.5), BRAF NRS severity (mean difference 1.0; CI 0.3–1.6) and BRAF NRS effect (mean difference 1.1; CI 0.4–1.7). Further, women had a significantly lower mean CSFQ score than that of men (mean difference -12.2; CI -14.8- -9.5) and a higher percentage of women than men in the MTX user group (60.2% vs. 36.5%) and in the MTX non-user group (57.2% vs. 23.5%) had a mean score indicating a sexual dysfunction (Table 1).

Table 3 Adjusted differences between women and men, MTX users and MTX non-users, female and male MTX users, male MTX users, and male MTX non-users, and female MTX users and female MTX non-users

Comparing MTX users and MTX non-users (overall)

The analysis adjusted for gender, age, years of education and cohabitation status revealed a significantly higher score on the UCLA Loneliness Scale amongst MTX users than amongst MTX non-users (mean difference 4.15; CI 1.5–6.9). No other significant differences were found between these two groups (p-values between 0.051 and 0.652) (See Table 3, column B).

Comparing women and men (MTX users)

Comparing the two MTX user groups (Table 3, column C), stratified by gender, we observed differences between women and men on several parameters: in women, we found an increased mean HAQ score (mean difference 0.4; CI 0.2–0.6), increased mean fatigue score (VAS) (mean difference 11.2; CI 4.0–18.4), increased mean pain score (VAS) (mean difference 8.7; CI 1.4–16.0), increased mean PtGA (VAS) (mean difference 9.2; CI 1.5–17.0), increased mean BRAF-NRS severity (mean difference 1.1; CI 0.3–1.8), increased mean BRAF-NRS effect (mean difference 1.1; CI 0.3–1.9), increased mean BRAF-NRS Coping (mean difference 0.9; CI 0.1–1.7) and decreased mean CSFQ score (mean difference -13.3; CI -16.3- -10.3) compared to men. Furthermore, the risk of smoking and consuming more alcohol than recommended by the Danish Health Authority was half for women in comparison with men (OR 0.5; CI 0.3- 0.9).

Comparing women and men (MTX non-users)

When comparing female MTX non-users with male MTX non-users (Table 3, column D), there was a significant difference in the CSFQ mean score, indicating increased sexual dysfunction in women (mean difference -9.2; CI -15.0—-3.5) compared to men. No other differences were statistically significant.

Comparing male MTX users with male MTX non-users and female MTX users with female MTX non-users

Comparing male MTX users with male MTX non-users and female MTX users with female MTX non-users (Table 1), no differences in mean age (57.1 vs. 57.6 and 55.9 vs. 57.5), the proportion of seropositivity (82.7 vs. 76.5 and 78.9 vs. 85.3) or mean DAS-28CRP (4.3 vs. 3.7 and 4.6 vs 4.7) were observed.

Both male and female MTX users had lower mean disease duration than that of male and female MTX non-users, respectively (9.5 vs. 12.9 and 9.8 vs. 18.2), and there was a difference between the HAQ mean score at the time of diagnosis in both the male and the female groups (0.7 vs. 0.4 and 1.0 vs. 1.6).

The psychosocial factors at enrolment showed significant differences between male and female MTX users and MTX non-users in UCLA Loneliness mean score (36.2 vs. 32.8 and 39.0 vs. 35.0), CSFQ mean score (49.1 vs. 46.7 and 36.4 vs. 37.4) and BDI mean score (7.0 vs. 5.6 and 10.5 vs. 8.9).

Men in the MTX user group had a lower mean BMI than that of men with MTX non-use (26.8 vs. 28.1), and a difference was also present in the frequency of current smoking (46.2% vs. 58.8%) and alcohol consumption per week (40.4% vs. 52.9%). However, adjusted analyses showed no significant change in any parameters or scores comparing men using MTX with men not using MTX (Table 3, column E). The results must be considered in the light of the small sample size included in these analyses.

More women in the MTX-user group than those in the MTX non-user group had a CCI of 0 (64.6% vs 45.3%), indicating no comorbidities at the time of diagnosis. The adjusted analyses found a significant change in the UCLA Loneliness score between female MTX users and female MTX non-users (mean difference 4.6; CI 1.5–7.7) (Table 3, column F).

Discussion

This study amongst RA patients investigated differences between MTX users and MTX non-users in selected PROMs and compared these between men and women. Participants were characterized by a low level of disability (mean HAQ score 0.8 at enrolment), a low comorbidity index, a low BDI and low scores of loneliness. In half of the participants, the CSFQ score indicated sexual dysfunction.

We found statistically significant gender differences in physical functioning, sexual dysfunction, depression and fatigue at the time of enrolment.

The results revealed a higher HAQ score for women than men, both at the time of diagnosis and at the time of study enrolment. Similar gender differences have been found in other studies, with higher scores reported by women in the HAQ, the fatigue score (VAS) and the PtGA score (VAS) [indicating a lower physical function], increased fatigue and a more negative disease impact in women than that in men [2, 43]. According to some studies, such gender differences might be explained by a tendency for female patients to evaluate symptoms more severely than male patients [4, 44], while men might tend to underestimate their symptoms [45]. The preponderance of female patients may result in outcome measures more likely being sensitive to women’s experiences. However, other explanations should be considered since clinical RA measurements may reflect biological differences (e.g., increased pain receptors in women), with a potentially more severe symptom picture in women [4, 46].

In line with other studies [24, 27], we found an increased Beck’s Depression Index (indicating more depressive symptoms) and a lower score in CSFQ (indicating more sexual dysfunction) in women with RA compared to men. A Danish study amongst the general population found a female predominance in depression, but differences were only significant concerning minor depression [47].

Gender differences in sexual dysfunction are present in the general population as well. For example, a Danish population study [48] found that 3.4% of partnered men vs 9.9% of partnered women reported hypoactive sexual desire disorder within the past 4 weeks.

The higher depression scores and low CSFQ scores in our study could add to the lowered well-being amongst female patients shown in some studies [27, 49]. However, other studies found no gender difference in the prevalence of depression amongst patients with RA [1, 50]. These differences may be due to scale measurement bias and would be worthy of further exploration using a male-specific depression scale that addresses ways through which men traditionally express depression (i.e., anger, aggression, substance abuse, self-sabotage and risk-taking behaviours) [51]. Again, this warrants future attention to the psychometric properties of the instruments used in clinical settings. Thus, clinicians should be aware that gender bias may be present when interpreting the scores of the HAQ or the BDI [43].

The multivariate analyses showed significant differences in self-reported loneliness between MTX users and MTX non-users. A relation between loneliness and MTX treatment (or other treatments for RA) has not previously been identified, while one study found living alone to be negatively associated with treatment adherence, whereas good mental health was positively associated with treatment adherence [52]. High loneliness scores may negatively impact medication taking and decrease adherence to treatment. In contrast, low loneliness scores could indicate more social support and help increase treatment adherence [23]. Since this sample’s overall loneliness scores were low, this must be considered when assessing the results.

The study found significant differences between women and men using MTX, with poorer outcomes for women in several parameters. A recent systematic literature review [53] analysed a large number of studies and several predictive models concerning MTX response within 3 to 6 months from the initial diagnosis, and it concluded that female gender, along with current smoking and/or rheuma-factor positivity, was a vague predictor of non-response to MTX. Almalag et al. [54] found female gender to be significantly associated with MTX intolerance and connected this difference to gender-specific pharmacokinetics. Female gender as a predictor of non-response/not using MTX aligns with our findings, although we did not test for RF positivity in the adjusted analyses.

In the adjusted analyses, smoking was found to be less likely in women than that in men––both in general and comparing male MTX users with female MTX users. This is in line with gender differences in smoking amongst the general Danish population [55]. Unlike one previous study [13], we did not observe any association between smoking and poor MTX use/response.

Our study reported significantly higher pain scores (VAS) in female MTX users than those in male MTX users. Results from other studies are contradictory regarding pain as a predictor of MTX response. One study found no impact on pain response [56], while an association of increased pain levels with a negative response to MTX therapy was reported in another study [54].

Comparisons between female and male MTX non-users showed increased disability measured by the HAQ score and increased sexual dysfunction for women not using MTX. These differences may reflect the general gender differences mentioned above since other studies, including ours, report increased sexual dysfunction in female RA patients(25) and increased HAQ scores in women compared to men [57]. Another explanation for the significant differences between women using MTX and women not using MTX (and female and male MTX users), along with the lack of differences between male MTX users and male MTX non-users, might be that the included PROMs lacked sensitivity to detect relevant changes in men.

The low number of male participants makes it difficult to reach firm conclusions on gender differences, and consequently, caution is needed in group comparisons. However, the study’s female-to-male ratio reflects the clinical population of RA patients (one male to four females).

Selection bias may have occurred, as participants were recruited after consultation in the outpatient department. This may have favoured participants with more disease activity/more severe arthritis. Conversely, patients declining to participate may have been experiencing more negative impacts of RA.

The data on MTX use could have been more detailed about reasons for MTX non-use, and data details were generally limited due to incomplete medical records. Further, the cross-sectional study design did not allow for exploring causal effects.

In conclusion, the results revealed better physical function at the time of diagnosis and a higher loneliness score in MTX users compared to MTX non-users in general. There were no significant differences in Beck’s Depression Index or the CSFQ score. However, women reported worse scores than men in several PROMs. Gender differences were also evident when comparing MTX users to MTX non-users, with female MTX non-users reporting worse PROMs than male MTX non-users. This may reflect that female RA patients are situated in more vulnerable positions than male RA patients. Still, it may also imply that the clinical measures used in rheumatology are more sensitive to adversities affecting women than men. Thus, gender differences should be considered when assessing patients’ symptoms and needs and when selecting or developing instruments to monitor various PROs.