Introduction

Lumbar disc herniation frequently presents with lower back pain accompanied with sciatica [5]. It has a naturally favorable clinical course, with spontaneous recovery occurring in up to 87% of cases within 3 months [24]. Sometimes surgical treatment is warranted if acute symptoms are intolerable or neurologically severe or agonizing pain continues for an extended time period. Early surgery within 6 to 12 weeks after onset of symptoms leads to faster short-term recovery compared to conservative treatment, but the difference diminishes in 1-year follow-up [16].

Patients undergoing surgery for lumbar disc herniation suffer from disability and drastically lowered health-related quality of life prior to surgery, while after surgery, the health-related quality of life improves rapidly during the first year [8, 12]. However, previous studies suggest that little or no improvement occurs during the second year [6], and no improvement between 2 and 7 years after surgery [21]. In these studies, the patients’ health-related quality of life remained lower than reported general population values at all follow-up time points. If the health-related quality of life remains reduced for an extended period, the disease burden is considerable, especially for young patients as they have long expected life ahead of them.

Previously, we reported the long-term outcome of an adult patient cohort who underwent surgery for lumbar disc herniation below the age of 40 years [18]. Our results showed that despite a 26% need for further surgery during the median 18-year follow-up period, the patients were highly satisfied with the outcome of their surgery, and they demonstrated a favorable employment status and reported Oswestry Disability Index scores similar to previously reported normative values.

In the present study, we evaluated the health-related quality of life of the patient cohort 18 years after they had undergone surgery for lumbar disc herniation. To our knowledge, no studies have reported over a 7-year outcome after surgery for lumbar disc herniation measured by health-related quality of life. The primary objective of the study was to compare the health-related quality of life of the patient cohort to that of an age and gender–matched Finnish population sample. The secondary objective was to discover which patient characteristics predispose the patient to worse health-related quality of life compared to the other patients.

Methods

Data collection

The study was conducted in the Department of Neurosurgery at Helsinki University Hospital. Helsinki University Hospital (HUS) is a publicly funded hospital in southern Finland, consisting of 23 hospitals or clinics. The ethical committee of HUS approved the study prior to its initiation.

The patient cohort included all 316 adult patients below the age of 40 years without history of previous lumbar spine surgery who underwent surgery for lumbar disc herniation at the department during the period from 1990 to 2005 and responded to health-related quality of life questionnaires. The medical records of all patients undergoing surgery for lumbar disc herniation within the years 1995–2005 were examined to identify candidates for the study (n = 615). Patients with history of previous lumbar surgery as well as patients whose symptoms were not caused by a lumbar disc herniation were excluded (n = 89). All patients had radiological confirmation of lumbar disc herniation prior to surgery, and it was conducted with a magnetic resonance imaging (MRI). At this point, 526 candidate patients were identified. In addition to confirming eligibility to the study, the medical records were utilized to collect patient characteristics such as age, body mass index, duration of symptoms, smoking, and possible further lumbar spine surgeries during the follow-up period.

These patients were contacted to gather long-term follow-up data. Contacting patients nearly two decades after surgery was possible due to the Finnish population register system which allowed us to request the patients’ current addresses from the Digital and Population Data Services Agency. Subsequently, the patients were sent information on the study, the informed consent form, and two questionnaires.

The first questionnaire was the EuroQol-5D-3L (EQ-5D-3L) questionnaire which is a frequently utilized instrument to measure health-related quality of life [4, 7]. In the EQ-5D-3L questionnaire, five dimensions of health (mobility, self-care, usual activities, pain/discomfort, depression/anxiety) are assessed on a 3-level scale (no problems, moderate problems, extreme problems). The answers can be converted in a single value, EQ-index, by utilizing valuation sets. The valuation set ranges from 0 (worst possible health) to 1 (best possible health) [15]. Additionally, the respondents report their self-perceived health with a visual analogue scale, EQ-VAS, ranging from 0 (worst imaginable health) to 100 (best imaginable health).

The second questionnaire was a general questionnaire inquiring the patient’s smoking prior to the index surgery and nowadays, and possible further lumbar spine surgeries after the index surgery. The information on further lumbar surgeries was used to complement the data collected from the medical records. Non-responding patients were sent one reminder. Three hundred sixteen patients (60% of all candidates) responding to the questionnaires were included in the study. The data collection protocol is summarized in Fig. 1.

Fig. 1
figure 1

The data collection and propensity score matching protocol. In total, 615 adults under 40 years underwent surgery for lumbar disc herniation from 1990 to 2005. Of these patients, 89 were excluded because of history of lumbar surgery. The patients were contacted to collect follow-up data, and 316 patients responded. The patients who completed all parts of the EQ-5D questionnaire were matched with two same gender and age participants from Health 2011 study

Patient cohort

The total number of patients included in the patient cohort was 316, with a median follow-up of 19.1 (IQR 6.0) years (Table 1). The cohort included 171 males (54%) and 145 females (46%). The median age at the time of the index surgery was 33.5 years (IQR 7.4), and 28% of the patients were younger than 30 years at that time. At the time of the index surgery, the patients’ average body mass index (BMI) was 23.9 (SD 4.3) and 30% of the patients were smokers. The duration of symptoms prior to the index surgery was less than 6 months for 55% of the patients, 28% had symptoms for 6 to 12 months, and the symptoms had persisted for over 12 months for 17% of the patients. At the end of the follow-up period, 150 patients (27%) had undergone at least one additional lumbar surgery.

Table 1 Patient cohort characteristics

Of the initially identified 526 candidate patients, 316 (60%) returned the EQ-5D-3L questionnaire, and 310 patients completed every part of the questionnaire. The patients who completed the follow-up questionnaire had significantly different characteristics than the non-responding candidate patients. They were older both at the time of the surgery (p = 0.02) and at the follow-up (p = 0.004), more often female (p = 0.001), and non-smokers at baseline (p = 0.006) and had lower baseline BMI (p = 0.001) (Table 1).

General population sample and propensity score matching

A general population sample was used to compare patient cohort’s health-related quality of life to that of the general population. General population health surveys are conducted frequently in Finland by the Finnish National Institute for Health and Welfare (THL). THL carried out a large population study, Health 2011 Survey, in 2011 [1]. The sample of the Health 2011 Survey included 8135 individuals aged 29 years or over representing the Finnish adult population, and 5903 persons participated in the comprehensive study protocol, including a thorough health examination, laboratory tests, and multiple questionnaires. One of the questionnaires was EQ-5D-3L, allowing us to compare the health-related quality of life between our patient cohort and a general population sample.

To form a general population sample comparable to our patient cohort, propensity score matching was conducted to create an age and gender–matched general population sample [2]. Each member of our patient cohort who had completed every part of the EQ-5D-3L questionnaire (n = 310) was matched with two participants from the Health 2011 study population according to age and sex. The matching was done with the statistical software R (R Core Team, 2021) by utilizing the extension package MatchIt [11]. After propensity score matching, the matched general population sample consisted of 620 participants with the same median age (54 years) and gender distribution (46% female) as in the patient cohort.

Statistical methods

All statistical tests were conducted with R, version 4.0.5. A t test was done to analyze the difference in means of independent continuous variables between two dependent categorical variables, and the Levene test was run prior to the t test to choose the correct assumption of equality of variance for the t test. One-way ANOVA was utilized if the number of categorical variables was higher than 2. Ordinal variables were analyzed with the Mann–Whitney U and Kruskal–Wallis tests. The correlation between two variables was measured with the Spearman rank correlation test. The Fisher exact test was used to analyze the 2 × 2 contingency tables. A p value less than 0.05 was considered significant for all statistical tests.

Results

EQ-5D-3L dimensions

The patient cohort reported mobility problems as often as the general population sample (p = 0.51) (Fig. 2). Within the patient cohort, patients who were overweight at the time of the surgery reported more problems (p = 0.004) (Table 2). Additionally, an increasing number of lifetime lumbar spine surgeries (p = 0.002) and longer duration of symptoms (p = 0.013) prior to the index surgery correlated with more frequent problems in mobility. Other baseline characteristics, gender (p = 0.71), age (p = 0.61), and smoking (p = 0.64), were not significantly associated with mobility.

Fig. 2
figure 2

Patient cohort and matched general population cohort responses to the EQ-5D questionnaire. The patient cohort included 316 patients while the general population sample size included 620 age and gender–matched respondents. In the questionnaire, the degree of problems experienced by the respondent on each aspect of health is reported on a 3-level scale (no problems, moderate problems, severe problems). The difference between the groups was analyzed with the Mann–Whitney U test. *p < 0.05

Table 2 Patient responses to the EQ-5D-3L questionnaire by baseline characteristics of subgroups

Similarly, there was no overall difference in the self-care dimension of EQ-5D between the patient cohort and general population sample (p = 0.41, Fig. 2), but patients with a higher number of lumbar surgeries (p = 0.002) and longer duration of symptoms experienced more problems (p = 0.037). Age (p = 0.69), gender (p = 0.25), BMI (p = 0.11), and smoking at the time of the surgery (p = 0.51) did not have an impact on self-care dimension scores (Table 2).

On the other hand, in usual activities, the patient cohort had a trend towards more problems than the general population sample (p = 0.074, Fig. 2). Similarly again, a higher number of lumbar surgeries (p = 0.026 and longer duration of symptoms (p = 0.017) were correlated with more frequent problems. Furthermore, females reported more problems (p = 0.006), while age (p = 0.63), smoking (p = 0.61), and BMI (p = 0.24) at baseline had no effect on the outcome (Table 2).

In the pain and discomfort dimension, fewer patients reported problems compared to the general population sample participants (p = 0.012, Fig. 2). An increasing number of lumbar spine surgeries correlated with an inferior outcome also in this dimension (p = 0.030), as well as a longer duration of symptoms prior to surgery (p = 0.010). Age (p = 0.98), BMI (p = 0.19), and smoking (p = 0.17) at baseline did not affect the results, but women (p = 0.017) reported more problems than men (Table 2).

In the depression and anxiety dimension, the patient cohort reported as much problems as the general population sample (p = 0.23, Fig. 2). All baseline characteristics were insignificant as risk factors in this dimension (Table 2).

EQ-index

The mean EQ-index score in the patient cohort was 0.86, while it was 0.84 in the general population sample (p = 0.054) (Table 3). The patient cohort had statistically significantly slightly higher scores in the age group 50–59 years (0.86 vs. 0.82, p = 0.021), but there were no significant differences in the other age groups (Fig. 3).

Table 3 EQ-index scores
Fig. 3
figure 3

EQ-index scores in different age groups in the patient cohort and in the general population sample. Blue triangles show the mean score for the patient cohort (n = 310) and red dots that of the general population sample (n = 620). Error bars display the 95% confidence interval for the values. Mean scores in the patient cohort and in the general population sample in age groups were as follows: 30–49 (0.85 vs. 0.87, p = 0.39), 50–59 (0.86 vs. 0.82, p = 0.021), and 60–69 (0.88 vs. 0.83, p = 0.086). Student’s t test was conducted to calculate the p value. *p < 0.05

A higher number of lifetime lumbar surgeries correlated with lower EQ-index score (p = 0.049) (Table 3). Patients without further surgery reported a mean score of 0.88; for those with one additional surgery, it was 0.83; and the mean score was 0.80 for those with two or more further surgeries.

Furthermore, prolongation of symptoms prior to the index surgery was correlated with progressively worsening EQ-index score (p = 0.013). The mean scores for different symptom duration groups were as follows: under 6 months, 0.88; 6 to 12 months, 0.85; and over 12 months, 0.81. Additionally, males reported higher scores than females (0.88 vs. 0.84, p = 0.021) while the other baseline characteristics, age (p = 0.64), smoking (p = 0.18), and BMI (p = 0.33), had no effect on the scores.

EQ-VAS

EQ-VAS scores were available for the patient cohort but not in the general population sample. The mean EQ-VAS score for patients was 81.6 (Table 4). Patients who had symptoms for less than 6 months and 6 to 12 months reported similar scores (82.3 vs. 83.5), while patients suffering from symptoms for over 12 months reported a significantly lower score of 77.8 compared to patients with shorter duration of symptoms (p = 0.048). A higher number of lumbar surgeries correlated with worse self-perceived health; the mean EQ-VAS score for patients without further surgery was 82.7, it was 80.3 for patients with one additional surgery, and for patients with at least two further surgeries, it was 76.2 (p = 0.070). Other baseline characteristics, age (p = 0.25), gender (p = 0.068), BMI (p = 0.30), and smoking (p = 0.39), were not significantly associated with EQ-VAS.

Table 4 Patient responses to EQ-VAS

Discussion

The results on long-term health-related quality of life after surgery for lumbar disc herniation have been sparse, and the few reports have indicated that the patients have impaired health-related quality of life even years after surgery. To our best knowledge, this study is the first one to present results indicating that long-term health-related quality of life after surgery for lumbar disc herniation is similar compared to the general population.

This is demonstrated by the patients reporting EQ-index scores comparable to an age and gender–matched general population sample. The mean EQ-index score in the patient cohort was even statistically significantly slightly better that that in the age and gender–matched general population sample (0.86 vs. 0.84, p = 0.02 for difference). However, as the minimal clinically important difference (MCID) for EQ-5D index score has been proposed to be 0.07 [25], there was no clinically significant difference between these groups. Nevertheless, these results demonstrate that health-related quality of life of the patients is not worse compared to the general population in long-term follow-up.

Previous studies have shown that patients experience quality of life inferior to the normative values 1 year [8, 12], 2 years [6, 10], and even 7 years after surgery for lumbar disc herniation [21]. This study suggests that the health-related quality of life eventually reaches normative values. However, in the earlier studies, the average age of the patients was in early 40s compared to early 30s in our study. Furthermore, some earlier studies [12, 21] included patients who underwent a macrodiscectomy while our study cohort only included patients who underwent a microdiscectomy. In the study by Fisher et al. [8], the patient cohort had an average duration of symptoms of 51 weeks before surgery which may affect the results as long duration of symptoms has a negative influence on the long-term outcome [20]. These differences may limit the comparability of our results with the previous studies.

We observed a significant correlation between an increasing number of further lumbar surgeries and deteriorating quality of life and a clinically significant difference in quality of life when comparing patients without any further surgery and patients with at least two additionally surgeries (0.87 vs. 0.80). This is in accordance with earlier studies which have reported lower EQ-index scores [9, 14] and lower SF-36 bodily pain and physical function scores [13] for patients who underwent further lumbar spine surgery during the follow-up period. Additionally, Lubelski et al. [14] found that patients who underwent third surgery after revision discectomy had drastically impaired health-related quality of life. Nonetheless, EQ-index of 0.80 can be regarded as a good outcome as the general population value was 0.84.

Furthermore, patients whose symptoms prior to the index surgery exceeded 12 months reported worse quality of life than the other patients even nearly two decades after the index surgery. Their mean EQ-index score was 0.80, while it was 0.87 for patients whose symptom duration was less than 6 months and 0.83 for patients with the symptom duration between 6 and 12 months. The correlation between lengthening of symptoms and lowering EQ-index was statistically significant, but the difference is clinically significant only between the patients at the ends of the spectrum. This concurs with an earlier study which reported that, 1 year after surgery, patients who had symptoms for over 6 months reported worse EQ-index scores than patients with a shorter symptom duration [12]. Regardless of the chosen treatment, prolonged symptoms lead to inferior outcome, as in a 4-year observational follow-up study the symptom duration exceeding 6 months was associated with inferior SF-36 bodily pain and physical component scores with both treatment options [17]. However, a randomized trial reported that for patients who had symptoms for over 4 months, surgery led to better recovery from leg pain in 6-month follow-up [3].

The strength of the present study was the reliable conduction of the long-term follow-up which was achieved due to two factors. First, the HUS hospital district serves a population base of over 1 million people and its 23 hospitals have a common system of electronic medical records. This provided a reliable collection of patient history data from the medical records. Second, the Finnish personal register system allowed us to contact all patients still living in Finland even decades after the index surgery. This gave all patients an equal opportunity to participate in the study, minimizing selection bias.

The weakness of the study is that 60% of patients responded to the questionnaires 19 years after the index surgery. Wang et al. [26] reported that the response rate decays with time and 61% is the average rate after 5 to 10 years for cohort studies; hence, this study had reasonable response rate compared to previous studies. Nevertheless, the fairly high number of non-responders could affect the generalizability of our results. We did observe that the responding patients were older and more likely female and had lower BMI. Similar observations have been previously reported in Finnish population studies [23]. We did not account for education level in our study which could affect the results as people with higher education level respond more frequently [23] and report higher quality of life [19]. However, similar responding biases influenced the collection of general population sample (Health 2011), and it is therefore likely that our results provide a reliable picture of the differences in health-related quality of life between the patient cohort and the general population. Furthermore, two previous studies have reported that the short-term outcome after spine surgery is similar for responders and non-responders [6, 22].

Conclusions

In conclusion, the results show that patients who underwent surgery for lumbar disc herniation as young adults reported health-related quality of life comparable to the general population nearly two decades after surgery. However, patients who had undergone repeated lumbar spine surgery experienced significantly inferior health-related quality of life compared to the other patients. This finding highlights the need to discover ways to prevent the cumulation of lumbar surgeries. Possible ways could include identifying the patients who have high risk for further surgery, correct patient selection for surgery, optimal surgical treatment, and postsurgical rehabilitation.