Introduction

Male voiding dysfunction is common in the general population, particularly in the elderly. It is often, although not necessarily rightly so, attributed to the presence of benign prostatic hyperplasia (BPH). The extent of voiding dysfunction is typically assessed by quantifying signs such as a reduced peak urinary flow (Q max) or symptoms as measured in the International Prostate Symptom Score (IPSS). Based upon factor analysis [1], the IPSS is often subdivided into a voiding and a storage sub-score. Of note, the IPSS is somewhat biased toward voiding symptoms, as four of the seven question belong to the voiding sub-score. While a pathophysiological link between BPH and the associated obstruction and lower urinary tract symptoms (LUTS) implies mainly the presence of voiding/obstructive symptoms, storage/irritative symptoms are also common in such men [2].

Male LUTS are often associated with a reduced disease-specific quality of life (QoL). This can be assessed by a variety of questionnaires among which the QoL question of the IPSS is used most often [3] although more complex instruments such as the SF-36 questionnaire may be more informative [4]. Such research shows that a greater IPSS statistically is strongly associated with a reduced QoL prior to treatment [5, 6]. Many studies imply that storage symptoms may contribute to this reduced QoL to a greater extent than voiding symptoms across both genders [4, 710], even among men considered to have major obstruction as they are waiting for BPH-related surgery [11] but limited other studies in men report a stronger association of voiding than storage symptoms with a reduced QoL [12]. Whether a reduced Q max and voiding symptoms as assessed in the IPSS and its voiding sub-score can be considered equivalent in this regard has not been evaluated thoroughly.

Thus, the existing literature clearly demonstrates an association of both voiding and storage symptoms with QoL at baseline, with the latter exhibiting somewhat stronger correlations. However, a cause–effect relationship is difficult to evaluate based upon statistical associations at baseline alone. Stronger scientific arguments could come from analysis of similar relationships after treatment but only very limited data on this are available [13]. Even more importantly, evidence for a cause–effect relationship could come from exploring associations between treatment-associated changes of voiding and storage sub-scores on the one and changes of QoL on the other hand. While various treatment forms have consistently shown QoL improvement in LUTS/BPH patients [14], the relationship between improvements of voiding and storage sub-scores with those of QoL, to the best of our knowledge, has not been studied before. Therefore, the present study was done to primarily explore such relationships after treatment in a large sample of men with LUTS/BPH.

Patients and methods

This is a retrospective analysis of a single-center database generated during routine care at the Dept. of Urology of the Radboud University Nijmegen Medical Centre (Nijmegen, Netherlands), which includes patients diagnosed with LUTS/BPH between 1992 and 2002. The diagnosis and treatment decisions were not based on formalized criteria but on the medical judgment of one urologist (JdlR) based upon digital rectal examination, estimation of prostate volume by means of transrectal ultrasound, uroflowmetry, urinalysis and IPSS including its QoL question.

Our analysis of associations at baseline includes all patients in the database for whom relevant data were available (n = 2,611). Post-treatment analyses are based on patients with at least one assessment of the IPSS 6–12 months after initiation of treatment where at least 150 patients were available for a given form of treatment. This time frame was chosen because it should yield a reasonably stable post-treatment situation. The following interventions yielded sufficient patient numbers for further analysis: watchful waiting (WW, n = 421), α-blockers (n = 297), transurethral microwave thermotherapy (TUMT, n = 356) and transurethral resection of the prostate (TURP, n = 184). This excluded 696 patients because they had received treatments other than the above, and 265, 198, 72 and 122 in the WW, α-blockers, TUMT and TURP groups, respectively, because no post-treatment data were available. α-Blocker treatment consisted of standard doses of alfuzosin, tamsulosin or terazosin. TUMT was administered with the Prostatron machine (EDAP-TMS, France) on an ambulatory basis, and TURP was performed under spinal or general anesthesia. If more than one assessment was done within 6–12 after initiation of treatment, the results closest to 9 months after initiation were used.

A descriptive analysis of the treatment groups at baseline and after 6–12 months of treatment is presented as mean ± SD. From the IPSS a voiding sub-score based upon the symptoms incomplete emptying, intermittency, weak stream, and straining was calculated and a storage sub-score based upon frequency, urgency, and nocturia [1]. As both the voiding and the storage sub-score of the IPSS correlated only poorly with Q max at baseline (r = 0.162 and 0.126, respectively, n = 2611), we have used all three parameters in parallel as explanatory variables to explore their relationships with QoL using multiple regression analysis. Specifically, we have addressed three questions: (1) What is the statistical association of Q max and the voiding and storage sub-score to QoL at baseline and after 6–12 months of treatment (secondary aim)? (2) Do the various treatments differ in their effects on Q max, voiding and storage (tertiary aim)? For this analysis, the possibly confounding factors age, and baseline values of the respective parameter were used as covariates in a multiple regression analysis. Moreover, the α-blocker group was used as the reference treatment because most data from double-blind, randomized studies exist for this treatment form which is known to improve voiding and storage sub-scores to a similar extent [1523]. (3) What is the statistical association of the treatment-associated alterations of Q max, voiding and storage sub-scores to the associated improvement in QoL score (primary aim)? All statistical analyses were performed with the statistical software SPSS for Windows (version 11.5.1, SPSS Inc. Chicago, IL, USA.), and a p < 0.05 was considered as statistically significant.

Results

In multiple regression analysis, baseline Q max and voiding and storage sub-scores of the IPSS were significantly but weakly associated with QoL with the storage sub-score exhibiting the strongest association among the three explanatory variables (Table 1). Thus, all other factors being equal, a Q max difference of 1 ml/s or 1 point in each sub-score statistically explained only a difference of 0.010, 0.074 and 0.120 points in the QoL score.

Table 1 Multiple regression analysis of the relationship between Q max, voiding and storage score on the one and QoL on the other hand

As expected, patients assigned to WW, α-blocker, TUMT and TURP had different baseline characteristics. Lower baseline Q max correlated with increasing invasiveness of treatment chosen, i.e. was highest in the WW and lowest in the TURP group (Table 2). The voiding and storage sub-scores at baseline were lower in the WW than in the other three groups, but no major differences were seen between the latter three.

Table 2 Baseline and post-treatment characteristics according to allocated treatment

Increasing invasiveness of treatment was associated with increasing improvements of Q max and voiding and storage sub-scores (Table 2), i.e. all three parameters improved with an order of effectiveness of WW < α-blocker <TUMT < TURP. In order to explore in more detail the relationship between a form of treatment and the extent of improvement of Q max and voiding and storage sub-scores, we have applied multiple regression analysis using age and baseline value of the respective parameter as co-explanatory variables (Table 3). This analysis demonstrated small but significant adverse effects of age on the improvement of Q max and the storage sub-score, whereas improvements of the voiding sub-score were not significantly affected. As expected, respective baseline values had a strong effect on the improvement of all three parameters, i.e. each ml/s or point at baseline statistically explained approximately 0.5 ml/s or 0.6 points of improvement. Compared to α-blocker treatment as the reference group, WW had rather similar effects, i.e. they were numerically slightly weaker than those of the α-blocker with the differences reaching statistical significance only for the voiding sub-score. In contrast, TUMT and even more so TURP yielded greater benefit than α-blocker treatment. Interestingly, these differences were more pronounced for Q max and the voiding than for the storage sub-score.

Table 3 Multiple regression analysis of the relationship between form of treatment and other factors and improvement of the Q max and voiding and storage sub-scores of the IPSS

Having characterized the effects of the various treatments on Q max and the IPSS sub-scores, we have applied multiple regression analysis to explore their associations with QoL after treatment (Table 1). This confirmed the associations between all three parameters and QoL already observed at baseline. Finally, and most importantly, we have explored our primary research question, i.e. how treatment-associated alterations of Q max and voiding and storage sub-scores related to alterations of QoL (Table 1). Improvements of all three parameters were significantly but weakly associated with QoL improvements. For example, each treatment-induced change of Q max by 1 ml/s or the voiding or storage sub-score by 1 point was associated with changes of QoL by 0.033, 0.109 and 0.119 points, respectively.

Discussion

The present analysis was primarily designed to explore whether the hypothesis of LUTS as a cause of reduced QoL and specifically the relative roles of voiding and storage LUTS can be further supported by post-treatment data.

Critique of methods

For the selection of post-treatment data we have limited ourselves to patients with at least one assessment at 6–12 months after initiation of treatment to reflect a somewhat stable clinical situation. Moreover, we have limited ourselves to patients which had received one of the treatment forms for which at least 150 patients were available to yield meaningful group sizes. Patients with and without follow-up data did not differ in a meaningful way in their baseline values, and all treatment groups had comparable attrition rates (data not shown). Treatment allocations had been based upon the clinical judgment of one urologist, yielding the expected heterogeneity in pre-treatment symptoms between groups. Where applicable this has been taken account as co-variables in our analyses, as randomized comparisons of e.g. WW and TURP would be neither feasible nor ethical [24].

Our QoL assessments are based upon the QoL question of the IPSS. While more complex instruments such as the SF-36 may have benefits over single-item questionnaires, the QoL question of the IPSS has been used in the majority of previous studies in the field and consistently shown good correlations with more complex instruments and hence is considered to be a valid QoL assessment with major utility due to its simplicity [14].

Multiple biological mechanisms including degeneration of the bladder upon long-standing obstruction may contribute as causes of LUTS and hence QoL, but the present database does not allow this type of analysis.

In line with previous studies [25, 26] our analysis showed only limited correlation between both voiding or storage sub-scores with Q max, and hence Q max has been included as a separate item into our analyses. All of these factors should be taken into account in the interpretation of our data.

Baseline data

In line with many previous studies [4, 711], our baseline data confirm that the storage sub-score exhibits a somewhat stronger association with QoL than the voiding sub-score and each of them a stronger association than Q max. The extent of the difference in strength of association between the voiding and storage sub-scores may even be under-estimated because the IPSS is biased toward voiding symptoms [1] and hence one point of the storage sub-score is somewhat more difficult to achieve than one point of the voiding sub-score. This validates our database and findings for the subsequent analysis of post-treatment data.

Treatment effects

As expected, the various treatment forms varied considerably in their efficacy with the more invasive treatments (TUMT, TURP) causing greater improvement than α-blockers despite similar baseline symptoms. Of note, the superiority of TUMT and TURP related not only to the voiding but also, albeit to a slightly smaller extent, to the storage symptoms. Possible roles of more recent medical approaches to LUTS, particularly storage LUTS, such as muscarinic receptor antagonists [27] could not be evaluated as part of our studies as none of our patients had received such medication.

A statistical association of two parameters at baseline provides only very limited evidence for a cause–effect relationship. Therefore, the main aim of the present analysis was to explore the statistical associations of voiding and storage symptoms with QoL after treatment, and even more importantly how well improvements in either sub-score correlate to those of QoL. Very limited earlier findings have indicated that storage symptoms correlate somewhat better with QoL than voiding symptoms, with Q max yielding little association [13]. Our data, based upon multiple treatment forms with different efficacy, confirm those findings. Our data importantly extend those earlier studies by demonstrating that reductions of both voiding and storage sub-scores correlate with QoL improvements, with the latter yielding somewhat tighter associations. As with the baseline data, the extent of the difference in strength of association between the voiding and storage sub-scores may even be under-estimated because the IPSS is biased toward voiding symptoms [1]. In contrast, Q max shows only little association with QoL after treatment.

Conclusions

While many previous studies have demonstrated inverse correlations between voiding and storage symptoms and QoL at baseline, our study for the first time shows that improvements of symptoms correlate with improvements of QoL. This considerably strengthens the logical assumption that symptoms and QoL are indeed related in a cause–effect relationship. Our study also shows that both voiding and storage symptoms correlate with QoL with the latter exhibiting slightly tighter correlations, and that such association can be found not at baseline but also after treatment. The relevance of this conclusion is underscored by its apparent applicability to various medical and surgical treatment forms.