Introduction

The clinical picture of idiopathic normal pressure hydrocephalus (iNPH) comprises disturbed gait and balance, cognitive impairment and urinary incontinence, treatable by CSF shunting [1]. In modern studies, around 80% of treated patients experience improved symptoms [2]. In spite of extensive research into a wide range of radiological [3, 4], biochemical [5] and hydrodynamic [6,7,8] markers, to date no specific test can adequately diagnose iNPH or predict which patients will benefit from shunt surgery. Reliable tests for the prediction of outcomes are warranted.

Concerning symptoms and signs, several studies have suggested that either the presence of the complete triad [1] or a clinical picture with a predominant gait disturbance [9], are prognostically favorable. Agerskov et al. showed that the sign of shuffling gait was predictive of improvement post-shunting [1]. Earlier studies have been small or limited with regard to studied symptomatology. The POiNT (Prediction of Outcome in iNPH) study was designed to evaluate the predictive value of clinical, CSF and MRI markers on outcome after shunt surgery, and this paper aims to explore the predictive value of a broad panel of symptoms and signs typical for iNPH using an outcome measure specifically developed for iNPH patients.

Methods

Study design and inclusion

The study design was prospective, with consecutive inclusion of patients from two Swedish centers: Sahlgrenska University Hospital in Gothenburg and Östersund Hospital. The inclusion period in Gothenburg was 1 January 2014 to 30 June 2016, and in Östersund 2 October 2014 to 31 December 2016. All 272 patients (Gothenburg n = 258 and Östersund n = 14) investigated for suspected iNPH in these two centers were assessed for inclusion. A total of 143 patients (Gothenburg, n = 130 and Östersund, n = 13) were included in the study. Inclusion criterion was a diagnosis of iNPH [5] treated with shunt surgery. Patients who, for different reasons, did not undergo postoperative clinical assessment with the iNPH scale were excluded. A flow-chart is presented in Fig. 1. Demographic and baseline data for the 143 patients are shown in Table 1.

Fig. 1
figure 1

Flow chart of inclusion in the study. iNPH idiopathic normal pressure hydrocephalus; CSF cerebrospinal fluid; MRI magnetic resonance imaging; SDH subdural hematoma; AD Alzheimer’s disease; ALS amyotrophic lateral sclerosis; MS multiple sclerosis

Table 1 Baseline demographic and clinical data, with comparisons between improved and non-improved patients

Clinical examinations

All patients underwent clinical examinations by a neurologist, a neuropsychologist, and a physiotherapist, MRI of the brain with standardized sequences, including perfusion protocols, and lumbar puncture for assessment of ICP and routine analyses pre- and in median five months, postoperatively. Ventricular CSF was also sampled perioperatively, and from the Rickham reservoir, postoperatively. Results from the radiological and CSF biochemical investigations will be published separately. Decisions for surgery were made at multidisciplinary conferences. Whenever the diagnosis was considered uncertain, mostly due to indistinct or atypical symptomatology, the CSF tap test or a lumbar infusion test was used to support the diagnosis.

At the baseline examination, the patients’ body mass index (BMI) and any previously diagnosed comorbidity of hypertension, diabetes and cardiovascular disease were registered.

The clinical ratings included the iNPH scale introduced by Hellström [10], where the total score is the mean of four domain scores: gait, balance, cognitive function, and continence. The presence of impaired function within each domain was defined as a gait score ≤ 90, a balance score ≤ 80, a neuropsychology score ≤ 80, and a continence score ≤ 80. Patients were defined as “Improved” if postoperatively their iNPH scale had improved by at least five points, otherwise they were considered “Non-improved” [10]. The threshold of five points for clinically relevant improvement has been used in previous studies [3, 11,12,13,14], originally based on clinical experience. The iNPH scale score is derived from a range of standardized quantitative tests and qualitative scales, measured by three different examiners of the three different professions (physician, physiotherapist, neuropsychologist). As the iNPH scale was used as the outcome measure in this study, the tests and scales that are comprised in that scale were not included in the analyses of the clinical variables’ predictive value.

Functional disability was rated with the modified Rankin Scale (mRS) [15] (0–5).

In addition to the ratings and assessments that constitute the iNPH scale, several other observations were included in the pre- and postoperative examinations of the patients. Gait and motor function were assessed by the Timed-up-and-go test [16], the time and number of steps to walk three meters backwards, the number of times the patients were able to stand up from a chair during the 30 s, [17] and the number of steps needed to turn 180°. Balance was measured by the number of seconds achieved in Romberg’s test (maximum 60 s) along with the direction of postural imbalance. Neuropsychological functions were assessed by the Mini-Mental State Examination (MMSE) [18], Identical forms test (test of perceptual speed and accuracy, score 0–60), and Bingley memory test: a task where the patients are asked to memorize pictures of 12 different items—the test is performed twice and the score represents the mean number of items retained (score 0–12) [19]. The number of hours of sleep per 24 h was registered, and, finally, the presence or absence of the following clinical neurological signs were recorded: broad-based or shuffling gait, freezing of gait, paratonic rigidity in the lower extremities, cerebellar ataxia, and focal neurological signs.

All clinical variables were assessed pre- and postoperatively. All examinations were performed within the frame of routine care. The majority of the patients were assessed by the same specialist neuropsychologist and physiotherapist, and consultant or resident neurologists, in the latter case supervised by consultants.

At the baseline evaluation patients and/or their next-of-kin were interviewed about the duration of symptoms perceived as iNPH symptoms. They were also asked which was the first presenting symptom (impaired gait/balance/memory/continence, several simultaneously, headache, or other), and about the dominating subjective complaint at present (impaired gait/balance/memory/continence, headache, or other).

Treatment

The insertion of ventriculoperitoneal shunts was performed in 139 patients, and ventriculo-atrial shunts in four patients. The choice of operation technique was based on the surgeons’ assessment of individual patients’ prerequisites. PS Medical Strata, Medtronic (Goleta, California, USA) valves with an antisiphon device (ASD) and a standard setting of 1.5 were used in all cases.

All shunts were deemed working at the postoperative assessment. Shunt function was assessed by evaluation of clinical symptoms and CT or MRI. If doubts regarding shunt patency remained following CT or MRI, a radionuclide shuntography [20] or a lumbar infusion test was performed.

Complications

Before the postoperative follow-up, 12 patients (8.4%) had complications following shunt surgery. Eight patients (5.6%) had major complications requiring new surgery: six had distal mechanical shunt failures (misplacement or obstruction) corrected by surgical revisions, one had a subdural hematoma requiring surgical evacuation and temporary ligation of the shunt, and one had a CNS infection requiring extraction and later reinsertion of a shunt. Four patients (2.8%) had minor complications: two had hygromas requiring upregulation of their shunt valve opening pressure, one had both septicemia related to a urinary catheter placement and gastrointestinal bleeding, prompting termination of antithrombotic treatment, and one had a postoperative transient fever.

Standard protocol approvals, registrations, and patients consents

The study was performed in accordance with the Declaration of Helsinki and was approved by the Regional Ethical Review Board in Gothenburg (Dnr 328-14, T439-15). All patients or their next of kin gave written consent to inclusion in the study.

Statistics

For comparisons of ordinal or continuous variables between improved and non-improved patients, the Mann–Whitney U test was used. For comparisons between pre- and postoperative data, the Wilcoxon signed-rank test was used for ordered categorical and not normally distributed continuous variables, and the student’s t-test was used for normally distributed continuous variables. Proportions were compared by use of Fisher’s exact test or the Chi-square test.

To assess the predictive value of baseline clinical variables, ROC analyses were performed and odds ratios with 95% confidence intervals were calculated by univariable logistic regression analyses. The dependent variable was an improvement by ≥ 5 points on the iNPH scale (yes/no). Variables with < 5 observations were not tested. Further, correlations between the baseline variables and the delta-iNPH-scale score (i.e., the difference between pre- and postoperative scores) were tested by Pearson’s correlation analysis, and univariable linear regression analyses were performed for variables with a significant correlation with the delta-iNPH-scale score. All significance tests were two-tailed, and alpha was set to 0.05, without corrections for multiple testing. For inclusion in multivariable logistic regression analysis, the significance level was set at p < 0.1, but no variable reached this significance level. Analyses were performed with IBM SPSS version 29.

Data availability

Anonymized data not published within this article will be made available upon request from any qualified investigator.

Results

At the postoperative examination, in the median five (IQR 4–7) months after surgery, 104 (73%) of patients were improved by at least five points on the iNPH scale. There was no difference in the baseline demographic or clinical variables between the 104 improved and the 39 non-improved patients (Table 1).

Regarding the anamnestic information, the reported symptom duration did not differ between the groups (Table 2). Of the 39 non-improved patients, 26% (n = 10) had reported memory impairment as the dominating subjective complaint at the baseline clinical evaluation, compared to 7.8% (n = 8) of the improved patients (p = 0.009). The percentages of patients reporting gait or balance difficulties as dominating subjective complaint did not differ significantly in improved vs non-improved patients, neither when analyzed separately nor merged (total of 69 + 46 = 115 patients, p = 0.154). A report of headache as a presenting symptom was seen in 3 patients, all subsequently non-improved by shunt surgery; apart from that, no other debut symptom was over-represented in improved or non-improved patients (Table 2).

Table 2 Anamnestic data at baseline clinical evaluation, with comparisons between improved and non-improved patients

There were postoperative significant improvements in each of the domains of the iNPH scale as well as in the total iNPH scale score, with a median increment from 53 to 69 points (p < 0.001) (Table 3). The modified Rankin Scale improved from a median 3 to 2 (IQR 2–3 both pre- and postoperatively, p < 0.001), and 36% (n = 48) had improved by at least one scale score. Improvement was seen in all registered symptoms and signs except for MMSE, Bingley’s memory test, freezing of gait, cerebellar dystaxia, and focal neurological signs (Table 3). The number of hours of sleep per 24 h diminished from a mean of 9.2 to 8 (p < 0.001). Results in the Identical forms test were improved. Half as many patients had retropulsion or shuffling gait after surgery compared to before. The median number of seconds managed in Romberg’s test increased from 20 to 60 (p < 0.001) and the median number of steps to turn 180° was improved from four to three. Further, the proportions of patients with paratonic rigidity in the legs and with broad-based gait were smaller, postoperatively. The number of seconds and steps needed in the TUG test as well as in the 3 m backwards walk test decreased (< 0.001 for all). The median number of times patients were able to stand up from a seated position in 30 s increased from six to eight (p < 0.001) (Table 3).

Table 3 Results from pre- and postoperative examinations in 143 iNPH patients

None of the baseline demographic or clinical recordings were found to predict beneficial post-surgical outcome in univariable logistic regression analyses with a significance level of < 0.10 (Table 4). Of the anamnestic information, a report of dominant subjective complaint of memory problems at baseline was significantly predictive of non-improvement (Table 4), remaining significant when adjusted for age and total iNPH scale score; adjusted OR 0.26, 95% CI 0.09–0.68, p = 0.007. The ROC analyses yielded AUC values close to 0.5 for all variables, meaning that the studied clinical variables’ predictive value was low (Table 4).

Table 4 The demographic, clinical and anamnestic factors’ predictive value for postoperative improvement by at least five points on the iNPH scale, univariable logistic regression analyses and ROC analyses

A few significant associations were noticed but they were all weak. Older patients improved less than younger in univariable linear regression analysis: R = − 0.18, p = 0.031, β = 0.3. The number of affected domains (1–4) did correlate with improvement: R = 0.19, p = 0.021, β = 2.8. Finally, the presence of impairment in all four symptom domains correlated with improvement: R = 0.21, p = 0.014, β = 4.8. None of the other baseline variables correlated significantly with the change in total iNPH scale score.

There was no difference in improvement rates between patients with or without any complication before the postoperative investigation (p = 1.0). Neither did improvement rates differ for patients with minor (p = 0.30), nor major (p = 0.45) complications compared to those without.

Discussion

Here, we made a substantial effort to identify clinical variables of predictive value. Our study included well over 100 patients, it was prospective and consecutive, and involved the registration of around 30 different symptoms and signs, comorbidities and BMI. To the best of our knowledge, this is the largest study of its kind so far addressing this question.

It has been a major challenge for previous research to predict outcomes after shunt treatment in NPH and to identify the approximate 20% of patients not improved by shunt surgery [2, 11]. Available CSF dynamic or drainage tests all show low negative predictive values [7, 8] and the use of CSF and MRI biomarkers have, so far, not improved the identification of non-responders [3, 4, 21,22,23], which is why better predictive markers are highly warranted. Since the diagnosis of iNPH is based on recognition of the typical clinical syndrome in combination with radiological markers, the use of symptoms and signs for prediction is both practical and appealing.

In our sample of diagnosed and operated iNPH patients, none of the preoperative clinical tests could aid in telling apart patients who later benefitted from shunt surgery from those who did not: comparisons of clinical baseline data between improved and non-improved patients revealed no preoperative differences. The odds ratios for improvement were non-significant for all those baseline variables. Only the anamnestic information of memory impairment being the dominating subjective complaint, was found to have a predictive value, with a lower improvement rate of 44% in that group of 18 patients, compared to 73% in the whole cohort. These 18 patients do not stand out from the remaining cohort of 125 patients with regard to clinical baseline variables: their age was similar, p = 0.58, and they had a median MMSE of 25; IQR 20–27, without significant difference from the remaining cohort, p = 0.21. Neither was any significant difference seen in their preoperative total iNPH scale score, p = 0.54, nor in the neuropsychology or gait domain score of the iNPH scale, p = 0.11 and p = 0.31. Impairment was seen in all four symptom domains of the iNPH scale at baseline in ten of those 18 patients, three domains in four, two domains in three patients, and in one impairment was seen only in the neuropsychology domain.

Three of the patients (2%) reported headache as the presenting symptom—none of them later improved after shunt surgery. A possible interpretation of this finding would be that headache is not a typical presenting symptom of shunt-responsive iNPH. But, the reliability of the finding is low: interviewing patients about what was the presenting symptom generally yields uncertain answers.

Weak, yet statistically significant, correlations were found between the degree of improvement and age and with the number of affected symptom domains. The postoperative change in the iNPH scale score would be 0.3 points lower per additional year of age in a univariable linear regression model. Higher age as a negative predictive factor has been reported previously [24, 25], whereas others did not find less beneficial results in older patients [1, 26, 27]. Considering the high rates of improvement also in older patients, and the weak correlation with age reported here, our opinion is that age should not be a determinant factor in the decision making of shunting a patient.

The finding that the more complete the clinical syndrome the better the outcome, as well as the positive prognostic value of the presence of the full tetrad, are interesting results that corroborate a previous study from our group [1]. These findings are reasonable given the potential reversibility of all symptoms in iNPH—the more symptoms that can be reversed, the better the improvement—and also gives support to the notion that the mechanism generating the iNPH symptom in some way is related to reversibility. The results also add value in the clinical setting: patients with more paramount symptoms should be considered eligible for surgery. However, all these significant associations were weak, adding no substantial support to the identification of possible responders or non-responders in the clinical routine. None of the other clinical factors could predict how much a patient would improve. Neither was the presence of diabetes, cardiovascular disease nor the patients’ BMI associated with the outcome. Although an increased burden of cerebrovascular disease and vascular risk factors are described in patients with iNPH [28,29,30,31,32], the impact on outcome after shunt surgery seems to be limited [11, 33,34,35,36,37]. In a previous study of 979 iNPH patients from the Swedish Hydrocephalus Quality Registry, we found neither cardiovascular nor cerebrovascular disease to have a negative influence on improvement within five years after shunt surgery [38]. In line with our findings, Klinge et al. recently reported no association between comorbidity burden and outcome in 208 iNPH patients after shunt surgery [39].

Notably, in the present study, symptom duration was not associated with outcome, which might imply that iNPH does not progress into an irreversible state. On the other hand, disease duration is difficult to ascertain, which could explain why the well-documented negative effect of time on prognosis was not identified in this study [40].

As none of our findings are really of any help in identifying possible responders or non-responders, clinical measures should probably be used mainly for diagnostics and not for prediction of outcome. Another reasonable conclusion of this work is that patients lacking the full panel of typical iNPH symptoms should also be considered for shunt surgery as should patients with a heavy symptom burden, since the symptom severity did not predict whether the patients improved or not.

One possible reason for the lack of strong predictive associations is that factors not assessed here are related to reversibility and improvement. We only included data on the comorbidity of hypertension, diabetes, and cardiovascular disease, whereas e.g., Alzheimer’s disease, parkinsonian disorders or orthopedic conditions such as hip, knee or spinal arthrosis—all with potential influence on outcome—were not included. However, patients were selected for surgery based on iNPH typical symptoms and the absence of signs of comorbidities thought to explain symptomatology or affect outcome in a significant way. As accounted for in Fig. 1, we also chose to exclude patients after the postoperative follow-up if they had progressed substantially in comorbid conditions fogging the assessment of whether the iNPH symptoms had improved or not. However, the remaining included patients are also older people with a range of comorbidities that affect the performance level in the different outcome measures—an inherent confounder in all studies on iNPH patients. Future studies should include a more detailed assessment of the prognostic value of a broader range of comorbidities [41].

Further, no detailed assessment of radiological changes or biochemical markers was included here: all patients fulfilled radiological criteria for iNPH and had no significant evidence of other neurodegenerative disorders in the CSF. We plan to explore the predictive value of MR perfusion and diffusion, as well as of CSF biomarkers, in the same patient sample in future publications.

Another possible factor related to the lack of improvement could be a suboptimal shunt function. We used Medtronic Strata programmable valves with an ASD and a standard setting of 1.5. In our clinical routine, if a patient is not improved, shunt function is tested and if the shunt is found working, reduction of the opening pressure is considered, weighing patients’ individual risk vs benefit. We cannot exclude that patients in the unimproved group would have improved if the opening pressure had been further reduced. However, there is good evidence that lowering the opening pressure below 12 cm H2O does not improve symptoms and signs but heightens the risk of complications [42,43,44]. The pressure setting in this study is substantially lower than 12 cm H2O in the supine position, which is why we assume that it is probably not the pressure setting that explains why some are non-responders. Volumetric imaging studies have shown positive correlations between postoperative improvement and reduction of ventricular volume [45], pointing at shunt CSF flow, shunt-induced ICP reduction, and brain elasticity or stiffness as factors that can be involved in reversibility and outcome.

The response to shunt surgery in this patient sample needs to be commented: 104 patients improved at least 5 points on the iNPH scale and another 27 showed a postoperative change of 0–4 points, i.e. 92% (131/142) were either improved or showed no further decline in their symptoms according to the chosen cutoff. In light of recent reports on AD treatment aimed at slowing down disease progression with e.g. lecanemab, where researchers describe findings of a 64% probability of active treatment being better than placebo [46], one is inclined to consider shunt surgery for iNPH a remarkable treatment.

Symptoms and signs were measured in detail pre- and postoperatively. All the gait and balance measurements improved, as did one of the applied cognitive tests not included in the iNPH scale, the modified Rankin scale, and the number of hours needed to sleep. The proportions of patients with abnormal neurological signs were smaller postoperatively, e.g., broad-based or shuffling gait, freezing of gait, paratonic rigidity in the lower extremities and retropulsion in Romberg’s test. This supports the view that the profile of symptoms and signs in iNPH is unique and clinically different from other degenerative disorders such as Alzheimer’s disease, Parkinson’s disease, and atypical parkinsonian disorders [1]. The high degree of reversibility and the uniformity of the improvement pattern render these clinical measures important when considering a diagnosis of iNPH.

We believe that the merits of this study are considerable and that it is important to conceptualize and incorporate the “negative” result into the understanding of NPH pathophysiology. We have not identified a reversibility factor in the constellation of symptoms and signs, nor in its severity. In some patients, symptoms and signs for some reason become irreversible. Contrary to what could be expected, this irreversibility is not related to the duration of the disease, nor to the severity of its symptoms. Neither is there any support for a biochemical functional limit; CSF biomarker studies have not been able to identify specific levels of impaired or altered metabolism capable of telling responders and non-responders apart, or of predicting the magnitude of improvement [5, 47, 48]. What is more confusing is the fact that improvement in iNPH is associated with a normalization of the pathological CSF biomarker profile; an increase in amyloid precursor proteins [21]. There is a bulk of evidence in support of a metabolic disturbance in predominantly periventricular regions of the brain, including pons, mesencephalon, thalamus and basal ganglia, and frontal periventricular white matter in iNPH related to the CSF dynamic disturbance [21, 47]. It is reasonable to assume that functional disturbances on a cellular level in these regions, including neuronal dysfunction, determine reversibility. These changes present clinically with the typical iNPH phenotype with characteristic symptoms and signs. Over time, the functional disturbance may increase, causing a gradual clinical deterioration, still potentially reversible. Around 20% of iNPH patients do not respond to shunt treatment [2], indicating that, for some reason or at some point, the disturbance becomes less reversible and is no longer influenced by normalization of CSF dynamics. This is probably what happens during the natural course of disease if left untreated, with developing irreversible damage to the brain [40]. The search for markers of this process needs to be pursued. Several studies have pointed at the brain stem region as a generator of typical symptoms in iNPH [49,50,51,52]. One of these studies investigated wakefulness in relation to cerebral blood flow in periventricular areas including the mesencephalon, finding an increase in blood flow in patients who no longer had impaired wakefulness post-shunt surgery [50], theoretically connected to the finding in the present paper that the number of hours asleep decreases. Is it possible that there is a primary symptom generator located at the brain stem level that influences the overall degree of reversibility? We believe that future studies of biochemical and metabolic function should focus on these brain stem regions. Here, we are measuring secondary systems, not the primary symptom generator.

The strengths of this study are its prospective design with consecutive inclusion of patients, the large patient sample, including a considerably large group of unimproved patients, the meticulous assessment of clinical symptoms and signs, and the use of an outcome measure developed for iNPH. The inclusion criteria, the proportion of improved patients, and the profile of improvement reported here are the same as in contemporary studies, supporting the representativeness of our study.

The limitations, some of which have been mentioned earlier, consist of our lack of data for better evaluating the influence of comorbidities, the lack of a systematic routine for adjusting shunt opening pressure to optimal clinical performance, and the number of dropouts and exclusions from postoperative follow-up. As patients were examined within the frame of routine care, blinding to whether the patients had had surgery or not was not possible, and the inter-rater reliability was not measured. Another limitation is that the used traditional statistical methods are not able to evaluate potential predictive values of combinations of variables, such as a machine learning approach might have enabled.

As with morphological radiological markers, there is a potential circular reasoning in studying the predictive value of clinical characteristics and symptoms, that are in fact required for the diagnosis of the disease. Further, the symptoms are what the treatment aims to improve, consequently they are what we measure to evaluate the result of the treatment. To do this as accurately as possible, the clinical tests used in the iNPH scale were only used to measure the outcome, and a wide range of other clinical variables were tested for predictive potential.

Finally, a more liberal approach to shunting a wider range of patients might have enabled clearer results on prognostic factors; however, all decisions were made as part of our general clinical routine, and we do not propose that the indications for shunt surgery should be widened for this purpose.

Conclusion

This study confirms that the recorded clinical signs, symptoms, and impairments in the adopted clinical tests are characteristic findings in iNPH, based on that most of them improved after shunt surgery. However, our clinical data did not enable predictions of whether or not patients would respond to shunt surgery, indicating that the phenotype is unrelated to the reversibility of the iNPH state and should mainly support the diagnosis. Absence of specific signs should not be used to exclude patients from treatment.