Introduction

Antenatally detected renal pelvic dilatation and/or antenatally detected hydronephrosis is associated with a definite health burden [1]. This brief editorial, which is drafted in response to the study by Herthelius et al., looks at the results of the reported long-term follow-ups [2,3,4,5], compares between similar studies and discusses the direct and indirect future implications of such precious data on healthcare.

The authors of this index study have reported a long-term follow-up of a cohort of children diagnosed with antenatal renal pelvic dilation[4]. The median/mean follow-up is 13.5 years approximately. Such data are indeed scarce and the study is a welcome addition. Whilst the study is rigorous and well designed, it is admittedly small and discordant with its handful of peers. In my view, this study generates a few significant, guiding conversations for the paediatric nephro-urological community with perhaps, a few sub-conversations within.

Conversation 1: What is the CKD risk and safe, optimal follow-up of mild renal pelvic dilatation? (ARPD ≤ 7 mm or SFU 1)

The findings of this study perhaps argue for a limited follow-up for patients whose pelvic dilation is less than or equal to 7 mm. Similar suggestions have recently been made in other publications [6, 7]. In a study focussed on unilateral dilatation and need for surgery for pelviureteric junction obstruction (PUJO), Yang et al. confirmed that SFU grade 1 dilatations did not progress to needing intervention [2]. Nevertheless, the highlight of this study is the correlation to a lack of long-term kidney morbidity which few other studies comment on (Table 1). In perhaps the only other study to add to these data, Costa et al. also report no CKD in the low-severity cohort (SFU 1–2) in their study [5]. Clinicians should feel more reassured by this.

Table 1 Comparison between studies reporting long-term outcomes of antenatally detected RPD or ANH

Conversation 2: What is the optimal management of moderate to severe renal pelvic dilatation? (ARPD ≥ 7 mm or SFU 2–4)

Sub-conversation 1: What is the risk of permanent kidney damage and CKD risk in moderate/severe dilatation (> 7 mm, SFU grades 3 and 4)?

This index study argues indirectly perhaps for a structured surveillance of the other arm (group B) mentioning a higher risk of kidney damage albeit with no reported CKD. [4] In the study, 36% cases (approximately 20% of total) in group B (>7mm RPD) had progression/new evidence of kidney damage. This is a significant finding of this study. Of note, the odds for permanent kidney damage in dilatation > 15 mm or CAKUT diagnosis are impressively high (8.9 and 14.0 respectively) and these must receive appropriate visibility and clinicians’ attention. The study findings regarding CKD appear in some discordance to Costa et al., which reports a somewhat higher risk of CKD surrogates at peri-pubertal age [5] (Table 1). Although important, it is not an easy task to explain these differences; the effect of cohort size and puberty may be two important factors. Nevertheless, these differences are significant and need to be further explored.

Sub conversation 2: Is there a long-term risk of recurrence of significant hydronephrosis after initial resolution?

One per cent of the patients in the study by Matsui develop a recurrence of severe hydronephrosis [3]. Most needed surgical correction since they had a diagnosis of PUJO in almost all cases. The authors of the current study report a 5% recurrence in group B. Whilst they have attempted to explain the difference, it is possible that there may be actually very little difference, if any. This is because 2/40 (5%) cases in the study equates to about 2% if the denominator is changed to include the entire eligible dataset which is then similar to Matsui et al. [3]. A spin-off from both these studies is that about 20% (2/10) of PUJO patients who need surgery declare themselves late despite extensive antenatal scanning support. Once again, it is reassuring that this risk is embedded almost entirely within the moderate to severe dilatation groups in both studies and should allow us to leave the counselling and follow-up of mild cases (< 7 mm or SFU 1) unaffected with confidence.

Sub conversation 3: Are we in a better position to estimate the incidence of needing surgery in antenatally detected renal pelvic dilatation or hydronephrosis?

Compared with similar studies, this study reports far reduced proportional need for surgical intervention (Table 1) [8]. Twenty to 30% children needed surgery in the other two long-term studies [2]. Whilst the reasons are not obvious, it is possible that they may be related to slightly different management paradigms and/or a selection bias (which the authors have done their best to exclude through a limited sensitivity analysis). It is therefore possible that the long-term outcomes are related to a friendlier cohort of patients compared with other studies. Furthermore, different studies have used different predictive features with some success. For example, Jung et al. reported higher predictive values of antenatal dimensions than post-natal dimensions [9]. Both these issues therefore, continue to be a space that is evolving and needs future attention.

Conversation 3: Could the cost to healthcare be reduced by less intensive screening of selected antenatal renal dilatation/hydronephrosis?

Healthcare dollars are precious and economic analyses of care models are gaining importance. With the large-scale application of antenatal scanning, the cost of healthcare is likely to go up and the policy makers may become concerned about the value it provides when applied across the board in a non-selective manner. Markov models have been reported for antenatally diagnosed PUJO [10]. An estimated cost of 90 million USD annually on this front has been mentioned by the authors of the American consensus statement [11]. Although not directly measured or mentioned, the findings of this study have the potential of relating to the issue of the cost of healthcare in different ways. Reduced imaging tests and reduced clinical encounters are an obvious gain. Additionally, the study may provide a better insurance cover for the mild cases in some medical systems, thus affecting the utilization of the public tax dollar. These additional benefits must be systematically explored going further to show our commitment to minimizing the healthcare expenditure and cutting out the low-value encounters (healthcare waste).

Conversation 4: What thresholds/grading systems should be used? Quantitative, qualitative or others

This study is based on a quantitative system of categorizing the imaging information using APD, ureteric dilatation etc. Discussion of management of antenatal renal pelvic dilatation includes many grading/classification systems: some qualitative, some quantitative, others semi-quantitative [12]. Although the authors of the index study have used a quantitative system consistently, it must be remembered that this is not the most popular system in current use [12]. Nevertheless, they have successfully determined 7 mm as the threshold below which the long-term risks are minimal. Comparison with the consensus document by Nguyen et al. is interesting [11]. Their inclusion criteria for this arm overlap significantly with UTDA1 and are lesser in severity than the lowest category in the postnatal group UTDP1, as has been rightly acknowledged by the authors of the index study [11]. Other authors have proposed antenatal thresholds with similar aims: 6, 8 and 10 mm of AP diameters at 20 week, 20–30 weeks and after 30 weeks [13]. Some others have tried to differentiate mild pelvic dilatation from hydronephrosis in order to improve timely detection of obstructive uropathies [14]. The use of varying thresholds and multiple grading systems is an issue in itself and is further complicated by less than ideal inter-observer agreement [15, 16]. Back et al. have recently advocated for increased consistency in reporting calyceal dilatation [16]. The question for us going forward will be: How can we influence clinical practice to assert uniformity?

Conversation 5: Is the regional variation in CKD and need for intervention real? A closer look at discordance between studies

The current study also expresses some noteworthy discordance with the long-term modelling study from Brazil [5] (Table 1). The study which includes 447 participants had a much higher incidence of obstructive nephropathy (specifically PUJO), higher modelled incidence of CKD and composite events, including proteinuria and hypertension, as has been rightly acknowledged by the authors. The cohort also had a higher incidence of UTI and this was correlated to an increased risk of kidney disease. This was all evident at a median follow-up of 6 years. The study from China appears to exclude group A patients (≤7mm RPD) and has a median follow-up of 142 months [2]. Nevertheless, they have almost a 25% incidence of PUJO in the included cohort, which is similar to group B in the index study (>7mm RPD) [4], and suggests a much higher kidney damage percentage extrapolated from the split kidney function data. They did not report data on creatinine, proteinuria and hypertension. Thus, three long-term studies appear to tell us slightly different stories, albeit with a significant overlap at the less severe end of renal pelvic dilatation. Another Scandinavian study reported about 6–7% (8/125) risk of surgical intervention at long-term follow-up [17]. These data point to some heterogeneity and the obvious (but unlikely) thought that arises is regarding the possibility of underlying geographical variations. I wonder whether synthesizing the available long-term studies into a meaningful meta-analysis with pre-specified sensitivity analyses would be a worthwhile exercise and would help clarify these variations.

Conversation 6: Innovation/research and the way forward

Data linkage and artificial intelligence

Recent studies using data linkage have added very useful information to the armamentarium of practising paediatric urologists, nephrologists and materno-fetal medicine experts [1]. They confirm a significantly higher medicalization risk for children with persistent pelvic dilatation through the pregnancy. Such studies have the potential to capture the ground realities from a slightly different angle and can further inform the debate on counselling and minimal management. Deep learning algorithms and other artificial intelligence-based approaches are entering this space too [18]. They will undoubtedly have an impact on our understanding of the problem in the future.

Do we need to do more to understand the etio-pathological mechanisms?

The aim of science is to be able to explain significant phenomena. In that context, have we done enough to explain renal pelvic dilatation from an etiological perspective? Perhaps not and therefore it appears that observational studies rule the roost. Some elegant animal studies provoked thoughts which admittedly remain difficult to pursue in human environments, such as the theory of premature urachal closure [19]. With advances in genetics, epigenetics and molecular medicine, we should perhaps advocate more for pushing further along this path.

In conclusion, the authors deserve praise for bringing these data to our attention. We owe it to them to start a few clinico-academic conversations with the hope of working towards a better, more standardized care model some time into the future.