Introduction

Granulomatosis with polyangiitis (GPA) is a heterogeneous multisystem disorder causing granulomatous inflammation and necrotising vasculitis in small- and medium-sized vessels. The disease is associated with Anti-Neutrophil Cytoplasmic Antibodies (ANCA), predominantly targeting proteinase 3 (PR3-ANCA). The upper and lower respiratory tract and the kidneys are the organ systems most commonly involved [13]. GPA usually runs a relapsing-remitting course with damage accrual. The relapse rates are high, particularly after withdrawal of immunosuppressive therapies [4, 5].

Sinonasal inflammation is common in GPA, with a reported cumulative incidence of at least 80% [4, 6]. Clinical and imaging features indicate an aggressive disease course of the chronic rhinosinusitis (CRS) related to GPA, with pronounced damage to cartilage and bone [7]. Imaging, predominantly paranasal sinus computed tomography (CT) scans, is important for diagnosis and follow-up of CRS in GPA. By CT, sinonasal mucosal thickening and fluid may represent disease activity, while inflammatory bone thickening (defined as osteitis) and destructions presumably represent CRS-related sinus damage.

When GPA patients were asked to rank disease-related challenges in daily life, sinonasal symptoms reached top five together with fatigue, pain, musculoskeletal symptoms, and breathing difficulty [8], explaining why CRS-related features were prioritised items in the newly developed and validated ANCA-associated vasculitis patient-reported outcome measures (AAV-PRO) [9]. Nonetheless, the research on CRS in GPA is still quite limited [6, 10].

Damage accrual, determined by time-dependent increase in the Vasculitis Damage Index (VDI), appears to have major impact on morbidity and mortality in GPA [11]. It is not clear, however if VDI has sufficient resolution to capture extent and development of organ-restricted damage, such as the damage caused by CRS. Follow-up by paranasal sinus CT scans may be a better method to assess sinonasal damage development in GPA. Recently, we developed and applied methods suitable for evaluation of serial paranasal sinus CT scans of GPA patients. We used these methods to assess time-dependent CRS changes, with thickening of the bones of the sinus walls as the main marker for sinus damage [12, 13].

The aim of this study was to identify clinical and radiological baseline predictors of severe CRS outcome, i.e. osteitis, in patients with GPA. Specifically, we analysed the potential impact of clinical variables from time of diagnosis and baseline paranasal sinus CT findings on osteitis. With new knowledge on predictors, we may be able to identify patients at high risk for CRS-related damage already at baseline and provide targeted treatment at early disease stages. Hopefully, this might contribute to reduce the huge negative impact of CRS on GPA patients’ perceived quality of life.

Methods

GPA patient cohort and study timeline

We have described details on the GPA patient cohort in previous paranasal CT studies [12, 13]. Briefly, the cohort included all the GPA patients from the Norwegian systemic vasculitis and connective tissue disease registry (NOSVAR), a consent-based patient registry run by the Department of Rheumatology at Oslo University Hospital. Pre-specified inclusion criteria were as follows: (I) clinical ANCA vasculitis classifiable as GPA according to the European Medicines Agency (EMA) algorithm [14], and (II) at least two sets of archived sinus CT scans available for analysis, with minimum 1 year between time of baseline CT and time of last CT. Specific to this study, we included patients who had complete electronic medical record (EMR) available from time of GPA diagnosis. We assessed changes in radiological features during the study observation period, defined from the time of baseline CT to the time of last CT in each patient (Fig. 1).

Fig. 1
figure 1

Plot showing the study timeline and osteitis development from baseline to end of follow-up. a Timeline plot. Timepoint zero was defined as the year of diagnosis. Median differences between the year of diagnosis and the time of the baseline and the last CT are shown as diamond-shaped points, and the interquartile ranges are shown as horizontal error bars. The mean study observation period with error bars of 2 standard deviations is shown at the bottom. b Groups of the study. At baseline, there were two groups. Three different osteitis trajectories ended in three different outcome groups. The connection lines show how the patients changed groups during the study observation period

Clinical parameters assessed at time of GPA diagnosis

From each patient, we reviewed and recorded complete EMR information from time of diagnosis to registration end on 31 December 2016 in a pre-specified medical history form. Form items included the following: date of GPA symptom onset, date of GPA diagnosis assessed by an experienced physician, and type and extent of organ involvement at time of diagnosis defined by the Birmingham Vasculitis Activity Score, version 3 (BVAS) [15]. We also recorded smoking status and the findings on routine blood tests, ANCA tests, and urine analyses. We were able to collect information on medication for the entire observation period for 87 patients. For the remaining 29 patients, only baseline data were available. We defined aggressive first-line treatment as either cyclophosphamide or rituximab treatment or a combination of both. We further chose to stratify the patients according to the definition of the Wegener’s Granulomatosis Etanercept Trial (WGET) research group of limited and severe disease [16], as recommended by EULAR [10] where limited disease equals no organ or life-threatening disease while kidney involvement usually is classified as severe disease.

We retrieved data on paranasal sinus surgery from the EMR and from the Norwegian Patient Registry, which is a mandatory national registry that includes patient identifiable data on surgical procedures from 2008 [13].

Evaluation and scoring of CRS-related CT findings

We scored CRS-related CT features using scoring systems developed for CRS assessment as previously described [12]. Briefly, we assessed mucosal thickening or fluid (mucosal disease) using the Lund-Mackay score (LM-score) [17], where a completely clear sinus is scored as 0 and a completely opacified sinus as 2, and all degrees of mucosal disease in between are scored as 1. In addition, the ostiomeatal complex is scored as 0 for an open passage and 2 for an obliterated one. For ten sinuses (pairs of frontal, maxillary, and sphenoid sinuses, and pairs of anterior and posterior ethmoid sinuses) and two ostiomeatal complexes, the maximum LM-score is 24.

We assessed bone thickening, i.e. osteitis, using the Global Osteitis Scoring Scale (GOSS) [18], where ten individual sinuses are scored from zero (no osteitis) to 5 (>50% of the sinus wall is >5 mm thick), giving a maximum score of 50. We scored bony destruction as present (score 1) or absent (score 0) in 18 defined structures (10 sinuses, 3 pairs of conchae, nasal septum, and hard palate). Saddle nose deformity was evaluated using the lateral contour of the nose seen on the lateral scout view of the CT scans.

We followed the osteitis trajectories of the cohort, and in line with our two previous studies [12, 13], we stratified the patients into three groups based on the osteitis development over time: (I) no osteitis, (II) stable osteitis, and (III) progressive osteitis. Specific to this study, the patients were stratified by osteitis severity at the end of study observation, using a modification of the GOSS severity grading proposed by Georgalas et al. [18]. Three groups were defined: (I) no osteitis (GOSS = 0), (II) mild to moderate osteitis with GOSS <35 and no single sinus with score >4, and (III) severe osteitis with GOSS ≥35 or GOSS <35 and at least one sinus score = 5 (Fig. 1).

Statistical analyses

Normally distributed variables were given in mean (SD) and skewed variables in median (range) where range is from the minimum value to the maximum value. Proportions were given in number (percentage). Mean difference with 95% confidence interval was calculated by t-distribution. Due to skewed data, we calculated correlation by Spearman’s rank correlation coefficient with a bias-corrected and accelerated (BCa) bootstrap 95% confidence interval.

Logistic regression modelling was used to assess the strength of association between selected covariates and osteitis at last CT, which was our CRS-related outcome variable. Some of the patients had undergone sinus surgery either before or during observation. We excluded patients after they had undergone surgery in the present modelling data set, to avoid time-varying covariates. Patients who had at least 1 year of observation before their first surgery were included with their preoperative observation time (n = 102). Covariates were baseline mucosal thickening or fluid (mucosal disease) scored by LM-score, renal involvement at baseline by BVAS, age at diagnosis, sex, disease duration at the baseline CT, and the time between baseline CT and last CT (i.e. observation period). Medication was not included in the model as it is a time-varying covariate. Level of significance was set at <0.05.

Results

GPA patient cohort, demographics, and clinical characteristics at time of diagnosis

The GPA cohort consisted of 116 patients. Mean age at diagnosis was 45 (SD = 19) years and 54% were men. ANCA was positive in 109 patients. Sixty-four patients (55%) had severe GPA as defined by the WGET research group criteria [16] (Table 1). At the time of GPA diagnosis, 37/116 (32%) patients had concurrent involvement of upper and lower airways and kidneys, while 26/116 (22%) had combined upper and lower airway diseases, with no renal involvement. Symptoms and/or signs of upper airway disease were present in 107/116 (92%) of the GPA patients, with involvement of the nose and/or sinuses in 102 of them. Sinus involvement at baseline was seen more often in men (76%, 48/63) than in women (58%, 31/53), and more men (63%; 40/63) than women (42%; 22/53) had baseline kidney involvement.

Table 1 Demographics, baseline clinical data, and baseline CT scores in total cohort

Changes in paranasal sinus CT findings from baseline to end of follow-up

We evaluated 697 sinus CT scans from the 116 patients, with median 5 CT scans per patient. From 68/116 patients, we had baseline CT scans performed within 1 year from the time of GPA diagnosis (Fig. 1).

At the baseline CT scan, we found that of the 116 patients, 95 (82%) had mucosal thickening or fluid in the sinuses (defined by LM-score >0), 29 (25%) had bony destructions (Table 1), and 53 (46%) had osteitis (Fig. 1). The patients who had osteitis at baseline had a longer delay between year of diagnosis and year of baseline CT (mean difference = 2.8; 95% confidence interval 0.7 to 4.9).

To determine changes in CRS-related damage during the median 5-year study observation period, we scored osteitis severity (GOSS) on all CT scans performed during follow-up. In line with previously published data on this cohort [12], we found that the degree of osteitis either remained stable or progressed during observation. In this cohort, 53 patients had osteitis at the baseline CT, 26 of these had stable osteitis during observation, while 27 had progressive osteitis (Fig. 1). Time-course analyses of the 63 patients with no signs of osteitis at the baseline CT showed that 45 patients remained free from osteitis during observation, seven developed a mild osteitis that remained stable across the observation period, and 11 displayed progressive osteitis (Fig. 1).

At the end of the study period, defined for each patient as the time of the last sinus CT scan, outcomes regarding CRS-related damage were as follows: 45 patients had no osteitis, 34 patients had moderate osteitis, and 37 had developed severe osteitis (Fig. 1).

Analyses of additional CT damage parameters showed that 15/116 (13%) had nasal septum perforation (n=12) and/or saddle nose deformity (n=8) already at baseline. At the end of follow-up, 30/116 (26%) patients had saddle nose and/or nasal septum perforation, with saddle nose in 16 and nasal septum perforation in 26. Sixty percent (18/30) of the patients who had saddle nose and/or nasal septum perforation at end of observation had at the time of diagnosis limited disease, according to the definition of WGET.

Associations between baseline radiological features and outcome of CRS

As expected, we found the most pronounced increase in osteitis from baseline to end of study in the patient subset with a severe osteitis outcome (Fig. 2). Regarding mucosal disease, we found gradually increasing median baseline LM-score from the no osteitis group via the moderate to the severe osteitis group (Table 2 and Fig. 2). Consistently, the baseline LM-score correlated with GOSS at last CT (ρ=0.64 with 95% confidence interval 0.50 to 0.75).

Fig. 2
figure 2

The levels of the three CT parameters at baseline and last CT. Upper panel shows bone thickening (i.e. osteitis) measured by Global Osteitis Scoring Scale (GOSS), number of destructed sinonasal bone structures in the middle panel, and sinonasal mucosal disease measured by the Lund-Mackay score (LM-score) in the bottom panel. The variables are grouped by the degree of osteitis severity at last CT. The observations are shown as red dots

Table 2 Demographics, baseline clinical data, and CT scores at baseline and last CT grouped by osteitis severity at last CT

Additional file 1: Figure S1 shows the relationship between longitudinal measurements of LM-score compared to GOSS in GPA cohort patients with no history of paranasal sinus surgery, available clinical data on medical treatment, and a timespan from baseline clinical data to first CT of <2 years (n=68). The figure indicates distribution of baseline LM-score in the patients stratified by osteitis trajectories. We identified increasing osteitis following a CT scan with a high LM-score in individual patients. Additional file 2: Figure S2 shows the same patients grouped by three levels of baseline LM-score, with timelines for osteitis development and treatment with rituximab/cyclophosphamide showed for individual patients. We were not able to show any clear effects of the treatment on the osteitis development.

Finally, the patients with severe osteitis outcome also had more bony destruction including saddle nose deformity than the no osteitis and moderate osteitis outcome groups (Fig. 2, Table 2), but it should be noted that surgical interventions in sinuses, which often include removal of bony structures, were more prevalent in the severe osteitis group (Table 2).

Associations between baseline demographics, clinical characteristics, and outcome of CRS

When patients were stratified by outcome of CRS, we found that the group ending with severe osteitis (n=37) were younger at disease onset than the patients with no osteitis (n=45) or moderate osteitis (n=34) and included more smokers (Table 2). In patients who were <50 years at disease onset, we found that 40% (25/63) had a severe osteitis outcome, compared to 23% (12/53) in patients with onset >50 years (Fig. 3). Severe osteitis outcome was slightly more frequent in men (35%, 22/63) than in women (28%, 15/53). Overall, we found that types and extent of organ involvement at baseline were quite similar across the osteitis outcome groups (Table 2). However, in patients with renal involvement at baseline (as defined by BVAS), severe osteitis outcome was less frequent (24%, 15/62) than in patients with no renal disease (41%, 22/54). We found a similar difference with disease classified using WGET, where 28% (18/64) with severe disease and 37% (19/52) with limited disease developed severe osteitis (Fig. 3).

Fig. 3
figure 3

Associations to the degree of osteitis severity at last CT. Demographic data including age at diagnosis (a), baseline clinical data (b), and total sinus surgery (c). Limited/severe disease is by the definition of the WGET research group

We had complete data sets on the aggressive first-line therapies (cyclophosphamide or rituximab) given during observation in 87 GPA cohort patients. In these 87 patients, we found that 100% (44/44) of those with baseline kidney disease and 79% (34/43) with no kidney disease at baseline received aggressive first-line treatment. Thus, only nine patients never had any treatment with cyclophosphamide or rituximab. Of these nine patients, five had no osteitis at last CT and four had moderate osteitis. The aggressive first-line treatment group of patients (n=78) were distributed between all three osteitis severity groups: 41% (32/78) had no osteitis at last CT, 33% (26/78) had moderate osteitis, and 26% (20/78) had severe osteitis.

Surgery is associated to osteitis [13, 20]. Consistently, we found a large difference in osteitis outcome between GPA patients with or without a history of sinus surgery (Fig. 3). Therefore, we removed the surgery patients (7/87) from the group of patients having complete data on medication. Osteitis development and timing of cyclophosphamide or rituximab medication are shown in these patients in Additional file 2: Figure S2. After removing the surgery patients, a lesser proportion of the patients had severe osteitis (13/80, 16%). We selected the patients who had aggressive first-line treatment (71/80) and stratified them on kidney disease and found that a higher proportion of patients without baseline kidney disease had severe osteitis at last CT (29%, 8/28) than of patients with baseline kidney disease (12%, 5/43).

Baseline clinical and radiological features as risk factors for osteitis

Patients were excluded after sinus surgery from the modelling, as detailed in the “Methods” section. We excluded surgery and medical treatment as covariates in the model since they were time-varying variables.

A logistic regression model was fitted with baseline covariates and the outcome variable which divided the patients into two groups (no osteitis and a merged moderate and severe osteitis group). We found a clear association between baseline LM-score and the osteitis outcome variable, where the odds of having osteitis at last CT increased by 33% for one-unit increase in baseline LM-score with 95% confidence interval from 17% increase in odds to 57%. Baseline renal disease and sex were included in multivariable analyses but were not significant (Table 3).

Table 3 Univariable and multivariable logistic regression models with outcome variable osteitis at last CT. Estimates (odds ratio), 95% confidence intervals (CI), and p values

Discussion

CRS is highly prevalent in GPA and a major contributor to perceived disease burden. Still, we know little about risk factors for severe CRS outcomes in GPA. In this study, the main finding was that baseline sinonasal CT findings consistent with mucosal disease increased the risk for development and progression of CRS-related damage (osteitis) during follow-up. We found no effect of age at diagnosis or disease duration at the time of the baseline CT (OR = 1), but the results indicated, albeit not statistically significant, that baseline kidney involvement reduced the odds.

Our finding of a positive correlation between baseline mucosal disease measured by LM-score and development of osteitis measured by GOSS in GPA is novel. A previous cross-sectional study described associations between LM-score and osteitis in CRS [18], but this was not investigated in studies on GPA-related CRS [7].

In this study, we used sinonasal osteitis rather than destructions as primary marker of CRS-related damage, because we had previously observed in individual patients that it was highly challenging to differentiate bony destructions caused by CRS from post-surgical bone changes [13]. Since no prospective studies exist on CRS in GPA, we do not know if CRS-related damage correlates with sinonasal symptom severity. However, we do know that sinonasal complaints and sinonasal osteitis both are common in GPA patients [6, 8]. In CRS, a prospective follow-up study showed that patients with severe osteitis at baseline experienced less symptom improvement after the surgical intervention than patients without osteitis [19].

In addition to osteitis, we focused on saddle nose deformity and nasal septum perforation as clinically important organ damage. Few GPA studies have assessed these nose complications by paranasal CT scans [7], and only one of these studies included more than 15 patients [20]. This cross-sectional study from a tertiary otolaryngology centre by Grindler et al. reported CT findings consistent with saddle nose in 34% (25/74) of GPA patients and septal perforation in 35% [20]. For comparison, by end of follow-up, we identified septal perforation and saddle nose in 22% (26/116) and 14% (16/116) of patients, respectively. This difference indicates probably selection for GPA patients with severe sinonasal disease in Grindler’s cohort.

Although no variables other than baseline mucosal disease were significant in the logistic regression model, some clinical variables did show a different distribution between the osteitis groups. Specifically, patients with no renal involvement and/or limited GPA (by the WGET definition) at baseline seemed to have increased risk for severe osteitis development. The potential negative association between baseline kidney involvement and severe CRS is, to our knowledge, a new observation. It is, however, indirectly supported by the finding of low frequency of renal disease (27%) in the abovementioned GPA cohort from Grindler et al. [20], where a high proportion of patients had severe CRS with osteitis (78%).

The positive correlation between baseline kidney disease and severe osteitis might be due to more aggressive systemic treatment of the patients with kidney disease. In our cohort, 90% (78/87) of the patients with complete data on medical treatment had cyclophosphamide and/or rituximab treatment. This is probably the major reason why we were not able to show any differences in osteitis development related to treatment. The different frequency of severe osteitis between patients with or without baseline kidney disease remained when comparing the cyclophosphamide/rituximab-treated patients. The amount of cyclophosphamide or rituximab were not accounted for.

Finally, it seemed that younger age at diagnosis (<50 years as compared to ≥50 years) associated with development of severe osteitis during observation (Fig. 3). The difference was perhaps due to longer disease duration in the younger patients with mean increased disease duration of 4.8 years with 95% confidence interval 2.2 to 7.3 years. However, twice as many younger patients had undergone sinus surgery which could reflect a more aggressive disease in the younger and partly explain the higher level of osteitis. In the logistic regression model, where the surgery patients were excluded, the age at diagnosis had no effect (odds ratio = 1). The potential effect of sex on osteitis was unclear, with trends in opposite directions in the uni- and multivariable analyses. We do, however, believe that the relatively even distributions of males across the three categories of osteitis at study end (as shown in Fig. 3) argue against sex differences regarding severe CRS damage.

Limitations and strengths

The major strength of this study is that it contains detailed analyses of a large number of paranasal sinus CT scans obtained over a relatively long observation period from a largely unselected cohort of GPA patients. Due to the unselected nature of the cohort, we were able to identify groups of patients with different CRS trajectories, ranging from patients with no osteitis ever to patients with progressive changes and severe osteitis outcome.

An obvious weakness, which relates to the observational study design, is that we did not have baseline CT from time of GPA diagnosis in all the patients. Additionally, we could not develop a prediction model for osteitis because of too little data, and we were not able to fit any model with more than one significant variable.

Furthermore, the study did not take into account the underlying systemic activity of GPA at time of CT scanning. As previously mentioned, BVAS is the recommended measure of disease activity in GPA. We have retrospectively scored BVAS at baseline in this study, but did not have data on BVAS from the time of each CT scan. However, the efficacy of BVAS as a disease activity measure for the sinonasal area in GPA is under debate since BVAS has been considered too coarse. Decker et al. [21] proposed therefore an endoscopy-based disease activity score in GPA, but this required follow-up by otorhinolaryngologists.

We think that our clear association between baseline mucosal disease and osteitis at last follow-up could suggest LM-score as a potential useful indicator of activity. Our longitudinal data on mucosal disease measured by LM-score and osteitis also suggests that LM-score measured at each follow-up CT might be useful as a disease activity measure. This needs confirmation in independent GPA populations in prospective studies with longitudinal data and preferably comparison to both endoscopy findings and BVAS.

Conclusion

We found a clear association between baseline paranasal sinus mucosal disease and osteitis at last follow-up (odds ratio = 1.33 with 95% confidence interval 1.17 to 1.57). No other variables reached the significance level of 5%, but there seemed to be a negative association between osteitis and kidney disease even in patients who had cyclophosphamide or rituximab treatment, indicating that development of severe CRS damage is not evenly distributed across the GPA population.