Introduction

Celiac disease (CD) is a chronic immune-mediated enteropathy triggered by dietary gluten in genetically predisposed individuals [1,2,3] with a prevalence of around 1% in the general population [1,2,3,4].

The clinical presentation of CD is very heterogeneous, ranging from an overt malabsorption syndrome, to mild intestinal/extra-intestinal symptoms, or to asymptomatic patients [1,2,3]. Positive endomysial (EmA)/tissue transglutaminase (tTA) antibodies and villous atrophy (VA) on correctly oriented duodenal biopsies are still recommended for the diagnosis of CD in adults in most countries [1,2,3], although great interest has been dedicated to the possibility of a biopsy-sparing approach also in adults [5, 6]. A strict lifelong gluten-free diet (GFD) is the cornerstone for treatment of CD, leading to complete resolution of symptoms/histological lesions in the vast majority of patients, and preventing long-term morbidity and mortality associated to CD [1,2,3,4, 7,8,9,10,11]. However, it has been reported that in up to 30% of patients symptoms and/or histological lesions may not resolve completely despite being on a GFD [12,13,14,15,16,17,18].

These clinical scenarios have a wide spectrum of etiologies, which may be related or unrelated to CD. Ongoing gluten ingestion (voluntary or inadvertent) has been reported to be the prevalent cause of persistent symptoms in CD. However, other etiologies may also be responsible, ranging from purely functional gastrointestinal disorders, to life-threatening malignant complications of CD [12,13,14,15,16,17,18]. Despite their wide clinical variability, these scenarios are very often grouped together under the umbrella term non-responsive CD (NRCD). The literature provides data on the underlying etiology of NRCD, with particular interest on refractory CD and malignant complications of CD, for which risk factors and natural history have also been delineated [11, 12, 19,20,21]. Conversely, very little is known on the natural history of symptoms and disorders unrelated to complicated CD occurring in celiac patients on a long-term GFD and how this may influence modalities for organizing the follow-up of these patients. Although major international guidelines suggest maintaining regular follow-up consultations for all adult celiac patients, there is no widespread consensus on timing and modalities for organizing it cost-effectively [1,2,3]. Therefore, the present study aims to: (1) evaluate occurrence and etiologies of symptoms leading to clinically relevant events (CREs), both related and unrelated to CD, in adult celiac patients on long-term follow-up; (2) identify any relevant predictors of CREs during follow-up; 3) suggest possible follow-up strategies in adult celiac patients.

Patients and Methods

Study Population and Setting

This is a single-centre retrospective study of adult patients (≥ 18 years old) with biopsy-proven CD on a long-term GFD and follow-up, which aims to describe the natural history and identify the etiologies and predictors of persistent/recurrent symptoms despite a GFD. Patients enrolled in this study all underwent follow-up duodenal biopsy, as part of our standard of care in the last 20 years [22].

Enrolment and Exclusion Criteria

Patients who were directly diagnosed with CD at our center between January-2000 and November-2019, and followed-up in clinic until May-2021 were the focus of the present study. All patients underwent follow-up duodenal biopsy and regular dietary assessment of GFD adherence, either via interview by expert personnel or using a validated questionnaire we previously developed [23]. In the last 20 years our center has provided care to over 800 patients, as previously described [24]. To avoid biases, for the purpose of the present study, we included only patients directly diagnosed by our center who had at least one follow-up biopsy. In other words we excluded all referred patients and any patients who did not undergo follow-up duodenal biopsy for various reasons.

CD was diagnosed based on a certain degree of VA on duodenal biopsies from the second part of the duodenum and positive EmA and/or tTA while on a gluten-containing diet [1,2,3].

Data Collection

Patients’ medical records were retrospectively reviewed between June-2021 and September-2021, and the following data were collected: age at diagnosis of CD, presenting symptoms of CD according to Oslo classification [25], first-degree family history of CD, results and time of follow-up duodenal biopsies and EmA testing, number of follow-up medical consultations, duration of follow-up, GFD adherence throughout follow-up, and occurrence of clinical events, both related and unrelated to CD during follow-up.

Evaluation of Symptoms and Clinical Events During Follow-Up

Symptoms/signs (persistent, recurrent or new symptoms/signs) at any time during follow-up requiring any diagnostic testing, treatment, emergency room access, or hospitalization were considered CREs for the purpose of our study. Their etiologies and outcomes were also evaluated.

Persistent symptoms were defined as those already present at diagnosis of CD, which did not improve significantly or resolve during follow-up despite a GFD [22]; recurrent symptoms were those that relapsed despite initial resolution/improvement on a GFD; symptoms which developed for the first time while on a GFD, were considered ‘new onset symptoms’. Data on symptoms was collected regardless of whether a relationship with CD was suspected or not.

Diagnostic tests performed for scheduled follow-up of chronic conditions in the absence of symptoms, including follow-up of persistent VA, or in the context of a screening program were not considered as clinical events. This strategy was adopted in order to provide the most objective evaluation of CREs throughout follow-up in a retrospective study.

Clinical characteristics of patients with at least one event were compared to those of patients without events to identify predictors of developing events. Patients who had at least one clinically relevant event during follow-up as defined above were considered patients who developed events while those without any events were considered event-free. All events observed throughout follow-up were analysed to determine their aetiology and to estimate overall incidence of events.

Criteria to Assess GFD Adherence

All patients were instructed immediately after diagnosis of CD by expert personnel on how to follow a strict GFD correctly. Throughout follow-up, GFD adherence was assessed, until 2008, by dietary interview by expert personnel, after 2008 using a five-point validated score we previously developed [23, 24]. Patients scoring 0–2 were considered poorly adherent, while those scoring 3–4 were considered adherent.

Statistical Analysis

Statistical analysis was performed using R version 4.1.2 (R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/). Categorical variables were summarized as total counts and percentages. Univariate analysis of categorical variables was performed with Fisher’s exact test. The Cochran-Armitage test was used to evaluate trends in occurrence of events according to patient age groups. Continuous variables were summarized as mean and standard deviation or median and interquartile range. Testing for normality of data was performed using the Shapiro–Wilk test. For univariate analysis of continuous variables among groups the Mann–Whitney U test was used. Spearman rank correlation was used to evaluate for correlation among variables. Incidence rate of events and exact binomial 95% confidence intervals (95% CI) were calculated. Two-sided p-values < 0.05 were considered statistically significant. A multivariable Cox model was developed to identify predictors of event-free follow-up and to estimate hazard ratios and 95% CI. Harrel’s c statistic and 95% CI were calculated to evaluate model discriminatory ability. Variables for multivariable Cox model analysis were selected based on results of univariate analysis and on the basis of clinical relevance. Kaplan–Meier curves for event-free follow-up were generated and the logrank test was used to compare event-free follow-up rates among groups. Mortality among patients with and without events during follow-up was compared using the logrank test.

Results

One-hundred-and-eighty-nine celiac patients (133 F, mean age at diagnosis 36 ± 13 years) who underwent routine follow-up duodenal biopsy and were followed-up for a median of 112 months (IQR 73–164) were enrolled. At follow-up duodenal biopsy (after a median of 16 months since diagnosis, IQR 13–20 months), 29 patients (15.4%) still showed a certain degree of VA. Among these 29 patients, 2 (6.9%) had complicated CD, 9 (31.0%) had poor GFD adherence, while the remaining 18 (62.0%) showed initial but incomplete histological improvement in the context of good GFD adherence. GFD adherence at time of follow-up biopsy was good in the vast majority of patients (160/176, 90.9%, 13 data missing). Among patients without complications who underwent a further follow-up duodenal biopsy, histological recovery occurred in 75% of the patients while 25% still had atrophy due to poor GFD adherence.

At time of follow-up duodenal biopsy 95 patients overall (50.3%) had ongoing symptoms, of which 64/95 (37.0%) had persistence of symptoms present at diagnosis despite a GFD and 42/95 (24.6%) developed new symptoms which were not present at diagnosis. Only 11 of the 29 patients with VA at follow-up biopsy (37.9%) had persistent symptoms at time of follow-up biopsy.

Demographic and Clinical Characteristics of Patients With and Without Clinical Events

Eighty-eight out of 189 patients (47%) had at least one clinically significant event (symptoms requiring additional diagnostic testing, medical treatment, emergency room access or hospitalization) during follow-up, while the remaining 101 patients (53%) had no events. As shown in Table 1, patients who had at least one clinical event during follow-up were older at diagnosis (p = 0.02) and more likely to have a classical pattern of CD at presentation (p = 0.01). They were also more likely to have persistent VA at follow-up duodenal biopsy (p = 0.03). As shown in Fig. 1, there was a trend for increasing prevalence of events with increasing age at diagnosis (p < 0.01). No relationship was found between GFD adherence at time of follow-up duodenal biopsy and occurrence of events during follow-up (p = 0.44). In total, 5 patients died over follow-up, after a median of 146 months from diagnosis of CD (IQR 122–161). All patients that died had at least one clinical event during follow-up, while none of the patients who had no clinical events died during follow-up (mortality 5.7% vs 0%, p = 0.03). As would be expected, patients with clinical events had a significantly higher number of follow-up medical consultations in our Unit (median 4 vs 2, p < 0.01) and a slightly, although not statistically significant, longer duration of clinical follow-up than those with no events (median 125 vs 100 months, p = 0.06). Likewise, total number of events correlated with number of follow-up medical consultations (Spearman rank correlation, rho = 0.49, p < 0.001).

Table 1 Demographic and clinical characteristics of patients with and without clinical events
Fig. 1
figure 1

Prevalence of patients with and without clinical events according to age group at diagnosis of celiac disease. *Cochran-Armitage trend test, p < 0.01

Clinical Events

Symptoms and Etiologies

In total 157 clinically significant events occurred throughout follow-up in all patients, resulting in an incidence of events of 83.5/1000 person-years (95% CI 70.9–97.6). Median time to event from diagnosis of CD was 50 months (IQR 26–107). Events, as shown in Fig. 2, were most prevalent in the first 5 years from diagnosis of CD and decreased thereafter (overall event-free follow-up rates were 65% at 5 years and 51% at 10 years). In managing the 157 events, which occurred, a total of 92 outpatient medical treatments, 63 diagnostic investigations and 13 emergency room or hospital admissions were required. The most common symptoms leading to clinical events were gastroesophageal reflux disease symptoms (30 cases, 19%), diarrhea (24, 15%), dyspepsia (21, 13%), and constipation (15, 10%). Figure 3 shows the underlying etiologies of clinical events the most common being gastroesophageal reflux disease (29, 18%) and functional gastrointestinal disorders (47, 30%) such as functional constipation (16, 10%), irritable bowel syndrome (15, 10%) and functional dyspepsia (14, 9%). Poor adherence to a GFD accounted for 4% of total events and complications of CD for only 2%.

Fig. 2
figure 2

Kaplan–Meier curve showing distribution of events over time from diagnosis of celiac disease

Fig. 3
figure 3

Bar chart showing etiology of clinical events. GERD gastroesophageal reflux disease, GI gastrointestinal, GFD gluten-free diet CD coeliac disease, IBD inflammatory bowel disease, COPD chronic obstructive pulmonary disease. Functional GI disorders include: functional constipation, functional dyspepsia, irritable bowel syndrome, functional diarrhea. Two events which occurred in the context of poor GFD adherence (due to Helicobacter Pylori infection and dyslipidemia) were not included under poor GFD adherence as they were unrelated in nature to GFD adherence. Other etiologies included: colonic diverticular disease, colon polyps, dyslipidemia, dermatitis herpetiformis relapse, small intestinal bacterial overgrowth, COPD exacerbation, diabetic ketoacidosis in type 1 diabetes mellitus, acute gastroenteritis, ANCA-associated vasculitis, multifactorial anemia, hypercalcaemic syndrome, spondyloarthritis, osteoporosis, NSAID abuse, erosive gastritis, recurrent aphthous stomatitis

Diagnostic Tests

Of the 63 diagnostic tests performed, the most common ones were upper GI endoscopy with biopsies (28 cases, 44%), colonoscopy (13 cases, 21%) and abdominal ultrasonography (11 cases, 17%). The most common symptoms leading to diagnostic testing were diarrhoea (16 cases, 25%), dyspepsia (13 cases, 21%), abdominal pain (12 cases, 19%) and hematochezia (6 cases, 10%). Irritable bowel syndrome (12 cases, 19%), functional dyspepsia (10 cases, 16%), hemorrhoids (6 cases, 10%), gastroesophageal reflux disease/oesophagitis (5 cases, 8%) and lactose intolerance (4 cases, 6%) were the most common etiologies for symptoms requiring diagnostic investigations.

Need for Medical Treatment

Medical treatment was required in 92 events. The most common reasons for medical treatment were gastroesophageal reflux disease/oesophagitis (29 cases, 32%), micronutrient deficiencies (16 cases, 17%) including iron, folate and vitamin B12 deficiency with or without consequent anemia, functional constipation (15 cases, 16%), Helicobacter pylori infection (11 cases, 12%). Symptoms resolved after treatment in 75% of cases while they persisted in the remaining 25%, with functional constipation (43%), iron-deficiency anemia (21%), and H pylori infection (14%) being the most common to persist.

Emergency Room Access/Hospital Admission

Eight patients required emergency room/hospital admission during follow-up, with a total of 13 such events. Three patients were admitted to hospital due to persistence or relapse of severe malabsorption, all due to development of complications of CD (enteropathy-associated T-cell lymphoma, B-cell lymphoma, type 1 refractory CD). Two of these patients had primary unresponsiveness to a GFD (VA at follow-up biopsy and persistent malabsorption) while the last patient initially responded clinically and histologically to a GFD and after several years developed a complication [20]. Other reasons for emergency room access/hospital admission were not directly related to CD and included biliary colic, diabetic ketoacidosis and subsequent diagnosis of type 1 diabetes mellitus, acute gastroenteritis, ANCA-associated vasculitis, small intestinal bacterial overgrowth and exacerbation of chronic obstructive pulmonary disease (one patient each). Finally, one patient attended the emergency room and was subsequently hospitalized 4 times for reasons unrelated to CD (including hypercalcaemic syndrome, spondyloarthritis flare, multifactorial anemia). GFD adherence was good in all patients who attended the emergency room or were hospitalized.

Predictors of Clinical Events

Multivariable Cox analysis identified age at diagnosis ≥ 45 years (HR 1.68, 95%CI 1.05–2.69, p = 0.03) and classical pattern of CD at diagnosis (HR 1.63, 95% CI 1.04–2.54, p = 0.03) as independent predictors of developing clinical events during follow-up. However, persistence of VA at follow-up duodenal biopsy did not reach statistical significance (HR 1.55, 95%CI 0.91–2.63, p = 0.11). Harrell’s c was 0.60 (95%CI 0.54–0.66).

Distribution of Clinical Events During Follow-Up According to Age and Clinical Pattern at Diagnosis of CD

As shown in Fig. 4 occurrence of events during follow-up differs significantly (p < 0.01) according to age group (< 45 years, ≥ 45 years) and clinical pattern of CD at diagnosis (classical vs non-classical/silent). Overall, at 5 years follow-up, 46% of classical patients diagnosed at age ≥ 45 years were event-free, while event-free rates were 62% for non-classical/silent patients ≥ 45 years at diagnosis, 60% for classical patients < 45 years old at diagnosis, and 80% for non-classical/silent patients < 45 years old at diagnosis. At 10 years, event-free rates were 25%, 47%, 51% and 60% for these four groups, respectively.

Fig. 4
figure 4

Event-free follow-up rates according to age at diagnosis and clinical pattern of celiac disease at diagnosis (Oslo classification [25]). Group 1: Patients diagnosed at age ≥ 45 years with classical CD. Group 2: Patients diagnosed at age ≥ 45 years with non-classical/silent CD. Group 3: Patients diagnosed at age < 45 years with classical CD. Group 4: Patients diagnosed at age < 45 years with non-classical/silent CD

Discussion

This retrospective study, conducted on data collected over the last two decades in a referral centre, provides a real-world overview on the etiologies, natural history, and clinical predictors of persistent, recurrent or newly developing symptoms both related and unrelated to CD in adult celiac patients on a long-term GFD. Major findings include the identification of celiac patients at risk of having CREs over follow-up, potentially providing new clinical perspectives on follow-up modalities for adult CD.

Firstly, we have shown that functional gastrointestinal disorders were the leading cause (30% of cases) of clinical events during follow-up, whereas etiologies strictly related to CD, which included inadequate adherence to a GFD and malignant complications of CD were rare, accounting, respectively, for only 4% and 2% of all events. Although functional gastrointestinal disorders are not life-threatening conditions, their impact on patients’ quality of life is burdensome [26, 27]. Their association with CD has been known for a long time [18, 28, 29]; however, mechanisms underlying functional symptoms in celiac patients on a long-term GFD are still unclear [12,13,14,15,16,17,18, 30,31,32,33]. Based on our results, the high prevalence of functional gastrointestinal disorders, gastroesophageal reflux disease and micronutrient deficiencies while on a GFD suggest a possible role for dietary quality rebalancing, as crucial intervention, in addition to maintaining a strict lifelong GFD adherence. Recently, it has been reported that gene expression in the small intestine of treated celiac patients differed from healthy controls, particularly for genes involved in the transport of micronutrients [34, 35], therefore this may have implications for nutritional supplementation, especially after diagnosis.

Secondly, one third of patients had at least one clinically relevant event in the first five years after diagnosis. Just under 15% of patients without previous events subsequently developed events between five and ten years from diagnosis. Very few developed events after 10 years. This suggests that strict follow-up immediately after diagnosis is necessary, and then it can be organized on a case-by case basis. We have shown that age (> 45 years old) and classical pattern at diagnosis of CD were independent predictors of CREs. These parameters may be used to stratify patients into three subgroups according to their risk of developing CREs during follow-up, regardless of severity or etiological relationship with CD. These results complete our previous findings that age at diagnosis and clinical pattern of CD were risk factors for the development of malignant complications of CD [11].

Currently, international guidelines recommend regular follow-up in adult CD, but timing and modalities of follow-up are not standardised [1,2,3]. Implications for clinical practice of our results include the development of personalized and cost-effective modalities for the follow-up of adult celiac patients, including timing of follow-up, role of healthcare practitioners, and decentralization of care for those at low risk. We believe a possible cost-effective proposal may be as follows: (1) for patients diagnosed at age ≥ 45 years with classical symptoms maintenance of regular annual follow-up specialist medical consultation; (2) for patients diagnosed at age < 45 years with non-classical/silent presentation annual specialist medical consultation for the first two years since diagnosis and subsequent discharge to general practice or dietitians with referrals as necessary; (3) annual follow-up for the first five years and then on a case-by-case basis in the remaining patients. These last two scenarios account for the vast majority of celiac patients in our cohort.

This study has some limitations, which are predominantly related to the retrospective and single-centre design in a referral center, limited sample size and lack of standardized methods to retrospectively assess and categorize the heterogeneous symptoms, which develop in celiac patients. The referral centre setting may limit generalizability of our results, given the lack of data on patient outcomes in other settings such as general practice and other secondary care settings. However, it is extremely difficult to obtain data on celiac patients followed-up in these settings.

We did not find adherence to a GFD or persistence of VA on follow-up duodenal biopsy as predictors of CREs. In our cohort, only a minority of patients had clinical events attributable to poor adherence to a GFD. This is in contrast with the current literature, which reports voluntary/involuntary dietary lapses as the most common cause of NRCD (persistent/recurrent symptoms and/or villous atrophy) [12,13,14,15,16,17,18]. Reasons behind this may include discrepancies in methods for assessing GFD adherence, cultural differences due to the culinary and social background and strict instruction received by our patients on how to maintain lifelong rigorous dietary adherence [23, 24].

It should be noted that although persistence of VA at follow-up duodenal biopsy did not reach significance on multivariate analysis, this may be due to our limited sample size. Nonetheless, the literature provides discordant data on the relationship between persistent VA during follow-up and risk of poor long-term outcomes in celiac patients [16, 36, 37].

In conclusion, this study has delineated the natural history, etiologies and predictors of CREs in patients with CD on a long-term GFD and provided a proposal for a cost-effective optimization of follow-up care of these patients. Although requiring a confirmation on larger sample sizes, and integration with emerging methods for implementing adherence to a GFD such as gluten peptides [38], we hope these results can help clinicians to deliver the best quality of care to celiac patients. Finally, our results may potentially be relevant in selecting patients for no-biopsy diagnostic strategies for adult CD, considering the growing interest in this approach [5, 6].