Introduction

Invasive mold infections (IMI) remain associated with high mortality rates, with a 1-year mortality of 32% (even higher in certain populations) [1,2,3], requiring early diagnosis and prompt initiation of appropriate treatment to increase odds of survival [4]. The early diagnosis of invasive aspergillosis (IA), the most frequent IMI, is a challenge, with signs and symptoms being either unspecific or appearing at a very late disease phase. Histopathological identification of fungal elements within a typical tissue reaction remains the gold-standard method, but the material is rarely accessible, and fungal culture form respiratory specimens has a low sensitivity [5]. Non-culture-based fungal biomarkers such as the detection of galactomannan (GM) or other Aspergillus antigens, beta-D-glucan (BDG) and molecular diagnostic tests such as polymerase chain reaction (PCR) and even next generation sequencing are now used as a diagnostic mycological criteria, often in combination [6]. However, their availability varies between geographical regions [7]. In addition, these tests are far from perfect and present several limitations, including the possibility of false positive and false negative results, and the low sensitivity in patients receiving mold-active prophylaxis or treatment [8, 9]. Performance can also vary depending on the tested matrix or on the immunological status of the patients [10]. The predictive value and performance of the currently available biomarkers and tests for early diagnosis of IA may be enhanced by combination with sensitive and specific immunological markers [11].

Cytokines and chemokines which are centrally involved in protective immunity against Aspergillus species (spp.) and other molds [12, 13] have been studied singularly or in combination as diagnostic markers for invasive fungal infections [14]. In addition, some studies suggest that increased or decreased concentrations of certain cytokines or chemokines in serum are associated with an heightened risk of IA, so they could be used as a trigger to initiate diagnostic investigation or as potential biomarkers for an early antifungal treatment [15]. Yet, the early predictive potential of immunological markers has not been investigated in detail, and most of the studies often focused on the role of specific cytokines or chemokines leading to bias and missing potentially relevant mechanisms involved.

Therefore, in this study, we set out to systematically explore the early predictive role of immunological serum biomarkers in patients with IA by (i) investigating the proteomic profile of a cohort of individuals not receiving anti-mold prophylaxis, and (ii) using an unbiased approach without pinpointing to preselected cytokines.

Methods

This retrospective case–control study from the European Conference of Medical Mycology (ECMM) Immunologic Markers for Treatment Monitoring and Diagnosis in Invasive Mold Infection Working Group aimed to identify immunologic markers for early diagnosis of IA in a cohort of hematology patients naïve to mold-active therapy, of which serial serum samples were available.

Study Design, Participants, and Sample Collection

For the discovery cohort, we retrospectively evaluated longitudinal serum samples from 33 cases with proven or probable IA and 66 matched controls from two matched control cohorts (an infected and a non-infected cohort) without IMI. Patients were classified as per EORTC-MSGERC consensus definitions [6]. Patients with proven or probable IA were selected as cases. High-risk hematology patients (with underlying hematological malignancy, of which some underwent an allogeneic hematopoietic cell transplantation) that underwent a diagnostic workup for clinically suspected invasive fungal infection, but never received anti-mold therapy due to lack of supportive evidence but had microbiological and clinical evidence of bacterial or viral non-fungal infection were considered as “infected controls”. High-risk hematology patients without evidence of infection and without fever were considered as “non-infected controls”.

We also obtained samples from an independent cohort of 20 cases of proven or probable IA and 20 matched controls without evidence of infection, all with underlying hematological malignancies.

All controls were matched in terms of presence of neutropenia, primary underlying disease, and treatment with corticosteroids and/or other immunosuppressants within a week of inclusion. Prior to the diagnosis, none of the patients received anti-mold active prophylaxis.

Samples were collected between April 2017 and April 2021 at the Belgian National Reference Centre for Mycosis (University Hospitals Leuven, Leuven, Belgium). More details on the cohort and a flux diagram of the sample collection is provided in the supplementary material (Online Supplement and Supplementary Fig. 1).

For the cases, samples were selected as close as possible to the date of the test that confirmed diagnosis: e.g., date of positive culture, date of confirming Aspergillus PCR, or date of serum or BAL fluid GM that triggered the start of treatment, called the “diagnostic sample” (Timepoint 3). In addition, we selected two serum samples obtained more than 10 days before (Timepoint 1) and two serum samples obtained within 10 days (Timepoint 2) before this diagnostic sample. For the “infected controls”, the sample that was drawn on the day that the patient had one of the following events (in descending order): a bronchoscopy, a high-resolution CT scan of the chest or a fever greater than 38.2 °C, was selected as day 0, and two serum samples in time before this day 0. For the “non-infected controls”, serial samples were selected from an at-random 4-week period during their immunocompromised timeframe within 6 months after the start of an intensive chemotherapy schedule or a hematopoietic stem cell transplantation.

This prospective, noninterventional, systematic sampling was approved by the institutional ethics committee under the number S61797. All patients provided written informed consent before inclusion.

Proteomic Analysis

Samples were analyzed at the Radboud University Medical Centre, Nijmegen, The Netherlands. A panel of 92 inflammatory markers was measured in serum samples using proximity extension assay (Olink Proteomics AB, Uppsala, Sweden). The OLINK Inflammation Panel includes proteins that are crucial for anti-fungal host immune response: Interleukin(IL)-4, IL-6, IL-8, IL-10, IL-15, IL-17A, IL-22, IL-1β, IL-1 receptor antagonist (IL-1Ra), soluble IL-2 receptor (sIL-2r), TNF, IFN γ, and RANTES (chemokine ligand 5) (for the complete list: https://www.olink.com/products/target/inflammation/) through a Proceek Multiplex proximity extension assay [16]. In this assay, the target proteins are recognized by antibody pairs coupled to cDNA strands. As soon as those probes come in proximity with the target protein the binding happens, enabling the probes to anneal and amplify during real-time polymerase reaction.

Internal controls were used to minimize variation within runs. Detected proteins were normalized and measured on a log2-scale as normalized protein expression value (NPX). The NPX value is different for each protein due to the sensitivity of each of the probes, with range and estimated inversion from NPX value to absolute amount (ng/μL) available on the Olink website (https://olink.com/faq/what-is-npx/). As part of the quality control, proteins detected in less than 75% of the samples and samples that deviated more than 0.3 NPX from the median were removed. Protein concentrations under the detection threshold were replaced with the protein’s lower limit of detection (LOD). NPX values were compared between cases and controls at three different time points: more than 10 days before diagnosis, less than 10 days before diagnosis and at the day of diagnosis.

Statistical Analysis

Protein levels of cases and controls were compared using a linear regression model using the Limma package [17] in R. Correction for multiple testing was applied by the Benjamini–Hochberg method, and a false discovery rate (FDR) < 0.05 was considered statistically significant. Based on the significant proteins, a random forest model was created in R using the pROC package and the AUC of the model was calculated. Wald and ANNOVA test were performed using the nparLD package in R using the f1.ld.f1 function [18]. In addition to these tests, we also tested a model with the hypothesis of no simple time effects. These analyses were restricted to the 48 individuals for which we had measurements for the three time points. All analyses were performed in R (version 4.0.2, R Foundation for Statistical Computing, Vienna, Austria).

Results

Demographics

Clinical and microbiological information on the 33 patients from the discovery cohort and the 20 patients from the independent cohort can be found in Supplementary Table 1. A summary of demographic characteristics of cases and control cohorts is reported in Supplementary Table 2. The independent cohort was associated with significantly higher rates of neutropenia (a neutrophil count < 500/µL for ≥ 10 days at time of diagnosis our temporally related) (85% versus 35%), systemic corticosteroid use (43% vs 12%), active GVHD (15% versus 2%), but lower rates of T-cell suppressive treatments (20% vs 38%) in compared to the discovery cohort (Supplementary Table 2).

To focus on early diagnosis, samples were stratified into three groups: samples obtained (a) more than 10 days before the diagnosis of IMI, (b) less than 10 days before diagnosis, and (c) at the time of diagnosis (± 1 day) (Fig. 1A; stratification of the validation cohort Fig. 1B).

Fig. 1
figure 1

Overview of the stratification of the samples into three groups: samples obtained (a) more than 10 days before the diagnosis of IMI, (b) less than 10 days before diagnosis, and (c) at the time of diagnosis (± 1 day) (Fig. 1A: stratification of the discovery cohort, Fig. 1B: stratification of the independent cohort)

Discovery Cohort

IA Cases Versus Non Infected Controls

In the differential abundance analysis of serum samples collected more than 10 days before diagnosis (timepoint 1), six proteins showed significantly higher concentrations in the cases group compared to the non-infected controls: IL-6, CCL20, IL-17C, CXCL1, CCL23, and CX3CL1. Nine proteins had lower concentrations: TRANCE, TWEAK, EN-RAGE, TRAIL, SCF, CD6, IL-12B, OSM, and CXCL5 (Fig. 2A). We did not observe any proteins significantly different between cases and non-infected controls in the group with less than 10 days before diagnosis (timepoint 2) after correcting for multiple testing (Fig. 2B). Abundance analysis on the day of diagnosis (timepoint 3) revealed 30 differentially regulated proteins, with a higher concentration of IL-8, CCL20, IL-6, CCL23, TNFS14, CXCL1, MMP-10, IL-18, MCP-3, CDCP1, CCL3, IL-18R1, IL-10, HGF, CD40, ADA, IL-17C, VEGFA, CXCL6, IL-17A, IL-10RB, LIF-R, IL-15RA, TNF, CSF-1, PD-L1, and CCL19, and a lower expression of SCF, TRAIL, and Tlt3L (Fig. 2C).

Fig. 2
figure 2

Discovery cohort: ATimepoint 1: Abundance analysis more than 10 days before diagnosis comparing cases with non-infected controls. Proteins with significant different concentrations are in red. Six proteins showed significantly higher concentrations: IL-6, CCL20, IL-17C, CXCL1, CCL23, and CX3CL1. Nine proteins had lower concentrations: TRANCE, TWEAK, EN-RAGE, TRAIL, SCF, CD6, IL-12B, OSM, and CXCL5. B Timepoint 2: Abundance analysis in serum from samples less than 10 days before diagnosis comparing cases with non-infected controls. No significant differences between cases and controls were found (dots in black not significant). C Timepoint 3: Abundance analysis at time of diagnosis comparing cases with non infected controls. 30 proteins were expressed significantly differently. There was a higher concentration of IL-8, CCL20, IL-6, CCL23, TNFS14, CXCL1, MMP-10, IL-18, MCP-3, CDCP1, CCL3, IL-18R1, IL-10, HGF, CD40, ADA, IL-17C, VEGFA, CXCL6, IL-17A, IL-10RB, LIF-R, IL-15RA, TNF, CSF-1, PD-L1, and CCL19, and a lower concentration of SCF, TRAIL, and Tlt3L

The concentrations of IL-17C were increased in the case group at all three time points compared to the non-infected controls. In longitudinal analysis, we observed a general difference in the global patterns of IL-17C between cases and non-infected controls (p-value = 0.0018 for Wald and ANOVA tests) and over time (p-value = 0.03 for Wald and p-value = 0.022 for ANOVA). Additionally, we observed significant differences over time within the case group (p-value = 0.04 and p-value = 0.025), but not within the non-infected control group (p-value = 0.24 and p-value = 0.21) (Fig. 3).

Fig. 3
figure 3

Trajectories over time of IL-17C in cases and controls. Timepoint 1: more than 10 days before the diagnosis of IMI, Timepoint 2: less than 10 days before diagnosis and Timepoint 3: at the time of diagnosis (± 1 day)

We then used the samples from cases and non-infected controls collected more than 10 days before diagnosis to create a machine learning algorithm including the four most deviant proteins from the primary cohort: TRANCE, TWEAK, EN-RAGE, and CCL20. The model was able to classify patients later diagnosed with IMI with 87% accuracy (reporting an error rate of 12.73%) (Supplementary Fig. 2). The protein trajectory over time of these four proteins can be found in Fig. 4.

Fig. 4
figure 4

Trajectory of the four most deviant proteins from the discovery cohort TRANCE, TWEAK, EN-RAGE, and CCL20 over time. Timepoint 1: more than 10 days before the diagnosis of IMI, Timepoint 2: less than 10 days before diagnosis and Timepoint 3: at the time of diagnosis (± 1 day)

IA Cases Versus Infected Controls

When correcting for multiple testing, the comparison between IA cases and infected controls showed no significant differences at any timepoint, including more than 10 days before diagnosis (Supplementary Fig. 3). Without correction for multiple testing, we found some proteins with nominal significant p-value: three for samples taken more than 10 days before diagnosis (TRANCE, ADA, and CST5), four for less than 10 days before diagnosis (CXCL10, IL-12B, HGF, and TNFB), and one at the time of diagnosis (IL-10RB).

Independent Cohort

We performed targeted testing to validate the findings from the discovery cohort in the independent cohort and we found that the increased abundance of IL-17C was replicated with a p-value = 0.024 in samples obtained more than 10 days before diagnosis (timepoint 1) (Fig. 5A). Out of 30 differentially regulated proteins found at time of diagnosis (timepoint 3) in the discovery cohort, nine were also replicated in this independent cohort: PD-L1, MMP-10, IL-10, IL-15RA, IL-18, IL-18R1, CDCP1, CCL19, and IL-17C being more abundant among the cases (unadjusted p-value < 0.05; adjusted p-value > 0.5) (Fig. 5B). The derived machine learning model could not be replicated in the independent cohort, as the accuracy for this cohort was 47% (error rate of 53%).

Fig. 5
figure 5

Independent cohort: A Timepoint 1: Abundance analysis more than 10 days before diagnosis comparing cases with non-infected controls from the independent cohort. B Timepoint 3: Abundance analysis at time of diagnosis comparing cases with non-infected controls from the independent cohort

Discussion

We systematically investigated the diagnostic role of immunological serum biomarkers in IA and found significant differences in 30 inflammatory proteins between patients with or without IA at the time of diagnosis, of which PD-L1, MMP-10, IL-10, IL-15RA, IL-18, IL-18R1, CDCP1, CCL19, and IL-17C were replicated in the independent cohort at the same timepoint. IL-17C was the only protein with a higher concentration at 10 days before the diagnosis of IA which was replicated in the independent cohort at the same timepoint. IL-17C therefore was the only “biomarker” that showed increased concentrations at both timepoints in both cohorts when comparing cases to non-infected controls.

Several layers of host barriers are responsible for the protection against the hundreds to thousands of conidia inhaled every day by humans [19] resulting in elevated levels of cytokines (details outlined in the online supplement). In our study, a higher circulating concentration of IL-17C was observed both in the discovery and the validation cohort, both at the time of diagnosis and in samples 10 days before the diagnosis of IA. IL-17C shares 27% amino acid homology with IL-17A, which is produced mainly by leukocytes (Th4 cells and neutrophils) and is crucial for a protective mucosal anti-fungal host response, with elevated levels in IMI reported before [20,21,22,23,24]. IL-17C specifically binds to the receptor IL-17RA/RE, a complex that is expressed on both epithelial and Th17 cells [25]. Several cell types, including lung epithelial cells, induce the production of IL-17C, which in turn, is responsible for the induction of the pro-inflammatory response at the site of the epithelial barrier and antibacterial functions [26]. IL-17C production activates expression of other proinflammatory cytokines and of chemokines such as CXCL1/2/3 and CCL20, which were also elevated in our cohort at time of diagnosis and 10 days before diagnosis [26]. It has been shown to enhance the expression of cytokines and chemokines in epithelial cells in an autocrine manner, resulting in increased pulmonary inflammation [27]. The role of IL-17C in IA or other mold infections remains still unknown.

IL-17C not only induces inflammation, but also promotes tissue healing. For instance, during oral, skin, and bloodstream candidiasis, mice lacking IL-17C are still able to clear C. albicans suggesting a dispensable protective role for this cytokine [28]. However, in another study, mice that lack IL-17C displayed an increased survival and attenuated kidney damage, although the fungal load was similar to the wild type, thus suggesting an immunopathological role for IL-17C during systemic Candida infection [29]. IL-17C was also involved in the immune response of respiratory epithelial cells: IL-17C heightened inflammatory responses of respiratory epithelial cells infected with Pseudomonas aeruginosa and was found in bronchial tissue of subjects with infection-related lung diseases [30]. Moreover, IL-17C exacerbates chronic obstructive pulmonary disease (COPD) through neutrophil recruitment via upregulation of CXCL1 [31]. CXCL1 expression was also higher in cases versus non-infected controls in our discovery cohort, significant at timepoint 1 and 3, but non-significant at timepoint 2.

To the best of our knowledge, our study is the first to mention a possible role of IL-17C in invasive mold infection: specifically, these data sets IL-17C as an attractive candidate for early biomarkers for detecting IMI already 10 days before the diagnosis. Moreover, this “early” feature of IL-17C that we found is not surprising, as it is expressed by epithelial cells during bacterial infections at earlier timepoints in disease than IL-17A [27, 32]. Therefore, the fact that IL-17C is already upregulated 10 days before patients develop IMI should not be seen as protective factor, since this cytokine might generate an autocrine loop of autoinflammation driving immunopathology. Further studies that specifically investigate the role of IL-17C on mold infections are warranted to pinpoint its pathophysiologic role during invasive fungal infections.

We also found that PD-L1 concentration was increased at time of diagnosis in both cohorts. The checkpoint protein PD-L1 is best known in the field of immunotherapy where immune checkpoint inhibitors are used for treatment of various cancer types. PD-L1 can be stimulated by IFN-γ and is expressed on macrophages. It promotes T-cell apoptosis, anergy and functional exhaustion [33]. Because immune checkpoint inhibitors restore T-cell response, they could also be active against fungal infections [34]. Previous studies showed that during C. albicans infection, PD-L1 is induced by presence of glucans and governs the mobilization of neutrophils through regulating their secretion of CXCL1 and CXCL2 (in this study CXCL1 concentration was also higher in the case group). PD-L1 expression in infected mice strongly correlated with fungal tissue burden and PD-L1 deficiency improved survival, suggesting a negative role of PD-L1 expression on the host immune response inhibiting neutrophil migration from the bone marrow into the infected sites [35, 36]. In mice with mucormycosis and in mice infected with Histoplasma capsulatum, inhibition of the PD-L1 pathway has improved clinical outcomes [37,38,39]. Information specifically for aspergillosis is, however, lacking. These literature findings suggest that PD-L1, instead of being a diagnostic biomarker, could be a marker for treatment decisions. As all high-risk patients included in our cohort had underlying haematological malignancies, knowing more about the possible anti-tumour and anti-fungal synergy of immune checkpoint inhibitors in these patients and its effect on the PD-L1 expression in their serum and BALf would be very interesting. Probably, more information on this will come in the future as trials with immune checkpoint inhibitors and hypomethylating drugs in in patients with AML and MDS are ongoing [34].

We also found that IL-10, IL-15RA, IL-18, and IL-18R1 concentrations were higher at the time of diagnosis in both cohorts. Increased concentrations of these interleukins have been described previously in mice and in patients with IA [13, 40,41,42]. There was no difference between cases and controls in the days before diagnosis, limiting their potential in early diagnosis or screening. Matrix metalloproteinase (MMP)-10, CCL19, and CDCP1 were found in higher concentrations at time of diagnosis in both the discovery and the independent cohort. However, little is known about their role in mold infections. MMPs play a role in regulating the immune response in inflammatory processes and regarding fungal infections this has been described in host immunity to pulmonary Cryptococcus infection [43, 44]. CCL19 plays a role in attracting immune cells to the site of inflammation or infection in general, and this has been shown in the presence of A. fumigatus as well [45, 46]. Less information specifically directed towards (fungal) infections is available for CDCP1 but it has for example been described in association with severe COVID-19 infection [47]. As concentrations of these proteins were not significantly different between uninfected and infected patients, and they are described in literature in many other types of infections (and even in autoimmune disease), one could suggest that these proteins are just markers of general immune activation.

As expected, in our discovery cohort, also IL-6 and IL-8 concentrations were higher in serum of cases than in non-infected controls at the time of diagnosis (IL-6 expression also at more than 10-days before diagnosis) as they are well known to be produced by alveolar macrophages and lung epithelial cells in elevated levels in serum and BALf of patients with IMI [11], which has been studied in vitro [48,49,50] and in clinical case–control studies [13, 20, 51]. Cytokines such as IL-6 and IL-8 have also been shown to be useful in treatment monitoring and outcome prediction of patients with IMI [52, 53]. However, in our study, a higher concentration of IL-6 and IL-8 was not replicated in the independent cohort. Reasons for these divergent findings could include differences in clinical characteristics, with the independent cohort showing a significantly higher rate of neutropenia compared to the discovery cohort, which may impact levels of these inflammatory markers. IL-6 and IL-8 were not significantly higher in serum samples around 10 days before the diagnosis of IMI, limiting their role in screening.

In this study we also identified a model to predict IMI more than 10 days before diagnosis in the discovery cohort. However, the model could not be replicated in the independent cohort, likely due to different clinical characteristics and a small sample size resulting in an underpowered independent cohort.

The major limitation of this study is the relatively small sample size, which is often the case in studies concerning IMI. It is also important to note the lack of significant differences between cases and infected controls, which could be due to the limitations discussed above (including the small sample size). On the other hand, this could also mean that the higher protein concentrations that we found in the comparison between cases and non-infected controls suggest a signal for infection and not specifically a signal for IMI, that could mainly trigger further diagnostic testing. Importantly not only fungal, but also bacterial, viral, and parasitic coinfections including mixed infections are a major threat in patients with underlying haematological malignancies, whereby there are important interactions, e.g. with bacterial or viral infections creating a favorable environment for fungal infections [54,55,56]. Even though being less specific for one particular causative pathogen, these early biomarkers could still have an important role in stratifying diagnostic testing in these high-risk patients, and in decisions upon prophylactic or preemptive therapy. Other limitations confer to the validation cohort, which was recruited at the same center as the derivation cohort and even more importantly significantly differed in a number underlying disease characteristics. Future studies in independent cohorts with clinical characteristics more similar to our discovery cohort are needed to further validate our findings and particularly the derived model.

Conclusion

Our results suggest significant differences in the inflammatory proteins between patients with IA and matched controls without infection at the time of diagnosis, but the prediction capacity significantly decreases in earlier samples taken before the diagnosis of the infection. We found in both our derivation and validation cohort that IL-17C may have potential to be a biomarker for early detection of IA; future studies should confirm our findings before the biomarker can be integrated into clinical routine. Future studies are also needed to validate our predictive model in at-risk patients and to evaluate the potential of inflammatory proteins for treatment stratification and outcome prediction in patients with IA.