Introduction

Endometriosis is a chronic gynecological condition affecting 6%-10% of women of reproductive age [1]. Histologically, endometriosis corresponds to the dissemination of endometrial-like tissue, or lesions, outside the uterus. The reference method to diagnose endometriosis is surgery, through lesion visualisation and anatomical pathology evaluation. Endometriosis staging is currently based on surgeons' observations. The most widely used scoring is from the American Society for Reproductive Medicine (ASRM) that ranges endometriosis from stage I “minimal” to stage IV “severe”, based on lesion localisation, size, appearance and presence of adhesions [2]. Currently, patient management practices promote non-invasive diagnostic methods such as transvaginal ultrasonography and magnetic resonance imaging (MRI) [3]. Not all staging features are accessible with non-invasive diagnostic approaches, which makes the ASRM classification difficult to use in this context. An alternative classification defines three phenotypes: superficial peritoneal lesions (located less than 5 mm below the peritoneum), ovarian endometriomas, and deep infiltrating endometriosis (located more than 5 mm below the peritoneum). The last two can usually be detected by imaging.

Clinically, endometriosis is associated with a wide range of symptoms and consequences: pelvic pain, severe pain during periods (dysmenorrhea), painful sex, painful urination (dysuria) and/or defecation (dyschesia), alternance of diarrhea/constipation, heavy menstrual bleeding, mood disorders, chronic fatigue and infertility [4]. Endometriosis is increasingly considered as a systemic disease rather than a pelvic pathology [5].

Because of its complex pathophysiology, symptoms heterogeneity, and diagnostic requirements, diagnosis delay for endometriosis ranges from 4 to 11 years [5]. Biomarker candidate research is therefore a key avenue to improve diagnosis. Despite extensive efforts, no single or combination of biomarkers has reached clinical validation for endometriosis, as extensively reviewed in Cochrane’s reviews in 2016 [6,7,8,9]. While dozens of potential biomarkers have been investigated across varied biological compartments, few have been validated in independent studies, making their relevance unclear. To address this, we propose an original integrative review of the endometriosis biomarker literature across biological compartments. Our hypothesis is that endometriosis biomarkers recurrently identified across multiple tissues may be particularly relevant and play a more direct role in disease physiopathology, and an integrative multi-tissue approach could highlight and prioritize these candidates, which may eventually lead to enhanced patient care. For all considered studies, we highlight if endometriosis subtypes, menstrual cycle phases, treatments and symptoms were accounted for. We particularly focus on biomarkers reproducibly detected by at least two independent research teams, and found in different biological compartments.

Materials and methods

Literature search

Pubmed and Embase (excluding Medline articles) were searched for English-language articles as follows: Endometriosis in the title AND ‘biomarkers’ as Medical Subject Heading (MeSH) or All Fields terms. The strategy was designed in association with the referral Inter-University Library of Medicine of Université Paris Cité, France. All articles involving human subjects and published between 2005/01/01 and 2022/09/01 were selected for screening. The databases were last consulted on 30 September 2022. The review was conducted in accordance with The PRISMA 2020 statement for systematic review and not registered [10]. PRISMA 2020 Checklist is included in Additional Material.

Eligibility criteria and study selection

The study selection strategy is summarized as a flowchart in Fig. 1. Both clinical and basic research studies were considered. A total of 879 original publications were manually screened. At each step of the selection process, titles/abstracts/methods or full-text were screened by 2 independent reviewers (A.B. and one of the other co-authors) regarding eligibility criteria. Discrepancies concerning studies inclusion were resolved by A.B. and L.D. All review articles, editorial, animal subject studies and publications not written in English were excluded. Articles with the two following mandatory inclusion criteria were selected for further evaluation: i) presence of a control group without endometriosis and without malignant diseases, and ii) available information on endometriosis phenotype(s). For the remaining 447 articles, the full text was screened to extract the biomarkers studied, the significance and the direction of the variation observed between control and endometriosis groups and the biological compartment in which these changes were found. Only biomarkers significantly deregulated in endometriosis were considered. Biomarkers deregulated in ectopic endometrium only, and articles focusing exclusively on ectopic endometrium were excluded from this review because ectopic endometrium has no equivalent tissue in the control group. At this step, we obtained a list of 387 articles with 1107 significantly deregulated biomarkers in endometriosis and affected biological compartment(s). This list from the remaining 387 publications, validated independently by 2 reviewers, was reviewed a third time to extract information and/or adjustments for disease phenotypes, menstrual cycle phases, treatments and symptoms, which represent parameters of major importance in endometriosis. A list of candidate biomarkers per compartment was also created. Identified biomarkers and their instances in each compartment were tallied using a custom Perl script identifying unique character strings (case insensitive). To focus on sustained multi-tissue evidence, we selected articles related to the biomarkers identified by at least 2 different research teams (regardless of biological compartment) and across at least 3 biological compartments. For each candidate biomarker, at least one study per compartment including 30 or more controls and 30 or more patients with endometriosis was mandatory. All extracted data were analysed descriptively.

Fig. 1
figure 1

PRISMA flowchart for the systematic review. Flowchart highlighting the different steps in the selection of articles included in the review, giving details of the inclusion and exclusion criteria and the analyses carried out

Data extraction and analysis

The data related to our final set of candidate biomarkers of interest in this review were extracted from 18 articles (see flowchart in Fig. 1). For each article, the following data were extracted by 2 independent reviewers: the quality of the control groups and the homogeneity of the confounding factors between the groups, the level of expression (mRNA, protein) and the direction and amplitude of variation of the candidate biomarker, the adjustments of the results according to the subtypes of endometriosis, the phase of the menstrual cycle, the treatments and the symptoms and the ROC curve analysis to highlight diagnostic accuracy if available.

Assessment of risk of bias

Due to the great disparity in study designs (clinical and basic research) available for each candidate biomarkers of interest, we did not use ROC curve analysis as a selection criterion to address robustness, but instead included the presence of at least one study per biological compartment including at least 30 individuals per group as a mandatory criterion. For the creation of lists of candidate biomarkers by biological compartments, to avoid computing different aliases of the same gene/protein as different biomarkers, all identified candidate biomarkers were manually curated to unify writing styles and conventions before processing. Aliases were identified through the HUGO Gene Nomenclature Committee (HGNC) Multi symbol checker tool (https://www.genenames.org/tools/multi-symbol-checker/), replacing 76 aliases by their official gene symbols.

Results

Extensive but heterogeneous studies have explored potential endometriosis biomarkers

We systematically searched the PubMed and Embase databases for research articles on endometriosis biomarkers published between January 2005 and September 2022. Of the 879 publications retrieved after exclusion of duplicates, 278 focused on comparisons i) among endometriosis patients or between tissues in endometriosis patients or ii) between endometriosis and cancer patients (Fig. 1). We excluded these articles since their usability for endometriosis diagnosis is limited. 270 articles did not include information on the patient endometriosis phenotypes according to either the rARSM classification or lesions localization (Fig. 1). These articles were also excluded, as we chose to study how each candidate biomarker was potentially relevant to specific subtypes of patients. In total, we retained 447 publications for further analysis (Additional Table 1), of which 387 identified at least one biomarker with significantly modified levels in endometriosis patients compared to controls (Fig. 1).

Cycle phase, treatments and symptoms are rarely accounted or adjusted for

Information on endometriosis phenotypes was a mandatory inclusion criterion in this study, and 73% of selected publications adjusted the results accordingly, either intentionally or indirectly by including only a particular phenotype (Fig. 2). Biomarker levels can vary with menstrual cycle phases [11], but only 47% of selected publications provided information about cycle phase. Just over half took this parameter into account when analysing the results (Fig. 2), and mainly because surgical teams operated on patients either in the follicular phase or in the luteal phase. Regarding treatments, 42% of articles provided some information (Fig. 2). Non-use of hormonal treatments in the 3 to 6 months prior to inclusion is often specified (and sometimes non-use of anti-inflammatory drugs in the days before inclusion), explaining why only 3% of the analysed publications adjust results for hormonal or symptomatic treatments (anti-inflammatory, painkillers, etc.; Fig. 2). Although endometriosis symptoms are very diverse, this aspect is the least documented, with only 32% of analysed articles taking symptoms into account (Fig. 2). Of these, 17% reported subgroup analyses depending on symptoms (Fig. 2). Infertility, a consequence of endometriosis often regarded as a symptom, was the most commonly considered.

Fig. 2
figure 2

Intermediate analyses carried out on 387 articles. Analyses performed i) on the adjustments of the results according to the subtype of endometriosis, the menstrual cycle phases, the treatments and the symptoms and ii) on the different biological compartments studied

One thousand one hundred seven biomarkers identified across nine unequally studied biological compartments

Among 447 retained publications, 387 identified a significant biomarker in at least one biological compartment (Fig. 1 and 2). The majority studied peripheral blood (183 articles, 47,3%) (Fig. 2, Additional Table 2). Other studied compartments were eutopic endometrium (93 articles, 24%), peritoneum and peritoneal fluid (74 articles, 19,1%), ovary (mainly follicular fluid or cumulus cells; 21 articles, 5,4%), urine (12 articles, 3,1%), menstrual blood (2 articles, 0,5%), saliva (2 articles, 0,5%), feces (1 article, 0,3%) and cervical mucus (1 article, 0,3%) (Fig. 2). 444 biomarkers were identified in peripheral blood, 408 in eutopic endometrium, 188 in peritoneum and peritoneal fluid, 96 in compartments related to ovary, 77 in urine, 23 in menstrual blood, 13 in saliva, 6 in feces and 1 in cervical mucus (Additional Table 2). In total, we listed 1107 candidate biomarkers of endometriosis, several of which were identified in different compartments. Interestingly, only a minority were reproducibly found in independent articles within the same compartment (Additional Table 2), questioning the standardisation and reproducibility of endometriosis candidate biomarker studies. Moreover, only 74 were found in several biological compartments by at least two independent research teams (Table 1). These 74 biomarkers significantly modified in endometriosis were classified into molecular subtypes (Table 1) and used for the following selections. The names of the 74 candidate biomarkers identified, their direction of variation in the different biological compartments studied and the cohort sizes of the selected articles are detailed in Table 1.

Table 1 Biomarkers found in different biological compartments by independent teams

Cohort sizes are relatively small except for peripheral blood

The majority of selected studies included between 10 and 50 women in each group with and without endometriosis (Fig. 3). Unsurprisingly, all studies including more than 500 women per group focused on peripheral blood. Biological compartments are also very unequally studied in cohorts with 100 to 500 women per group: 84.3% studied peripheral blood, 8.6% eutopic endometrium, 5.7% peritoneal fluid and 1.4% follicular fluid. Biological compartments that can be assessed non-invasively (urine, menstrual blood, saliva, feces, cervical mucus), have not been studied on a large scale.

Fig. 3
figure 3

Cohort size distribution. Analyses performed among the 387 articles kept and illustrating the distribution of studies according to cohort size

Only 4 reproducible biomarkers have been consistently detected across tissues in large cohorts

We highlighted 4 candidate biomarkers identified by at least two different research teams in 3 biological compartments or more, with at least one well-powered study per compartment including 30 or more controls and 30 or more patients with endometriosis (Table 2). Here, we summarized the main lines of evidence supporting each of these 4 biomarkers as potential diagnostic elements for endometriosis.

Table 2 Candidate biomarkers found by independent teams in at least 3 biological compartments in large cohort

TNF-α

Tumor necrosis factor alpha (TNF-α), a pro-inflammatory cytokine, was consistently reported as increased in larger cohorts of women with endometriosis in two biological compartments: peritoneal fluid [23, 135], and follicular fluid [34]. In the peritoneal fluid, this increase was further consolidated by consistent results from smaller cohorts [17, 31, 32], although two other studies found no significant differences between women with and without endometriosis [21, 51]. How TNF-α changes in peritoneal fluid tie in with endometriosis phenotypes and menstrual phases was unclear, with reported increases in both stages I/II and III/IV [32], only in stage III/IV with no difference between proliferative and secretory phases, or only in patients with endometrioma and in proliferative phase [135] – some of which may reflect inappropriate statistical power to control for Type II errors when stratifying cohorts. Another increase was also detected in the endometrium at the mRNA level during menstrual phase [24]. While TNF-α was also reported as modified in blood, results were inconsistent, reporting increases [13, 29], decreases [12, 30] or no change [32, 39, 44, 60, 67], both within small or large cohorts. When focusing on large cohorts, increases were observed in serum while the decreases were in plasma, suggesting an importance on the blood collection method (Table 2). Unfortunately, diagnostic accuracy of TNFα was only assessed in blood and yielded low specificity and sensitivity, which is unsurprising in light of the discrepancies across studies.

MMP-9 or MMP-9/NGAL

Enzyme matrix metalloproteinase (MMPs), including MMP-9, are involved in extracellular matrix remodeling via proteolytic activity. They play a key role in physiological (like embryogenesis and wound healing) and pathophysiological (invasion and tissue destruction mechanisms) uterine processes [80]. In this review, we observed that MMP-9 levels appeared to be increased in endometriosis in all studies and regardless of the biological compartment studied. Interestingly, fertility status and menstrual cycle phases do not seem to affect the variations of this biomarker [79,80,81,82, 136]. Although still to be confirmed, the diagnostic value of this biomarker seems to be appropriate [79, 82]. We note that a therapeutic approach to reduce MMP-9 level through progesterone supplementation to improve IVF success rates in endometriosis patients showed promising success, suggesting that MMP-9 may have treatment as well as diagnosis value in endometriosis [81].

TIMP-1

TIMP-1, a metalloproteinase inhibitor, is involved in extracellular matrix remodeling which is particularly intense in ovary during follicular development and cyst formation and in endometrium during dynamic cyclic changes across the menstrual cycle [81]. TIMP-1 showed inconsistent regulation between different biological compartments in women with endometriosis, with reported decreases in blood and ovarian tissue and an increase in peritoneal fluid across well-powered cohorts [22, 81]. Although this remains to be confirmed, this candidate biomarker does not appear discriminative for disease stage and fertility status, but seems impacted by menstrual cycle phases [22].

miR451/miR451a

MiRNAs are small endogenous noncoding functional RNAs [122]. As they are released into the circulation, their interest as biomarkers has been the subject of numerous studies, specific miRNA expression patterns are hallmarks for numerous diseases [122, 123]. These associations between miRNA expression profiles and diseases are often obtained by non-targeted screening (microarray or miRNome sequencing), and their mechanistic roles in physiology or pathophysiology are poorly studied. miR-451 seemed to perform well as a diagnostic marker of endometriosis across different biological fluids and study settings, particularly in combination with other miRNAs [123, 124]. This biomarker does not seem to be correlated with endometriosis severity [22, 123]. Understanding the observed discrepancies in the direction of variation will require studies with systematic adjustments for disease severity, menstrual cycle phases, treatments, symptoms, clinical characteristics of the cohorts and associated comorbidities.

Discussion

Endometriosis biomarker research investigated a large variety of biological compartments so far, some of which are relevant to the local mechanisms of endometriosis pathophysiology (eutopic endometrium, peritoneal fluid, ovary, menstrual blood, cervical mucus), while others approached endometriosis as a systemic disorder resulting in body-wide dysregulations (peripheral blood, urine, feces, saliva). A first valuable outcome of our review was that non-invasively accessible biological compartments (urine, menstrual blood, feces, saliva, cervical mucus) remained drastically understudied despite their potential to transform endometriosis diagnosis. These compartments can address disease modifications both at the systematic and local levels, and deserve more focused attention in the future.

We identified a total of 1107 candidate biomarkers across all nine studied biological compartments, suggesting that endometriosis is potentially associated with widespread molecular modifications. However, agreement between studies, protocols and laboratories was strikingly low, with few candidate markers consistently modified within the same compartment and exhibiting similar directions of change. This suggests that many reported candidate biomarkers were either highly dependent on technical considerations, or represent false positives due to unaccounted confounders. Only 4 of these candidates were reproducibly detected across several compartments by different research teams and with appropriately powered cohorts, and we argued that these markers with widespread modifications should be first-line candidates for investigation in more accessible biological compartments.

As highlighted in this review, the relationships between marker variations and biological compartments were often obscured by uneven consideration of fundamental variables such as disease stage, symptoms, treatments, and menstrual cycle phase. Symptoms and treatments were the major missing elements in many study designs. These variables were rarely analyzed and often absent altogether. Regarding symptomatic treatments (painkillers, anti-inflammatory drugs), neither their effectiveness nor their frequency of use was reported. Menstrual cycle phases were also key variables as many metabolic and regulatory pathways vary throughout the cycle, including one-carbon metabolism [137] and miRNAs [128, 138], but were frequently overlooked. Most articles used the revised ASRM classification to rank phenotypes from endometriosis stage I (minimal) to stage IV (severe) [2]. Deep infiltrating endometriosis is then classified as stage III or IV regardless of the presence of endometrioma. However, presence of endometriomas seems decisive for some biomarkers regulation, especially metabolites. A more accurate classification like ERZIAN scoring may allow for better discrimination between different disease phenotypes [2]. All these parameters may contribute to explain the lack of reproducibility between studies, and standardizing data records may help alleviate this issue in future studies.

The top 4 candidate biomarkers of interest identified here belong to different molecular categories (miRNAs, extra-cellular matrix, and cytokines), and are involved in pathophysiological processes common to many diseases, especially extra-cellular matrix remodeling and inflammation. Previous studies have generally combined elements of the same molecular category together [122, 123], but combinations involving different molecular families are more rarely studied [30]. The 4 candidate biomarkers identified in this work were present in blood, an accessible and relevant biological compartment for diagnostic test development. They have never been combined together to test their diagnostic performance in endometriosis, but their association should be evaluated. Comparing sensitivity and specificity across studies to identify potential combinations of markers of interest remains difficult, as designs and cut-offs varied between studies and between biological compartments. We noted that formal meta-analyses of endometriosis biomarkers were largely absent, and will likely remain challenging due to the heterogeneity in study designs and data collection records that we highlighted above, limiting the reusability of available information.

Finally, this systematic study also came with some limitations. First, and despite our best efforts, we may have missed biomarkers that meet our selection criteria but are listed under different aliases during manual literature curation. We however expect that these instances were rare and did not affect the overarching conclusions of this study. Another important limiting factor was the design of the selected studies, which typically excluded rather than accounted for stratifying parameters of interest. Most of the highlighted adjustments were by exclusion of other categories of patients, for example by including only a single endometriosis phenotype in the cohort, or enrolling women in the same phase of the cycle. In this context, rigorously assessing the impact of adjustment and the differential effects of endometriosis subtypes, cycle phases, symptoms and treatments on biomarker levels remained challenging. Another potential source of bias was the heterogeneity of the control groups, a problem widely recognized for endometriosis research. Indeed, supposedly healthy donors may contain asymptomatic endometriosis patients, while most laparoscopically examined controls with confirmed absence of endometriosis had other gynecological or fertility issues. In most cases, these women presented benign comorbidities (e.g. leiomyomas, ovarian cysts) which were not matched with the case group and may impact the levels of certain markers. While these markers were also of interest to eliminate other diseases during endometriosis diagnosis, they addressed a separate question compared to diagnosing endometriosis at large in the population. A final issue that may interfere with reproducibility concerned the methodology of the studies. At this time, few untargeted studies with large discovery and validation cohorts used omics technologies for high-throughput biomarker discovery. The majority of studies focused on a limited panel of predefined targets and many potential biomarkers were therefore not evaluated. We chose to focus on biomarkers reported by independent research teams and in multiple tissues to improve the relevance and the strength of evidence, but numerous biomarkers were probably unconfirmed because they have not been evaluated so far by independent team and in several compartments.

Conclusion

It appears necessary to rethink endometriosis candidate biomarkers research by designing studies that can be integrated at different levels: i) local and systemic biological compartments; ii) different disease phenotypes with improved characterisation; iii) treatments and their impacts; iv) symptoms; and v) menstrual cycle phases. Access to these parameters will require harmonisation of data collection methods following recommendations of the EPHect project [139]. Such harmonisation would enable meta-analyses, yield a considerable increase in cohort sizes, and facilitate investigations into the effects of these stratifying variables. As endometriosis biomarker discovery remains challenging, sensitivity may be improved by combining biomarkers from different molecular pathways. However, combining biomarkers across biological compartments seems unsustainable in clinical practice, and identifying the most relevant biological compartment remains an important challenge. To this regard, our study pinpoints numerous discrepancies in the results obtained in peripheral blood. Local approaches may lead to more consistent results, as is the case in peritoneal fluid, which can unfortunately not be assessed non-invasively. We therefore highlight the need to further investigate non-invasively accessible biological fluids, especially locally accessible such as menstrual fluid or cervical mucus.