Large-scale proteomics reveals precise biomarkers for detection of ovarian cancer in symptomatic women

Ivansson, Emma; Hedlund Lindberg, Julia; Stålberg, Karin; Sundfeldt, Karin; Gyllensten, Ulf; Enroth, Stefan

doi:10.1038/s41598-024-68249-2

Large-scale proteomics reveals precise biomarkers for detection of ovarian cancer in symptomatic women

Article
Open access
Published: 27 July 2024

Volume 14, article number 17288, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Large-scale proteomics reveals precise biomarkers for detection of ovarian cancer in symptomatic women

Download PDF

Emma Ivansson¹,
Julia Hedlund Lindberg¹,
Karin Stålberg²,
Karin Sundfeldt³,
Ulf Gyllensten¹ &
…
Stefan Enroth¹

504 Accesses
Explore all metrics

Abstract

Ovarian cancer is the 8th most common cancer among women and has a 5-year survival of only 30–50%. While the survival is close to 90% for stage I tumours it is only 20% for stage IV. Current biomarkers are not sensitive nor specific enough, and novel biomarkers are urgently needed. We used the Explore PEA technology for large-scale analysis of 2943 plasma proteins to search for new biomarkers using two independent clinical cohorts. The discovery analysis using the first cohort identified 296 proteins that had significantly different levels in malign tumours as compared to benign and for 269 (91%) of these, the association was replicated in the second cohort. Multivariate modelling, including all proteins independent of their association in the univariate analysis, identified a model for separating benign conditions from malign tumours (stage I–IV) consisting of three proteins; WFDC2, KRT19 and RBFOX3. This model achieved an AUC of 0.92 in the replication cohort and a sensitivity and specificity of 0.93 and 0.77 at a cut-off developed in the discovery cohort. There was no statistical difference of the performance in the replication cohort compared to the discovery cohort. WFDC2 and KRT19 have previously been associated with ovarian cancer but RBFOX3 has not previously been identified as a potential biomarker. Our results demonstrate the ability of using high-throughput precision proteomics for identification of novel plasma protein biomarker for ovarian cancer detection.

High throughput proteomics identifies a high-accuracy 11 plasma protein biomarker signature for ovarian cancer

Article Open access 20 June 2019

Proteomic landscape of epithelial ovarian cancer

Article Open access 31 July 2024

Proteomics of ovarian cancer: functional insights and clinical applications

Article 04 March 2015

Introduction

Ovarian cancer 5-year survival ranges from 90% when the cancer is discovered in stage I to 20% in stage IV¹. This partly reflects that more aggressive subtypes tend to be diagnosed at later stages, but it has also been suggested that early detection could improve survival. Ovarian cancer is the 8th most common cancer among women today and kills over 200 thousand women per year worldwide². At present, the discovery of ovarian cancer is symptom-driven, and less than a third of the cases are discovered at stage I or II¹. Detailed understanding of the aetiology of ovarian cancer could assist in determination of an optimized screening interval in relation to underlying processes, but the precursor states have not yet been precisely identified. For high-grade serous carcinomas for instance, the presumed precursor state STIC (serous tubal intraepithelial carcinomas) can itself develop slowly over decades from the first occurrence of predisposing genetic mutations³. Recent molecular evidence obtained from patient material suggests that the transition from STIC to ovarian cancer can occur in a relative short time-span estimated to 6–7 years^4,5. Computational estimates based on tumour sizes and growth rates⁶ have indicated that ovarian cancer can exist over 4 years in situ, or as stage I and II, before finally progressing to stages III and IV.

Today, no sufficiently accurate enough molecular test exists to justify population-based screening. The largest running prospective ovarian cancer screening study UKCTOCS (United Kingdom Collaborative Trial of Ovarian Cancer Screening)⁷ has evaluated a multi-modal screening strategy for post-menopausal women. This strategy is based on a molecular indication of increased MUCIN-16 (Cancer antigen-125/CA125) which is then followed by a transvaginal ultrasound. Elevated MUCIN-16 was first introduced as an indication for ovarian cancer in 1983⁸ and still remains today the best single value biomarker used both for diagnosis in post-menopause women and for treatment management⁹. A remaining difficulty with the multi-modal approach is the relatively low sensitivity of MUCIN-16 which results in that many cancers are missed. In general, a high specificity can be achieved by analysis of additional biomarkers or by using transvaginal ultrasound (TVU)¹⁰ which can reduce false positives. Typically, today, women who experience pelvic symptoms are often examined using available molecular biomarker analysis, TVU or computer tomography, and with surgery as the final tool for diagnosis. In Sweden today, close to four out of five women who undergo surgery for adnexal tumours are diagnosed with benign cysts, not cancer¹¹ and accurate biomarkers are needed to accurately triage symptomatic women to reduce unnecessary diagnostic surgery. MUCIN-16 as a single biomarker has low sensitivity for early-stage cancer and results in a high rate of false positive indications in many benign gynaecological conditions in younger women, such as infections, pregnancy, or endometriosis⁹. Combinations of MUCIN-16 and additional biomarkers such as the WAP Four-Disulfide Core Domain 2 (WFDC2 or HE4), as used in the ROMA-index (ovarian malignancy risk algorithm) improves the accuracy. The ROMA-index, is calculated differently depending on menopausal status and was initially reported to have a sensitivity of 0.77 at a specificity of 0.75 in pre-menopausal women while a sensitivity of 0.92 at a specificity of 0.75 can be achieved in post-menopausal women¹². Recent meta-analyses of the ROMA-index in both pre- and post-menopausal women indicates that the overall sensitivity of the test is in the range of 0.88 to 0.93 with a specificity in the range of 0.89 to 0.94¹³. In addition to MUCIN-16 and WFDC2 which are used in the ROMA-index, a number of studies have indicated that additional protein biomarkers can be informative for e.g., triaging or early diagnosis of ovarian cancer. The OVA1-test, combines five proteins (Apolipoprotein A1, Beta 2 microglobulin, CA125, Transferrin and Prealbumin/Transthyretin) and classifies women into categories of high, intermediate or low risk of ovarian cancer and was recently evaluated in a multicentre study¹⁴ where a higher proportion of the individuals predicted to be low risk, e.g. benign, according to the OVA1-test as compare to using CA125 alone remained benign during a 12-month follow-up period. In a recent study¹⁵, we showed that combinations of 4 to 7 protein biomarkers selected from over 1450 plasma proteins outperform MUCIN-16 alone in detection of ovarian cancer in symptomatic women. A set of 7 proteins achieved a sensitivity of 0.91 at a specificity of 0.96 separating benign tumours from malign in stage I and II. This high accuracy was then replicated in an independent cohort. Notably, our data-driven approach for selecting biomarkers did not include MUCIN-16 among the 7 proteins showing that a broad characterization of protein biomarkers candidates without prior assumptions on inclusion can break new ground and go far past the current gold standard in molecular tests for early detection of ovarian cancer.

To examine this hypothesis further, we have characterized larger number of plasma proteins in two independent Swedish cohorts consisting of women that underwent diagnostic surgery after suspicion of ovarian cancer and whom were later diagnosed with either benign or malignant tumours.

Material and methods

Samples

Plasma samples of women with benign and malignant ovarian tumours were collected from either the U-CAN collection¹⁶ at Uppsala Biobank, Uppsala University, Sweden or the Gynaecology tumour biobank¹⁷ at Biobankvast.se, Western healthcare region, Göteborg, Sweden. All samples from the biobanks were included based on surgical ovarian cancer diagnosis or patients that had been surgically diagnosed with benign conditions based on suspicion of ovarian cancer. Exclusion criteria were patients that had received neoadjuvant treatment prior to surgery or if the tumour was pathologically determined to be metastatic originating from other tissues, based on pathology. The samples from U-CAN in Uppsala were collected between 2012 and 2018 in agreement will all local guidelines and regulations. The samples in the Gynaecology tumour biobank in Göteborg were collected from 2016 to 2018 in agreement with all local guidelines and regulations. The tumours were examined by pathologist specialized in gynaecologic cancers for histology, grade, and stage according to International Federation of Gynaecology and Obstetrics (FIGO) standards. Both cohorts contained mixed tumour histology. In the U-CAN samples, among the samples with complete histology data, 60.1% were high grade serous (HGS), 8.7% low grade serous (LGS), 7.6% endometroid, 6.0% clear cell, 5.5% carcinosarcoma and the remainder mucinous, non-epithelial, endometroid or mixed. In the Göteborg samples, 70.6% were HGS, 8.2% LGS, 7.0% mucinous, and the remainder clear cell, endometroid, sarcoma, epithelial/clear cell, mucinous/teratoma or unclear histology. All samples were collected at time of diagnosis, from non-fasting, non-sedated patients by a trained nurse. Separated plasma was then frozen and stored at − 70°C on site. In total, 350 samples were used from the U-CAN collection and 171 from the Göteborg collection. Basic statistics for the samples used are presented in Table 1. The study was approved by the Regional Ethics Committee in Uppsala (Dnr: 2016/145) and Göteborg (Dnr: 201–15) and informed written consent was obtained from all participants following the guidelines of the Declaration of Helsinki.

Table 1 Cohort characteristics.

Full size table

Proteomics

The samples used here have previously been analysed with the proximity extension assay (PEA)¹⁸ Explore1536¹⁹ assay^15,20 and was here studied using the PEA Explore3072 Expansion assay. The samples were randomized across seven 96 well plates. In brief, the PEA is based on pairs of antibodies equipped with DNA single-strand oligonucleotide reporter molecules, probes, that bind to their respective target if present in the sample. Target binding by both probes in a pair in close proximity generates double-stranded DNA amplicons. The Olink Explore3072 Expansion assays is built upon four separate 384-plex panels focusing on Inflammation, Oncology, Cardiometabolic and Neurology proteins, corresponding to a total of 2943 unique human proteins, and the workflow has been described in detail before²¹. After the initial probe-based immune reaction step in the Explore workflow, the amplicons were extended and amplified in a two-step process. Individual sample index sequences were added during the second step. After this step the samples were pooled. Sequencing libraries were prepared and subsequently sequenced on a NovaSeq 6000 instrument (Illumina, USA) according to the manufacturer’s instructions. After sequencing, the generated BCL files were transformed into count files. The count files were then translated into normalized protein expression (NPX) values through a quality control (QC) and normalization process built around internal and external controls as specified by the manufacturer of the assay. The resulting NPX values are on a log2 scale and in the logarithmic phase of the curve, one (1) increase of the NPX value corresponds to a doubling of the protein content. In the resulting data, a high NPX value corresponds to a high protein concentration. Each of the measured proteins has a lower limit of detection (LOD) given in the same NPX-scale which is determined at run time. Here, each protein measure with NPX under LOD was replaced with the plate-specific LOD as indicated in the provided result file. In total, 8.8% of the measurements in the Explore 1536 assay were found to be under the LOD and 31.1% of the measurements in the Explore 3072 expansion asssay. All experimental methods were conducted in accordance with relevant guidelines and regulations.

Data analysis and statistics

All calculation were carried out using R²² (4.2.2). Univariate comparisons were done one protein at a time using a two-sided Wilcoxon ranked based test. The resulting p-values were adjusted for multiple hypothesis testing using the Holm correction method as implemented in the ‘p.adjust’ R-function. For the multivariate analyses, a feature selection was first done using the training cohort only based on recursive feature selection as implemented by the ‘rfe’ function in the ‘caret’ R-package²³ (version 6.0.91) using ‘nbFuncs’ as functions with method set to ‘repeatCV’ with 4 repeats. The feature selection was carried out by allowing combinations of 2 to 20 individual proteins. A Naïve Bayes model was then trained using the ‘caret’ R-package employing a four-fold cross-validation schema optimising the Laplace correction (‘fL’ parameter) from 0 to 1 in steps of 0.1, with and without kernel and bandwidth adjustment (‘adjust’ parameter) from 1 to 4 in steps of 0.1. The model returned a score in the range 0 to 1 and thresholds (cut-off) for separating the classes was determined by evaluating the receiver operating characteristics (ROC) on the training cohort at a minimum sensitivity and/or specificity of 0.95 and at the ‘best point’ meaning the closest (Euclidian distance) point on the ROC-curve to perfect classification. The model performance was evaluated separately in the training and the validation cohorts. No samples from the validation data were used in the training nor in the optimization in any of the models. The obtained performances in the validation cohort were compared to the obtained performance in the training cohort based on the area under curve (AUC) statistics and the achieved sensitivity and specificity at the cut-off developed in the training cohort. AUC-statistics were compared using the DeLong’s test and a two-sided Fisher’s Exact test on a 2 × 2 matrix with true or false negatives or positives was used to compare the obtained sensitivity and specificity between the training and validation cohort. Correlations between proteins were calculated with Spearmans’s method using the ‘cor.test’-function. Beeswarm-plots were made using the R-package ‘beeswarm’ (0.4.0)²⁴. The literature-search was conducted by searching PUBMED (https://pubmed.ncbi.nlm.nih.gov/) for “ovarian cancer” and (i) the protein short-name (ii) the protein full name and (iii) the name of the encoding gene. All statistical analyses were conducted in accordance with relevant guidelines and regulations.

Results

Multiple single value biomarkers for early detection

A total of 571 samples from two separate clinical cohorts with women diagnosed with benign or malign tumours based on suspicion of ovarian cancer, was analysed using Explore¹⁹ PEA¹⁸. The first set of samples (Table 1, discovery cohort) were from a cohort collected in Göteborg, Sweden and the second (Table 1, replication cohort) from the U-CAN biobank in Uppsala, Sweden). Here, 443 samples were analysed using the Olink Explore3072 expansion assay. Part of the samples have previously been characterized with the Olink Explore1536 assay and results from these analyses have been published^15,20. Both panels (1536 and 3072) each contains 1536 assays. One-hundred and seven (107) samples from the discovery cohort and 163 samples from the replication cohort had complete data from both analyses runs. In our previous analysis¹⁵ the Göteborg cohort was used as discovery cohort while the Uppsala cohort was used for replication and the same assignment was used here. We first analyzed the data from the two PEA Explore assays separately to maximize the statistical power in the univariate analysis. This included up to 111 and 237 samples from the discovery and replication cohorts respectively for the Explore1536 assay, and 167 and 276 samples from the discovery and replication cohorts for the Explore3072 assay. The univariate analyses were performed in three ways; benign vs early-stage ovarian cancer (stages I and II), benign vs late stage (stages III and IV) and finally benign vs any stage (I-IV). After adjustment for multiple-hypothesis testing in the discovery cohort, a total of 4 proteins were found to be significantly different between benign and early-stage cancer (Fig. 1A), 163 between benign and late-stage (Fig. 1B) and finally, 129 between benign and any stage (Fig. 1C). Several of these 296 proteins differed significantly in more than one category of comparisons, and in total 171 unique proteins were involved. We then analysed the 296 associations in the replication cohort and found that all 296 associations had fold-changes in the same direction also in the replication cohort and that 279 (94.3%) of these were nominally significantly different. After adjustment for multiple-hypothesis testing also in the replication cohort, 269 (90.9%) of the associations remained significant. All univariate results are presented in Supplementary Table 1. Here, MUCIN-16 was found to be among the biomarkers with replicated performance, but ranked as the 139th most significant association by p-value in the discovery data when comparing benign and any stage cancer. RBFOX3 and TCOF1 were the two proteins with the lowest p-values comparing benign with any stage cancer and the distribution of obtained NPX values in both the discovery and replication cohorts for these by stage are shown in Fig. 1D and E. Next, we compared the performance of these two biomarkers as single valued classifiers for separating benign from any stage. In the replication cohort, RBFOX3 had an AUC (area under curve) of 0.85 (95% confidence interval 0.81–0.89, Fig. 1F) and TCOF1 had an AUC of 0.84 (0.80–0.88, Fig. 1G) while an AUC of 0.68 (0.61–0.75) was found for MUCIN-16 as measured by the PEA. In Figures F and G, the selected biomarker is shown as a solid line while the performance of MUCIN-16 (as measured with the PEA) is shown as a dashed line. For both proteins, the AUC was significantly higher than for MUCIN-16 (all p-values < 1.1 × 10^–4, DeLong’s test). Using both cohorts, the clinical CA125 measurements were found to have a moderate (Spearman’s’ Rho = 0.53) but significant (p < 2.8 × 10^–9) correlation with the PEA equivalent. Across both cohorts, the CA125 measurements achieved an AUC of 0.70 (0.56—0.85), 0.93 (0.87–0.98) and 0.87 (0.80—0.94) in separating benign from early stage (I and II), late stage (III and IV) and any stage (I-IV) tumours, respectively. At the often clinically used cutoff at 35 U/ml, CA125 alone achieved sensitivities of 0.78 (0.61–0.94), 1.00 (1.00–1.00) and 0.94 (0.89–0.99) and specificities of 0.44 (0.31–0.59), 0.44 (0.28–0.59) and 0.44 (0.28–0.59) respectively for the same three categories.

Combining biomarkers increases precision

We next built multivariate prediction models separating; (i) benign vs early stages (I and II), (ii) benign vs late stages (III and IV) and (iii) benign vs any stage (I-V). For all three models, we used the same methodology where we first employed feature selection based on all proteins in the discovery cohort. The selected proteins were then used to build and optimize a classifier reporting a risk-score on the scale of 0 to 1. We then set a cut-off for predicting malignancy based on the risk-score where we aimed to achieve at least 95% sensitivity. We then applied the model to the replication cohort calculating the risk-score applying the same cut-off and evaluated the final performance of the models based on those scores only. In this analysis, the data from the two PEA assays (Explore 1536 and the 3072 expansion, see Methods for details, was merged keeping only proteins and individuals with no missing values. This resulted in a final data set with 2934 proteins measured in both cohorts, 107 samples in the discovery cohort and 163 samples in the replication cohort.

The model separating early stages from benign consisted of three proteins, WFDC2, FOLR1 and KRT19, and achieved an AUC of 0.90 in the replication cohort (Fig. 2A) which was not statistically different from the performance in the discovery cohort (AUC = 0.94, p = 0.45, DeLong’s method, Table 2), suggesting robust modelling performance. Compared to the performance of CA125 alone for separating early-stage cancers from benign, the three-protein model achieved significantly (p < 0.023, DeLong’s method) higher AUC. The feature selection for the models for late stages vs benign were built using two proteins, RBFOX3 and WDFC2, and achieved and AUC of 0.93 in the replication cohort (Fig. 2B), as compared to 0.97 in the discovery cohort (p > 0.12, DeLong’s method, Table 2). Finally, the model for separating benign from any stage consisted of three proteins, WFDC2, FOLR1 and KRT19, and had an AUC of 0.92 in the replication cohort (Fig. 2C), as compared to 0.97 in the discovery cohort (p > 0.12, DeLong’s method, Table 2). Although the AUCs achieved by the multivariate protein models comparing late (III and IV) and any stage (I-IV) vs benign were consistently higher than for CA125 alone, these differences were not statistically significant (p > 0.19, DeLong’s method). Lastly, using the discovery cohort we also developed a cut-off for each model requiring at least 95% sensitivity in separating the malign from the benign which was then applied also to the replication cohort. For all three models, there was no statistical difference (all p-values > 0.08, Fisher’s Exact test, Table 2) in the point-estimates of sensitivities and specificities obtained in the replication data at the respective cut-off. Overall, the three models generated here for early, late and any stage vs benign conditions contained only four proteins in total (WFDC2, FOLR1, KRT19 and RBFOX3). We also compared the performance of the benign vs any stage model for separate tumour histologies (Fig. 2D). When comparing the raw risk-scores between histologies, we found nominally significant (two-sided Wilcoxon ranked based test) differences between Carcinosarcoma (p = 0.037), Endometroid (p = 0.022) and HGS and LGS (p = 0.037, 0.022 and 0.0021 respectively) with lower values in the LGS (Fig. 2D). After adjustment for multiple hypothesis testing, only the difference between HGS and LGS remained significant (q = 0.021). In addition to the raw risk-score distribution, we also compared the fraction of samples predicted as false negatives in the different histologies and found a nominally significant (p = 0.029, Fisher’s exact test) higher proportion in LGS as compared to HGS although this significance did not remain after adjustment for multiple hypothesis testing (q = 0.29). Lastly, we compared the predictive performance of the model in separating benign from specific histologies (Fig. 2E) and although there is a trend of worse performance in separating LGS from benign as compared to all other comparisons (Fig. 2E), we found no statistical difference in the estimated AUC between any of the categories (all nominal p-values > 0.063, DeLong’s method).

Table 2 Results of multivariate modelling.

Full size table

Discussion

There is a strong need to identify biomarkers for ovarian cancer, both to improve the diagnosis of women seeking healthcare and for population screening. A biomarker test separating benign from malign tumours in symptomatic women could reduce the need for diagnostic surgery and thereby reduce the risk of side effects on fertility and iatrogen menopause. Population screening could enable early detection of women at risk and provide an opportunity to identify early-stage cancer with a better prognosis. Today, none of the available biomarkers is accurate enough in a screening scenario to safely identify close to all cancers (high sensitivity) without also including a considerable fraction of false positives (low specificity). False positives lead to unnecessary anxiety among women until a benign or healthy diagnosis have been confirmed. In ovarian cancer, the diagnostic work-up results in additional examination and diagnostic surgery, which introduce health care costs and may result in complications and anxiety for the women. The largest running ovarian cancer screening study, the UKCTOCS (United Kingdom Collaborative Trial of Ovarian Cancer Screening)⁷, has suggested using multi-modal testing based on elevated MUCIN-16 followed by transvaginal ultrasound in post-menopausal women. Health-economic studies support that this strategy could be justified for screening²⁵. Recently, the long-term outcome in the UKCTOCS study²⁶ was analysed, and although an increase in early-stage cancer discovery was observed, no clear improvement could be seen in reduced mortality. A similar study in the United States, the Normal Risk Ovarian Screening Study (NROSS), which employed the same multi-model strategy did, however, find more promising results¹⁰. A test based on multiple biomarkers could potentially achieve both a high sensitivity and specificity and thereby complement single CA125 or even replace the multi-modal strategy if the accuracy of a test is high enough. For symptomatic women in Sweden today, where a TVU indicates adnexal ovarian mass, surgery is used for final diagnosis but close to 80% of these women have benign conditions. In this scenario, a molecular test with a high sensitivity and a moderate specificity would still be useful for triaging patients and could therefore reduce unnecessary surgery. The OVA1-test achieves a high sensitivity (above 90%)^27,28 at a specificity of 49–69%^27,28 and has in a multicentre study been shown to be clinical useful when compared to using CA125 alone¹⁴. Here, we show that a biomarker signature based on 3 proteins can separate malign from benign conditions at a sensitivity of 93% while retaining a specificity of 77%. This biomarker panel could potentially reduce the number of women in need for diagnostic surgery with up to two thirds, saving health care resources and reducing the risk of complications for the women.

Our investigation is based on close to 3000 characterized proteins using a machine learning based approach to select proteins to be included in the prediction models, without prior assumption regarding known association with ovarian cancer, nor restriction to proteins with high univariate significance. We have previously developed²⁹ and validated³⁰ a plasma protein biomarker panel consisting of 11 proteins for detection of ovarian cancer. That panel was selected from analyses of up to 983 proteins and comparing benign and malign tumours achieved an AUC of 0.92 and 0.93 in the validation stage of two separate cohorts comparing benign and malign tumours. In the present study, three out of the four proteins in the final models were also part of the previous 11-biomarker panel with the fourth being exclusively present on Explore3072-expansion and not available in our previous analyses. Although these three proteins were measured using the same technology (PEA), it should be noted in our studies, they were characterized using two different versions of the PEA; the Target 96 and the Explore. Target 96 has a PCR-based readout while the Explore assays uses next-generation sequencing. Although similar, the manufacturers webpage lists slightly different performances for the two techniques, with somewhat lower %CVs for the Explore versions but also slight differences in the expected measuring ranges. This in turn could affect the performance of the individual proteins assays as part of a combined biomarker panel. On the other hand, the fact that the same three proteins were selected as top candidates, independent of the version of the PEA technology used, testified to the robustness of the technology as well as strength of the association with ovarian cancer.

The models presented here were exclusively generated using a discovery cohort followed by validation in an independent replication cohort. The detected models showed similar performance in the two cohorts, suggesting robust behaviour without overestimation of the predictive capabilities. Only four proteins were selected to be included in the final models for separating early, late and any stage from benign conditions.

In a clinical setting, the need to decisively distinguish between only early or late stage from benign conditions is likely not common why the any-stage model is likely to be the most relevant. Notably, neither model developed here included MUCIN-16 even though the protein was available for all models in the feature selection step. As reported above, when comparing the clinically reported CA125 values to the MUCIN-16 as measured by the PEA, we found a moderate but significant correlation between the values of two assays. Technical differences in analytical range between the assays could be a factor in why MUCIN-16 was not included in the models generated. Apart from technical factors, it should also be noted that the comparisons made was between malign and benign conditions and that MUCIN-16 is known to be elevated in several benign gynaecological conditions⁹. This is clear from the clinically measured levels (Table 1), where the mean values among patients with benign diagnoses in both cohorts (130.0 and 99.3 U/ml) are well above the established cut-off at 35 U/ml for ovarian cancer. Several of the four proteins used in our multivariate models here have previously been proposed as biomarkers for ovarian cancer. WFDC2 (also known as HE4) is part of the ROMA-score, and high expression of both KRT19 and FOLR1 has been associated with poor outcome and progression in ovarian cancer^31,32. RBFOX3 (RNA binding fox-1 homolog 3) has a known function in the regulation of alternative splicing of pre-mRNA, is commonly expressed in the central nervous system³³, and has been implicated in for instance neuroblastoma³⁴ and reported as elevated in prostate cancer ³³. The role of RBFOX3 in ovarian cancer is, however, not well understood and we have not found any previous relating literature linking to ovarian cancer.

We analysed a large set of proteins with a commercial assay in two independent cohorts from two different geographical locations. A major strength of our study is the strict use of one cohort as discovery and the second for replication, both for the univariate and multivariate analyses. These cohorts contained a mixture of histology diagnoses both among the malignant and benign samples which is reflective of the distribution that should be targeted in a future screening scenario. Our study is however limited by the sample size from the cohorts and the distribution of analysed proteins, reducing the samples available for complete analyses. The sample size also prohibited us from performing detailed stratified analyses of different histologies and/or cancer stages, which could have further improved our models. Our results are also limited by that we are only analysing Swedish samples and that the study does not include neither symptom-free controls nor samples collected before diagnosis, that could have been used to investigate the performance of the developed risk-score in a screening scenario.

Recent advances in the throughput of ultra-highly sensitive proteomics technologies such as the one used here, makes it possible to characterize an increasingly higher number of plasma proteins using very small quantities of input material. Coupling such analysis technologies with machine learning approaches to detect combinations of biomarkers with robust predictive power is a powerful approach to break new ground and go beyond the current knowledge. The PEA technology has been shown to work well not only in wet plasma but also from dried blood spots^35,36. This opens the possibility of screening based on self-collected dried blood spots, coupled with precise molecular biomarkers as a cost-efficient solution for early detection and monitoring of ovarian cancer.

Data availability

Raw data is located in controlled access data storage at the Swedish Science for Life Laboratories (SciLifeLab) Data Repositories accessible at https://doi.org/https://doi.org/10.17044/scilifelab.25237765.

References

Torre, L. A. et al. Ovarian cancer statistics, 2018. CA Cancer J. Clin. 68, 284–296 (2018).
Article PubMed PubMed Central Google Scholar
Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249 (2021).
Article PubMed Google Scholar
Wu, R. C. et al. Genomic landscape and evolutionary trajectories of ovarian cancer precursor lesions. J. Pathol. 248, 41–50 (2019).
Article PubMed PubMed Central Google Scholar
Shih, I. M., Wang, Y. & Wang, T. L. The origin of ovarian cancer species and precancerous landscape. Am. J. Pathol. 191, 26–39 (2021).
Article CAS PubMed PubMed Central Google Scholar
Labidi-Galy, S. I. et al. High grade serous ovarian carcinomas originate in the fallopian tube. Nat. Commun. 8, 1–11 (2017).
Article CAS Google Scholar
Brown, P. O. & Palmer, C. The preclinical natural history of serous ovarian cancer: Defining the target for early detection. PLoS Med. 6, e1000114 (2009).
Article PubMed PubMed Central Google Scholar
Jacobs, I. J. et al. Ovarian cancer screening and mortality in the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS): A randomised controlled trial. Lancet 387, 945–956 (2016).
Article PubMed PubMed Central Google Scholar
Bast, R. C. et al. A radioimmunoassay using a monoclonal antibody to monitor the course of epithelial ovarian cancer. N. Engl. J. Med. 309, 883–887 (1983).
Article PubMed Google Scholar
Sölétormos, G. et al. Clinical use of cancer biomarkers in epithelial ovarian cancer: Updated guidelines from the European Group on Tumor Markers. Int. J. Gynecol. Cancer 26, 43–51 (2016).
Article PubMed Google Scholar
Bast, R. C., Han, C. Y., Lu, Z. & Lu, K. H. Next steps in the early detection of ovarian cancer. Commun. Med. 1, 1–3 (2021).
Article ADS Google Scholar
Lycke, M., Kristjansdottir, B. & Sundfeldt, K. A multicenter clinical trial validating the performance of HE4, CA125, risk of ovarian malignancy algorithm and risk of malignancy index. Gynecol. Oncol. 151, 159–165 (2018).
Article PubMed Google Scholar
Moore, R. G. et al. A novel multiple marker bioassay utilizing HE4 and CA125 for the prediction of ovarian cancer in patients with a pelvic mass. Gynecol. Oncol. 112, 40–46 (2009).
Article CAS PubMed Google Scholar
Cui, R., Wang, Y., Li, Y. & Li, Y. Clinical value of ROMA index in diagnosis of ovarian cancer: Meta-analysis. Cancer Manag. Res. 11, 2545 (2019).
Article PubMed PubMed Central Google Scholar
Reilly, G. P. et al. A real-world comparison of the clinical and economic utility of OVA1 and CA125 in assessing ovarian tumor malignancy risk. J. Comp. Eff. Res. https://doi.org/10.57264/cer-2023-0025 (2023).
Article PubMed PubMed Central Google Scholar
Gyllensten, U. et al. Next generation plasma proteomics identifies high-precision biomarker candidates for ovarian cancer. Cancers (Basel) 14, 1757 (2022).
Article CAS PubMed Google Scholar
Glimelius, B. et al. U-CAN: A prospective longitudinal collection of biomaterials and clinical information from adult cancer patients in Sweden. Acta Oncol. (Madr.) 57, 187–194 (2018).
Article Google Scholar
Region Västra Götaland. Gothia Forum för klinisk forskning: Biobank Väst. https://www.gothiaforum.com/web/en.
Assarsson, E. et al. Homogenous 96-plex PEA immunoassay exhibiting high sensitivity, specificity, and excellent scalability. PLoS One 9, e95192 (2014).
Article ADS PubMed PubMed Central Google Scholar
Olink Proteomics AB. PEA-a High-Multiplex Immunoassay Technology with QPCR or NGS Readout. 2020.
Bueno Álvez, M. et al. Next generation pan-cancer blood proteome profiling using proximity extension assay. Nat. Commun. https://doi.org/10.1038/s41467-023-39765-y (2023).
Article Google Scholar
Zhong, W. et al. Next generation plasma proteome profiling to monitor health and disease. Nat. Commun. 12, 1–12 (2021).
Article ADS Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2020).
Google Scholar
Kuhn, M. Building predictive models in R using the caret package. J. Stat. Softw. 28, 1–26 (2008).
Article Google Scholar
Eklund, A. & Trimble, J. The bee swarm plot, an alternative to stripchart (2021).
Menon, U. et al. The cost-effectiveness of screening for ovarian cancer: Results from the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS). Br. J. Cancer 117, 619–627 (2017).
Article PubMed PubMed Central Google Scholar
Menon, U. et al. Ovarian cancer population screening and mortality after long-term follow-up in the UK Collaborative Trial of Ovarian Cancer Screening (UKCTOCS): A randomised controlled trial. Lancet 397, 2182–2193 (2021).
Article PubMed PubMed Central Google Scholar
Fritsche, H. A. & Bullock, R. G. A reflex testing protocol using two multivariate index assays improves the risk assessment for ovarian cancer in patients with an adnexal mass. Int. J. Gynecol. Obstet. 162, 485–492 (2023).
Article CAS Google Scholar
Kumari, S. Serum biomarker based algorithms in diagnosis of ovarian cancer: A review. Indian J. Clin. Biochem. 33, 382 (2018).
Article CAS PubMed PubMed Central Google Scholar
Enroth, S. et al. High throughput proteomics identifies a high-accuracy 11 plasma protein biomarker signature for ovarian cancer. Commun. Biol. 2, 221 (2019).
Article PubMed PubMed Central Google Scholar
Enroth, S. et al. Data-driven analysis of a validated risk score for ovarian cancer identifies clinically distinct patterns during follow-up and treatment. Commun. Med. 2, 1–13 (2022).
Article Google Scholar
Bax, H. J. et al. Folate receptor alpha in ovarian cancer tissue and patient serum is associated with disease burden and treatment outcomes. Br. J. Cancer 128, 342–353 (2022).
Article PubMed PubMed Central Google Scholar
Saha, S. K., Yin, Y., Chae, H. S. & Cho, S. G. Opposing regulation of cancer properties via KRT19-mediated differential modulation of Wnt/β-catenin/notch signaling in breast and colon cancers. Cancers (Basel) 11, 99 (2019).
Article CAS PubMed Google Scholar
Uhlen, M. et al. A pathology atlas of the human cancer transcriptome. Science https://doi.org/10.1126/science.aan2507 (2017).
Article PubMed Google Scholar
Freire, N. H. et al. Targeting the epigenome of cancer stem cells in pediatric nervous system tumors. Mol. Cell. Biochem. 2023(1), 1–15 (2023).
MathSciNet Google Scholar
Broberg, K. et al. Evaluation of 92 cardiovascular proteins in dried blood spots collected under field-conditions: Off-the-shelf affinity-based multiplexed assays work well, allowing for simplified sample collection. BioEssays https://doi.org/10.1002/bies.202000299 (2021).
Article PubMed Google Scholar
Björkesten, J. et al. Stability of proteins in dried blood spot biobanks. Mol. Cell. Proteomics 16, 1286–1296 (2017).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We are grateful for the willingness of the participants to donate samples for the research conducted here. This study was founded by Sjöbergsstiftelsen (SE, UG, KSu), the Swedish Research Council (SE 2022-00857), the Swedish Cancer Foundation (SE 220604FE, UG 190008PJ, KSu CAN211848 and the Swedish state under the agreement between the Swedish government and the county council, the ALF-agreement (KSu). The funders had no role in the study design nor the decision to publish the results.

Funding

Open access funding provided by Uppsala University.

Author information

Authors and Affiliations

Department of Immunology, Genetics, and Pathology, Biomedical Center, SciLifeLab Uppsala, Uppsala University, 75108, Uppsala, Sweden
Emma Ivansson, Julia Hedlund Lindberg, Ulf Gyllensten & Stefan Enroth
Department of Women’s and Children’s Health, Uppsala University, 75185, Uppsala, Sweden
Karin Stålberg
Department of Obstetrics and Gynaecology, Institute of Clinical Sciences, Sahlgrenska Academy at Gothenburg University, 41685, Gothenburg, Sweden
Karin Sundfeldt

Authors

Emma Ivansson
View author publications
You can also search for this author in PubMed Google Scholar
Julia Hedlund Lindberg
View author publications
You can also search for this author in PubMed Google Scholar
Karin Stålberg
View author publications
You can also search for this author in PubMed Google Scholar
Karin Sundfeldt
View author publications
You can also search for this author in PubMed Google Scholar
Ulf Gyllensten
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Enroth
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: K.St., U.G., K.Su., S.E. Data curation: S.E., E.I. Formal Analysis: E.I., S.E.. Funding acquisition: K.Su., U.G., S.E. Investigation: J.H.L., E.I. Methodology: S.E. Project administration: S.E. Resources: K.St., U.G., K.Su. Software: S.E. Supervision: S.E. Validation, Visualization, Writing—original draft: S.E. All authors: Writing—review & editing. S.E., E.I. and U.G. had full access to and verified the underlying data. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Stefan Enroth.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ivansson, E., Hedlund Lindberg, J., Stålberg, K. et al. Large-scale proteomics reveals precise biomarkers for detection of ovarian cancer in symptomatic women. Sci Rep 14, 17288 (2024). https://doi.org/10.1038/s41598-024-68249-2

Download citation

Received: 20 January 2024
Accepted: 22 July 2024
Published: 27 July 2024
DOI: https://doi.org/10.1038/s41598-024-68249-2
Springer Nature Limited

Large-scale proteomics reveals precise biomarkers for detection of ovarian cancer in symptomatic women

Abstract

Similar content being viewed by others

High throughput proteomics identifies a high-accuracy 11 plasma protein biomarker signature for ovarian cancer

Proteomic landscape of epithelial ovarian cancer

Proteomics of ovarian cancer: functional insights and clinical applications

Introduction