Introduction

Ovarian fibromas and adenofibromas form part of the sex-cord stromal family of tumours and are relatively uncommon, accounting for approximately 8% of all diagnosed ovarian tumours [1]. These tumours are benign entities composed in significant part of fibroblasts (pure fibromas), or as compound tumours composed of a mix of fibroblast and epithelial (adenofibroma) or sex-cord (granulosa-stromal tumours) components. Tumours with a cystic epithelial component are termed cystadenomas or cystadenofibromas.

Due to their relative rarity and benign nature these tumours have not been well molecularly characterised, with the majority of studies focussing on immunohistochemistry and cytogenetics. Ovarian fibromas differ from fibromas arising in other organs in that they frequently express hormone receptors (e.g. ER-β, PR, AR) and are typically negative for the characteristic markers of other cells derived from a fibroblast/myofibroblastic origin (e.g. SMA, CD34, CD117, S-100) [1, 2].

Genomic aberrations involving trisomy and tetrasomy of chromosome 12 appear to be particularly prevalent in tumours arising in the female genitourinary tract, including uterine leiomyomas [3, 4], thecomas [5,6,7], fibromas [7,8,9,10] and granulosa cell tumours [11, 12]. The underlying biological driver for this recurrent event has yet to be established. Other genomic aberrations that arise less frequently may be more cell type-specific. Imbalances involving chromosomes 4 and 9 are also common in the fibroma-thecoma subgroup, with chromosome 9 aberrations potentially being associated with cellular fibromas [13].

We previously reported that around 40% of benign serous cystadenomas and cystadenofibromas show copy number aberrations (CNA) exclusively in the stroma [14] and are thus likely misdiagnosed primary fibromas with epithelial inclusion. To investigate this further we undertook molecular characterisation of these epithelial-stromal tumours in comparison to pure fibromas.

Main text

Materials and methods

Tissue samples

Fresh frozen tissue samples were used for copy number, exome and expression analyses. All samples were collected with the patient’s informed consent and the study was approved by the Human Research Ethics Committees at the Peter MacCallum Cancer Centre. Patients with ovarian tumors were identified through hospitals in the Wessex Region, UK (n = 25) [15] and the Australian Ovarian Cancer Study (AOCS) (n = 31) [16, 17]. Pathology review was conducted on cryosections adjacent to the tissue from which DNA was extracted (PA). Microdissections and DNA/RNA extractions were performed as previously described [18]. Samples were selected for inclusion based on availability of tissue for DNA and RNA extraction.

Copy number data

The Affymetrix SNP6.0 Human Mapping (1.8 M probe set) array was utilised for ultra-high resolution allele-specific copy number analysis. Arrays were performed as recommended by the manufacturer with the exception that the input was reduced from the recommended 500 ng to 250 ng by reducing reaction volumes by half for all processes prior to the SNP6.0 PCR step. Reduction in DNA input does not result in any loss in the quality of the data. Copy number analysis was performed as previously described [18], using Partek Genomics Suite v 6.5. Copy number and allele-specific copy number was generated paired (when matching normal available) or unpaired and circular binary segmentation was performed to identify regions of copy number and loss of heterozygosity. Thresholds were > 2.3 for gains, < 1.7 for losses and < 0.75 for homozygous deletions. All SNP data has been made publicly available through Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/ - GSE67189).

Exome sequencing

For each case 500 ng–1 µg of microdissected tumour DNA and matched lymphocyte DNA when available was sheared to < 1000 bp using a Covaris® ultra-sonicator (Covaris®), libraries prepared using the Illumina TruSeq DNA Sample Preparation procedure (Illumina), and enriched for exome sequencing using the SeqCap EZ Human Exome Library v2.0 (Roche NimbleGen). Exomes were sequenced with 100 bp PE reads in pools of three per lane on a HiSeq 2000 (Illumina).

Sequence reads were aligned to the human genome (GRCh37/hg19) using BWA-MEM (v0.7.7-r441) [19, 20]; duplicates marked using Picard (v1.77); local indel realignment and base quality recalibration performed using GATK (v2.7-2-g6bda569) [21, 22]; indel detection performed using GATK Unified Genotyper (v2.7-2-g6bda569), Indel Genotyper, Pindel (v0.2.5a3) [23], and VarScan2 (v2.2.4) [24]; SNV prediction performed using GATK Unified Genotyper, MuTect (v2.7-1-g42d771f) [25], SomaticSniper [26], JointSNVMix2 (v0.8-b2) [27], and VarScan2 (v2.2.4); and variants annotated using Ensembl variant effect predictor v73. Exome bam files are available from the Sequence Read Archive Accession number PRJNA631561 (https://www.ncbi.nlm.nih.gov/sra/PRJNA631561).

Variants were enriched for genuine somatic events by filtering for those called by >=2 variant callers, with the exception of MuTect, which is capable of detecting variants at lower frequencies and therefore all MuTect variants were included; germline allele frequency <=0.01 and tumour allele frequency >=0.05, with >=0.1 difference in allele frequency between tumour and germline; variant observed in <=3 of 250 in-house germ-line exomes. All variants with a tumour allele frequency >=0.1 were taken forward for Sanger sequencing validation.

Expression data

Expression data was generated using the Affymetrix Human Gene 1.0 ST array according to the manufacturer’s recommendations. An input of 300 ng of total RNA was used, as quantified by Nanodrop spectrophotometer. RIN values were determined using the Agilent Bioanalyzer RNA 6000 Nano assay, the average RIN value for the 25 samples was 4.7 (range 1–7.9). Analysis of the data was performed using the Partek Gene Expression workflow. CEL files were processed using RMA normalisation and batch correction. All Gene 1.0 ST data has been made publicly available through Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/-GSE67223).

Results

The clinical features of the ovarian cohort are presented in Table 1. Women with ovarian fibromas compared to benign serous ovarian tumours had very similar median ages and ranges (64, range 35–80 and 61, range 27–80 respectively). Interestingly, there appeared to be a strong preponderance for both fibromas and benign serous tumours to be bilateral or detected on the right ovary (Binomial test for left vs right P = 0.02).

Table 1 Clinical features of cohort

Copy number aberrations

Genome-wide copy number analysis was performed for eight unselected pure fibromas, and compared to copy number data from the stroma of 27 serous cystadenofibromas and 18 serous cystadenomas (collectively referred to as benign serous tumours). CNAs were detectable in 7/8 (88%) of the pure fibromas, with gain of chromosome 12 being the most recurrently observed aberration in five of eight (63%) cases (Additional file 1: Table S1). Other recurrent CNAs in the fibromas were gain of chromosomes 9 or 9q (50% cases), 18 and 21 (20% cases each). CNAs were detected in the stroma of 33% of benign serous tumours. Recurrent gain of chromosome 12 was also observed in 31% of serous cystadenofibromas (8/27) and 17% of serous cystadenomas (3/18), gain of 9q was only observed in single serous cystadenofibroma case, while loss of chromosome 22 was detected in 11% of cystadenofibromas (3/27). No CNAs were detected in the stroma of the normal ovaries.

Expression analysis

Gene expression arrays were used to compare the stromal RNA of three normal ovaries against eight pure fibromas (seven with CNAs), seven cystadenomas (two with CNAs), and seven cystadenofibromas (four with CNAs). Comparing normal ovary to benign serous tumours or fibromas did not identify any differentially expressed genes following multiple testing correction. Comparison of benign serous tumours to fibromas also did not identify any differentially expressed genes. Comparison of samples based on the presence or absence of CNA, or the presence of specific CNA compared to an absence of genomic aberrations (with and without tumour subtyping), did not identify differentially expressed genes that remained significant following multiple testing correction. As these samples are difficult to enrich for neoplastic cells due to a mixture of cell types in the stroma the expression signal from the tumour cells will be diluted, therefore a less stringent approach was taken to identify candidate genes by taking the most significantly altered genes p <=0.001 with a fold change >=2. Through this approach 17 genes were found to be differentially expressed based on the presence of specific CNA (gain 9q and 12, loss of 16q) compared to samples with no genomic CNAs (Table 2).

Table 2 Candidate genes for differential expression

Exome data

Exome sequencing was performed on the stromal DNA of seven cystadenofibromas, one adenofibroma and one cystadenoma (all with CNAs), and two cystadenofibromas and one cystadenoma with no CNAs. Exome sequencing identified 83 putative somatic variants, with an average of 7 mutations per case (range 2–20). It is difficult to enrich for the subpopulation of neoplastic fibroblasts in the stroma, as indicated by the low variant allele frequency of the majority of the variants (Additional file 2: Table S2), and subsequently difficult to validate findings by Sanger sequencing. We undertook Sanger validation of a subset of variants for each case. No recurrent mutations or recurrently mutated genes were identified that went on to validate. In total, 20 somatic variants were able to be validated by Sanger sequencing (Table 3). Of the validated variants, a single nonsense mutation was identified in the DMD gene. The remaining 19 validated variants were all missense variants, the functional impacts of which were assessed using transFIC (Transformed Functional Impact for Cancer) (Table 3). No variants from the three tumours with no CNAs validated by Sanger sequencing.

Table 3 Validated exome variants

The ability to detect variants may be confounded by normal DNA contamination. There was a positive correlation between the number of variants detected (before validation) and the average allele frequency (Spearman’s r = 0.76, P = 0.01, Additional file 3: Figure S1). The mean sequencing depth was 121.3 reads (88.6–186.8) for germline samples and 122.7 reads (72.8–163.6) for somatic samples, therefore the detection of low frequency variants is not substantially compromised by limited depth of coverage.

Discussion

The findings of this study are consistent with previous karyotyping and FISH studies that identified trisomy 12 as the most common chromosomal abnormality identifiable in ovarian fibromas [8,9,10]. Pure ovarian fibromas were found to harbour chromosomal abnormalities more frequently than benign serous ovarian tumours (88% vs. 33%), further supporting our hypothesis that a subset of benign serous tumours are actually fibromas that coincidentally have an associated epithelial cyst. However, benign serous tumours more frequently harboured trisomy 12 as the sole aberration (47% of tumours with aberrations) compared to fibromas (30% of tumours with aberrations). Fibromas also more frequently harboured CNA that were rarely detected in the benign serous tumours such as 9q gain (50% cases), potentially indicating unique underlying biological drivers.

Expression analysis provided some interesting candidates that have previously been associated with neoplasms or fibroblast growth for further investigation. Genes with increased expression in tumours with CNAs compared to those without CNAs included the extracellular matrix (ECM) and signalling molecules HAPLN1 and ANOS1, the antigen and repressor of retinoic acid signalling molecule PRAME, and the cell cycle regulator CKS2. All of these genes have been previously associated with overexpression in other types of neoplasm [28,29,30,31,32,33,34], and PRAME and CKS2 expression have been proposed as markers of poor prognosis in high-grade serous ovarian carcinoma [33, 35].

Genes with decreased expression in tumours with CNAs compared to those without CNAs included the putative DNA/RNA helicase SLFN11, the ECM protein PRELP, the high density lipoprotein component APOD, and the secreted chaperone CLU. Although these have all been linked with altered expression in other neoplasms before, this has typically been upregulation and potentially linked to neoplastic progression [36,37,38,39,40], including in ovarian cancer for APOD and CLU [41, 42]. Data is inconsistent for CLU, as expression was also linked to improved prognosis in high-grade serous ovarian carcinoma [33]. Low expression of SLFN11 has been associated with resistance to chemotherapy in ovarian cancer other cancers due to its role in the DNA damage response [43].

No genes were found to be recurrently mutated, however, two tumours had mutations in histone methytransferases (SETD2 and KMT2D) and one also had a mutation in a demethylase (KDM4A), all previously associated with neoplasia and all mutations predicted to be deleterious. Other mutated genes have also been associated with neoplasia, such as HECW2, SRRT and KLK14, with predicted medium to high deleterious impact.

Limitations

  • Use of Affymetrix Human Gene 1.0 ST array is limited to the probes on the array at the time.

  • Insufficient power to detect differentially expressed genes due to n = 3 normal ovaries.

  • Limited ability to detect mutations due to normal contamination.