Introduction

Breast cancer (BC) is currently the most common cancer among women worldwide [1]. In addition to the harmful impact by industrialization induced pollutants or mental-related endocrine change, BC is well known as a complex and heterogeneous disease driven by genomic variants [2]. Statistically, 5–10% of breast cancers are primarily caused by genetic factors resulting from the accumulation of acquired somatic alterations [3].

Somatic alteration of the phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA) gene is one of the most prevalent driver genes in breast cancer, occurring at a frequency of 20–40% [4, 5]. PIK3CA gene is located in chromosome 3q26.32 and encodes the PIK3CA, also named the p110α protein, which is an important component of class I phosphoinositide 3-kinase (PI3K). More than 80% of PIK3CA mutations occur at three ‘‘hotspots’’, E542K and E545K in exon 9 encoding the helical domains, and H1047R in exon 20 encoding the kinase domains [6]. Through the PI3K/AKT pathway, altered PIK3CA gene plays a critical role in cell survival, apoptosis, proliferation, motility, and adhesion [7].

PI3K inhibition as a therapeutic strategy for BC has been investigated throughout many clinical trials [8, 9]. Theα-specific PI3K inhibitor Alpelisib (PIQRAY, Novartis) has received accelerated approval by the Food and Drug Administration (FDA) in May 2019 for the treatment of patients with estrogen receptor–positive (HR +)/human epidermal growth factor receptor 2-negative (HER2-) PIK3CA-mutated advanced or metastatic BC based on the SOLAR-1 trial [10]. With the development of next-generation sequencing (NGS) technologies, more recognition on the individual genomic landscape has become possible. Some studies have focused on the altered PIK3CA gene spectrum in different race [11, 12]. However, there is a lack of comprehensive comparison of the PIK3CA gene alteration spectrum between Caucasians and Chinese in a large scale.

In this study, we report the somatic PIK3CA alterations in 412 Chinese BC patients, demonstrate the association between the frequency of PIK3CA alterations and the clinicopathological characteristics. We analyze the alteration spectrum within different molecular subgroups, and compare with the data of The Cancer Genome Atlas (TCGA). The ultimate aim is to provide more precise annotations of tumor genomic alterations of Chinese compared to Caucasian in the context of promoting research and application of PI3K inhibitors.

Materials and methods

Patients and tumor samples

This study was approved by the ethics committee of Guangdong Provincial People’s Hospital (GDPH) [approval number of 2017312 H(RE)]. The written informed consents were obtained from all participants. A total of 412 Chinese patients of Han nationality who were diagnosed with BC at the GDPH from June 1, 2017 to September 27, 2018 were enrolled in the study. The inclusion criteria are as follows: (1) patients were diagnosed with invasive breast cancer and no chemo or radiotherapy before NGS test; (2) complete clinicopathological information were acquired [sex, age, menstrual status, primary tumor size, axillary lymph node status, pathological type, pathological grade, estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) status, Ki67 status, distant metastasis status]. Specimens were reviewed by the Department of Pathology at GDPH. The breast cancer molecular subtypes were characterized based on the guideline of St Gallen International Expert Consensus (2019) [13]; and (3) qualified tumor tissue sequencing information.

For The Cancer Genome Atlas (TCGA) cohort, the PIK3CA mutation profiles of Caucasian patients with breast cancer were downloaded from the public website (http://genome-cancer.ucsc.edu) and analyzed. A total of 453 patients were eligible for this study.

Tissue DNA extraction

DNA from paraffin-embedded breast cancer tissues was extracted using QIAamp DNA FFPE tissue kit (Qiagen, California, US) according to the manufacturer’s instructions. DNA concentration was measured by Qubit dsDNA assay (Life Technologies, California, US).

NGS library preparation and capture-based targeted DNA sequencing

DNA was subjected to end repair, phosphorylation and adaptor ligation. Fragments of size 200–400 bp were selected by AMPure beads (Agencourt AMPure XP Kit), followed by hybridization with capture probe baits, hybrid selection with magnetic beads and PCR amplification. A high-sensitivity DNA assay was performed to assess the quality and size of the fragments. Indexed samples were sequenced on Nextseq500 sequencer (Illumina, Inc., USA) with pair-end reads.

Sequence data analysis and identification of variants

Sequence data were mapped to the human genome using BWA aligner 0.7.10. Local alignment optimization, variant calling and annotation were performed using GATK 3.2, MuTect, and VarScan. Variants were filtered using the VarScan fpfilter pipeline, with loci of depth less than 100 filtered out. At least 5 supporting reads were needed for INDELs; while 8 supporting reads were needed for SNVs to be called. According to the ExAC, 1000 Genomes, dbSNP, ESP6500SI-V2 database, variants with population frequency over 0.1% were grouped as SNP and excluded from further analysis.

The reported mutations were further discerned by searching dbSNP (http://www.ncbi.nlm.nih.gov/SNP), ClinVar (http://www.ncbi.nlm.nih.gov/clinvar/), COSMIC(http://cancer.sanger.ac.uk/cosmic), BRCA Exchange (http://www.brcaexchange.org/), and Exome Aggregation Consortium (http://exac.broadinstitute.org/) databases along with PubMed publications to detect novel mutations.

Statistical analysis

All the computations were performed using the R software (version3.6.0). Pearson’s Chi-square test and Fisher’s exact test were used to study the correlation between the clinicopathological features and the occurrence of PIK3CA mutation. The results with obtained P value < 0.05 were considered statistically significant.

Results

Association of PIK3CA alterations with clinicopathological features

We examined a total of 412 tumor tissues of Chinese patients with invasive BC. They were divided into PIK3CA altered (N = 188) and wild-type (N = 224) groups. The clinicopathological features and PIK3CA gene status of these patients are summarized in Table 1. PIK3CA alterations were predominantly found in ER + (49.3%, p = 0.024), low ki67 status (58.3%, p = 0.007), and low pathological grade tumors (grade I 80%, grade II 53.4% and grade III 35.9%; p < 0.001). The PIK3CA alteration status was significantly different among luminal A (57.6%), luminal B (44.6%), Her2-enriched (44.9%) and basal-like molecular subtypes (32.1%). No significant association was found between PIK3CA alterations and onset of age, menopausal status, tumor size, lymph node status, metastasis status, pathological stage, pathological type, PR and Her2 status.

Table 1 Clinicopathologic characteristics and PIK3CA status of 412 Chinese patients with invasive breast cancer patients (GDPH cohort)

The characteristics of PIK3CA alterations in GDPH and TCGA-white cohort

In the GDPH cohort, 49 somatic alterations were detected in 188 out of 412 patients, with a gross alteration rate of 45.6% (Fig. 1a). 163 patients had single alterations (mutation or amplification) and 25 had compound alterations. In the TCGA-white cohort, 43 somatic alterations were detected in 157 out of 453 patients (alteration rate of 34.7%) (Fig. 1b), which is lower than that of the GDPH cohort (p < 0.001). 127 patients had single alterations and 30 harbored compound mutations.

Fig. 1
figure 1

PIK3CA somatic alteration distribution across breast cancer population of GDPH and TCGA-white cohorts. a Distribution in GDPH population. b Distribution in TCGA-white population. PIK3CA phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha, GDPH Guangdong Provincial People's Hospital, TCGA The Cancer Genome Atlas

The PIK3CA alteration type and mutation site were studied and illustrated in Fig. 2. The predominant alteration type was missense mutation (GDPH 90.23%, TCGA-white 94%). Both cohorts had similar PIK3CA mutation site. The PIK3CA mutations occurred in several exons (exon1, 2, 4, 5, 7–9, 13, 15, 17, 19, 20), with the most frequent location in exon 9 and 20 for both GDPH (73.7%) and TCGA-white population (66.0%). The PIK3CA mutation sites with mutation rate ≥ 1% in GDPH cohort were showed in Table 2. The fraction of three common mutation sites including p.E545K, p.E542K and p.H1047R were showed in Fig. 2c. The TCGA-white cohort had almost similar common mutations rate among all altered PIK3CA compared to the GDPH cohort (65.6% vs 66.0%, p = 0.95). However, p.H1047R mutation occupied a significantly higher proportion in GDPH cohort than that of TCGA-white cohort, in all cases (19.9% vs 9.9%, p < 0.001) or in common mutations (66.1% vs 43.7%, p = 0.01). In addition, 9 novel mutation sites of PIK3CA were detected in the GDPH cohort (Table 3).

Fig. 2
figure 2

PIK3CA alteration spectrum in breast cancer of the TCGA and GDPH cohorts. a Distribution of mutation across the gene. b Pie charts showing the fraction of alteration types. c Pie charts showing the fraction of mutation sites. d Conserved and functional domains of PIK3CA gene. *Rare mutation: defined as PIK3CA mutations except three common mutation sites: p.E545K, p.E542K and p.H1047R. PIK3CA phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha, GDPH Guangdong Provincial People's Hospital, TCGA The Cancer Genome Atlas, CN_amp copy number amplification

Table 2 PIK3CA mutation sites with mutation rate ≥ 1% in GDPH cohort and corresponding rate in TCGA-white cohort
Table 3 Novel mutation sites of PIK3CA detected in GDPH cohort

Spectrum of PIK3CA alterations across ER/HER2 subtypes

The PIK3CA alteration spectrum was studied in detail by dividing the GDPH specimens into four ER/HER2 subtypes (Fig. 3). PIK3CA alterations occurred at the highest frequency in ER + /HER2 + (51.6%) tumors, followed by the ER + /HER2-(48.7%) and ER-/HER2 + (43.1%) tumors, and the lowest in ER-/HER2-(30.0%) tumors (Table 1). Missense mutation was the primary alteration type for four subtypes, while copy number amplification occupied considerable proportion in ER-/HER2- type (19.05%). Common mutations distributed extremely alike across subtypes with rates among all altered PIK3CA as 66.1% in ER + /HER2 −, 66.7% in ER + /HER + , 64.0% in ER-/HER2 + and 66.7% in ER −/HER2 − subtype. Because of the limited sample size, none of the three common mutations were separately analyzed.

Fig. 3
figure 3

PIK3CA alteration spectrum across four ER/HER2 subgroups in breast cancer of GDPH cohort. A. Distribution of mutation across the gene. B. Pie charts showing the fraction of alteration types. C. Conserved and functional domains of PIK3CA gene. PIK3CA phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha, GDPH Guangdong Provincial People's Hospital, TCGA The Cancer Genome Atlas, CN_amp copy number amplification

Discussion

In the current study, we explored the frequency and spectra of somatic altered PIK3CA gene by high-resolution genomic sequencing in a large cohort of 412 Chinese patients with invasive BC. To the best of our knowledge, this is one of the largest studies to describe the alteration spectrum of PIK3CA gene in Chinese BC patients of Han nationality compared with Caucasian. The frequency of PIK3CA alteration is 45.6% in our study, which is slightly higher than that of TCGA-white data (34.7%) and other literature data (7–36%) in Chinese [14,15,16], 14–45% in other Asians [11, 17, 18], 18–40% in Americans [19] and 13–45% in Europeans [20, 21]. The disparity of PIK3CA gene mutation rate in different studies could be explained by ethnicity, sample size, inclusion criteria or gene sequencing methods.

In accordance with previous studies, the majority of PIK3CA mutations occurred in three hotspot sites, namely E542K and E545K in helical domain and H1047R in kinase domain [11]. Interestingly, there is a significantly larger proportion of p.H1047R mutation in our study than the TCGA Caucasian cohort. The H1047R mutation enhances the interaction of the p110 kinase domain with cell membranes, while the E542K and E545K mutations disrupt the inhibitory interface with p85 [22]. Due to different functional mechanisms, 3 hotspot sites may cause different damage through downstream signaling. An in vitro study showed that the helical domain mutants produced a more aggressive phenotype than kinase domain mutants [23]. However, one study suggested that the mere existence of mutant E545K may not be as harmful as H1047R [24]. As for utilization of antineoplastic drugs, limited studies have focused on the relationship between specific PIK3CA alteration site and effectiveness of drugs [25]. In a retrospective analysis, Binghe Xu, etc. suggested that PIK3CA/H1047R mutations may be a potential biomarker of sensitivity to everolimus, wherein H1047R mutated patients had longer PFS than wild-type or other mutant forms of PIK3CA. Analyses in the neoadjuvant setting suggested that mutations in exon 9 conferred a higher sensitivity to pictilisib when compared with mutations in exon 20 [26]. In general, two puzzles remain to be clarified about clinical significance of PIK3CA alteration sites: (1) extent of harmfulness for different PIK3CA alteration domains; and (2) PIK3CA alteration site as a biomarker for patients who benefit most from PI3K inhibitors.

Another question is the relationship between PIK3CA alterations and clinicopathological characteristics. In present study, PIK3CA alterations happen more in BC with low pathological grade, ER positive and low Ki67 index. The positive correlation between PIK3CA alteration and better prognosis has also been previously reported [20, 27, 28]. A large-scale systematic review of BC clinical studies involved 2587 cases showed that patients with tumors harboring a PIK3CA mutation have a better clinical outcome, especially for postmenopausal women with ER + BC [29]. However, recent molecular profiling data from MBC patients seem to indicate that in advanced HR + /HER2 − BC, a PIK3CA mutation would lead to resistance to chemotherapy and a poor outcome [30]. On all accounts, improved understanding of the clinical impact of PIK3CA alterations is critical to prevent or explain therapeutic failures and develop optimal personalized therapeutics against breast cancer.

Additionally, the relevance between PIK3CA alteration spectrum and ER/HER2 molecular subtypes has also been detected in our study. PIK3CA alterations were more prevalent in ER + tumors and the least in the ER−/HER2−(30.0%) subtype, in agreement with previous studies [5, 21, 31]. Despite the overall low prevalence of PIK3CA mutations in TNBC, the proportion is not negligible. The finding of Jiang etc. [4] stated that Chinese TNBC cases demonstrate 18% PIK3CA mutations, we found the mutation rate of 30%. Therefore, high PIK3CA mutation may be a remarkable feature of Chinese TNBC patients. Meanwhile, ER−/HER2− subtype in our study is enriched for frameshift and DNA copy number amplification, which was similarly observed previously [32], suggesting the genomic instability in TNBC subtypes. Several clinical studies are trying to determine the potential benefit of PIK3CA inhibitors in different BC subtypes. Although the efficacy of additional PIK3CA inhibitor shows lack of significance in the early setting [33, 34], the combined use with existing therapies in heavily pretreated MBC patients demonstrates substantial prospective [10, 35]. On all accounts, a biomarker-driven approach to identify optimal clinical choices in the background of PIK3CA gene alterations is warranted.

In conclusion, our study showed that PIK3CA alteration is a common event in Chinese BC patients and has a unique spectrum compared to that of Caucasians. The distribution patterns of PIK3CA alteration are distinct across four molecular subtypes based on ER/HER status. Future clinical trials should assess PIK3CA alteration characteristics as biomarkers to predict therapeutic response to PI3K inhibitors.