Introduction

Gastric cancers (GCs) are still one of the leading causes of cancer-related mortality. The high mortality of GCs is mainly due to late diagnosis and poor response to the currently available therapeutic agents. One of the reasons that may explain poor clinical outcomes of GCs is their highly heterogeneous nature. Unlike other tumors of the digestive system, GCs exhibit diversity in various pathological factors, including histological type, differentiation, stroma and infiltration patterns. Histological type is one of the most important factors, because it has a close relationship to the aggressiveness of the disease or prognosis of patients with GCs [1]. It is known that patients with poorly differentiated-type GCs have a worse prognosis than those with well-differentiated-type GCs [2]. The histological features of GCs vary from lesion to lesion and within the same tumor due to heterogeneity. Even tumors confined to the mucosa show histologic diversity, which tends to increase with deeper invasion and increased tumor diameter [3]. As mucosal tumors grow or invade into submucosal or deeper layers, the numbers or composites of histological patterns are likely to change [4]. Differentiated-type GCs were reported to progress to undifferentiated type [5]. Mixed-type GCs are defined as differentiated-type cancers containing undifferentiated-type components or undifferentiated-type cancers containing differentiated-type components. Current guidelines stipulate that mixed-type cancers of predominantly the differentiated type are to be dealt with as differentiated-type cancers and those of predominantly the undifferentiated type as undifferentiated type. Takizawa et al. examined surgical specimens of mixed histological-type GCs and found that the frequency of lymphovascular invasion and risk of lymph node metastasis was higher in mixed-type GC than in pure differentiated-type GC, which indicates the higher malignancy of mixed type [6].

Although the biological characteristics correlate with morphological diversities during GC progression [3, 7], histological classifications are not always sufficient to explain the high complexity of GCs. Tumor heterogeneity can also be generated due to genetic alterations, epigenetic abnormalities, interactions between tumor cells and the microenvironment, and interactions between different tumor cell populations [8]. In recent years, there has been outstanding progress in the elucidation of the genomic landscape of GCs due to technical advances and the efforts of international research consortiums such as The Cancer Genome Atlas (TCGA) Research Network [9]. The identification of different molecular subtypes of GCs represents an advance towards the goal of personalized medicine. Nevertheless, in current clinical practice, the choice of therapeutic strategy against GCs still does not regularly consider molecular heterogeneity and is mostly based on tumor stage. Considering the well-known intratumoral heterogeneity in GCs, random tissue submitted for genomic analysis may not fully represent the underlying mechanisms of the whole tumor. Thus, the combined molecular and histological classification defines the pathogenetic mechanism of GCs more accurately. In addition, we raised questions about what molecules would be associated with histological diversities in invasive areas and whether histological features in invasive areas could be predicted preoperatively by mucosal histology or molecular characteristics.

In the present study, we categorized each of the GCs into a mucosal component and invasive component, analyzed the relationship of numbers of histological patterns and composites with clinicopathological parameters and prognosis, and finally searched for GC-associated molecules marking histological diversities.

Materials and methods

Patients and histological evaluation

At first, we collected archival formalin-fixed paraffin-embedded tissues from 1067 consecutive cases who had undergone surgical excision for GC at Hiroshima University Hospital or its affiliated hospitals. 842 GC cases (539 men and 303 women; age range, 31–93 years) were pathologically proven invasive GC with mucosal components, and such cases showing mucosal invasion by retrograde spread from underneath and by lymphatic or vascular permeation were excluded from the present study (Supplementary Fig. 1a–d). All 842 GC cases were histologically reviewed and reclassified by two pathologists (K. S. and W. Y.) based on the latest 15th edition of the Japanese Classification of Gastric Carcinomas. Histological numbers are defined as the numbers of histological patterns within the tumors. The concordance rate for diagnosis between the two pathologists was more than 80%, and when the evaluations differed, a final diagnosis was made by consensus, while the investigators reviewed the specimen with a multi-head microscope. Tumor staging was classified according to the latest 8th edition of the AJCC/UICC TNM classification of malignant tumors. 124 out of 842 GC cases were used for immunohistochemical analysis because of the availability and reliability of their prognostic information. Because written informed consent was not obtained from the patients, identifying information for all samples was removed before analysis to ensure strict privacy protection. This procedure was in accordance with the Ethical Guidelines for Human Genome/Gene Research enacted by the Japanese Government. This study was approved by the Ethical Committee for Human Genome Research of Hiroshima University, Hiroshima, Japan (No. IRINHI66).

Fig. 1
figure 1figure 1

Relationships of histological numbers and composites with patient prognosis. More histological types in the invasive area showed a significantly poorer prognosis (A-C) but not in the mucosal area (D-E). The same result was confirmed only in Stage I GC (F). D > U or U > D type in the invasive area showed a significantly poorer prognosis than D or U type (G) but not in the mucosal area (I). The same result was shown only in T1 GC (I). GCs with gastric mucin or mixed gastrointestinal mucin phenotype in invasive areas showed poorer prognosis than those with other mucin phenotypes (J)

Immunohistochemistry and in situ hybridization

One or more representative tumor blocks, including mucosal areas, invasive areas of the tumor center, lateral and deep invading front, and surrounding non-neoplastic mucosa, from each patient were examined by immunohistochemistry. We performed all of the immunohistochemical analysis on the whole sections. For immunostaining of all markers except EGFR, a Dako Envision Kit (Dako Corporation, Carpinteria, CA) was used according to the manufacturer’s recommendations. The antibodies and their conditions used in the present study are shown in Supplementary Table 1. After endogenous peroxidase activity was blocked with 3% H2O2-methanol for 10 min, sections were incubated with normal goat serum (Dako Corporation) for 20 min to block nonspecific antibody binding sites. The sections were incubated with the primary antibodies for 1 h at room temperature, followed by incubations with peroxidase-labeled anti-rabbit or mouse IgG for 60 min. For immunostaining of EGFR, a Dako EGFR pharmDx™ assay detection system (Dako Corporation) was used. Staining was completed with a 10-min incubation with the substrate-chromogen solution. The sections were counterstained with 0.1% hematoxylin. Appropriate positive and negative control samples were also stained. For Epstein–Barr virus (EBV), slides were stained on an automated slide-staining system (Benchmark XT; Ventana Medical Systems, Inc., Tucson, AZ). A ready-to-use EBER (EBV-encoded small RNA) probe (Ventana/Roche) was used together with ISH-Protease 3 pretreatment for 1 h and probe incubation of 28 min.

Evaluation of positive cases

Immunostaining results were evaluated independently by 3 investigators (K. S., T. I. and G. K.), and when the evaluations differed, a decision was made by consensus, while the investigators reviewed the specimen with a multi-head microscope. Neoplastic tissue was evaluated semiquantitatively at magnifications of × 100 and × 400. The cutoff-point for antibody reactivity necessary to define a result as positive is shown in Supplementary Table 1. HER2 or c-MET staining was scored as previously reported [10] and was considered to be positive for overexpression if the staining was scored as 2 + or 3 + . E-cadherin and ARID1A expression was semiquantitatively scored on a scale of 0 (no staining), 1 + (weak), 2 + (moderate) to 3 + (strong; equivalent to normal epithelium or stromal cells of the stomach). A score of 2 + or less was regarded as loss or reduction [11, 12]. MLH1 or MSH2 was considered to be a loss or reduction of nuclear reactivity together with a positive background reaction in non-neoplastic epithelial or stromal cells.

Table 1 Relationships of histological numbers in mucosal and invasive areas with clinicopathological parameters in 842 GCs

Mucin phenotypes of GCs and molecular subtypes using surrogate markers

We evaluated 124 GCs according to the criteria for classification of mucin phenotypes [13]. Furthermore, these 124 GC cases were divided into four molecular subtypes according to a previous report [14]. The microsatellite-instable (MSI) subtype was defined as tumors showing a loss or reduction of MLH1 of more than 30%. Strong nuclear staining of p53 of more than 50% was used as a surrogate for the chromosomal instability (CIN) subtype, as they frequently showed a TP53 mutation [9]. In addition, overexpression of tyrosine kinase receptors such as HER2, EGFR or c-MET was used as a CIN subtype. The remaining cases showing loss or reduction of E-cadherin were categorized as genomic stability (GS). EBV subtype indicated EBER-positive tumors.

Statistical methods

Analysis of clinicopathological parameters or immunohistochemical staining was performed using the Chi squared test. The Kaplan–Meier method was used to estimate the overall survival, and the difference in survival between groups was examined with a log-rank test. Univariate and multivariate Cox regression was used to evaluate the associations between clinical covariates and survival as described previously [15]. A p value of < 0.05 was considered statistically significant. The SPSS software program (SPSS Inc., Chicago, IL, USA) was used for all statistical analyses.

Results

Clinicopathological characteristics associated with histological numbers

Among the 842 GC cases, histological components are listed in the descending order of their proportion in the mucosa and invasive areas, and the histological combination of both areas totaled 344 patterns (Supplementary Fig. 1a–c, Supplementary Table 2). The most predominant histology of mucosal areas showed a concordance with those of corresponding invasive areas except for signet-ring cell carcinoma most likely turning into poorly differentiated adenocarcinoma (Supplementary Table 3). Histological numbers ranged from 1 to 4 in the mucosal areas and from 1 to 5 in the invasive areas (Supplementary Table 4). Table 1 shows the relationship between the clinicopathological factors and histological numbers. GC cases with more histological numbers in invasive areas showed significantly higher T grade and staging, whereas those in mucosal areas did not show any significant associations. Furthermore, GC cases with a greater increase in histological numbers in invasive areas than mucosal areas showed significantly higher T grade, N grade, M grade and staging (Supplementary Table 5).

Clinicopathological characteristics associated with histological composites

The 842 GC cases were classified into four subgroups according to their histological features: a pure differentiated component with no undifferentiated component (D type); a pure undifferentiated component with no differentiated component (U type); and a histologically mixed type that consisted of both differentiated and undifferentiated components (mixed type). The histological mixed type included the differentiated-predominant mixed type with the undifferentiated component making up less than or equal to 50% (D > U type) and the undifferentiated-predominant mixed type with the undifferentiated component making up more than 50% (U > D type). The 842 GC cases were divided into 445 D type, 142 U type, 144 D > U type and 111 U > D type in the mucosal areas, and 214 D type, 214 U type, 224 D > U type and 190 U > D type in the invasive areas. Characteristically, 235 (53%) of the 445 D-type GC cases in mucosal areas displayed U type, D > U type or U > D type histology in the invasive areas (Supplementary Table 6). Supplementary Table 7 shows the relationship between the clinicopathological factors and histological composites. D > U type or U > D type showed higher T grade, N grade and staging than U type in the invasive areas, whereas these types were only associated with higher N grade in the mucosal areas (Supplementary Table 8). In addition, D > U type or U > D type was characteristically observed in patients aged over 65, whereas U type was more frequent in patients aged less than 65. The male and female ratios gradually decreased significantly from D type to U type via D > U type or U > D type.

Relationships of histological numbers and composites with patient prognosis

We examined the relationships of histological numbers and composites with overall survival in 124 GC cases. The prognosis of patients with more histological numbers in the invasive areas was significantly worse (N = 1 vs 2 ≤ , p = 0.0006; N = 1–2 vs 3 ≤ , p < 0.0001; N = 1–3 vs 4 ≤ , p = 0.0228) (Fig. 1a–c), whereas there was no significant difference in mucosal areas (Fig. 1d–e). The same result was also confirmed only in stage I GC (Fig. 1f). Furthermore, in the 124 GC cases, the prognosis of D > U or U > D type (mixed type) was significantly poorer than that of D or U type (pure type) in invasive areas (Fig. 1g) but not in mucosal areas (Fig. 1h). The same result was shown only in T1 GC (Fig. 1i). Therefore, we performed the following analysis of histological composites using the classifications of mixed type and pure type. Table 2 shows the relationships of histological composites in mucosal and invasive areas with clinicopathological parameters in 842 GCs. Indeed, the results with the classifications of mixed type and pure type displayed almost the same tendency as those with four subgroups.

Table 2 Relationships of histological composites in mucosal and invasive areas with clinicopathological parameters in 842 GCs

In a univariate analysis, more histological numbers and mixed type in invasive areas showed a significantly poorer prognosis (Table 3), but they were not independent factors in the multivariate analysis of the 124 GCs from stage I to IV. However, when the GCs were limited to stage II and III, more histological numbers were an independent factor of poor prognosis (N = 1–2 vs 3 ≤ , p = 0.0325).

Table 3 Univariate and multivariate Cox regression analysis of histological numbers and composites with overall survival in 124 GCs

Cancer-related molecules and relation of infection status with more histological numbers and mixed types in invasive areas

To clarify the molecular characteristics of GC with more histological numbers and mixed types, we investigated their association with various cancer-related molecules in invasive areas (Table 4 and Supplementary Table 1). In the 124 cases examined, molecules of each of the cancer stem cell (CSC) markers were detected as follows: CD44 in 56 (45%) cases, CD133 in 21 (17%) cases and ALDH1 in 78 (63%) cases, and the expressions of these molecules were significantly related with more histological numbers or mixed type. In addition, molecules of each of the tyrosine kinase receptors were detected as follows: HER2 in 28 (23%) cases, EGFR in 24 (19%) cases and c-MET in 24 (19%) cases, and the expressions of all of these molecules were associated with mixed type. In 5 of 19 mixed-type HER2-positive GCs, HER2 expression was observed in both differentiated and undifferentiated components. Other molecules such as MMP-7, laminin 5γ2 or E-cadherin were also associated with more histological numbers or mixed type. Furthermore, H. pylori infection was significantly associated with both more histological numbers and mixed type, whereas EBV infection did not correlate with either of them. Meanwhile, GCs with gastric mucin or mixed gastrointestinal mucin phenotype in invasive areas showed poorer prognosis than those with other mucin phenotypes (p = 0.0436) (Fig. 1j). They were also significantly associated with more histological numbers and mixed histological type, although each mucin-associated molecule was not associated with either of them.

Table 4 Relation of histological numbers and composites in invasive areas with cancer-related molecules or infection status in invasive areas of 124 GCs

Histological characteristics and cancer-related molecules in mucosal areas predicting more histological numbers and mixed types in invasive areas

There were significant associations between more histological numbers and mixed type in both the invasive and mucosal areas (p < 0.0001). More histological numbers in mucosal areas was also significantly related more histological numbers and mixed type in invasive areas. Mucosal mixed type was associated with invasive mixed type (Supplementary Table 9). Next, we analyzed cancer-related molecules that are predictive of histological diversities in invasive areas from the immunohistochemical expression of mucosal areas (Table 5). As a result, the mucosal expressions of CD44, HER2, c-MET, laminin 5γ2 or retained E-cadherin were significantly associated with more histological numbers and mixed types in invasive areas. There were no significant correlations between mucosal mucin phenotypes and histological diversities in invasive areas.

Table 5 Relation of histological numbers and composites in invasive areas with cancer-related molecules in mucosal areas of 124 GCs

Molecular subtypes related with histological diversities and other cancer-related molecules

Finally, we analyzed the molecular subtypes marking histological diversities. The 124 GC cases were putatively classified into five molecular subtypes based on the expression pattern of surrogate markers: 51 cases (41%) of CIN, 29 cases (23%) of GS, 13 cases (10%) of MSI, 6 cases (5%) of EBV and 25 cases (20%) of the others subtype (Table 6). The molecular subtypes of each predominant histology included poorly differentiated adenocarcinoma in GS and EBV and well- or moderately differentiated tubular adenocarcinoma in MSI. The CIN subtype of GC showed significant associations with both more histological numbers and mixed histological type, whereas the GS subtype of GC showed a significant relationship with pure type (Table 6). In the analysis of the relationship between molecular subtypes and other cancer-related molecules, CSC markers such as CD133 and ALDH1, and VEGF-A were characteristically overexpressed in CIN subtype, and conversely more infrequent in GS subtype compared with other subtypes. Loss or reduction of ARID1A or PD-L1 overexpression was frequent in the MSI and EBV subtypes. In addition, H. pylori infection was observed in the CIN subtype but was infrequent in the GS subtype (Supplementary Table 10).

Table 6 Relationships of molecular classification in invasive areas of 124 GCs with histological numbers and composites using surrogate markers

Discussion

In the present study, we analyzed the relationship of histological numbers and composites in GCs with clinicopathological parameters and prognosis from morphological and molecular standpoints. As predicted, GCs with more histological numbers and mixed type in invasive areas showed higher stage and poorer prognosis, which accorded with the results of previous reports [6, 16]. In elderly patients, the differentiated type predominates in early GCs, but the histologic diversity increases with progression to advanced cancer, resulting in higher proportions of undifferentiated type [17]. In the present study, histological diversities were increased more significantly in the elderly patients aged over 65, and the male and female ratios gradually decreased significantly from D type to U type through D > U type and U > D type, similar to other previous reports [18, 19].

Cancer is the result of the accumulation of multiple molecular abnormalities that only occur and perpetuate in long-lived and self-renewing cells such as adult stem cells [20]. A previous study in mice revealed that the induction of genetic or epigenetic events specifically in gastric stem cells can lead to the development of GCs [21]. This malignant transformation of gastric stem cells also occurs as a result of H. pylori infection and CD44 induction [22,23,24]. In the present study, the expression of CSC markers and H. pylori infection were significantly associated with more histological numbers and mixed type in invasive areas. Previous studies showed that several GC-related molecules could be useful as an adjunct to TNM staging [25, 26]. However, GC heterogeneity appears to have had a negative impact on the response to therapies against HER2, FGFR and EGFR and potentially CCND1 and MYC [27]. The present study showed that overexpressions of tyrosine kinase receptors such as HER2, EGFR or c-MET were all associated with mixed type in invasive areas. The extent of the heterogeneity of HER2 expression within primary GCs varied widely across studies, from 5 to 69%, and those between primary GCs and metastatic sites also varied [8]. Regarding the mixed type of GCs, some authors reported that HER2 expression was restricted to the intestinal component [28], whereas others observed HER2 expression in both intestinal and diffuse components, supporting the hypothesis of clonal origin [29]. The present study also showed that HER2 expression was observed in both components. Several reports disclosed that the two components of the mixed type of GCs displayed similar DNA copy changes and CpG island hypermethylation status [30, 31] and had a higher number of methylated genes than pure diffuse or intestinal GCs [31]. These results indicate that the mixed type of GCs might emerge from a clonal origin.

The discrepancies between endoscopic biopsy and postoperative histology range from 25 to 53%, probably because the endoscopic biopsy samples only a small amount of the cancerous lesion, and there is much difference in tumor microenvironment interaction between mucosal and invasive areas [32,33,34,35]. Previous reports showed that HER2 expression is more frequent in the luminal tumor layers rather than deeper ones [36] and that HER2 assessment in biopsy tissues might predict the HER2 status of the whole tumor [37]. In addition to HER2, we showed that CD44, c-MET, laminin 5γ2 or retained E-cadherin were also useful markers predicting more histological diversities in invasive areas from endoscopic biopsy specimens. CD44 and c-MET are also the target molecules for gastric tumor-initiating cell-targeted therapeutic strategies, and their expressions are associated with chemoresistance and poor prognosis [35].

Genomic instability has often been proposed as a major driver of intratumoral heterogeneity and as a potential marker of its extent. Raynaud et al. reported that tumors with the greatest intratumoral heterogeneity exhibited high numbers of both clonal and subclonal mutations, suggesting that genomic instability emerged early and was sustained and fostered during tumor evolution [38]. They also revealed that this type of molecular phenotype was common in the CIN subtype of GC and in cancers of the lung, bladder and head and neck, and skin melanoma [38]. In the present study, the CIN subtype showed significant associations with both more histological numbers and mixed histological type, whereas the GS subtype showed a significant relationship with pure type. Loss or reduction of ARID1A and PD-L1 expression was frequently observed in the MSI and EBV subtypes, similar to the results of a previous report [39].

There is very limited information on the nature and the mechanisms of intratumor morphological heterogeneity in human cancers. Epithelial–mesenchymal transition (EMT) is reported to induce a variety of malignant features in cancer cells, such as cancer stemness, chemoresistance, immune evasion, altered metabolism, and blocked senescence [40]. EMT and CSC were reported to be involved in the role of morphological diversity of invasive breast carcinomas [41]. The changes in cell adhesion occurring within EMT might lead to morphologically different architectural arrangements of tumor cells. In the present study, we showed the relationship of histological diversities such as histological numbers and composites with clinicopathological significance and identified characteristic molecules marking these histological diversities. Our study is limited by its retrospective design and the small number of cases with each tumor molecular subtype. We hope that the present data can contribute to the early diagnosis and prevention, and effective treatment of GC.