Introduction

The epithelial–mesenchymal transition (EMT) was originally defined by developmental biologists as a morphological conversion occurring at specific sites in embryonic epithelia to give rise to individual migratory cells [1]. EMT is a fundamental process in the development of most metazoans and is primarily involved in the shaping of embryos. In mammals, EMT has been associated with the formation of the parietal endoderm [2]. It is also directly involved in the formation of the mesoderm and definitive endoderm at the primitive streak during gastrulation [3]. Neural crest cells emerge from the dorsal neural epithelium through EMT before undergoing extensive migration and differentiation into many derivatives [4]. EMT has also been implicated in the ontogeny of other structures including somites and heart endocardium [5].

Although EMT was recognized in the late nineteenth century, it is only recently that some of the molecular mechanisms in the developing embryo have been revealed. Most current studies are performed in vitro with epithelial cell lines, which can be converted into fibroblast-like cells under specific culture conditions. However, not all normal or malignant cell lines share all characteristics with embryonic epithelia. In many instances, epithelial cell lines can be refractory to EMT, perhaps owing to the inaccessibility of scatter factors or to intrinsic inhibitory mechanisms. Conversely, culture conditions do not always allow epithelial cells to achieve full polarity and can facilitate dispersion. The definition of an EMT, and the requirements to execute one, in vitro are at variance with those in vivo and therefore cannot exactly recapitulate these events. It is therefore not surprising to find studies differing in their stringency for the various criteria for defining an EMT.

The current criteria for defining an EMT in vitro [1, 6] involve the loss of epithelial cell polarity, the separation into individual cells and subsequent dispersion after the acquisition of cell motility. EMT presumably involves the disassembly of tight junctions, adherens junctions and desmosomes as well as the reorganization of cell substrate adhesion complexes. After the loss of cell polarity, the cytoskeleton is significantly remodeled. A shift from cytokeratin intermediate filaments to vimentin is considered to be an important criterion for EMT, although vimentin is not necessarily a reliable marker of mesenchymal cells (transcription of the vimentin gene is particularly sensitive to serum components). The epithelial and mesenchymal phenotypes also show particular transcription profiles including cytoskeletal components and extracellular matrix components. It is likely that several other types of protein will be found to be associated with EMT in only one of the two states. This minireview addresses the question of whether EMT can be involved in breast cancer progression and whether these tumor types can benefit from a molecular understanding of EMT [7].

EMT signal transduction pathways

EMT can be induced in vitro in several epithelial cell lines by growth factors activating tyrosine kinase surface receptors. These factors include scatter factor/hepatocyte growth factor, fibroblast growth factors, epithelial growth factor (EGF) family members, and insulin-like growth factors 1 and 2 [7]. For instance the human 184 breast cell line can respond to EGF [8]. In most cases the Ras/mitogen-activated protein kinase (MAPK) pathway has a crucial role in inducing EMT, but in some cell lines the transient activation of Src, phosphoinositide 3-kinase (PI3K) and Rac has an effect on particular aspects of EMT. At least two normal breast cell lines (NMuMG and EpH4: two normal murine mammary cell lines) and other lines respond specifically to transforming growth factor-β (TGF-β), although one of them (the EpH4 line) undergoes EMT when expressing the oncogenic H-Ras [911]. MAPK and PI3K have also been implicated in TGF-β signaling and more direct signalling, both through the conventional Smad pathways and other as yet unknown pathways [12]. These signaling pathways, and particularly the cooperativity between Ras and TGF-β signalling, are also observed in the EMT process in squamous carcinoma of the skin, which can progressively acquire a fibroblast-like morphology in later stages of the disease [13]. The classical TGF-β signaling pathway leading to inhibition of cell growth or even the induction of apoptosis is abrogated by activated Ras in part through Raf/MAPK or PI3K activation. In addition, it has been clearly shown in the skin carcinogenesis model that nuclear accumulation of Smad2 by oncogenic Ras is required for progression towards the spindle cell carcinoma stage.

ECM components, including collagens and laminin 5, can also induce EMT in some cell lines [14]. Current research is aimed at identifying gene targets within these pathways. One class of potential targets is the metalloproteases, which are important in morphogenesis. Their function in mammary gland branching morphogenesis [15, 16] and tumor formation has been described recently [17]. Interestingly, only the expression of stromelysin 1 (matrix metalloprotease [MMP]-3) in the Scp2 mammary cell line is sufficient to induce EMT in vitro and tumorigenesis in vivo [18]. The targeted expression of MMP3 in the mammary gland can induce premalignant and malignant lesions, forming poorly differentiated tumors with a fibroblast-like morphology in a proportion of the carcinoma cells [19].

These data suggest that EMT is involved in early or later steps of breast malignancy development. However, breast malignancies are a heterogeneous group of different morphological entities. Pathological characteristics of breast carcinomas need to be defined so that the phenotype can be correlated with the molecular alterations observed in vitro or in animal models.

Pathological characteristics of breast malignancies

Epithelial tumors (carcinomas) are the most frequent type of breast tumor. The first step in the histological study of breast carcinomas is to determine whether the tumor is confined to the glandular component of the organ (in situ carcinoma) or whether it invades the stroma (invasive carcinoma). Then, the histological subtype within each category is determined and prognostic indicators such as stage, grade and the presence of vascular invasion are assessed. The stage of the tumor is evaluated according to its size and the presence of axillary lymph node metastases. The grade of a carcinoma is an estimation of its differentiation.

The grading of breast carcinomas is based solely on the invasive portion of the tumor. The grading procedure involves a three-tiered system for describing the tumor structure. The first criterion is the evaluation of tubule formation. The absence of tubules with only cord sheets and isolated cells might reflect an incomplete EMT process. The second criterion is the nuclear grade assessed by comparison with the sizes of nuclei in normal cell and between carcinoma cells. The third criterion is the mitotic count expressed as the number of mitoses per 10 high-magnification fields [20].

The term invasive carcinoma encompasses numerous entities differing from each other by morphological characteristics related to the degree of differentiation and the organization of the cells.

The most common form (75% of cases) is ductal invasive carcinoma. The cells can grow in irregular or rounded sheets, or nests or cords of solid clusters of cells, frequently interspersed with isolated cells. The presence of gland lumens, poorly or well formed, reflects the degree of differentiation of the tumor. Those glands lacking myoepithelial cells are not delimitated by a basement membrane. The amount of stroma and in particular the amount of the inflammatory infiltrate varies considerably between individual ductal invasive carcinomas. An association with in situ carcinoma is observed in almost 70% of cases [2123].

The second large category of invasive carcinoma is the group of lobular carcinomas, which account for 10–15%. These tumors are composed of carcinoma cells, isolated or organized in single file or narrow cords, usually with an abundant surrounding stroma. Gland formation is not a feature of classical infiltrative lobular carcinoma. An in situ component is associated with this form in nearly 90% of cases [24]. Additional patterns of infiltrative lobular carcinomas have been recognized: they have different architectural and cytological patterns but cohesive cells. These patterns include the solid pattern, composed of large nests of closely packed but non-cohesive cells separated by thin vascular channels, the alveolar pattern characterized by clusters of 20 cells arranged in round nests separated by small amounts of stroma, and the pleomorphic pattern composed of cells showing a higher degree of nuclear atypia and more mitotic figures [23].

In addition to the ductal and the lobular types of breast carcinoma, other rare types have been described, each accounting for less than 5% of the total. The group of metaplastic carcinomas is of particular interest with regard to EMT. This is because these carcinomas are believed to be of epithelial origin. There are two main categories: one composed of intermixed cells with epithelial morphology but exhibiting a glandular differentiation associated with a squamous metaplasia, and the second composed of epithelial cells with glandular differentiation tightly mixed with non-epithelial cells. The non-epithelial cells are composed of spindle shape cells, bone and cartilaginous cells or both. The spindle cells often seem to be merged with epithelial cells, yet they represent the majority cell type characterized by morphological heterogeneity. Cytokeratin and more specifically high-molecular-mass cytokeratins and vimentin are observed within both the spindle cell and epithelial cell components [23].

Some other rare types of breast carcinoma are associated with a more favorable clinical outcome. These are generally well-differentiated tumors such as tubular carcinoma composed of angular glands lined by a single layer of cylindrical cells; mucinous carcinoma, a tumor that produces large amounts of extracellular mucus; and cribriform carcinoma composed of large clusters of cells with glandular differentiation [25]. Some other lesions show an undifferentiated morphology. An example is the medullary carcinoma characterized by large syncytial sheets of large cells with atypical nuclei and high rates of mitosis, mixed with an abundant inflammatory stroma [26].

This non-exhaustive overview of the pathology of breast carcinomas illustrates the broad diversity of the morphological aspects of breast tumors. This diversity is in part related to the differentiation state of the carcinoma cells. The prognosis is evaluated from the pathological type, the stage and the grade of its tumor and this is used to determine the individual therapeutic scheme. Unfortunately, the evaluation of prognosis is still inaccurate. A better knowledge of the EMT pathways and of the genes involved in breast carcinomas might be of great value in improving our understanding of these tumors and consequently allowing a more reliable prognosis for patients.

Involvement of EMT in breast carcinoma

Loss of heterozygosity at 16q22.1 is relatively frequent in breast carcinoma, implicating E-cadherin as a tumor suppressor gene. E-cadherin, the prototype type 1 epithelial cadherin, has been studied extensively in EMT. In vivo, E-cadherin is downregulated specifically at sites of EMT such as gastrulation in Drosophila and in several vertebrates including the mouse. Numerous studies have described a partial or complete loss of E-cadherin during carcinoma progression, which is correlated with an unfavorable prognosis [27, 28] and confirming that E-cadherin is a caretaker of the epithelial state. Several distinct mechanisms of E-cadherin downregulation have been described. Mutations have been found in the E-cadherin gene in about 50% of lobular carcinomas of the breast [29]. Most mutations lead to non-functional proteins. In accordance with the Knudson two-hit hypothesis, most of these mutations are found in tumors with loss of heterozygosity (LOH) of the E-cadherin wild-type locus. A recent study examined in depth whether the two-hit hypothesis applies to sporadic ductal invasive carcinoma of the breast [30]. No mutations were found in this series, confirming previous studies. However, LOH in the E-cadherin locus was not significantly associated with hypermethylation in the other allele, suggesting the existence of other mechanisms for E-cadherin gene extinction.

Genetic screening in Drosophila led to the identification of snail, a gene involved in the control of gastrulation [31]. snail transcripts are specifically expressed in invaginating mesodermal cells just before their EMT. Snail and a closely related member of this zinc finger transcriptional repressor, named Slug, have also been found in vertebrates. There is convincing evidence that Slug is required for gastrulation and neural crest emigration in Xenopus and in the chick embryo, and that Snail has a similar role in the mouse [32]. The important discovery was made that Snail can downregulate transcription of the E-cadherin gene through its interaction with E boxes in the proximal region of the promoter [33]. Slug can also bind to the same region of the promoter, although with lower affinity (A Cano, personal communication). Other transcription factors also inhibit the transcription of E-cadherin genes: an example is the zinc finger protein SIP1, a downstream target gene in the TGF-β-mediated induction of EMT in the NMuMG cell line [34]. Snail expression has been analyzed by in situ hybridization in breast carcinomas and compared with that of E-cadherin. Snail is expressed mostly in dedifferentiated tumors and is correlated with grading. In heterogeneous tumors, Snail is expressed in carcinoma cell islands devoid of E-cadherin. It is found in all ductal invasive carcinomas with lymph node involvement. However, Snail was not found in the small number of lobular carcinomas investigated by Blanco et al. [35]. Another study reports Snail expression in tumors in which the E-cadherin promoter is hypermethylated rather than in tumors with LOH in the E-cadherin locus [30]. Another recent study analyzed the roles of the various E boxes in the control of E-cadherin transcription: the findings stress the importance of E-box C and thus are at variance with previous studies. In addition, in breast cell lines, Slug expression is more tightly correlated than Snail with a lack of E-cadherin expression [36].

Clearly there is a need for more extensive analysis of breast tumor samples, but significant progress has already been made toward understanding one key aspect of EMT in breast cancers. The phenotype of breast cancer micrometastases in lymph nodes and in the bone marrow suggests that EMT occurs within the primary tumors [37]. Furthermore, if these cells are at the origin of secondary tumors, it would indicate that there is a potential reversal of the phenotype because E-cadherin can be re-expressed in the metastatic lesion, which has a generally more differentiated phenotype than the primary tumor [38, 39]. This is indeed consistent with the notion that E-cadherin is regulated mostly by epigenetic mechanisms. It also suggests that LOH at one locus, together with a mutation or a definitive extinction of transcription at the other locus, explains only a small fraction of all breast carcinomas.

There are several breast carcinoma fibroblast-like lines that express the classical type 1 N-cadherin [40] and possibly some type 2 cadherins. The expression of N-cadherin de novo in breast carcinoma cells induces an EMT [41]. The mechanism by which N-cadherin can overcome the maintenance of the epithelial state by E-cadherin is not known, although a domain in N-cadherin essential for this effect has been identified [42]. E-cadherin can be down-regulated by other mechanisms including the recently described mechanism of ubiquitination and endocytosis of E-cadherin in epithelial cells stimulated by EMT-inducing growth factors [43].

Concluding remarks

Epithelial cell plasticity is a major feature of embryonic development. Epithelial cell intercalation during the convergence–extension movement, or during the process of cavitation and branching morphogenesis, implies local phenotypic changes in the cells undergoing these processes. The development of the mammary gland probably employs these mechanisms. EMT is one of the most drastic aspects of epithelial cell plasticity. Some of the molecular programs of EMT might be involved in the development of the mammary gland, particularly at terminal end buds or possibly in lateral branching [44]. However, EMT is also likely to be important in tumor progression. One of the best markers of EMT associated with breast cancer is the loss of E-cadherin, and this is controlled in part by Snail family members, as is EMT associated with development. Other transduction pathways might be found in the mammary tumors in which TGF-β or tyrosine kinase surface receptor ligands are produced.

Recent DNA chip profiling has already proved powerful for tumor classification: a subset of breast carcinomas display molecular markers of the basal cell phenotype [45] associated with myoepithelial cell differentiation [46]. This subgroup has the poorest prognosis of all the groups studied as assessed by hierarchical clustering [45]. These findings do not preclude the possibility that these tumors contain myoepithelial cells rather than carcinoma cells with basal characteristics. A recent study shows that murine HC11 clones with basal cell characteristics acquire a motile phenotype in vitro and invasive properties in vivo when exposed to EGF [47]. The plasticity of human breast carcinoma has also been investigated in vitro by establishing cell lines and assaying them for their tumorigenic properties. At least one particular line has a myofibroblastic phenotype, suggesting the intriguing possibility that in some cases the myofibroblasts in the stroma of breast tumors might be derived from the carcinoma cells [48].

A subset of genes, rather than a particular single gene, can be used as a prognostic marker, and this approach shows great promise for predicting metastatic outcome [49]. Various laboratories are currently trying to identify genes more specifically associated with EMT, tumor invasion and tumor metastasis, either with cell line models or with tumors of diverse grades and stages. It is expected that new candidate genes will be validated in the near future and that we will learn significantly more about the effects of EMT on the progression of breast carcinomas.