Methodology to analyze gene expression patterns of early mammary development in pig models

In mammary gland development, normal stem cell activity occurs in the embryonic stage and postnatally. Research supports that certain breast cancers contain a small sub-population of cells that mimic stem-like activity. It is believed stem cell activation in the mutated mature human mammary tissue is what drives quiescent epithelial cells to convert to mesenchymal states initiating migration, invasion, and metastasis in breast cancer. The goal of the work reported herein was to investigate early mammary development gene expression in the postnatal pig using fine needle biopsy methods in order to establish a reliable model for human breast cancer detection. Tissue samples were collected from pig mammary glands beginning at Day 11 of age through Day 39 in order to capture early postnatal-growth gene expression. Based on the initial clustering analysis, two distinct clusters of gene expression profiles occurred before and after Day 25 of mammary development. Gene set enrichment analysis (GSEA) ontology indicated the cellular processes that changed after Day 25, and many of these processes were implicated in epithelial–mesenchymal transition (EMT) signaling events. Gene expression in the postnatal pig was compared with the Epithelial–Mesenchymal Transition gene database (dbEMT) confirming the presence of EMT activity in this early developmental program. Information from this study will provide insight into early postnatal mammary gland development. In addition, mechanisms exploited by mutated mammary epithelial cells leading to cancer initiation and growth may be detected considering that mutated mammary epithelial cells can reactivate early developmental signals.


Introduction
Due to ethical considerations and minimal tissue acquisition from young human mammary glands, our current knowledge of human breast development has been limited. Therefore, interpretation of developing human breast morphology is based on analogies in the mouse. Additionally, cancer research progress has been hampered by lack of anatomically and physiologically relevant animal models, because large animal models more closely recapitulate human cancer and diseases [1]. Transitional animal models could bridge the gap between diagnostic discoveries and human clinical trials. The pig is currently advancing as a transitional model bridging the gap from mouse studies to human trials.
Comparing the morphology of developing mammary glands, humans and pigs develop terminal ductal lobular units (TDLU) and epithelial proliferation is concentrated in the endbuds or TDLU's. The mouse morphology has sparse ducts and alveolar tissue and epithelial proliferation occurs in ductal endbuds and alveoli [2]. This research report promotes the use of an innovative procedure (Dunn Biopsy) by collecting mammary tissue from the pig weekly beginning at 11 days of age and ending at 39 days of age. This methodology is described in the pending U.S. Provisional Patent No. 62/256,416 titled Dunn Biopsy: A method of fine needle biopsy of mammalian tissue over time as a diagnostic tool. This biopsy technique provides a representative sample of tissue that was not altered due to genetic modifications, introduction of exogenous cells, or surgical techniques. This technique allows for repeated sampling from the same subject over time eliminating genetic variability from different individuals. Due to this naturally occurring state, our model provides an alternative analogy of breast development morphology and gene expression.
Previous studies indicate that stem cell activity which occurs in early postnatal development may be reactivated in the mutated human adult mammary gland as a trigger to initiate cell proliferation in early breast cancer [3][4][5]. These mechanisms involved in early postnatal growth period could provide insight into the initiation of quiescent cells in the mature human adult. In developing tissue, epithelial cells undergo a transdifferentiation process known as the epithelial-mesenchymal transition (EMT) where they alter cell states from epithelial to mesenchymal [6]. The reverse program, mesenchymal-epithelial transition (MET), occurs during development to allow the formation of distinct tissues [6,7].
Early-stage tumors exhibit epithelial phenotypes, and highly aggressive tumors display mesenchymal features; therefore, the EMT-MET phenomenon can be linked to tumor progression and malignancy. Since cancer cells can interconvert between phenotypes of epithelial or mesenchymal states [8] design of research strategies have been limited. Given the similar phenotypic attributes of the EMT-MET program in developing tissue and epithelial-based cancers, it is plausible that understanding biological mechanisms in natural, transdifferentiating states may provide critical information regarding initiation and metastasis.
By evaluating cellular mechanisms in their natural environment, the Dunn Biopsy technique could establish a comprehensive database allowing investigators to disentangle neonatal mammary developmental processes. The focus of this research was to evaluate gene expression patterns of mammary development in postnatal pigs using a fine needle biopsy method. This information may provide insight into mechanisms exploited by mutated mammary epithelial cells leading to cancer initiation and growth.

Animals
Animals were maintained in certified Association for Assessment and Accreditation of Laboratory Animal Care facilities (AAALAC), and animal handling adhered to the Institutional Animal Care and Use Committee (IACUC) approved protocols and guidelines. Three Yorkshire/Berkshire cross-bred gilts were used for this study and housed with sow and litter mates until weaning. At 21 days of age, the three animals were moved to a nursery where they remained through the final sampling.

Dunn biopsy procedure
Tissue samples were collected from pig mammary glands using the Dunn Biopsy method beginning at 11 days of age continuing weekly through Day 39. Mammary glands chosen for each biopsy procedure were randomized to prevent selection bias and remove differences in the mammary chain of pigs. Animals were manually restrained for preparation and biopsy procedure by holding the animal in a prone position on a cushioned surface. The mammary gland was triple washed with chlorhexidine and iodopovidone prior to administering 2 ml of lidocaine at the sample site. Mammary tissue was collected with a 14 g biopsy needle using the Dunn Biopsy technique (U.S. patent application number 62/256,416). Tissue samples were cut in half and stored in 10% neutral buffered formalin for immunohistochemistry and RNAlater (Sigma Aldrich) for RNA isolation and sequencing.

Histology preparation
Samples placed in 10% neutral buffered formalin remained at room temperature overnight followed by ethanol dehydration, xylene clearing and paraffin embedding (FFPE). Blocks were sectioned at 6 μm for hematoxylin and eosin (H&E) staining in order to confirm samples collected were representative of mammary parenchyma and to evaluate tissue morphology.

RNA isolation, library preparation, sequencing
Samples in RNAlater (Sigma Aldrich) were stored at − 20 °C until RNA isolation. Biopsy samples were prepared using Qiagen TissueRupter, treated with Qiagen on-column DNase, and RNA was isolated using Qiagen RNeasy kit. Samples were individually normalized and an Illumina compatible sequencing library was prepared robotically on a Microlab STAR with Illumina's TruSeq stranded total RNA library prep kit. The libraries were assessed for size on a 2100 Agilent Bioanalyzer and sequence data collected on an Illumina HiSeq2500. Each sample was sequenced to a depth of at least 22 million reads. Quality metrics, using FastQC [9] were run on all samples. Trimming of low-quality bases was carried out with Trimmomatic [10]. Trimmed reads were aligned to the Sus scrofa 11.1 genome using STAR, a splice aware RNA-seq aligner [11]. Transcripts were then quantified using the FeatureCounts [12] tool in the subreads [13] package in R [14]. DESeq [15] was then used for differential expression analysis and gene expression normalization. Gene ontology analysis was performed using GSEA [16,17] software and association with compared to the dbEMT database [18] for a more targeted analysis.

A change in morphology occurs after 25 days of pig mammary gland development
Postnatal time points from Day 11 through Day 39 of age were selected for data collection of mammary gland tissue in order to evaluate early postnatal gene expression. Tissue was collected using the Dunn Biopsy fine needle methodology which allows for genetic expression comparisons from the same animal's tissues at different time points (Fig. 1a). This methodology removes many of the confounding effects that can be caused by collecting tissue from different animals to perform developmental analyses over time. In order to analyze gene expression changes of early pig mammary gland development, RNA-seq was performed on the biopsied mammary tissue (Fig. 1b). RNA extraction quality for all time-points and the percent mapped after trimming are illustrated in Fig. 2a and b respectively.
Clustering of whole transcriptome gene expression analysis indicated a profound change in global gene expression that occurs after Day 25 of pig mammary gland development (Fig. 3a). This is consistent with histological examination where the appearance of TDLU's that were identified in Day 32 and Day 39 (Fig. 3b). Due to this substantial change of genetic expression and morphology prior to and after Day 25 of age, we decided to further probe what processes were altered after this point in development.

Gene expression changes after 25 days of development are related to the epithelial mesenchymal transition
Investigations of mammary gene expression patterns led to the discovery of highly implicated EMT-MET developmental program events. In order to delineate what processes were altered due to noticeable gene expression changes, we performed a Gene Set Enrichment Analysis (GSEA). This analysis revealed several processes which significantly changed after Day 25 of gland development. Noticeably, we found processes down regulated after Day 25 associated with actin cytoskeleton, actin mediated cell contraction, regulation of calcium ion import, and contractile fiber (Fig. 4a). These processes are all implicated in different aspects of breast cancer progression.
In addition to the gene expression changes known to be implicated in breast cancer, there are substantial changes to pathways involved in EMT. For this reason, we decided to further probe genes involved in EMT by analyzing their expression within our dataset. We identified EMT genes by examining those found in the dbEMT. This analysis (Fig. 4b) revealed the overall expression of EMT associated genes a b Fig. 1 Methods for data collection and analyses: a schematic illustration of the Dunn Biopsy methodology sampling from the same animal over time. Biopsy tissue was collected for each time point in RNAlater and 10% formalin for downstream applications. b RNA Seq was mapped using STAR to the Sus scrofa 11 reference genome and quantified using FeatureCounts using the Sus scrofa 11 transcriptome annotation file. The data was then processed for ontologies and changes in expression profile over time replicates results determined from the total gene set (Fig. 3a) where samples after Day 25 are more similar to each other than those earlier in development. This confirmed that changes occurring in the developing pig mammary gland included EMT associated events, which may be involved in driving other gene expression changes. Gene expression changes in EMT-inducing transcription factors were analyzed to identify drivers of EMT related processes. The analyzed genes included CDH1, TWIST, TEAD, SNAI, VEGF, and SOX (Fig. 4c). The two genes with highest levels of upregulation in later time points included PRRX1 and TEAD1. PRRX1 has been shown to promote EMT in cancer, and TEAD1 enhances proliferation in cancer. The two genes with the strongest downregulation included CDH1 and ZEB2. CDH1 encodes E-cadherin which is highly implicated in the EMT pathway. Mutation of this gene leads to predisposition of certain cancers. ZEB2 has been shown to maintain stemness and is associated as an E-cadherin repressor. EMT related transcription identified through this pig development model indicate the pig mammary gland is an ideal candidate as a model system for mammary gland and cancer development.

Discussion
The focus of this research was to evaluate gene expression patterns of mammary gland development in postnatal pigs using the Dunn Biopsy method. To our knowledge, this is the first study that reports genetic expression patterns in young pig mammary glands from the same animals over time. This information may provide insight into mechanisms exploited by mutated mammary epithelial cells leading to cancer initiation and growth.
Puberty in pigs is defined as the first observed standing reflex in the presence of a boar [19]. The pig industry determines a pig has reached puberty as the age when the animal can be successfully bred, which can vary based on the physical composition of the animal [20]. Considering body size and physical attributes contribute to sexual maturation, the average pig would reach puberty between 120 and 150 days of age. Pig growth and development including the estrous cycle have been established, however mammary gland development prior to the onset of puberty has not been reported. Based on the results from this study, there are genetic and morphological changes that occur beginning as young as 25 days of age. The data support early pre-pubertal influences occur in the pig mammary gland based on the presence of TDLU's in animals at 32 and 39 days of age.
Whole transcriptome gene expression clustering analysis indicated a profound change in global gene expression after Day 25 of age which was consistent with the histological appearance of TDLU's after Day 25. This indicates the importance of EMT-MET mediated events in early mammary gland development. Processes that are highly implicated in EMT and cancer progression were identified by GSEA in the pig model, with significant downregulation after Day 25. These processes included actin cytoskeleton, actin-mediated cell contraction, regulation of calcium ion transport, and contractile fiber. The actin cytoskeleton is a primary contributor to cell shape, however remodeling of the actin cytoskeleton is associated with loss of cell-cell contact and linked to poor prognosis in breast cancer [21][22][23]. Actin-mediated cell contraction when downregulated decreases intercellular adhesion usually as a result of the loss of E-cadherin (CDH1). These events increase the opportunity for remodeling and carcinoma migration due to alterations in cell structure and adhesion [24]. The down regulation of calcium ion transport has been implicated in cancer progression due to the imperative need for calcium signaling in normal cellular behavior [25]. Additionally, EMT processes are associated with altered calcium transport [26]. The motile forces in cell migration are dependent on contractile fibers and the deregulation of these fibers are necessary for cell plasticity and motility in EMT and cancer progression [27]. The overall expression of EMT associated genes replicates the results determined from the total gene set where samples collected up to Day 25 are more similar to each other than those later in development after Day 25. These confirmed EMT associated events occurred in a similar pattern when compared with global gene expression in the early postnatal pig mammary model. EMT-related transcription factors implicated in human breast cancer were identified in our model including CDH1, TWIST, TEAD, SNAI, VEGF, and SOX [28][29][30][31][32][33][34]. The two genes with highest levels of upregulation in later time points include PRRX1 and TEAD1. PRRX1 has been shown to promote EMT and is involved in stem-like activities associated in cancer progression [35,36]. TEAD1 is involved in apoptotic resistance in cancer and mediates cell growth and EMT [29,30]. The two genes with the most significant downregulation include CDH1 and ZEB2. CDH1 encodes E-cadherin which is highly implicated in the EMT pathway. Mutation of this gene leads to predisposition of certain cancers [28] and increased EMT mediated events [37]. In addition, the loss of E-cadherin is associated with metastasis and poor prognosis in invasive breast cancer [38]. ZEB2 downregulation has been linked to stem cell properties and EMT events [39] and is known to be an E-cadherin repressor [40]. Given these findings, we report the gene expression patterns in the postnatal pig correlate to EMT mediated events in the human adult. The Dunn Biopsy sampled pig mammary gland serves as a translational model to evaluate EMT and EMT-processes that are implicated in cancer.
Considering 80% of life-threatening cancers are derived from epithelial tissue [41] the information reported herein warrants further investigation. Early stage tumors exhibit epithelial phenotypes and highly aggressive tumors display mesenchymal features [42], therefore the EMT-MET phenomenon can be linked to tumor progression and malignancy. Investigating aggressive subtypes of breast cancer indicates a positive association with mesenchymal features [43]. The EMT process in breast cancer is also linked to potential drug resistance [44] and disease recurrence [45]. Given the similar phenotypic attributes of the EMT-MET program in developing tissue and epithelial-based cancers, it's plausible that understanding biological mechanisms through repetitive biopsied samples in natural transdifferentiating states of mammary development may provide critical information regarding initiation and metastasis.
Experimental models provide worthwhile insight to many human physiological processes including the progression of cancer, however no individual animal model can truly mimic all complex mechanisms. Therefore, different animal models need to be studied to provide additional information regarding development, disease progression, and response to treatment [46]. The pig mammary gland is an ideal alternative model system to study global changes of mammary development in order to evaluate the process of breast cancer progression.

Conclusion
Using the Dunn Biopsy method to evaluate tissue changes over time has provided a new methodology to evaluate gene expression changes in postnatal mammary glands. Based on the high relevance of EMT related activity identified in this study, these results indicate the pig mammary gland is an ideal candidate as a model system for investigating mammary gland development and breast cancer progression. Further studies will continue to evaluate the pig model for gene expression and how mutated cells can reactivate early developmental processes. Application of this technique may provide insight of the initiating steps involved in cancer progression.

3
need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/.