Identification of novel ΔNp63α-regulated miRNAs using an optimized small RNA-Seq analysis pipeline

Sakaram, Suraj; Craig, Michael P.; Hill, Natasha T.; Aljagthmi, Amjad; Garrido, Christian; Paliy, Oleg; Bottomley, Michael; Raymer, Michael; Kadakia, Madhavi P.

doi:10.1038/s41598-018-28168-5

Identification of novel ΔNp63α-regulated miRNAs using an optimized small RNA-Seq analysis pipeline

Article
Open access
Published: 03 July 2018

Volume 8, article number 10069, (2018)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Identification of novel ΔNp63α-regulated miRNAs using an optimized small RNA-Seq analysis pipeline

Download PDF

Suraj Sakaram¹,
Michael P. Craig¹,
Natasha T. Hill¹,
Amjad Aljagthmi¹,
Christian Garrido¹,
Oleg Paliy¹,
Michael Bottomley²,
Michael Raymer³ &
…
Madhavi P. Kadakia¹

2936 Accesses
6 Citations
3 Altmetric
Explore all metrics

Abstract

Advances in high-throughput sequencing have enabled profiling of microRNAs (miRNAs), however, a consensus pipeline for sequencing of small RNAs has not been established. We built and optimized an analysis pipeline using Partek Flow, circumventing the need for analyzing data via scripting languages. Our analysis assessed the effect of alignment reference, normalization method, and statistical model choice on biological data. The pipeline was evaluated using sequencing data from HaCaT cells transfected with either a non-silencing control or siRNA against ΔNp63α, a p53 family member protein which is highly expressed in non-melanoma skin cancer and shown to regulate a number of miRNAs. We posit that 1) alignment and quantification to the miRBase reference provides the most robust quantitation of miRNAs, 2) normalizing sample reads via Trimmed Mean of M-values is the most robust method for accurate downstream analyses, and 3) use of the lognormal with shrinkage statistical model effectively identifies differentially expressed miRNAs. Using our pipeline, we identified previously unrecognized regulation of miRs-149-5p, 18a-5p, 19b-1-5p, 20a-5p, 590-5p, 744-5p and 93-5p by ΔNp63α. Regulation of these miRNAs was validated by RT-qPCR, substantiating our small RNA-Seq pipeline. Further analysis of these miRNAs may provide insight into ΔNp63α’s role in cancer progression. By defining the optimal alignment reference, normalization method, and statistical model for analysis of miRNA sequencing data, we have established an analysis pipeline that may be carried out in Partek Flow or at the command line. In this manner, our pipeline circumvents some of the major hurdles encountered during small RNA-Seq analysis.

Deregulated microRNAs in triple-negative breast cancer revealed by deep sequencing

Article Open access 10 February 2015

Genome-Wide Analysis of c-MYC-Regulated mRNAs and miRNAs and c-MYC DNA-Binding by Next-Generation Sequencing

Expanding the repertoire of miRNAs and miRNA-offset RNAs expressed in multiple myeloma by small RNA deep sequencing

Article Open access 19 February 2019

Introduction

MiRNAs are small non-coding RNAs of approximately 18–22 nucleotides in length that bind to the 3′ UTR regions of target mRNA to translationally repress or degrade them¹. A single miRNA is capable of targeting hundreds of genes and it is estimated that they may regulate over a third of all mammalian genes. Thus, the dysregulation of several miRNAs can have strong biological effects on entire gene networks^1,2. MiRNAs regulate a number of cellular processes that are dysregulated in cancer, such as proliferation, differentiation, apoptosis, motility and invasion. Further, changes in miRNA expression profiles reflect the developmental lineage and differentiation state of cancers, and are thus being used as cancer biomarkers^3,4. In recent years, improvements in Next Generation Sequencing (NGS) have made it possible to sequence small RNA species like miRNA with unprecedented sensitivity and dynamic range. Despite the increasing use of small RNA-Sequencing (small RNA-Seq) to identify potential biomarkers and therapeutic targets for cancer, there is no consensus on a data analysis pipeline for miRNA-Seq data^5,6,7.

Standard NGS data analysis assumptions and algorithms used in mRNA sequencing experiments are routinely used in small RNA-Seq experiments despite inherent differences in read length, read depth and coverage between mRNA and miRNA datasets⁸. For example, seed lengths greater than 25 nucleotides are commonly used in RNA-Seq analyses, however, these seed lengths are longer than the average miRNA length and are not appropriate for aligning miRNA. Alignment of small RNA-Seq data requires the use of shorter seed lengths. Consequently, this increases the likelihood of individual reads mapping to multiple locations thereby increasing the uncertainty in mapping and quantitating reads. Further, given the uniform length of miRNAs, it is possible that normalization methods such as RPKM, which correct for differences in read length, may negatively impact analysis of miRNA datasets. Despite these limitations, the choice of alignment index, quantitation reference, and normalization method to identify differentially expressed miRNAs from small RNA-Seq data have not been fully evaluated. Validating a standard pipeline for small RNA-Seq data is critical since each step of processing impacts downstream analysis and identification of statistically significant differentially expressed (DE) miRNAs.

In this study, small RNA-Seq was used to identify novel ΔNp63α-regulated miRNAs in keratinocytes by comparing those in which ΔNp63α was silenced relative to non-silencing controls (NSC). ΔNp63α, a member of the p53 family, has been shown to modulate the expression of miRNAs involved in various cellular processes including the regulation of keratinocyte differentiation, cell migration, tumor growth, cell cycle arrest, apoptosis, and metabolism^9,10,11. ΔNp63α regulates miRNAs by directly regulating their transcription, and indirectly through the regulation of other transcription factors, such as the Runt-related transcription factor 1 (RUNX1), which in turn regulates miRNAs^12,13. Importantly, ΔNp63α has also been shown to regulate the global post-transcriptional processing of miRNAs by transcriptionally activating DGCR8 in the Drosha complex and potentially through direct interaction with the Drosha PY-WW domain¹⁴. Thus, the dysregulation of ΔNp63α in epithelial cancers alters miRNA expression through a variety of indirect and direct mechanisms, and it is our hope that ΔNp63α-regulated miRNAs may serve as novel cancer biomarkers or therapeutic targets.

To identify miRNAs that are differentially expressed between keratinocytes lacking ΔNp63α versus expressing ΔNp63α, we optimized key parameters in the analysis to develop a standard pipeline for analyzing small RNA-Seq data. In this study, we identified several novel miRNAs regulated by ΔNp63α using our optimized pipeline. A sub group of these miRNAs were validated by RT-qPCR further supporting the pipeline we established. Analysis of the functional roles of these miRNAs and their targets will facilitate a deeper understanding of ΔNp63α’s role in both maintaining epidermal integrity and determining tumorigenic fate. Our results will provide non-bioinformaticians, who rely on sequencing analysis software for their research, an optimized small RNA-Seq pipeline to expedite data analysis, thereby enabling researchers to focus on the biological significance of their findings. Furthermore, the pipeline parameters chosen herein (e.g. alignment and quantitation to the miRBase reference, TMM normalization and use of an LNS model for identification of differentially expressed miRNAs) may be implemented at the command line with the use of open-source tools for small RNA-Seq analysis.

Results

Small RNA Sequencing for miRNAs in HaCaT cells with p63 knockdown

To examine miRNAs regulated by ΔNp63α, we transfected HaCaT cells with either non-silencing control siRNA (NSC) or p63 specific siRNA. These cells express ΔNp63α, the most physiologically relevant isoform of p63 expressed in the basal layer of the skin^15,16,17. All three biological replicates of HaCaT cells transfected with siRNA against p63 (sip63) showed 80% or greater reduction in p63 transcript levels by RT-qPCR (Fig. 1A) and no detectable p63 protein by immunoblot (Fig. 1B) relative to non-silencing control (NSC) (representative data shown), thus confirming p63 knockdown. Bioanalyzer measurements showed that our samples had an average of 7–11% RNA of 10–40 nucleotides in length, which we considered to be miRNAs per the manufacturer’s guideline. After size-selection and library preparation, barcoded cDNA libraries prepared from each of the 3 biological replicates of HaCaT cells transfected with NSC and sip63 were pooled and sequenced yielding over 6 million reads per sample (data not shown). Mean read lengths between 20 and 25 base pairs were obtained, consistent with expected miRNA length.

Analysis of small RNA-Seq data

The general workflow for small RNA-Seq analysis used in this study, including alignment, quantitation, normalization, and differential gene expression analysis is shown in Fig. 2. The choice of alignment index, quantitation reference, normalization method, and statistical probability distribution model are known to affect differential expression analyses of miRNA datasets, thus highlighting the need for careful consideration of pipeline parameters during small RNA-Seq analysis^8,18,19. Figure 2 shows the key steps in the data analysis workflow evaluated during small RNA-Seq pipeline optimization.

To approximately determine the total miRNA content in each sample, trimmed reads were first aligned and quantitated to the miRBase reference to obtain the total miRNA read count for each sample. Reads that did not align using the miRBase reference were re-aligned to the whole genome reference and quantitated using the RefSeq reference. Re-aligning to a different reference in this manner served to identify as many potential miRNAs as possible and aided in the quantitation of other RNA species. In combination with the miRNA reads quantitated on the first pass, we estimate that roughly 71 ± 6% of reads were miRNAs and 9 ± 4% were snoRNAs, with the remainder likely comprised of degradation products and other low-quality reads.

To determine the optimal alignment index and quantitation reference, we assessed the differences in total aligned reads, quantitated reads, and the number of unique miRNAs in each of the 6 combinations tested as shown in Table 1. As expected, the total number of aligned reads was greatest when aligned to the whole genome (hg38) and lowest when aligned to the miRBase reference, reflecting the number of annotations present in each reference. For reads aligned to either whole genome or RefSeq, the number of quantitated reads was higher when using RefSeq as the quantitation reference. This is attributed to the fact that size selection for small RNA during our library preparation retained other types of small RNAs (e.g. piRNA, snoRNA, etc.) as well as partially degraded mRNA. As expected, the number of aligned reads and quantitated reads were identical when miRBase was used as the alignment reference, independent of quantitation reference. Despite differences in the total number of aligned reads between pipelines, all pipelines identified a large number of unique miRNA, a more direct indication of pipeline performance. Interestingly, in all 6 conditions tested, the number of unique miRNAs quantitated was the highest when miRBase was used as the alignment and quantitation reference.

Table 1 Effect of alignment reference on read quantification.

Full size table

To further assess the difference between alignment indexes, we looked at the frequency of raw read count values for each index when quantitated using miRBase (Fig. 3). When aligned to Whole genome and RefSeq, a significant number of miRNAs received 0 counts, indicating that approximately 63% and 30% of miRNAs contained in the miRBase reference, respectively, were not quantitated (Fig. 3A,B). By contrast, when aligned to miRBase, a majority of the miRNAs (~88%) received counts between 10 and 1,000 reads (Fig. 3C). Thus, more miRNAs were quantitated using miRBase as the alignment index. To further compare the distribution of raw read counts, box plots were generated from Relative Log Expression (RLE) values calculated for each of the miRBase quantitated datasets. Assuming that gross variation in raw read count distributions is primarily due to differences in library preparation and sequencing efficiency, one would expect sample replicates to have similar median values and roughly similar distributions of raw read counts such that the median RLE values are distributed around zero²⁰. While reads aligned to whole genome or RefSeq show more variation in median RLE values, the miRBase aligned reads show a median RLE which was naturally centered at zero and a uniform distribution across samples (Fig. 4). Together, these results clearly demonstrate that miRBase is the optimal alignment index and quantitation reference.

Next, to determine the optimal normalization method for our miRNA datasets, miRNA read counts were normalized using each of four normalization methods: RPKM, TPM, TC, and TMM. To compare the effect of each normalization algorithm, box plots were generated from RLE values of normalized reads (Fig. 5). One would expect that effective normalization would produce similar median values within treatment groups and roughly similar distributions of raw read counts for all samples²⁰. Normalization of miRBase aligned and quantitated data using RPKM or TPM showed increased variation in the median RLE compared to TC or TMM normalized data. Normalization by TC reduced variance but failed to stabilize the median RLE values across samples. TMM outperformed the other normalization methods by reducing variance and stabilizing the median RLE values around zero (Fig. 5).

Finally, Principal Components Analysis (PCA) was performed to assess the similarity of miRNA expression profiles among samples. Figure 6 displays the PC1-vs-PC2 scatter plots of PCA output for each tested normalization procedure. In all cases, sip63 and NSC groups separated along the PC1 axis, indicating that the differences between groups accounted for the largest observed variance in the dataset. Variance among samples within each group was distributed along the PC2 axis. While the dispersion of samples and corresponding Davies-Bouldin (DB) index measures were comparable for RPKM, TPM, and TC methods, TMM normalization led to an improved separation of sip63 and NSC samples in PCA space (as evidenced by the lower DB index and p value, see Fig. 6). Due to the median values being most closely centered around zero, from the data presented by the PCA analyses, TMM normalization method appears to be superior.

Altogether, thus far, we have demonstrated that alignment and quantitation of trimmed reads using miRBase results in mapping the greatest number of unique miRNAs. We further showed that normalizing raw reads via TMM is optimal for downstream analyses. To identify differentially expressed miRNAs by ΔNp63α, we focused on the normalized reads obtained from this pipeline configuration.

Identification of Differentially Expressed miRNAs

Differential expression (DE) analysis of the miRNAs was performed between the NSC and sip63 samples using the LNS model. Selection criteria used to identify these miRNAs were reads ≥10 in each sample, FC ≥1.5, and p ≤ 0.05. The full list of 79 miRNA that are predicted to be positively and negatively regulated by ΔNp63α are provided in Supplementary Tables 1 and 2, respectively. These tables show the average read counts for NSC and sip63 triplicates, p-values calculated using the LNS model and fold change. Of the 79 miRNAs identified, 58 miRNAs were positively regulated whereas 21 miRNAs were negatively regulated by ΔNp63α. Figure 7A shows a heat map of the differences in expression level for these 79 differentially expressed miRNAs in all 6 samples after both samples and features were clustered using a Euclidean distance metric and average-linkage algorithm. NSC and sip63 samples clustered together with up- or down-regulation of a majority of miRNA correlating with sample type. This list of miRNA was compared to a previously published dataset of p63-regulated miRNAs summarized in our previous study²¹. Eight of the currently identified miRNA were previously shown to be regulated by ΔNp63α and served as positive controls (miR-185–5p, miR-205-5p, miR-130b-3p, miR-203a-5p, and miR-429)^10,22,23,24. Additionally, seven ΔNp63α-regulated miRNA have been specifically identified in HaCaT cells: miR-17, miR-18a, miR-20b, miR-30a, miR-106a, miR-143, miR-455-3p^25,26,27,28. Of these, only miR-455-3p and miR-18a-5p met the reads, fold change and p-value cutoff used in the chosen pipeline. miR-20b was detected at very low levels in our samples and did not meet our strict filtering criteria. Consistent with previous reports, miR-17, miR-30a, miR-106a and miR-143 were identified as being positively regulated by p63 but did not reach statistical significance in our dataset and therefore excluded from our final list (Supplementary Table 1).

Pathway analysis of differentially expressed miRNAs

miRNAs shown to be regulated by ΔNp63α (Supplementary Tables 1 and 2) were subjected to pathway analysis. mRNA targets for these 79 miRNAs were identified in Ingenuity Pathway Analysis using a database of experimentally validated targets. A functional analysis of these targets indicated significant enrichment of genes involved in cell proliferation, movement, and apoptosis (p < 0.001) (Fig. 7B). These cellular functions have previously been identified as being affected by silencing of ΔNp63α and are consistent with the known functional role of ΔNp63α in maintaining epithelial stemness and promoting cancer progression.

Using IPA Upstream Regulator analysis tools, p63 (TP63) was identified as a significant upstream regulator of the experimentally validated targets of the 79 differentially expressed miRNA (p < 0.001). Thirty of these mRNA targets were previously known targets of p63 listed in the IPA Knowledge Base. These mRNA were linked to the 16 corresponding p63-regulated miRNA known to target them in the p63 signaling network shown in Fig. 7C. Among the 30 mRNAs predicted to be downstream of p63, 4 are involved in apoptosis and 7 in cell cycle (Fig. 7C).

Validation of statistically significant DE miRNAs

From the list of 79 miRNAs with significant changes in expression (p ≤ 0.05, reads ≥10 for all samples), those with reads greater than 250 in each of the NSC samples were subjected to validation to assess the robustness of the established pipeline. Out of these, a sub-group of miRNAs not shown to be previously regulated by p63 were shortlisted for validation for novel discovery. miR-149-5p, 18a-5p, 19b-1-5p, 20a-5p, 590-5p, 744-5p and 93-5p were validated by RT-qPCR. RT-qPCR results revealed that all 7 miRNAs were downregulated when ΔNp63α was silenced, thus confirming the DE results predicted by small RNA-Seq that these miRNAs are positively regulated by ΔNp63α (Fig. 8).

Testing of small RNA-Seq pipeline using a publicly available dataset

A previous study by Zhao et al.²⁹ performed on the Illumina platform identified 173 miRNA which were differentially expressed between early and mid-gestational fetal keratinocytes. Moreover, the authors validated a total of 10 by RT-PCR. While 4/10 are known miRNAs, 6 represent novel miRNA not listed in miRBase v18. Reanalysis of the FASTQ files obtained from the same study (SRA GSE65342) was performed using our optimized pipeline parameters except for using hg19 and miRBase v18 references to be consistent with that study. Our analysis led to the identification of 161 differentially expressed miRNA, 79 of which were also identified by Zhao et al. and includes the 4 known microRNA validated by PCR in that study. Of the remaining 94/173 not detected in our analysis, 52 of them had low read counts (<10 in all samples) and would not have been selected for analysis based on our stringent filtering criteria. Although there is a large concordance in the results obtained following analysis by both pipelines, it is not uncommon that we see unique microRNAs picked up by different analysis methods. This analysis provides additional empirical data in support of the current pipeline design and function with biological data collected using a different sequencing platform.

Discussion

In this study, we sought to optimize a user-friendly pipeline for small RNA-Seq analysis which could be utilized with little to no command line experience. We used HaCaT cells where ΔNp63α was either silenced (sip63) or not silenced (NSC) to optimize the pipeline’s parameters. The small RNA-Seq pipeline optimized and implemented in this study utilizes miRBase v21 as the alignment index and quantitation reference, TMM normalization, and an LNS model for DE analysis. This validated pipeline was used to identify novel ΔNp63α-regulated miRNAs which have predicted roles in cancer signaling. Further characterization of these miRNAs and their mRNA targets may provide mechanistic insight into the progression of non-melanoma skin cancer.

In our analyses, alignments were performed using Bowtie as it is regarded to be “an ultrafast, memory-efficient alignment program for aligning short DNA sequences”³⁰. The Johns Hopkins site, which hosts the Bowtie and Bowtie 2 software describes Bowtie as being optimal for sets of short reads where (a) many of the reads have at least one good, valid alignment, (b) many of the reads are relatively high-quality, and (c) the number of alignments per read is small (close to 1)³¹. Accordingly, Bowtie is generally thought to outperform other alignment algorithms for sequences less than 50 bp. While studies such as Ziemann et al. have shown that Bowtie 2 outperformed Bowtie in both precision and accuracy, Tam et al. has shown that Bowtie and Bowtie2 are similar in accuracy, specificity, and sensitivity as measured by congruence to RT-PCR data (supplemental materials, Tam et al.)^20,32. Bowtie 2 was developed to handle gapped alignments, however, that is not needed for aligning short sequences that span the entire length of the miRNA it resembles.

Using the whole genome and RefSeq indexes for alignment yielded a large number of reads which did not map to known miRNA sequences, suggesting that many reads aligned to non-miRNA loci when these indexes were used (Table 1, Fig. 3). This is likely the result of the presence of other small RNA species that remained after the size exclusion steps of library preparation but may also include a subset of miRNA reads which were misaligned. In either case, these errors would result in artificially low read counts. Quantitation made using the miRBase reference effectively resolved these issues. It is therefore recommended that alignment and quantitation of small RNA-Seq datasets should be performed using the miRBase reference and Bowtie aligner to effectively quantitate unique miRNAs while avoiding the computational time required for aligning to the whole genome or RefSeq indexes. This approach facilitates the rapid quantitation of known miRNAs, allowing researchers to investigate expression level changes and pursue validation experiments. However, it’s worth noting that alignment to the whole genome would be required if the researcher is interested in identifying undiscovered miRNAs.

Previous RNA-Seq pipeline studies have shown that the choice of normalization method can affect the estimation of miRNA abundance and subsequent identification of differentially expressed miRNAs^20,33. Total count normalization assumes that most RNA are unchanged across samples and scales datasets toward a common distribution, thus TC may be ineffective in situations in which highly expressed miRNAs are differentially expressed³⁴. Similarly, the intended utility of TPM and RPKM normalizations in correcting for sequencing biases attributed to read length has been questioned in mRNA-Seq analyses and may actually increase variance in miRNA datasets due to similar biases³⁵. All three methods assume similarities between read count distributions and function in a similar manner if corrections for read length bias are not necessary. TMM normalization, by contrast, relies on the assumption that most genes are not differentially expressed, and frequently outperforms other methods when datasets differ in composition^20,35. Further, TMM is robust for lower coverage data where a high number of genes with zero counts is expected³⁶. Since miRNAs are of uniform length, with the majority appearing to be expressed at very low basal levels, TMM normalization would seem appropriate. Our analysis supports the robustness of the TMM method in that it effectively stabilized the median distribution irrespective of the alignment used and identified a panel of miRNAs which were ultimately validated by RT-qPCR (Fig. 5).

Differential expression analysis performed using an LNS response distribution model identified a core set of miRNAs in our sip63 samples which were differentially expressed relative to NSC samples (Fig. 7A). Since sample sizes in NGS experiments are generally low and the read count data is non-normally distributed and continuous in nature, we recommend that LNS should be selected over other response distribution models.

Of the novel p63-regulated miRNAs identified in HaCaT cells and validated in this study (Fig. 8), several have known roles in cancer. It is important to note that differential expression of these miRNAs was validated in HaCaT cells, and p63-regulation may be cell type specific. miR-18a-5p plays an oncogenic role in nasopharyngeal cancer by regulating E-cadherin and K-ras³⁷. miR-19b-1-5p is downregulated in CD44 + cervical cancer cells which express increased p63 levels, although no link to p63 was implied³⁸. miR-20a-5p targets p63 to regulate p53 and PTEN expression, although the feedback regulation of mir-20a-5p by p63 has not been shown³⁹. miR-590-5p attenuates the TGFβ signaling pathway through down-regulation of SMAD3, and may regulate cell proliferation, apoptosis and migration⁴⁰. Lastly, miR-93-5p is elevated in colorectal cancer and is known to target WNK lysine deficient protein kinase 1 (WNK1) to inhibit the invasive potential of triple-negative breast cancer cells⁴¹. Nothing has been published regarding the functional roles or regulation for the remaining miRNAs identified in this study. It is our hope that these miRNAs and additional targets identified by our small RNA-Seq pipeline will be a source of biomarkers and therapeutic targets for p63-related pathologies and provide critical insight into the role played by p63 in cancer.

The analysis presented herein utilized small RNA libraries which were sequenced on the Ion Torrent platform. Lahens et al. demonstrated that Illumina and Ion Torrent platforms in RNA-Seq datasets yielded >80% agreement in differential expression with low read depth likely contributing to differences between the platforms⁴². Since our proposed pipeline filters out low read depth miRNAs (<10 reads per sample), we expect that it would perform similarly using Illumina datasets. However, the Lahens study also reported that the choice of alignment reference yielded some differences that were platform-specific. Thus, the pipeline parameters utilized herein should be empirically tested for other sequencing platforms.

Our investigation of the various alignment and quantitation references (whole genome, RefSeq and miRBase) and normalization methods (TC, RPKM, TPM, and TMM) highlights the potential impact of each on the analysis of small RNA-Seq data. While it is important to experiment with pipeline parameters to accommodate differences in sample library composition and confirm data output by RT-qPCR, we propose that miRBase alignment using Bowtie, quantitation with miRBase, and normalization with TMM as performed in Partek Flow provides a robust pipeline for small RNA-Seq analysis, circumventing the need for command line experience.

Materials and Methods

Cell culture, reagents, and plasmids

HaCaT, a non-tumorigenic immortalized human keratinocyte cell line, was obtained from Dr. Nancy Bigley (Wright State University, Dayton, OH). Cells were maintained in DMEM Hyclone media (GE Healthcare Life Sciences, Pittsburg, PA) supplemented with 8% fetal bovine serum, 100 U/mL penicillin, and streptomycin at 37 °C in 5% CO₂.

siRNA Knockdown

HaCaT cells were transfected with siRNA against p63 (sip63) or non-silencing control (NSC) (Qiagen, Valencia, CA) using Lipofectamine RNAi-Max (Thermo Fisher Scientific, Carlsbad, CA) as previously described⁴³.

Immunoblot analysis

Whole cell extracts were prepared by washing cells in cold 1% Phosphate-Buffered Saline (PBS) and lysing in phosphatase inhibitor buffer (50 mM Tris-HCl [pH 8.0], 120 mM NaCl, 5 mM EGTA, 1 mM EDTA, 5 mM NaPP, 10 mM NAF, 30 mM PMSF, 0.2 mM PMSF, 1 mM Benzamidine, 0.1% NP-40, 100 nM NaVO₄) supplemented with protease inhibitor cocktail (catalog #P8340, Sigma-Aldrich, St. Louis, MO). Protein was quantitated by Bicinchoninic Assay (BCA) according to the manufacturer’s instructions (Thermo Fisher Scientific, Fremont, CA). Equivalent amounts of protein were separated on 10% SDS-PAGE and transferred to Polyvinylidene difluoride (PVDF) membrane at 350 mA for 1 hour. ΔNp63α and β-actin were detected using rabbit polyclonal anti-p63 (N2C1, GeneTex, Irvine, CA) or mouse monoclonal anti-β-actin (AC15, Santa Cruz Biotechnology, Dallas, TX) antibodies, respectively. HRP-tagged secondary antibodies (Promega, Madison, WI) were used to enable chemiluminescent detection with the Western Lightning Plus-ECL Chemiluminescent Substrate kit (Perkin Elmer, Waltham, MA).

Small RNA-Seq library preparation and sequencing

Total RNA was isolated from HaCaT cells using the mirVana^TM Paris Kit (Thermo Fisher Scientific, Carlsbad, CA) and enriched for small RNA through size selection ethanol washes. For each library, 4 ng of miRNA was hybridized and ligated to Ion Adapters v2 (Ion Total RNA-Seq Kit v2, Life Technologies, Carlsbad, CA). Small RNA samples were reverse transcribed to cDNA using adapter specific Ion RT primers v2 (Life Technologies, Carlsbad, CA). Purified cDNA samples were size-selected and amplified by PCR followed by further purification and size selection. cDNA samples were barcoded using Platinum PCR SuperMix High Fidelity polymerase with IonXpress RNA 3′ Barcode primer and unique 5′ Ion Xpress RNA-Seq Barcode Primers using the Ion Xpress RNA-Seq Barcode 1–16 Kit (catalog #4471250, Life Technologies, Carlsbad, CA). Yield and size distribution of the cDNA libraries were assessed using the Agilent 2100 Bioanalyzer DNA1000 chip (catalog #5067–1504, Agilent Technologies, Santa Clara, CA). Total barcoded cDNA within the 50–300 base pair range was considered to be derived from small RNA. 7.5 picomoles of each barcoded library were pooled and clonally amplified onto Ion Sphere^TM Particles (ISPs) according to the manufacturer’s protocol (Ion PI Template OneTouch^TM 200 Template Kit v3) and enriched using the Ion OneTouch 2 ES system (Life Technologies, Carlsbad, CA). Clonal amplification of the cDNA libraries onto ISPs yielded 18.4% templated Ion Sphere Particles (ISPs), well within the manufacturer stated optimal range of 10% to 25% (Ion Sphere^TM Quality Control assay, Life Technologies, Carlsbad, CA). Enriched ISPs were sequenced on the Ion Proton Next Generation Sequencing system using the Ion P1 chip v2 Kit and Ion PI ^TM Sequencing 200 kit v3 (catalog #4482321 and #4488315, Life Technologies, Carlsbad, CA) with 500 Sequencing flows.

Small RNA-Seq data analysis

The general workflow for small RNA-Seq analysis used in this study, including alignment, quantitation, normalization, and differential gene expression analysis is shown in Fig. 2. All analyses were performed in Partek® Flow® software, version 5.0 (Copyright 2016, Partek Inc., St. Louis, MO).

Alignment and Quantification

Sequenced reads were assigned to their respective samples based on corresponding IonXpress barcodes and output in FASTQ format with their associated base quality scores using the Ion Torrent Suite version 4.0.2. FASTQ files were uploaded into Partek Flow software (Partek Inc., St. Louis, MO) for processing. Unaligned reads were trimmed from the 3′ end to a fixed length dependent on the positional base at which the PHRED quality score fell below 20 (Fig. 2A). A minimum read length filter retaining reads greater than 15 bases in length was used in this study and is within the range of 10–26 bases as previously reported^44,45,46,47.

Trimmed reads were aligned to either Whole Genome, RefSeq Transcripts 80 (2-6-2017), or miRBase mature miRNAs version 21 of the latest human genome assembly, hg38 (GRCh38) (Fig. 2B) using the Bowtie aligner³⁰. A seed mismatch limit of 1 and minimum seed length of 10 were used. A seed mismatch of 1 was chosen to avoid discarding reads potentially containing inaccurate base calls made by the sequencer. These settings are consistent with the standard recommendation provided by Partek Flow software (verbal communication from Partek Inc., St. Louis, MO). The alignment reporting option was set to 200 alignments per read in order to maximize the predictive power of the Expectation Maximization (EM) algorithm without being overly computationally intensive⁴⁸.

Raw read counts were obtained by quantitating aligned reads to either RefSeq Transcripts 80 (2-6-2017) or miRBase version 21, using a modified version of the EM algorithm implemented by Xing et al. in which isoform expression levels are quantified across the whole genome at the same time (Fig. 2C)⁴⁹. Details of the Partek EM algorithm can be found in the White Paper on RNA-Seq Methods⁵⁰. The EM algorithm is used to resolve ambiguous mappings (i.e. reads aligning well to multiple loci) for improved estimation of true expression read counts. It assigns an initial estimate of transcript abundance derived from uniquely mapped reads and then employs a Bayesian approach to calculate the most likely alignment for reads that map to multiple locations on the reference genome. These alignments are used to re-compute global transcript abundances, which are utilized to probabilistically re-align and resolve ambiguous mappings. This process is iterated until the algorithm converges, at which point the reads assigned to a particular locus are counted, resulting in final raw read counts.

Normalization

Raw read counts were normalized using Total Count (TC), Reads Per Kilobase per Million (RPKM), Transcripts Per Million (TPM), or Trimmed Mean of M values (TMM) normalization methods^{34,35,36,51,52} (Fig. 2D). Since miRNAs with 0 read counts would impede statistical calculations when performing differential expression analysis, an offset of 1.0 was added to all normalized read counts. This facilitated reporting for all annotated miRNAs at this stage. The offset did not result in inclusion of miRNAs with near zero read counts in the final DE lists since these lists were filtered to include only miRNAs with minimum read count values of greater than or equal to 10 reads in each of the profiled samples.

Differential Expression (DE) Analysis

Normalized read counts for each miRNA were statistically modeled using Partek Flow’s Gene Specific Analysis (GSA) approach. The GSA approach uses the data to select the best model (Normal (N), Negative Binomial (NB), Lognormal (LN), and Lognormal with Shrinkage (LNS)) for each miRNA based on the lowest corrected Akaike Information Criterion corrected (AICc)⁵³. Because the LNS model yielded the lowest AICc for a majority (72%) of miRNAs identified, it was selected as the default model for DE analysis. A low expression filter based on Lowest Average Coverage (LAC) was set with a cutoff of 4, thereby excluding miRNAs with a geometric mean across all samples below this value (Fig. 2E). LNS utilizes an empirical Bayes method that estimates gene-specific dispersion by combining information about variance from other genes to improve the estimation process, resulting in improved DE detection and lower false-positive rates⁵⁴. The LNS model is similar to the “limma trend” method from the limma package previously reported to be a robust model for differential expression analysis and is generally beneficial when the number of experimental replicates is low^46,55. P-values were generated using the F statistic. P-values less than or equal to 0.05 were deemed significant.

Reverse Transcription Quantitative Polymerase Chain Reaction (RT-qPCR)

Total RNA was extracted from HaCaT cells transfected with siRNA against p63 (sip63) and non-silencing control (NSC) using the mirVana^TM Paris^TM Isolation kit according to the manufacturer’s protocol (Thermo Fisher Scientific, Carlsbad, CA). 1 µg of total RNA input was used for cDNA synthesis using the q-Script cDNA supermix (Quanta Biosciences, Beverly, MA). TaqMan based RT-qPCR was performed as previously described⁴³. Assays on Demand (AODs) for p63 (Hs00978340_ml) and GAPDH (4325792) (Thermo Fisher Scientific, Carlsbad, CA, USA) were used. p63 expression was normalized to GAPDH according to the manufacturer’s instructions (Thermo Fisher Scientific, Carlsbad, CA). For miRNA quantitation, 10 ng of RNA was used for cDNA synthesis using the TaqMan miRNA reverse transcription kit according to manufacturer’s instructions (Applied Biosystems, Japan Ltd). qPCR was performed using the Universal TaqMan master mix (2X) and miRNA-specific TaqMan AODs for hsa-miR-149-5p (002255), hsa-miR-18a-5p (002422), hsa-miR-19b-1-5p (002425), hsa-miR-20a-5p (000580), hsa-miR590-5p (001984), hsa-miR-744-5p (002324), hsa-miR-93-5p (001090) and RNU48 (001006) (Thermo Fisher Scientific, Carlsbad, CA). MiRNA expression was normalized to RNU48 according to the manufacturer’s instructions (Thermo Fisher Scientific, Carlsbad, CA). Student’s t-tests were used to determine significant differences in sip63 samples relative to NSC controls.

Ingenuity Pathway Analysis (IPA) of DE miRNA

The functional roles and signaling networks of p63-regulated miRNA were identified using IPA (Qiagen Inc., Valencia, CA, https://www.qiagenbioinformatics.com/products/ingenuitypathway-analysis). The algorithms developed for use in IPA have been previously described⁵⁶. The input dataset for this analysis was the list of 79 significant DE miRNAs obtained from alignment and quantitation to the miRBase index and normalization by TMM after filtering on p ≤ 0.05, fold-change ≥1.5 and read counts ≥10 in all samples. MiRNAs were associated with their respective experimentally validated mRNA targets by querying the Ingenuity Pathways Knowledge Base (Qiagen Inc., Valencia, CA). Functional profiling was performed to identify cellular processes which showed enrichment for these mRNA. The upstream analysis function in IPA was used to filter the list of experimentally validated mRNA targets to only those with known links to p63 (TP63) in the IPA Knowledge Base. The IPA network connection tools were then used to display known functional connections between differentially expressed miRNA and these mRNA targets. Signaling network generation was performed using the Path Designer tools in IPA (Qiagen, Inc., Valencia, CA).

Statistical analysis

Relative log expression (RLE) plots were generated according to established methods⁵⁷. Briefly, 1) read counts were log₁₀ transformed, 2) the median of log expression values for each miRNA across all samples was calculated, 3) RLE values were calculated by subtracting this median value from each of the miRNA log read count value for each sample to obtain an RLE matrix, and 4) a box plot of RLE values was generated for each NSC and sip63 sample.

Principal components analysis (PCA) was performed to assess the overall variability in the miRNA datasets. The statistical significance of the separation of sip63 and NSC samples in PCA space was determined by a permutation test of the Davies-Bouldin (DB) index measure run with 1,000 iterations⁵⁸. DB index compares the intra-cluster distances among samples to the distance between cluster centroids; smaller values indicates a better separation of samples belonging to different groups. Statistical significance of group separation in PCA space was assessed by permutation analysis of Davies-Bouldin index as described⁵⁹.

Hierarchical cluster analysis (HCA) was performed on the subset of miRNA genes that satisfied differential expression criteria. The dataset was normalized by the root-mean-square algorithm applied across genes, and experiments were median-centered. HCA was run with Euclidean distance measure and average linkage clustering option.

Data Availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Lewis, B. P., Burge, C. B. & Bartel, D. P. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120, 15–20 (2005).
Article PubMed CAS Google Scholar
Irani, S. miRNAs Signature in Head and Neck Squamous Cell Carcinoma Metastasis: A Literature Review. Journal of dentistry 17, 71–83 (2016).
PubMed PubMed Central Google Scholar
Wojcicka, A., Kolanowska, M. & Jazdzewski, K. Mechanisms In Endocrinology: MicroRNA in diagnostics and therapy of thyroid cancer. European journal of endocrinology 174, R89–98 (2016).
Article PubMed CAS Google Scholar
Lu, J. et al. MicroRNA expression profiles classify human cancers. Nature 435, 834–838 (2005).
Article ADS PubMed CAS Google Scholar
Lin, L. J., Lin, Y., Jin, Y. & Zheng, C. Q. Investigation of key microRNAs associated with hepatocellular carcinoma using small RNA-seq data. Molecular biology reports 41, 4341–4349 (2014).
Article PubMed CAS Google Scholar
Alisoltani, A., Fallahi, H., Shiran, B., Alisoltani, A. & Ebrahimie, E. RNA-seq SSRs and small RNA-seq SSRs: new approaches in cancer biomarker discovery. Gene 560, 34–43 (2015).
Article PubMed CAS Google Scholar
Kou, Y., Qiao, L. & Wang, Q. Identification of core miRNA based on small RNA-seq and RNA-seq for colorectal cancer by bioinformatics. Tumour biology: the journal of the International Society for Oncodevelopmental Biology and Medicine 36, 2249–2255 (2015).
Article CAS Google Scholar
Conesa, A. et al. A survey of best practices for RNA-seq data analysis. Genome biology 17, 13 (2016).
Article PubMed PubMed Central CAS Google Scholar
Hill, N. T. et al. 1alpha, 25-Dihydroxyvitamin D(3) and the vitamin D receptor regulates DeltaNp63alpha levels and keratinocyte proliferation. Cell death & disease 6, e1781 (2015).
Article CAS Google Scholar
Ratovitski, E. A. Tumor Protein p63/microRNA Network in Epithelial Cancer Cells. Current genomics 14, 441–452 (2013).
Article PubMed PubMed Central CAS Google Scholar
Lin, C. et al. The microRNA feedback regulation of p63 in cancer progression. Oncotarget 6, 8434–8453 (2015).
PubMed PubMed Central Google Scholar
Ortt, K., Raveh, E., Gat, U. & Sinha, S. A chromatin immunoprecipitation screen in mouse keratinocytes reveals Runx1 as a direct transcriptional target of DeltaNp63. Journal of cellular biochemistry 104, 1204–1219 (2008).
Article PubMed CAS Google Scholar
Masse, I. et al. Functional interplay between p63 and p53 controls RUNX1 function in the transition from proliferation to differentiation in human keratinocytes. Cell death & disease 3, e318 (2012).
Article CAS Google Scholar
Chakravarti, D. et al. Induced multipotency in adult keratinocytes through down-regulation of DeltaNp63 or DGCR8. Proceedings of the National Academy of Sciences of the United States of America 111, E572–581 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Koster, M. I., Kim, S., Mills, A. A., DeMayo, F. J. & Roop, D. R. p63 is the molecular switch for initiation of an epithelial stratification program. Genes & development 18, 126–131 (2004).
Article CAS Google Scholar
Koster, M. I. & Roop, D. R. The role of p63 in development and differentiation of the epidermis. Journal of dermatological science 34, 3–9 (2004).
Article PubMed CAS Google Scholar
Koster, M. I. & Roop, D. R. Transgenic mouse models provide new insights into the role of p63 in epidermal development. Cell cycle 3, 411–413 (2004).
Article PubMed CAS Google Scholar
Ramskold, D., Kavak, E. & Sandberg, R. How to analyze gene expression using RNA-sequencing data. Methods in molecular biology 802, 259–274 (2012).
Article PubMed CAS Google Scholar
Oshlack, A., Robinson, M. D. & Young, M. D. From RNA-seq reads to differential expression results. Genome biology 11, 220 (2010).
Article PubMed PubMed Central CAS Google Scholar
Tam, S., Tsao, M. S. & McPherson, J. D. Optimization of miRNA-seq data preprocessing. Briefings in bioinformatics 16, 950–963 (2015).
Article PubMed PubMed Central CAS Google Scholar
Stacy, A. J., Craig, M. P., Sakaram, S. & Kadakia, M. DeltaNp63alpha and microRNAs: leveraging the epithelial-mesenchymal transition. Oncotarget 8, 2114–2129 (2017).
Article PubMed Google Scholar
Rivetti di Val Cervo, P. et al. p63-microRNA feedback in keratinocyte senescence. Proceedings of the National Academy of Sciences of the United States of America 109, 1133–1138 (2012).
Article ADS PubMed PubMed Central Google Scholar
Tran, M. N. et al. Thep63 protein isoform DeltaNp63alpha inhibits epithelial-mesenchymal transition in human bladder cancer cells: role of MIR-205. The Journal of biological chemistry 288, 3275–3288 (2013).
Article PubMed CAS Google Scholar
Luo, Z. et al. An in silico analysis of dynamic changes in microRNA expression profiles in stepwise development of nasopharyngeal carcinoma. BMC medical genomics 5, 3 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Wu, N. et al. The miR-17 family links p63 protein to MAPK signaling to promote the onset of human keratinocyte differentiation. PloS one 7, e45761 (2012).
Article ADS PubMed PubMed Central CAS Google Scholar
Han, F., Wu, Y. & Jiang, W. MicroRNA-18a Decreases Choroidal Endothelial Cell Proliferation and Migration by Inhibiting HIF1A Expression. Medical science monitor: international medical journal of experimental and clinical research 21, 1642–1647 (2015).
Article CAS Google Scholar
Luo, W., Li, G., Yi, Z., Nie, Q. & Zhang, X. E2F1-miR-20a-5p/20b-5p auto-regulatory feedback loop involved in myoblast proliferation and differentiation. Scientific reports 6, 27904 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Zhang, Z. et al. MiR-455-3p regulates early chondrogenic differentiation via inhibiting Runx2. FEBS letters 589, 3671–3678 (2015).
Article PubMed CAS Google Scholar
Zhao, F. et al. Dynamic Expression of Novel MiRNA Candidates and MiRNA-34 Family Members in Early- to Mid-Gestational Fetal Keratinocytes Contributes to Scarless Wound Healing by Targeting the TGF-beta Pathway. PloS one 10, e0126087 (2015).
Article PubMed PubMed Central CAS Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome biology 10, R25 (2009).
Article PubMed PubMed Central CAS Google Scholar
Langmead, B. et al. Bowtie, http://bowtie-bio.sourceforge.net/bowtie2/faq.shtml (2018).
Ziemann, M., Kaspi, A. & El-Osta, A. Evaluation of microRNA alignment techniques. Rna 22, 1120–1138 (2016).
Article PubMed PubMed Central CAS Google Scholar
Hoffmann, R., Seidl, T. & Dugas, M. Profound effect of normalization on detection of differentially expressed genes in oligonucleotide microarray data analysis. Genome biology 3 (2002).
Bullard, J. H., Purdom, E., Hansen, K. D. & Dudoit, S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC bioinformatics 11, 94 (2010).
Article PubMed PubMed Central CAS Google Scholar
Dillies, M. A. et al. A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis. Briefings in bioinformatics 14, 671–683 (2013).
Article PubMed CAS Google Scholar
Robinson, M. D. & Oshlack, A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome biology 11, R25 (2010).
Article PubMed PubMed Central CAS Google Scholar
Luo, Z. et al. miR-18a promotes malignant progression by impairing microRNA biogenesis in nasopharyngeal carcinoma. Carcinogenesis 34, 415–425 (2013).
Article PubMed CAS Google Scholar
Xiao, S., Zhou, Y., Jiang, J., Yuan, L. & Xue, M. CD44 affects the expression level of FOSlike antigen 1 in cervical cancer tissues. Molecular medicine reports 9, 1667–1674 (2014).
Article PubMed CAS Google Scholar
Fang, W. W. et al. MicroRNA-20a-5p contributes to hepatic glycogen synthesis through targeting p63 to regulate p53 and PTEN expression. Journal of cellular and molecular medicine 20, 1467–1480 (2016).
Article PubMed PubMed Central CAS Google Scholar
Jafarzadeh, M. & Soltani, B. M. Hsa-miR-590-5p Interaction with SMAD3 Transcript Supports Its Regulatory Effect on The TGF beta Signaling Pathway. Cell J 18, 7–12 (2016).
PubMed PubMed Central Google Scholar
Niu, Y. et al. Identification of reference genes for circulating microRNA analysis in colorectal cancer. Scientific reports 6, 35611 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Lahens, N. F. et al. A comparison of Illumina and Ion Torrent sequencing platforms in the context of differential gene expression. BMC genomics 18, 602 (2017).
Article PubMed PubMed Central Google Scholar
Kommagani, R. et al. Regulation of VDR by deltaNp63alpha is associated with inhibition of cell invasion. Journal of cell science 122, 2828–2835 (2009).
Article PubMed PubMed Central CAS Google Scholar
Williams, C. R., Baccarella, A., Parrish, J. Z. & Kim, C. C. Trimming of sequence reads alters RNA-Seq gene expression estimates. BMC bioinformatics 17, 103 (2016).
Article PubMed PubMed Central CAS Google Scholar
McCormick, K. P., Willmann, M. R. & Meyers, B. C. Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments. Silence 2, 2 (2011).
Article PubMed PubMed Central CAS Google Scholar
Auer, P. L. & Doerge, R. W. A Two-Stage Poisson Model for Testing RNA-Seq Data. Stat Appl Genet Mol 10 (2011).
Marco, A. & Griffiths-Jones, S. Detection of microRNAs in color space. Bioinformatics 28, 318–323 (2012).
Article PubMed CAS Google Scholar
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC bioinformatics 12, 323 (2011).
Article PubMed PubMed Central CAS Google Scholar
Xing, Y. et al. An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs. Nucleic Acids Res 34, 3150–3160 (2006).
Article PubMed PubMed Central CAS Google Scholar
White Paper: RNA-Seq Methods. Reprint at http://www.partek.com/Tutorials/microarray/User_Guides/RNASEQ.pdf (2010).
Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nature methods 5, 621–628 (2008).
Article PubMed CAS Google Scholar
Wagner, G. P., Kin, K. & Lynch, V. J. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory in biosciences = Theorie in den Biowissenschaften 131, 281–285 (2012).
Article PubMed CAS Google Scholar
Burnham, K. P. & Anderson, D. R. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. 2 edn, (Springer New York, 2004).
Wu, H., Wang, C. & Wu, Z. A new shrinkage estimator for dispersion improves differential expression detection in RNA-seq data. Biostatistics 14, 232–243 (2013).
Article PubMed Google Scholar
Law, C. W., Chen, Y. S., Shi, W. & Smyth, G. K. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome biology 15 (2014).
Kramer, A., Green, J., Pollard, J. & Tugendreich, S. Causal analysis approaches in Ingenuity Pathway Analysis. Bioinformatics 30, 523–530 (2014).
Article PubMed CAS Google Scholar
Gandolfo, L. C. & Speed, T. P. RLE Plots: Visualising Unwanted Variation in High DimensionalData. ArXiv e-prints, 1–9 (2017).
Davies, D. L. & Bouldin, D. W. A cluster separation measure. IEEE transactions on pattern analysis and machine intelligence 1, 224–227 (1979).
Article PubMed CAS Google Scholar
Shankar, V. et al. The networks of human gut microbe-metabolite associations are different between health and irritable bowel syndrome. Isme J 9, 1899–1903 (2015).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

This work was supported by the National Cancer Institute [1R01CA154715] and the Department Office of Naval Research [N00014-16-3150].

Author information

Authors and Affiliations

Biochemistry and Molecular Biology, Wright State University, Dayton, OH, 45435, USA
Suraj Sakaram, Michael P. Craig, Natasha T. Hill, Amjad Aljagthmi, Christian Garrido, Oleg Paliy & Madhavi P. Kadakia
Math and Microbiology, Wright State University, Dayton, OH, 45435, USA
Michael Bottomley
Computer Science and Engineering, Wright State University, Dayton, OH, 45435, USA
Michael Raymer

Authors

Suraj Sakaram
View author publications
You can also search for this author in PubMed Google Scholar
Michael P. Craig
View author publications
You can also search for this author in PubMed Google Scholar
Natasha T. Hill
View author publications
You can also search for this author in PubMed Google Scholar
Amjad Aljagthmi
View author publications
You can also search for this author in PubMed Google Scholar
Christian Garrido
View author publications
You can also search for this author in PubMed Google Scholar
Oleg Paliy
View author publications
You can also search for this author in PubMed Google Scholar
Michael Bottomley
View author publications
You can also search for this author in PubMed Google Scholar
Michael Raymer
View author publications
You can also search for this author in PubMed Google Scholar
Madhavi P. Kadakia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. conceived and designed the experiments. S.S., N.T.H. and A.A. performed the experiments. S.S., M.P.C. and C.G. performed NGS analysis. O.P. performed PCA analysis and hierarchical clustering. M.B. provided statistical support. M.R. provided bioinformatics support. All authors contributed to the analysis and interpretation of the data, co-wrote the manuscript and approved the final version.

Corresponding author

Correspondence to Madhavi P. Kadakia.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Dataset 1 and Dataset 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sakaram, S., Craig, M.P., Hill, N.T. et al. Identification of novel ΔNp63α-regulated miRNAs using an optimized small RNA-Seq analysis pipeline. Sci Rep 8, 10069 (2018). https://doi.org/10.1038/s41598-018-28168-5

Download citation

Received: 08 February 2018
Accepted: 15 June 2018
Published: 03 July 2018
DOI: https://doi.org/10.1038/s41598-018-28168-5
Springer Nature Limited

Identification of novel ΔNp63α-regulated miRNAs using an optimized small RNA-Seq analysis pipeline

Abstract

Similar content being viewed by others

Deregulated microRNAs in triple-negative breast cancer revealed by deep sequencing

Genome-Wide Analysis of c-MYC-Regulated mRNAs and miRNAs and c-MYC DNA-Binding by Next-Generation Sequencing

Expanding the repertoire of miRNAs and miRNA-offset RNAs expressed in multiple myeloma by small RNA deep sequencing

Introduction

Results

Small RNA Sequencing for miRNAs in HaCaT cells with p63 knockdown

Analysis of small RNA-Seq data

Identification of Differentially Expressed miRNAs

Pathway analysis of differentially expressed miRNAs

Validation of statistically significant DE miRNAs

Testing of small RNA-Seq pipeline using a publicly available dataset

Discussion

Materials and Methods

Cell culture, reagents, and plasmids

siRNA Knockdown

Immunoblot analysis

Small RNA-Seq library preparation and sequencing

Small RNA-Seq data analysis

Alignment and Quantification

Normalization

Differential Expression (DE) Analysis

Reverse Transcription Quantitative Polymerase Chain Reaction (RT-qPCR)

Ingenuity Pathway Analysis (IPA) of DE miRNA

Statistical analysis

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Dataset 1 and Dataset 2

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation