Avoid common mistakes on your manuscript.
Dear Editor,
iPSCs are typically derived using four transcription factors Oct4 (also known as Pou5f1), Klf4, Sox2 and c-Myc (OKSM) (Takahashi and Yamanaka, 2006). Major directions in this field are focused on understanding the reprogramming mechanism, optimizing reprogramming methods to improve iPS cell (iPSC) quality, and applications of the cells for regenerative medicine. Over the past few years, great efforts have been put into finding novel genes involved in the reprogramming process. Indeed, increasing numbers of factors are found to play important roles in improving reprogramming efficiency (Theunissen and Jaenisch, 2014). However, it is still unclear how many more genes are implicated in this process, and methods are needed to systematically evaluate each gene’s relevance to reprogramming.
Phenotype-driven genetic screens with loss-of function and gain-of function methods have been successfully used to discover reprogramming factors. DNA transposons, RNA interference and CRISPR/Cas9-based systems have been powerful tools for loss-of function screening. A series of reprogramming factors have been identified with these loss-of-function methods (Ding et al., 2012; Kearns et al., 2014; Woltjen et al., 2016). However, the use of gain-of-function screening in reprogramming has been very limited. cDNA libraries, cell extract of pluripotent stem cells and high-throughput small molecule screens have also been exploited to find pluripotency genes (Abujarour et al., 2010; Singhal et al., 2010; Ying et al., 2008). These methods can only examine several genes at a time. A gain-of-function screen on the whole genome-scale would be very useful for saturation screens of reprogramming factors.
In this study, we aimed to systematically identify genes that participate in cellular reprogramming by establishing a gain-of-function screening strategy with PB vectors combined with the next-generation sequencing (NGS) and define each gene’s relevance to reprogramming.
The piggyBac transposon (PB) has been demonstrated to be an efficient mutagenesis tool in mammalian cells and mice (Ding et al., 2005; Wu et al., 2007) Compared with the commonly used retroviral vectors, PB transpositions in mammalian cells are more efficient and random (Wang et al., 2008). Therefore, we constructed a PB screening vector pFind1 that enabled a large-scale functional screening strategy for genes related to reprogramming (Fig. 1A and Fig. S1A). Although pFind1 is designed as a dual function vector, when used in cultured diploid cells, it primarily functions as a gain-of-function vector. To determine an efficient screening scheme, we first aimed to find a practical combination of reprogramming factors that alone generate no or minimal number of iPSC clones, but pFind1 addition can significantly increase colony numbers. We began by co-transfecting our cells with OK (Oct4 and Klf4) or OS (Oct4 and Sox2) combination (Fig. S1B), since OK or OS combinations alone rarely induced MEFs into iPSCs (Takahashi and Yamanaka, 2006). Neither OS + pFind1 nor OK + pFind1 showed significant difference of reprogramming efficiency compared with OK or OS alone (Fig. 1B).
We further tested pFind1 and three-factor combinations KSM (Klf4, Sox2 and c-Myc) or OKS (Oct4, Klf4 and Sox2) (Fig. S1B). KSM did not reprogram fibroblasts into iPSC colonies, and only partially reprogrammed cell clusters could be found (Fig. 1C) (Shu et al., 2013; Takahashi and Yamanaka, 2006). However, when both KSM and pFind1 were introduced into MEFs, we detected colonies that exhibited similar morphology to mouse ES cells and expressed alkaline phosphatase (AP) (Fig. S1C) and pluripotency marker Oct4 (Fig. 1D), demonstrating that insertion of pFind1 promoted cellular reprogramming. Five colonies were picked to identify genomic insertion sites. Among the genes identified, Smad3 has been reported to play an important role in maintaining mouse ESC stability (Li et al., 2013). This result from KSM + pFind1 screening indicates that our PB vectors can be used to find factors that improve quality of iPSCs. Because of the low reprogramming efficiency (0.001‰), KSM plus pFind1 could only provide very few colonies for high-throughput screening. We next tested alternative screening scheme with OKS and pFind1 combination.
To test the reliability and validity of OKS/pFind1 strategy, we used a reporter cell line Oct4-GFP MEFs harboring a stably integrated green fluorescent protein (GFP) cassette driven by an Oct4 promoter and enhancer to monitor iPSC generation. iPSC clones derived from OKS alone or OKS + pFind1 (Fig. 1E) showed similar morphology. We next estimated whether the addition of pFind1 affected reprogramming efficiency. As a positive control, we supplied Brg1/Baf155 chromatin remodeling complex proteins that had previously been reported to increase the number of GFP+ colonies up to 4-fold (Singhal et al., 2010). An empty vector was used as a blank control. On these blank control plates, colonies first appeared around 10 days post transfection (dpt). In contrast, iPSC colonies appeared a few days earlier in the experiment plates that were transfected with OKS, pFind1 and pCAG-PBase. When GFP+ colonies were counted at 14 dpt, we found that pFind1 exhibited a clear increase in reprogramming efficiency (Fig. 1F).
We next manually picked 300 iPSC clones from the experiment plates at 8 dpt in order to catch pFind1-facilitated colonies. All these clones were expanded for further analysis. Through inverse-PCR followed by sequencing analysis, we identified pFind1 genomic insertion sites (Fig. S1D). In these ~300 iPSC clones we identified 213 genes including known pluripotency genes, such as Nr5a1, Klf5, mir205 (Table S1). To detect whether pFind1 can modulate expression of genes adjacent to insertion sites, we examined three random iPSC clones (pFind1-iPS2, pFind1-iPS8 and pFind1-iPS10) derived with OKS and pFind1 in details. We identified pFind1 insertion sites (Fig. 1G) and analyzed expression of neighboring genes. Increased expression of Stox2 (a gene encoding Storkhead Box 2 protein) was significant in pFind1-iPS2 with pFind1 insertion (Fig. 1H), compared with other clones that have completely different integration sites (not adjacent integration) and iPSC clones induced with OKS only (OKS-iPS). These results indicated that the pFind1 insertions activated neighboring genes at the transcription level. Further, we found that the Id2 gene 120 kb away from insertion site was also up-regulated by pFind1 (Fig. 1I), suggesting that the effect range of pFind1 could reach 120 kb. Together, these results demonstrate that pFind1 vector can be used for large-scale screening for reprogramming related genes.
Because the identification of pFind1 insertion sites in single colonies is tedious and rate limiting, we designed a high-throughput method to detect pFind1 insertion sites with NGS (Fig. S2A). We collected nearly 500,000 iPSC colonies derived with pFind1 and OKS. To detect a potential genomic insertion bias of pFind1, we concurrently performed a control with the pFind1 vector alone. pFind1 integration sites were mapped through a modified inverse PCR protocol that created fragments compatible with high-throughput sequencing.
After filtering and processing the original deep sequencing data, we aligned the sequences that were adjacent to pFind1 insertion sites to UCSC database (http://genome.ucsc.edu). pFind1 insertions were distributed among all chromosomes (Fig. S2B). We calculated frequencies of the pFind1 insertion in each chromosome, which were used to create distribution histograms of insertion frequencies for 19 mouse autosomes and 2 sex chromosomes (Figs. 2A and S3A). Through statistical analysis of genes located around pFind1 integration sites, a ranking list of 12,634 genes was obtained by counting pFind1 hit times around each gene (Table S2). In the control experiment of MEFs transfected with pFind1 only, pFind1 insertions showed dramatically different patterns (Fig. S3B), suggesting that the different pFind1 insertions in our screening experiment resulted from pFind1 enhanced reprogramming.
Depending on the location of pFind1 insertion sites, dual function design of pFind1 can result in either loss-of-function or gain-of-function of the gene. We isolated genes that were disrupted (Table S3) and genes that were up-regulated (Table S4). Since our sequencing results indicated that most pFind1 insertions are located in intergenic regions, we mostly focused on gain-of-function aspect in the screen. We ranked these genes that were up-regulated by pFind1 via calculating pFind1 insertion frequency around each gene. To get a global picture of these genes, functional annotations were performed using GO (Gene Ontology) analysis (Fig. 2B) and KEGG (Kyoto encyclopedia of genes and genomes) analysis (Fig. 2C). Many enriched genes were related to well known self-renewal signaling pathways such as Jak-STAT signaling pathway (Raz et al., 1999). The most significantly enriched genes were related to innate immunity pathways. These results suggest innate immunity pathways may be involved in reprogramming.
To further validate our gene ranking list, by comparing with a previously published list of genes that could potentially increase iPSC quality (Wu et al., 2014), we selected 10 genes covering different segments in Table S4 to examine their reprogramming capability. We electroporated each candidate gene’s overexpression vector along with OKS into MEFs and calculated efficiency of iPSC generation. All these genes can promote generation of iPSCs to various extents with a trend that high ranking genes markedly enhanced iPSC generation whereas low ranking genes only had slight effect, although there was no strict correlation (Fig. 2D). These validation results suggest that our gene ranking list from the screen can indeed provide a meaningful reference to evaluate each gene’s reprogramming ability in the future.
In the current study, we developed a new screening method for large-scale screening reprogramming-related genes based on PB transposon technology and NGS. We illustrated the utility of this method by rapid and systematic screening of mouse genome for reprogramming factors. Several features of our method are worth mentioning. First, the use of PB transposon mutagens allowed us to quickly generate a large collection of iPSCs with different insertions covering the entire mouse genome. Second, the gain-of-function design of pFind1 enabled us to perform a genetic screen for reprogramming factors in a sensitized background. Third, combined with deep sequencing and bioinformatic analysis, the distribution of pFind1 insertions can be easily identified, allowing one to address gene function on a large scale. The frequency of PB insertion in our study is not an absolute measure of a gene’s reprogramming ability, but it can very well provide a good starting point for further assessment of each gene’s relevance to reprogramming. This method should have broader applications to reveal genes participating in other biological processes.
References
Abujarour R, Efe J, Ding S (2010) Genome-wide gain-of-function screen identifies novel regulators of pluripotency. Stem Cells 28:1487–1497
Ding S, Wu X, Li G, Han M, Zhuang Y, Xu T (2005) Efficient transposition of the piggyBac (PB) transposon in mammalian cells and mice. Cell 122:473–483
Ding L, Poser I, Paszkowski-Rogacz M, Buchholz F (2012) From RNAi screens to molecular function in embryonic stem cells. Stem Cell Rev 8:32–42
Kearns NA, Genga RM, Enuameh MS, Garber M, Wolfe SA, Maehr R (2014) Cas9 effector-mediated regulation of transcription and differentiation in human pluripotent stem cells. Development 141:219–223
Li P, Chen Y, Meng X, Kwok KY, Huang X, Choy KW, Wang CC, Lan H, Yuan P (2013) Suppression of malignancy by Smad3 in mouse embryonic stem cell formed teratoma. Stem Cell Rev 9:709–720
Raz R, Lee CK, Cannizzaro LA, d’Eustachio P, Levy DE (1999) Essential role of STAT3 for embryonic stem cell pluripotency. Proc Natl Acad Sci USA 96:2846–2851
Shu J, Wu C, Wu Y, Li Z, Shao S, Zhao W, Tang X, Yang H, Shen L, Zuo X et al (2013) Induction of pluripotency in mouse somatic cells with lineage specifiers. Cell 153:963–975
Singhal N, Graumann J, Wu G, Arauzo-Bravo MJ, Han DW, Greber B, Gentile L, Mann M, Scholer HR (2010) Chromatin-remodeling components of the BAF complex facilitate reprogramming. Cell 141:943–955
Takahashi K, Yamanaka S (2006) Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126:663–676
Theunissen TW, Jaenisch R (2014) Molecular control of induced pluripotency. Cell Stem Cell 14:720–734
Wang W, Lin C, Lu D, Ning Z, Cox T, Melvin D, Wang X, Bradley A, Liu P (2008) Chromosomal transposition of piggyBac in mouse embryonic stem cells. Proc Natl Acad Sci USA 105:9290–9295
Woltjen K, Kim SI, Nagy A (2016) The piggyBac transposon as a platform technology for somatic cell reprogramming studies in mouse. Methods Mol Biol 1357:1–22
Wu S, Ying G, Wu Q, Capecchi MR (2007) Toward simpler and faster genome-wide mutagenesis in mice. Nat Genet 39:922–930
Wu S, Wu Y, Zhang X, Capecchi MR (2014) Efficient germ-line transmission obtained with transgene-free induced pluripotent stem cells. Proc Natl Acad Sci USA 111:10678–10683
Ying QL, Wray J, Nichols J, Batlle-Morera L, Doble B, Woodgett J, Cohen P, Smith A (2008) The ground state of embryonic stem cell self-renewal. Nature 453:519–523
Footnotes
We thank Changxin Wu, Xiaofeng Wang and Rui Liu for help in bioinformatics analysis, Qian Zhao and Jingxin Wang for everyday support of this project, and Lara Carroll for reading the manuscript. This work was supported by the National High Technology Research and Development Program (2013AA102502), Transgenic Research Grant 2013ZX08010-001, and the Project for Extramural Scientists of State Key Laboratory of Agrobiotechnology 2015SKLAB6-15.
Xi Zhang, Xinglin Wei, Yuanyuan Wu, Yuzhe Wang, Cheng Tan, Xiaoxiang Hu, Ning Li, Mario R. Capecchi and Sen Wu declare that they have no conflict of interest. All national and institutional guidelines for the care and use of laboratory animals were followed.
Author information
Authors and Affiliations
Corresponding author
Additional information
Xi Zhang, Xinglin Wei, and Yuanyuan Wu have contributed equally to this work.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Zhang, X., Wei, X., Wu, Y. et al. Genome-wide piggyBac transposon mediated screening reveals genes related to reprogramming. Protein Cell 8, 134–139 (2017). https://doi.org/10.1007/s13238-016-0332-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13238-016-0332-z