Positional distribution of transcription factor binding sites in Arabidopsis thaliana

Yu, Chun-Ping; Lin, Jinn-Jy; Li, Wen-Hsiung

doi:10.1038/srep25164

Positional distribution of transcription factor binding sites in Arabidopsis thaliana

Article
Open access
Published: 27 April 2016

Volume 6, article number 25164, (2016)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Positional distribution of transcription factor binding sites in Arabidopsis thaliana

Download PDF

Chun-Ping Yu¹,
Jinn-Jy Lin^2,3,4 &
Wen-Hsiung Li^1,2,5

10k Accesses
57 Citations
6 Altmetric
Explore all metrics

Abstract

Binding of a transcription factor (TF) to its DNA binding sites (TFBSs) is a critical step to initiate the transcription of its target genes. It is therefore interesting to know where the TFBSs of a gene are likely to locate in the promoter region. Here we studied the positional distribution of TFBSs in Arabidopsis thaliana, for which many known TFBSs are now available. We developed a method to identify the locations of TFBSs in the promoter sequences of genes in A. thaliana. We found that the distribution is nearly bell-shaped with a peak at 50 base pairs (bp) upstream of the transcription start site (TSS) and 86% of the TFBSs are in the region from −1,000 bp to +200 bp with respect to the TSS. Our distribution was supported by chromatin immunoprecipitation sequencing and microarray data and DNase I hypersensitive site sequencing data. When TF families were considered separately, differences in positional preference were observed between TF families. Our study of the positional distribution of TFBSs seems to be the first in a plant.

A fine-scale Arabidopsis chromatin landscape reveals chromatin conformation-associated transcriptional dynamics

Article Open access 16 April 2024

WRKY transcription factors and plant defense responses: latest discoveries and future prospects

Article 15 April 2021

Non-B-form DNA is associated with centromere stability in newly-formed polyploid wheat

Article 16 April 2024

Introduction

Binding of a transcription factor (TF) to its DNA binding sites is a critical step to initiate the transcription of its target genes. Typically, a TF binding site (TFBS) is 5 to 15 base pairs (bp) long within the promoter of its target gene and a TF protein usually can recognize a set of similar DNA sequences with varying degrees of binding affinity. In view of the importance of TFBSs in gene regulation, it is useful to know how the TFBSs of a gene are spatially distributed in its promoter region. Such a study for a large number of genes may shed light on how many TFBSs on average regulate a gene and where to find the TFBSs of a new TF.

The spatial distribution of TFBS has been studied in yeast and human^1,2, revealing that TFBSs are not uniformly distributed over the promoter region but tend to lie in the vicinity of the transcription start site (TSS) of the gene. In Saccharomyces cerevisiae TFBSs are enriched in the region from 200 bp to 100 bp upstream (from −200 bp to −100 bp) of the transcription start site (TSS) and has a sharp peak at 115 bp². In human, according to the ChIP-chip data of nine TFs, the distribution of TFBSs is a mixture of two distributions: one is a bell-shaped distribution with a narrow peak within 300 bp upstream of the TSS and the other is a uniform background distribution². In plants, TFBSs are enriched in the 200 bp region upstream of the TSS for the stress-responsive genes in Arabidopsis thaliana³. However, as the study was limited in scale, we conducted a more extensive study.

To study the spatial distribution of TFBSs in an organism requires a large number of known TFBSs in that organism. In plants such data are available only for Arabidopsis thaliana. We therefore studied the distribution in this plant. Another requirement for studying the spatial distribution of TFBS is to have a large number of target genes. For this purpose we developed a method for predicting the target genes of a TF whose TFBS is known.

A common experimental approach to identify the TFBSs of a TF in a genome is chromatin immunoprecipitation-sequencing (ChIP-seq) assay⁴. It utilizes a TF-specific antibody to capture in vivo cross-linked DNA-protein complexes, which contain the TF protein, DNA fragments and other chromatin-associated proteins. The DNA fragments are then sequenced to determine the TFBSs (peaks) of the TF over the genome. Many ChIP-seq studies have been made on A. thaliana^5,6,7. An alternative assay is sequencing of DNase I hypersensitive sites (DHS-seq)^8,9, which detects open chromatin regions that are sensitive to DNase I enzyme to infer the regulatory regions in a genome. In this study, we also used these two types of data to study the positional distribution of TFBSs in A. thaliana and we compared the distributions obtained from the three types of data.

Results

Statistics of TFBSs in the promoters of predicted target genes of a TF

We collected 586 TFBSs for 400 Arabidopsis TFs from the TF databases and literature (see Materials and Methods). As in Yu et al.¹⁰, for a predicted TFBS in the promoter of a putative target gene of a TF to be considered a target site of the TF it has to pass the conservation test in the following three species: Arabidopsis lyrata, Brassica oleracea and Brassica rapa. Among the annotated 33,602 genes in A. thaliana (TAIR10), 17,573 genes were found to have orthologs in all of the above three species. We found that 547 of the 586 TFBSs passed the conservation test and that 15,781 of the 17,573 genes contained at least one of the 547 TFBSs in their promoter region (Supplementary Table S1), which is defined as the region from 2,000 bp (−2,000 bp) upstream of the transcription start site (TSS) to 200 bp (+200 bp) downstream of the TSS of the gene. On average, a gene has 5.4 TFBSs, which belong to 4.1 TF families and a TFBS is, on average, present in 162.4 of the 15,781 genes.

Positional distribution of TFBSs

First, we consider the occurrence of a TFBS sequence in the promoter region of a gene. The presence of a TFBS sequence at a nucleotide site in a promoter was predicted by FIMO¹¹ (p-value < 10⁻⁴). Note that this occurrence refers to the chance occurrence of the sequence without requiring that it passes the conservation test. This distribution can be considered the random distribution of TFBS sequences in the promoter region. As expected, this distribution (the probability density) is rather flat except for a small peak at −50 bp with respect to the TSS (the grey line in Fig. 1a; see also Supplementary Fig. S1). The probability density at the peak (5.2 × 10⁻⁴) is only 1.3-fold higher than that at the position −2,000 bp (~4.1 × 10⁻⁴).

Second, we consider the positional distribution of the TFBS sequences that have passed the conservation test. We call this the positional distribution of TFBSs. Compared to the random distribution (the grey line in Fig. 1a), this distribution is much more similar to a bell-shaped distribution with a peak at −50 bp (the blue line in Fig. 1a). This distribution shows that the majority of the TFBSs (63%) occur within the region from −400 bp to +200 bp (kurtosis = 0.36). (Kurtosis is a measure of whether the distribution is more peaked or flat compared to the normal distribution, which has kurtosis = 0.) Moreover, the tail of the distribution in the upstream direction drops quickly and the probability density becomes negligibly small beyond (upstream of) −2,000 bp (Supplementary Fig. S2). In comparison, the distribution drops at a much slower rate on the downstream side of the TSS. However, the downstream region of the TSS may include the 5′ UTR region and part of the coding region, which are likely functional, so that a TFBS similar sequence in the downstream region of the TSS may have a higher probability to pass the conservation test than a sequence in an upstream region of the TSS. Note that at +200 bp the distribution is already close to that of the random distribution. At any rate, it should be kept in mind that the inferred distribution in the downstream side of the TSS is likely higher than the actual distribution. Under the distribution given by the blue line in Fig. 1a, the cumulative probability of TFBSs is 4.4% at −1,500 bp (Fig. 1b). This denotes the probability that a TFBS would occur in the region from −2,000 bp to −1,500 bp, or in other words, it is the probability that a TFBS would be missed if the promoter region is defined from −1,500 bp to +200 bp with respect to the TSS. Under the same distribution the accumulative probability is 14.3% at −1,000 bp and 31.8% at −500 bp (Fig. 1b). We also estimated that the accumulative probability of having a TFBS upstream of −2,000 bp is only 0.0004 (Supplementary Fig. 2).

Third, we add the condition that a TFBS is the target of a specific TF, only if the expression profile of the gene is correlated with that of the TF gene, that is, the Pearson Correlation Coefficient (PCC) is >0.8 (see Materials and Methods). The distribution under this additional condition, which is denoted by the green line in Fig. 1a, is very similar to the blue line. Thus, adding this condition does not change the distribution much. From the above analyses, we conclude that the blue line in Fig. 1a can be taken as the distribution of TFBSs in Arabidopsis thaliana.

When the distribution is given for each TF family separately, some distributions turn into sharply bell-shaped, showing a peak upstream of the TSS (Fig. 2a), very close to the TSS (Fig. 2b) or downstream of the TSS (Fig. 2c). For example, the positional preference of ERF (kurtosis = 2.9) and that of GATA (kurtosis = 1.2) (Fig. 2c) are in the region from +100 bp to +200 bp, those of E2F/DP and CAMTA (kurtosis >2.5) (Fig. 2b) are in the region from −100 bp to +100 bp and those of bZIP, bHLH and BES1 (Fig. 2a) are in the region from −100 bp to −50 bp (kurtosis > 2.1). On the other hand, for a number of TF families, including AP2, TALE and C2H2, the distribution becomes flatter (kurtosis < 0) (Fig. 2d). Thus, different TF families appear to have different positional preferences.

Positions of DH and ChIP sites

One may also get some insight into the positional preference of TFBSs from the positional distributions of DNase I hypersensitive sites (DHSs)^12,13 or chromatin immunoprecipitation (ChIP) experiments¹⁴. For the DH sites we collected from the literature, the data contained ~62,000 peaks with an average site length of 311.7 bp and with a length summation of 19.2 Mbp (16.1% of the genome size) (see Materials and Methods). For the ChIP-seq or ChIP-chip peaks (denoted ChIP for both ChIP-seq and ChIP-chip henceforth), the data included ~56,000 peaks with an average length of 327.4 bp. Since the lengths of peaks in the two datasets have several hundred base pairs, the distance was calculated by the mid position of a DH or ChIP site to the TSS of its nearest gene. The positional distributions of DH and ChIP sites (the green and orange lines in Fig. 3) are similar to that of TFBSs (the blue line in Fig. 3), with peaks near −100 bp. This observation largely supports the positional distribution of TFBSs we obtained above. However, the peaks for DH and ChIP sites are lower than that for TFBSs, perhaps for two reasons. First, the peak for a DH or ChIP site gives only an approximate position for the TFBS—the longer the peak, the less precise the position of the TFBS. Second, the DH and ChIP sites were not required to pass the conservation test, while the TFBSs have passed the conservation test. Note also that the peaks for DH and ChIP sites are located slightly more upstream than the TFBS peak, perhaps partly because, as mentioned above, the conservation test was more favorable for TFBS sequences located downstream of TSS.

As the ChIP dataset provided 27 TFs in 13 TF families, we grouped the ChIP sites into families. As it requires a fairly large number of sites to obtain a reliable distribution, we selected the 7 TF families that had >1000 sites to examine positional preferences between different TF families. Overall, the positional distributions of the ChIP sites in a family have a flatter shape compared to the positional distributions of TFBS (Fig. 4), because the average length of ChIP sites (~300 bp) is considerably longer than that of the TFBS. The peaks of ChIP site distributions tend to locate more upstream than that of the counterpart of the TFBSs, perhaps because the longer ChIP sites make it more difficult to determine the precise TFBS position and because the ChIP sites had not been subjected to the conservation test. The positional distributions of ChIP sites for individual TFs indicate that the TFBSs for different TF families have different positional preferences.

Discussion

We used ~500 known TFBSs in A. thaliana to study the positional distribution within the promoter sequences of ~15,800 genes. The distribution inferred (kurtosis = 0.36) is somewhat sharper than the normal distribution (kurtosis = 0). It has a peak at 50 bp upstream of the TSS and the majority of TFBSs (86%) lie in the region from −1,000 bp to +200 bp with respect to the TSS. When the 41 TF families were considered separately, different positional preferences were found between families and 11TF families showed a flatter distribution (kurtosis < 0) than the normal distribution. However, even for the most flat distribution 79% of the TFBSs are in the region from −1,000 bp to +200 bp. Thus, it is a good region to look for the TFBS(s) of a TF in A. thaliana.

In this study, we only used the TFs whose TFBSs have been verified by experiment, so most of the TFBS sequences (518/586 = 88%) were assigned to unique TFs. Two TFBS sequences that are assigned to two TFs in the same TF family usually have substantial differences. Nevertheless, two TFs in the same family can share a target gene for two reasons. First, two sequences on the promoter of the gene are predicted to be similar to the two TFBSs, respectively. This is the same as the situation where two TFs in different TF families share a target gene. In this case, there are two TF binding sites. Second, a single sequence on the promoter of the gene is predicted to be similar to both TFBSs, according to the prediction method such as FIMO, which we used. This can happen because a TF usually recognizes a set of similar sequences and all prediction methods allow for such sequence variation. In this case, we count only one TF binding site.

Our distribution of TFBS in Arabidopsis and the published distributions in yeast and human all show positional preference of TFBSs with respect to the TSS and all have a probability peak upstream of the TSS. However, the probability for a TFBS to lie downstream of the TSS is negligible in yeast but substantial in Arabidopsis and human. As mentioned above, the method used to infer the distribution in Arabidopsis may produce a bias for the region downstream of TSS. For human the distribution was based on ChIP-chip data and as mentioned above this type of data might also have a bias for the downstream region of TSS. Therefore, the distributions in Arabidopsis and human should be reexamined when more suitable data becomes available.

For the human study², the authors proposed that the distribution consists of a narrow peak and a uniform distribution in all 9 TFs studied, especially the homeobox (HB) TFs and they hypothesized that a TF switches its TFBS between the proximal and the distal region. In our study, there are five TF families that possess the homeodomain (HB or HD) and the distributions for the HB-PHD, HB-other, TALE and WOX families showed a low peak and long tail (Fig. 2d), although the HD-ZIP family showed a sharp peak and short tail (Fig. 2a), suggesting that some homeodomain TFs in Arabidopsis have a dual binding distribution like some TFs in human.

As the amount of TFBS data used in this study is still limited and as there are still TF families in A. thaliana for which no data about the TF binding specificity is available, a more detailed study should be conducted in the future to check the accuracy of the present inferences and to expand the study to other TFs.

Materials and Methods

Collection of TF-TFBS pairs

For Arabidopsis TFBSs, we collected 478 TFBSs for 359 TFs from four databases (TRANSFAC, JASPAR, Athamap and CIS-BP^15,16,17,18) and also 108 TFBSs for 63 TFs from Franco-Zorrilla et al.¹⁹. These two collections provided a total of 586 non-redundant TFBSs for 400 TFs that have been experimentally verified (Supplementary Table 1), using PBM (protein binding array), SELEX (systematic evolution of ligands by exponential enrichment), ChIP-seq, or ChIP-chip.

TFBS conservation test

For each of the collected TFBSs, we examined whether a TFBS similar sequence is present in the promoter sequence of a gene, using the software FIMO¹¹ (p-value < 10⁻⁴). The promoter sequence of a gene is defined by the region from −2,000 bp to +200 bp relative to the TSS of the gene. Then, we used the following three species to test the conservation of a TFBS in Arabidopsis thaliana (TAIR10): Arabidopsis lyrata (v.1.0), Brassica oleracea (v2.1) and B. rapa (IVFCAASv1). The orthologous relationships between A. thaliana and the three reference species were obtained from the Ensembl Plants orthologous definition and we determined a one-to-one relationship by two criteria: (i) the sequence identity between the target and the query is >50% and (ii) among the potential orthologs it has the highest average sequence identity with the Arabidopsis gene. If no such ortholog was found for one of the three species, we discarded the gene from our analysis. For a gene under study, the promoter sequences in the three reference species were identified by alignment to the promoter sequence of the orthologous gene in A. thaliana. We required that a TFBS similar sequence is located within 100 bp from the TFBS of Arabidopsis thaliana in the promoter sequence alignment of the all four species and is on the same strand¹⁰. Supplementary Fig. 3 shows that in the alignment of promoter sequences of A. thaliana and the three other species, 79% of the TFBSs have identical positions in the alignment and in less than 10% of the cases the TFBS position has moved more than 10 bp.

Target gene prediction

To assess whether a TFBS is regulated by its putative cognate TF, we looked for transcriptome (RNA-seq) data. For Arabidopsis transcriptomes, we collected three data sets: 5 transcriptomes from stomatal development²⁰, 6 transcriptomes from developing flower²¹ and 10 transcriptomes from the apical meristem of flow initiation²². In total, we collected 21 transcriptomes.

In the 21 transcriptomes, a gene was defined as expressed and retained in subsequent analyses, if its raw RPKM value was >1 in at least two transcriptomes. Under this criterion, there were 32,799 expressed genes in the 21 transcriptomes. For each transcriptome, the gene expression levels were then normalized by upper quantile normalization²³. A predicted target gene of a TF is said to be co-expressed with the TF gene, if the Pearson Correlation Coefficient (PCC) between their expression levels in the 21 transcriptomes is >0.8.

DHS and ChIP-seq datasets

Two datasets of DNase I hypersensitive sites (DHSs) were collected. First, a processed dataset of DHS peaks was downloaded from Zhang et al.¹³, who used two-week old leaf tissues and closed flower buds of A. thaliana. This dataset included 55,500 peaks (FDR < 0.01) with an average length of 311.7 bp. The second set was from the DHS peaks in roots of 7-day-old seedlings of A. thaliana¹². This set provided a total of 43,500 peaks with an average length of 188.8 bp. In these two datasets, two peaks were merged if they overlapped, resulting in a total of 62,300 peaks and a length summation of 19.2 Mbp (Supplementary Table S2).

For the chromatin immunoprecipitation (ChIP) data in A. thaliana, we downloaded integrated data of binding profiles for 27 TFs¹⁴, which used techniques of microarray (ChIP-chip) or deep sequencing (ChIP-seq). It contained a total of 56,600 peaks with an average length of 327.4 bp (Supplementary Table S3).

For the positional distributions of DHS and ChIP-seq peaks, we calculated the distance between the mid position of the peak and the TSS of its nearest gene in either the forward or the reverse strand.

Additional Information

How to cite this article: Yu, C.-P. et al. Positional distribution of transcription factor binding sites in Arabidopsis thaliana. Sci. Rep. 6, 25164; doi: 10.1038/srep25164 (2016).

References

Koudritsky, M. & Domany, E. Positional distribution of human transcription factor binding sites. Nucleic acids research 36, 6795–6805, 10.1093/nar/gkn752 (2008).
Article CAS PubMed PubMed Central Google Scholar
Lin, Z., Wu, W. S., Liang, H., Woo, Y. & Li, W. H. The spatial distribution of cis regulatory elements in yeast promoters and its implications for transcriptional regulation. BMC genomics 11, 581, 10.1186/1471-2164-11-581 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zou, C. et al. Cis-regulatory code of stress-responsive transcription in Arabidopsis thaliana. Proceedings of the National Academy of Sciences of the United States of America 108, 14992–14997, 10.1073/pnas.1103202108 (2011).
Article ADS PubMed PubMed Central Google Scholar
Johnson, D. S., Mortazavi, A., Myers, R. M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502, 10.1126/science.1141319 (2007).
Article CAS ADS PubMed Google Scholar
Kaufmann, K. et al. Chromatin immunoprecipitation (ChIP) of plant transcription factors followed by sequencing (ChIP-SEQ) or hybridization to whole genome arrays (ChIP-CHIP). Nature protocols 5, 457–472, 10.1038/nprot.2009.244 (2010).
Article CAS PubMed Google Scholar
Zhu, J. Y., Sun, Y. & Wang, Z. Y. Genome-wide identification of transcription factor-binding sites in plants using chromatin immunoprecipitation followed by microarray (ChIP-chip) or sequencing (ChIP-seq). Methods in molecular biology 876, 173–188, 10.1007/978-1-61779-809-2_14 (2012).
Article CAS PubMed Google Scholar
Ricardi, M. M. et al. Genome-wide data (ChIP-seq) enabled identification of cell wall-related and aquaporin genes as targets of tomato ASR1, a drought stress-responsive transcription factor. Bmc Plant Biol 14, Artn 29 10.1186/1471-2229-14-29 (2014).
Boyle, A. P. et al. High-resolution mapping and characterization of open chromatin across the genome. Cell 132, 311–322, 10.1016/j.cell.2007.12.014 (2008).
Article CAS PubMed PubMed Central Google Scholar
Neph, S. et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 489, 83–90, 10.1038/nature11212 (2012).
Article CAS ADS PubMed PubMed Central Google Scholar
Yu, C. P. et al. Transcriptome dynamics of developing maize leaves and genomewide prediction of cis elements and their cognate transcription factors. Proceedings of the National Academy of Sciences of the United States of America 112, E2477–2486, 10.1073/pnas.1500605112 (2015).
Article CAS ADS PubMed PubMed Central Google Scholar
Grant, C. E., Bailey, T. L. & Noble, W. S. FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018, 10.1093/bioinformatics/btr064 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sullivan, A. M. et al. Mapping and dynamics of regulatory DNA and transcription factor networks in A. thaliana. Cell reports 8, 2015–2030, 10.1016/j.celrep.2014.08.019 (2014).
Article CAS PubMed Google Scholar
Zhang, W., Zhang, T., Wu, Y. & Jiang, J. Genome-wide identification of regulatory DNA elements and protein-binding footprints using signatures of open chromatin in Arabidopsis. The Plant cell 24, 2719–2731, 10.1105/tpc.112.098061 (2012).
Article CAS PubMed PubMed Central Google Scholar
Heyndrickx, K. S., Van de Velde, J., Wang, C., Weigel, D. & Vandepoele, K. A functional and evolutionary perspective on transcription factor binding in Arabidopsis thaliana. The Plant cell 26, 3894–3910, 10.1105/tpc.114.130591 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bulow, L., Steffens, N. O., Galuschka, C., Schindler, M. & Hehl, R. AthaMap: from in silico data to real transcription factor binding sites. In silico biology 6, 243–252 (2006).
Article PubMed Google Scholar
Mathelier, A. et al. JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic acids research 42, D142–147, 10.1093/nar/gkt997 (2014).
Article CAS PubMed Google Scholar
Matys, V. et al. TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic acids research 34, D108–110, 10.1093/nar/gkj143 (2006).
Article CAS PubMed Google Scholar
Weirauch, M. T. et al. Determination and inference of eukaryotic transcription factor sequence specificity. Cell 158, 1431–1443, 10.1016/j.cell.2014.08.009 (2014).
Article CAS PubMed PubMed Central Google Scholar
Franco-Zorrilla, J. M. et al. DNA-binding specificities of plant transcription factors and their potential to define target genes. Proceedings of the National Academy of Sciences of the United States of America 111, 2367–2372, 10.1073/pnas.1316278111 (2014).
Article CAS ADS PubMed PubMed Central Google Scholar
Adrian, J. et al. Transcriptome dynamics of the stomatal lineage: birth, amplification and termination of a self-renewing population. Developmental cell 33, 107–118, 10.1016/j.devcel.2015.01.025 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jiao, Y. & Meyerowitz, E. M. Cell-type specific analysis of translating RNAs in developing flowers reveals new levels of control. Molecular systems biology 6, 419, 10.1038/msb.2010.76 (2010).
Article CAS PubMed PubMed Central Google Scholar
Klepikova, A. V., Logacheva, M. D., Dmitriev, S. E. & Penin, A. A. RNA-seq analysis of an apical meristem time series reveals a critical point in Arabidopsis thaliana flower initiation. BMC genomics 16, 466, 10.1186/s12864-015-1688-9 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bullard, J. H., Purdom, E., Hansen, K. D. & Dudoit, S. Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC bioinformatics 11, 94, 10.1186/1471-2105-11-94 (2010).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study was supported by Academia Sinica, Taiwan (AS-102-SS-A13 and Innovative Translational Agricultural Research Program).

Author information

Authors and Affiliations

Biotechnology Center, National Chung-Hsing University, Taichung, 40227, Taiwan
Chun-Ping Yu & Wen-Hsiung Li
Biodiversity Research Center, Academia Sinica, Taipei, 115, Taiwan
Jinn-Jy Lin & Wen-Hsiung Li
Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, 115, Taipei, Taiwan
Jinn-Jy Lin
Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu, 300, Taiwan
Jinn-Jy Lin
Department of Ecology and Evolution, University of Chicago, Chicago, 60637, USA
Wen-Hsiung Li

Authors

Chun-Ping Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jinn-Jy Lin
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Hsiung Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.-P.Y. and W.-H.L. designed the research; C.-P.Y. and J.-J.L. performed the data analysis; C.-P.Y. and W.-H.L. wrote the paper. All authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Dataset 1

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Yu, CP., Lin, JJ. & Li, WH. Positional distribution of transcription factor binding sites in Arabidopsis thaliana. Sci Rep 6, 25164 (2016). https://doi.org/10.1038/srep25164

Download citation

Received: 11 January 2016
Accepted: 11 April 2016
Published: 27 April 2016
DOI: https://doi.org/10.1038/srep25164
Springer Nature Limited

This article is cited by

Transcriptional competition shapes proteotoxic ER stress resolution
- Dae Kwan Ko
- Federica Brandizzi
Nature Plants (2022)
Elicitation of solid callus cultures of Salvia miltiorrhiza Bunge with salicylic acid and a synthetic auxin (1-naphthaleneacetic acid)
- Piotr Szymczyk
- Grażyna Szymańska
- Renata Grąbkowska
Plant Cell, Tissue and Organ Culture (PCTOC) (2021)
A network of transcriptional repressors modulates auxin responses
- Jekaterina Truskina
- Jingyi Han
- Teva Vernoux
Nature (2021)
Exploring the complexity of soybean (Glycine max) transcriptional regulation using global gene co-expression networks
- Fabricio Almeida-Silva
- Kanhu C. Moharana
- Thiago M. Venancio
Planta (2020)
FvatfA regulates growth, stress tolerance as well as mycotoxin and pigment productions in Fusarium verticillioides
- Zsuzsa Szabó
- Klaudia Pákozdi
- István Pócsi
Applied Microbiology and Biotechnology (2020)

Positional distribution of transcription factor binding sites in Arabidopsis thaliana

Abstract

Similar content being viewed by others

A fine-scale Arabidopsis chromatin landscape reveals chromatin conformation-associated transcriptional dynamics

WRKY transcription factors and plant defense responses: latest discoveries and future prospects

Non-B-form DNA is associated with centromere stability in newly-formed polyploid wheat

Introduction

Results