Skip to main content

An analysis and prediction of nucleosome positioning based on information content

Abstract

Nucleosome positioning plays a key role in the regulation of many biological processes. In this study, the statistical difference of information content was investigated in nucleosome and linker DNA regions across eukaryotic organisms. By analyzing the information redundancy, D k , in Saccharomyces cerevisiae, Drosophila melanogaster, and Caenorhabditis elegans genomes, the short-range dominance of nucleotide correlation in nucleosome and linker DNA regions was confirmed. Significant difference of the D k value between the nucleosome and linker DNA regions was also found. The underlying reason for many successful oligonucleotide-based predictions of nucleosome positioning in eukaryotic model organisms may be attributed to the short-range dominance of nucleotide correlation in the nucleosome and linker DNA regions. When applying power spectrum analysis to the nucleosome and linker DNA regions, some obvious differences in sequence periodic signals were observed. The parameter F k was introduced to describe particular base correlation. Furthermore, the support vector machine combining F k was used to classify nucleosome and linker DNA regions in Homo sapiens, Oryzias latipes, C. elegans, Candida albicans, and S. cerevisiae. Independent test demonstrated that a good performance can be achieved by using this algorithm. This result further revealed that base correlation information has an important role in nucleosome positioning.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3

Abbreviations

MNase-chip:

Micrococcal nuclease coupled with microarrays

MNase-Seq:

Micrococcal nuclease coupled with sequencing techniques

SVM:

Support vector machine

FFT:

Fast Fourier transform

auROC:

Area under the ROC curve

References

  • Abbott DW, Ivanova VS, Wang X, Bonner WM, Ausio J (2001) Characterization of the stability and folding of H2A.Z chromatin particles: implications for transcriptional activation. J Biol Chem 276:41945–41949

    PubMed  Article  CAS  Google Scholar 

  • Albert I, Mavrich TN, Tomsho LP et al (2007) Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome. Nature 446:572–576

    PubMed  Article  CAS  Google Scholar 

  • Arya G, Maitra A, Grigoryev SA (2010) A structural perspective on the where, how, why, and what of nucleosome positioning. J Biomol Struct Dyn 27:803–820

    PubMed  Article  CAS  Google Scholar 

  • Bauer M, Schuster SM, Sayood K (2008) The average mutual information profile as a genomic signature. BMC Bioinforma 9:1–11

    Article  Google Scholar 

  • Brogaard K, Xi LQ, Wang JP, Widom J (2012) A map of nucleosome positions in yeast at base-pair resolution. Nature 486:496–501

    PubMed  CAS  Google Scholar 

  • Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:1–27

    Article  Google Scholar 

  • Chen KF, Wang L, Yang M et al (2010) Sequence signature of nucleosome positioning in Caenorhabditis elegans. Genomics Proteomics Bioinform 8:90–102

    Google Scholar 

  • Cherry JM, Hong EL, Amundsen C et al (2012) Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res 40:D700–D705

    PubMed  Article  CAS  Google Scholar 

  • Chodavarapu RK, Feng SH, Bernatavichute YV et al (2010) Relationship between nucleosome positioning and DNA methylation. Nature 466:388–392

    PubMed  Article  CAS  Google Scholar 

  • Collings CK, Fernandez AG, Pitschka CG, Hawkins TB, Anderson JN (2010) Oligonucleotide sequence motifs as nucleosome positioning signals. PLoS One 5:e10933

    PubMed  Article  Google Scholar 

  • Dreszer TR, Karolchik D, Zweig AS et al (2012) The UCSC Genome Browser database: extensions and updates 2011. Nucleic Acids Res 40:D918–D923

    PubMed  Article  CAS  Google Scholar 

  • Fukushima A, Ikemura T, Kinouchi M et al (2002) Periodicity in prokaryotic and eukaryotic genomes identified by power spectrum analysis. Gene 300:203–211

    PubMed  Article  CAS  Google Scholar 

  • Grosse I, Buldyrev SY, Stanley HE (2000a) Average mutual information of coding and noncoding DNA. Pac Symp Biocomput 5:611–620

    Google Scholar 

  • Grosse I, Herzel H, Buldyrev SV, Stanley HE (2000b) Species independence of mutual information in coding and noncoding DNA. Phys Rev E 61:5624–5629

    Article  CAS  Google Scholar 

  • Gupta S, Dennis J, Thurman RE, Kingston R, Stamatoyannopoulos JA, Noble WS (2008) Predicting human nucleosome occupancy from primary sequence. PLoS Comput Biol 4:e1000134

    PubMed  Article  Google Scholar 

  • Ioshikhes IP, Albert I, Zanton SJ, Pugh BF (2006) Nucleosome positions predicted through comparative genomics. Nat Genet 38:1210–1215

    PubMed  Article  CAS  Google Scholar 

  • Johnson SM, Tan FJ, McCullough HL, Riordan DP, Fire AZ (2006) Flexibility and constraint in the nucleosome core landscape of Caenorhabditis elegans chromatin. Genome Res 16:1505–1516

    PubMed  Article  CAS  Google Scholar 

  • Kaplan N, Moore I, Fondufe-Mittendorf Y et al (2010) Nucleosome sequence preferences influence in vivo nucleosome organization. Nat Struct Mol Biol 17:918–920

    PubMed  Article  CAS  Google Scholar 

  • Kaplan N, Moore IK, Fondufe-Mittendorf Y et al (2009) The DNA-encoded nucleosome organization of a eukaryotic genome. Nature 458:362–366

    PubMed  Article  CAS  Google Scholar 

  • Kunkel GR, Martinson HG (1981) Nucleosomes will not form on double-stranded RNA or over poly(dA)-poly(dT) tracts in recombinant DNA. Nucleic Acids Res 9:6869–6888

    PubMed  Article  CAS  Google Scholar 

  • Leach TJ, Mazzeo M, Chotkowski HL, Madigan JP, Wotring MG, Glaser RL (2000) Histone H2A.Z is widely but nonrandomly distributed in chromosomes of Drosophila melanogaster. J Biol Chem 275:23267–23272

    PubMed  Article  CAS  Google Scholar 

  • Lee W, Tillo D, Bray N et al (2007) A high-resolution atlas of nucleosome occupancy in yeast. Nat Genet 39:1235–1244

    PubMed  Article  CAS  Google Scholar 

  • Luo LF (1998) Statistical correlation of nucleotides in a DNA sequence. Phys Rev E 58:861–871

    Article  CAS  Google Scholar 

  • Luo LF, Tsai L, Zhou YM (1988) Informational parameters of nucleic acid and molecular evolution. J Theor Biol 130:351–361

    PubMed  Article  CAS  Google Scholar 

  • Mavrich TN, Ioshikhes IP, Venters BJ et al (2008a) A barrier nucleosome model for statistical positioning of nucleosome throughout the yeast genome. Genome Res 18:1073–1083

    PubMed  Article  CAS  Google Scholar 

  • Mavrich TN, Jiang CZ, Ioshikhes IP et al (2008b) Nucleosome organization in the Drosophila genome. Nature 453:358–364

    PubMed  Article  CAS  Google Scholar 

  • McQuilton P, St Pierre SE, Thurmond J, the FlyBase Consortium (2012) FlyBase 101—the basics of navigating FlyBase. Nucleic Acids Res 40:D706–D714

    PubMed  Article  CAS  Google Scholar 

  • Peckham HE, Thurman RE, Fu YT et al (2007) Nucleosome positioning signals in genomic DNA. Genome Res 17:1170–1177

    PubMed  Article  CAS  Google Scholar 

  • Raisner RM, Hartley PD, Meneghini MD et al (2005) Histone variant H2A.Z marks the 5′ ends of both active and inactive genes in euchromatin. Cell 123:233–248

    PubMed  Article  CAS  Google Scholar 

  • Rapoport AE, Frenkel ZM, Trifonov EN (2011) Nucleosome positioning pattern derived from oligonucleotide compositions of genomic sequences. J Biomol Struct Dyn 28:567–574

    PubMed  Article  CAS  Google Scholar 

  • Reynolds SM, Bilmes JA, Noble WS (2010) Learning a weighted sequence model of the nucleosome core and linker yields more accurate predictions in Saccharomyces cerevisiae and Homo sapiens. PLoS Comput Biol 6:e1000834

    PubMed  Article  Google Scholar 

  • Richmond TJ, Davey CA (2003) The structure of DNA in the nucleosome core. Nature 423:145–150

    PubMed  Article  CAS  Google Scholar 

  • Satchwell SC, Drew HR, Travers AA (1986) Sequence periodicities in chicken nucleosome core DNA. J Mol Biol 191:659–675

    PubMed  Article  CAS  Google Scholar 

  • Schones DE, Cui KR, Cuddapah S et al (2008) Dynamic regulation of nucleosome positioning in the human genome. Cell 132:887–898

    PubMed  Article  CAS  Google Scholar 

  • Segal E, Fondufe-Mittendorf Y, Chen LY et al (2006) A genomic code for nucleosome positioning. Nature 442:772–778

    PubMed  Article  CAS  Google Scholar 

  • Su JZ, Zhang Y, Lv J et al (2010) CpG_MI: a novel approach for identifying functional CpG islands in mammalian genomes. Nucleic Acids Res 38:e6

    PubMed  Article  Google Scholar 

  • Swamy KBS, Chu WY, Wang CY, Tsai HK, Wang D (2011) Evidence of association between nucleosome occupancy and the evolution of transcription factor binding sites in yeast. BMC Evol Biol 11:1–10

    Article  Google Scholar 

  • Tanaka Y, Nakai K (2009) An assessment of prediction algorithms for nucleosome positioning. Genome Inform 23:169–178

    PubMed  Article  CAS  Google Scholar 

  • Tanaka Y, Yamashita R, Suzuki Y, Nakai K (2010) Effects of Alu elements on global nucleosome positioning in the human genomes. BMC Genomics 11:309

    PubMed  Article  Google Scholar 

  • Tillo D, Hughes TR (2009) G+C content dominates intrinsic nucleosome occupancy. BMC Bioinform 10:442

    Article  Google Scholar 

  • Trifonov EN (2010) Base pair stacking in nucleosome DNA and bendability sequence pattern. J Theor Biol 263:337–339

    PubMed  Article  CAS  Google Scholar 

  • Valouev A, Ichikawa J, Tonthat T et al (2008) A high-resolution, nucleosome position map of C. elegans reveals lack of universal sequence-dictated positioning. Genome Res 18:1051–1063

    PubMed  Article  CAS  Google Scholar 

  • Warnecke T, Batada NN, Hurst LD (2008) The impact of the nucleosome code on protein-coding sequence evolution in yeast. PLoS Genet 4:e1000250

    PubMed  Article  Google Scholar 

  • Xing YQ, Zhao XJ, Cai L (2011) Prediction of nucleosome occupancy in Saccharomyces cerevisiae using position-correlation scoring function. Genomics 98:359–366

    PubMed  Article  CAS  Google Scholar 

  • Yuan GC, Liu JS (2008) Genomic sequence is highly predictive of local nucleosome depletion. PLoS Comput Biol 4:e13

    PubMed  Article  Google Scholar 

  • Zhang Y, Moqtaderi Z, Rattner BP et al (2009) Intrinsic histone–DNA interactions are not the major determinant of nucleosome positions in vivo. Nat Struct Mol Biol 16:847–852

    PubMed  Article  CAS  Google Scholar 

  • Zhang Z, Zhang YS, Chen W, Gutman I, Li Y (2012) Prediction of nucleosome positioning using the dinucleotide absolute frequency of DNA fragment. Match-Commun Math Comput Chem 68:639–650

    CAS  Google Scholar 

Download references

Acknowledgments

We would like to acknowledge HongYu Zhao and Xiangjun Cui for many helpful discussions. This work was supported by the Natural Science Foundation of China (61072129, 61271448, 61102162) and the Natural Science Foundation of Inner Mongolia (2011MS0504).

Author information

Affiliations

Authors

Corresponding author

Correspondence to Lu Cai.

Additional information

Responsible Editor: Tatsuo Fukagawa.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(DOCX 18 kb)

ESM 2

(DOCX 112 kb)

ESM 3

(DOCX 221 kb)

ESM 4

(DOCX 62 kb)

ESM 5

(DOCX 73 kb)

ESM 6

(DOCX 96 kb)

ESM 7

(DOCX 301 kb)

ESM 8

(DOCX 725 kb)

ESM 9

(DOCX 798 kb)

ESM 10

(DOCX 689 kb)

ESM 11

(DOCX 782 kb)

ESM 12

(DOCX 569 kb)

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Xing, Yq., Liu, Gq., Zhao, Xj. et al. An analysis and prediction of nucleosome positioning based on information content. Chromosome Res 21, 63–74 (2013). https://doi.org/10.1007/s10577-013-9338-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10577-013-9338-z

Keywords

  • Nucleosome positioning
  • Information content
  • Power spectrum analysis
  • Support vector machine