Sequence periodic pattern of HERV LTRs: A matrix simulation algorithm
- 84 Downloads
Abstract
Flanking regulatory long terminal repeats (LTRs) in Human endogenous retrovirus (HERV) is a kind of typical DNA repeat that is widespread in the human genome. Currently, many algorithms have been developed to detect the latent periodicity of a wide range of DNA repeats. However, no such attempt was made for HERV LTRs. The present study focused on the investigation of the possible sequence periodic patterns in the HERV LTRs and their regulatory mechanisms. We calculated the sequence periods of 5′, 3′ and combined LTRs in HERVs with our devised matrix simulation algorithm. It is interesting that 5′ and 3′ LTRs have the same period of 7, and combined LTRs have a period of 9. These results indicated that HERV LTRs have predominant periodic patterns. Based on the obtained sequence periodicity, we constructed periodic consensus sequences of 5′, 3′ and combined LTRs. As to 5′ and 3′ LTRs with the same period – 7, we manually scanned the nucleotide bases in the corresponding positions of their periodic consensus sequences, and found some positions have the nucleotide base unchanged, such as the 1st, 5th and 7th positions. These conservative nucleotide base positions represent critical binding sites of regulatory LTRs, and may be indicative of conserved regulatory mechanisms in LRT-participating regulatory networks.
Keywords
DNA repeats HERV periodicity periodic consensus sequenceNotes
Acknowledgements
This work was supported the Natural Science Foundation of Heilongjiang Province (Grant Nos. ZJG0501 and GB03C602-4).
References
- Arora R, Sethares WA and Bucklew JA 2008 Latent periodicities in genome sequences. IEEE J. Select. Topic. Signal Process. 2 332–342CrossRefGoogle Scholar
- Baldi P, Brunak S, Chauvin Y, Engelbrecht J and Krogh A 1995 Periodic sequence patterns in human exons. Proc. Int. Conf. Intell. Syst. Mol. Biol. 3 30–38PubMedGoogle Scholar
- Bannert N and Kurth R 2004 Retroelements and the human genome: new perspectives on an old relation. Proc. Natl. Acad. Sci. USA 101 14572–14579PubMedCrossRefGoogle Scholar
- Benson G 1999 Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27 573–580PubMedCrossRefGoogle Scholar
- Chechetkin VR and Lobzin VV 1998 Nucleosome units and hidden periodicities in DNA sequences. J. Biomol. Struct. Dyn. 15 937–947PubMedGoogle Scholar
- Coward E and Drablos F 1998 Detecting periodic patterns in biological sequences. Bioinformatics 14 498–507PubMedCrossRefGoogle Scholar
- Dodin G, Vandergheynst P, Levoir P, Cordier C and Marcourt L 2000 Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. J. Theor. Biol. 206 323–326PubMedCrossRefGoogle Scholar
- Domanskii AN, Akopov SB, Lebedev Iu B, Nikolaev LG and Sverdlov ED 2002 [Enhancer activity of solitary long terminal repeat of the human endogenous retrovirus of the HERV-K family]. Bioorg. Khim. 28 341–345PubMedGoogle Scholar
- Epps J 2009 A hybrid technique for the periodicity characterization of genomic sequence data. EURASIP J. Bioinform. Syst. Biol. 2009 924601Google Scholar
- Eskesen ST, Eskesen FN, Kinghorn B and Ruvinsky A 2004 Periodicity of DNA in exons. BMC Mol. Biol. 5 12PubMedCrossRefGoogle Scholar
- Hertz GZ and Stormo GD 1999 Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics 15 563–577PubMedCrossRefGoogle Scholar
- Huh JW, Kim DS, Kang DW, Ha HS, Ahn K, Noh YN, Min DS, Chang KT and Kim HS 2008 Transcriptional regulation of GSDML gene by antisense-oriented HERV-H LTR element. Arch. Virol. 153 1201–1205PubMedCrossRefGoogle Scholar
- Illingworth CJ, Parkes KE, Snell CR, Mullineaux PM and Reynolds CA 2008 Criteria for confirming sequence periodicity identified by Fourier transform analysis: application to GCR2, a candidate plant GPCR? Biophys. Chem. 133 28–35PubMedCrossRefGoogle Scholar
- Jackson JH, George R and Herring PA 2000 Vectors of Shannon information from Fourier signals characterizing base periodicity in genes and genomes. Biochem. Biophys. Res. Commun. 268 289–292PubMedCrossRefGoogle Scholar
- Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O and Walichiewicz J 2005 Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110 462–467PubMedCrossRefGoogle Scholar
- Kumar L, Futschik M and Herzel H 2006 DNA motifs and sequence periodicities. In Silico Biol. 6 71–78PubMedGoogle Scholar
- Kwan BY, Kwan JY and Kwan HK 2011 Spectral classification of short numerical exon and intron sequences. BMC Bioinformatics 12 1–2CrossRefGoogle Scholar
- Li WH, Gu Z, Wang H and Nekrutenko A 2001 Evolutionary analyses of the human genome. Nature 409 847–849PubMedCrossRefGoogle Scholar
- Lobzin VV and Chechetkin VR 2000 [Order and correlations in genomic DNA sequences. The spectral approach]. Uspekhi Fizicheskikh Nauk 170 57–81CrossRefGoogle Scholar
- Makalowski W 2000 Genomic scrap yard: how genomes utilize all that junk. Gene 259 61–67PubMedCrossRefGoogle Scholar
- Mamedov IZ, Amosov AL, Fisunov G and Lebedev Iu B 2008 [A new database on polymorphic retroelements in human genome (PRED)]. Mol. Biol. (Mosk) 42 721–727CrossRefGoogle Scholar
- Popa A and McDowell JJ 2010 The effect of Hamming distances in a computational model of selection by consequences. Behav. Process. 84 428–434CrossRefGoogle Scholar
- Romanish MT, Lock WM, van de Lagemaat LN, Dunn CA and Mager DL 2007 Repeated recruitment of LTR retrotransposons as promoters by the anti-apoptotic locus NAIP during mammalian evolution. PLoS Genet. 3 e10PubMedCrossRefGoogle Scholar
- Ryan FP 2011 Human endogenous retroviruses in multiple sclerosis: Potential for Novel neuro-pharmacological research. Curr. Neuropharmacol. 9 360–369PubMedCrossRefGoogle Scholar
- Salem AH, Ray DA and Batzer MA 2005 Identity by descent and DNA sequence variation of human SINE and LINE elements. Cytogenet. Genome Res. 108 63–72PubMedCrossRefGoogle Scholar
- Shepelev V and Fedorov A 2006 Advances in the exon-intron database (EID). Brief Bioinform. 7 178–185PubMedCrossRefGoogle Scholar
- Silverman BD and Linsker R 1986 A measure of DNA periodicity. J. Theor. Biol. 118 295–300PubMedCrossRefGoogle Scholar
- Singer MF 1990 SINE and LINE nomenclature. Trend. Genet. 6 204CrossRefGoogle Scholar