Abstract
Although gene-free areas compose the great majority of eukaryotic genomes, a significant fraction of genes overlaps, i.e., unique nucleotide sequences are part of more than one transcription unit. In this work, the evolutionary history and origin of a same-strand gene overlap is dissected through the analysis of COG8 (component of oligomeric Golgi complex 8) and PDF (peptide deformylase). Comparative genomic surveys reveal that the relative locations of these two genes have been changing over the last 445 million years from distinct chromosomal locations in fish to overlapping in rodents and primates, indicating that the overlap between these genes precedes their divergence. The overlap between the two genes was initiated by the gain of a novel splice donor site between the COG8 stop codon and PDF initiation codon. Splicing is accomplished by the use of the PDF acceptor, leading COG8 to share the 3′end with PDF. In primates, loss of the ancestral polyadenylation signal for COG8 makes the overlap between COG8 and PDF mandatory, while in mouse and rat concurrent overlapping and non-overlapping Cog8 transcripts exist. Altogether, we demonstrate that the origin, evolution and preservation of the COG8/PDF same-strand overlap follow similar mechanistic steps as those documented for antisense overlaps where gain and/or loss of splice sites and polyadenylation signals seems to drive the process.
Similar content being viewed by others
Abbreviations
- COG8 :
-
Conserved oligomeric Golgi complex subunit 8
- PDF :
-
Peptide deformylase
- EST:
-
Expressed sequence tag
- UTR:
-
Untranslated region
References
Boi S, Solda G, Tenchini ML (2004) Shedding light on the dark side of the genome: overlapping genes in higher eukaryotes. Curr Genomics 5:509–524
Boore JL (1999) Animal mitochondrial genomes. Nucleic Acids Res 27:1767–1780
Dahary D, Elroy-Stein O, Sorek R (2005) Naturally occurring antisense: transcriptional leakage or real overlap? Genome Res 15:364–368
Dan I, Watanabe NM, Kajikawa E, Ishida T, Pandey A, Kusumi A (2002) Overlapping of MINK and CHRNE gene loci in the course of mammalian evolution. Nucleic Acids Res 30:2906–2910
Deininger PL, Moran JV, Batzer MA, Kazazian HH Jr (2003) Mobile elements and mammalian genome evolution. Curr Opin Genet Dev 13:651–658
Drummond AJ, Ashton B, Buxton S, Cheung M, Cooper A, Heled J, Kearse M, Moir R, Stones-Havas S, Sturrock S, Thierer T, Wilson A (2010) Geneious v5.0.4. http://www.geneious.com
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797
Escobar-Alvarez S, Goldgur Y, Yang G, Ouerfelli O, Li Y, Scheinberg DA (2009) Structure and activity of human mitochondrial peptide deformylase, a novel cancer target. J Mol Biol 387:1211–1228
Guindon S, Gascuel O (2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52:696–704
Hubbard TJP, Aken BL, Ayling S, Ballester B, Beal K, Bragin E, Brent S, Chen Y, Clapham P, Clarke L, Coates G, Fairley S, Fitzgerald S, Fernandez-Banet J, Gordon L, Graf S, Haider S, Hammond M, Holland R, Howe K, Jenkinson A, Johnson N, Kahari A, Keefe D, Keenan S, Kinsella R, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, Meidl P, Overduin B, Parker A, Pritchard B, Rios D, Schuster M, Slater G, Smedley D, Spooner W, Spudich G, Trevanion S, Vilella A, Vogel J, White S, Wilder S, Zadissa A, Birney E, Cunningham F, Curwen V, Durbin R, Fernandez-Suarez XM, Herrero J, Kasprzyk A, Proctor G, Smith J, Searle S, Flicek P (2009) Ensembl 2009. Nucleic Acids Res 37:D690–D697
Hughes AL, Westover K, da Silva J, O’Connor DH, Watkins DI (2001) Simultaneous positive and purifying selection on overlapping reading frames of the tat and vpr genes of simian immunodeficiency virus. J Virol 75:7966–7972
Kapranov P, Willingham AT, Gingeras TR (2007) Genome-wide transcription and the implications for genomic organization. Nat Rev Genet 8:413–423
Kim D-S, Cho C-Y, Huh J-W, Kim H-S, Cho H-G (2009) EVOG: a database for evolutionary analysis of overlapping genes. Nucleic Acids Res 37:D698–D702
Koonin EV (2005) Orthologs, paralogs, and evolutionary genomics. Annu Rev Genet 39:309–338
Luo Y, Fu C, Zhang D-Y, Lin K (2006) Overlapping genes as rare genomic markers: the phylogeny of gamma-Proteobacteria as a case study. Trends Genet 22:593–596
Lutz CS (2008) Alternative polyadenylation: a twist on mRNA 3′ end formation. ACS Chem Biol 3:609–617
Makalowska I (2008) Comparative analysis of an unusual gene arrangement in the human chromosome 1. Gene 423:172–179
Makalowska I, Lin C-F, Makalowski W (2005) Overlapping genes in vertebrate genomes. Comput Biol Chem 29:1–12
Makalowska I, Lin C-F, Hernandez K (2007) Birth and death of gene overlaps in vertebrates. BMC Evol Biol 7:193
Mazumder B, Seshadri V, Fox PL (2003) Translational control by the 3′-UTR: the ends specify the means. Trends Biochem Sci 28:91–98
Miller RH, Kaneko S, Chung CT, Girones R, Purcell RH (1989) Compact organization of the hepatitis B virus genome. Hepatology 9:322–327
Millevoi S, Vagner S (2009) Molecular mechanisms of eukaryotic pre-mRNA 3′ end processing regulation. Nucleic Acids Res 38:2757–2774
Mizokami M, Orito E, Ohba K, Ikeo K, Lau JY, Gojobori T (1997) Constrained evolution with respect to gene overlap of hepatitis B virus. J Mol Evol 44 (Suppl 1):S83–90
Pavesi A (2006) Origin and evolution of overlapping genes in the family Microviridae. J Gen Virol 87:1013–1017
Reichert A, Rothbauer U, Morl M (1998) Processing and editing of overlapping tRNAs in human mitochondria. J Biol Chem 273:31977–31984
Sabath N, Graur D, Landan G (2008) Same-strand overlapping genes in bacteria: compositional determinants of phase bias. Biol Direct 3:36
Sabath N, Price N, Graur D (2009) A potentially novel overlapping gene in the genomes of Israeli acute paralysis virus and its relatives. Virol J 6:144
Sakharkar KR, Sakharkar MK, Verma C, Chow VTK (2005) Comparative study of overlapping genes in bacteria, with special reference to Rickettsia prowazekii and Rickettsia conorii. Int J Syst Evol Microbiol 55:1205–1209
Sanna C, Li W-H, Zhang L (2008) Overlapping genes in the human and mouse genomes. BMC Genomics 9:169
Serero A, Giglione C, Sardini A, Martinez-Sanz J, Meinnel T (2003) An unusual peptide deformylase features in the human mitochondrial N-terminal methionine excision pathway. J Biol Chem 278:52953–52963
Shintani S, O’hUigin C, Toyosawa S, Michalova V, Klein J (1999) Origin of gene overlap: the case of TCP1 and ACAT2. Genetics 152:743–754
Solda G, Suyama M, Pelucchi P, Boi S, Guffanti A, Rizzi E, Bork P, Tenchini M, Ciccarelli F (2008) Non-random retention of protein-coding overlapping genes in Metazoa. BMC Genomics 9:174
Ungar D, Oka T, Brittle EE, Vasile E, Lupashin VV, Chatterton JE, Heuser JE, Krieger M, Waters MG (2002) Characterization of a mammalian Golgi-localized protein complex, COG, that is required for normal Golgi morphology and function. J Cell Biol 157:405–415
Ware SM, El-Hassan N, Kahler SG, Zhang Q, Ma YW, Miller E, Wong B, Spicer RL, Craigen WJ, Kozel BA, Grange DK, Wong LJ (2009) Infantile cardiomyopathy caused by a mutation in the overlapping region of mitochondrial ATPase 6 and 8 genes. J Med Genet 46:308–314
Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccio M, Edgar R, Federhen S, Geer LY, Kapustin Y, Khovayko O, Landsman D, Lipman DJ, Madden TL, Maglott DR, Ostell J, Miller V, Pruitt KD, Schuler GD, Sequeira E, Sherry ST, Sirotkin K, Souvorov A, Starchenko G, Tatusov RL, Tatusova TA, Wagner L, Yaschenko E (2007) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 35:D5–D12
Williams BAP, Slamovits CH, Patron NJ, Fast NM, Keeling PJ (2005) A high frequency of overlapping gene expression in compacted eukaryotic genomes. In: Proceedings of the National Academy of Sciences of the United States of America 102:10936–10941
Zhao J, Hyman L, Moore C (1999) Formation of mRNA 3′ ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis. Microbiol Mol Biol Rev 63:405–445
Zhou C, Blumberg B (2003) Overlapping gene structure of human VLCAD and DLG4. Gene 305:161–166
Acknowledgments
This work was supported by the Fundação para a Ciencia e a Tecnologia (FCT) grants SFRH/BD/44264/2008 to I.P.-C., SFRH/BD/23657/2005 to R.Q., Ciência 2008-ICAAM and research project POCTI/CBO/48218/2002 to L.T.C. and by POPH-QREN-Tipologia 4.2-Scientific employment promotion funded by the European Social Fund and by national funds from the Ministry of Science, Technology and Higher Education (MCTES) to L.A., and partially by research project FCOMP-01-0124-FEDER-007167 sponsored by FCT, COMPETE program and by FEDER funds. The Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP) is an Associate Laboratory of the Portuguese MCTES and is partially supported by FCT.
Conflict of interest
The authors declare that no conflict of interest exists.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Pereira-Castro, I., Quental, R., da Costa, L.T. et al. Successful COG8 and PDF overlap is mediated by alterations in splicing and polyadenylation signals. Hum Genet 131, 265–274 (2012). https://doi.org/10.1007/s00439-011-1075-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-011-1075-9