Abstract
Old and young duplicate genes have been reported in some organisms. However, little is known about the properties of old and young duplicate genes in Arachis. Here, we have identified old and young duplicate genes in Arachis duranensis, and analyzed the evolution, gene complexity, gene expression pattern, and functional divergence between old and young duplicate genes. Our results showed different evolutionary, gene complexity and gene expression patterns, as well as differing correlations between old and young duplicate genes. Gene ontology results showed that old duplicate genes play a crucial role in lipid and amino acid biosynthesis and the oxidation–reduction process and that young duplicate genes are preferentially involved in photosynthesis and response to biotic stimulus. Transcriptome data sets revealed that most old and young duplicate genes had asymmetric function, and only a few duplicate genes exhibited symmetric function under drought and nematode stress. We found that old duplicate genes are preferentially involved in lipid and amino acid metabolism and response to abiotic stress, while young duplicate genes are likely to participate in photosynthesis and response to biotic stress. This work provides a better understanding of the evolution and functional divergence of old and young duplicate genes in A. duranensis.
Similar content being viewed by others
References
Altschul S, Madden T, Schäffer A, Zhang J, Zhang Z, Miller W et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25(17):3389–3402. https://doi.org/10.1093/nar/25.17.3389
Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11:106–110. https://doi.org/10.1186/gb-2010-11-10-r106
Anderson DE, Anderson D, Goudie A, Parker A (2013) Global environments through the quaternary: exploring environmental change. Oxford University Press, Oxford
Arendsee ZW, Li L, Wurtele ES (2014) Coming of age: orphan genes in plants. Trends Plant Sci 19(11):698–708. https://doi.org/10.1016/j.tplants.2014.07.003
Banerjee S, Chakraborty S (2017) Protein intrinsic disorder negatively associates with gene age in different eukaryotic lineages. Mol BioSyst 13:2044–2055. https://doi.org/10.1039/c7mb00230k
Bertioli DJ, Seijo G, Freitas FO, Valls JFM, Leal-Bertioli SCM, Moretzsohn MC (2011) An overview of peanut and its wild relatives. Plant Genet Resour Charact Util 9(1):134–149. https://doi.org/10.1017/S1479262110000444
Bertioli DJ, Cannon SB, Froenicke L, Huang G, Farmer AD, Cannon EKS et al (2016) The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut. Nat Genet 48(4):438–446. https://doi.org/10.1038/ng.3517
Brasileiro ACM, Morgante CV, Araujo ACG, Leal-Bertioli SCM, Silva AK, Martins ACQ et al (2015) Transcriptome profiling of wild Arachis from water-limited environments uncovers drought tolerance candidate genes. Plant Mol Biol Rep 33:1876–1892. https://doi.org/10.1007/s11105-015-0882-x
Capra JA, Pollard KS, Singh M (2010) Novel genes exhibit distinct patterns of function acquisition and network integration. Genome Biol 11:R127. https://doi.org/10.1186/gb-2010-11-12-r127
Cartelle C, Hartwig WC (1996) A new extinct primate among the Pleistocene megafauna of Bahia. Brazil. Proc Natl Acad Sci USA 93(13):6405–6409. https://doi.org/10.1073/pnas.93.13.6405
Chen S, Zhang YE, Long M (2010) New genes in Drosophila quickly become essential. Science 330(6011):1682–1685. https://doi.org/10.1126/science.1196380
Chen WH, Trachana K, Lercher MJ, Bork P (2012) Younger genes are less likely to be essential than older genes, and duplicates are less likely to be essential than singletons of the same age. Mol Biol Evol 29:1703–1706. https://doi.org/10.1093/molbev/mss014
Clevenger J, Chu Y, Scheffler B, Ozias-Akins P (2016) A developmental transcriptome map for allotetraploid Arachis hypogaea. Front Plant Sci 7:1446. https://doi.org/10.3389/fpls.2016.01446
Conant GC, Wolfe KH (2008) Turning a hobby into a job: how duplicated genes find new functions. Nat Rev Genet 9:938–950. https://doi.org/10.1038/nrg2482
Cui X, Lv Y, Chen M, Nikoloski Z, Twell D, Zhang D (2015) Young genes out of the male: an insight from evolutionary age analysis of the pollen transcriptome. Mol Plant 8:935–945. https://doi.org/10.1016/j.molp.2014.12.008
Dash S, Cannon EKS, Kalberer SR, Farmer AD, Cannon SB (2016) PeanutBase and other bioinformatic resources for peanut. In: Stalker HT, Wilson RF (eds) Peanuts genetics, processing, and utilization. AOCS Press, Urbana, pp 241-252. https://doi.org/10.1016/b978-1-63067-038-2.00008-3
De Bodt S, Maere S, Van de Peer Y (2005) Genome duplication and the origin of angiosperms. Trends Ecol Evol 20(11):591–597. https://doi.org/10.1016/j.tree.2005.07.008
deMenocal PB (2001) Cultural responses to climate change during the late Holocene. Science 292(5517):667–673
Gibbard P, Kolfschoten TV (2004) The pleistocene and holocene epochs. In: Gradstein FM, Ogg JG, Smith AG (eds) A geologic time scale 2004. Cambridge University Press, Cambridge, pp 441–452
Gossmann TI, Saleh D, Schmid MW, Spence MA, Schmid KJ (2016) Transcriptomes of plant gametophytes have a higher proportion of rapidly evolving and young genes than sporophytes. Mol Biol Evol 33(7):1669–1678. https://doi.org/10.1093/molbev/msw044
Guimarães PM, Guimaraes LA, Morgante CV, Silva OB Jr, Araujo ACG, Martins ACQ et al (2015) Root transcriptome analysis of wild peanut reveals candidate genes for nematode resistance. PLoS One 10(10):e0140937. https://doi.org/10.1371/journal.pone.0140937
Gupta AK (2004) Origin of agriculture and domestication of plants and animals linked to early Holocene climate amelioration. Curr Sci Bangalore 87:54–59
Hanada K, Zou C, Lehti-Shiu MD, Shinozaki K, Shiu SH (2008) Importance of lineage-specific expansion of plant tandem duplicates in the adaptive response to environmental stimuli. Plant Physiol 148:993–1003. https://doi.org/10.1104/pp.108.122457
Hanada K, Tezuka A, Nozawa M, Suzuki Y, Sugano S, Nagano AJ et al (2018) Functional divergence of duplicate genes several million years after gene duplication in Arabidopsis. DNA Res 25(3):327–339. https://doi.org/10.1093/dnares/dsy005
Hughes PD, Woodward JC, Gibbard PL (2006) Quaternary glacial history of the Mediterranean mountains. Prog Phys Geogr Earth Environ 30:334–364
Ingvarsson PK (2007) Gene expression and protein length influence codon usage and rates of sequence evolution in Populus tremula. Mol Biol Evol 24(3):836–844. https://doi.org/10.1093/molbev/msl212
Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C et al (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30(9):1236–1240. https://doi.org/10.1093/bioinformatics/btu031
Kaessmann H (2010) Origins, evolution, and phenotypic impact of new genes. Genome Res 20:1313–1326. https://doi.org/10.1101/gr.101386.109
Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30(4):772–780. https://doi.org/10.1093/molbev/mst010
Kochert G, Stalker H, Gimenes M, Galgaro M, Lopes C, Moore K (1996) RFLP and cytogenetic evidence on the origin and evolution of allotetraploid domesticated peanut, Arachis hypogaea (Leguminosae). Am J Bot 83(10):1282–1291
Leister D (2004) Tandem and segmental gene duplication and recombination in the evolution of plant disease resistance genes. Trends Genet 20(3):116–122
Li B, Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform 12:323. https://doi.org/10.1186/1471-2105-12-323
Long M, Betran E, Thornton K, Wang W (2003) The origin of new genes: glimpses from the young and old. Nat Rev Genet 4:865–875. https://doi.org/10.1038/nrg1204
Long M, VanKuren NW, Chen S, Vibranovski MD (2013) New gene evolution: little did we know. Annu Rev Genet 47:325–351. https://doi.org/10.1146/annurev-genet-111212-133301
Maere S, De Bodt S, Raes J, Casneuf T, Van Montagu M, Kuiper M et al (2005) Modeling gene and genome duplications in eukaryotes. Proc Natl Acad Sci USA 102(15):5454–5459. https://doi.org/10.1073/pnas.0501102102
Mayewski PA, Rohling EE, Stager JC, Karlén W, Maasch KA, Meeker LD et al (2004) Holocene climate variability. Quat Res 62:243–255
Mészáros B, Erdős G, Dosztányi Z (2018) IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res 46(W1):W329–W337. https://doi.org/10.1093/nar/gky384
Nei M, Gu X, Sitnikova T (1997) Evolution by the birth-and-death process in multigene families of the vertebrate immune system. Proc Natl Acad Sci USA 94(15):7799–7806
Ohno S (1970) Evolution by gene duplication. Springer, New York
Ota T, Nei M (1994) Divergent evolution and evolution by the birth-and-death process in the immunoglobulin VH gene family. Mol Biol Evol 11(3):469–482. https://doi.org/10.1093/oxfordjournals.molbev.a040127
Panchy N, Lehti-Shiu M, Shiu SH (2016) Evolution of gene duplication in plants. Plant Physiol 171(4):2294–2316. https://doi.org/10.1104/pp.16.00523
Pujos F, Salas R (2004) A new species of Megatherium (Mammalia: Xenarthra: Megatheriidae) from the pleistocene of sacaco and tres ventanas, Peru. Palaeontology 47(3):579–604
Ramos M, Fleming G, Chu Y, Akiyama Y, Gallo M, Ozias-Akins P (2006) Chromosomal and phylogenetic context for conglutin genes in Arachis based on genomic sequence. Mol Genet Genom 275(6):578–592. https://doi.org/10.1007/s00438-006-0114-z
Seijo J, Lavia G, Fernandez A, Krapovickas A, Ducasse D, Moscone E (2004) Physical mapping of the 5S and 18S-25S rRNA genes by FISH as evidence that Arachis duranensis and A. ipaënsis are the wild diploid progenitors of A. hypogaea (Leguminosae). Am J Bot 91:1294–1303. https://doi.org/10.3732/ajb.91.9.1294
Seijo G, Lavia GI, Fernandez A, Krapovickas A, Ducasse DA, Bertioli DJ et al (2007) Genomic relationships between the cultivated peanut (Arachis hypogaea, Leguminosae) and its close relatives revealed by double GISH. Am J Bot 94(12):1963–1971. https://doi.org/10.3732/ajb.94.12.1963
Soltis PS, Marchant DB, Van de Peer Y, Soltis DE (2015) Polyploidy and genome evolution in plants. Curr Opin Genet Dev 35:119–125. https://doi.org/10.1016/j.gde.2015.11.003
Song H, Gao H, Liu J, Tian P, Nan Z (2017a) Comprehensive analysis of correlations among codon usage bias, gene expression, and substitution rate in Arachis duranensis and Arachis ipaënsis orthologs. Sci Rep 7:14853. https://doi.org/10.1038/s41598-017-13981-1
Song H, Zhang Q, Tian P, Nan Z (2017b) Differential evolutionary patterns and expression levels between sex-specific and somatic tissue-specific genes in peanut. Sci Rep 7:9016. https://doi.org/10.1038/s41598-017-09905-8
Song H, Sun J, Yang G (2018) Comparative analysis of selection mode reveals different evolutionary rate and expression pattern in Arachis duranensis and Arachis ipaënsis duplicated genes. Plant Mol Biol 98:349–361. https://doi.org/10.1007/s11103-018-0784-z
Suyama M, Torrents D, Bork P (2006) PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res 34(suppl 2):609–612. https://doi.org/10.1093/nar/gkl315
Van de Peer Y, Mizrachi E, Marchal K (2017) The evolutionary significance of polyploidy. Nat Rev Genet 18:411–424. https://doi.org/10.1038/nrg.2017.26
Vishnoi A, Kryazhimskiy S, Bazykin GA, Hannenhalli S, Plotkin JB (2010) Young proteins experience more variable selection pressures than old proteins. Genome Res 20:1574–1581. https://doi.org/10.1101/gr.109595.110
Vuilleumier BS (1971) Pleistocene changes in the fauna and flora of South America. Science 173(3999):771–780
Wang J, Tao F, Marowsky NC, Fan C (2016) Evolutionary fates and dynamic functionalization of young duplicate genes in Arabidopsis genomes. Plant Physiol 172:427–440. https://doi.org/10.1104/pp.16.01177
Wei W, Jin YT, Du MZ, Wang J, Rao N, Guo FB (2016) Genomic complexity places less restrictions on the evolution of young coexpression networks than protein–protein interactions. Mol Bio Evol 8(8):2624–2631. https://doi.org/10.1093/gbe/evw198
Whittle CA, Extavour CG (2015) Codon and amino acid usage are shaped by selection across divergent model organisms of the Pancrustacea. G3-Gene Genom Genet 5(11):2307–2321. https://doi.org/10.1534/g3.115.021402
Wilson BA, Foy SG, Neme R, Masel J (2017) Young genes are highly disordered as predicted by the preadaptation hypothesis of de novo gene birth. Nat Ecol Evol 1:0146. https://doi.org/10.1038/s41559-017-0146
Wolf YI, Novichkov PS, Karev GP, Koonin EV, Lipman DJ (2009) The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages. Proc Natl Acad Sci USA 106:7273–7280. https://doi.org/10.1073/pnas.0901808106
Yang Z (2007) PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24(8):1586–1591. https://doi.org/10.1093/molbev/msm088
Yin H, Ma L, Wang G, Li M, Zhang Z (2016) Old genes experience stronger translational selection than young genes. Gene 590:29–34. https://doi.org/10.1016/j.gene.2016.05.041
Zhang JZ (2003) Evolution by gene duplication: an update. Trends Ecol Evol 18(6):292–298. https://doi.org/10.1016/s0169-5347(03)00033-8
Zou C, Lehti-Shiu MD, Thomashow M, Shiu SH (2009) Evolution of stress-regulated gene expression in duplicate genes of Arabidopsis thaliana. PLoS Genet 5(7):e1000581. https://doi.org/10.1371/journal.pgen.1000581
Funding
This study was supported by the Forage Industrial Innovation Team, Shandong Modern Agricultural Industrial and Technical System (SDAIT-23-01), China Agriculture Research System (CARS-34) and Natural Science Foundation of Shandong Province, China (ZR2019QC017).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethics approval and consent to participate
The authors declare that this study complies with the current laws of the country in which the experiments were performed. This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Communicated by Stefan Hohmann.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
438_2019_1574_MOESM1_ESM.tif
Fig. S1 Correlation analysis of gene-expression level, gene-expression breadth, gene complexity, and substitution rate between old and young duplicate genes in A. duranensis. The figure was constructed using the gplots package in R. (TIFF 1660 kb)
438_2019_1574_MOESM2_ESM.tif
Fig. S2 Comparisons of the number of gene ontology terms between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate genes based on the sequencing name. The figure was constructed using the ggpubr package in R. (TIFF 573 kb)
438_2019_1574_MOESM3_ESM.tif
Fig. S3 Old and young duplicate genes distributed on 10 chromosomes in A. duranensis. A. Old duplicate gene locations on the 10 chromosomes. B. Young duplicate gene locations on the 10 chromosomes. A red line indicates duplicate gene pairs in a chromosome. Grey line indicates duplicate gene pairs in a different chromosome. The chromosomal location of A. duranensis genes has been documented on https://peanutbase.org/gbrowse_aradu1.0. We extracted the chromosomal location of each old and young duplicate gene based on the sequencing name. The figure was constructed using Circos 9.0. (TIFF 1275 kb)
438_2019_1574_MOESM4_ESM.xlsx
Table S1 Young and old duplicate genes in A. duranensis. Ks: synonymous substitution rate per synonymous site; Ka: nonsynonymous substitution rate per nonsynonymous site; Ka/Ks: nonsynonymous to synonymous substitution ratio (XLSX 118 kb)
438_2019_1574_MOESM5_ESM.xlsx
Table S2 Specific gene ontology terms for cell components between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate gene based on the sequencing name. (XLSX 12 kb)
438_2019_1574_MOESM6_ESM.xlsx
Table S3 Specific gene ontology terms for molecular functions between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate gene based on the sequencing name (XLSX 13 kb)
438_2019_1574_MOESM7_ESM.xlsx
Table S4 Specific gene ontology terms for biological processes between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate gene based on the sequencing name (XLSX 13 kb)
438_2019_1574_MOESM8_ESM.xlsx
Table S5 Common gene ontology terms for cell components between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate genes based on the sequencing name (XLSX 8 kb)
438_2019_1574_MOESM9_ESM.xlsx
Table S6 Common gene ontology terms for molecular functions between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate genes based on the sequencing name (XLSX 10 kb)
438_2019_1574_MOESM10_ESM.xlsx
Table S7 Common gene ontology terms for biological processes between old and young duplicate genes in A. duranensis. Gene ontology (GO) of A. duranensis sequences has been released. We extracted the GO terms of each old and young duplicate genes based on the sequencing name (XLSX 8 kb)
Rights and permissions
About this article
Cite this article
Song, H., Sun, J. & Yang, G. Old and young duplicate genes reveal different responses to environmental changes in Arachis duranensis. Mol Genet Genomics 294, 1199–1209 (2019). https://doi.org/10.1007/s00438-019-01574-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-019-01574-8