Abstract
Dipteryx alata Vog. (Fabaceae) is a species of tree native to the Brazilian Cerrado that has economic potential due to its use for food, forage, medicinal, recovery of degraded areas, landscaping and wood extraction. The objective of the present study was to sequence, assemble and annotate the chloroplast genome sequence (cp genome) of D. alata and perform comparative analysis with the cp genome of other species of Fabaceae. The chloroplast is 158,647 bp in length and exhibits a quadripartite structure, with a pair of inverted repeats (IRs: 24,948 bp) separated by the large single copy (LSC: 88,769 bp) and small single copy (SSC: 19,982 bp) regions. It contains 125 genes, of which 109 are unique, including 76 protein-coding genes (CDS), 29 transporter RNA genes (tRNA) and four ribosomal RNA genes (rRNA). Comparative analysis of the cp genome of D. alata with the genome of other Fabaceae indicated similarity to the gene content, but gene losses and rearrangements have been identified. Comparative analysis indicated that the genes located in the IR regions were the most conserved, with average values of nucleotide diversity (Pi) of 0.03 followed by SSC (Pi-0.08) and LSC (Pi-0.09). Some non-coding regions exhibited relatively high divergence of sequences. The chloroplast genome contains also 131 simple sequence repeats (SSRs) of which 121 are located in intergenic regions and ten in protein-coding regions. The most frequent SSR repetition was A/T and AT/TA. The complete cp genome sequence of D. alata reported in this paper represents a valuable addition to the scarce available genomic resources for this Brazilian Cerrado species. This work shows the first complete plastoma of a species belonging to the ADA clade, within the Papilionoideae subfamily, and contributes to improve studies of phylogeny and plastoma evolution of the Fabaceae family. In addition, it provides new genetic information on plastid sequences useful for designing conservation and breeding strategies. Chloroplast sequences will be useful in phylogenetic studies, population genetics, phylogeography and molecular systematics.
Similar content being viewed by others
References
Altschul SF, Gish W, Miller W et al (1990) Basic local alignment search tool. J Mol Biol 215:403–410. https://doi.org/10.1016/S0022-2836(05)80360-2
Andrews S (2010) FastQC: a quality control tool for high throughput sequence data. Available: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/. Accessed 19 Mar 2020
Bankevich A, Nurk S, Antipov D et al (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. https://doi.org/10.1089/cmb.2012.0021
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Cardoso D, Pennington RT, Queiroz L, James B (2013) Reconstructing the deep-branching relationships of the papilionoid legumes. South African J Bot 89:58–75. https://doi.org/10.1016/j.sajb.2013.05.001
Cai Z, Guisinger ÆM, Ruck E et al (2008) Extensive reorganization of the plastid genome of Trifolium subterraneum (Fabaceae) is associated with numerous repeated sequences and novel DNA insertions. J Mol Evol 67:696–704. https://doi.org/10.1007/s00239-008-9180-7
Collevatti RG, Telles MPC, Nabout JC et al (2013) Demographic history and the low genetic diversity in Dipteryx alata (Fabaceae) from Brazilian neotropical savannas. Heredity (Edinb) 111:97–105. https://doi.org/10.1038/hdy.2013.23
Doyle J, Doyle J (1987) Isolation of plant DNA from fresh tissue. Focus 12:13–15
Ferreira CM, Gabriel, GH, Nepomuceno L, Cruz VS et al (2018) Caracterização botânica e cadeia produtiva da espécie Dipteryx alata Vogel. Enciclopedia Biosfera 15(28):201–217
Frazer KA, Pachter L, Poliakov A et al (2004) VISTA: computational tools for comparative genomics. Nucleic Acids Res 32:273–279. https://doi.org/10.1093/nar/gkh458
Greiner S, Lehwark P, Bock R (2019) OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res 47:59–64. https://doi.org/10.1093/nar/gkz238
Guimarães RA, Telles MPC, Antunes AM et al (2017) Discovery and characterization of new microsatellite loci in Dipteryx alata vogel (Fabaceae) using next-generation sequencing data. Genet Mol Res. https://doi.org/10.4238/gmr16029639
Guimarães RA, Marques K, Miranda C et al (2019a) Assessing genetic diversity and population structure in a Dipteryx alata germplasm collection utilizing microsatellite markers. Crop Breed Appl Biotechnol 19:329–336
Guimarães RA, Marques K, Miranda C et al (2019b) Mating system and pollen dispersal in Dipteryx alata Vogel (Leguminosae): comparing in situ and ex situ conditions. Tree Genet Genomes 15(2):28
Guo X, Castillo-Ramírez S, González V et al (2007) Rapid evolutionary change of common bean (Phaseolus vulgaris L) plastome, and the genomic diversification of legume chloroplasts. BMC Genom 8:1–16. https://doi.org/10.1186/1471-2164-8-228
Khan A, Asaf S, Khan AL et al (2019) First complete chloroplast genomics and comparative phylogenetic analysis of Commiphora gileadensis and C. foliacea: myrrh producing trees. PLoS ONE 14:1–21. https://doi.org/10.1371/journal.pone.0208511
Kumar S, Stecher G, Li M et al (2018) MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol 35:1547–1549. https://doi.org/10.1093/molbev/msy096
Lagesen K, Hallin P, Rødland EA et al (2007) RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. https://doi.org/10.1093/nar/gkm160
Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:1–14. https://doi.org/10.1186/gb-2009-10-3-r25
Liu E, Yang C, Liu J et al (2019) Comparative analysis of complete chloroplast genome sequences of four major Amorphophallus species. Sci Rep 9:1–14. https://doi.org/10.1038/s41598-018-37456-z
Lowe TM, Eddy SR (1996) TRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. https://doi.org/10.1093/nar/25.5.0955
LPWG (2017) A new subfamily classification of the Leguminosae based on a taxonomically comprehensive phylogeny —the legume phylogeny working group (LPWG). Taxon 66:44–77. https://doi.org/10.12705/661.3
Lu Y, Li W, Xie X et al (2018) The complete chloroplast genome sequence of Sophora japonica var. violacea: gene organization and genomic resources. Conserv Genet Resour 10:1–4. https://doi.org/10.1007/s12686-017-0748-7
Magee AM, Aspinall S, Rice DW et al (2010) Localized hypermutation and associated gene losses in legume chloroplast genomes. Genome Res 20:1700–1710. https://doi.org/10.1101/gr.111955.110.nuclear
Mardis ER (2008) Next-Generation DNA Sequencing Methods. Annu Rev Genom Hum Genet 9:387–402. https://doi.org/10.1146/annurev.genom.9.081307.164359
Martin GE, Rousseau-Gueutin M, Cordonnier S et al (2014) The first complete chloroplast genome of the Genistoid legume Lupinus luteus: evidence for a novel major lineage-specific rearrangement and new insights regarding plastome evolution in the legume family. Ann Bot 113:1197–1210. https://doi.org/10.1093/aob/mcu050
Metzker ML (2010) Sequencing technologies the next generation. Nat Rev Genet 11:31–46. https://doi.org/10.1038/nrg2626
Mudunuri SB, Nagarajaram HA (2007) IMEx: imperfect microsatellite extractor. Bioinformatics 23:1181–1187. https://doi.org/10.1093/bioinformatics/btm097
Nock CJ, Waters DLE, Edwards MA et al (2011) Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol J 9:328–333. https://doi.org/10.1111/j.1467-7652.2010.00558.x
Qian J, Song J, Gao H et al (2013) The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza. PLoS ONE. https://doi.org/10.1371/journal.pone.0057607
Rozas J, Ferrer-Mata A, Sanchez-DelBarrio JC et al (2017) DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol 34:3299–3302. https://doi.org/10.1093/molbev/msx248
Rozen S, Skaletsky H (2000) Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol 132:365–386
Ruhlman TA, Jansen RK (2014) The plastid genomes of flowering plants. Methods Mol Biol 1132:3–38
Sano S, Ribeiro J, Brito M (2004) Baru: biologia e uso. Embrapa Cerrados 116:51
Santos V, Almeida C (2019) The complete chloroplast genome sequences of three Spondias species reveal close relationship among the species. Genet Mol Biol 42:132–138. https://doi.org/10.1590/1678-4685-gmb-2017-0265
Saski C, Lee SB, Daniell H et al (2005) Complete chloroplast genome sequence of glycine max and comparative analyses with other legume genomes. Plant Mol Biol 59:309–322. https://doi.org/10.1007/s11103-005-8882-0
Soares TN, Pires M, Telles DC (2008) Distribuição espacial da variabilidade genética intrapopulacional de Dipteryx alata. Pesq agropec bras 43:1151–1158
Soares TNA, Melo DBO, Resende LVI et al (2012) Development of microsatellite markers for the neotropical tree species Dipteryx alata (Fabaceae) 1. AJB Prim notes Protoc Plant Sci Dev. https://doi.org/10.3732/ajb.1100377
Soares TN, Diniz-Filho JAF, Nabout JC et al (2015) Patterns of genetic variability in central and peripheral populations of Dipteryx alata (Fabaceae) in the Brazilian Cerrado. Plant Syst Evol 301:1315–1324. https://doi.org/10.1007/s00606-014-1155-0
Song Y, Chen Y, Lv J et al (2019) Comparative chloroplast genomes of Sorghum species: sequence divergence and phylogenetic relationships. Biomed Res Int. https://doi.org/10.1155/2019/5046958
Thompson JD, Higgins DG, Gibson TJ (1994) Clustal-W-Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, [position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680
Tillich M, Lehwark P, Pellizzer T et al (2017) GeSeq—versatile and accurate annotation of organelle genomes. Nucleic Acids Res 45:W6–W11. https://doi.org/10.1093/nar/gkx391
Untergasser A, Cutcutache I, Koressaar T et al (2012) Primer3-new capabilities and interfaces. Nucleic Acids Res 40:1–12. https://doi.org/10.1093/nar/gks596
Wicke S, Schneeweiss GM, dePamphilis CW et al (2011) The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Mol Biol 76:273–297. https://doi.org/10.1007/s11103-011-9762-4
Wyman SK, Jansen RK, Boore JL (2004) Automatic annotation of organellar genomes with DOGMA. Bioinformatics 20:3252–3255. https://doi.org/10.1093/bioinformatics/bth352
Xu JH, Liu Q, Hu W et al (2015) Dynamics of chloroplast genomes in green plants. Genomics 106:221–231. https://doi.org/10.1016/j.ygeno.2015.07.004
Xue S, Shi T, Luo W et al (2019) Comparative analysis of the complete chloroplast genome among Prunus mume, P. armeniaca, and P. salicina. Hortic Res 6:1–13. https://doi.org/10.1038/s41438-019-0171-1
Yang Z, Wang G, Ma Q et al (2019) The complete chloroplast genomes of three Betulaceae species: implications for molecular phylogeny and historical biogeography. PeerJ 7:e6320. https://doi.org/10.7717/peerj.6320
Yi DK, Choi K, Joo M et al (2016) The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: abietoideae). J Asia-Pac Biodivers 9:245–249. https://doi.org/10.1016/j.japb.2016.03.014
Zhang T, Zhang X, Hu S, Yu J (2011) An efficient procedure for plant organellar genome assembly, based on whole genome data from the 454 GS FLX sequencing platform. Plant Methods 7:38. https://doi.org/10.1186/1746-4811-7-38
Zhang Y, Li L, Yan TL, Liu Q (2014) Complete chloroplast genome sequences of Praxelis (Eupatorium catarium Veldkamp), an important invasive species. Gene 549:58–69. https://doi.org/10.1016/j.gene.2014.07.041
Acknowledgements
Our research was supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Fundação de Amparo à Pesquisa do Estado de Goiás (FAPEG). Project “Núcleo de Excelência em Recursos Genéticos Vegetais do Cerrado (CERGEN)”-GECER (PRONEX/FAPEG/CNPq#CP07-2012). Research network GENPAC (“Geographical Genetics and Regional Planning for Natural Resources in Brazilian Cerrado”) from CNPq/Fapeg/PRO-CENTRO-OESTE nº 31/2010, GENPAC 02-Proc. 563839/2010-4 and 201110267000125. M.P.C.T. and T.N.S has been continuously supported by productivity fellowships from “Conselho Nacional de Desenvolvimento Científico e Tecnológico” (CNPq), and A.M.A has been supported by doctorate and postdoctoral fellowships from “Coordenação de Aperfeiçoamento de Pessoal de Nível Superior” (Capes), which we gratefully acknowledge. Current research is developed in the context of National Institutes for Science and Technology (INCT) in Ecology, Evolution and Biodiversity Conservation, supported by MCTIC/CNPq (Proc. 465610/2014-5) and FAPEG.
Author information
Authors and Affiliations
Contributions
Telles and Antunes elaborated the research project. Telles collected the samples. Antunes extracted the genomic DNA. Telles and Antunes constructed the DNA Libraries and genome sequencing. Antunes, Novaes and Coelho conducted the bioinformatics analyses and generated assembly and annotations of the genome. Antunes andTargueta conducted the phylogenetic analysis. Antunes uploaded the raw read data, genome assembly and annotation in the NCBI databases. Antunes wrote the paper. Telles, Soares, Targueta, Novaes and Coelho reviewed the paper. All authors read and approved the final manuscript.
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Antunes, A.M., Soares, T.N., Targueta, C.P. et al. The chloroplast genome sequence of Dipteryx alata Vog. (Fabaceae: Papilionoideae): genomic features and comparative analysis with other legume genomes. Braz. J. Bot 43, 271–282 (2020). https://doi.org/10.1007/s40415-020-00599-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40415-020-00599-3