Assembly, annotation and analysis of the chloroplast genome of the Algarrobo tree Neltuma pallida (subfamily: Caesalpinioideae)

Caycho, Esteban; La Torre, Renato; Orjeda, Gisella

doi:10.1186/s12870-023-04581-5

Assembly, annotation and analysis of the chloroplast genome of the Algarrobo tree Neltuma pallida (subfamily: Caesalpinioideae)

Research
Open access
Published: 16 November 2023

Volume 23, article number 570, (2023)
Cite this article

Download PDF

You have full access to this open access article

BMC Plant Biology Aims and scope Submit manuscript

Assembly, annotation and analysis of the chloroplast genome of the Algarrobo tree Neltuma pallida (subfamily: Caesalpinioideae)

Download PDF

Esteban Caycho¹,
Renato La Torre¹ &
Gisella Orjeda¹

1430 Accesses
1 Citation
Explore all metrics

Abstract

Background

Neltuma pallida is a tree that grows in arid soils in northwestern Peru. As a predominant species of the Equatorial Dry Forest ecoregion, it holds significant economic and ecological value for both people and environment. Despite this, the species is severely threatened and there is a lack of genetic and genomic research, hindering the proposal of evidence-based conservation strategies.

Results

In this work, we conducted the assembly, annotation, analysis and comparison of the chloroplast genome of a N. pallida specimen with those of related species. The assembled chloroplast genome has a length of 162,381 bp with a typical quadripartite structure (LSC-IRA-SSC-IRB). The calculated GC content was 35.97%. However, this is variable between regions, with a higher GC content observed in the IRs. A total of 132 genes were annotated, of which 19 were duplicates and 22 contained at least one intron in their sequence. A substantial number of repetitive sequences of different types were identified in the assembled genome, predominantly tandem repeats (> 300). In particular, 142 microsatellites (SSR) markers were identified. The phylogenetic reconstruction showed that N. pallida grouped with the other Neltuma species and with Prosopis cineraria. The analysis of sequence divergence between the chloroplast genome sequences of N. pallida, N. juliflora, P. farcta and Strombocarpa tamarugo revealed a high degree of similarity.

Conclusions

The N. pallida chloroplast genome was found to be similar to those of closely related species. With a size of 162,831 bp, it had the classical chloroplast quadripartite structure and GC content of 35.97%. Most of the 132 identified genes were protein-coding genes. Additionally, over 800 repetitive sequences were identified, including 142 SSR markers. In the phylogenetic analysis, N. pallida grouped with other Neltuma spp. and P. cineraria. Furthermore, N. pallida chloroplast was highly conserved when compared with genomes of closely related species. These findings can be of great potential for further diversity studies and genetic improvement of N. pallida.

View this article's peer review reports

Comprehensive analysis of complete chloroplast genome and phylogenetic aspects of ten Ficus species

Article Open access 23 May 2022

Comparative genomics of 11 complete chloroplast genomes of Senecioneae (Asteraceae) species: DNA barcodes and phylogenetics

Article Open access 22 August 2019

Comparative analysis of complete Artemisia subgenus Seriphidium (Asteraceae: Anthemideae) chloroplast genomes: insights into structural divergence and phylogenetic relationships

Article Open access 10 March 2023

Background

The Algarrobo tree Neltuma pallida (Humb. & Bonpl. ex Willd.) Hughes & Lewis is a tree of 8 to 20 m in height, with small grayish-green leaves, uninodal axillary spines and yellow pod-shaped fruits [1, 2]. This species belongs to the genus Neltuma (Subfamily: Caesalpinioideae), which includes up to 43 potential species that are arboreal or shrubby, possess uninodal axillary spines, and are distributed in dry tropical and arid regions of America [2]. Neltuma pallida (Fig. 1), specifically, is native to arid regions of Colombia, Ecuador and Peru [1]. In Peru, Algarrobo trees grow in the Equatorial Dry Forest (3.45% of the country’s total area), an ecoregion located in the northern coastal regions such as La Libertad, Lambayeque, Piura and Tumbes [3,4,5,6], being one of the predominant species there.

Algarrobo trees hold significant economic and ecological importance in the South American countries, where they thrive [7, 8]. The species is known as “multipurpose”, offering a range of benefits to local communities [9]. The most common uses of Algarrobo trees are for fuel, medicine, cattle feed, or construction material [10]. Moreover, Algarrobo trees play a crucial role in the maintenance of their environment [11]. They provide a physical barrier to prevent wind erosion, contribute to soil fertility, maintain a microclimate and facilitate bioremediation [12, 13].

Despite the economic and ecological importance of Neltuma spp., their populations in Peru, Argentina, Chile, Venezuela, and the Chaco region (which includes some departments of Bolivia, Brazil, Argentina, and Paraguay) are experiencing a severe decline [14,15,16,17,18]. Regarding Peru, a report by the Peruvian National Forest and Wildlife Inventory [19] shows that as much as 40% of N. pallida trees are standing dead and 9% can be found as stumps. Of the 51% that remain alive, 27% display mild to severe damage. The true cause of the high mortality rate of the Algarrobo tree population is still uncertain and likely complex. Some hypotheses contemplate abiotic factors such as climate change [20] and drought [21] in the northern coast of Peru. Other hypotheses include biotic factors, such as the increasing presence of phytophagous and sucking pest insects [17, 22], now thought to be an effect of the decrease of natural biological controllers due to an ecological imbalance after recent ENSO events, or viruses of the Closteroviridae family [6, 23].

The literature shows few genetic studies on N. pallida, and genomic studies have not yet been carried out. In 2022, the National Institute of Agrarian Innovation (INIA) of Peru, together with the National University of Frontera (Piura - Peru), started a plant breeding project to improve the Algarrobo in Piura, a department in northern Peru [Arbizu, pers. comm]. Also, a project for molecular delimitation of Neltuma species has recently begun in our laboratory [Rivas M. pers. comm].

Some other works that have been published about the species focus on knowing its phenotypic variability, especially its characteristics of agronomic importance [24,25,26]. On the other hand, there are some studies from more than a decade ago that have sought to characterize the species at the genetic level [6]. These use classical molecular markers [27, 28], study the ploidy of the species [29,30,31], or analyze a single nucleotide sequence [32]. The lack of information on such an important and currently endangered species is detrimental to the development of comprehensive conservation and improvement strategies. Thus, the decline of Algarrobo tree populations continues to affect its highly fragile ecosystems, such as the deserts and dry forests of many South American countries. This is expected to lead to a decrease in endemism and biodiversity, as well as harsher living conditions.

A first step in the development of genomic studies in plants is the assembly and annotation of the chloroplast genome. This represents a faster and simpler task than sequencing and analyzing the nuclear genome due to its size and level of complexity [33, 34]. Chloroplast genome sizes range from 120 to 160 Kb, most commonly with 100 to 130 genes [35, 36]. In addition, genomic data is valuable for diversity studies [37, 38], phylogenetic analyses [39, 40], genetic improvement [41, 42], and genetic engineering of the species and closely related species [43, 44].

The objective of this work is to assemble, annotate and analyze the chloroplast genome of Neltuma pallida, and to compare it with other chloroplast genomes of close species. In this study, we discovered the chloroplast genome content of the Algarrobo tree N. pallida (subfamily: Caesalpinioideae) through its assembly, annotation and structural analysis. We also performed a comparative analysis using the sequenced chloroplast genomes of other species Prosopis sensu lato (s.l.), now the genera Neltuma, Strombocarpa and Prosopis sensu stricto (s.s.), and made a phylogenetic reconstruction to identify the relationships of N. pallida within the clade.

Results

Assembly and annotation of the chloroplast genome of Neltuma pallida

We assembled the chloroplast genome and obtained a graphical file of the assembly and the genome sequence. The graphical file was used to examine the structure and sequencing depth of the assembled genome. The assembled N. pallida chloroplast genome (Genbank: OR178743) had a length of 162,381 bp and the classical quadripartite structure (Fig. S1): a long single copy sequence (LSC) of 91,805 bp (~ 56.54% of the genome), a short single copy sequence (SSC) of 18,748 bp (~ 11.55%), and two inverted repeat (IR) regions of 25,914 bp (~ 31.91%) (Table 1).

Table 1 Summary of chloroplast genome features of N. pallida and related species

Full size table

An analysis of the assembled genome structure was performed by examining the nucleotide composition of each region (Table S1). The GC content (GC%) of the whole genome was 35.97%. LSC and SSC had lower GC content: 33.26 and 30.46%, respectively. The IRs presented a higher GC% than the other regions and the whole genome with 42.77%. Among the coding regions, rRNA coding regions (located in the IRs) had the highest GC% with 55.41%, followed by tRNA coding regions with 53.10%. The lowest GC content was found in the protein coding regions with 37.45%.

The annotation of the assembled genome was done using the chloroplast genome of N. juliflora as reference. A total of 132 genes were found in the genome (19 duplicated genes), consisting of 85 protein-coding genes, 39 tRNA-coding genes and 8 rRNA-coding genes. The 85 protein-coding genes correspond to 78,834 bp, the 39 tRNA-coding genes to 2938 bp, and the 8 rRNA-coding genes to 9052 bp (Table 2).

Table 2 Genes annotated in the N. pallida chloroplast genome grouped by category and functional group

Full size table

The annotated genes included the following (Fig. 2): 12 small ribosomal proteins (rps), 8 large ribosomal proteins (rpl), 4 DNA-dependent RNA polymerases (rpo), 4 different rRNA coding genes (rrn), 31 different tRNA coding genes (trn), 5 photosystem I proteins (psa), 15 photosystem II proteins (psb), 11 NADH dehydrogenase proteins (ndh), 6 cytochrome b6/f complex proteins (pet), 6 ATP synthase complex proteins (atp), the major subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (rbcL), maturase K (matK), the proteolytic subunit of ATP-dependent Clp protease (clpP), membrane envelope protein (cemA), beta subunit of acetyl-CoA carboxylase (accD), cytochrome C biogenesis protein (ccsA), 4 hypothetical proteins of unknown function (ycf), and translation initiation factor 1 (infA). This makes a total of 113 different genes, 18 of which present introns (16 have one intron and 2 have two introns) (Table 3).

Table 3 Genes containing introns within the chloroplast genome of N. pallida and the length of their respective exons

Full size table

Functional annotation was conducted on the protein-coding genes sequences, to identify metabolic pathways and processes. Based on information obtained from the KEGG database, these genes belong to four primary classes (metabolism, genetic information processing, cellular processes and organ systems) (Table S2). Metabolism is the class with the highest number of genes, with energy metabolism being particularly prominent. According to the UniProt database, most of the genes are involved in biological processes, photosynthesis, and generation of metabolite precursors and energy (Fig. 3). Additionally, a large number of genes relate to cellular components and the chloroplast.

Codon usage analysis

Codon frequency (Table S3) and the Relative Synonymous Codon Usage (RSCU) were calculated for the whole exome (protein-coding regions). The most frequently used codon in the genome was AAU, which codes for Isoleucine (n = 1144), followed by AAA, which codes for Lysine (n = 1069). At the other extreme, the least used codons were the three types of STOP codons UGA (n = 17), UAG (n = 18) and UAA (n = 50). The less frequent amino acid-coding codons were UGC, coding for Cysteine (n = 85), and CGC, coding for Arginine (n = 103). Analyzing the RSCU values, it could be seen that, for each amino acid, half of the codons were used with a higher relative frequency than the other half. Also, the preferred codons were A or T/U ending codons (Fig. 4).

Identification of repetitive sequences

We searched for repetitive sequences in the genome. The tandem repeats were identified as microsatellites (SSRs) and tandem repeats in general. A total of 142 SSRs were identified (Fig. 5A), with the most abundant repeats being mononucleotide repeats (n = 78), mostly A/T mononucleotides (Fig. 5B). For the remaining SSR (di- to hexanucleotide repeats), between 4 and 18 repeats were found. Tandem repeats larger than hexanucleotides were also identified, with 164 repeats found (Fig. 5C). Thus, a total of 306 tandem repeats were found.

The rest of the repeats were palindromes, direct repeats and inverted repeats. In the genome, 33 palindromes, 250 direct repeats and 240 inverted repeats were found, most of them having between 15 and 29 nucleotides (Fig. 5C).

Phylogenomic relationships of Neltuma pallida

A phylogeny of the subfamily Caesalpinioideae was constructed using the chloroplast genomes of 30 previously published species to determine the position of N. pallida in the clade. We found that the genus Neltuma was not recovered as a monophyletic group because N. pallida was grouped with the other species of Neltuma spp. and P. cineraria (Fig. 6). Beside this, the genus Neltuma is grouped with the other genera of Prosopis s.l. used in this analysis: Prosopis s.s. and Strombocarpa. Prosopis s.l. also shows closeness to Cylicodiscus gabunensis and the Dichrostachys clade.

Sequence divergence analysis

Divergence between the sequences of N. juliflora, P. farcta, S. tamarugo and A. ligulata with N. pallida was calculated (Fig. 7). The lowest divergence was observed with N. juliflora, while the highest divergence was observed with A. ligulata. When looking at the genomic regions, it was found that the most conserved regions are the IRs, regardless of the species to which they are compared. The opposite was determined for the single copy regions (LSC and SSC), where the greatest divergence between sequences was found.

As expected, non-coding regions show the highest divergence, highlighting some intergenic regions such as trnK-rps16, trnS-trnG, trnC-petN, psbZ-trnG, trnT-trnL, rbcL-accD, accD-ycf4, rps8-rpl14 and rps3-rps19 in LSC or ccsA-ndhD and rps15-ycf1 in SSC. Also, some intronic regions show a higher degree of divergence such as in trnK, rps16, atpF, rpoC1, rpl2, rpl16 and clpP in LSC or petB, petD and ndhA in SSC.

In the coding regions the divergence was smaller, especially when compared to other Prosopis s.l. species. The genes with the greatest difference in their coding sequences are rpoC2, rpoB, accD, cemA, petA, rpl20, ycf2, ndhF and ycf1. There are also genes that show divergence only in one of the species of this group, this is the case of atpA, psbC, psaA and psbB in P. farcta.

Genetic distance of coding sequences analysis

Genetic distance was determined using the sequences of 74 chloroplast genes taken pairwise with the p-distance algorithm. For this purpose, the genome of N. pallida was compared as a reference with those of N. juliflora, P. farcta, S. tamarugo and A. ligulata (Fig. 8). As expected, the highest average distance was found between N. pallida and A. ligulata (0.0200), while the lowest was found between N. pallida and N. juliflora (0.0007).

The 10 genes with the largest genetic distance to N. pallida sequences were clpP (0.0754), ycf1 (0.0231), rpl20 (0.0188), accD (0.0187), psbT (0.0139), petN (0.0139), psbK (0.0134), rps3 (0.0133), rbcL (0.0131) and rps18 (0.0122).

Boundary between regions

The expansion and contraction of the LSC, IRB, SSC, and IRA of the N. pallida chloroplast genome were analyzed by examining the distance between their boundaries with their nearest genes (Fig. 9), and comparing these distances with related species.

In all Prosopis s.l. species, the genes closest to the boundary between the LSC and IRB regions (called JLB) were rps19 and rpl2. In all cases, the rps19 gene overlaps with both regions, and rpl2 falls within the IRB region. When comparing the distances of rpl2 to JLB between Neltuma spp. and Prosopis s.l. species, a contraction (12 bp) in IRB and an expansion in LSC are found.

Moving to the next boundary, between IRB and SSC (called JSB), in all Prosopis s.l. species, the closest genes are ycf1 and ndhF. When comparing N. pallida with the rest of the Prosopis s.l. species, we found a 17 bp contraction in IRB and a 20–112 bp contraction in SSC.

At the boundary between SSC and IRA (called JSA), the closest genes for Prosopis s.l. species were the other copy of ycf1 and trnN. When comparing the distance between this boundary and the closest genes, it was found that in the N. pallida SSC, there was a contraction of 6 bp with respect to the distance found in N. juliflora and an expansion of 17 bp with respect to the distance found in P. farcta and S. tamarugo. On the other hand, in N. pallida IRA, a contraction of 16 bp was observed with respect to the rest of the Prosopis s.l. species.

Finally, near the boundary between IRA and LSC (called JLA), the rps19 pseudogene and the trnH gene were found in Prosopis s.l. species. In the IRAs of Prosopis s.l., no differences were found between species, starting from the rps19 pseudogene in the IRA and reaching the boundary with LSC. The opposite case was observed in the LSC, when comparing N. pallida with the rest of the Prosopis s.l. species, a contraction of 12 and 182 bp was found with respect to P. farcta and N. juliflora, respectively. With respect to S. tamarugo, an expansion of only 1 bp difference was found.

Different distances were observed between boundaries and their nearest genes, these differences were small among species within Prosopis s.l. We also analyzed the case of A. ligulata and found two differences. First, the boundaries between regions in the chloroplast genome of A. ligulata were at completely different positions than in the Prosopis s.l. species used in this analysis. Second, the genes closest to the boundaries were different.

Discussion

The total genomic DNA of N. pallida was sequenced using Illumina technology, and its chloroplast genome sequences were extracted and assembled with GetOrganelle. This genome was annotated and compared with chloroplast genomes of other species of Prosopis s.l. The assembled chloroplast genome of N. pallida was 162,381 bp (162.4 Kb) long, with a classical circular quadripartite structure: two inverted repeats (IRA and IRB), a short single-copy region (SSC) and a long single-copy region (LSC) (Fig. 2; Table 1). The same quadripartite structure and similar chloroplast genome sizes have been reported in species of the genera Prosopis, Neltuma and Strombocarpa (161.5–163.7 kb) [45,46,47]. The same has been found in other mimosoid species [48, 49] and legumes [45]. In general, chloroplast genomic regions conserve similar sizes among closely related species, as in Prosopis s.l. However, some mimosoid species present regions of different length. This is the case for species of the Inga clade and Albizia spp. [50, 51], which exhibit expansions in their IRs and reductions in their SSCs, generating longer genomes. Our results agree with some studies mentioning that differences in chloroplast genome size are explained by variations in the length of single copy regions, LSC and SSC [46, 52]. Comparing the size differences between IRs and single copy regions in Prosopis s.l., it was found that differences between LSCs and SSCs were always larger than those between IRs.

In the chloroplast genome of N. pallida, 132 genes were identified and the 19 genes located in the IRs were duplicated (Fig. 2, Table 2). The genes in the IRs were also duplicated in the chloroplast genomes of P. cineraria and N. juliflora [46]. Duplication of genes in the IRs is common as it has been observed in other mimosoid species, including those with longer IRs. The difference with these species is that their IRs contain a larger number of genes [50, 51]. The same 19 duplicated genes have also been identified as duplicates in other legumes that are not mimosoids [53]. The duplication of the IRs genes in the chloroplast is common regardless of the length of the region and the phylogeny.

Gene prediction annotation identified 15 genes with one intron and 3 with two introns (clpP, rps12 and ycf3) (Table 3). Similarly, to other chloroplast genomes, rps12 was unevenly distributed in LSC (first exon) and IRB (second and third exons) [54, 55]. Additionally, in the chloroplast genome of N. pallida we identified genes that were absent in related species. For instance, we found trnG-GCC, which is not present in P. cineraria or in N. juliflora chloroplast genomes, while it is in N. glandulosa [46]. The gene coding for the transcription initiator factor, infA, was also annotated in N. pallida, as it presents an open reading frame (ORF). This ORF is also present in the chloroplast genomes of Prosopis, Strombocarpa or Neltuma, but has not been annotated in those genomes. This gene is either absent or present only as a pseudogene in many legume species [53, 56, 57]. The ORF of this gene has been identified in various chloroplast genomes of other legumes, such as Albizia julibrissin and Lespedeza spp. [51, 58]. It has been proposed that infA is a gene that has been lost multiple times from chloroplast genomes during the evolutionary history of angiosperms, and has subsequently been transferred to the nuclear genome of plants [59]. Additionally, there is the interesting case of a rps19 segment (103 bp) which is located in IRA as a pseudogene. This rps19 pseudogene has been discovered in all the other species of Prosopis s.l., with incomplete ORFs [46]. The same pseudogene has been found in non-legumes species, such as Cerasus humilis (family: Rosaceae) [60] and Garcinia paucinervis (family: Clusiaceae) [61].

The GC content (GC%) of the whole chloroplast genome of N. pallida was determined to be 35.97% (Table 1). Comparable values have been observed in the chloroplast genomes of P. cineraria and N. juliflora [46]. These values are typical for most sequenced chloroplast genomes, with GC content around 36.2%, except for certain taxonomic groups like Selaginella spp. (family Selaginellaceae), with 54.8% [62, 63]. Lower GC% values were found in single copy regions (33.26 and 30.46% for LSC and SSC, respectively), but GC% was higher in IRs (42.77%). Previous studies have suggested that this increase in GC% value in IRs is due to the presence of rRNA and tRNA genes [46, 64, 65]. Our results support this presumption as the GC% of rRNA and tRNA genes was calculated to be higher than 53%. It is noteworthy that the high GC% found in IRs aids in their preservation, as it has been shown to decelerate the rate of nucleotide substitution [66].

Regarding codon usage in protein-coding genes, a clear preference was observed for those ending in A or T/U, having RSCU values higher than 1. This pattern was observed in Albizia julibrissin [51] and Stryphnodendron adstringens [48], both species from the subfamily Caesalpinioideae, and in more distant species such as Salix floderusii [67] or Chrysosplenium spp. [68]. Also, Duan et al. [69] found a preference for the use of codons ending in A or T/U. They determined that there is a selective pressure favoring the use of these codons. This evolutionary pressure has only been detected in the third position of the codon, but not in the rest of the chloroplast genome. This positive selective pressure facilitates the expression efficiency and conservation of highly important genes [70], which may explain the observed conservation in the third position of codons.

We identified 142 microsatellites (SSRs) in the N. pallida chloroplast genome, with mononucleotide repeats being the most abundant. This is similar to the findings of Asaf et al. [46] in the chloroplast genomes of P. cineraria and N. juliflora, although they found a lower number of SSRs and did not find penta- or hexanucleotides in those genomes. We also identified more repeats of all types compared to other chloroplast genomes of Prosopis, Strombocarpa or Neltuma. This could be attributed to the use of distinct tools to detect repetitive sequences. As noted by Das & Ghosh [71] dedicated software programs for the identification of repetitive sequences commonly produce varying outcomes based on the algorithm utilized. Despite the challenges that arise from algorithmic disparity, it is still feasible to draw general comparisons, such as the higher number of repetitive mononucleotides among SSRs in the chloroplast genomes of distant species [39, 72, 73]. There is a need to establish a standardized methodology for the identification of repetitive sequences in the chloroplast genomes. This will allow for more reliable comparisons. Repetitive sequences analysis within chloroplast genomes is of high importance as they serve to study genetic diversity and phylogeny through molecular markers development, as emphasized in previous work [72, 74]. Likewise, chloroplast genomes exhibit considerable variation in their repetitive sequences’ number of tandem repeats, which may be useful for detecting population-level polymorphisms [75].

The chloroplast genome sequence is a powerful tool frequently utilized to elucidate phylogenetic relationships [76, 77]. However, cases of discordance between chloroplast and nuclear phylogenies can also be found [78, 79]. This is why information from both sources should be used concurrently when possible. In this work, N. pallida formed a subclade with N. juliflora, N. glandulosa and P. cineraria. The subclade with these four species, formerly included in the Prosopis genus, was reported by Asaf et al. [46], as a monophyletic group. This differs completely with Hughes et al. [2] whose recent results support that Prosopis s.l. is polyphyletic. Their research included about 1000 nuclear genes sequenced by Ringelberg et al. [80] in combination with morphological characters. The author suggested separating Prosopis s.l. into 4 different genera, placing N. pallida, N. juliflora and N. glandulosa in the genus Neltuma. On the other hand, P. cineraria and P. farcta, remained as species of the genus Prosopis s.s. P. cineraria and P. farcta originated and can be found in western Asia [81,82,83], while N. pallida, N. juliflora and N. glandulosa are native to the Americas [2, 7]. Our outcomes diverged from Hughes et al.’s [2] research since we found that P. cineraria is distanced from P. farcta in our phylogenetic tree (Fig. 6). Literature reports further cases of incongruent phylogeny between nuclear and chloroplast information [84,85,86,87,88,89]. In all these cases, incomplete lineage delimitation or introgression/hybridization was found to have influenced the discordances. Both options are feasible in certain Caesalpinioideae species, such as Prosopis cineraria. There is also evidence of interspecific and intergeneric hybridization in Prosopis s.l. [90, 91]. Nevertheless, due to the lack of reference chloroplast genome sequences for other Caesalpinioideae species, they could not be included in our analysis, which hinders the draw of conclusion.

When analyzing the divergence between chloroplast genome sequences among species of Prosopis s.l., a high level general of conservation was observed. However, it was also found: first, that non-coding regions, including intergenic sequences and introns, display the lowest similarity; second, that LSC and SSC exhibit the lowest identity, as has been seen across different taxonomic groups [92,93,94]; and lastly, that IRs are highly conserved, which aligns with the importance of the biological functions they serve in terms of genomic stabilization and gene conservation [95]. Thus, a single mutation in IRs may cause structural and sequence changes at many other sites in chloroplast genomes [96,97,98]. Another interesting point when comparing chloroplast sequences of Prosopis s.l. is that variations were only detected in 9 genes. A similar finding was made by Asaf et al. [46] in their comparison of N. juliflora and P. cineraria, with N. glandulosa, showing divergence in 5 of these 9 genes. This may indicate that this group’s chloroplast genomes are particularly susceptible to mutations in these genes. Despite the observed variability, the genetic distances were minimal. The clpP gene exhibited the most significant variability (0.0754). This gene, which encodes a subunit of the ATP-dependent chloroplast protease, has been reported to show a high rate of amino acid substitution, which is associated with protein structure variability [99]. The second gene with the highest genetic distance was ycf1 (0.0231). Some studies highlight that ycf1, in combination with the intergenic region between it and the ndhF gene, can be applied as a barcode for land plants [100, 101], indicating its high diversity level.

Boundary shift analysis among chloroplast genome regions indicates that N. pallida differs from other Prosopis s.l. species in the positions of all boundaries. Yet, these differences were minor, with a maximum displacement of a few hundred bases at the beginning of LSC. Asaf et al. [46] reported similar results when evaluating other Prosopis s.l. species. The shifting of boundaries between regions is a primary factor to the variation in chloroplast genome size. This has been observed when comparing closely related species in different groups, and it can be explained by the expansion and contraction of chloroplast regions [102,103,104]. Nonetheless, these variations are typically minor among related species, resulting in similar lengths of their chloroplast genomes and regions.

We successfully sequenced, assembled and annotated the chloroplast genome of N. pallida in this study. Our results allowed us to make comparisons with other species of Prosopis s.l., revealing a high degree of similarity with some differences at the structural and genetic level. We also used the sequence produced along with other published chloroplast sequences to perform a phylogenomic analysis that showed that N. pallida grouped with the other Neltuma species and with P. cineraria. Finally, divergence comparisons with other chloroplast genomes of Neltuma and Prosopis s.l. showed that within the group exists a high level of sequence identity. Nevertheless, certain divergent sequences and genes that could be interesting for the development of molecular markers. The data generated by this research can aid in the development of new lines of research that enhances the understanding of the diversity and preservation of this species in a more effective manner .

Conclusions

The Neltuma pallida chloroplast genome closely resembles those of closely related species. It has a size of 162,381 bp with a classical quadripartite structure and a GC content of 35.97%. The genome contains 132 genes, comprising 85 protein-coding genes, 8 rRNA-coding genes and 39 tRNA-coding genes. The codon usage analysis of the 85 protein-coding genes showed that isoleucine and lysine were the two most prevalent amino acids, and there was a clear preference for codons that have A or T/U in their third position. Also, the repetitive sequence identification enabled us to discover 142 SSR with potential as population-level markers.

Phylogenetic reconstruction revealed that N. pallida grouped together with the other species of the genus Neltuma and with P. cineraria. Additionally, the comparison of the N. pallida chloroplast genome with others from close species exhibited a high degree of similarity, particularly in coding regions. These findings can be useful for further diversity or genetic improvement studies in N. pallida.

Methods

Plant material

Young leaves of N. pallida were collected from an adult Algarrobo tree situated in the Bosque de Pómac Historic Sanctuary, Lambayeque Department, Peru (6°26′39.4″ S 79°48′16.6.6″ W). The collection was carried out under the authorization granted by “Resolución Jefatural de Santuario Histórico Bosque de Pómac N° 003-2020-SERNANP-JEF”. The collected samples were transported in paper envelopes for DNA extraction. The species was identified by PhD (c) Marinoli Rivas from the Laboratorio de Gimnospermas y Monocotiledoneas of the Museo de Historia Natural – UNMSM, using the descriptions made by Burkart [1] and Hughes et al. [2]. The voucher is available at the Museo de Historia Natural - UNMSM herbarium (USM N° 335,439) in Lima, Peru.

DNA extraction and sequencing

For DNA extraction, we used 0.2 g of ground sample in 2% CTAB buffer and followed the protocol published by Doyle [105] with minor modifications. The extracted DNA was purified with a DNAse-free RNAse A treatment at 37 °C for 1 hr. DNA quality and concentration were assessed with Nanodrop™ One C (Thermo Scientific, Massachusetts, USA) and Qubit™ 4 (Invitrogen, Massachusetts, USA), respectively. Additionally, sample integrity was also verified by 1% agarose gel electrophoresis.

DNA sequencing was performed using Illumina methodology by contracting the services of an external laboratory. The TruSeq DNA PCR-Free kit (Illumina Inc., California, USA) was used to construct the sequencing library.

The quality of the generated reads was assessed with FastQC v0.11.9 (https://github.com/s-andrews/FastQC) and then filtered with Trimmomatic v0.39 (https://github.com/usadellab/Trimmomatic) [106].

De novo assembly of Neltuma pallida chloroplast genome

To obtain the sequence of the N. pallida chloroplast genome (Genbank: OR178743), a de novo assembly was performed with GetOrganelle v1.7.6.1 (https://github.com/Kinggerm/GetOrganelle) [107]. This is a toolkit that combines Bowtie2 [108], BLAST [109], SPAdes [110], and Python libraries to identify sequences specific to the chloroplast DNA and assemble the chloroplast genome. GetOrganelle parameters were tuned to use the filtered data in the previous step (forward and reverse reads), and to conduct 1000 rounds of extension iterations. Also, we set up GetOrganelle to search sequences corresponding to plant plastids. The other options were left as default. The obtained GFA file was visualized with Bandage v0.9.0 (https://rrwick.github.io/Bandage/) [111] to explore the chloroplast genome structure. The average coverage of the final genome assembly was 130X.

Genome annotation of Neltuma pallida chloroplast genome

Genome annotation by prediction was performed using GeSeq v2.03 [112] in the Chlorobox web server (https://chlorobox.mpimp-golm.mpg.de/), with N. juliflora (Genbank: NC049132) taken as reference. Chloroplast Inverted Repeats (IR), rps12 interspersed gene, protein-coding sequences, transfer RNAs (tRNAs), and ribosomal RNAs (rRNAs) were all annotated. For proteins and RNAs, 25 and 85% identity were set as thresholds for annotation, respectively. Furthermore, tRNAscan-SE v2.0.7 [113], found on the same server, was used as a secondary tRNA annotator. Additionally, the external annotator Chloë v0.1.0 (https://github.com/ian-small/chloe), which is also found on the web server, was utilized. A manual curation was performed to finish the annotation for each gene, comparing the genes with their homologues found in other chloroplast genomes of species of the genera Neltuma, Prosopis and Strombocarpa.

The sequences of the annotated protein-coding genes were blasted in the KEGG (https://www.kegg.jp/) and UniProt (https://www.uniprot.org/) databases. In this way we sought to perform functional annotation.

Codon usage of protein-coding sequences

This analysis was solely performed on protein-coding sequences. Codon usage was analyzed by calculating codon frequency and the Relative Synonym Codon Usage (RSCU) values. If the RSCU was greater than 1, this codon was considered to be used more frequently, whereas if the RSCU was less than 1, the opposite was considered to be true. DAMBE5 v7.3.2 (http://dambe.bio.uottawa.ca/index.aspx) [114] was used to perform this analysis.

Identification of repetitive sequences in the chloroplast genome

Several programs were employed for the identification of repetitive sequences. The identification of tandem repeats, both microsatellite repeats (SSR, Short Sequence Repeat) and long sequences was performed with Phobos v3.3.12 (https://www.ruhr-uni-bochum.de/ecoevo/cm/cm_phobos.htm) [115]. The threshold for determining repeats was a minimum of 10 repeats for mononucleotides, 8 repeats for dinucleotides, 4 repeats for trinucleotides and tetranucleotides, and 3 repeats for pentanucleotides and hexanucleotides.

For the other repetitive sequences: palindromes (P), forward (F), and reverse (R), the RepEX web server (http://bioserver2.physics.iisc.ac.in/RepEx/index.html) [116] was used. Additionally, IUPACpal was used (https://sourceforge.net/projects/iupacpal/) [117] for palindromic sequences, and Vmatch v2.3.1 (http://www.vmatch.de/) [118] for forward repeats. For all of these repeats, a minimum length of 15 bp was considered, and 90% identity with respect to their template as threshold.

Phylogenomic relationships of Neltuma pallida

The N. pallida chloroplast genome was aligned with previously published chloroplast genomes of 30 species of the Caesalpinioideae subfamily (Table S7). Sequences were retrieved from the NCBI database (https://www.ncbi.nlm.nih.gov/) and aligned using the MAFFT web server (https://mafft.cbrc.jp/alignment/software/) [119] with default options.

Phylogenetic inference was done with two types of algorithms: Bayesian Inference (BI), in Beast2 v2.7.3 (https://www.beast2.org/) [120], and Maximum Likelihood (ML), in RAxML-HPC2 v8.2.12 [121]. For BI, the GTR + I + G substitution model (Yang 1994) with a Markov Chain Monte Carlo of 1,500,000 steps was used, removing the initial 10% of these. For ML, the GTR + I + G model with 1000 Bootstrap replicates was used. We used jModelTest 2.1.10 v20160303 (https://github.com/ddarriba/jmodeltest2) [122] to determine the substitution models.

Sequence divergence of chloroplast genomes

The Vista web server (https://genome.lbl.gov/vista/index.shtml) [123], with the mVista function in Shuffle-LAGAN mode was utilized for sequence divergence analysis. This mode enables the detection of rearrangements within genomes. The chloroplast genome sequence of N. pallida was used as a reference along with its annotation. The comparison was made with the chloroplast genomes of N. juliflora, P. farcta (Genbank: NC061227), S. tamarugo (Genbank: NC060845) and A. ligulata (Genbank: NC026134).

Genetic distance analysis of coding sequences

Sequences of 74 chloroplast genes from N. pallida, N. juliflora, P. farcta, S. tamarugo and A. ligulata (Genbank: NC026134) were aligned using ClustalW [124] in Bioedit v7.2.6 [125]. The generated alignments were used to calculate the genetic distance of the sequences of all species from N. pallida. The genetic distance was calculated using the p-distance algorithm with Mega X v10.1.8 (https://www.megasoftware.net/) [126].

Boundary shift analysis between chloroplast genome regions

A comparison was made of the position of boundaries between single copy regions (LSC and SSC) and inverted regions (IRA and IRB) in 5 species of the subfamily Caesalpinioideae: N. pallida, N. juliflora, P. farcta, S. tamarugo and A. ligulata. These boundaries were called JLB (LSC-IRB boundary), JSB (IRB-SSC), JSA (SSC-IRA) and JLA (IRA-LSC). For the positions of the boundaries, the distance between the boundaries and the genes closest to or over the boundaries was calculated.

Availability of data and materials

The data supporting the findings of this study is freely available in GenBank on the NCBI website (https://www.ncbi.nlm.nih.gov/) using the accession number OR178743, which corresponds to the Neltuma pallida chloroplast genome sequence. The reads that were used to assemble the chloroplast genome in this study were deposited at the NCBI Sequence Read Archive (SRA) under accession SRR25007997.

Abbreviations

BI:: Bayesian Inference
F:: Forward repeat
IR:: Inverted Repeat
JLA:: Boundary between LSC and IRA
JLB:: Boundary between LSC and IRB
JSA:: Boundary between SSC and IRA
JSB:: Boundary between SSC and IRB
LSC:: Large Single Copy
ML:: Maximum Likelihood
ORF:: Open Reading Frame
P:: Palindrome repeat
R:: Reverse repeat
rRNA:: Ribosomal RNA
RSCU:: Relative Synonymous Codon Usage
SSC:: Small Single Copy
SSR:: Simple Sequence Repeat or Microsatellite
tRNA:: Transfer RNA

References

Burkart A. A monograph of the genus Prosopis (leguminosae Subfam. Mimosoideae). J Arnold Arbor. 1976;57(4):450–525.
Article Google Scholar
Hughes CE, Ringelberg JJ, Lewis GP, Catalano SA. Disintegration of the genus Prosopis L. (Leguminosae, Caesalpinioideae, mimosoid clade). Phyto Keys. 2022;205:147–89.
PubMed PubMed Central Google Scholar
Padrón E, Navarro-Cerrillo RM. Aboveground biomass in Prosopis pallida (Humb. and Bonpl. ex Willd.) H. B. K. ecosystems using Landsat 7 ETM+ images. Rev Chil Hist Nat. 2007;80(1)
Organismo de Supervisión de los Recursos Forestales y de Fauna Silvestre. Aprovechamiento forestal maderable en bosques secos en el norte del Perú. Lima Peru: Presidencia del Consejo de Ministros; 2018. https://www.osinfor.gob.pe/publicaciones/aprovechamiento-forestal-maderable-en-bosques-secos-en-el-norte-del-peru/. Accessed 2 Jan 2023.
Cruzado Jacinto L, Chávez-Villavicencio CL, Charcape-Ravelo JM. Uso y selección de las partes aéreas del Algarrobo Prosopis pallida (Fabaceae) por reptiles, aves y mamíferos en Sechura (Piura – Perú). Rev Peru Biol. 2019;26(1):081–6.
Article Google Scholar
Forestal SN, y de Fauna Silvestre. Avances Sobre la Investigación de “Algarrobo” Prosopis (Fabaceae) en la Costa Norte del Perú. Lima Peru: Ministerio de Desarrollo Agrario y Riego; 2021. https://www.gob.pe/institucion/serfor/informes-publicaciones/1977079-avances-sobre-la-investigacion-de-algarrobo-en-la-costa-norte-del-peru. Accessed 2 Jan 2023
Google Scholar
Barros AS. El género Prosopis, valioso recurso forestal de las zonas áridas y semiáridas de América. Asia y Africa Cienc Investig For. 2010;16(1):91–127.
Google Scholar
Moreno MC. Torres LM del R, Campos CM. Nuevos aportes al uso de Prosopis flexuosa en el centro oeste de Argentina y su interpretación en el marco general de la ecorregión del monte. 2018;16(3):18–35.
Google Scholar
Mostacero León J, editor. Plantas medicinales del Perú: taxonomía, ecogeografía, fenología y etnobotánica. 1st ed. Asamblea Nacional de Rectores.: Lima, Peru; 2011.
Google Scholar
Depenthal J, Yoder LSM. Community use and knowledge of Algarrobo (Prosopis pallida) and implications for Peruvian dry Forest conservation. Rev Cienc Ambient. 2017;52(1):49.
Article Google Scholar
Salazar Zarzosa P, Mendieta-Leiva G, Navarro-Cerrillo RM, Cruz G, Grados N, Villar R. An ecological overview of Prosopis pallida, one of the most adapted dryland species to extreme climate events. J Arid Environ. 2021;193:104576.
Article Google Scholar
Mokgalaka-Matlala NS, Flores-Tavizón E, Castillo-Michel H, Peralta-Videa JR, Gardea-Torresdey JL. Arsenic tolerance in mesquite (Prosopis sp.): low molecular weight thiols synthesis and glutathione activity in response to arsenic. Plant Physiol Biochem. 2009;47(9):822–6.
Article CAS PubMed Google Scholar
Santos-Jallath J, Castro-Rodríguez A, Huezo-Casillas J, Torres-Bustillos L. Arsenic and heavy metals in native plants at tailings impoundments in Queretaro. Mexico Phys Chem Earth Parts ABC. 2012;37–39:10–7.
Article Google Scholar
D’Aubeterre R, Principal J, Barrios C, Graterol Z. Insectos plaga en vainas de Acacia tortuosa y Prosopis juliflora para consumo animal en las zonas semiáridas del estado Lara, Venezuela. Zootec Trop. 2012;30(2):147–53.
Google Scholar
Calderón Jiles GB. Prosopis tamarugo Phil. un árbol endémico del desierto de Atacama. Propuesta de niveles umbrales de profundidad freática para su conservación [Tesis para optar al título de Ingeniero en Recursos Naturales Renovables]. Santiago Chile: Universidad de Chile; 2016. https://repositorio.uchile.cl/handle/2250/151714
Google Scholar
Iturre MC, Araujo PA, Rueda MP, Rueda CV, Pece MG. Reclutamiento y mortalidad de las principales especies arbóreas del Chaco Semiárido. Argentina Quebracho - Rev Cienc For. 2020;28(2):134–44.
Google Scholar
Whaley OQ, Borda C, Moat J, Wilkinson T, Bravo Sánchez A, Gagné RJ, et al. Ecología y diagnóstico de Enallodiplosis discordis (Diptera: Cecidomyiidae): un nuevo defoliador ferozcon repercusiones directas en la pérdida del bosqueseco de Prosopis y los medios de vida en Perú. Rev Peru Biol. 2020;27(4):451–82.
Article Google Scholar
García J, Alvárez P, Izzo M, Zurita-Bianchini C, Sarmiento M. Establecimiento y reposición de pérdidas en plantaciones de Prosopis alba en Santiago del Estero. Argentina Revista Forestal del Perú. 2022;37(1):41–53.
Article Google Scholar
Forestal SN, y de Fauna Silvestre. Informe del Inventario Nacional Forestal y de Fauna Silvestre - Primer Panel. Lima Peru: Ministerio de Desarrollo Agrario y Riego; 2019. https://www.serfor.gob.pe/portal/wp-content/uploads/2020/03/INFORME-DEL-INFFS-PANEL-1.pdf. Accessed 2 Jan 2023
Google Scholar
Ramos Jauregui LYR. Estimación del Efecto del Cambio Climático en la Precipitación en la Costa Norte del Perú Usando Simulaciones de Modelos Climáticos Globales [Tesis para optar al título de Ingeniero Meteorólogo]. Lima Peru: Universidad Nacional Agraria La Molina; 2014. http://met.igp.gob.pe/publicaciones/2014/tesis_final_yakelyn_ramos.pdf
Google Scholar
Autoridad Nacional del Agua. Evaluación de la veda en el acuífero de Motupe y margen derecha del río La Leche. Ministerio de Desarrollo Agrario y Riego. 2016; https://www.ana.gob.pe/sites/default/files/normatividad/files/estudio_evaluacion_de_la_veda_acuifero_motupe.pdf. Accessed 16 Jan 2023
Juárez NG, Grados QN, Cruz AG. Insectos asociados a Prosopis pallida (Humb.& Bonpl.ex.Wild.) en el campus de la Universidad de Piura, Perú. Zonas Áridas. 2016;16(1):28.
Article Google Scholar
Inca’ Biotec. Servicio de Análisis Molecular de Agentes Patógenos del Algarrobo para Determinar Causas de Mortalidad. Tumbes Peru; 2019.
Alban L, Matorel M, Romero J, Grados N, Cruz G, Felker P. Cloning of elite, multipurpose trees of the Prosopis juliflora/pallida complex in Piura. Peru Agrofor Syst. 2002;54(3):173–82.
Article Google Scholar
Balcazar Huamán MA. Variabilidad de rasgos funcionales de 7 poblaciones de Algarrobo (Prosopis pallida H.B.K) bajo condiciones controladas [Tesis para optar al título de Ingeniero Agrónomo]. Piura Peru: Universidad Nacional de Piura; 2017. http://repositorio.unp.edu.pe/handle/UNP/1253
Google Scholar
Chinguel-Laban DO, Charcape-Ravelo JM. Biomasa y secuestro de carbono radicular, en bosque natural de “Algarrobo” en el campus de la Universidad de Piura (UDEP). Rev Investig Agroproducción Sustentable. 2019;3(3):53–66.
Article Google Scholar
Landeras G, Alfonso M, Pasiecznik NM, Harris PJC, Ramírez L. Identification of Prosopis juliflora and Prosopis pallida accessions using molecular markers. Biodivers Conserv. 2006;15(5):1829–44.
Article Google Scholar
Palacios RA, Burghardt AD, Frías-Hernández JT, Olalde-Portugal V, Grados N, Alban L, et al. Comparative study (AFLP and morphology) of three species of Prosopis of the Section Algarobia: P. juliflora, P. pallida, and P. limensis. Evidence for resolution of the “P. pallida–P. juliflora complex”. Plant Syst Evol. 2012;298(1):165–71.
Article Google Scholar
Bukhari YM, Nuclear DNA. Amounts in Acacia and Prosopis (Mimosaceae) and their evolutionary implications. Hereditas. 2004;126(1):45–51.
Article Google Scholar
Harris P. Differentiation of Prosopis juliflora (Sw.) DC. and P. pallida (H. & B. ex. Willd.) H.B.K. Using foliar characters and ploidy. For Ecol Manag. 2003;180(1–3):153–64.
Article Google Scholar
Trenchard LJ, Harris PJC, Smith SJ, Pasiecznik NM. A review of ploidy in the genus Prosopis (Leguminosae). Bot J Linn Soc. 2008;156(3):425–38.
Article Google Scholar
Bessega C, Vilardi JC, Saidman BO. Genetic relationships among American species of the genus Prosopis (Mimosoideae, Leguminosae) inferred from ITS sequences: evidence for long-distance dispersal. J Biogeogr. 2006;33(11):1905–15.
Article Google Scholar
Palmer JD. Comparative organization of chloroplast genomes. Annu Rev Genet. 1985;19(1):325–54.
Article CAS PubMed Google Scholar
De Las RJ, Lozano JJ, Ortiz AR. Comparative analysis of chloroplast genomes: functional annotation, genome-based phylogeny, and deduced evolutionary patterns. Genome Res. 2002;12(4):567–83.
Article Google Scholar
Sugiura M, Hirose T, Sugita M. Evolution and mechanism of translation in chloroplasts. Annu Rev Genet. 1998;32(1):437–59.
Article CAS PubMed Google Scholar
Turudić A, Liber Z, Grdiša M, Jakše J, Varga F, Šatović Z. Towards the well-tempered chloroplast DNA sequences. Plants. 2021;10(7):1360.
Article PubMed PubMed Central Google Scholar
Ahrens CW, Supple MA, Aitken NC, Cantrill DJ, Borevitz JO, James EA. Genomic diversity guides conservation strategies among rare terrestrial orchid species when taxonomy remains uncertain. Ann Bot. 2017;119(8):1267–77.
Article PubMed PubMed Central Google Scholar
Supple MA, Shapiro B. Conservation of biodiversity in the genomics era. Genome Biol. 2018;19(1):131.
Article PubMed PubMed Central Google Scholar
Li QJ, Su N, Zhang L, Tong RC, Zhang XH, Wang JR, et al. Chloroplast genomes elucidate diversity, phylogeny, and taxonomy of Pulsatilla (Ranunculaceae). Sci Rep. 2020;10(1):19781.
Article CAS PubMed PubMed Central Google Scholar
Shang C, Li E, Yu Z, Lian M, Chen Z, Liu K, et al. Chloroplast genomic resources and genetic divergence of endangered species Bretschneidera sinensis (Bretschneideraceae). Front Ecol Evol. 2022;10:873100.
Article Google Scholar
Bell DA, Robinson ZL, Funk WC, Fitzpatrick SW, Allendorf FW, Tallmon DA, et al. The exciting potential and remaining uncertainties of genetic rescue. Trends Ecol Evol. 2019;34(12):1070–9.
Article PubMed Google Scholar
Onley IR, Moseby KE, Austin JJ. Genomic approaches for conservation Management in Australia under climate change. Life. 2021;11(7):653.
Article CAS PubMed PubMed Central Google Scholar
Daniell H, Lin CS, Yu M, Chang WJ. Chloroplast genomes: diversity, evolution, and applications in genetic engineering. Genome Biol. 2016;17(1):134.
Article PubMed PubMed Central Google Scholar
Dobrogojski J, Adamiec M, Luciński R. The chloroplast genome: a review. Acta Physiol Plant. 2020;42(6):98.
Article CAS Google Scholar
Schwarz EN, Ruhlman TA, Sabir JSM, Hajrah NH, Alharbi NS, Al-Malki AL, et al. Plastid genome sequences of legumes reveal parallel inversions and multiple losses of rps16 in papilionoids. J Syst Evol. 2015;53(5):458–68.
Article Google Scholar
Asaf S, Khan AL, Khan A, Al-Harrasi A. Unraveling the chloroplast genomes of two Prosopis species to identify its genomic information, comparative analyses and phylogenetic relationship. Int J Mol Sci. 2020;21(9):3280.
Article CAS PubMed PubMed Central Google Scholar
Contreras Díaz R, van den Brink L, Navarrete Fuentes MJ, Arias Aburto M, Contreras Díaz R, van den Brink L, et al. Caracterización del genoma completo del cloroplasto de Prosopis tamarugo (Prosopis, Leguminosae), una especie arbórea endémica en peligro del Desierto de Atacama. Bosque Valdivia. 2021;42(3):365–70.
Article Google Scholar
Souza UJB de, Nunes R, Targueta CP, Diniz-Filho JAF, Telles MP de C. The complete chloroplast genome of Stryphnodendron adstringens (Leguminosae - Caesalpinioideae): comparative analysis with related Mimosoid species. Sci Rep. 2019;9(1):14206
Thi NPA, Khang DT, Khoa NT, Le VM, Do HDK. The complete chloroplast genome of Mimosa pigra L. (Fabaceae), a notorious invasive plant Mitochondrial DNA Part B. 2022;7(6):1063–5.
Article PubMed Google Scholar
Asaf S, Khan A, Khan AL, Al-Harrasi A, Al-Rawahi A. Complete chloroplast genomes of Vachellia nilotica and Senegalia senegal: comparative genomics and Phylogenomic placement in a new generic system. PLoS One. 2019;14(11):e0225469.
Article CAS PubMed PubMed Central Google Scholar
Zhang J, Huang H, Qu C, Meng X, Meng F, Yao X, et al. Comprehensive analysis of chloroplast genome of Albizia julibrissin Durazz. (Leguminosae sp.). Planta. 2021;255(1):26.
Article PubMed Google Scholar
Su Q, Liu L, Zhao M, Zhang C, Zhang D, Li Y, et al. The complete chloroplast genomes of seventeen Aegilops tauschii: genome comparative analysis and phylogenetic inference. PeerJ. 2020;8:e8678.
Article PubMed PubMed Central Google Scholar
Liu W, Kong H, Zhou J, Fritsch PW, Hao G, Gong W. Complete chloroplast genome of Cercis chuniana (Fabaceae) with structural and genetic comparison to six species in Caesalpinioideae. Int J Mol Sci. 2018;19(5):1286.
Article PubMed PubMed Central Google Scholar
Hildebrand M, Hallick RB, Passavant CW, Bourque DP. Trans-splicing in chloroplasts: the rps 12 loci of Nicotiana tabacum. Proc Natl Acad Sci U S A. 1988;85(2):372–6.
Article CAS PubMed PubMed Central Google Scholar
Liu S, Wang Z, Wang H, Su Y, Wang T. Patterns and rates of plastid rps12 gene evolution inferred in a phylogenetic context using Plastomic data of ferns. Sci Rep. 2020;10(1):9394.
Article CAS PubMed PubMed Central Google Scholar
Lei W, Ni D, Wang Y, Shao J, Wang X, Yang D, et al. Intraspecific and heteroplasmic variations, gene losses and inversions in the chloroplast genome of Astragalus membranaceus. Sci Rep. 2016;6(1):21669.
Article CAS PubMed PubMed Central Google Scholar
Li C, Zhao Y, Xu Z, Yang G, Peng J, Peng X. Initial characterization of the chloroplast genome of Vicia sepium, an important wild resource plant, and related inferences about its evolution. Front Genet. 2020;11:73.
Article CAS PubMed PubMed Central Google Scholar
Somaratne Y, Guan DL, Wang WQ, Zhao L, Xu SQ. The complete chloroplast genomes of two Lespedeza species: insights into codon usage Bias, RNA editing sites, and phylogenetic relationships in Desmodieae (Fabaceae: Papilionoideae). Plants. 2020;9(1):51.
Article CAS Google Scholar
Millen RS, Olmstead RG, Adams KL, Palmer JD, Lao NT, Heggie L, et al. Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus. Plant Cell. 2001;13(3):645–58.
Article CAS PubMed PubMed Central Google Scholar
Mu X, Wang P, Du J, Gao YG, Zhang J. The chloroplast genome of Cerasus humilis: genomic characterization and phylogenetic analysis. PLoS One. 2018;13(4):e0196473.
Article PubMed PubMed Central Google Scholar
Wang Y, Zhao B, Lu Z, Shi Y, Li J. The complete chloroplast genome provides insight into the polymorphism and adaptive evolution of Garcinia paucinervis. Biotechnol Biotechnol Equip. 2021;35(1):377–91.
Article CAS Google Scholar
Smith DR. Unparalleled GC content in the plastid DNA of Selaginella. Plant Mol Biol. 2009;71(6):627–39.
Article CAS PubMed Google Scholar
Jiao Y, Guo H. Chapter nine-prehistory of the angiosperms: characterization of the ancient genomes. In: Paterson AH, editor. Advances in botanical research. London UK: Academic Press; 2014. p. 223–45.
Google Scholar
Talat F, Wang K. Comparative bioinformatics analysis of the chloroplast genomes of a wild diploid Gossypium and two cultivated Allotetraploid species. Iran J Biotechnol. 2015;13(3):47–56.
Article PubMed PubMed Central Google Scholar
Guo S, Liao X, Chen S, Liao B, Guo Y, Cheng R, et al. A comparative analysis of the chloroplast genomes of four Polygonum medicinal plants. Front Genet. 2022;13:764534.
Article CAS PubMed PubMed Central Google Scholar
Li FW, Kuo LY, Pryer KM, Rothfels CJ. Genes Translocated into the plastid inverted repeat show decelerated substitution rates and elevated GC content. Genome Biol Evol. 2016;8(8):2452–8.
Article PubMed PubMed Central Google Scholar
Ren W, Jiang Z, Zhang M, Kong L, Zhang H, Liu Y, et al. The chloroplast genome of Salix floderusii and characterization of chloroplast regulatory elements. Front Plant Sci. 2022;13:987443.
Article PubMed PubMed Central Google Scholar
Wu Z, Liao R, Yang T, Dong X, Lan D, Qin R, et al. Analysis of six chloroplast genomes provides insight into the evolution of Chrysosplenium (Saxifragaceae). BMC Genomics. 2020;21(1):621.
Article CAS PubMed PubMed Central Google Scholar
Duan H, Zhang Q, Wang C, Li F, Tian F, Lu Y, et al. Analysis of codon usage patterns of the chloroplast genome in Delphinium grandiflorum L. reveals a preference for AT-ending codons as a result of major selection constraints. PeerJ. 2021;9:e10787.
Article PubMed PubMed Central Google Scholar
Guan DL, Ma LB, Khan MS, Zhang XX, Xu SQ, Xie JY. Analysis of codon usage patterns in Hirudinaria manillensis reveals a preference for GC-ending codons caused by dominant selection constraints. BMC Genomics. 2018;19(1):542.
Article PubMed PubMed Central Google Scholar
Das G, Ghosh I. Benchmarking tools for DNA repeat identification in diverse genomes. bioRxiv. 2021; https://doi.org/10.1101/2021.09.10.459798.
Shukla N, Kuntal H, Shanker A, Sharma SN. Mining and analysis of simple sequence repeats in the chloroplast genomes of genus Vigna. Biotechnol Res Innov. 2018;2(1):9–18.
Article Google Scholar
Asaf S, Khan AL, Khan MA, Shahzad R, Lubna KSM, et al. Complete chloroplast genome sequence and comparative analysis of loblolly pine (Pinus taeda L.) with related species. PLoS One. 2018;13(3):e0192966.
Article PubMed PubMed Central Google Scholar
Ping J, Feng P, Li J, Zhang R, Su Y, Wang T. Molecular evolution and SSRs analysis based on the chloroplast genome of Callitropsis funebris. Ecol Evol. 2021;11(9):4786–802.
Article PubMed PubMed Central Google Scholar
Vieira LD, Faoro H, Rogalski M, Fraga HP, Cardoso RL, de Souza EM, et al. The complete chloroplast genome sequence of Podocarpus lambertii: genome structure, evolutionary aspects, gene content and SSR detection. PLoS One. 2014;9(3):e90618.
Article PubMed PubMed Central Google Scholar
Xu J, Shen X, Liao B, Xu J, Hou D. Comparing and phylogenetic analysis chloroplast genome of three Achyranthes species. Sci Rep. 2020;10(1):10818.
Article CAS PubMed PubMed Central Google Scholar
Namgung J, Do HDK, Kim C, Choi HJ, Kim JH. Complete chloroplast genomes shed light on phylogenetic relationships, divergence time, and biogeography of Allioideae (Amaryllidaceae). Sci Rep. 2021;11(1):3262.
Article CAS PubMed PubMed Central Google Scholar
Rose JP, Toledo CAP, Lemmon EM, Lemmon AR, Sytsma KJ. Out of sight, out of mind: widespread Nuclear and plastid-Nuclear discordance in the flowering plant genus Polemonium (Polemoniaceae) suggests widespread historical gene flow despite limited Nuclear signal. Syst Biol. 2021;70(1):162–80.
Article CAS PubMed Google Scholar
Terra V, Ringelberg JJ, Maslin B, Koenen EJM, Ebinger J, Seigler D, et al. Dilemmas in generic delimitation of Senegalia and allies (Caesalpinioideae, mimosoid clade): how to reconcile phylogenomic evidence with morphology and taxonomy? Phyto Keys. 2022;205:261–78.
PubMed Central Google Scholar
Ringelberg JJ, Koenen EJM, Iganci JR, de Queiroz LP, Murphy DJ, Gaudeul M, et al. Phylogenomic analysis of 997 nuclear genes reveals the need for extensive generic re-delimitation in Caesalpinioideae (Leguminosae). Phyto Keys. 2022;205:3–58.
PubMed PubMed Central Google Scholar
Pasiecznik N, Harris P, Smith S. Identifying tropical Prosopis species: a field guide. Coventry UK: International Research Department; 2004.
Google Scholar
El-ahmir S hoob, Mahklouf M, Azzu Y. The first record of a medicinal plant species Prosopis farcta (banks & sol). J.F. Macbr (Fabaceae; Mimosoideae) from Libya. 2021;8:39–43.
Baibout M, Corcket E, Kothari SL, Fievet V. Ecosystem services provided by Prosopis cineraria (L.) Druce in the drylands of southern and Western Asia. Bot Lett. 2022;169(1):30–42.
Article Google Scholar
Soltis DE, Kuzoff RK. Discordance between Nuclear and chloroplast phylogenies in the Heuchera group (saxifragaceae). Evolution. 1995;49(4):727–42.
Article PubMed Google Scholar
Van Raamsdonk L, Smiech MP, Sandbrink JM. Introgression explains incongruence between nuclear and chloroplast DNA-based phylogenies in Allium section Cepa. Bot J Linn Soc. 1997;123(2):91–108.
Article Google Scholar
Roelofs D, Bachmann K. Comparison of chloroplast and nuclear phylogeny in the autogamous annualMicroseris douglasii (Asteraceae: Lactuceae). Plant Syst Evol. 1997;204(1):49–63.
Article Google Scholar
Nishimoto Y, Ohnishi O, Hasegawa M. Topological incongruence between nuclear and chloroplast DNA trees suggesting hybridization in the urophyllum group of the genus Fagopyrum (Polygonaceae). Genes Genet Syst. 2003;78(2):139–53.
Article CAS PubMed Google Scholar
Renoult JP, Kjellberg F, Grout C, Santoni S, Khadari B. Cyto-nuclear discordance in the phylogeny of Ficus section Galoglychia and host shifts in plant-pollinator associations. BMC Evol Biol. 2009;9(1):248.
Article PubMed PubMed Central Google Scholar
Yu WB, Huang PH, Li DZ, Wang H. Incongruence between Nuclear and chloroplast DNA phylogenies in Pedicularis section Cyathophora (Orobanchaceae). PLoS One. 2013;8(9):e74828.
Article CAS PubMed PubMed Central Google Scholar
Vega MV, Hernández P. Molecular evidence for natural interspecific hybridization in Prosopis. Agrofor Syst. 2005;64(3):197–202.
Article Google Scholar
Fontana ML, Pérez VR, Luna CV. Características evolutivas en Prosopis spp.: citogenética, genética e hibridaciones. Rodriguésia. 2018;69(2):409–21.
Article Google Scholar
Nguyen HQ, Nguyen TNL, Doan TN, Nguyen TTN, Phạm MH, Le TL, et al. Complete chloroplast genome of novel Adinandra megaphylla Hu species: molecular structure, comparative and phylogenetic analysis. Sci Rep. 2021;11(1):11731.
Article CAS PubMed PubMed Central Google Scholar
Yang L, Abduraimov O, Tojibaev K, Shomurodov K, Zhang YM, Li WJ. Analysis of complete chloroplast genome sequences and insight into the phylogenetic relationships of Ferula L. BMC Genomics. 2022;23(1):643.
Article CAS PubMed PubMed Central Google Scholar
Fan ZF, Ma CL. Comparative chloroplast genome and phylogenetic analyses of Chinese Polyspora. Sci Rep. 2022;12(1):15984.
Article CAS PubMed PubMed Central Google Scholar
Turudić A, Liber Z, Grdiša M, Jakše J, Varga F, Šatović Z. Chloroplast genome annotation tools: prolegomena to the identification of inverted repeats. Int J Mol Sci. 2022;23(18):10804.
Article PubMed PubMed Central Google Scholar
Palmer JD, Thompson WF. Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost. Cell. 1982;29(2):537–50.
Article CAS PubMed Google Scholar
Sloan DB, Alverson AJ, Wu M, Palmer JD, Taylor DR. Recent acceleration of plastid sequence and structural evolution coincides with extreme mitochondrial divergence in the angiosperm genus Silene. Genome Biol Evol. 2012;4(3):294–306.
Article PubMed PubMed Central Google Scholar
Turmel M, Otis C, Lemieux C. Divergent copies of the large inverted repeat in the chloroplast genomes of ulvophycean green algae. Sci Rep. 2017;7(1):994.
Article PubMed PubMed Central Google Scholar
Williams AM, Friso G, van Wijk KJ, Sloan DB. Extreme variation in rates of evolution in the plastid Clp protease complex. Plant J. 2019;98(2):243–59.
Article CAS PubMed Google Scholar
Dong W, Xu C, Li C, Sun J, Zuo Y, Shi S, et al. ycf1, the most promising plastid DNA barcode of land plants. Sci Rep. 2015;5(1):8348.
Article CAS PubMed PubMed Central Google Scholar
Amar MH. ycf1-ndhF genes, the most promising plastid genomic barcode, sheds light on phylogeny at low taxonomic levels in Prunus persica. J Genet Eng Biotechnol. 2020;18(1):42.
Article PubMed PubMed Central Google Scholar
Asaf S, Waqas M, Khan AL, Khan MA, Kang SM, Imran QM, et al. The complete chloroplast genome of wild Rice (Oryza minuta) and its comparison to related species. Front Plant Sci. 2017;8:304.
Article PubMed PubMed Central Google Scholar
Zhang F, Wang T, Shu X, Wang N, Zhuang W, Wang Z. Complete Chloroplast Genomes and Comparative Analyses of L. chinensis, L. anhuiensis, and L. aurea (Amaryllidaceae). Int J Mol Sci. 2020;21(16):5729.
Article CAS PubMed PubMed Central Google Scholar
Zhang XF, Landis JB, Wang HX, Zhu ZX, Wang HF. Comparative analysis of chloroplast genome structure and molecular dating in Myrtales. BMC Plant Biol. 2021;21(1):219.
Article CAS PubMed PubMed Central Google Scholar
Doyle J. DNA protocols for plants. In: Hewitt GM, Johnston AWB, Young JPW, editors. Molecular Techniques in Taxonomy. Berlin Alemania: Springer; 1991. p. 283–93.
Chapter Google Scholar
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinform. 2014;30(15):2114–20.
Article CAS Google Scholar
Jin JJ, Yu WB, Yang JB, Song Y, de Pamphilis CW, Yi TS, et al. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 2020;21(1):241.
Article PubMed PubMed Central Google Scholar
Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012;9(4):357–9.
Article CAS PubMed PubMed Central Google Scholar
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinform. 2009;10(1):421.
Article Google Scholar
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–77.
Article CAS PubMed PubMed Central Google Scholar
Wick RR, Schultz MB, Zobel J, Holt KE. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics. 2015;31(20):3350–2.
Article CAS PubMed PubMed Central Google Scholar
Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, et al. GeSeq – versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017;45(W1):W6–11.
Article CAS PubMed PubMed Central Google Scholar
Chan PP, Lowe TM. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol Biol Clifton NJ. 2019;1962:1–14.
Article CAS Google Scholar
Xia X. DAMBE5: a comprehensive software package for data analysis in molecular biology and evolution. Mol Biol Evol. 2013;30(7):1720–8.
Article CAS PubMed PubMed Central Google Scholar
Mayer C. Phobos: highly accurate search for perfect and imperfect tandem repeats in complete genomes by Christoph Mayer. Ruhr Universität Bochum. 2007; https://www.ruhr-uni-bochum.de/spezzoo/cm/cm_phobos.htm. Accessed 15 Oct 2022
Michael D, Gurusaran M, Santhosh R, Hussain MK, Satheesh SN, Suhan S, et al. RepEx: a web server to extract sequence repeats from protein and DNA sequences. Comput Biol Chem. 2019;78:424–30.
Article CAS PubMed Google Scholar
Alamro H, Alzamel M, Iliopoulos CS, Pissis SP, Watts S. IUPACpal: efficient identification of inverted repeats in IUPAC-encoded DNA sequences. BMC Bioinform. 2021;22(1):51.
Article CAS Google Scholar
Kurtz S. The Vmatch large scale sequence analysis software a manual. University of Hamburg; 2010. Available from: http://www.vmatch.de/virtman.pdf
Google Scholar
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.
Article CAS PubMed PubMed Central Google Scholar
Bouckaert R, Heled J, Kühnert D, Vaughan T, Wu CH, Xie D, et al. BEAST 2: a software platform for Bayesian evolutionary analysis. PLoS Comput Biol. 2014;10(4):e1003537.
Article PubMed PubMed Central Google Scholar
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3.
Article CAS PubMed PubMed Central Google Scholar
Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9(8):772.
Article CAS PubMed PubMed Central Google Scholar
Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004;32:W273–9.
Article CAS PubMed PubMed Central Google Scholar
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22(22):4673–80.
Article CAS PubMed PubMed Central Google Scholar
Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41:95–8.
CAS Google Scholar
Kumar S, Stecher G, Li M, Knyaz C, Tamura K, MEGA X: Molecular evolutionary genetics analysis across computing platforms. Battistuzzi FU, editor. Mol Biol Evol. 2018;35(6):1547–1549.

Download references

Acknowledgements

We would like to thank the chief of the Santuario Histórico Bosque de Pómac (SHBP), Eng. Sirley Bernabé and the SHBP rangers, in particular William Zeña for helping us with transportation, sample collection and his extensive knowledge of the forest. We would also like to thank Edgar Neyra and Marinoli Rivas for their great help.

Funding

The work was funded financed by the Research and Graduate Studies Office of the UNMSM within the framework of the research project: “Sequencing of the genome of the Algarrobo tree (Prosopis pallida) and identification of genes for resistance to biotic factors using bioinformatics tools” through the Research Projects Program for Research Groups [RECTORAL RESOLUTION No. 01686-R-20, Code: B20100311].

Author information

Authors and Affiliations

Laboratory of Genomics and Bioinformatics for Biodiversity, Faculty of Biological Sciences, Universidad Nacional Mayor de San Marcos, 15081, Lima, Peru
Esteban Caycho, Renato La Torre & Gisella Orjeda

Authors

Esteban Caycho
View author publications
You can also search for this author in PubMed Google Scholar
Renato La Torre
View author publications
You can also search for this author in PubMed Google Scholar
Gisella Orjeda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.O. conceived the study and acquired the funding. E.C. and R.L. performed the DNA extraction and the bioinformatic analysis. All authors participated in the manuscript writing and approved its final version.

Corresponding author

Correspondence to Gisella Orjeda.

Ethics declarations

Ethics approval and consent to participate

The authors confirm that all methods comply with local and national regulations. The authors have collected plant material under the research permit N° 003–2020-SERNANP-JEF and the certificate of origin for biological samples N° 001–2021-SERNANP-SHBP from the “Servicio Nacional de Áreas Naturales Protegidas por el Estado” (SERNANP).

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Additional file 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Caycho, E., La Torre, R. & Orjeda, G. Assembly, annotation and analysis of the chloroplast genome of the Algarrobo tree Neltuma pallida (subfamily: Caesalpinioideae). BMC Plant Biol 23, 570 (2023). https://doi.org/10.1186/s12870-023-04581-5

Download citation

Received: 15 July 2023
Accepted: 03 November 2023
Published: 16 November 2023
DOI: https://doi.org/10.1186/s12870-023-04581-5

Assembly, annotation and analysis of the chloroplast genome of the Algarrobo tree Neltuma pallida (subfamily: Caesalpinioideae)

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Comprehensive analysis of complete chloroplast genome and phylogenetic aspects of ten Ficus species

Comparative genomics of 11 complete chloroplast genomes of Senecioneae (Asteraceae) species: DNA barcodes and phylogenetics

Comparative analysis of complete Artemisia subgenus Seriphidium (Asteraceae: Anthemideae) chloroplast genomes: insights into structural divergence and phylogenetic relationships

Background

Results

Assembly and annotation of the chloroplast genome of Neltuma pallida

Codon usage analysis

Identification of repetitive sequences

Phylogenomic relationships of Neltuma pallida

Sequence divergence analysis

Genetic distance of coding sequences analysis

Boundary between regions

Discussion

Conclusions

Methods

Plant material

DNA extraction and sequencing

De novo assembly of Neltuma pallida chloroplast genome

Genome annotation of Neltuma pallida chloroplast genome

Codon usage of protein-coding sequences

Identification of repetitive sequences in the chloroplast genome

Phylogenomic relationships of Neltuma pallida

Sequence divergence of chloroplast genomes

Genetic distance analysis of coding sequences

Boundary shift analysis between chloroplast genome regions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Additional file 2.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation