Comparative genomics of 11 complete chloroplast genomes of Senecioneae (Asteraceae) species: DNA barcodes and phylogenetics
- 759 Downloads
Majority of the species within Senecioneae are classified in Senecio, making it the tribe’s largest genus. Certain intergeneric relationships within the tribe are vaguely defined, with the genus Senecio being partly linked to this ambiguity. Infrageneric relationships within Senecio remain largely unknown and consequently, the genus has undergone continuous expansion and contraction over the recent past due to addition and removal of taxa. Dendrosenecio, an endemic genus in Africa, is one of its segregate genera. To heighten the understanding of species divergence and phylogeny within the tribe, the complete chloroplast genomes of the first five Senecio and six Dendrosenecio species were sequenced and analyzed in this study.
The entire length of the complete chloroplast genomes was ~ 150 kb and ~ 151 kb in Dendrosenecio and Senecio respectively. Characterization of the 11 chloroplast genomes revealed a significant degree of similarity particularly in their organization, gene content, repetitive sequence composition and patterns of codon usage. The chloroplast genomes encoded an equal number of unique genes out of which 80 were protein-coding genes, 30 transfer ribonucleic acid, and four ribosomal ribonucleic acid genes. Based on comparative sequence analyses, the level of divergence was lower in Dendrosenecio. A total of 331 and 340 microsatellites were detected in Senecio and Dendrosenecio, respectively. Out of which, 25 and five chloroplast microsatellites (cpSSR) were identified as potentially valuable molecular markers. Also, through whole chloroplast genome comparisons and DNA polymorphism tests, ten divergent hotspots were identified. Potential primers were designed creating genomic tools to further molecular studies within the tribe. Intergeneric relationships within the tribe were firmly resolved using genome-scale dataset in partitioned and unpartitioned schemes. Two main clades, corresponding to two subtribes within the Senecioneae, were formed with the genus Ligularia forming a single clade while the other had Dendrosenecio, Pericallis, Senecio and Jacobaea. A sister relationship was revealed between Dendrosenecio and Pericallis whereas Senecio, and Jacobaea were closely placed in a different clade.
Besides emphasizing on the potential of chloroplast genome data in resolving intergeneric relationships within Senecioneae, this study provides genomic resources to facilitate species identification and phylogenetic reconstructions within the respective genera.
KeywordsSenecio Dendrosenecio Endemic Chloroplast genome Codon usage Microsatellites DNA barcodes Phylogenomics
internal transcribed spacer
small single copy
large single copy
simple sequence repeat
relative synonymous codon usage
codon usage bias
effective number of codons
Senecioneae, the largest tribe in the family Asteraceae, has over 160 genera with more than 3000 species, and new genera continue to be added (Chen et al. 2011; Nordenstam et al. 2009). The tribe is prominent for its size, and rich morphological and ecological diversity. It is mostly dominated by annual and perennial herbs, while the rest constitute shrubs, vines, trees, and epiphytes. It has a near cosmopolitan distribution, with southern Africa being one of its key diversity hotspot zones (Pelser et al. 2007). Majority of the species in the tribe are placed in Senecio L., making it one of the largest genera of angiosperms, with over 1250 species (Nordenstam et al. 2009). Senecio is characteristically diverse in morphology, life-history, growth forms, and thus, it has been markedly linked to the incongruous phylogenetic relationships within the tribe (Pelser et al. 2007). Its members are generally distinguished by style branches truncate with short sweeping hairs, separated stigmatic lines and sometimes with a median hair pencil, and with ecaudate anther bases and balusterform filament collar (Nordenstam 2007; Pelser et al. 2007).
Over the years, the genus has been under constant re-evaluation and reclassification, and until now, comprehensive infrageneric relationships are yet to be established. Consequently, numerous species have in the past been segregated as new genera mostly based on morphological, anatomical, and chromosomal variations (Jeffrey and Chen 1984; Jeffrey et al. 1977). One of such segregate genera is Dendrosenecio (Hauman ex Hedb.) B. Nord., upgraded by Nordenstam (1978) to constitute the Afromontane pachycaul taxa. Dendrosenecio was initially classified in Senecio based on the striking similarities in floral characters. It is therefore not surprising that the elevation of Dendrosenecio was at first controversial (Jeffrey et al. 1977) as the genus exhibited substantial morphological resemblances to other African perennials of Senecio. Despite these remarkable morphological similarities, amplified fragment length polymorphism analysis revealed considerable divergence between Senecio and Dendrosenecio (Knox and Palmer 1995b). Afterwards, internal transcribed spacer (ITS) data identified Oresbia Cron & B. Nord. as the closely related genus to Dendrosenecio (Pelser et al. 2007).
Majority of the segregated groups are now accepted on the basis of molecular data obtained from markers such as ITS (Pelser et al. 2007). However, it is evident that more valuable diagnostic molecular sequences are needed to further understand the generic and intergeneric relationships in Senecioneae. The large number of species, considerable variation in species life-history and over-dependence on morphological characters, the majority of which overlap, have been pointed out as the causes of the systematic conflict observed within Senecio. Similar to Senecio, infrageneric relationships within Dendrosenecio are still debatable, especially in relation to specific and subspecific classifications. Species of Dendrosenecio exhibit ‘mosaic of morphological variation’ arising from divergence and convergence as they dispersed to various geographical regions with similar habitat conditions (Knox 2005; Mabberley 1973). Besides, frequent hybridization events between species within each genus have been evidenced resulting in allopolyploid species (Hedberg 1957; Hegarty et al. 2012; Milton 2009). It is therefore imperative that more molecular markers and divergent regions are identified to facilitate species identification, speciation and adaptive evolution studies on species of Senecio and Dendrosenecio.
Partial plastid markers, species-specific or universal, have in the past decades been used to resolve phylogenetic relationships and species delimitations. This inclination is progressively being substituted by the use of plastid genome-scale data, resulting in improved phylogenetic resolutions and detailed evolutionary information about species at all taxonomic levels. Typically chloroplast DNA is uniparentally, maternally in angiosperms and paternally in gymnosperms, inherited and exhibits homologous recombination (Marechal and Brisson 2010). This attribute can greatly benefit studies on taxa that are affected by hybridization, introgression and convergent evolution. Additionally, chloroplast genomes are justifiably conserved in terms of gene composition and arrangement permitting comparative genomics even at the generic level. However, they harbour key variations e.g., in the inverted repeat (IR) size and positioning of the IR junctions even among close relatives (Downie and Jansen 2015), and in specific lineages, massive rearrangements, gene duplications, loss or gain have been observed e.g. in Campanulaceae (Knox 2014). These variations provide sufficient unique attributes to reconstruct phylogenetic relationships with strong statistical support, and to investigate the origin and evolutionary patterns of plastids (Pouchon et al. 2018; Tonti-Filippini et al. 2017) through comparative genomics.
To date, only eight chloroplast genomes have been sequenced and reported from three genera in the Senecioneae tribe (Doorduin et al. 2011; Lee et al. 2016; Wang et al. 2019). In this study, the first five and six chloroplast genomes in Senecio and Dendrosenecio respectively were sequenced and analysed. The objectives were: to generate, characterize and analyse the complete chloroplast genomes of 11 species of Senecioneae; to identify highly variable regions that could be of phylogenetic utility within the tribe through comparative analyses and; to investigate the potential of chloroplast phylogenomics in resolving phylogenetic relationships among the species of Senecioneae, with key interest on the phylogenesis of Senecio and Dendrosenecio.
Plant material and genome sequencing
Characteristics of complete chloroplast genomes of 14 species of the tribe Senecioneae (Asteraceae)
General genome characteristic
Herbarium accession numbers
GenBank accession number
Genome size (bp) [GC%]
LSC length (bp)
SSC length (bp)
IR length (bp)
PCGs (duplicated in the IR)
Trnas (duplicated in the IR)
Rrna (duplicated in the IR)
Dendrosenecio johnstonii (Oliv.) B. Nord.
D. meruensis (Cotton & Blakelock) E.B. Knox
D. elgonensis subsp. elgonensis (T.C.E.Fr.) E.B. Knox
D. keniodendron (R.E. Fr. & T.C.E.Fr) B. Nord.
D. battiscombei (R.E. Fr. & T.C.E.Fr) E.B. Knox
D. brassiciformis (R.E.Fr. & T.C.E.Fr) Mabb.
Senecio moorei R.E.Fr.
S. keniophytum R.E.Fr.
S. purtschelleri Engl.
S. schweinfurthii O.Hoffm.
S. roseiflorus R.E.Fr.
Jacobea vulgaris Gaertn.
Ligularia fischeri (Ladeb.) Turcz.
Pericallis hybrida (Regel) B.Nord
Genome assembly and annotation
The raw data were filtered and trimmed using Fastp software using the default settings (Chen et al. 2018); all low-quality reads were discarded. The de novo assembly of the filtered reads, into complete chloroplast genomes, was performed using NOVOPlasty (Dierckxsens et al. 2017) with default seed and K-mer = 31–39. The contigs were then mapped to the chloroplast genomes of Jacobaea vulgaris Gaertn. (NC_015543; Doorduin et al. 2011) and Pericallis hybrida (Regel) B. Nord. (NC_031898; Wang et al. 2019) using Geneious Prime 2019 (Biomatters Ltd., Auckland, New Zealand; https://www.geneious.com). Basic local alignment search tool ver. 2.2.18+ (Camacho et al. 2009) was used to ascertain the positions of the single copies and the inverted repeat regions by self-blasting the assembled sequences.
GeSeq (Tillich et al. 2017) was used to annotate each of the chloroplast genomes using the complete chloroplast genome sequences of Jacobaea vulgaris and Pericallis hybrida as references. Where necessary, manual corrections were performed in Geneious Prime 2019 (Biomatters Ltd., Auckland, New Zealand), to rectify the start and stop codons of the protein-coding genes (PCGs), based on the annotations of J. vulgaris and P. hybrida. A circular genome map for each species was generated using OGDraw v1.2 (Lohse et al. 2007). All annotated genome sequences were submitted to the GenBank under the accession numbers listed in Table 1.
Codon usage and microsatellite repeats identification
The level of codon usage bias was determined by analysing the Relative Synonymous Codon Usage (RSCU; Sharp and Li 1987), Effective Number of codon (ENc; Wright 1990) and the Codon Biased Index (CBI; Morton 1993) for all PCGs, in DnaSP 6.10 (Rozas et al. 2017). The frequency of amino acid was also considered. The MicroSAtellite Identification tool Perl Script (MiSa; Thiel et al. 2003), was used to mine for SSRs with the parameters set at 10 for mononucleotides, 5 for dinucleotides, 4 for trinucleotides and 3 for tetra-, penta- and hexa-nucleotides.
Genome comparative analyses and divergence hotspot identification
The available chloroplast genomes of Asteraceae species have been shown to harbour no major differences in their sizes, gene content and arrangement. The whole genome size, GC percentage, LSC, SSC, IR and number of gene in each of the 11 species, were therefore compared to three other species of Senecioneae. Preliminary comparative analyses among the species within each genus revealed highly conserved sequences with > 99% pairwise identity and > 98% identical sites. Consequently, one chloroplast genome sequence was randomly picked per genus to conduct further comparative studies against other chloroplast genomes within the tribe. The expansion/contraction of the IR regions was assessed by comparing the positions of SC/IR junctions and their adjacent genes using IRscope (Amiryousefi et al. 2018).
Further, to outline any significant sequence divergence spots and genome rearrangements, the chloroplast genomes were aligned and plotted in MAUVE (Darling et al. 2004), with Nicotiana tabacum L. (NC_001879; Shinozaki et al. 1986) being used as an external reference genome. Nucleotide diversity (Pi) in the non-coding regions (> 200 bp) of the five species of Senecioneae was analysed in DnaSP v.6.10 (Rozas et al. 2017). Potential primers for ten sites with the highest Pi values were designed using Primer3 (Untergasser et al. 2012) using default settings.
A total of 75 species, representing 49 genera from 11 tribes of Asteraceae, were downloaded from the NCBI (Additional file 1: Table S1) for phylogenetic analyses. Besides, data for Adenophora divaricata Franch. & Sav. and A. stricta Miq. (Cheon et al. 2017) were downloaded and used as outgroups in this analysis. Sequences of 70 common PCGs were extracted from the 77 species. Each gene was separately aligned using MUSCLE (Edgar 2004) and then concatenated in Geneious Prime 2019 (Biomatters Ltd., Auckland, New Zealand). Phylogenetic reconstructions were carried out using Maximum Likelihood (ML) and Bayesian Inference (BI) methods. Each method was used twice in independent analyses based on unpartitioned and partitioned data. Before the ML analysis using unpartitioned data, the best-fit DNA substitution model was determined using ModelFinder (Kalyaanamoorthy et al. 2017) as implemented in IQ-TREE version 1.5.4 (Nguyen et al. 2015). Maximum Likelihood (ML) analysis was conducted using IQ-TREE with a bootstrap analysis of 5000 replications under the GTR + F + R6 nucleotide substitution model. MrBayes v3.2.6 (Ronquist et al. 2012) was used to implement the BI analyses based on the unpartitioned data set, using four independent Markov Chain Monte Carlo runs with three heated and one cold chain. The chains were run for 2 × 106 generations with sampling from the cold chain run after every 103 generations. The analysis was stopped after the average standard deviation of split frequencies as calculated by Mr. Bayes was below 0.01, an indication that convergence had been attained. The first 25% of all generations were excluded, and a consensus phylogenetic tree was obtained based on majority rule from the remaining trees. Branch support was indicated by posterior probability (PP) values. The data set was then partitioned by categorizing the nucleotides in each gene based on the position (first, second, or third) they occupy in a codon. The best partitioning scheme and substitution models were calculated using PartitionFinder2 (Lanfear et al. 2017). The generated phylogenetic trees were visualised and formatted in Interactive tree of life (iTOL) v3 (Letunic and Bork 2016).
Chloroplast genome organization and content
List of genes identified in the studied chloroplast genomes of 11 species of Senecioneae
trnA-UGC*, trnfM-CAU, trnI-GAU*, trnM-CAU, trnR-ACG, trnS-UGA, trnC-GCA, trnG-GC, trnK-UUU*, trnN-GUU, trnW-CCA, trnT-GGU, trnD-GUC, trnV-UAC*, trnL-CAA, trnY-GUA, trnR-UCU, trnT-UGU, trnE-UUC, trnH-GUG, trnL-UAA*, trnP-UGG, trnS-GCU, trnV-GAC, trnF-GAA, trnI-CAU, trnL-UAG, trnQ-UUG, trnS-GGA, trnG-UCC*
Small ribosomal units
Large ribosomal units
RNA polymerase sub-units
rpoA, rpoB, rpoC1*, rpoC2
Translation initiation facto
ndhA*, ndhB*, ndhC, ndhDb, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
psaA, psaB, psaC, psaI, psaJ, ycf3**, ycf4
Cytochrome b/f complex
petA, petB*, petD*, petG, petL, petN
atpA, atpB, atpE, atpF*, atpH, atpI
Large subunit of rubisco
Envelope membrane protein
Component of TIC complex
c-Type cytochrome synthesis
Hypothetical genes reading frames
rrn4.5, rrn5, rrn16, rrn23
Codon usage and microsatellite sequences
Details of 25 potentially polymorphic microsatellite repeats in five species of Senecio
Details of potentially polymorphic microsatellite repeats in six species of Dendrosenecio
D. elgonensis subsp. elgonensis
Genome comparative analyses
Details of ten primers that target the most divergent regions in the chloroplast genomes of Senecioneae species
Expected product size
Genome structure and content
It is typical of higher plants to possess a chloroplast genome that has a quadripartite structure and a relatively well-conserved gene content and arrangement (Ravi et al. 2008; Wicke et al. 2011; Yurina et al. 2017). However, minor to major variability in chloroplast genome structure has been observed in specific plant lineages (Guisinger et al. 2011). The current upsurge in the amount of chloroplast genome-scale data has played a crucial role in the enhancement of knowledge on the evolution and organization of chloroplast genomes. Both Senecio and Dendrosenecio are conspicuous genera in the tribe Senecioneae, the former for being the largest genus in the tribe and the latter for exhibiting an atypical growth habit. In this study, the first complete chloroplast genome sequences of five Senecio and six Dendrosenecio species were generated. The genome structures were comparable to those of higher plants each exhibiting four compartments; a pair of inverted repeats and a pair of single copies of unequal length. The overall chloroplast genome size of between 150 and 151 kb, was comparable to the majority of chloroplast genomes of species within the Asteraceae family (Salih et al. 2017). The GC content of species within each of the two genera was identical 37.2% in Senecio and 37.5% in Dendrosenecio (Table 1, Fig. 1). Usually, chloroplast genomes exhibit a high AT-GC content ratio (Ravi et al. 2008), which is a crucial factor in genome organization and stability (Niu et al. 2017).
Chloroplasts have undergone enormous changes since they evolved from cyanobacteria in over a billion years ago (Timmis et al. 2004). One of the notable transformations is the reduction in size, which is primarily accredited to considerable gene transfer from the chloroplast genome to the nuclear genome (Bock and Timmis 2008; Martin et al. 1998). Generally, the chloroplast genomes of higher plants currently encode 70–90 protein-coding genes, which is approximately 2% of the total PCGs found in cyanobacterium Synechocystis (Eckardt 2006). The number of unique genes encoded in the available chloroplast genomes of Asteraceae varies slightly between 110 and 115 (Dempewolf et al. 2010; Doorduin et al. 2011; Kumar et al. 2009; Lu et al. 2016; Walker et al. 2014). The 11 species reported here encoded an equal number of 114 unique genes, with 80 genes encoding for protein, similar to the chloroplast genomes of Taraxacum F.H. Wigg. (Asteraceae; Salih et al. 2017). The exon of some specific genes has been subjected to interruptions by either a single or several introns. In such cases, the entire sequence, containing both the exon and intron(s), is transcribed into a forerunner RNA and later the introns are detached for accurate production of a proper transcript (Eckardt 2007; Plant and Gray 1988). The rps12 gene is distinctively placed, with its 5′-end being positioned in the LSC region and the 3′-end is duplicated in the IR section, in a way similar to that of most other angiosperms including Nicotiana tabacum (Hildebrand et al. 1988). Seventeen genes, in each of the studied species, had either a single or several introns (Table 2). Excluding rps12 gene whose intron is exceptionally large, trnK-UUU had the largest single intron. Two genes, ycf3 and clpP, had two introns within their exons this arrangement has also been observed in other Asteraceae species including Artemisia annua L. (Shen et al. 2017). However, in Ageratina adenophora (Spreng.) R.M. King. & H. Rob., the rpoC1 gene has two introns (Nie et al. 2012), which is considered rare among other Asteraceae species.
Codon usage and repetitive sequences
Relative synonymous codon usage values of less than one indicate that the codons are less frequent, > 1 represents those that are more frequently used, whereas = 1 shows no bias (Uddin 2017). An identical trend in the manner in which the amino acids were encoded was discovered among the 11 species (Fig. 2, Additional file 2: Table S2). The usage frequency of leucine was higher than the rest, while cysteine had the least frequency which is congruent to most Asteraceae species e.g., (Salih et al. 2017; Shen et al. 2017). It was also observed that usage of synonymous codons was generally biased in favour of those ending with A/U bases. Consequently, some codons were over- (> 1.6) or under- (< 0.6) represented. In particular, only trnL-UAA, trnS-UCU, trnT-ACU, trnA-GCU, trnY-UAU, trnR-AGA, and stop codon UAA were over-represented, whereas the majority were under-represented (Additional file 2: Table S2). Methionine (AUG) and tryptophan (UGG) were uniformly used (RSCU = 1). Other indices of non-uniformity in codon usage include the Effective Number of codons (ENc), which ranges from 20 (one codon per amino acid) to 61 (equal use of synonymous codons; Wright 1990) and Codon Bias Index (CBI) which ranges from 0, no bias to 1 equal usage of all the synonymous codon (Morton 1993). The values for both ENc (49.76 to 51.49) and CBI (0.308 to 0.356) were insignificantly different among the species of Dendrosenecio, but similar to most species in Asteraceae (Nie et al. 2014). The common start codon for the protein coding genes is AUG (M) however, three genes, psbL, rps19, and ndhD deviated from the norm, and they had ACG, GUG, and GUG respectively.
Microsatellite repeats are abundantly distributed in the genome (Tautz and Renz 1984), and they display a high level of polymorphism, placing them among the most preferred genetic markers for genetic investigations. A total of 340 and 331 SSRs were discovered in six species of Dendrosenecio and five Senecio species. The majority were mononucleotides, followed by dinucleotides and tetranucleotides (Additional file 3: Table S3). Mononucleotides, usually A/T repeat types, are abundantly present in chloroplast genomes of Asteraceae species, e.g., Jacobaea vulgaris (Doorduin et al. 2011) Artemisia annua (Shen et al. 2017) and other families e.g., Paeonia ostii T. Hong & Z. X. Zhang (Paeoniaceae; Guo et al. 2018). In Dendrosenecio, a majority of the SSRs were located on homologous regions and except for five, the rest lacked any variations in terms of length and motif. On the contrary, 25 microsatellites in Senecio exhibited slight variations based on the same criteria. Being considered polymorphic, the identified microsatellites are therefore potential molecular markers for use in further studies within the respective genus.
Chloroplast genome comparison
There were no remarkable structural rearrangements among the taxa of these two genera. The chloroplast genomes are highly conserved with an identical structure and an equal number of genes, an indication that this could be the case in chloroplast genomes of most species of these genera. Comparative analyses against representatives of three other genera of Senecioneae revealed a similar trend of the well-preserved structure and organization. The inverted repeat region is present in a majority of angiosperms chloroplast genomes. Initially the IR was reported to serve as a whole-genome stabilizer by reducing recombination between the two SC regions; however, these reports lacked support as more chloroplast genomes revealed significant rearrangements even with both copies of the IR present (Jansen and Ruhlman 2012). Comparative analyses between plants of different plant lineages revealed that inverted repeats could contract/expand up to a few hundred base pairs even among closely related species (Goulding et al. 1996). In this study, the comparison of the IR/SC junctions showed a slight expansion of the IR in all except in Senecio moorei and Pericallis hybrida. In the novel chloroplast genomes, the same genes were found adjacent to the junctions, and only slight length variations were recorded in P. hybrida and S. moorei. In P. hybrida the LSC/IRb junction contracted into the rpl2 gene, whereas it extended into the rps19 in all the other analysed species. Two pseudogenes (ψrps19 and ψycf1) of varying length were generated in the IRa region, as a result of the expansion of the IR into the exons of rps19 and ycf1 genes respectively (Fig. 4). This pattern of expansion of the IR, and the introduction of partial copies of genes with non-coding abilities represents a familiar phenomenon in majority of Asteraceae species (Wang et al. 2015), and besides being a source of DNA barcodes, it can offer insights into the evolutionary processes of plastid genomes.
The entire sequences of the 11 chloroplast genomes generated here, lack any striking inversions or rearrangements and therefore were outlined as a single locally collinear block in our analyses. However, certain regions harboured divergent sites the majority of which were in the non-coding regions. Among the few sites with significant deviations include trnH-(GUG)-psbA, ndhD-ccsA, trnT(UGU)-trnL(UAA), ndhI-ndhG, and trnL-UAG-rpl32. Other regions including trnL(UAG)-rpl32 and the exons of ndhF and ycf1 were within a conserved block, but they had significant divergent points. These findings were supported by results obtained from the DNA polymorphism test based on genus representatives, as same regions were noted to have high nucleotide variability (Pi). Some of these regions have previously been reported in chloroplast genomes of other species (Salih et al. 2017; Wu et al. 2018) and used in phylogenetic studies of numerous taxa including Senecio (Kandziora et al. 2016). Non-coding regions in chloroplast genomes have shown high potential for use as molecular markers for phylogenetic studies at low taxonomic levels in Angiosperms (Shaw et al. 2005). Therefore, the regions identified herein are prospective sources of highly informative markers for phylogenetic utility in elucidating intergeneric relationships within the tribe. Subsequently, ten potential markers were developed, allowing for specific amplification of each of the ten most polymorphic sites.
We compared the Senecioneae chloroplast genomes against Nicotiana tabacum and detected the two inversions reported to be shared by all clades of Asteraceae family, apart from species of the Barnadesioideae subfamily (Kim et al. 2005). Six conserved gene blocks were identified among the chloroplast genomes indicating the most conserved regions of the genomes. The SSC region in Ligularia fischeri was differently oriented in relation to the other Asteraceae species. This re-inversion is considered an ordinary phenomenon among chloroplast genomes of higher plants, and it is not a product of any evolutionary event, as single-copy regions exist in two equimolar states (Palmer 1983; Walker et al. 2015).
The rapid increase in the amount of complete chloroplast genome sequences during the past decade, provided essential data to elucidate further and resolve phylogenetic relationships among species. Consequently, in a move towards chloroplast phylogenomics, clarification of phylogenetic relationships at higher and lower taxonomic levels have been achieved (Lu et al. 2015; Ma et al. 2014; Wu et al. 2013). The tribe Senecioneae is often subdivided into three subtribes; Senecioninae, Tussilagininae, and Blennospermatinae (Chen et al. 2011). In this study, the multi-gene analysis resulted in a phylogenetic tree whose branches were strongly supported. The five genera of Senecioneae formed two distinct clades that corresponded to two of the three subtribes of Senecioneae including Senecioninae (Dendrosenecio, Senecio, Jacobaea and Pericallis) and Tussilagininae (Ligularia; Fig. 7, Additional file 5: Figure S2). The 11 newly generated species were well placed within the Senecioneae tribe by both ML and BI phylogenetic methods under partitioned and unpartitioned data schemes. The afro-alpine species of Senecio are classified in at least five clades of Senecio (Kandziora et al. 2016). The present phylogenetic study recovered a monophyletic group with two sub-clades which split S. keniophytum and S. roseiflorus from S. purtschelleri, S. moorei and S. schweinfurthii. The relationships among the species of Senecio was identical in all data schemes differing only in support of the S. moorei and S. schweinfurthii relationship (94% and 0.3 PP; Fig. 7). The genus Jacobaea, just like Dendrosenecio, was previously classified in Senecio under section Jacobaea (Pelser et al. 2002) but later segregated from Senecio based on new insights from molecular phylogeny (Pelser et al. 2006). A sister relationship between Senecio and Jacobaea was highlighted in this study.
Previously, genetic relationship within Dendrosenecio species was shown to be strongly correlated to geographic distance (Knox and Palmer 1995a) as geographically close species were genetically more related than distantly located species. In this study two clades were formed within a monophyletic group of Dendrosenecio, one contained D. johnstonii and D. meruensis, both from Tanzanian mountains. The other clade had species from Kenyan mountains; D. keniodendron, and D. battiscombei (Mt. Kenya), D. elgonensis subsp. elgonensis (Mt. Elgon) and D. brassiciformis, which was sampled from the Aberdare ranges (Fig. 7a). However, the sister relationships among the Kenyan species was conflicting between the ML and the BI phylogenetic reconstructions. In both ML trees a clear distinction is established concerning geographical (different mountains) and altitudinal (same mountain) variations (Fig. 7a; Additional file 5: Figure S2a), though this correlation is missing in the BI tree (Fig. 7b; Additional file 5: Figure S2b). Therefore, there is need to carry out further analyses, including more species of Dendrosenecio from all habitats in order to make a comprehensive conclusion. A majority of the intergeneric relationships defined here were significantly supported and congruent to most of the previous studies based on a few DNA fragments, including the position of Parastrephia quadrangularis (Meyen) Cabrera within the species of Diplostephium Kunth (Vargas et al. 2017). Therefore, this study strongly underscored the potential of chloroplast genome-scale data in outlining both inter- and intra-generic phylogenetic relationships within the tribe Senecioneae species and in the family at large. Nonetheless, interspecific relationships were weakly supported and therefore, further comprehensive studies that include more taxon sampling are necessary to enhance our understanding of the evolutionary histories of both Senecio and Dendrosenecio.
Dendrosenecio is a segregate genus to Senecio. Despite exhibiting some striking morphologic similarities, a few differences existed based on which the two genera were separated. Initially, controversies arose over the segregation although a consensus was later arrived at. Amplified fragment length polymorphism data distinctly separated the two genera, affirming the earlier decisions. However, lack of or limited molecular resources have impeded further studies on the respective genera. This study generated the first complete chloroplast genome sequences in each genus. Chloroplast genomes in both genera are highly similar in structure, gene composition and synteny, but they significantly differ in size. A chloroplast genome multi-gene dataset revealed three strongly supported clades within the tribe Senecioneae, markedly splitting Dendrosenecio from Senecio. Ten primers, targeting the ten highly divergent regions in the chloroplast genomes of Senecioneae species, were designed. Also, 25 polymorphic cpSSR in Senecio and five in Dendrosenecio were identified. The ten divergent hotspots could offer the much-needed DNA barcodes for species identification and phylogenetic reconstructions within the tribe, while the cpSSRs provides potential markers for future population-level research in each respective genus.
We thank Elizabeth M. Kamande for her support in fieldwork, Josphat K. Saina for his assistance in data analyses, and Justus M. Mulinge for proofreading the manuscript.
QF, GH, and JC conceived and designed the experiment. AWG, SA, GH, and JC conducted fieldwork. AWG and SA performed the experiments. AG, SA, and ZL analysed the data. AG wrote the manuscript. ZL assisted in revising the manuscript. All authors read and approved the final manuscript.
This work was funded by Sino Africa Joint Research Center (Nos. Y323771W07 and SAJC201322) and the CAS-TWAS President’s Fellowship Program for developing countries.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
- Beentje HJ, Jeffrey C, Hind DJN (2005) Compositae (Part 3). In: Beentje HJ, Ghazanfar SA (eds) Flora of tropical east Africa. Royal Botanic Gardens, KEW, London, pp 547–869Google Scholar
- Chen YL et al (2011) Senecioneae. In: Wu ZY, Raven PH, Hong DY (eds) Flora of China, vol 20–21. Missouri Botanical Garden Press, Beijing, pp 371–544Google Scholar
- Dempewolf H et al (2010) Establishing genomic tools and resources for Guizotia abyssinica (L.f.) Cass.—the development of a library of expressed sequence tags, microsatellite loci, and the sequencing of its chloroplast genome. Mol Ecol Resour. 10:1048–1058. https://doi.org/10.1111/j.1755-0998.2010.02859.x CrossRefPubMedGoogle Scholar
- Doorduin L, Gravendeel B, Lammers Y, Ariyurek Y, Chin-A-Woeng T, Vrieling K (2011) The complete chloroplast genome of 17 individuals of pest species Jacobaea vulgaris: SNPs, microsatellites and barcoding markers for population and phylogenetic studies. DNA Res 18:93–105. https://doi.org/10.1093/dnares/dsr002 CrossRefPubMedPubMedCentralGoogle Scholar
- Downie SR, Jansen RK (2015) A comparative analysis of whole plastid genomes from the Apiales: expansion and contraction of the inverted repeat, mitochondrial to plastid transfer of DNA, and identification of highly divergent noncoding regions. Syst Bot 40:336–351. https://doi.org/10.1600/036364415X686620 CrossRefGoogle Scholar
- Doyle JJ (1987) A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull Bot Soc Am 19:11–15Google Scholar
- Hedberg O (1957) Afroalpine vascular plants: a taxonomic revision. Symb Bot Upsal 15:1–411Google Scholar
- Knox EB (2005) Dendrosenecio. In: Beentje HJ, Jeffrey C, Hind DJN (eds) Flora of Tropical East Africa, Compositae (part 3). Kew, Royal Botanic Gardens, LondonGoogle Scholar
- Kumar S, Hahn FM, McMahan CM, Cornish K, Whalen MC (2009) Comparative analysis of the complete sequence of the plastid genome of Parthenium argentatum and identification of DNA barcodes to differentiate Parthenium species and lines. BMC Plant Biol 9:131. https://doi.org/10.1186/1471-2229-9-131 CrossRefPubMedPubMedCentralGoogle Scholar
- Milton JJ (2009) Phylogenetic analyses and taxonomic studies of Senecioninae: southern African Senecio section Senecio. University of St Andrews, St AndrewsGoogle Scholar
- Nordenstam B (1978) Taxonomic studies in the tribe Senecioneae (Compositae). Opera Bot Lund 44:1–84Google Scholar
- Nordenstam B (2007) Tribe Senecioneae. In: Kadereit JW, Jeffrey C, Kubitzki K (eds) The families and genera of vascular plants, vol 8. Springer, Berlin, pp 208–241Google Scholar
- Nordenstam B, Pelser PB, Kadereit JW, Watson LE (2009) Senecioneae. In: Funk VA, Susanna A, Stuessy TF, Bayer RJ (eds) Systematics, evolution, and biogeography of composiate. International Association for Plant Taxonomy, ViennaGoogle Scholar
- Pelser PB, Veldkamp JF, van der Meijden R (2006) New combinations in Jacobaea Mill. (Asteraceae–Senecioneae). Compos Newsl 44:1–11Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.