Introduction

Toona plants have magnificent material, straight surface, and radiance, turning into the predominant furnishings and inside adornment wood, known as "Chinese mahogany", which is greatly esteemed by individuals [1,2,3,4,5]. T.sinensis and T.ciliata have the most noteworthy application in the Toona. Toona sinensis (A. Juss) Roem is a unique species of vegetable in China, its young shoots and leaves are crisp and juicy, fragrant and unique in flavor, it is a traditional and valuable woody vegetable that our people like to eat, and is also a local product for foreign trade export [6, 7]. T.sinensis is not only an excellent vegetable but also a natural green nutritious food of medicinal and food origin. It has somewhat high happiness of flavone and other pharmacologically dynamic mixtures [8, 9]. T.ciliata is a Grade II safeguarded plant, an important timber tree, and a therapeutic plant that has acquired broad consideration as of late [10,11,12,13]. The roots, stems, and leaves of T.ciliata can be utilized as medication and have successful restorative properties [14,15,16]. The monetary worth of this variety is quite high, and it is generally utilized and has extraordinary potential for advancement and utilization [17,18,19,20].

Mitochondria are organelles in higher plant cells with a semi-autonomous genetic system that provides the majority of the energy required for cellular and other life activities [21,22,23,24]. Mitochondria are particularly important in the study of the origin and evolution of living things. Mitochondrial DNA (mtDNA) is a genetic material found outside the nucleus that is normally a double-stranded circular molecule with a covalent closure [25,26,27]. Advanced plants have the largest mitochondria of any known higher organism species, ranging from 200 to 2400 kb [28,29,30].

Plant Mt genomes have been increasingly studied and more and more Mt genomes have been sequenced in recent years, which is very important for studying the diversity of biological phenotypes, functional diversity, as well as species evolution. This is critical for understanding biological phenological diversity, functional diversity, and the emergence of new functions during species evolution [31, 32].

Even though Toona plants have a long history of cultivation in China, most studies have been limited to chemical pathology, physiology, biochemistry, introduction, and breeding, with little research done on its origin, taxonomy, cytogenetics, and so on. There are still some issues with Toona classification, such as interspecific hybridization, that need to be addressed [33, 34]. Furthermore, Toona plants have a geographically dispersed distribution in China, resulting in a scarcity of natural forests and susceptibility to natural and anthropogenic breakage, T.ciliata has now been classified as an endangered species, listed as a Class II key protected wild plant in China, and included in the Reference List of Major Cultivated Precious Tree Species in China and [35,36,37,38].

Subsequently, this review, given Mt near genomic examination through trend-setting innovations, for example, sub-atomic sequencing of Mt DNA, makes it conceivable to concentrate on Toona further top to bottom according to a minuscule viewpoint notwithstanding plainly visible morphological characterization and makes the preservation of excellent hereditary assets of the imperiled species T.ciliata, determination and reproducing of good species, and advancement and usage with significant hypothetical and functional importance.

Materials and methods

Plant material, DNA extraction, and library construction

While T.sinensis was acquired from Pingxiang, Guangxi, T.ciliata was obtained from Baoshan, Yunnan (Longitude: 106.75 E, Latitude: 22.12 N.) Before this investigation, both species completed seedling trials and were found to be suitable for cultivation in Guangzhou, Guangdong. (Note: Professor Xiaoyang Chen and Teacher WeiZhou conducted a detailed identification of the plant material. The seed trial forest is situated near the South China Agricultural University's teaching and research facility in Guangzhou, China, at N23°16′ and E113°37′.)

High-quality total DNA is the primary prerequisite for obtaining the whole Mt genome sequence. Fresh leaves of T.sinensis and T.ciliata were taken and whole genome DNA was extracted by the CTAB [39] method. high-quality genome DNA was extracted and quality checked for purity, concentration, and integrity using Nanodrop [40], 1% (w/v) agarose gel electrophoresis. DNA samples that passed the electrophoresis test were randomly broken into fragments of approximately 350 bp in length using a Covaris ultrasonic fragmentation machine [41]. After processing, the DNA fragments were subjected to end repair, A-tail addition, sequencing junction addition, purification, PCR amplification, and other steps to complete the entire library preparation. After the library was constructed, the initial quantification was performed using Qubit 3.0, and the library was diluted to 2 ng/ul. The insert size (insert size) of the library was then detected using Agilent 2100 [42, 43]. After the inserts met the expectation, the effective concentration of the library was accurately quantified by Q-PCR [44] to ensure the quality of the library. After the libraries passed the test, they were sent to Guangzhou Ruike Gene Technology Co.

Sequencing, assembly, and annotation

Qualified DNA libraries were sequenced using the Illumina HiSeq 4000 High-throughput Sequencing Platform. Once the sequencing was completed, the sequenced data were spliced into the Mt genome. The reads with low sequencing quality (< 40 bp in length) were filtered by Trimmomatic [45], the overlapping reads were filtered out by Blast to obtain Clean Data, and the sequencing data were analyzed by 15-mer using K-mer software to obtain high-quality reads. Assembly was performed using SOAP denovo [46] assembly software. The preliminary assembly results were optimized and holes were filled using krskgf and gapclose [47] software to obtain the specific assembly results.

The complete Mt genome sequence was annotated utilizing CPGAVAS [48] software together with DOGMA [49] software. Comparison analysis of blast on the proximal edge, followed by manual correction(Specific results of T.sinensis and T.ciliata annotations can be detailed in  Additional file 7 and 8: Appendices G and H, respectively). Transfer RNA (transfer RNA, tRNA) genes were identified along with manual correction employing tRNAscan-SE [50] software. The BLAST [51] search method was performed to align [52,53,54] and validate [55] the information sites such as gene boundaries, intron, exon, and coding regions.

The annotated genome sequences were submitted to NCBI according to the requirements, resulting in the definitive accession numbers T.sinensis (GenBank: OM574631.1) and T.ciliata (GenBank: OM574630.1).

Superior Mt genome analysis

Structure and composition

Mitochondria were mapped by OGDRAW vl.2 (Organellar Genome DRAW) [56] online website (https://chlorobox.mpimp-golm.mpg.de/OGDraw.html). The circular structure of the genome sequence was mapped. The base content of the Mt genome was calculated using Editseq [57] software to obtain the ratio of A, T, C, G, and GC content respectively.

Frequency of codon usage

Considering the formula mentioned in Sharp PM literature [58], the utilization of relative equivalent codon use (RSCU) was examined utilizing CodonW [59] software.

Simple sequence repeats

Simple Sequence Repeats (SSRs) of the Mt genome of T. sinensis and T. ciliata were analyzed using MISA [60] software, with the tandem repeat unit length and a minimum number of repeats set to > 10 for single nucleotide repeats, > 6 for dinucleotide repeats, and > 5 for trinucleotide, tetranucleotide, pentanucleotide, and hexanucleotide repeats. The minimum distance between SSRs was set to 100 bp.

Chloroplast and Mt genomes

The chloroplast genome sequences of T.sinensis (GenBank: OK572965) and T. ciliata (GenBank: OK572964.1) on NCBI were uploaded by our group before completion. Match them with the mitochondrial genome for Blastn (https://blast.ncbi.nlm.nih.gov/Blast.cgi) to find out the migrating gene sequences. Regions with similarity greater than 90% and comparison lengths greater than 50 bp were screened as migration sequences.

Phylogenetic tree analysis

Species (The specific Mt information etc. of the tree species can be detailed in a file called taxonomy in Additional file 4: Appendices D) with complete Mt genome arrangements and explanations in direct relation to the objective species were downloaded from NCBI for phylogenetic tree development. For more details on structural trees, the ML construction tree method is described in the folder titled "Description of the structure tree" in Additional file 4: Appendices D, while the Bayesian construction tree method is detailed in Additional file 5: Appendices E.

Results

Genome features

The total Mt genome length of T.ciliata was 683,000 bp, the composition of bases was A (27.31%), T (27.29%), C (22.56%), and G (22.85%), and the C + G content was 45.40%. The size of the T. sinensis Mt genome was 638,482 bp, and its base makeup was A (27.35%), T (27.09%), C (22.79%), and G (22.76%), with a C + G content of 45.56%. All of them have a circumferential Mt genome construction, where their longest gene is the rrn26 gene in the transfer RNA, measuring 3116 bp (Table 1, Fig. 1).

Table 1 Results of mt DNA genome sequence analysis of two plants
Fig. 1
figure 1

A map of the Mt genome of the Toona. A T.sinensis, B Toona ciliate. Reverse transcription is indicated by genes outside of the circles, and clockwise transcription is indicated by genes inside the circles. The two IR regions are represented by the thick black line on the outside circle. The GC content is represented by the inner nucleus' dark gray graph

Functional gene

Gene encoding protein

T.ciliata encodes 71 genes while T.sinensis encodes 72 genes. Protein-coding genes of both Toona plants are consistent in frequency, types, and measurements (Additional file 1: Appendices A), whereas the predominant divergence is in tRNAs, with T.ciliata encoding 33 tRNAs and T. sinensis encoding 34 (Additional file 2: Appendices B).

Employing NCBI-BLAST analysis, 38 genes encoding proteins were obtained on the Mt genomes of both T.ciliata and T. sinensis. We categorized the protein-encoding genes into the following eight categories according to their gene functions (Table 2): including Complex I genes ( nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, and nad9) involved in the synthesis of NADH deaminase subunits; Complex II genes ( sdh3 and sdh4) participated in the synthesis of cytochrome b precursor subunits; Complex III gene (cob) implicated in the synthesis of the cytochrome C oxidase subunit; Complex IV genes (cox1, cox2, and cox3); Complex V genes (atp1, atp4, atp6, atp8, and atp9), associated with the synthesis of ATP synthase subunits; Cytochrome c biosynthetic genes (ccmB, ccmC, ccmFC and ccmFN) engaged in the synthesis of cytochrome C synthase subunits; Ribosome protein genes synthesized by ribosome protein synthesis genes (rps1, rps3, rps4, rps10, rps12, rpl2, rpl5, rpl10 and rpl16); The ribosomal RNA genes (rrn5, rrn18 and rrn26) as well as the matR gene (encoding a maturation-like enzyme) and the mttB gene (encoding a transporter).

Table 2 Number and proportion of each type of SSR in T.sinensis and T.ciliata

Gene encoding tRNA

Utilizing tRNAscan-SE, 33 and 34 genes encoding transfer RNAs were identified separately on the Mt genomes of T.ciliata and T. sinensis.

In the T.ciliata, a total of 33 tRNAs encode 20 amino acids ranging from 66 bp-88 bp in length. five of these tRNASers, Leucine(LeU), Glycine(Gly), Gly, Cysteine(Cys), Argnine(Arg), and Lysine(Lys) each have two tRNAs encoding, Met and Pro are distributed with three tRNAs encoding, and the remaining amino acids all have one tRNA Editor. In contrast to T.ciliata, T. sinensis has 34 tRNAs encoding 20 amino acids, ranging from 63–167 bp in length. compared to T.ciliata, T. sinensis has 2 fewer tRNAs encoding Cys and Arg, but 3 more tRNAs encoding Proline(Pro), Isoleucine(ILe), and Glutamic acid(Glu) (Additional file 2: Appendices B).

Codon Usage bias

RSCU (Relative Synonymous Codon Usage) is a relative synonymous codon usage measure, indicating the proportion of a given synonymous codon usage among all synonymous codons. The Mt genomes of T.ciliata and T. sinensis have a codon usage bias for all amino acids except for the Tryptophane (Trp) of only one codon, TGG.

The codon TTT was the most frequently accessed codon in the Mt protein-coding genes of T.ciliata and T. sinensis, with the second commonest codon being ATT and the third being TTC. The termination codon TAG was the least frequently addressed codon, being exclusively indexed on six and five occasions respectively (Additional file 3: Appendices C).

SSRs

A total of 94 simple sequence repeats were detected in the mitochondrial genome of T.sinensis while 87 were detected in T. ciliata (Annex E). The distribution of each type of SSRs can be observed from the statistical results (Table 2), where T. sinensis mono-, di-, tri-, and hexa-nucleotides had 75, 14, 6, and 1, respectively. No tetra-nucleotides were detected and penta-nucleotides. T.ciliata, on the other hand, had 70, 13, 6, and 1, respectively. However, T.ciliata also detected 1 tetra-nucleotides (CGA).

The major repeat types of SSRs are single nucleotide repeats, with the number of A/T in the relevant single nucleotide repeats being much larger than the number of G/C (Table 3). The proportion of A/T on polynucleotide repeats is also greater than the proportion of G/C, judging from the data in Additional file 6: Appendices F. It is consistent with the results of their codon preferences.

Table 3 Distribution of the number of single nucleotide repeats

Genome alignment and migration sequence

Even though T. sinensis and T.ciliata mitochondrial genomes are up to four times longer than those of chloroplasts, they only have half as many protein-coding genes, making up less than one-fifth of the total length, whereas the proportion of protein-coding in chloroplasts is around 50% of the total length (Table 4). There were no introns found in the chloroplast genome of T. sinensis or T. ciliata, 21 introns were released in the mitochondrial genome, and the rRNA numbers of the two tree species were very congruent in both genomes. T. sinensis was larger than T.ciliata in the mitochondrial genome but had one more tRNA, which may be related to the exchange of genetic material in nuclear genes or cytoplasm. Both T. sinensis and T. ciliata had 37 numbers in the chloroplast genome.

Table 4 Comparison of chloroplast and mitochondrial genomes of T. sinensis and T. ciliata

We discovered that T. ciliata had 11 migratory sequences and T. sinensis had 10 when we compared the chloroplast and mitochondrial genome sequences under the screening criteria of areas with similarity greater than 90% and comparison length greater than 50 bp(Table 5). The largest of these migratory sequence segments measured 4124 bp. Comparatively, we discovered that T.sinensis While only one sequence fragment of T.ciliata was consistent, with a variation of 1–8 bp, we discovered that only three sequence fragments of T. sinensis and mitochondria were consistent in size, with the others varying by 1–3 bp. The mitochondrial genome's recombination and gene rearrangement were linked to variations in sequence, which may indicate that after migratory integration, these fragments may have undergone separate replication and recombination within the mitochondrial genome recombination.

Table 5 Gene sequences of T. sinensis and T. ciliata mitochondrial genomes derived from the chloroplast genome

Phylogeny analysis

Aiming to ascertain the evolutionary status of T.ciliata and T. sinensis in the plant system, we downloaded the mtDNA sequences of the same ORDER relatives that have published their mtDNA sequences on NCBI. The two approaches of amino acid construction tree and DNA sequence construction tree are described in Additional file 3: Appendices C. Six Anacardiaceae species, five Sapindaceae species, three Rutaceae species, two Nitrariaceae species, and two Meliaceae species, T.ciliata (OM574630) and T.sinensis (OM574631), for a total of 18 tree species. The outgroup for the Mt genome was Morus notabilis (NC 041177.1), and an evolutionary tree was constructed using the maximum likelihood method using the software MEGA 11. Bayesian tree (BI) and maximum likelihood method (ML) to create phylogenetic tree topology are similar, only the support at a few branches varies(Only the values between the large branches clustered into Anacardiaceae and Nitrariaceae had large divergences, where the ML tree had a support of 69, while the BI tree was 99). In this research, the tree with the maximum likelihood tree is selected, detailed in Fig. 2, while the result regarding the BI development tree is detailed in Fig. 3. In the likelihood ML phylogenetic tree (Fig. 2), a total of 15 nodes were formed, nine of which had 100 percent support, except for the large branch of Anacardiaceae and Nitrariaceae, which had 69 percent support, and Xanthoceras sorbifolium (MK333231.1) and Sapindus mukorossi (MT806100.1), which formed a minor branch with only 56 percent support, but all the other nodes had no less than 93% support.

Fig. 2
figure 2

Phylogenetic trees constructed based on the ML method for 19 related plants

Fig. 3
figure 3

Phylogenetic trees constructed based on the BI method for 19 related plants

Discussion

Mitochondrial Structure and Genetic Information

In terms of GC content, gene content, and genetic codon usage preference, functional gene and the numerical Mt genomes of T.ciliata and T. sinensis were well conserved. Moreover, they are similar in the results of codon preference and RSCU values. These encoded genes are mainly concerned with the synthesis of ATP synthase subunits, cytochrome C synthesis, and ribosomal protein synthesis. This provides the theoretical conditions for the exploration of the mechanisms and pathways of metabolite synthesis reactions including respiration and other related metabolites between the two species.

Genome comparison and genetic sequence migration

The tRNA from the chloroplast or nucleus will be involved in the transport of amino acids to reach the number of amino acids required for life. They encode the same 20 amino acids, but T. sinensis has one more tRNA than T.ciliata. Gene transfer in cells occurs between different organelles, including chloroplasts, mitochondria, and the nucleus [61, 62]. The vast majority of sequences in the mitochondrial genome that migrate from the chloroplast genome are currently considered "dead on arrival", except tRNAs [63]. Expression of chloroplast-derived tRNA genes in the mitochondrial genome has been shown to exist [64, 65]. For the extra tRNA in the T.sinensis mitochondria, there is no relationship with the transfer of tRNAs from the chloroplast to the T. sinensis mitochondria, which should be associated with the nuclear genome. However, regarding specific gene exchange, it is required to develop in-depth research on gene communication between the nuclear genome and organelles.

Repeat sequence

Simple sequence repeats (SSRs), also known as microsatellites, are widely distributed on the mitochondrial genome [66]. Due to their high polymorphism and co-dominance, microsatellites are often used as molecular markers to assist in breeding [67] construction of genetic linkage maps and gene mapping, etc. [68].

In contrast, there are some significant variations in SSRs between T.sinensis and T.ciliata, for example, T.ciliata has one more tetra-nucleotides (CGA), whether this has an evolutionary link to the two plants. It will provide a point for the subsequent screening of genetic molecular markers. In addition, further research on homologous recombination mediated by repeated sequences, Toona kinship, and genetic distance will be conducted.

Systematic evolution

T.ciliata and T.sinensis have similar morphological characteristics and cultivate in similar environments, so the traditional morphological taxonomy considers the two plants to be cloplantssely related [69, 70]. In the phylogenetic tree, the target tree species T.ciliata and T.sinensis, belonging to the Meliaceae, clustered into a narrow branch with 100% support. This unifies with the results of traditional morphological taxonomy.

Since plants in the Sapindaceae are more susceptible to geographical location and their genetic variation, the evolutionary distance and genetic variation of plants within the Sapindaceae vary widely [71, 72]. Flora of China records that Meliaceae, Rutaceae, Anacardiaceae, Sapindaceae, and Nitrariacea are natural taxon.

Taxonomists such as Rendle, Hutchinson, and others, who have organ morphological classification, have concluded that Meliaceae and Rutaceae are closely related, but for the classification of the degree of affinity between them, most of them are distinguished from plant physiology and morphology, less from the molecular level of genes [73, 74]. The establishment of the ML evolutionary tree provides a preliminary evolutionary relationship between Meliaceae and Rutaceae at the Mt genome level, but there are limitations because the published Mt genome sequences of plants are still quantitatively insufficient to represent the family level.

Conclusion

The completion of the Mt genome sequencing of T.ciliata and T. sinensis has enriched the Mt genome library of Toona. and is important for investigating interspecific species relationships and researching the genetics and evolution of Toona.

The Mt genomes are predominantly maternally inherited and do not originate in the recombinant genome, therefore, they may have dissimilar evolutionary mechanisms and might reflect different evolutionary information. Further research on gene recombination, locus analysis, etc. can theoretically be supported by the identified moving sequence fragments. Phylogenetic tree building also further illustrates that the simulation of Mt genomic evolutionary tree outcomes is moderately compatible with the traditional classification. The Mt genome can be acclaimed as a molecular marker for the investigative assessment of phylogenetic relationships among species and the genetic structure of populations.

Regarding T.ciliata and T. sinensis, it is of great value for the data on their energy metabolism, growth and development, and hybrid breeding.