Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

Tong, Wei; He, Qiang; Park, Yong-Jin

doi:10.1038/srep43327

Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

Article
Open access
Published: 03 March 2017

Volume 7, article number 43327, (2017)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

Download PDF

Wei Tong^1,2^na1,
Qiang He¹^na1 &
Yong-Jin Park^1,3

2923 Accesses
Explore all metrics

Abstract

Mitochondrial genome variations have been detected despite the overall conservation of this gene content, which has been valuable for plant population genetics and evolutionary studies. Here, we describe mitochondrial variation architecture and our performance of a phylogenetic dissection of Korean landrace and weedy rice. A total of 4,717 variations across the mitochondrial genome were identified adjunct with 10 wild rice. Genetic diversity assessment revealed that wild rice has higher nucleotide diversity than landrace and/or weedy, and landrace rice has higher diversity than weedy rice. Genetic distance was suggestive of a high level of breeding between landrace and weedy rice, and the landrace showing a closer association with wild rice than weedy rice. Population structure and principal component analyses showed no obvious difference in the genetic backgrounds of landrace and weedy rice in mitochondrial genome level. Phylogenetic, population split, and haplotype network evaluations were suggestive of independent origins of the indica and japonica varieties. The origin of weedy rice is supposed to be more likely from cultivated rice rather than from wild rice in mitochondrial genome level.

Does long-term harvesting impact genetic diversity and population genetic structure? A study of Indian gooseberry (Phyllanthus emblica) in the Central Western Ghats region in India

Article 20 June 2024

Genetic diversity and population structure of Canna edulis accessions in Vietnam revealed by ISSR markers

Article 09 May 2023

The domestication and breeding history of Castanea crenata Siebold et Zucc. estimated by direction of gene flow and approximate Bayesian computation

Article 28 September 2023

Introduction

The mitochondrial (mt) genome plays an essential role in cell metabolism. Plant mt genomes vary greatly in size, from ~200 kb to 2 Mb mostly, and are substantially larger than mt genomes of other eukaryotes (https://www.ncbi.nlm.nih.gov/genome/browse/?report=5#)^1,2. Some specialized plant (like Silene conica) even has a genome size over 10 Mb³. Physical mapping and sequencing of some of the small mt genomes show that their structures are shaped by active recombination, gene transfer to the nucleus, and other forces that remain unclear⁴. Structural analyses revealed high frequencies of intra- and intermolecular recombination, which generated a structurally dynamic assemblage of genome configurations^5,6. This dynamic organization of the plant mt genome provides a powerful model for the study of genome structure and evolution. These genomes exhibit an intriguing mixture of conservative (slowest rates of nucleotide substitution)^7,8 and dynamic evolutionary patterns. Some previous studies^9,10,11,12 also suggested that for evolutionary studies it is not necessary to assemble whole organelle genomes but just exploring the variations.

The complete chloroplast and mt genomes of rice are available^2,13,14,15, and comparative analysis showed that the gene order and essential gene content are highly conserved for most chloroplast genomes. In contrast, mt-encoded genes are highly conserved, but their gene order, genome structure, and genome size are highly variable among plant species^2,16. In rice, the intersubspecific polymorphism rate for mitochondrial genome is 0.02% for SNPs (single nucleotide polymorphisms) and 0.006% for indels (insertions and deletions) and the intravarietal polymorphism rates among mitochondrial genomes are about 1.3% for SNPs and 1.1% for indels, respectively. Some intravarietal polymorphisms are fixed in indica or japonica, which can be used as specific markers in distinguishing the two subspecies². Whole-organelle genome sequencing, especially for the chloroplast and mt genomes, has been applied recently as a potential barcode¹⁷ that can assist in overcoming the previous process of collecting data over generations. Furthermore, due to recombination in the nucleus, data may lead to the construction of unreliable phylogenies; organelles are structurally stable, non-recombinant, and haploid, and thus offer certain advantages in phylogenetic reconstruction.

Asian cultivated rice (Oryza sativa L.) is generally thought to have been domesticated from O. rufipogon several thousands of years ago^18,19,20,21. However, some debate regarding the origin of cultivated rice has emerged over the past several years, centering on whether the two major rice cultivars, O. sativa ssp. indica and japonica, were derived from a single ancestor or were domesticated independently at different locations^{10,19,22,23,24,25}. As a consequence of adaptation to different habitats, extensive genotypic and phenotypic diversity exists within O. sativa L., resulting in about 120,000 different accessions²⁰. These accessions range from traditional rice landraces preserved by indigenous farmers to the commercially bred cultivars developed during the green revolution. Landraces are local varieties of a domesticated plant species that were adapted to their natural and cultural environments. Rice landraces are lineages developed by farmers through artificial selection during the long-term domestication process. Each landrace has particular properties or characteristics, such as early maturity, adaptation to particular soil types, resistance or tolerance to biotic and abiotic stresses, and properties related to the expected end usage of the grains. Exploring the genetic basis of these diverse varieties will provide important insight for the breeding of elite varieties for sustainable agriculture.

Weedy rice (Oryza sativa f. spontanea Rosh.), harbors characteristics of undomesticated Oryza species, including seed dispersal mechanisms and seed dormancy. It also possesses traits of domesticated rice, such as rapid growth, and resembles domesticated rice during the seedling stage, which promotes its invasiveness in the agroecosystem^26,27. The origin of weedy rice has long been discussed, which has led to several hypotheses, including the ongoing selection and adaptation of wild rice^28,29, hybridization between cultivated rice and its progenitor type³⁰, hybridization between indica-japonica²⁶, ongoing and multidirectional hybridization between weedy rice and cultivated types, and among weedy types³¹. Simple sequence repeats based analyses of the genetic diversity of weedy rice populations from China suggested that weedy rice most likely originated from local indica or japonica varieties^27,32. In Korea, weedy rice varieties have been collected from farmers’ fields, and their regional distribution and genetics have been characterized extensively^33,34. As weedy rice is a member of the Oryza genus, gaining an understanding of the genetic background of problematic weedy species by examining the underlying genomic information is important.

In the present study, a collection of 60 Korean landrace and weedy rice, and 10 wild rice varieties, including O. sativa L. ssp. indica and japonica, O. rufipogon, and O. nivara, were selected to investigate the rice mt genome architecture. The mt genome of O. sativa L. ssp. japonica (Nipponbare, Genbank: NC_011033) was chosen as the reference for mapping variations in the collection. Mt genome variations in the germplasm were mined and subjected to comparative analyses among different groups (wild and the others; indica, japonica, and wild; as well as landrace and weedy). The diversity and population structure of these accessions were also examined at the mt genome level. Phylogenetic analyses were performed using the maximum likelihood (ML) and Bayesian inference (BI) methods, together with population split and haplotype network analyses, which could reveal phylogenetic relationships among landrace and weedy rice, and possibly the origin of weedy rice. This report provides a case study for the rice mt genome developed from whole genome resequencing, and the data generated here could be applied to further analyses of rice mt evolution and genetics.

Results

Mt variations across the genome

A number of variations in the rice mt genome were observed in the current study. In total, 4,717 variations, including 4,507 SNPs and 210 indels, were identified across the mt genomes of landrace and weedy rice accessions, along with wild rice. However, after removing missing values and heterozygotes, only 264 high-quality (HQ) variations remained (Table 1). Excluding wild rice, we identified 3,960 variations across the landrace and weedy rice accessions, with 203 HQ sites (Supplementary Tables 1 and 2). In total, 3041 variations (76.8% of total) among the 60 rice accessions located in the intergenic regions, while only 23.2% variations are in the gene region (Supplementary Table 1). When we assigned these variations to different groups, excluding the total SNPs, no significant difference was observed in the distribution of variations between landrace and weedy rice (Table 1). The overall distribution of variations in the total collection and different groups was also targeted based on the reference genome (Fig. 1), which suggested that the variations occurred with a region-dependent preferential. Few variations were observed in several large regions of the genome, some of which contained no variation (Supplementary Table 3).

Table 1 Summary and subgroup distribution of the total variations (SNPs and Indels) detected in 60 Korean origin landrace and weedy rice along with 10 wild rice using the mitochondrial genome of Oryza sativa japonica as reference.

Full size table

**Figure 1: Overall variation (SNPs and Indels) distribution across the mt genome.**

Variation architecture at the mt genome level

The nucleotide diversity (pi) of the mt genome in the whole collection and different subgroups (landrace, weedy and wild rice, and indica and japonica) was calculated (Fig. 2). In the whole collection, pi ranged from 0.0176 to 8.23E-06 with 1 kb slide window among whole mt genome. Most of the pi values were lower than 0.005 (Fig. 2a, Supplementary Table 4), while some regions with extremely high diversity were also identified. In most regions of the mt genome, landrace rice showed slightly greater diversity than weedy rice (Fig. 2b, Supplementary Table 5). Wild rice showed higher diversity than landrace and weedy rice at most mt genome regions(Fig. 2c, Supplementary Table 6). By sorting the pi values for O. rufipogon, O. nivara, and O. japonica and O. indica, we found that O. rufipogon had the greatest diversity and japonica had the least diversity among the four groups (Fig. 2d, Supplementary Table 7). The average pi of the indica and japonica types was greater in landrace rice than in weedy rice (Fig. 2e, Supplementary Table 8). These results were also suggestive of greater diversity in indica than in japonica. The genetic distance (Fst) among the different groups was also calculated, which revealed a very high level of breeding (low Fst value) between landrace and weedy rice, and a closer association between landrace rice and wild rice (0.1109) than weedy rice and wild rice (0.1228) (Fig. 2f). When we isolated the wild rice into O. rufipogon and O. nivara, and the landrace and weedy rice into indica and japonica (Fig. 2g), the weedy_indica was more distant with O. nivara than landrace_indica, indicating that weedy_indica had much lower level of breeding with O. nivara than the landrace_indica. Similar results were observed among the O. rufipogon and japonica types (landrace and weedy), illustrating that the landrace_japonica was much closer with O. rufipogon than weedy_japonica. However, the breeding level between weedy_japonica and landrace_japonica was much higher than between weedy_indica and landrace_indica. In addition, overall Fst values between O. rufipogon and japonica type were lower than those between O. nivara and the indica type in landrace and weedy rice.

**Figure 2: Mitochondrial genome nucleotide diversity (pi) and genetic distance (*Fst*).**

Admixed population structure of landrace and weedy rice

A neighbor-joined phylogenetic tree was constructed in PHYLIP with 1,000 replicates based on the mt genome SNPs, and the consensus tree is shown in Fig. 3a. As illustrated, the japonica, indica, and wild types fell into three subpopulations (POP1, POP2, and POP3, respectively). However, the landrace and weedy indica types were mixed, as were the landrace and weedy japonica types. This admixture was also observed in the population structure estimation with increasing K (number of populations) values from 2 to 6 (Fig. 3b). When the K value increased to 6, the clear separation of indica and japonica types was preserved; however, no obvious clustering of landrace_indica and weedy_indica be detected in indica group, similar to landrace_japonica and weedy_japonica in japonica group. PCAs (principal component analysis) of the population with and without wild rice were also performed to compare the grouping of the different groups, which revealed that the indica, japonica, and wild types could be grouped (Fig. 3c). However, the landrace or weedy indica and japonica types were mixed together (Fig. 3d), which was consistent with the results from the neighbor-join phylogenetic and population structure analyses.

**Figure 3: Population structure and principal component analysis of the collection.**

Phylogenetic relationships in landrace and weedy rice

Phylogenetic analysis of the whole collection was performed using a Bayesian MCMC search with MrBayes 3.2.5 software using the best-fit model K80 (Fig. 4a). In parallel, a ML iterative model-based method (with best-fit model SYM) with a bootstrap of 1,000 replicates to assess the reliability of the phylogeny reconstructed using PhyML was also conducted (Fig. 4b). A tanglegram was then constructed for the two trees generated using the two methods, which showed complete consistency with three major clusters: indica, japonica, and wild types. Although the tree topology structure generated by the two methods differed, the phylogenetic relationships of these accessions were similar. Wild rice was located in the middle with indica and japonica in the two opposite sides, suggesting that indica and japonica may have different origins. In addition, we found no clear separation of landrace and weedy rice within the indica and japonica types, which is indicative of their sophisticated genetic backgrounds.

**Figure 4: A tanglegram phylogenetic analysis using trees from ML and BI methods to compare the differences from two methods.**

Haplotype network and population splits based on the mt genome

Given the observed complex genetic relationship of landrace and weedy rice, and to support our results, we conducted a haplotype network analysis and population split test in the current population. The samples were divided into six groups: O. rufipogon, O. nivara, landrace_japonica, landrace_indica, weedy_japonica, and weedy_indica. The haplotype network revealed 11 haplotypes in the whole collection, dominated by two common haplotypes that were detected in the majority of landrace and weedy rice (Fig. 5a). Excluding the three wild types with haplotypes similar to the indica type, other wild rice accessions, including two indica suspected accessions, shared the other minor haplotypes. Generally, haplotype 1 was represented primarily by the indica type, whereas haplotype 2 was detected primarily in the japonica type. Analysis of population splits among the six groups showed that the indica and japonica types were separated by wild rice, which also indicating the different origin of indica and japonica (Fig. 5b).

**Figure 5: Haplotype network and population splits of the mitochondrial genome suggested by genome wide variations.**

Discussion and Conclusions

In this report, we mined mt variation in Korean landrace and weedy rice accessions together with 10 wild rice varieties (O. nivara and O. rufipogon). Generally, different mt genome variations were detected among different groups and the wild rice is more polymorphic than indica and japonica (Table 1). Landrace rice has more variations than weedy rice, indicates that landrace rice is more diverse than weedy rice. We also carried out an intersection of different groups, which revealed the wild, indica, and japonica types shared a huge number of variations (1,068, 22.6% of the total; Supplementary Fig. 1). Similar distributions could be observed in two other groups, with 1,543 variations (32.7% of the total) shared among wild, landrace, and weedy rice (Supplementary Fig. 1); and 1,317 variations (27.9% of the total) shared in indica or japonica varieties of landrace or weedy rice (Supplementary Fig. 1). Especially among indica and japonica types, specific variations were more common in landrace than in weedy rice. These results also support that the landrace rice is more diverse than weedy rice.

Evaluation of diversity and genetic distance of landrace and weedy rice revealed that weedy rice has slightly less nucleotide diversity than landrace rice, and that the landrace indica and japonica types have greater nucleotide diversity than weedy rice (Fig. 2). It is same with our previous result by using nuclear genome of Korea landrace and weedy rice³⁵. We found greater genetic distance from wild rice in Korean weedy rice than in landrace rice, with less between landrace and weedy types. The distance between landrace_indica and weedy_indica types was much greater than that between landrace_japonica and weedy_japonica types, which suggested that japonica is less diverse than indica. In addition, genetic distance evaluation suggested that weedy rice was far from wild rice but close to landrace rice, indicated the low breeding level between weedy rice and wild rice.

Population structure analysis suggested that landrace and weedy rice in Korea have mixed genetic backgrounds and cannot be separated in mt genome level (Fig. 3). Not only weedy but landrace rice has admixed population composition, as some varieties covered two or three subpopulation components (Fig. 3b). However, precious little wild rice background was shared in weedy rice accessions, foreboding the distant background of them. PCA of the whole collection indicated that the indica and japonica types showed admixture in landrace and weedy rice accessions (Fig. 3c,d). It supposed to be that the Korean weedy rice may not from the hybridization between wild and cultivated rice. In common, cultivated rice is also hardly to be hybridized with wild rice (O. rufipogon or O. nivara) in Korea, since there’s no wild rice in Korean farmers’ field in most cases. Another interesting event we found in the population structure was that most landrace or weedy rice only shared the background within landrace_japonica or landrace_indica, which means the indica or japonica of weedy rice may from independent cultivated rice (indica or japonica, respectively). We then conclude that Korean weedy rice not from the wild rice but from the cultivated rice itself.

Phylogenetic analyses using two different methods with different nucleotide substitution models revealed a same phylogenetic structure of all accessions, although the overall tree topology structure was very different (Fig. 4). Three groups (wild rice, indica, and japonica types) were well illustrated and separated. However, consistent with previous results, landrace and weedy rice could not be clearly seperated both in indica and japonica subgroups. This supported the complicated genetic background of landrace and weedy rice in mt genome level. As we know, the genome transfer is easily happened between mt genome and nuclear genome^14,36,37. Genetic differentiation of nuclear and mt genomes in indica, japonica and wild rice exists and may infers the different transfer patterns^36,38. Though we cannot totally avoid all interference from the transfer between mt and nuclear genome, we tried to minimize the bias by removing the most easily transferred region in our study. A set of 965 SNPs, which located at mt highly possible non-transfer region (these regions were identified from the Rice Genome Annotation Project, http://rice.plantbiology.msu.edu/annotation_pseudo_organellar.shtml) were isolated and applied for further evaluation. In addition, the same reference information for the SNPs mining and non-transfer regions characterization were employed, which would also reduce the bias. Population structure and phylogenetic analysis revealed that the results were similar by using all dataset and non-transfer dataset (Figs 3 and 4, Supplementary Fig. 2). Three subpopulations, indica, japonica and wild rice can be clustered into the same pattern between the two datasets (K = 5 with non-transfer SNPs and K = 6 with whole SNPs). The similar result by using whole SNP set and non-transfer SNPs indicated that the transfer pattern has limited impact in current study. This also suggested that the variations based method for phylogenetic or evolution studies is feasible, and using all variations in mt genome is also suit for such analysis with high decent accuracy.

Furthermore, we generated a haplotype network of the whole collection, which was dominated by two common haplotypes, including primarily the indica type (haplotype 1) and the japonica type (haplotype 2), respectively. Haplotype 1 harbored 15 accessions, and haplotype 2 covered 46 japonica accessions (Fig. 5). Haplotypes of landrace and weedy rice were contained within the two major haplotypes, and no additional minor haplotype was found in landrace or weedy rice. These results suggest that Korean landrace and weedy rice do not have unique background with each other at the mt genome level, which hold only the haplotypes distributed between indica and japonica.

Current report suggests that mt genome–based analyses can be applied in genetic diversity studies, as well as in population genetics and phylogenetic analyses. Outcomes from the rice mt genomes reveal and support the independent origin of O. sativa L. and also suggest that Korean weedy rice are more likely to be originated from cultivated rice rather than wild rice. Korean landrace and weedy rice have complicated genetic background and different genetic architecture in mt genome. According to a cytoplasmic-genetic male sterility genes study, the weedy rice in different regions most likely originated from local cultivated rice (indica or japonica) and the hybrid rice probably has been involved in the evolution of some weedy rice accessions³⁹. These results are also consistent with the outcomes from our report. A lack of relationship between weedy and wild rice was characterized in areas where O. rufipogon is still present⁴⁰, which also supports the conclusion from current report. It indicates that using mitochondrial genome or CMS (cytoplasmic male sterility) genes are a good method for replenishing the evidence from nuclear genome. Apart from nuclear genome–based genomic and evolution studies, we believe this study of the mt genome will increase our understanding of the genomics and evolution of Korea rice.

Methods

Samples and whole-genome resequencing

A core set containing 137 rice accessions with diverse types (landrace, weedy, bred) previously generated from worldwide varieties collected from the National Genebank of the Rural Development Administration (RDA-Genebank, Republic of Korea) using the program PowerCore^41,42 was selected for whole genome resequencing⁴³, in which 60 landrace and weedy type rice accessions were isolated for current mt genome analysis (Supplementary Table 9). In addition, 10 wild rice (Oryza rufipogon and Oryza nivara) from the resequencing set developed by Xu, et al.⁴⁴ were also combined in the present study (Supplementary Table 9). Raw data of the 10 wild rice accessions were downloaded from the European Nucleotide Archive (http://www.ebi.ac.uk/ena) under accession numbers SRA023116.

For the 60 landrace and weedy rice accessions, young leaves from a single plant were sampled and stored at –80 °C prior to genomic DNA extraction using the DNeasy Plant Mini Kit (Qiagen). Qualified DNA was used for whole-genome resequencing of the collected rice varieties, with an average coverage of approximately 9× on the Illumina HiSeq 2000 Sequencing Systems Platform (Illumina Inc.).

Data manipulation and variations

Resequencing raw data of all the accessions were trimmed using Sickle v1.2⁴⁵ to remove low-quality reads followed by alignment using BWA v0.6.2⁴⁶ to the Nipponbare mt genome sequence (Genbank: NC_011033). A Sequence Alignment/Map (SAM) file was created during the mapping and converted to a binary SAM (BAM) file with sorting. Removal of duplicates and addition of read group IDs were performed using Picard Tools v1.88 (http://picard.sourceforge.net/). Final realignment and identification of variation were performed using GATK software v3⁴⁷. The raw variant call format file (VCF format) of all accessions are available as Supplementary Dataset 1. Some scripts and commands used in the software were presented in Supplementary Dataset 2. Statistical analyses were performed to summarize the number and distribution of SNPs and indels based on the HapMap (Haplotype Map) file generated from the VCF file.

Mt genome variations architecture

Statistics evaluation of mt genome nucleotide diversity (π), population genetic distance (Fst), and Tajima’s D in the whole collection and different groups were conducted using VCFtools⁴⁸ across a sliding window 1000 bp in length with a 500-bp step size. Assessments of the calculation were also conducted in different groups to compare the groups divergence.

Population structure analysis

To estimate individual admixture assuming different numbers of clusters, the population structure was investigated using the program FRAPPE (Frequentist Estimation of Individual Ancestry Proportion), which allow for estimation of founding allele frequencies and individual admixture using maximum likelihood estimates⁴⁹. We increased the coancestry clusters spanning from 2 to 6 and ran analysis with 20,000 iterations. With an increasing K value range from 2 to 6, we could investigate the individual ancestry events in different clusters. A neighbor-join phylogenetic tree of the landrace, weedy and wild rice was constructed using PHYLIP package (Phylogeny Inference Package v3.695, http://evolution.genetics.washington.edu/phylip.html) with 1000 replicates. Principal component analysis (PCA) was conducted using TASSEL 5⁵⁰ based on the variations that will provide evidence and complement the population structure analysis.

Mitochondria-based phylogenetic, haplotype network and population splits

ML and BI methods were applied to construct phylogenetic trees for the current collection. Briefly, appropriate nucleotide substitution models were assessed using jModeltest 2.1.7^51,52. To perform phylogenetic analyses, indels were excluded to eliminate potential errors, as the software does not process indels well. A ML tree was conducted using PhyML 3.0⁵³ complemented by the best nucleotide substitution model GTR+G selected by the hierarchical LRT (Hierarchical Likelihood Ratio Test)⁵⁴, the Akaike Information Criterion (AIC)⁵⁵ and Bayesian Information Criterion (BIC)⁵⁶ with 1000 bootstrap replicates. A Bayesian tree was constructed using MrBayes 3.2.5⁵⁷ implemented with a Bayesian MCMC search, with two parallel runs of 2 million generations and four chains each. Best-fit model GTR+G were selected according to the Bayesian Information Criterion (BIC)⁵⁶ and compared with the tree generated by the ML method. The phylogenetic tree was displayed and modified using Figtree v1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/). The consensus tree of the bootstrap in the ML method was integrated using Phylip software. Tanglegram for two trees was implemented in Dendroscope⁵⁸ using a Neighbor Net-based heuristic, which is one good way to visualize similarities and differences between two phylogenetic trees side by side connected with lines between taxa that correspond to each other.

A statistical model for estimating the historical relationships among populations, using a graph representation that allows both population splits and migration events was conducted using TreeMix v1.12⁵⁹. In this model, by using genome-wide allele frequency data and a Gaussian approximation to genetic drift, the structure of the graph that showing that relationships between sampled populations and their ancestral populations was inferred. Haplotype network was conducted using PopART (Population Analysis with Reticulate Trees, http://popart.otago.ac.nz) according to the groups in an Integer Neighbor-Joining method.

Additional Information

How to cite this article: Tong, W. et al. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice. Sci. Rep. 7, 43327; doi: 10.1038/srep43327 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Marechal, A. & Brisson, N. Recombination and the maintenance of plant organelle genome stability. New Phytol. 186, 299–317, doi: 10.1111/j.1469-8137.2010.03195.x (2010).
Article CAS PubMed Google Scholar
Tian, X., Zheng, J., Hu, S. & Yu, J. The rice mitochondrial genomes and their variations. Plant Physiol. 140, 401–410 (2006).
Article CAS Google Scholar
Sloan, D. B. et al. Rapid evolution of enormous, multichromosomal genomes in flowering plant mitochondria with exceptionally high mutation rates. PLoS Biol. 10, e1001241, doi: 10.1371/journal.pbio.1001241 (2012).
Article CAS PubMed PubMed Central Google Scholar
Woloszynska, M. Heteroplasmy and stoichiometric complexity of plant mitochondrial genomes–though this be madness, yet there’s method in’t. J Exp Bot. 61, 657–671, doi: 10.1093/jxb/erp361 (2010).
Article CAS PubMed Google Scholar
Ogihara, Y. et al. Structural dynamics of cereal mitochondrial genomes as revealed by complete nucleotide sequencing of the wheat mitochondrial genome. Nucleic Acids Res. 33, 6235–6250 (2005).
Article CAS Google Scholar
Alverson, A. J. et al. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol Biol Evol. msq029 (2010).
Drouin, G., Daoud, H. & Xia, J. Relative rates of synonymous substitutions in the mitochondrial, chloroplast and nuclear genomes of seed plants. Mol Phylogenet Evol. 49, 827–831, doi: 10.1016/j.ympev.2008.09.009 (2008).
Article CAS PubMed Google Scholar
Wolfe, K. H., Li, W. H. & Sharp, P. M. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc. Natl. Acad. Sci. USA 84, 9054–9058 (1987).
Article CAS ADS Google Scholar
McPherson, H. et al. Capturing chloroplast variation for molecular ecology studies: a simple next generation sequencing approach applied to a rainforest tree. BMC Ecol. 13, 8, doi: 10.1186/1472-6785-13-8 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tong, W. et al. A chloroplast variation map generated using whole genome re‐sequencing of Korean landrace rice reveals phylogenetic relationships among Oryza sativa subspecies. Biol J Linn Soc. (2015).
Wu, J. et al. Sequencing of chloroplast genome using whole cellular DNA and solexa sequencing technology. Front Plant Sci. 3, 243, doi: 10.3389/fpls.2012.00243 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tong, W., Kim, T. S. & Park, Y. J. Rice Chloroplast Genome Variation Architecture and Phylogenetic Dissection in Diverse Oryza Species Assessed by Whole-Genome Resequencing. Rice (N Y). 9, 57, doi: 10.1186/s12284-016-0129-y (2016).
Article Google Scholar
Hiratsuka, J. et al. The complete sequence of the rice (Oryza sativa) chloroplast genome: intermolecular recombination between distinct tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals. Molecular and General Genetics MGG. 217, 185–194 (1989).
Article CAS Google Scholar
Notsu, Y. et al. The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants. Mol. Genet. Genomics. 268, 434–445, doi: 10.1007/s00438-002-0767-1 (2002).
Article CAS PubMed Google Scholar
Tang, J. et al. A comparison of rice chloroplast genomes. Plant Physiol. 135, 412–420, doi: 10.1104/pp.103.031245 (2004).
Article CAS PubMed PubMed Central Google Scholar
Gray, M. W., Burger, G. & Lang, B. F. Mitochondrial evolution. Science. 283, 1476–1481 (1999).
Article CAS ADS Google Scholar
Nock, C. J. et al. Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol. J. 9, 328–333, doi: 10.1111/j.1467-7652.2010.00558.x (2011).
Article CAS PubMed Google Scholar
Cheng, C. et al. Polyphyletic origin of cultivated rice: based on the interspersion pattern of SINEs. Mol Biol Evol. 20, 67–75 (2003).
Article CAS Google Scholar
Huang, X. et al. A map of rice genome variation reveals the origin of cultivated rice. Nature. 490, 497–501, doi: 10.1038/nature11532 (2012).
Article CAS PubMed ADS Google Scholar
Khush, G. S. Origin, dispersal, cultivation and variation of rice. Plant Mol Biol. 35, 25–34 (1997).
Article CAS Google Scholar
Oka, H. I. Origin of cultivated rice. (Japan Scientific Societies Press; Elsevier; Exclusive sales rights for the USA and Canada, Elsevier Science Pub. Co., 1988).
Kawakami, S. et al. Genetic variation in the chloroplast genome suggests multiple domestication of cultivated Asian rice (Oryza sativa L.). Genome. 50, 180–187, doi: 10.1139/g06-139 (2007).
Article CAS PubMed Google Scholar
Li, C., Zhou, A. & Sang, T. Rice domestication by reducing shattering. Science. 311, 1936–1939, doi: 10.1126/science.1123604 (2006).
Article CAS PubMed ADS Google Scholar
Molina, J. et al. Molecular evidence for a single evolutionary origin of domesticated rice. Proc Natl Acad Sci USA 108, 8351–8356, doi: 10.1073/pnas.1104686108 (2011).
Article PubMed ADS Google Scholar
Zhang, L. B. et al. Selection on grain shattering genes and rates of rice domestication. New Phytol. 184, 708–720, doi: 10.1111/j.1469-8137.2009.02984.x (2009).
Article CAS PubMed Google Scholar
Qiu, J. et al. Genome re-sequencing suggested a weedy rice origin from domesticated indica-japonica hybridization: a case study from southern China. Planta. 240, 1353–1363, doi: 10.1007/s00425-014-2159-2 (2014).
Article CAS PubMed Google Scholar
Sun, J. et al. Introgression and selection shaping the genome and adaptive loci of weedy rice in northern China. New Phytol. 197, 290–299, doi: 10.1111/nph.12012 (2013).
Article CAS PubMed Google Scholar
De Wet, J. & Harlan, J. R. Weeds and domesticates: evolution in the man-made habitat. Econ. Bot. 29, 99–108 (1975).
Article Google Scholar
Harlan, J. Crops and man, Madison, WI. American Soc. Agronomy. (1992).
Tang, L. & Morishima, H. Genetic characteristics and origin of weedy rice. Paper on origin and dissemination of cultivated rice in China. 1, 211–215 (1996).
Google Scholar
Londo, J. P. & Schaal, B. A. Origins and population genetics of weedy red rice in the USA. Mol Ecol. 16, 4523–4535, doi: 10.1111/j.1365-294X.2007.03489.x (2007).
Article CAS PubMed Google Scholar
Cao, Q. et al. Genetic diversity and origin of weedy rice (Oryza sativa f. spontanea) populations found in north-eastern China revealed by simple sequence repeat (SSR) markers. Ann. Bot. 98, 1241–1252 (2006).
Article CAS Google Scholar
Chung, J. & Park, Y. Population structure analysis reveals the maintenance of isolated sub‐populations of weedy rice. Weed Res. 50, 606–620 (2010).
Article Google Scholar
Suh, H.-S. & Heu, M.-H. Collection and evaluation of Korean red rices I. Regional distribution and seed characteristics. Korean Journal of Crop Science. 37, 425–430 (1992).
Google Scholar
He, Q., Kim, K. W. & Park, Y. J. Population genomics identifies the origin and signatures of selection of Korean weedy rice. Plant Biotechnol. J., doi: 10.1111/pbi.12630 (2016).
Bentolila, S. & Stefanov, S. A reevaluation of rice mitochondrial evolution based on the complete sequence of male-fertile and male-sterile mitochondrial genomes. Plant Physiol. 158, 996–1017, doi: 10.1104/pp.111.190231 (2012).
Article CAS PubMed Google Scholar
Huang, C. Y., Grunheit, N., Ahmadinejad, N., Timmis, J. N. & Martin, W. Mutational decay and age of chloroplast and mitochondrial genomes transferred recently to angiosperm nuclear chromosomes. Plant Physiol. 138, 1723–1733, doi: 10.1104/pp.105.060327 (2005).
Article CAS PubMed PubMed Central Google Scholar
Sun, Q., Wang, K., Yoshimura, A. & Doi, K. Genetic differentiation for nuclear, mitochondrial and chloroplast genomes in common wild rice (Oryza rufipogon Griff.) and cultivated rice (Oryza sativa L.). Theor Appl Genet. 104, 1335–1345, doi: 10.1007/s00122-002-0878-4 (2002).
Article CAS PubMed Google Scholar
Zhang, J. et al. Cytoplasmic-genetic male sterility gene provides direct evidence for some hybrid rice recently evolving into weedy rice. Sci. Rep. 5, 10591, doi: 10.1038/srep10591 (2015).
Article PubMed PubMed Central ADS Google Scholar
Zhang, L., Dai, W., Wu, C., Song, X. & Qiang, S. Genetic diversity and origin of Japonica- and Indica-like rice biotypes of weedy rice in the Guangdong and Liaoning provinces of China. Genet. Resour. Crop Evol. 59, 399–410, doi: 10.1007/s10722-011-9690-9 (2012).
Article CAS Google Scholar
Kim, K. W. et al. PowerCore: a program applying the advanced M strategy with a heuristic search for establishing core sets. Bioinformatics. 23, 2155–2162, doi: 10.1093/bioinformatics/btm313 (2007).
Article CAS PubMed Google Scholar
Zhao, W. et al. Development of an allele-mining set in rice using a heuristic algorithm and SSR genotype data with least redundancy for the post-genomic era. Mol. Breed. 26, 639–651 (2010).
Article Google Scholar
Kim, T.-S. et al. Genome-wide resequencing of KRICE_CORE reveals their potential for future breeding, as well as functional and evolutionary studies in the post-genomic era. BMC Genomics. 17, 1 (2016).
Article CAS Google Scholar
Xu, X. et al. Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nat. Biotechnol. 30, 105–111, doi: 10.1038/nbt.2050 (2012).
Article CAS Google Scholar
Joshi, N. & Fass, J. Sickle: A sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33) [Software]. Available athttps://github.com/najoshi/sickle (2011).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 25, 1754–1760, doi: 10.1093/bioinformatics/btp324 (2009).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303, doi: 10.1101/gr.107524.110 (2010).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics. 27, 2156–2158, doi: 10.1093/bioinformatics/btr330 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tang, H., Peng, J., Wang, P. & Risch, N. J. Estimation of individual admixture: analytical and study design considerations. Genet. Epidemiol. 28, 289–301, doi: 10.1002/gepi.20064 (2005).
Article PubMed Google Scholar
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 23, 2633–2635, doi: 10.1093/bioinformatics/btm308 (2007).
Article CAS PubMed Google Scholar
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. jModelTest 2: more models, new heuristics and parallel computing. Nat. Methods. 9, 772, doi: 10.1038/nmeth.2109 (2012).
Article CAS PubMed PubMed Central Google Scholar
Guindon, S. & Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 52, 696–704 (2003).
Article Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321, doi: 10.1093/sysbio/syq010 (2010).
Article CAS PubMed Google Scholar
Felsenstein, J. Phylogenies from molecular sequences: inference and reliability. Annu Rev Genet. 22, 521–565, doi: 10.1146/annurev.ge.22.120188.002513 (1988).
Article CAS PubMed Google Scholar
Akaike, H. A new look at the statistical model identification. Automatic Control, IEEE Transactions on. 19, 716–723 (1974).
Article MathSciNet ADS Google Scholar
Schwarz, G. Estimating the dimension of a model. The annals of statistics. 6, 461–464 (1978).
Article MathSciNet ADS Google Scholar
Ronquist, F. et al. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542, doi: 10.1093/sysbio/sys029 (2012).
Article PubMed PubMed Central Google Scholar
Huson, D. H. & Scornavacca, C. Dendroscope 3: an interactive tool for rooted phylogenetic trees and networks. Syst Biol. 61, 1061–1067, doi: 10.1093/sysbio/sys062 (2012).
Article PubMed Google Scholar
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8, e1002967, doi: 10.1371/journal.pgen.1002967 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was carried out with the support of “Cooperative Research Program for Agriculture Science & Technology Development (Project No. PJ01116101)” Rural Development Administration, Republic of Korea. This research was supported by Bio-industry Technology Development Program(115078-2), Ministry of Agriculture, Food and Rural Affairs.

Author information

Wei Tong and Qiang He: These authors contributed equally to this work.

Authors and Affiliations

Department of Plant Resources, College of Industrial Science, Kongju National University, Yesan, 32439, Republic of Korea
Wei Tong, Qiang He & Yong-Jin Park
State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, 230036, Peoples’, Republic of China
Wei Tong
Center for crop genetic resource and breeding (CCGRB), Kongju National University, Cheonan, 31080, Republic of Korea
Yong-Jin Park

Authors

Wei Tong
View author publications
You can also search for this author in PubMed Google Scholar
Qiang He
View author publications
You can also search for this author in PubMed Google Scholar
Yong-Jin Park
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.P. led and conceived the idea and designed the experiments. W.T. and Q.H. performed the experiments and analyzed the data. Y.P., W.T. and Q.H., edit and revised the manuscript. All authors have read and approved the manuscript.

Corresponding author

Correspondence to Yong-Jin Park.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information (PDF 669 kb)

Supplementary Table 1 (XLS 344 kb)

Supplementary Table 2 (XLS 39 kb)

Supplementary Table 4 (XLS 43 kb)

Supplementary Table 5 (XLS 62 kb)

Supplementary Table 6 (XLS 63 kb)

Supplementary Table 7 (XLS 63 kb)

Supplementary Table 8 (XLS 79 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Tong, W., He, Q. & Park, YJ. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice. Sci Rep 7, 43327 (2017). https://doi.org/10.1038/srep43327

Download citation

Received: 27 June 2016
Accepted: 24 January 2017
Published: 03 March 2017
DOI: https://doi.org/10.1038/srep43327
Springer Nature Limited

Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

Abstract

Similar content being viewed by others

Introduction

Results

Mt variations across the genome

Variation architecture at the mt genome level

Admixed population structure of landrace and weedy rice

Phylogenetic relationships in landrace and weedy rice

Haplotype network and population splits based on the mt genome

Discussion and Conclusions

Methods

Samples and whole-genome resequencing

Data manipulation and variations

Mt genome variations architecture

Population structure analysis

Mitochondria-based phylogenetic, haplotype network and population splits

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation