Abstract
Plants belonging to the genus Taraxacum are widespread all over the world, which contain rubber-producing and non-rubber-producing species. However, the genomic basis underlying natural rubber (NR) biosynthesis still needs more investigation. Here, we presented high-quality genome assemblies of rubber-producing T. kok-saghyz TK1151 and non-rubber-producing T. mongolicum TM5. Comparative analyses uncovered a large number of genetic variations, including inversions, translocations, presence/absence variations, as well as considerable protein divergences between the two species. Two gene duplication events were found in these two Taraxacum species, including one common ancestral whole-genome triplication and one subsequent round of gene amplification. In genomes of both TK1151 and TM5, we identified the genes encoding for each step in the NR biosynthesis pathway and found that the SRPP and CPT gene families have experienced a more obvious expansion in TK1151 compared to TM5. This study will have large-ranging implications for the mechanism of NR biosynthesis and genetic improvement of NR-producing crops.
Similar content being viewed by others
Data availability
Raw sequencing data for genome assembly and annotation have been deposited into the Genome Sequence Archive (GSA) database in BIG Data Center under Accession Number PRJCA005187 and PRJCA000437. The final genome assemblies of TK1151 and TM5 were deposited into the Genome Warehouse (GWH) database in the BIG Data Center under Accession Number GWHBCHF00000000 and GWHBCHG00000000, respectively.
References
Banigan, T.F., Verbiscar, A.J., and Oda, T.A. (1982). An infrared spectrophotometric analysis for natural rubber in guayule shrubs. Rubber Chem Tech 55, 407–415.
Buranov, A.U., and Elmuradov, B.J. (2010). Extraction and characterization of latex and natural rubber from rubber-bearing plants. J Agric Food Chem 58, 734–743.
Cantarel, B.L., Korf, I., Robb, S.M.C., Parra, G., Ross, E., Moore, B., Holt, C., Sánchez Alvarado, A., and Yandell, M. (2008). MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res 18, 188–196.
Cheng, Z., Yan, H., Yu, H., Tang, S., Jiang, J., Gu, M., and Zhu, L. (2001). Development and applications of a complete set of rice telotrisomics. Genetics 157, 361–368.
Cuadrado, A., and Jouve, N. (1994). Mapping and organization of highly-repeated DNA sequences by means of simultaneous and sequential FISH and C-banding in 6×-triticale. Chromosome Res 2, 331–338.
Cornish, K., Xie, W., Kostyal, D., Shintani, D., and Hamilton, R.G. (2015). Immunological analysis of the alternate rubber crop Taraxacum koksaghyz indicates multiple proteins cross-reactive with Hevea brasiliensis latex allergens. J Biotechnol Biomater 5, 201–207.
Doll, R. (1982). Grundriß der evolution der gattung Taraxacum Zinn. Feddes Repert 93, 481–624.
Du, H., and Liang, C. (2019). Assembly of chromosome-scale contigs by efficiently resolving repetitive sequences with long reads. Nat Commun 10, 5360.
Durand, N.C., Shamim, M.S., Machol, I., Rao, S.S.P., Huntley, M.H., Lander, E.S., and Aiden, E.L. (2016). Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst 3, 95–98.
Ellinghaus, D., Kurtz, S., and Willhoeft, U. (2008). LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics 9, 18.
Gao, D. (2010). Analysis of nutritional components of Taraxacum mongolicum and its antibacterial activity. Phcog J 2, 502–505.
Goel, M., Sun, H., Jiao, W.B., and Schneeberger, K. (2019). SyRI: finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol 20, 277.
Hubley, R., and Smit, A. (2010). RepeatModeler Open-1.0.Available from URL: http://www.repeatmasker.org/RepeatModeler.
Kim, D., Pertea, G., Trapnell, C., Pimentel, H., Kelley, R., and Salzberg, S. L. (2013). TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol 14, R36.
Kirschner, J., and Štěpánek, J. (1998). A revision of Taraxacum sect. Piesis (Compositae). Folia Geobot 33, 391–414.
Kirschner, J., Štěpánek, J., Černý, T., De Heer, P., and van Dijk, P.J. (2013). Available ex situ germplasm of the potential rubber crop Taraxacum koksaghyz belongs to a poor rubber producer, T. brevicorniculatum (Compositae-Crepidinae). Genet Resour Crop Evol 60, 455–471.
Koren, S., Walenz, B.P., Berlin, K., Miller, J.R., Bergman, N.H., and Phillippy, A.M. (2017). Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27, 722–736.
Kurtz, S., Phillippy, A., Delcher, A.L., Smoot, M., Shumway, M., Antonescu, C., and Salzberg, S.L. (2004). Versatile and open software for comparing large genomes. Genome Biol 5, R12.
Li, G., Wang, L., Yang, J., He, H., Jin, H., Li, X., Ren, T., Ren, Z., Li, F., Han, X., et al. (2021). A high-quality genome assembly highlights rye genomic characteristics and agronomically important genes. Nat Genet 53, 574–584.
Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760.
Li, L., StoeckertJr., C.J., and Roos, D.S. (2003). OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 13, 2178–2189.
Liao, Y., Smyth, G.K., and Shi, W. (2014). FeatureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930.
Lin, T., Xu, X., Ruan, J., Liu, S., Wu, S., Shao, X., Wang, X., Gan, L., Qin, B., Yang, Y., et al. (2018). Genome analysis of Taraxacum kok-saghyz Rodin provides new insights into rubber biosynthesis. Natl Sci Rev 5, 78–87.
Liu, G., Zhang, X., Zhang, T., Zhang, J., Zhang, P., and Wang, W. (2017). Determination of the content of Eucommia ulmoides gum by Variable Temperature Fourier Transform Infrared Spectrum. Polym Testing 63, 582–586.
Miao, J., Feng, Q., Li, Y., Zhao, Q., Zhou, C., Lu, H., Fan, D., Yan, J., Lu, Y., Tian, Q., et al. (2021). Chromosome-scale assembly and analysis of biomass crop Miscanthus lutarioriparius genome. Nat Commun 12, 2458.
Nagel, R., Berasategui, A., Paetz, C., Gershenzon, J., and Schmidt, A. (2014). Overexpression of an isoprenyl diphosphate synthase in spruce leads to unexpected terpene diversion products that function in plant defense. Plant Physiol 164, 555–569.
Ou, S., and Jiang, N. (2018). LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol 176, 1410–1422.
Park, C.M., Youn, H.J., Chang, H.K., and Song, Y.S. (2010). TOP1 and 2, polysaccharides from Taraxacum officinale, attenuate CCl4-induced hepatic damage through the modulation of NF-κB and its regulatory mediators. Food Chem Toxicol 48, 1255–1261.
Piao, T., Ma, Z., Li, X., and Liu, J. (2015). Taraxasterol inhibits IL-1β-induced inflammatory response in human osteoarthritic chondrocytes. Eur J Pharmacol 756, 38–42.
Qureshi, S., Adil, S., Abd El-Hack, M.E., Alagawany, M., and Farag, M.R. (2017). Beneficial uses of dandelion herb (Taraxacum officinale) in poultry nutrition. Worlds Poultry Sci J 73, 591–602.
Rabanus-Wallace, M.T., Hackauf, B., Mascher, M., Lux, T., Wicker, T., Gundlach, H., Baez, M., Houben, A., Mayer, K.F.X., Guo, L., et al. (2021). Chromosome-scale genome assembly provides insights into rye biology, evolution and agronomic potential. Nat Genet 53, 564–573.
Salse, J. (2016). Ancestors of modern plant crops. Curr Opin Plant Biol 30, 134–142.
Shelton, J.M., Coleman, M.C., Herndon, N., Lu, N., Lam, E.T., Anantharaman, T., Sheth, P., and Brown, S.J. (2015). Tools and pipelines for BioNano data: molecule assembly pipeline and FASTA super scaffolding tool. BMC Genomics 16, 734.
Scaglione, D., Reyes-Chin-Wo, S., Acquadro, A., Froenicke, L., Portis, E., Beitel, C., Tirone, M., Mauro, R., Lo Monaco, A., Mauromicale, G., et al. (2016). The genome sequence of the outbreeding globe artichoke constructed de novo incorporating a phase-aware low-pass sequencing strategy of F1 progeny. Sci Rep 6, 19427.
Shi, S., Zhao, Y., Zhou, H., Zhang, Y., Jiang, X., and Huang, K. (2008). Identification of antioxidants from Taraxacum mongolicum by highperformance liquid chromatography-diode array detection-radical-scavenging detection-electrospray ionization mass spectrometry and nuclear magnetic resonance experiments. J Chromatogr A 1209, 145–152.
Simão, F.A., Waterhouse, R.M., Ioannidis, P., Kriventseva, E.V., and Zdobnov, E.M. (2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212.
Stubbe, J.A., Tian, J., He, A., Sinskey, A.J., Lawrence, A.G., and Liu, P. (2005). Nontemplate-dependent polymerization processes: polyhydroxyalkanoate synthases as a paradigm. Annu Rev Biochem 74, 433–480.
Sun, S., Zhou, Y., Chen, J., Shi, J., Zhao, H., Zhao, H., Song, W., Zhang, M., Cui, Y., Dong, X., et al. (2018). Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes. Nat Genet 50, 1289–1295.
Tarailo-Graovac, M., and Chen, N. (2009). Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinf 25.
van Deenen, N., Unland, K., Prüfer, D., and Schulze Gronover, C. (2019). Oxidosqualene cyclase knock-down in latex of Taraxacum koksaghyz reduces triterpenes in roots and separated natural rubber. Molecules 24, 2703.
Walker, B.J., Abeel, T., Shea, T., Priest, M., Abouelliel, A., Sakthikumar, S., Cuomo, C.A., Zeng, Q., Wortman, J., Young, S.K., et al. (2014). Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9, e112963.
Wang, H.B. (2014). Effect of dandelion polysaccharides on the retardation of the quality changes of white shrimp. Int J Biol Macromol 68, 205–208.
Wang, Y., Tang, H., Debarry, J.D., Tan, X., Li, J., Wang, X., Lee, T., Jin, H., Marler, B., Guo, H., et al. (2012). MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res 40, e49.
Xu, Z., and Wang, H. (2007). LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res 35, W265–W268.
Yang, Z., Ge, X., Yang, Z., Qin, W., Sun, G., Wang, Z., Li, Z., Liu, J., Wu, J., Wang, Y., et al. (2019). Extensive intraspecific gene order and gene structural variations in upland cotton cultivars. Nat Commun 10, 2989.
Yu, H., Lin, T., Meng, X., Du, H., Zhang, J., Liu, G., Chen, M., Jing, Y., Kou, L., Li, X., et al. (2021). A route to de novo domestication of wild allotetraploid rice. Cell 184, 1156–1170.e14.
Zhang, X., Xiong, H., and Liu, L. (2012). Effects of taraxasterol on inflammatory responses in lipopolysaccharide-induced RAW 264.7 macrophages. J Ethnopharmacol 141, 206–211.
Zhang, W., Yi, C., Bao, W., Liu, B., Cui, J., Yu, H., Cao, X., Gu, M., Liu, M., and Cheng, Z. (2005). The transcribed 165-bp CentO satellite is the major functional centromeric element in the wild rice species Oryza punctata. Plant Physiol 139, 306–315.
Acknowledgements
This work was supported by the Strategic Priority Research Program of Chinese Academy of Sciences (XDA24030504 and XDA24040305), the National Natural Science Foundation of China (32000144), and the Key Research Program of the Chinese Academy of Sciences (ZDRW-ZS-2018-2). We thank Ms. Fengqin Dong (Institute of Botany, Chinese Academy of Sciences) and Mr. Yanbao Tian (National Centre for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences) for assistance in transmission electron microscopy analysis.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Compliance and ethics The author(s) declare that they have no conflict of interest.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Lin, T., Xu, X., Du, H. et al. Extensive sequence divergence between the reference genomes of Taraxacum kok-saghyz and Taraxacum mongolicum. Sci. China Life Sci. 65, 515–528 (2022). https://doi.org/10.1007/s11427-021-2033-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11427-021-2033-2