Abstract
Plants are the most important sources of food for humans, as well as supplying many ingredients that are of great importance for human health. Developing an understanding of the functional components of plant metabolism has attracted considerable attention. The rapid development of liquid chromatography and gas chromatography, coupled with mass spectrometry, has allowed the detection and characterization of many thousands of metabolites of plant origin. Nowadays, elucidating the detailed biosynthesis and degradation pathways of these metabolites represents a major bottleneck in our understanding. Recently, the decreased cost of genome and transcriptome sequencing rendered it possible to identify the genes involving in metabolic pathways. Here, we review the recent research which integrates metabolomic with different omics methods, to comprehensively identify structural and regulatory genes of the primary and secondary metabolic pathways. Finally, we discuss other novel methods that can accelerate the process of identification of metabolic pathways and, ultimately, identify metabolite function(s).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Metabolites are fundamental components of the plant cell. These compounds play vital roles not only in plants as energy suppliers, signaling regulators, and enzyme cofactors, but also for human health, given that they supply carbohydrates, fatty acids, proteins, vitamins and minerals, alongside bioactive secondary metabolites (Canarini et al. 2019; Zaynab et al. 2019). Based on former estimates, the plant kingdom produces some 1 million metabolites for different functions (Alseekh and Fernie 2018). Moreover, these metabolites are the most important bio-markers, reflecting the exact physiological status of the plant (Afendi et al. 2013; Fang et al. 2019). Among these metabolites, some of them, such as sugars, organic acids, and amino acids directly participate in plant basic metabolism and maintain fundamental biological processes, including functional macromolecule biosynthesis. Such metabolites are regarded as primary metabolites. Moreover, other metabolites, such as polyphenols, terpenoids, and alkaloids, function as important regulators, not only of different plant growth and development processes but also against the biotic and abiotic stresses and further exhibit high bioactivities, thereby allowing them to augment defense to human inflammation, cardiovascular diseases, and cancer. Such plant metabolites are named as secondary metabolites or specialized metabolites (Tiwari and Rana 2015; Tohge et al. 2014; Verpoorte and Memelink 2002).
Considering the diversity and importance of metabolites, an important issue in metabolomic research is the development of a high-throughput and exact identification and qualification method. Recently, owing to the rapid innovation of liquid/gas chromatography and high-sensitive mass spectrometry, global analyses, conducted across different plant species, have identified many thousands of metabolites (Obata et al. 2020; Peng et al. 2017). To advance knowledge concerning metabolites, improving our understanding of metabolic pathways of biosynthesis and catabolism has recently attracted considerable attention. Indeed, many advances have been achieved by incorporating large-scale genome and transcriptome analyses (Li et al. 2020; Zhu et al. 2022a).
In this review, we summarize the importance of metabolites and the profiling methods to study the different kinds of metabolites. We also stress the importance of research that integrates the combination of metabolomic, genomic and transcriptomic datasets. We then discuss other efficient strategies to better decode the primary and secondary pathways of various crops.
Importance of the plant metabolome and research strategies employed for its evaluation
As mentioned above, metabolites can be divided into two major classes, primary metabolites and secondary metabolites (Wang et al. 2022a); however, the distinction between these is, at times, blurred (Erb and Kliebenstein 2020; Fàbregas and Fernie 2021). Both classes play important roles in plant development, response to stress, and reproduction. However, the profiling methods used to obtain information on their levels are slightly different.
Primary metabolites
As the most well-studied primary metabolites, organic acids and sugars are the key components of the tricarboxylic acid cycle (TCA cycle) and glycolysis pathways (Krebs 1970). These pathways not only provide the main energy source, and some components used for the generation of signaling molecules, but also produce different substrates for the synthesis of other important primary metabolites, such as amino acids and fatty acids (Sweetlove et al. 2010; Zhao et al. 2021).
As an example, exogenously applied sugars can increase the fresh weight, vitamin C, soluble protein, and sugar contents of pea sprouts (Tan et al. 2022). The availability of cellular sugars reflects the plant’s carbon nutrient status, which then induces the functions of nutrition sensing, through such pathways as the hexokinase glucose sensor, the trehalose 6-phosphate signal, and the Target of Rapamycin kinase pathway, to regulate plant growth and development processes (Smeekens et al. 2010).
In addition, 2-oxo-glutarate is an important intermediate in the TCA cycle but can also be converted to glutamate that can then be transferred to other amino acids (García-Gutiérrez et al. 2018; Liepman and Olsen 2003). Moreover, isotope labeling experiments have demonstrated that pyruvate, the key component of glycolysis, is the immediate precursor for the synthesis of alanine (Kennedy and Laetsch 1974). The accumulation of some primary metabolites, such as proline, can significantly enhance plant tolerance against the abiotic stresses resulting from the global climatic change (Ghosh et al. 2022). Besides sugars, organic acids and amino acids, fatty acids, and their derived lipids, are predominant components of both the plasma membrane and photosynthetic membranes (Li et al. 2016). Similarly, fatty acid metabolism also affects both cuticular wax biosynthesis and pollen fertility, which are important processes for plant adaption to a terrestrial environment and propagation (Millar et al. 1999; Wang et al. 2022b; Zhang et al. 2021a).
Secondary metabolites
Based on their molecular structures and biological functions, secondary metabolites can be divided into various classes, such as terpenoids, polyphenols, alkaloids, non-ribosomal polypeptides, and enzyme cofactors (Tohge et al. 2014; Verpoorte and Memelink 2002). Given that their sessile character has considerably limited the communication of plants with other plants and animals, some metabolites of the terpenoids, such as terpenes, are volatile and play important roles in the communication between plants with pollinators, seed dispersers, and signal transduction between different plants (Dudareva et al. 2013). Another example is the representative C40 terpenoids; these carotenoids play vital roles in plant photosynthesis, photoprotection, and development, and are important in the synthesis of pigments associated with fruit appearance quality (Nisar et al. 2015).
Polyphenols are the most studied secondary metabolites, and are derived from phenylalanine, via the shikimate/phenylpropanoid pathway. After condensation with malonyl-CoA, and modifications including methylation and glucuronidation, phenylalanine can be transferred to representative polyphenols. such as quercetin 3-O-rutinoside (rutin) and anthocyanin. Among the polyphenols, flavonoids are the most studied class and generally contain two aromatic rings. The different modifications, on these aromatic rings, led to the super diversity of the flavonoids, which have been estimated to contain over 6000 compounds (Górniak et al. 2019). Recently, scientists reported that, in Arabidopsis flowers, a high accumulation of a class of phenylacylated-flavonols (saiginols) was attributed to protecting the Brassicaceae flower from damage caused by the UV light in sunshine (Tohge et al. 2016) (Fig. 1). As important cereals of the human diet, the three grass crops, maize, rice, and wheat, accumulate some special flavonoids, such as glycosylated flavones. These metabolites also function as UV protectors in these crops (Peng et al. 2017) (Fig. 1).
Besides their importance in food, flavonoids also present in many human beverages, such as tea, and play important roles for human health. For example, as a main bioactive ingredient in green tea, epigallocatechin-3-gallate (EGCG), can repress the infection of influenza virus and other representative viruses, such as HCV and HIV-1, and also affect human-pathogenic yeast strains by affecting the folic acid metabolism of bacteria and fungi (Steinmann et al. 2013). In addition, it is well known that the representative flavonoids, the anthocyanins, play vital roles in human protection against a broad range of diseases (Ciumărnean et al. 2020). To develop tomato fruit with enhanced anthocyanin levels, scientists generated transgenic plants expressing two transcription factors from snapdragon (Del and Ros1), thereby enhancing, threefold, the hydrophilic antioxidant capacity in the fruit; feeding these transgenic tomato fruit to cancer-susceptible mice significantly extend their life span (Butelli et al. 2008).
Alkaloids contain at least one nitrogen atom in a heterocyclic ring and exhibit alkali-like properties. Steroidal glycoalkaloids (SGAs) are special alkaloids accumulating in different plant organs, such as leaves, roots, flowers, fruit and tubers of the Solanaceae family (Alseekh et al. 2020). For example, α-tomatine is the predominantly accumulated SGA in immature tomato fruit and is toxic for insects, fungi, and humans, thereby making it an efficient protection mechanism to reduce unripen fruit loss. Subsequently, during ripening, α-tomatine can be detoxified, via a series of hydroxylation and modification reactions, to produce the human health-promoting chemicals, the esculeosides (Alseekh et al. 2015; Fujiwara et al. 2007; Itkin et al. 2011, 2013; Szymanski et al. 2020) (Fig. 2).
Metabolite evaluation methods
Given the importance of different metabolites, the research community has played great attention to the methods used in their characterization; here, gas chromatography–mass spectrometry (GC–MS) and liquid chromatograph-mass spectrometry (LC–MS) are the widely used methods. As some primary metabolites are small molecular weight components, which can be easily volatilized after derivatization, gas chromatography-mass spectrometry (GC–MS) is widely used to analyze the primary metabolite contents (Obata and Fernie 2012). The extracted and dried metabolite mixture is first derivatized, by methoxyamine-hydrochloride/pyridine and N-methyl-N-(trimethylsilyl) trifluoracet-amide, and then analyzed by GC–MS (Salem et al. 2016). To determine the exact metabolite of each fraction, the small fragments are detected by time-of-flight (TOF)-MS, which can quickly and precisely scan the fragment m/z information. With the help of the established metabolite datasets, such as the Golm metabolic databases (GMD) (Kind et al. 2009; Kopka et al. 2005), efficient annotation and analysis of the relative metabolite content can be analyzed, based on the retention time, mass spectra and peak area information.
The reverse phase column exhibits the highly efficient separation capacity for similar-structured and a broad range of metabolites. Their coupling to LC–MS systems has been employed to identify thousands of fatty acids-derived lipids, carotenoids, polyphenols and alkaloid metabolites (Rupasinghe and Roessner 2018; Salem et al. 2016). The dry-extracted mixture is resuspended, using a suitable solution, such as butanol–methanol mixture or 50% methanol. The resuspended solutions are then directly loaded onto the LC–MS for separation in a liquid phase and electrospray ionization (ESI) prior to detection by a high-sensitive MS, such as an Exactive high-resolution Orbitrap-type MS.
Owing to the variation of LC pump pressure and mass spectra characteristics, it is impractical to build a universal library containing the retention time and mass spectra information for metabolite annotation derived from different labs. In general, based on machine characteristics, an individual-specific LC–MS reference library is established by a lab, through integrating the retention times and mass spectra information for the standard components, useful mass spectra datasets (such as Lipid Maps, http://www.lipidmaps.org/index.html and ReSpect database, http://spectra.psc.riken.jp/menta.cgi/respect/search/fragment) and information in the public domain (Luzarowska et al. 2020; Wang et al. 2019).
In summary, the highly sensitive and accurate metabolite identification method, based on the GC/LC–MS, allows construction of a solid foundation for the study of primary and secondary metabolism, whilst a comprehensive definition of the key genes and regulators of these pathways generally needs more information from both genomic and transcriptomic analyses.
Integration of genome and metabolome to decode genes involved in the primary and secondary metabolism
The one gene–one enzyme hypothesis of George Wells Beadle indicated that the individual gene-encoded enzymes play important roles in the biosynthesis and catabolism of metabolites (Horowitz 2014), whilst not entirely applicable in plants, due to the commonality of multiple isoforms of the same enzyme, there remains some value in the combined comprehensive analysis of the association of genome and metabolome variations, which we anticipate will still accelerate elucidation of the metabolic pathways. During evolution and the domestication of crops, variations in genes that encode for enzymes may change their amino acid sequence and directly affect enzyme activity. Therefore, to decode the important genes involved in primary and secondary metabolism, scientists traditionally carried out cDNA cloning using secondary metabolite-rich tissue or tissues collected from crosses derived from two parents with extreme phenotypes, and then conducted map-based cloning analysis of the biparental populations (Shi et al. 2020; Wang et al. 2008). Based on the analysis of amino acid content and quantitative trait locus (QTL) mapping of 190 recombinant inbred lines (RILs), some 80 individual QTL were identified for content of 19 amino acids, with a relatively strong QTL cluster, comprised of 19 individual QTL, being detected on chromosome 1 (Wang et al. 2008).
Although many QTL have been identified using RILs, such a population, which segregates concurrently for many QTL dispersed throughout the genome, may cause huge variances in the following statistical analysis and, thereby, reduce the effects of one another, to limit the genetic resolution of these quantitative traits (Zamir 2001). To resolve this limitation, introgression lines (ILs), which cover the entire genome and each line segregates for a single region, have been used for QTL mapping. In such analyses, the phenotype variations are associated with the introduced-targeted segment, which assists in focusing on the analysis of QTL and genes located in the introduced-targeted segment, and this can significantly improve the efficiency of QTL identification (Eshed and Zamir 1995; Schauer et al. 2006).
As an example, total soluble content of tomato fruit is a complex phenotype and the analysis, among a population of 76 segmental ILs of wild species Solanum pennellii into the cultivated tomato (S. lycopersicum M82), identified a flower- and fruit-specific invertase (LIN5), located in the QTL-Brix9-2-5 (Fridman et al. 2004). In addition, the same population of ILs was used to identify 338 putative mQTL for flavonoids and steroidal glycoalkaloids in the tomato seed (Alseekh et al. 2020).
The above-mentioned populations were based on the genetic resources from two genotypes; however, during the evolution and domestication of crops, hundreds, or even thousands of genotypes of each crop have been created, which contain more abundant genetic and metabolic variations for identification of metabolism-related genes. In addition, the development of next generation sequence methods reduced the cost and made it practical to carry out genome sequencing of hundreds, or even thousands, of genotypes. Therefore, recently, many association studies have been performed to explore metabolic and natural population genetic variations (Chen et al. 2014). As there are millions of variations, within the genome of a population, this complicates the identification of the exact association between a specific metabolite and a specific genomic region. To overcome this problem, a genome-wide association study (GWAS) can be used to explore the connection between a plant metabolite and genetic variation (Mountjoy et al. 2021; Ozaki et al. 2002).
Rice, as one of the most important crops for human food, supplies not only carbohydrates but also secondary metabolites. Using a metabolic genome-wide association study (mGWAS), based on 840 metabolites and some 6.4 million single-nucleotide polymorphisms (SNPs) for a 529 diverse rice accessions panel, identified 2947 lead SNPs that were associated with 598 metabolites, with five candidate genes being further validated with functions in secondary metabolite pathways (Chen et al. 2014).
In addition to SNPs and small indels, structural variations (SVs), such as a big insertion, deletion, copy number variation and chromosomal change, also play important roles in plant metabolism (Hollox et al. 2022; Voichek and Weigel 2020). However, the short sequence length of the first- and second-generation sequencing method significantly limited the identification of the SVs in a population genome. In recent years, the development of a third-generation sequencing method has significantly increased the sequencing read lengths to over 10,000 bp, and has become the ideal method for genome SV detection (Lee et al. 2016). Based on this method, Alonge et al. (2020) identified more than 200,000 SVs for 100 diverse wild and domesticated tomato accessions and identified several associations between SVs with important fruit quality, such as fruit smoky volatile content and fruit size.
Integration of transcriptome and metabolome to decode genes involved in primary and secondary metabolism
Earlier studies indicated that genomic variation located in a coding region could change the encoded protein amino acid sequence and, thereby, affect protein function. Moreover, other variations located within the promoter or an intergenic region can also significantly affect gene expression and by this manner lead to variation in metabolite abundance across a population (Ye et al. 2017; Zhu et al. 2022a). With the development of low-cost second-generation sequencing methods, a vast torrent of transcriptome and metabolome data have been analyzed to decode the key genes involved in primary and secondary metabolism (Karlova et al. 2011; Zhu et al. 2017, 2020).
As the biosynthesis and catabolism of metabolites results from the ordered sequence of a series of enzymes and their associated regulators, the expression of these genes may exhibit a similar trend with each other and also with the metabolite content (Omranian et al. 2015). Therefore, a comprehensively integrated analysis of metabolome and transcriptome can be carried out to detect the high correlation between an unknown gene expression with the levels of metabolites, and the tightly co-expressed unknown genes with the known function enzyme genes. These correlation analyses can indicate that an unknown gene may be involved in the interesting metabolite pathway, following a method known as the “guilt-by-association” approach (Li et al. 2020; Yonekura-Sakakibara et al. 2008). One example of this approach was the evaluation of the flavonol pathway of wild and cultivated tomato (Tohge et al. 2020). This study increased the number of metabolites recognized in this pathway in tomato from 22 to 44. Similarly, the spatio-temporal metabolome and transcriptome data of 20 major tomato tissues and growth stages were recently integrated to build the MicroTom Metabolic Network and identified several novel transcription factors, such as SlERF.G3-like, SlbHLH114, regulating the biosynthesis of flavonoids and steroidal glycoalkaloids (Li et al. 2020).
Population scale transcriptome data can provide insight into the expression of genes in different genotypes, and can be integrated with genome variations to reflect the effect of genetic variance on a gene expression phenotype; this is the so-called eQTL (Zhu et al. 2018, 2022b). Recently, Wen et al. (2018) analyzed mGWAS and eQTL in four independent tissues derived from different maize accessions. Based on these large-scale data, 36 loci were identified, and four genes were validated to be involved in trehalose, aspartate, and aromatic amino acid pathways. In addition, the mGWAS method was employed to identify CsANR, CsF3′5’H and CsMYB5 as important tea genes involved in biosynthesis of catechins (Zhang et al. 2020) (Fig. 1).
New methods to decode genes involved in primary and secondary metabolism
Recently, with transcriptome and metabolome technology development, single cell and spatial transcriptomics methods have been established with the potential to provide new insights into metabolism, at a more precise level, for both crop and non-crop plant species (Xia et al. 2022; Zhang et al. 2021b, c). Under the regulation of transcriptional, post-transcriptional, or feedback regulation, metabolite accumulation also exhibits different patterns within various cell types within the same tissue. With the developments and advances in MS, optical spectroscopy, and the fluorescence biosensors, it is slowly becoming practical to simultaneously measure hundreds of metabolites in a single cell (Zenobi 2013).
The mass spectrometry imaging methods, such as secondary ion mass spectrometry and matrix-assisted laser desorption/ionization, are widely used technologies to obtain spatially resolved metabolome information for samples (de Souza et al. 2020; Seydel 2021). Recently, in human cells, based on the gas cluster ion beam secondary ion mass spectrometry (GCIB-SIMS) method, Pareek et al. (2020) successfully visualized the in situ three-dimensional sub-micrometer chemical imaging of de novo purine biosynthesis and identified the enzyme interaction structure, of the purinosome, which can channel the pathway and, thereby, increase the pathway flux yielding purine biosynthetic efficiency. Moreover, ginsenosides are the main bioactive metabolites of popular traditional Chinese medicines, Panax ginseng. Based on matrix-assisted laser desorption/ionization time-of-flight mass spectrometry imaging (MALDI-TOF-MSI), Bai et al. (2016) used a novel approach to identify five different localization types of ginsenosides. Given that different ginsenosides have varied pharmacological effects and can reflect the ages of the ginseng root, this method can provides important cues for the component-specific extraction and the illustration of bioactive metabolites biosynthesis regulation of pharmacological research of herbs.
Conclusions
In this review, we have summarized the importance of metabolite functions and profiling methods for metabolites alongside the combined analysis of genome and metabolome, or transcriptome and metabolome, or the combination of all three omics, to decode the key genes and regulators of plant primary and secondary pathways (Fig. 3). Although these studies have demonstrated the power of the integrated omics analysis, these strategies are based on accessibility to a large number of natural populations and the inherent genetic variation. They can, therefore, not be applied to some non-crop plant species, such as certain medicinal plants, or even crop species such as banana for which the level of genetic variance is not available in natural populations to facilitate mGWAS analysis. Moreover, using omics methods remains technically difficult for illustrating the regulatory mechanism of metabolite synthesis within special cell types. However, this limitation is beginning to be addressed by the integration of newly developed technologies, such as spatial- transcriptomics and metabolomics and, given the large interest in this research frontier, it seems appropriate to anticipate that this limitation will soon be addressed. Irrespective of these restrictions, the integration of different multiomics data can remarkably accelerate the process towards a complete understanding of the pathway structure and regulation of primary and secondary metabolism in various species.
Data availability
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
References
Afendi FM et al (2013) Data mining methods for omics and knowledge of crude medicinal plants toward big data biology. Comput Struct Biotechnol J 4:e201301010. https://doi.org/10.5936/csbj.201301010
Alonge M et al (2020) Major impacts of widespread structural variation on gene expression and crop improvement in tomato. Cell 182:145-161 e123. https://doi.org/10.1016/j.cell.2020.05.021
Alseekh S, Fernie AR (2018) Metabolomics 20 years on: what have we learned and what hurdles remain? Plant J 94:933–942. https://doi.org/10.1111/tpj.13950
Alseekh S et al (2015) Identification and mode of inheritance of quantitative trait loci for secondary metabolite abundance in tomato. Plant Cell 27:485–512. https://doi.org/10.1105/tpc.114.132266
Alseekh S et al (2020) Quantitative trait loci analysis of seed-specialized metabolites reveals seed-specific flavonols and differential regulation of glycoalkaloid content in tomato. Plant J 103:2007–2024. https://doi.org/10.1111/tpj.14879
Bai H et al (2016) Localization of ginsenosides in Panax ginseng with different age by matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry imaging. J Chromatogr B 1026:263–271. https://doi.org/10.1016/j.jchromb.2015.09.024
Butelli E et al (2008) Enrichment of tomato fruit with health-promoting anthocyanins by expression of select transcription factors. Nat Biotechnol 26:1301–1308. https://doi.org/10.1038/nbt.1506
Canarini A, Kaiser C, Merchant A, Richter A, Wanek W (2019) Root exudation of primary metabolites: mechanisms and their roles in plant responses to environmental stimuli. Front Plant Sci. https://doi.org/10.3389/fpls.2019.00157
Chen W et al (2014) Genome-wide association analyses provide genetic and biochemical insights into natural variation in rice metabolism. Nat Genet 46:714–721. https://doi.org/10.1038/ng.3007
Ciumărnean L et al (2020) The effects of flavonoids in cardiovascular diseases. Molecules. https://doi.org/10.3390/molecules25184320
de Souza LP, Borghi M, Fernie A (2020) Plant single-cell metabolomics—challenges and perspectives. Int J Mol Sci 21. https://doi.org/10.3390/ijms21238987
Dudareva N, Klempien A, Muhlemann JK, Kaplan I (2013) Biosynthesis, function and metabolic engineering of plant volatile organic compounds. New Phytol 198:16–32. https://doi.org/10.1111/nph.12145
Erb M, Kliebenstein DJ (2020) Plant secondary metabolites as defenses, regulators, and primary metabolites: the blurred functional trichotomy. Plant Physiol 184:39–52. https://doi.org/10.1104/pp.20.00433
Eshed Y, Zamir D (1995) An introgression line population of Lycopersicon pennellii in the cultivated tomato enables the identification and fine mapping of yield-associated QTL. Genetics 141:1147–1162. https://doi.org/10.1093/genetics/141.3.1147
Fàbregas N, Fernie AR (2021) The interface of central metabolism with hormone signaling in plants. Current Biology: CB 31:R1535-r1548. https://doi.org/10.1016/j.cub.2021.09.070
Fang C, Fernie AR, Luo J (2019) Exploring the diversity of plant metabolism. Trends Plant Sci 24:83–98. https://doi.org/10.1016/j.tplants.2018.09.006
Fridman E, Carrari F, Liu YS, Fernie AR, Zamir D (2004) Zooming in on a quantitative trait for tomato yield using interspecific introgressions. Science 305:1786–1789. https://doi.org/10.1126/science.1101666
Fujiwara Y et al (2007) Esculeogenin A, a new tomato sapogenol, ameliorates hyperlipidemia and atherosclerosis in ApoE-deficient mice by inhibiting ACAT. Arterioscler Thromb Vasc Biol 27:2400–2406. https://doi.org/10.1161/atvbaha.107.147405
García-Gutiérrez Á, Cánovas FM, Ávila C (2018) Glutamate synthases from conifers: gene structure and phylogenetic studies. BMC Genom 19:65. https://doi.org/10.1186/s12864-018-4454-y
Ghosh UK, Islam MN, Siddiqui MN, Cao X, Khan MAR (2022) Proline, a multifaceted signalling molecule in plant responses to abiotic stress: understanding the physiological mechanisms. Plant Biol 24:227–239. https://doi.org/10.1111/plb.13363
Górniak I, Bartoszewski R, Króliczewski J (2019) Comprehensive review of antimicrobial activities of plant flavonoids. Phytochem Rev 18:241–272. https://doi.org/10.1007/s11101-018-9591-z
Hollox EJ, Zuccherato LW, Tucci S (2022) Genome structural variation in human evolution. Trends Genet 38:45–58. https://doi.org/10.1016/j.tig.2021.06.015
Itkin M et al (2011) GLYCOALKALOID METABOLISM1 is required for steroidal alkaloid glycosylation and prevention of phytotoxicity in tomato. Plant Cell 23:4507–4525. https://doi.org/10.1105/tpc.111.088732
Itkin M et al (2013) Biosynthesis of antinutritional alkaloids in solanaceous crops is mediated by clustered genes. Science 341:175–179. https://doi.org/10.1126/science.1240230
Karlova R et al (2011) Transcriptome and metabolite profiling show that APETALA2a is a major regulator of tomato fruit ripening. Plant Cell 23:923–941. https://doi.org/10.1105/tpc.110.081273
Kennedy RA, Laetsch WM (1974) Formation of 14C-labeled alanine from pyruvate during short term photosynthesis in a C4 plant. Plant Physiol 54:608–611. https://doi.org/10.1104/pp.54.4.608
Kind T, Wohlgemuth G, Lee DY, Lu Y, Palazoglu M, Shahbaz S, Fiehn O (2009) FiehnLib: mass spectral and retention index libraries for metabolomics based on quadrupole and time-of-flight gas chromatography/mass spectrometry. Anal Chem 81:10038–10048. https://doi.org/10.1021/ac9019522
Kopka J et al (2005) GMD@ CSB. DB: the Golm metabolome database. Bioinformatics 21:1635–1638. https://doi.org/10.1093/bioinformatics/bti236
Krebs H (1970) The history of the tricarboxylic acid cycle. Perspect Biol Med 14:154–172. https://doi.org/10.1353/pbm.1970.0001
Lee H et al (2016) Third-generation sequencing and the future of genomics. BioRxiv. https://doi.org/10.1101/048603
Li N, Xu C, Li-Beisson Y, Philippar K (2016) Fatty acid and lipid transport in plant cells. Trends Plant Sci 21:145–158. https://doi.org/10.1016/j.tplants.2015.10.011
Li Y et al (2020) MicroTom metabolic network: rewiring tomato metabolic regulatory network throughout the growth cycle. Mol Plant 13:1203–1218. https://doi.org/10.1016/j.molp.2020.06.005
Li J et al (2022) Biofortified tomatoes provide a new route to vitamin D sufficiency. Nature Plants 8:611–616. https://doi.org/10.1038/s41477-022-01154-6
Liepman AH, Olsen LJ (2003) Alanine aminotransferase homologs catalyze the glutamate:glyoxylate aminotransferase reaction in peroxisomes of arabidopsis. Plant Physiol 131:215–227. https://doi.org/10.1104/pp.011460
Luzarowska U et al (2020) Hello darkness, my old friend: 3-ketoacyl-coenzyme A synthase4 is a branch point in the regulation of triacylglycerol synthesis in arabidopsis by re-channeling fatty acids under carbon starvation. BioRxiv. https://doi.org/10.1101/2020.07.27.223388
Millar AA, Clemens S, Zachgo S, Giblin EM, Taylor DC, Kunst L (1999) CUT1, an arabidopsis gene required for cuticular wax biosynthesis and pollen fertility, encodes a very-long-chain fatty acid condensing enzyme. Plant Cell 11:825–838. https://doi.org/10.1105/tpc.11.5.825
Mountjoy E et al (2021) An open approach to systematically prioritize causal variants and genes at all published human GWAS trait-associated loci. Nat Genet 53:1527–1533. https://doi.org/10.1038/s41588-021-00945-5
Nisar N, Li L, Lu S, Khin Nay C, Pogson Barry J (2015) Carotenoid metabolism in Plants. Mol Plant 8:68–82. https://doi.org/10.1016/j.molp.2014.12.007
Obata T, Fernie AR (2012) The use of metabolomics to dissect plant responses to abiotic stresses. Cell Mol Life Sci 69:3225–3243. https://doi.org/10.1007/s00018-012-1091-5
Obata T et al (2020) Metabolic profiles of six African cultivars of cassava (Manihot esculenta Crantz) highlight bottlenecks of root yield. Plant J 102:1202–1219. https://doi.org/10.1111/tpj.14693
Omranian N et al (2015) Differential metabolic and coexpression networks of plant metabolism. Trends Plant Sci 20:266–268. https://doi.org/10.1016/j.tplants.2015.02.002
Ozaki K et al (2002) Functional SNPs in the lymphotoxin-α gene that are associated with susceptibility to myocardial infarction. Nat Genet 32:650–654. https://doi.org/10.1038/ng1047
Pareek V, Tian H, Winograd N, Benkovic SJ (2020) Metabolomics and mass spectrometry imaging reveal channeled de novo purine synthesis in cells. Science 368:283–290. https://doi.org/10.1126/science.aaz6465
Peng M et al (2017) Differentially evolved glucosyltransferases determine natural variation of rice flavone accumulation and UV-tolerance. Nat Commun 8:1–12. https://doi.org/10.1038/s41467-017-02168-x
Rupasinghe TW, Roessner U (2018) Extraction of plant lipids for LC-MS-based untargeted plant lipidomics. Plant Metabolomics. Springer, Berlin, pp 125–135. https://doi.org/10.1007/978-1-4939-7819-9_9
Salem MA, Jüppner J, Bajdzienko K, Giavalisco P (2016) Protocol: a fast, comprehensive and reproducible one-step extraction method for the rapid preparation of polar and semi-polar metabolites, lipids, proteins, starch and cell wall polymers from a single sample. Plant Methods 12:1–15. https://doi.org/10.1186/s13007-016-0146-2
Schauer N et al (2006) Comprehensive metabolic profiling and phenotyping of interspecific introgression lines for tomato improvement. Nat Biotechnol 24:447–454. https://doi.org/10.1038/nbt1192
Seydel C (2021) Single-cell metabolomics hits its stride. Nat Methods 18:1452–1456. https://doi.org/10.1038/s41592-021-01333-x
Shi T et al (2020) Metabolomics analysis and metabolite-agronomic trait associations using kernels of wheat (Triticum aestivum) recombinant inbred lines. Plant J 103:279–292. https://doi.org/10.1111/tpj.14727
Smeekens S, Ma J, Hanson J, Rolland F (2010) Sugar signals and molecular networks controlling plant growth. Curr Opin Plant Biol 13:273–278. https://doi.org/10.1016/j.pbi.2009.12.002
Steinmann J, Buer J, Pietschmann T, Steinmann E (2013) Anti-infective properties of epigallocatechin-3-gallate (EGCG), a component of green tea. Br J Pharmacol 168:1059–1073. https://doi.org/10.1111/bph.12009
Sweetlove LJ, Beard KFM, Nunes-Nesi A, Fernie AR, Ratcliffe RG (2010) Not just a circle: flux modes in the plant TCA cycle. Trends Plant Sci 15:462–470. https://doi.org/10.1016/j.tplants.2010.05.006
Szymanski J et al (2020) Analysis of wild tomato introgression lines elucidates the genetic basis of transcriptome and metabolome variation underlying fruit traits and pathogen response. Nat Genet 52:1111–1121. https://doi.org/10.1038/s41588-020-0690-6
Szymański J et al (2020) Analysis of wild tomato introgression lines elucidates the genetic basis of transcriptome and metabolome variation underlying fruit traits and pathogen response. Nat Genet 52:1111–1121. https://doi.org/10.1038/s41588-020-0690-6
Tan C, Zhang L, Duan X, Chai X, Huang R, Kang Y, Yang X (2022) Effects of exogenous sucrose and selenium on plant growth, quality, and sugar metabolism of pea sprouts. J Sci Food Agr 102:2855–2863. https://doi.org/10.1002/jsfa.11626
Tiwari R, Rana C (2015) Plant secondary metabolites: a review. Int J Eng Res Gen Sci 3:661–670
Tohge T, Alseekh S, Fernie AR (2014) On the regulation and function of secondary metabolism during fruit development and ripening. J Exp Bot 65:4599–4611. https://doi.org/10.1093/jxb/ert443
Tohge T et al (2016) Characterization of a recently evolved flavonol-phenylacyltransferase gene provides signatures of natural light selection in Brassicaceae. Nat Commun 7:12399. https://doi.org/10.1038/ncomms12399
Tohge T et al (2020) Exploiting natural variation in tomato to define pathway structure and metabolic regulation of fruit polyphenolics in the lycopersicum complex. Mol Plant 13:1027–1046. https://doi.org/10.1016/j.molp.2020.04.004
Verpoorte R, Memelink J (2002) Engineering secondary metabolite production in plants. Curr Opin Biotechnol 13:181–187. https://doi.org/10.1016/S0958-1669(02)00308-7
Voichek Y, Weigel D (2020) Identifying genetic variants underlying phenotypic variation in plants without complete genomes. Nat Genet 52:534–540. https://doi.org/10.1038/s41588-020-0612-7
Wang L et al (2008) The QTL controlling amino acid content in grains of rice (Oryza sativa) are co-localized with the regions involved in the amino acid metabolism pathway. Mol Breeding 21:127–137. https://doi.org/10.1007/s11032-007-9141-7
Wang S, Alseekh S, Fernie AR, Luo J (2019) The structure and function of major plant metabolite modifications. Mol Plant 12:899–919. https://doi.org/10.1016/j.molp.2019.06.001
Wang S, Li Y, He L, Yang J, Fernie AR, Luo J (2022a) Natural variance at the interface of plant primary and specialized metabolism. Curr Opin Plant Biol 67:102201. https://doi.org/10.1016/j.pbi.2022.102201
Wang Y et al (2022b) Function and transcriptional regulation of CsKCS20 in the elongation of very-long-chain fatty acids and wax biosynthesis in Citrus sinensis flavedo. Hort Res 9:uhab027. https://doi.org/10.1093/hr/uhab027
Wen W et al (2018) An integrated multi-layered analysis of the metabolic networks of different tissues uncovers key genetic components of primary metabolism in maize. Plant J 93:1116–1128. https://doi.org/10.1111/tpj.13835
Yonekura-Sakakibara K et al (2008) Comprehensive flavonol profiling and transcriptome coexpression analysis leading to decoding gene–metabolite correlations in Arabidopsis. Plant Cell 20:2160–2176. https://doi.org/10.1105/tpc.108.058040
Xia K et al (2022) The single-cell stereo-seq reveals region-specific cell subtypes and transcriptome profiling in Arabidopsis leaves. Dev Cell 57:1299–1310.e1294. https://doi.org/10.1016/j.devcel.2022.04.011
Ye J et al (2017) An InDel in the promoter of Al-ACTIVATED MALATE TRANSPORTER9 selected during tomato domestication determines fruit malate contents and aluminum tolerance. Plant Cell 29:2249–2268. https://doi.org/10.1105/tpc.17.00211
Zamir D (2001) Improving plant breeding with exotic genetic libraries. Nat Rev Genet 2:983–989. https://doi.org/10.1038/35103590
Zaynab M, Fatima M, Sharif Y, Zafar MH, Ali H, Khan KA (2019) Role of primary metabolites in plant defense against pathogens. Microbial Pathog 137:103728. https://doi.org/10.1016/j.micpath.2019.103728
Zenobi R (2013) Single-cell metabolomics: analytical and biological perspectives. Science 342:1243259. https://doi.org/10.1126/science.1243259
Zhang W et al (2020) Genome assembly of wild tea tree DASZ reveals pedigree and selection history of tea varieties. Nat Commun 11:3719. https://doi.org/10.1038/s41467-020-17498-6
Zhang S et al (2021a) ZmMs25 encoding a plastid-localized fatty acyl reductase is critical for anther and pollen development in maize. J Exp Bot 72:4298–4318. https://doi.org/10.1093/jxb/erab142
Zhang T-Q, Chen Y, Liu Y, Lin W-H, Wang J-W (2021b) Single-cell transcriptome atlas and chromatin accessibility landscape reveal differentiation trajectories in the rice root. Nat Commun 12:2053. https://doi.org/10.1038/s41467-021-22352-4
Zhang T-Q, Chen Y, Wang J-W (2021c) A single-cell analysis of the arabidopsis vegetative shoot apex. Dev Cell 56:1056–1074.e1058. https://doi.org/10.1016/j.devcel.2021.02.021
Zhao H, Ni S, Cai S, Zhang G (2021) Comprehensive dissection of primary metabolites in response to diverse abiotic stress in barley at seedling stage. Plant Physiol Biochem 161:54–64. https://doi.org/10.1016/j.plaphy.2021.01.048
Zhu F et al (2017) An R2R3-MYB transcription factor represses the transformation of α- and β-branch carotenoids by negatively regulating expression of CrBCH2 and CrNCED5 in flavedo of Citrus reticulate. New Phytol 216:178–192. https://doi.org/10.1111/nph.14684
Zhu G et al (2018) Rewiring of the fruit metabolome in tomato breeding. Cell 172:249-261.e212. https://doi.org/10.1016/j.cell.2017.12.019
Zhu F et al (2020) A NAC transcription factor and its interaction protein hinder abscisic acid biosynthesis by synergistically repressing NCED5 in Citrus reticulata. J Exp Bot 71:3613–3625. https://doi.org/10.1093/jxb/eraa118
Zhu F et al (2022a) Genome-wide association of the metabolic shifts underpinning dark-induced senescence in Arabidopsis. Plant Cell 34:557–578. https://doi.org/10.1093/plcell/koab251
Zhu F et al (2022b) A comparative transcriptomics and eQTL approach identifies SlWD40 as a tomato fruit ripening regulator. Plant Physiol. https://doi.org/10.1093/plphys/kiac200
Acknowledgements
Feng Zhu and Yunjiang Cheng were supported by the Major Special Projects and Key R&D Projects in Yunnan Province (202102AE090020 and 202102AE090054) and Hubei Hongshan Laboratory. Saleh Alseekh and Alisdair R. Fernie acknowledge funding of the PlantaSYST project by the European Union’s Horizon 2020 Research and Innovation Programme (SGA‐CSA no. 664621 and no. 739582 under FPA no. 664620). Moreover, we apologize to researchers whose works are not properly reviewed here because of space limitation.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
FZ and ARF: wrote the paper. SA: drew the figures. WW and YC revised the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhu, F., Wen, W., Cheng, Y. et al. Integrating multiomics data accelerates elucidation of plant primary and secondary metabolic pathways. aBIOTECH 4, 47–56 (2023). https://doi.org/10.1007/s42994-022-00091-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42994-022-00091-4