Abstract
Key message
Long intergenic non-coding RNA (lincRNA), cis-acting expression quantitative trait locus (cis-eQTL), maize, regulatory evolution.
Abstract
The law of genetic variation during domestication explains the evolutionary mechanism and provides a theoretical basis for improving existing varieties of maize. Previous studies focused on exploiting regulatory variations controlling the expression of protein-coding genes rather than of non-protein-coding genes. Here, we examined the genetic and evolutionary features of long non-coding RNAs from intergenic regions (long intergenic non-coding RNAs, lincRNAs) using population-scale transcriptome data and identified 1168 lincRNAs with cis-acting expression quantitative trait loci (cis-eQTLs). We found that lincRNAs are more likely to be regulated by cis-eQTLs, which exert stronger effects than the protein-coding genes. During maize domestication and improvement, upregulated alleles of lincRNAs, which originated from both standing variation and new mutation, accumulate more frequently and show larger effect sizes than the coding genes. A stronger signature of genetic differentiation was observed in their regulatory regions compared to those of randomly sampled lincRNAs. In addition, we found that cis-regulatory differentiation of lincRNAs is related to the sequence conservation of lincRNA transcripts. Non-conserved lincRNAs more tend to gain upregulated alleles and show a stronger relationship with selected traits than conserved lincRNAs between maize and its wild relatives. Our findings in maize improve the understanding of cis-regulatory variation in lincRNA genes during domestication and improvement and provide an effective approach for prioritizing candidates for further investigation.
Similar content being viewed by others
Data availability
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Change history
23 March 2023
A Correction to this paper has been published: https://doi.org/10.1007/s00122-023-04323-z
References
Akey JM (2009) Constructing genomic maps of positive selection in humans: where do we go from here? Genome Res 19:711–722. https://doi.org/10.1101/gr.086652.108
Anders S, Pyl PT, Huber W (2015) HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31:166–169. https://doi.org/10.1093/bioinformatics/btu638
Bailey TL, Grant CE (2021) SEA: Simple enrichment analysis of motifs. bioRxivdoi: https://doi.org/10.1101/2021.08.23.457422
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc: Ser B (methodol) 57:289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. https://doi.org/10.1093/bioinformatics/btu170
Browning BL, Browning SR (2016) Genotype imputation with millions of reference samples. Am J Hum Genet 98:116–126. https://doi.org/10.1016/j.ajhg.2015.11.020
Bukowski R, Guo X, Lu Y et al (2018) Construction of the third-generation Zea mays haplotype map. GigaScience 7:1–12. https://doi.org/10.1093/gigascience/gix134
Chen Q, Li W, Tan L et al (2021a) Harnessing knowledge from maize and rice domestication for new crop breeding. Mol Plant 14:9–26. https://doi.org/10.1016/j.molp.2020.12.006
Chen Q, Liu K, Yu R et al (2021b) From dark matter to star insight into the regulation mechanisms of plant functional long non-coding RNAs. Front Plant Sci 12:650926. https://doi.org/10.3389/fpls.2021.650926
Compton SA, Tolun G, Kamath-Loeb AS et al (2008) The werner syndrome protein binds replication fork and holliday junction DNAs as an oligomer. J Biol Chem 283:24478–24483. https://doi.org/10.1074/jbc.M803370200
Cubillos FA, Coustham V, Loudet O (2012) Lessons from eQTL mapping studies: non-coding regions and their role behind natural phenotypic variation in plants. Curr Opin Plant Biol 15:192–198. https://doi.org/10.1016/j.pbi.2012.01.005
Díaz-Valenzuela E, Sawers RH, Cibrián-Jaramillo A (2020) Cis- and trans-regulatory variations in the domestication of the chili pepper fruit. Mol Biol Evol 37:1593–1603. https://doi.org/10.1093/molbev/msaa027
Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250–255. https://doi.org/10.3835/plantgenome2011.08.0024
Flint-Garcia SA, Bodnar AL, Scott MP (2009) Wide variability in kernel composition, seed characteristics, and zein profiles among diverse maize inbreds, landraces, and teosinte. Theor Appl Genet 119:1129–1142. https://doi.org/10.1007/s00122-009-1115-1
Fu J, Cheng Y, Linghu J et al (2013) RNA sequencing reveals the complex regulatory network in the maize kernel. Nat Commun 4:2832. https://doi.org/10.1038/ncomms3832
Fu J, Leng P, Wang G et al (2022) The promise of eQTL studies in dissecting crop genetic basis and evolution. Annu Plant Rev Online 5:1–32. https://doi.org/10.1002/9781119312994.apr0775
Goede OMd, Nachun DC, Ferraro NM et al (2021) Population-scale tissue transcriptomics maps long non-coding RNAs to complex disease. Cell 184:2633-2648.e2619. https://doi.org/10.1016/j.cell.2021.03.050
Golicz AA, Bhalla PL, Singh MB (2018a) lncRNAs in plant and animal sexual reproduction. Trends Plant Sci 23:195–205. https://doi.org/10.1016/j.tplants.2017.12.009
Golicz AA, Singh MB, Bhalla PL (2018b) The long intergenic noncoding RNA (lincRNA) landscape of the soybean genome. Plant Physiol 176:2133–2147. https://doi.org/10.1104/pp.17.01657
Guo G, Liu X, Sun F et al (2018) Wheat miR9678 affects seed germination by generating phased siRNAs and modulating abscisic acid/gibberellin signaling. Plant Cell 30:796–814. https://doi.org/10.1105/tpc.17.00842
Han L, Mu Z, Luo Z et al (2019) New lncRNA annotation reveals extensive functional divergence of the transcriptome in maize. J Integr Plant Biol 61:394–405. https://doi.org/10.1111/jipb.12708
Huang C, Sun H, Xu D et al (2018) ZmCCT9 enhances maize adaptation to higher latitudes. Proc Natl Acad Sci USA 115:E334–E341. https://doi.org/10.1073/pnas.1718058115
Hufford MB, Xu X, van Heerwaarden J et al (2012) Comparative population genomics of maize domestication and improvement. Nat Genet 44:808–811. https://doi.org/10.1038/ng.2309
Ihaka R, Gentleman R (1996) R: a language for data analysis and graphics. J Comput Graph Stat 5:299–314. https://doi.org/10.1080/10618600.1996.10474713
Jin J, Tian F, Yang D-C et al (2017) PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants. Nucleic Acids Res 45:D1040–D1045. https://doi.org/10.1093/nar/gkw982
Ke L, Zhou Z, Xu X-W et al (2019) Evolutionary dynamics of lincRNA transcription in nine citrus species. Plant J 98:912–927. https://doi.org/10.1111/tpj.14279
Kim D, Langmead B, Salzberg SL (2015) HISAT: a fast spliced aligner with low memory requirements. Nat Methods 12:357–360. https://doi.org/10.1038/nmeth.3317
Kim-Hellmuth S, Bechheim M, Pütz B et al (2017) Genetic regulatory effects modified by immune activation contribute to autoimmune disease associations. Nat Commun 8:266. https://doi.org/10.1038/s41467-017-00366-1
Kindgren P, Ard R, Ivanov M et al (2018) Transcriptional read-through of the long non-coding RNA SVALKA governs plant cold acclimation. Nat Commun 9:4561. https://doi.org/10.1038/s41467-018-07010-6
Koenig D, Jiménez-Gómez JM, Kimura S et al (2013) Comparative transcriptomics reveals patterns of selection in domesticated and wild tomato. Proc Natl Acad Sci USA 110:E2655–E2662. https://doi.org/10.1073/pnas.1309606110
Kutter C, Watt S, Stefflova K et al (2012) Rapid turnover of long noncoding RNAs and the evolution of gene expression. PLoS Genet 8:e1002841. https://doi.org/10.1371/journal.pgen.1002841
Lemmon ZH, Bukowski R, Sun Q et al (2014) The role of cis regulatory evolution in maize domestication. PLoS Genet 10:e1004745. https://doi.org/10.1371/journal.pgen.1004745
Li H, Handsaker B, Wysoker A et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Li Q, Yang X, Xu S et al (2012) Genome-wide association studies identified three independent polymorphisms associated with α-tocopherol content in maize kernels. PLoS ONE 7:e36807. https://doi.org/10.1371/journal.pone.0036807
Li H, Peng Z, Yang X et al (2013) Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat Genet 45:43. https://doi.org/10.1038/ng.2484
Li C, Guan H, Jing X et al (2022) Genomic insights into historical improvement of heterotic groups during modern hybrid maize breeding. Nature Plants 8:750–763. https://doi.org/10.1038/s41477-022-01190-2
Liang Y, Liu H-J, Yan J et al (2021) Natural variation in crops: realized understanding, continuing promise. Annu Rev Plant Biol 72:357–385. https://doi.org/10.1146/annurev-arplant-080720-090632
Liu J, Jung C, Xu J et al (2012) Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis. Plant Cell 24:4333–4345. https://doi.org/10.1105/tpc.112.102855
Liu S, Gao Y, Canela-Xandri O et al (2022) A multi-tissue atlas of regulatory variants in cattle. Nat Genet. https://doi.org/10.1038/s41588-022-01153-5
Livak KJ, Schmittgen TD (2001) Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 25:402–408. https://doi.org/10.1006/meth.2001.1262
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550. https://doi.org/10.1186/s13059-014-0550-8
Managadze D, Rogozin IB, Chernikova D et al (2011) Negative correlation between expression level and evolutionary rate of long intergenic noncoding RNAs. Genome Biol Evol 3:1390–1404. https://doi.org/10.1093/gbe/evr116
McKenna A, Hanna M, Banks E et al (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297–1303. https://doi.org/10.1101/gr.107524.110
Melé M, Ferreira PG, Reverter F et al (2015) The human transcriptome across tissues and individuals. Science 348:660–665. https://doi.org/10.1126/science.aaa0355
Melé M, Mattioli K, Mallard W et al (2017) Chromatin environment, transcriptional regulation, and splicing distinguish lincRNAs and mRNAs. Genome Res 27:27–37. https://doi.org/10.1101/gr.214205.116
Nédélec Y, Sanz J, Baharian G et al (2016) Genetic ancestry and natural selection drive population differences in immune responses to pathogens. Cell 167:657-669.e621. https://doi.org/10.1016/j.cell.2016.09.025
Ongen H, Buil A, Brown AA et al (2016) Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32:1479–1485. https://doi.org/10.1093/bioinformatics/btv722
Palazzo AF, Koonin EV (2020) Functional long non-coding RNAs evolve from junk transcripts. Cell 183:1151–1161. https://doi.org/10.1016/j.cell.2020.09.047
Pang J, Fu J, Zong N et al (2019) Kernel size-related genes revealed by an integrated eQTL analysis during early maize kernel development. Plant J 98:19–32. https://doi.org/10.1111/tpj.14193
Pickrell JK, Marioni JC, Pai AA et al (2010) Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature 464:768–772. https://doi.org/10.1038/nature08872
Ponjavic J, Ponting CP, Lunter G (2007) Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res 17:556–565. https://doi.org/10.1101/gr.6036807
Popadin K, Gutierrez-Arcelus M, Dermitzakis ET et al (2013) Genetic and epigenetic regulation of human lincRNA gene expression. Am J Hum Genet 93:1015–1026. https://doi.org/10.1016/j.ajhg.2013.10.022
Qi S, Akter S, Li S (2019) Identification of novel lincRNA and co-Expression network analysis using RNA-sequencing data in plants. In: Chekanova JA, Wang HLV (eds) Plant long non-coding RNAs: methods and protocols. Springer, New York, pp 207–221
Salvi S, Sponza G, Morgante M et al (2007) Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize. Proc Natl Acad Sci USA 104:11376–11381. https://doi.org/10.1073/pnas.0704145104
Seymour DK, Chae E, Grimm DG et al (2016) Genetic architecture of nonadditive inheritance in Arabidopsis thaliana hybrids. Proc Natl Acad Sci USA 113:E7317–E7326. https://doi.org/10.1073/pnas.1615268113
Statello L (2021) Gene regulation by long non-coding RNAs and its biological functions. Nat Rev Mol Cell Biol 22:96. https://doi.org/10.1038/s41580-020-00315-9
Studer A, Zhao Q, Ross-Ibarra J et al (2011) Identification of a functional transposon insertion in the maize domestication gene tb1. Nat Genet 43:1160–1163. https://doi.org/10.1038/ng.942
Swanson-Wagner R, Briskine R, Schaefer R et al (2012) Reshaping of the maize transcriptome by domestication. Proc Natl Acad Sci USA 109:11878–11883. https://doi.org/10.1073/pnas.1201961109
Swinnen G, Goossens A, Pauwels L (2016) Lessons from domestication: targeting cis-regulatory elements for crop improvement. Trends Plant Sci 21:506–515. https://doi.org/10.1016/j.tplants.2016.01.014
Ulitsky I (2016) Evolution to the rescue: using comparative genomics to understand long non-coding RNAs. Nat Rev Genet 17:601–614. https://doi.org/10.1038/nrg.2016.85
Umans BD, Battle A, Gilad Y (2021) Where are the disease-associated eQTLs? Trends Genet 37:109–124. https://doi.org/10.1016/j.tig.2020.08.009
Van der Auwera GA, Carneiro MO, Hartl C et al (2013) From FastQ data to high-confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinform. https://doi.org/10.1002/0471250953.bi1110s43
Wang H, Niu Q-W, Wu H-W et al (2015) Analysis of non-coding transcriptome in rice and maize uncovers roles of conserved lncRNAs associated with agriculture traits. Plant J 84:404–416. https://doi.org/10.1111/tpj.13018
Wang X, Wang H, Liu S et al (2016) Genetic variation in ZmVPP1 contributes to drought tolerance in maize seedlings. Nat Genet 48:1233. https://doi.org/10.1038/ng.3636
Wang X, Chen Q, Wu Y et al (2018) Genome-wide analysis of transcriptional variability in a large maize-teosinte population. Mol Plant 11:443–459. https://doi.org/10.1016/j.molp.2017.12.011
Yang X, Gao S, Xu S et al (2011) Characterization of a global germplasm collection and its potential utilization for analysis of complex quantitative traits in maize. Mol Breed 28:511–526. https://doi.org/10.1007/s11032-010-9500-7
Yang N, Lu Y, Yang X et al (2014) Genome wide association studies using a new nonparametric model reveal the genetic architecture of 17 agronomic traits in an enlarged maize association panel. PLoS Genet 10:e1004573. https://doi.org/10.1371/journal.pgen.1004573
Yang N, Liu J, Gao Q et al (2019) Genome assembly of a tropical maize inbred line provides insights into structural variation and crop improvement. Nat Genet 51:1052–1059. https://doi.org/10.1038/s41588-019-0427-6
Zhao H, Sun Z, Wang J et al (2014) CrossMap: a versatile tool for coordinate conversion between genome assemblies. Bioinformatics 30:1006–1007. https://doi.org/10.1093/bioinformatics/btt730
Zheng XM, Chen J, Pang HB et al (2019) Genome-wide analyses reveal the role of noncoding variation in complex traits during rice domestication. Sci Adv 5:eaax3619. https://doi.org/10.1126/sciadv.aax3619
Zhou X, Stephens M (2012) Genome-wide efficient mixed-model analysis for association studies. Nat Genet 44:821–824. https://doi.org/10.1038/ng.2310
Zhou Y-F, Zhang Y-C, Sun Y-M et al (2021) The parent-of-origin lncRNA MISSEN regulates rice endosperm development. Nat Commun 12:6525. https://doi.org/10.1038/s41467-021-26795-7
Acknowledgements
This work was supported by the National Natural Science Foundation of China (31571683), the Modern Agro-Industry Technology Research System of Maize (CARS-02-13), the Yazhou Bay Seed Lab (JBGS+B21HJ0221), and the Agricultural Science and Technology Innovation Program of CAAS. We would like to thank Lingjie Yin (ICS-CAAS bioinformatics group) for providing computing support. We thank Editage (www.editage.cn) for English language editing.
Funding
This work was supported by the National Natural Science Foundation of China (31571683), the Modern Agro-Industry Technology Research System of Maize (CARS-02–13), and the Agricultural Science and Technology Innovation Program of CAAS.
Author information
Authors and Affiliations
Contributions
Conceptualization was done by JF, JL, and Y-XL; methodology was done by JF and XZ; formal analysis was done by J-WL, S-HZ, JZ, CH, and XW; validation was done by J-WL, YC, ZW, and SZ; investigation was done by J-WL, JF, XZ, and RG; resources were done by LL, GW, and JW; writing—original draft preparation were done by J-WL; writing—review and editing were done by JF, XZ, YX, and RG; visualization was done by J-WL and S-HZ; project administration was done by JF; funding acquisition was done by JF. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Communicated by Thomas Lubberstedt.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Lu, J., Zhen, S., Zhang, J. et al. Combined population transcriptomic and genomic analysis reveals cis-regulatory differentiation of non-coding RNAs in maize. Theor Appl Genet 136, 16 (2023). https://doi.org/10.1007/s00122-023-04293-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00122-023-04293-2