Strand-specific RNA-seq based identification and functional prediction of drought-responsive lncRNAs in cassava
- 147 Downloads
Long noncoding RNAs (lncRNAs) have emerged as playing crucial roles in abiotic stress responsive regulation, however, the mechanism of lncRNAs underlying drought-tolerance remains largely unknown in cassava, an important tropical and sub-tropical root crop of remarkable drought tolerance.
In this study, a total of 833 high-confidence lncRNAs, including 652 intergenic and 181 anti-sense lncRNAs, were identified in cassava leaves and root using strand-specific RNA-seq technology, of which 124 were drought-responsive. Trans-regulatory co-expression network revealed that lncRNAs exhibited tissue-specific expression patterns and they preferred to function differently in distinct tissues: e.g., cell-related metabolism, cell wall, and RNA regulation of transcription in folded leaf (FL); degradation of major carbohydrate (CHO) metabolism, calvin cycle and light reaction, light signaling, and tetrapyrrole synthesis in full expanded leaf (FEL); synthesis of major CHO metabolism, nitrogen-metabolism, photosynthesis, and redox in bottom leaf (BL); and hormone metabolism, secondary metabolism, calcium signaling, and abiotic stress in root (RT). In addition, 27 lncRNA-mRNA pairs referred to cis-acting regulation were identified, and these lncRNAs regulated the expression of their neighboring genes mainly through hormone metabolism, RNA regulation of transcription, and signaling of receptor kinase. Besides, 11 lncRNAs were identified acting as putative target mimics of known miRNAs in cassava. Finally, five drought-responsive lncRNAs and 13 co-expressed genes involved in trans-acting, cis-acting, or target mimic regulation were selected and confirmed by qRT-PCR.
These findings provide a comprehensive view of cassava lncRNAs in response to drought stress, which will enable in-depth functional analysis in the future.
KeywordsCassava lncRNA PEG treatment Tissue-specific expression ssRNA-Seq
Coding-Potential Assessment Tool
Coding Potential Calculator
Full expanded leaf
Fragments per kilobase per million mapped reads
Hidden Markov models
Long intergenic noncoding RNAs
Long noncoding natural antisense transcripts
Long noncoding RNAs
The protein families database
Long noncoding RNAs (lncRNAs) are usually defined as non-protein coding transcripts with > 200 bp in length. According to their genomic origins and their locations relative to nearby protein-coding genes, lncRNA are classified into types of long intergenic noncoding RNAs (lincRNAs), long intronic noncoding RNAs, and long noncoding natural antisense transcripts (lncNATs) . Previously lncRNAs are regarded as transcriptional noises because of their low expression levels, but now emerging evidences have demonstrated that lncRNAs play a crucial role in many plant developmental process including vernalization, reproduction, and photo-morphogenesis [2, 3, 4]. In particular, lncRNAs are now considered as important regulatory components in response to abiotic stresses. For examples, Arabidopsis thaliana lncRNA DRIR, as a novel positive regulator of plant response to drought and salt stress, was involved in ABA signaling, water transport and other stress-relief processes ; npc536 over-expression plants displayed enhanced root growth under salt stress condition compared with wild-type plants .
In plants, lncRNAs can execute their functions to respond to stresses in either cis-acting or trans-acting in the genome via diverse mechanisms, including sequence complementarity or homology with RNAs or DNAs, promoter activity modification by nucleosome repositioning, and epigenetic regulation by DNA methylation and histone modification [1, 7, 8]. Considering the complexity of lncRNA regulation, to date only a few lncRNAs have been functionally characterized in plants, although lncRNAs became more and more attractive in recent years. Recently, target mimic was identified as a regulatory mechanism for lncRNAs to block the interactions between miRNAs and their targets. For examples, Arabidopsis phosphate-induced lncRNA IPS1, which acts as a target mimic for miR399, can bind and sequester miR399 and reduce miR399-mediated cleavage of PHO2, which is involved in phosphate uptake . Similarly, several lncRNAs are found as target mimics for tomato (Solanum lycopersicum) miRNAs involved in the infection of tomato yellow leaf curl virus .
With the rapid development of high-throughput sequencing technologies, numerous lncRNAs have been identified under drought condition in many species by transcriptome re-assembly. In maize (Zea mays L.), a total of 664 drought-responsive lncRNAs were identified, of which 126 were highly similar to known maize lncRNAs while the remaining 538 transcripts were novel lncRNAs . In Populus trichocarpa, totally 2542 lncRNA candidates were identified, and 504 out of them were found to be drought responsive . In cotton (Gossypium hirsutum L.), a total of 10,820 high-confidence lncRNAs were found under drought and control conditions, of which 9989 were lincRNAs, 153 were intronic lncRNAs, and 678 were anti-sense lncRNAs . However, until now, comprehensive surveys of lncRNAs are still lacking in response to drought stress, especially in tropical crops such as cassava.
Cassava (Manihot esculenta Crantz) is one of the important cash crops for many farmers in tropical and sub-tropical regions, and it provides staple food for over 750 million people around the world . Because of its starch-enriched tuberous root, cassava is regarded as a major source for starch production, bio-fuel, and animal feed . Cassava is generally tolerant to drought, however, severe drought stress greatly depresses its growth and development, and finally reduces its economic yield . In the past decades, much progress has been made in the identification and functional characterization of cassava genes and proteins in response to drought stress [16, 17, 18, 19]. However, very few studies concerning lncRNAs were performed , and the mechanism of lncRNAs underlying cassava drought-tolerance remains largely unknown and therefore needs to be further explored.
In this study, a strand-specific RNA-seq (ssRNA-seq) sequencing approach was applied to investigate the genome-wide transcriptome changes of cassava leaves (at different developmental stages) and root under polyethylene glycol (PEG)-simulated drought condition. Subsequently, drought-responsive lncRNAs were systematically identified, the basic characterization, expression pattern, together with the putative function of these lncRNAs were predicted and analyzed. These findings will expand our knowledge of lncRNAs participating drought response in cassava, and enable in-depth functional analysis of lncRNAs in the future.
Drought responses and ssRNA-seq of cassava
Compared with the control (0 h), leaves of cassava seedlings were badly wilted after 24 h of PEG-simulated drought stress (Additional file 1: Figure S1). Similar phenotypes were observed in our previous study , which also demonstrated that physiological traits such as peroxidase activity, proline, and soluble protein content were significantly altered, and the expression levels of thousands of genes were dramatically changed after 3 and 24 h of 20% PEG treatment. As an extended research, similar PEG treatments were performed in this study but we mainly focused on the systematic identification and functional characterization of drought-responsive lncRNAs through a ssRNA-seq approach.
Identification and characterization of lncRNA in cassava
In total, 111,585 transcripts were obtained after transcriptome re-construction of all ssRNA-seq data using cufflinks pipeline. Subsequently, a few filtering steps were applied to identify the drought-responsive lncRNAs of high-confidence (Fig. 1). Firstly, the transcripts overlapped with known protein-coding genes in the same strand were removed. In this step, a total of 92,759 (~ 83%) transcripts, which were overlapped with 33,033 protein-coding genes representing all annotated genes of the cassava genome, were filtered. Secondly, the transcripts with exon < 2 and length < 200 bp were removed, which resulted in 2761 remained transcripts. Thirdly, the transcripts with FPKM > 1 in less than two samples were removed, to make sure the remaining transcripts were expressed. Next, the transcripts with coding potential, which was evaluated by Coding Potential Calculator (CPC), Coding-Non-Coding Index (CNCI), and the protein families database (Pfam), were removed. Finally, a total of 833 transcripts were obtained, and later they were classified into 652 intergenic and 181 anti-sense lncRNAs according to their genomic locations.
Identification of differentially expressed (DE) lncRNAs
To explore the transcriptional changes of lncRNAs affected by PEG treatment, DE lncRNAs were identified by pair-wise comparison of samples collected at different time-points within the same tissue, respectively.
In total, 124 DE lncRNAs were identified in response to PEG treatment. Most of them were exclusively identified in FL (31), FEL (26), BL (27), and RT (19), and 21 were commonly identified in at least two tissues (Fig. 3c). However, none of lncRNAs were identified in all four tissues. These results indicated that lncRNAs preferred to function in a tissue-specific manner.
Functional characterization of DE lncRNAs in trans-regulation
To explore the potential functions of drought-responsive lncRNAs, a total of 124 DE lncRNAs, together with 5187 DE genes, were selected and subjected to co-expression analysis to identify trans-regulatory networks of lncRNAs. Subsequently, functional enrichment analysis was performed for the genes of each group (co-expressed module), respectively, and then the enriched functions could be used to predict the functions of lncRNAs that were co-expressed with these DE genes.
The genes/lncRNAs from group M5 to M6 were highly expressed in FEL (Fig. 4a). The expression of the former group was greatly suppressed, while that of the latter group was dramatically induced at 24 h of PEG treatment. There were 5 lncRNAs in group M5, of which the genes were significantly enriched in lipid metabolism, degradation of major carbohydrate (CHO) metabolism, calvin cycle and light reaction, secondary metabolism, light signaling, and tetrapyrrole synthesis. The genes included in group M6 were significantly enriched in amino acid synthesis, lipid metabolism, secondary metabolism of wax, and abiotic stress, but none of lncRNAs were included in this group (Fig. 4b).
The genes/lncRNAs from group M7 to M8 were highly expressed in BL (Fig. 4a). Similar to group M5 and M6, the former group was greatly decreased whereas the latter group was significantly increased at 24 h of PEG treatment. There were 13 lncRNAs in group M7, of which the enriched categories included synthesis of major CHO metabolism, trehalose metabolism, nitrogen (N)-metabolism, photosynthesis, redox, secondary metabolism of flavonoids, and light signaling. There was only one lncRNA in group M8, and this group was significantly enriched in protein assembly and cofactor ligation (Fig. 4b).
The genes/lncRNAs from group M9 to M11 were highly expressed in RT (Fig. 4a). It was clearly observed that the expression was dramatically decreased from 0 h to 24 h upon PEG treatment in group M9, which contained only one lncRNA. The enriched categories in this group included hormone metabolisms such as gibberellin and jasmonate, mitochondrial electron transport/ATP synthesis, calcium signaling, and receptor kinases signaling. There were 4 and 9 lncRNAs in group M10 and M11, respectively. The genes/lncRNAs from group M10 were greatly induced at 3 h but suppressed at 24 h, and they were significantly enriched in abscisic acid (ABA), degradation of major CHO metabolism, secondary metabolisms, and abiotic stress. On the contrary, the genes/lncRNAs from group M11 were dramatically induced at 24 h after PEG treatment, and they were significantly enriched in raffinose metabolism, protein folding, and abiotic stress (Fig. 4b).
Taken together, these results revealed that the genes/lncRNAs were exhibited in a tissue-specific manner in response to PEG treatment in cassava, and also suggested that the genes/lncRNAs preferred to function differently in distinct tissues: e.g., cell-related metabolism, cell wall, RNA regulation of transcription in FL; degradation of major CHO metabolism, calvin cycle and light reaction, light signaling, and tetrapyrrole synthesis in FEL; synthesis of major CHO metabolism, N-metabolism, photosynthesis, and redox in BL; and hormone metabolism, secondary metabolism, calcium signaling, and abiotic stress in RT.
Functional characterization of DE lncRNAs in cis-regulation
To further explore the potential functions of drought-responsive DE lncRNAs, protein-coding genes, which were spaced 10 k/100 k upstream and downstream of these lncRNAs, were selected and subjected to co-expression analysis. The lncRNA-mRNA pairs that were highly correlated and closely located were specifically attractive in a cis-acting regulatory relationship.
Together, these results suggested that these DE lncRNAs, which might act as regulators in cis-acting in response to PEG treatment, regulated the expression of their neighboring genes mainly through hormone metabolism, RNA regulation of transcription, and signaling of receptor kinase.
Functional prediction of lncRNAs acting as miRNA target mimics
lncRNAs have been demonstrated to function through miRNAs for transcriptional, post-transcriptional, and epigenetic gene regulation, therefore, it’s of great importance to investigate the crosstalk between lncRNAs and miRNAs by exploring the lncRNAs acting as target mimic of known miRNAs in cassava.
In total, 11 lncRNAs were identified acting as target mimics of known miRNAs, such as miR156, miR164, miR169, and miR172 (Additional file 3: Table S2). miR156 is stress-induced and it targets SPL genes (e.g., SPL9) in plant development and abiotic stress tolerance [21, 22]. As a target mimic of miR156k, TCONS_00068353 was greatly suppressed in FL, and consistently, a homolog of SPL9 (Manes.09G032800) exhibited similar expression trend of TCONS_00068353 under PEG treatment (Fig. 5d and Additional file 3: Table S2). miR172 participated in water deficit and salt stress through the expression regulation of AP2-like transcription factors , and its expression was promoted by SPL genes . Further studies revealed that SPL/miR156 module can interact with the AP2/miR172 unit in barely . It’s worthy to note that, in our study, TCONS_00068353 was bound with miR172c, coordinated with the decreased expression of miR172-targeted AP2-like gene (Manes.05G184000) in FL under PEG treatment. In addition, TCONS_00068353/Manes.09G032800 and TCONS_00068353/Manes.05G184000 showed similar expression patterns in response to PEG treatment, supporting the interactions between SPL/miR156 module and AP2/miR172 unit .
Besides the miRNA-mRNA interactions consistent with the previously reporters, some different and currently unknown interactions were identified. For examples, MYC2 and CSD2 were the targets of miR169 and miR398, respectively, but they were predicted as the targets of miR164a and miR171g in our study, in accordance with the similar expression patterns of TCONS_00068353 and TCONS_00072359 (Fig. 5e and Additional file 3: Table S2) which acted as the target mimics of miR164a and miR171g, respectively.
Together, these results strongly suggested that lncRNAs might function through miRNAs in the response of drought stress in cassava.
Validation of lncRNAs and genes by qRT-PCR
lncRNA is a key player in cassava drought stress
lncRNAs have well demonstrated to play essential roles in drought stress response in many plants, including Arabidopsis , rice (Oryza sativa) , maize , cotton , foxtail millet (Setaria italica) , and Populus . In contrast, very a few lncRNAs were comprehensively identified in tropical species, especially in cassava, a tropical plant with outstanding tolerance to drought stress. In this study, a total of 833 lncRNAs, including 652 intergenic lncRNAs and 181 anti-sense lncRNAs, were identified in cassava using ssRNA-seq strategy. The number of lncRNAs was far less than that identified in cotton and Populus [12, 13], but more than that identified in foxtail millet and maize [11, 28]. This number was also 1.2-fold higher than that identified recently in cassava , even more strict criteria were applied in our study. These results, together, suggested that the number of lncRNAs identified by sequencing might depend largely on the species, sequencing depth, and the criteria of lncRNAs identification.
Similar to the characteristics reported previously , the majority of cassava lncRNAs contained 2–3 exons, however, the median length was much shorter in our case. Besides, it seems that intergenic and anti-sense lncRNAs preferred to locate on certain chromosomes, respectively (Fig. 2a). In addition to the basic characterizations, we also compared our lncRNAs with that identified previously  and found that only 57 (~ 6.5%) lncRNAs were commonly identified, thus the remaining 776 can be regarded as novel cassava lncRNAs identified in our study. Further inspection revealed that ~ 28.5% (221/776) lncRNAs were not expressed (FPKM < 1) in both FL and FEL samples, which might be one of the explanations for why these lncRNAs were not previously identified.
lncRNAs were reported to exhibit organ-specific or tissue-specific expression patterns in regulating response to abiotic stress such as drought [11, 29]. In our study, totally 124 DE lncRNAs were found, and most (~ 83%) of them were exclusively identified in only one tissue (Fig. 3c). Further analysis revealed that 46 lncRNAs, together with thousands of co-expressed DE genes, were clustered into a total of 11 groups with diverse expression patterns along different time-points of PEG treatment in four tissues (Fig. 4a). Consistent to the results previously reported in other plants [11, 29], our findings strongly indicating that cassava lncRNAs were tissue-specifically expressed under drought condition and they might play different functions in distinct tissues as revealed by functional enrichment assay (Fig. 4b).
Functional prediction of cassava lncRNAs in response to drought stress
Emerging evidences have demonstrated that lncRNAs can act in trans to regulate the expression level of multiple genes located throughout the genome [20, 30], therefore, in this study, a co-expression network analysis was performed to predict the functions of lncRNAs according to the functional enrichment of co-expressed DE genes. Consistent to our previous study , genes involved in cell cycle and cell organization, cell wall, calvin cycle and light reaction, major CHO metabolism, secondary metabolism, signaling receptor kinase, hormone metabolism (such as ABA and GA), and abiotic stress were significantly enriched. Notably, this result was consistently obtained from two independent RNA-seq experiments, strongly suggested that lncRNAs were involved in these similar functions of their co-expressed genes under drought stress in cassava. Comparable functions of lncRNAs were also reported in other species. In cotton, Lu et al.  concluded that lncRNAs were likely to be involved in hormone signal transduction, carbon fixation of photosynthesis, secondary metabolism, and RNA transport in response to drought stress; in Arabidopsis, lncRNA DRIR was significantly activated by drought and salt stress, and it participated in the expression regulation of genes involved in ABA signaling, water transport, and transcription ; in cassava, Li et al.  found that lncRNAs were mainly associated with hormone signal transduction, starch and sucrose metabolism, and secondary metabolic pathways, and suggested that transcriptional regulation of gene expression might be one of the principal roles of lncRNAs in response to drought and/or cold stresses.
Besides, lncRNAs also can act in cis to regulate the expression of their neighboring genes. Specifically, in maize, lncRNA Vgt1 influenced the expression of ZmRap2 which located as far as ~ 70 kb downstream of Vgt1 . In this study, a total of 27 lncRNA-mRNA pairs involved in cis-acting regulation were identified. The adjacent genes influenced by these lncRNAs were mainly involved in hormone metabolism, RNA regulation of transcription, and signaling of receptor kinase. For examples, TCONS_00060863 was located 2447 bp downstream of Manes.10G067700 encoding 8-hydroxylase involved in ABA catabolism (Fig. 5a), TCONS_00040721 was spaced 6652 bp upstream of Manes.06G036900 encoding an AP2/EREBP transcription factor (Fig. 5b), and the expression levels of these lncRNA-mRNA pairs were further verified by qRT-PCR (Additional file 4: Table S3).
Networks of lncRNAs, miRNAs, and mRNAs involved in drought response of cassava
ABA is a key hormone involved in the response of plant biotic and abiotic stress . To investigate the changes of ABA levels in different tissues and at different time-points of drought treatment, ABA contents were determined in our samples, respectively, and the results showed that ABA levels were significantly increased in BL and RT at 3 h and 24 h whereas the levels were almost unchanged in FL and FEL during the drought treatment (Additional file 6: Figure S2), suggesting that ABA functions mainly in BL and RT of cassava under drought stress. Accordingly, HAB1, a negative regulator of ABA signaling, was greatly suppressed in both BL and RT under PEG treatment. On the contrary, NCED9, a key gene involved in ABA biosynthesis, was greatly suppressed in BL and RT upon PEG treatment, indicating a possible negative feedback regulation of ABA biosynthesis as previously described . In this work, genes related to ABA pathways were significantly enriched in group G10 and their expression levels were dramatically altered in root, consistent with our previous study . Of which, Manes.08G030300 encoded an ABA DEFICIENT 2 (ABA2) gene involved in the conversion of xanthoxin to ABA-aldehyde during ABA biosynthesis, and it was a key hub gene with most connections to other genes in this group. To explore the ABA-involved networks in response to drought, this gene and its most connected genes were selected and visualized. As shown in Fig. 7b, an ABA transporter PDR12 (Manes.04G105800), which is a homolog of AtPDR12 that is necessary for timely responses to ABA under drought and involved in ABA-regulated lateral root development , and another ABA2 gene (Manes.08G034900) directly related to ABA pathways were included in this network. Notably, these genes were co-expressed with TCONS_00060863, which was also found to regulate CYP707A1 (Manes.10G067700) encoding 8-hydroxylase involved in ABA catabolism in cis-acting (Fig. 5a), strongly indicating that TCONS_00060863 was a key lncRNA involved in ABA signaling pathway under drought condition. In addition, a homolog of AtTRE1, which was greatly induced by ABA treatment and involved in drought stress tolerance , was also included. Compared with wild-type plants, AtTRE1 over-expressing lines showed enhanced root growth on trehalose-containing medium , indicating its possible roles for root development in drought stress. WRKY transcription factors (TFs) are key components in ABA signaling , of which WRKY75 is well characterized in phosphate (Pi) stress response and root development, and it can activate several Pi starvation-induced genes encoding phosphatases, Mt4/TPS1-like genes, and high-affinity Pi transporters . Recently, WRKY75 is also known as a novel component of gibberelin (GA)-mediated signaling pathway . Interesting, a homolog of WRKY75, together with GA1 involved in GA biosynthesis and PHT1;7 related to Pi transport and specifically induced in Pi-deprived roots , were included in this network (Additional file 7: Table S5), suggesting that WRKY75 might also be involved in ABA signaling under drought stress in cassava. Consistently, CiWRKY75, which showed the highest sequence similarity to AtWRKY75 of Arabidopsis WRKY family, was significantly induced by salt and ABA treatment in Caragana intermedia . In addition, several genes related to oxidation reduction, e.g., CYP71B34 and CYP71B35, were also included. Together, these results revealed a complex network of ABA signaling in drought response of cassava, and suggested that a possible role of this network is responsible for root development under stress conditions, as indicated by a few functionally well-characterized genes such as PDR12, TRE1, and WRKY75 [36, 37, 39].
In this study, a large number of drought-responsive lncRNAs were systematically identified in cassava leaves and root, their basic characterizations were investigated, and their potential functions were predicted via trans-acting, cis-acting, and miRNA target mimics. These findings provide a comprehensive view of cassava lncRNAs in response to drought stress and expand our knowledge of lncRNAs in the signaling regulatory networks under drought condition, which will enable in-depth functional analysis in the future.
Plant materials and treatments
This experiment was conducted as previously described : the stems of cassava variety, Ku50, were cut into ~ 15 cm in length with two to three buds and planted vertically in pots (height × bottom diameter × upper diameter = 18.8 cm × 14.8 cm × 18.5 cm) with soil and vermiculite (1:1) in the glass house in the Chinese Academy of Tropical Agricultural Sciences, Haikou, China. Forty-five days later, uniform cassava seedlings were chosen and subjected to drought stress simulated by using 20% PEG 6000 solution according to our previous study . Different developmental leaves, including folded leaf (FL), full expanded leaf (FEL) and bottom leaf (BL), as well as root (RT) were collected at 0, 3 and 24 h after PEG treatment and frozen immediately in liquid nitrogen. Each sample was pooled from five plants with three replicates. Subsequently, two replicates of these samples were chosen for ssRNA-seq sequencing instead of regular RNA-seq used previously . For each sample, ABA contents were determined by using plant enzyme-linked immunosorbent assay (ELISA) kits (MeiLian Biotechnology, Shanghai, China) with triple replicates, respectively.
RNA extraction, library construction and sequencing
The total RNA extraction, transcriptome libraries preparation, and ssRNA-seq sequencing were conducted by the Annoroad Gene Technology Corporation (Beijing, China). Briefly, the integrity and quality of total RNA were examined by a Nanodrop ND-2000 spectrophotometer (Thermo Scientific Inc., USA) and an Agilent 2100 Bioanalyzer (Agilent, USA). The RNA-seq libraries were constructed using Illumina TruSeq™ RNA sample prep Kit (Illumina, San Diego, CA, USA) with Ribo-Zero Magnetic kit for rRNA depletion according to the manufacturer’s instructions, and subsequently sequenced on the Illumina Hiseq 4000 platform with 150 bp paired-end reads.
Identification of lncRNAs
After trimming the adaptor sequences and removing low-quality reads, clean reads were obtained and mapped to the cassava reference genome using Tophat 2.0  with ‘-library-type fr-firststrand’ parameters. Subsequently, Cufflinks pipeline  was employed to assemble reads into transcripts, and the assembled transcripts found in at least two samples were chosen for further analysis. The expression levels were calculated as fragments per kilobase per million mapped reads (FPKM). For identification of lncRNA, a three-step pipeline was adopted  and used: 1) the transcripts that overlapped with known protein-coding genes on the same strand, that with length < 200 bp, that with exon number < 2, that with ORF length > 300, and that with minimal reads coverage < 3 were removed; 2) the transcripts with coding potential were removed based on the evaluation of Coding Potential Calculator (CPC) , Coding-Potential Assessment Tool (CPAT) , and Coding-Non-Coding Index (CNCI) ; 3) the transcripts with known protein domains were also excluded according to Pfam-hidden Markov models (HMMs) . The remaining transcripts were considered as reliable lncRNAs. Differentially expressed (DE) lncRNAs were pair-wisely identified setting false discovery rate < 0.05 and |log2fold-change| > 1.
Prediction of lncRNA target and functional enrichment
For identification of target genes in trans-regulation, DE lncRNAs, together with DE genes, were subjected to the standard procedure of WGCNA . The lncRNAs and genes within the same group (module) were of similar expression patterns and potentially in trans-regulation. To predict the function of lncRNAs in trans-regulation, cassava loci were functionally annotated and classified into hierarchical categories based on MapMan , and the significantly over-represented functional categories were determined according to the Fisher’s exact test as previously reported [18, 51].
For identification of target genes in cis-regulation, protein-coding genes, which were spaced 10 k/100 k upstream and downstream of lncRNAs, were selected and also subjected to co-expression analysis. The lncRNA-mRNA pairs that were co-expressed and closely located were in cis-acting regulatory relationships.
Prediction of lncRNAs acting as miRNA target mimics
Target mimics were predicted by submitting all of the discovered DE lncRNAs and the cassava miRNAs (miRBase Release 22, March 2018) to psRNATarget , with less than four mismatches and G/U pairs allowed within the lncRNA-miRNA pairing regions, according to the principles established by Wu et al. .
Quantitative RT-PCR (qRT-PCR) analysis
Total RNA was isolated from each sample using RNAiso reagent (OMEGA), respectively, and reverse transcription of the first-strand cDNA was performed by PrimeScript™ RT reagent Kit with gDNA Eraser (TaKaRa, Dalian, China). To validate the results of ssRNA-seq, a total of five DE lncRNAs, together with 13 co-expressed genes, were selected and confirmed by qRT-PCR with primers listed in Additional file 4: Table S3. The qRT-PCR was performed using SYBR Premix Ex Taq™ (TaKaRa, Dalian, China) on a Stratagene Mx3000P machine (Stratagene, CA, USA), and the conditions were as follows: 30 s at 95 °C; followed by 40 cycles of 10 s at 95 °C and 30 s at 60 °C. Then, a thermal denaturing step generating the melt curves was followed to verify the amplification specificity. The cassava actin gene was used as the endogenous control . Each sample was measured in triplicate, and the relative expression levels were calculated using the 2-ΔΔCt method.
We thank the Annoroad Gene Technology Corporation (Beijing, China) for the ssRNA-seq sequencing in this study.
This research was funded by the National Natural Science Foundation of P. R. China (31600198), the Supporting Scheme for Returned Overseas Chinese Students’ Entrepreneurial Start-Ups (Innovation sub-project) from MOHRSS to Z.D., the Central Public-Interest Scientific Institution Basal Research Fund for Innovative Research Team Program of CATAS (17CXTD-28, 1630052017017), and the Central Public-Interest Scientific Institution Basal Research Fund for Chinese Academy of Tropical Agricultural Sciences (1630052016005, 1630052016006, 1630052017021).
Availability of data and materials
The datasets generated and analyzed in our study are available in the NCBI Sequence Read Archive under the accession number of SRP162280.
JZ, WH, and ZD conceived the idea and designed the experiments. ZD, WT, LF, CW, and YY performed the experiments. ZD, WY, YL, and LF analyzed the data. ZD wrote the draft. JZ, GL, WH, and ZD finalized the manuscript. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 7.Wang J, Meng X, Dobrovolskaya OB, Orlov YL, Chen M. Non-coding RNAs and their roles in stress response in plants. Genom Proteom Bioinf. 2017;15(5):301–12.Google Scholar
- 14.Utsumi Y, Tanaka M, Morosawa T, Kurotani A, Yoshida T, Mochida K, Matsui A, Umemura Y, Ishitani M, Shinozaki K, et al. Transcriptome analysis using a high-density oligomicroarray under drought stress in various genotypes of cassava: an important tropical crop. DNA Res. 2012;19(4):335–45.PubMedPubMedCentralGoogle Scholar
- 37.Van Houtte H, Vandesteene L, Lopez-Galvis L, Lemmens L, Kissel E, Carpentier S, Feil R, Avonce N, Beeckman T, Lunn JE, et al. Overexpression of the trehalase gene AtTRE1 leads to increased drought stress tolerance in Arabidopsis and is involved in abscisic acid-induced stomatal closure. Plant Physiol. 2013;161(3):1158–71.PubMedPubMedCentralGoogle Scholar
- 41.Mudge SR, Rae AL, Diatloff E, Smith FW. Expression analysis suggests novel roles for members of the Pht1 family of phosphate transporters in Arabidopsis. Plant J. 2010;31(3):341–53.Google Scholar
- 42.Wan Y, Mao M, Wan D, Liu J, Wang G, Guojing LI, Wang R. Caragana intermedia WRKY75 altered Arabidopsis thaliana tolerance to salt stress and ABA. Acta Botan Boreali-Occiden Sin. 2018;38(1):17–25.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.