Introduction

Alfalfa (Medicago sativa L.) is an important forage legume species with global adaptation, high forage quality and the capacity for harvesting biomass multiple times during the growing season. Alfalfa is an autotetraploid (2n = 4x = 32), perennial outcrossing species with high levels of genetic diversity in cultivated and non-cultivated populations. Besides its use as a forage, alfalfa also has potential crop for biofuel production [1]. Alfalfa has the capacity for symbiotic nitrogen fixation and can also contribute to reduce soil erosion [2, 3].

Endogenous non-coding RNAs of approximately 21–22 nucleotides represent plant miRNAs that silence gene expression by binding to complementary sequences of its target mRNA at the post-transcriptional level. Such targeting results in mRNA cleavage and degradation or repression of translation, with the former being more prevalent in plants [4,5,6,7]. The miRNA analyses in different plant species highlight the important regulatory roles of miRNAs in multiple organs (roots, stems, leaves and flowers), differentiation and development, leaf polarity, transition from juvenile to vegetative stages and vegetative to reproductive phases, and regulation of plant responses to biotic and abiotic stresses [8,9,10].

Several investigations have shown that plant miRNAs can be classified into conserved and novel lineage- or species-specific miRNAs. Conserved miRNAs and their corresponding target genes are commonly found in all or most angiosperms, with some also being described in gymnosperms as well as primitive land plants such as ferns [11, 12]. However, miRNA analysis in several legumes including M. truncatula, soybean (Glycine max L), chickpea (Cicer arietinum L.), common bean (Phaseolus vulgaris), and Lotus japonicus indicate the presence of miRNAs that seem to be specific to certain legumes and there could have important gene regulatory roles [13,14,15,16,17,18,19]. Although recent attempts were made to report miRNAs from alfalfa (M. sativa) [20,21,22], these do not include the discovery of novel miRNAs, and most importantly, the miRNA target genes have not been confirmed in this legume species. Understanding miRNAs and their target gene regulation in various tissues can provide further insights into the miRNA target networks operating in a tissue-specific manner in alfalfa.

In order to identify conserved miRNAs as well as novel miRNAs from alfalfa, we constructed and sequenced small RNA libraries from whole clonally propagated plants, roots and shoots of three alfalfa genotypes (Altet-4, NECS-141 and NF08ALF06). The sequenced reads were mapped to known miRNAs in M. truncatula, deposited in the miRBase to identify and annotate the miRNAs in alfalfa. Degradome libraries were constructed and sequenced from these three genotypes to characterize the miRNA gene targets.

Materials and methods

Plant materials and growth conditions

Three alfalfa genotypes NECS-141, Altet-4 and NF08ALF06 were evaluated in this study. NECS-141 is the genotype being used to sequence the tetraploid alfalfa genome [23]. Altet-4 is an aluminum tolerant genotype used to develop a mapping population [24]. NF08ALF06 is a commercial breeding line with good agronomic performance (Forage Genetics International). The three alfalfa genotypes (NECS-141, Altet-4 and NF08ALF06) were clonally propagated and grown in tissue culture. After 13 d of growth in rooting media, these were transferred to medium at pH 7 for 96 h as previously described [25]. The rooting media contains 0.55 g/L Murashige & Skoog Basal Medium with Vitamins (PhytoTechnology #M519), 1 ml Plant Preservative Mixture, PPM (PhytoTechnology), adjust the pH to 5.8, and add 12 g/L Gelzan. The plants were placed in a Conviron growth chamber (24 °C, 18 h /6 h day/night cycle, 100 μmol light intensity) for root development and growth. An additional 20 clonally propagated plants of these genotypes were grown in a Conviron growth chamber as previously described and used to evaluate the tissue-specific expression of the miRNAs. Tissue samples were harvested and immediately flash frozen in liquid nitrogen and stored at − 80 °C.

Small RNA library construction and sequencing

Total RNA was isolated from the whole plants, roots and shoots of three alfalfa genotypes using TRIzol ® Reagent (Invitrogen), according to the manufacturer’s instructions. The quality of total RNA was monitored on 1% agarose gel and their concentrations were measured using Nanodrop spectrophotometer. Small RNA libraries were generated as described previously [26] by following the protocol described for the Illumina Truseq® Small RNA Preparation kit (Illumina, San Diego, USA). Briefly, 1 μg total RNA per sample was ligated sequentially with 5′ and 3’ RNA adaptors. The ligated products were converted into cDNAs and then amplified using PCR. The amplified products were sequenced using an Illumina Hiseq® Analyzer.

Identification of conserved and novel miRNAs

The raw sequencing reads were processed as follows: adaptor sequences were trimmed off from the raw reads to obtain small RNAs. These reads were then mapped to ribosomal RNA (rRNA), transfer RNA (tRNA), small nuclear RNAs (snRNA), and the aligned and mapped reads were not used for further analysis. The remaining reads were aligned to miRBase v 20 [27] to identify miRNAs in M. sativa. The reads with 100% sequence identity were designated as conserved miRNA homologs. To identify novel miRNAs, the presence of the miRNA-star (miRNA*) sequences coupled with the predictable hairpin-like structure for the precursor sequences were used.

Degradome library construction and analyses

Degradome libraries from the alfalfa genotypes NECS-141, Altet-4 and NF08ALF06 were constructed as previously described to identify potential target mRNAs [28]. Briefly, the cleaved 5′ monophosphate containing polyadenylated mRNA fragments were ligated to an RNA oligo-nucleotide adapter containing MmeI recognition site at its 3′ end. The ligated products were converted into cDNA using reverse transcriptase and the product was amplified using only 5 PCR cycles. The PCR product was eluted, digested with MmeI restriction enzyme and then ligated to a double-stranded DNA adapter. The ligated product was again purified and amplified using 15 cycles of PCR. The final PCR product was sequenced. The reads were processed for quality and then aligned to the transcriptome assembly of M. truncatula to identify potential miRNA targets using the SeqTar pipeline [29].

Results and discussion

The analyses of small RNA libraries

High-throughput sequencing has been used to identify miRNAs and their target mRNAs in plants [15, 30, 31]. To catalogue conserved and novel miRNAs in alfalfa, a total of eight small RNA libraries from the whole plants, roots and shoots of Altet-4, NECS-141 and NF08ALF06 genotypes were constructed and sequenced. After removal of the adapter sequences and low-quality reads, the total reads ranging between 11 to 42 million, and unique reads ranging between 1.8 to 8.5 million reads from these nine libraries were obtained (Table 1). However, the quality of the small RNA library generated from the shoots of NF08ALF06 did not meet the threshold criteria, therefore only NECS-141 and Altet-4 were used for the miRNA analyses of shoot tissues.

Table 1 The mapping of total and unique reads obtained from different small RNA libraries

Quantification of miRNA abundances between the genotypes and tissues was preceded by normalizing the expression levels of miRNA families to reads per ten million (RPTM). The normalized miRNA family read frequencies ranged between 1 to 552,267 RPTM for the whole plants, between 1 to 134,679 RPTM for the root samples, and 1 to 165,310 RPTM for the shoot samples (Table 2). The range of miRNA read frequencies varied slightly between the three genotypes. As expected, the most conserved miRNAs appeared to be the most abundantly expressed in all tissues and genotypes, with the exception of miR169, miR393, miR395 and miR172 which exhibited low abundances. Specifically, miR172 levels in roots and shoots of the three genotypes were extremely low and in most cases was below 20 RPTM (Table 2). The miRNA families with the lowest expression levels, and in some cases as low as 1 RPTM, were largely represented by the non-conserved miRNAs or miRNAs that have been reported exclusively from M. truncatula (miRBase) that include miR2601, miR2674, miR5207, miR5241, miR5243, miR5244, miR5255, miR5257, miR5269, miR5282, miR5289, miR5294, miR5296, miR5299, miR5561, miR5744, and miR7701 (Table 2). miR5207 is the only miRNA that was also reported from Gossypium raimondii (miRBase). The majority of the miRNA families identified are 21 nt long, although some cases including miR2601 and miR2603 were represented by 22 nucleotides. Further, a total of 23 miRNA families included between miR5267 to miR5299 were 24 nt long. The fact that these small RNAs were initially identified in M. truncatula (miRBase), and could be identified in several independent small RNA libraries from three different alfalfa genotypes (Table 2), suggests that these sequences and their associated processing are conserved between alfalfa and its close relative M. truncatula. However, their extremely low abundances coupled with their longer read lengths could also indicate that these may be 24-nt long siRNAs. Additional studies are needed to assess the precise nature of these small RNAs, i.e., miRNAs or siRNAs.

Table 2 Identified miRNA families and their frequencies (reads per ten million [RPTM]) in whole plants, roots and shoots of three alfalfa genotypes (miRNA-stars were marked in bold)

MicroRNA profiles in alfalfa plants, roots and shoots

A total of 100 known miRNA families were identified from the small RNA libraries of the three alfalfa genotypes (Table 2). Of these, 21 families were represented by the highly conserved miRNAs, whereas the remaining 79 families could be considered as Medicago-specific miRNA families. The identification of these 79 miRNA families in alfalfa was based on their expression in M. truncatula (miRbase), therefore, these are conserved at least between M. truncatula and alfalfa.

Among the highly conserved miRNA families, miR166 was the most highly expressed family in seven of the eight samples that were surveyed in this study. The only exception to this trend was observed in the shoots of NECS-141 in which the miR2118 family was the most abundant followed by the miR166 family. The miRNA families, miR396 and miR2118 represents the second and third most abundantly expressed in the whole plants, while miR159 and miR396 were the second and third most highly expressed miRNAs in roots. Several additional miRNA families including miR398, miR160, miR168, miR319, miR408, miR1510 and miR2643 were also highly expressed but miR169, miR171, miR393, miR397 and miR395 were expressed at relatively very low levels (Table 2). On the other hand, miR159, miR156, miR319, miR398 miR1507 and miR1510 were highly expressed but miR164, miR169, miR172, miR393, miR397, miR399 and miR482 were expressed at very low levels in roots of these genotypes. Interestingly, miR160 was not sequenced from the roots of three alfalfa genotypes.

Overall, the conserved miRNA families such as the miR156, miR159, miR166, miR168, miR319, miR396, miR398 and miR408 were more highly expressed in the plants, roots and shoots of all three alfalfa genotypes. Among the legume-specific families, miR1507, miR1510, miR2118, miR2592, miR2643, miR5213, miR5232, miR5558 and miR7696 (Table 2) were also abundant in all tissues of alfalfa genotypes. Conversely, some conserved miRNA families represented by miR169 and miR393 recorded very low abundances in all samples. Other notable differences between roots and shoots include relatively low expression levels of miR160, miR167, and miR408 in roots compared to the shoots of alfalfa genotypes (Table 2).

Several miRNA families including miR482, miR1507, miR2118, miR4416 are conserved in M. truncatula, soybean, chickpea (miRBase). These miRNA families are known to regulate NBS-LRR genes that are involved in pathogen resistance. The miRNA-guided cleavage on the NBS-LRR genes initiates the generation of phasiRNAs [16, 18, 32]. In alfalfa, miR482, miR1507 and miR2118 were detected in all three tissues (Table 2), but not miR4416. Both miR2118 and miR1507 families were more abundantly expressed in all tissues and genotypes compared with miR482 family. Remarkably, miR2118 was the top most highly expressed miRNA family in shoots of NECS-141. By contrast, miR2118 levels were very low in roots of three alfalfa genotypes. On the other hand, miR1507 family displayed approximately similar levels in three tissues of alfalfa genotypes.

The miRNA-star sequences corresponding to the 12 of the 21 highly conserved miRNA families were also recovered from almost all libraries (Table 2). Additionally, miRNA-stars for the miR1510, miR4414, miR5208, and miR7696 were also detected. Furthermore, the miRNA-star expression levels for miR156, miR166 and miR168 were very high (Table 2). Intriguingly, like miR168, miR168 star levels differed greatly between different tissue. In shoots of NECS-141, miR168 star levels were slightly more than that of miR168, while both in whole plants and roots, the star levels were approximately half of the levels of miR168.

miRNA diversity in alfalfa compared with other legumes

Several miRNA families are specifically reported from the leguminous plants such as the M. truncatula, Glycine max, Lotus japonicus, Phaseolus vulgaris, Cicer arietinum, Vigna unguiculata and Acacia auriculiformis [14, 16, 18, 19, 32, 33]. These lineage-specific miRNAs include miR1507, miR1508, miR1509, miR1510, miR1512, miR1514, miR1520, miR1521, miR2118, miR2086, miR2109, miR2199, miR4414, miR5213, miR5232, and miR5234 among others (miRBase). The majority of these were reported from M. truncatula and soybean, since these legume species have been the subject of multiple studies exploring small RNAs. Most of these legume-specific miRNAs were also identified in alfalfa and a few of them including miR1507, miR1510, miR2118, miR2592, miR2643, miR5211, miR5213, miR5214, miR5232, miR5239, miR5277, miR5558, and miR7696 were specifically highly expressed in all three genotypes (Table 2).

Identification of novel miRNAs from alfalfa

The sequencing of the small RNAs from multiple tissues of three different alfalfa genotypes would allow us to identify the novel miRNAs more confidently. Novel miRNA identification was dependent on sequencing of the miRNA complementary strand (miRNA-star) coupled with the predictable fold back structure for the primary miRNA transcript. Because a stable assembly of the tetraploid alfalfa genome sequence is not available, the small RNAs were mapped to the M. truncatula genome. Mapping of the small RNAs from the three alfalfa genotypes onto the M. truncatula genome enabled the identification of novel miRNAs more confidently because they have been sequenced from M. sativa and mapped on to the M. truncatula, suggesting their conservation between M. sativa and M. truncatula. Moreover, the novel miRNA identification in this study is more robust as it includes sequencing of these small RNAs from three different genotypes. We have identified a total of 17 novel miRNAs which have been sequenced from all of the three genotypes (Table 3 and Fig. 1). Among these, t50582913 was highly expressed followed by t50063038. In roots, t50582913 was highly expressed in NECS-141 and Altet-4 but not in NF08ALF06. In shoots, t50063038 was highly expressed followed by the t50582913 and t51235783.

Table 3 Identified novel miRNAs based on sequencing both 5′ and 3′ reads and the most abundant ones that is marked in bold denotes potential novel miRNA based on their greater abundances
Fig. 1
figure 1

The predicted foldback structures using the novel miRNA precursor sequences. a The fold-back structures for six novel miRNAs. b The distribution of small RNA reads on the precursors of the novel miRNAs depicted in Fig. 1a

Identification of miRNA targets in alfalfa

Although the alfalfa is one of the important legumes agronomically, the genome sequencing and annotations are not available so far. Due to this, studies have utilized the well-studied and closely related M. truncatula genome annotations as a model for alfalfa studies. The nucleotide identity for some genes was greater than 97% between M. sativa and M. truncatula [34]). Thus, using M. truncatula transcript annotations can facilitate identification of miRNA targets in alfalfa. We used SeqTar algorithm (Zheng et al., 2012) to identify miRNA targets by allowing a maximum of 4 mismatches between miRNAs and their potential target transcripts.

Previous studies have revealed that conserved miRNAs are strongly associated with the regulation of genes that encode transcription factors [35]. These transcription factors in turn regulate key developmental processes and pathways in plants. Degradome sequencing has been very effective in identifying plant miRNA targets. Besides identifying the conserved targets, this approach can also identify non-conserved targets for the conserved miRNAs [28, 36, 37]. Degradome sequencing was used in this study to identify the cleaved mRNA fragments corresponding to the miRNA recognition sites in all three alfalfa genotypes. Approximately 30 million degradome reads were obtained from the transcripts of each of the alfalfa genotypes (Table 4) and these reads were analysed using SeqTar program. In total, we have identified 69 targets for 31 miRNA families that included 16 highly conserved families (Table 5). With respect to the conserved miRNAs, 33 targets for 16 conserved miRNA families were identified (Table 5). The known targets for miR162, miR165/166, miR398 and miR399 families were not identified in this study. Although miR165/166 family is the most abundantly expressed as scored from their read frequencies in almost all libraries but the cleaved fragments from the HD-Zip target transcripts were not recovered from degradome libraries of alfalfa genotypes.

Table 4 Mapping of the reads obtained from the degradome libraries
Table 5 miRNA targets identified in the degradome libraries generated from three alfalfa genotypes. #Mis. is number of mismatches on the miRNA complementary site; Valid reads is Reads corresponding to the expected cleavage site; Total reads is Total reads mapped to the cDNA of the gene; Percent is Percent reads at the expected cleavage site

The identified miRNA targets in all three genotypes include mainly transcription factors. Specifically, five members of the squamosa promoter-binding-like protein (SPL) targeted by the miR156 family, five members of the auxin response factors targeted by both miR160 and miR167 families, five members of the apetala2 (AP2)-domain containing transcription factors, four members of the growth-regulating factor (GRFs) family targeted by miR396, two members of the TCP family transcription factors targeted by miR319, and, a NAC domain-containing transcription factor-like protein (NAC) targeted by miR164 [35]. Additionally, transcripts encoding Argonaute targeted by miR168, laccase targeted by miR397, and three plantacyanin containing proteins targeted by miR408 were also identified. Although evidence indicates that that miR398 targets Cu/Zn superoxide dismutases and a copper chaperone for the superoxide dismutases (CCS) in plants [28, 38] these relationships were not apparent in the data from this study. On the other hand, we have identified three potentially non-conserved targets (plastocyanin, protein disulphide isomerase and a hypothetical protein) for miR398 in three alfalfa genotypes. In addition to the GRFs, our analyses revealed potential non-conserved targets for miR396 including TNP1, Ulp1 protease and hypothetical proteins (Table 5).

The analyses of legume-specific miRNAs and their targets have revealed an interesting miRNA: target networks between the miRNAs and the NBS-LRR genes [14, 16, 18, 32]. In this study, we identified NBS-LRR disease resistance genes as targets for four different miRNA families including miR482, miR1507, miR1510 and miR5213 in alfalfa (Table 5).

Degradome analyses has also been utilized to identify potential targets for several non-conserved miRNAs or miRNAs that are present only in closely related species such as the M. truncatula. To increase the confidence in identification of targets for the non-conserved miRNAs that are usually expressed at low abundances and the cleavage frequencies on those targets are relatively low, we considered as ‘targets’ only those for which the cleavages were detected at least in two of the three alfalfa genotypes. The transcripts for Medtr6g053240.1 (F-box protein interaction domain protein) had a cleavage frequency of approximately 75% and were targeted by the miR2643 in NF08ALF06 genotype. Additionally, two other F-box protein interaction domain protein genes were also identified as targets for miR2643 in alfalfa genotypes (Table 5). These results suggest that the F-box protein interaction domain protein family are regulated by this potential legume-specific miRNA. Another notable observation is that 6 different genes identified as potential targets for miR7696, and the cleavage frequency of a particular target gene (hypothetical protein, Medtr3g112250.1) was more abundant in all three alfalfa genotypes (Table 5).

Because some of the miRNA-stars are also highly expressed, we scrutinized the degradome reads for potential cleavages on the transcripts that are complementary to the miRNA-stars. This analysis has identified potential targets for at least four conserved miRNAs. Specifically, miR156-star targets a heat shock transcription factor, miR164-star targets a protein transporter Sec61 subunit alpha-like protein, miR167-star targets a GRAS family transcription factor, and, miR482-star targets an auxin response factor 1 in alfalfa (Table 5).

The confirmed targets of conserved miRNAs are known to regulate diverse developmental processes in the lifecycle of plants. For example, the SPL transcription factors (target of miR156) which regulate the transition from juvenile to adult phase of the life cycle in land plants [39]. Auxin receptors (TIR1 proteins) and ARFs targeted by miR393 and miR160, miR167, are components of the auxin signalling pathway that regulates several aspects of plant growth and development. The roles of NAC factors (targeted by miR164) include shoot meristem initiation and later root formation in Arabidopsis [40, 41]. Similarly, TCP family transcription factors have several different roles including regulating leaf morphogenesis [42, 43]. In Arabidopsis, seven out of nine GRFs are known targets for miR396 [44], and we have identified four GRFs as targets for miR396 in alfalfa (Table 5). By interacting with its coactivators called GRF-interacting factors (GIFs), this regulatory network (miR396-GRFs-GIFs) regulate leaf size, leaf growth and senescence in Arabidopsis [44]. The known targets for miR397 include laccase, which is involved in oxidative polymerization of lignin in plants [45]. Similarly, miR408 is targeting a family of plantacyanins, which could function in shuttling electron-transfer between proteins [46, 47].

The miR398 family is known to target CSDs and a copper chaperone for superoxide dismutase (CCS) genes in plants [28, 38]. In this study, we have identified plastocyanin-domain like proteins (plastocyanin is an essential electron carrier which shuttles the electrons between cytochrome b6f and PS I) represents a novel target for miR398. Protein disulphide isomerase (PDI) is a member of a family of dithiol/disulfide oxidoreductases, the thioredoxin superfamily, which functions in the formation of disulphide linkage between the cysteine residues for proper protein folding [48]. Our degradome analyses confirms that PDI represents a novel target for miR398 in alfalfa (Table 5). The other confirmed miRNA target transcripts include Leucine rich repeat resistance (LRR) proteins (TIR-NBS-LRR and CC-NBS-LRR) that play important roles in plant pathogen recognition and activation of plant innate immune responses [14, 16, 18, 32]. Yet another interesting target include the F-box protein interaction domain proteins that are regulated by miR2643, one of the very abundantly expressed miRNA in alfalfa.

Conclusions

The analyses of small RNA libraries from the whole plants, shoots and roots resulted in the identification of 100 miRNA families that included highly conserved miRNAs as well as miRNAs that are at least conserved between M. truncatula and alfalfa. The conserved miRNA profiles share some similarities and a few differences between genotypes and types of tissues (roots and shoots). The tissue-specific profiles were used to identify miRNAs that are highly abundant as well as those miRNAs that are expressed at low levels. Additionally, 17 novel miRNAs with varying levels of expression were also identified in alfalfa. The present study also reports identification of 69 targets for 31 miRNA families. In addition to the conserved targets for conserved miRNAs, a few non-conserved targets such as the PDI for miR398 were confirmed. Similarly, miR2643 is targeting three transcripts encoding F-box protein interaction domain containing proteins in alfalfa. In summary, the results from this study have increased our understanding of miRNAs and miRNA-mediated gene regulation in alfalfa that could result in potential tangible targets for practical applications in alfalfa and related legume species to increase biomass yield and address abiotic and biotic limitations to agricultural productivity.