A putative role for amino acid permeases in sink-source communication of barley tissues uncovered by RNA-seq
- First Online:
- Cite this article as:
- Kohl, S., Hollmann, J., Blattner, F.R. et al. BMC Plant Biol (2012) 12: 154. doi:10.1186/1471-2229-12-154
The majority of nitrogen accumulating in cereal grains originates from proteins remobilised from vegetative organs. However, interactions between grain filling and remobilisation are poorly understood. We used transcriptome large-scale pyrosequencing of flag leaves, glumes and developing grains to identify cysteine peptidase and N transporter genes playing a role in remobilisation and accumulation of nitrogen in barley.
Combination of already known and newly derived sequence information reduced redundancy, increased contig length and identified new members of cysteine peptidase and N transporter gene families. The dataset for N transporter genes was aligned with N transporter amino acid sequences of rice and Arabidopsis derived from Aramemnon database. 57 AAT, 45 NRT1/PTR and 22 OPT unigenes identified by this approach cluster to defined subgroups in the respective phylogenetic trees, among them 25 AAT, 8 NRT1/PTR and 5 OPT full-length sequences. Besides, 59 unigenes encoding cysteine peptidases were identified and subdivided into different families of the papain cysteine peptidase clade. Expression profiling of full-length AAT genes highlighted amino acid permeases as the group showing highest transcriptional activity. HvAAP2 and HvAAP6 are highly expressed in vegetative organs whereas HvAAP3 is grain-specific. Sequence similarities cluster HvAAP2 and the putative transporter HvAAP6 together with Arabidopsis transporters, which are involved in long-distance transfer of amino acids. HvAAP3 is closely related to AtAAP1 and AtAAP8 playing a role in supplying N to developing seeds. An important role in amino acid re-translocation can be considered for HvLHT1 and HvLHT2 which are specifically expressed in glumes and flag leaves, respectively. PCA and K-means clustering of AAT transcript data revealed coordinate developmental stages in flag leaves, glumes and grains. Phloem-specific metabolic compounds are proposed that might signal high grain demands for N to distantly located plant organs.
The approach identified cysteine peptidases and specific N transporters of the AAT family as obviously relevant for grain filling and thus, grain yield and quality in barley. Up to now, information is based only on transcript data. To make it relevant for application, the role of identified candidates in sink-source communication has to be analysed in more detail.
KeywordsBarley Vegetative organs Developing grains N remobilisation N accumulation RNA-seq Cysteine peptidases N transporter genes Source-sink communication
Amino acid permeases
Amino acid transporters
Days after flowering
Endosperm transfer cells
Assembly 35 of HarvEST:Barley v1.83 
Principle component analysis
In crop plants more than 70% of seed nitrogen is remobilised and translocated from vegetative tissues such as stems and senescing leaves . Remobilisation of N follows different time courses, and contributions of various organs and tissues to N economy of developing seeds differ . In cereals, flag leaves and glumes maintain their metabolic activity longer than other vegetative tissues, and their contribution to the final grain yield is high .
Up to 75% of reduced nitrogen in photosynthetically active leaf cells is located in the chloroplasts. Ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco) represents the major fraction of chloroplast nitrogen . Before nitrogen is exported to the phloem, Rubisco must be degraded to peptides and amino acids . Gene expression analysis in wheat and barley identified several cysteine protease genes with enhanced transcript levels during leaf senescence [6, 7, 8]. Certain C1A-type (papain-type) cysteine proteases and possibly also S10-type serine carboxypeptidases are involved in bulk degradation of stromal proteins during leaf senescence . Both types of proteases are potentially synthesised at the endoplasmatic reticulum and channelled by the secretory pathway, which suggests routing to the lytic vacuolar compartment such as small senescence-associated vacuoles . High expression and strong upregulation of genes encoding papain-like cysteine peptidases suggests an important role for especially those family members in naturally senescing barley leaves between 7 and 21 DAF .
During senescence, cellular proteins are degraded into peptides and amino acids. Efficient partitioning of amino acids or peptides within the plant requires active transporters to transfer N compounds across cellular membranes . For plants with fully sequenced genomes (e.g. Arabidopsis and rice), the Aramemnon database [11, 12] provides annotation and further information for the complete collection of putative N transporter genes, whereas to date only four sequences for putative barley N transporters are listed. Based on sequence similarity, amino acid transporters were grouped into members of the ATF (amino acid transporter family) and the APC (amino acid-polyamine-choline) families. The ATF family can be further divided into AAPs (general amino acid permeases), LHTs (lysine-histidine transporters), proline transporters (ProTs) as well as into transporters with substrates like γ-aminobutyric acid (GATs), aromatic and neutral amino acids (ANTs) and indole-3-acetic acid (AUXs) [10, 13, 14]. Subdivision of the APC family reveals transporters for cationic (CATs) and L-type amino acids (LATs), as well as the GABA permeases (GAP). Overall 63 (Arabidopsis) and 80 (rice) candidates cluster into these groups.
Peptide transport in plants is accomplished by two gene families, the oligopeptide transporters (OPTs) transporting tetra- and pentapeptides and transporters for di- and tripeptides belonging to the nitrate/peptide transporter family (NRT1/PTR) . In Arabidopsis and rice, 53 and 81 members belong to this group, while 9 and 8 transporters are annotated as OPTs. Whereas a relatively high number of Arabidopsis amino acid and NRT1/PTR transporters are functionally characterized (for reviews see [10, 13]) the information for monocots, especially for barley is scarce. The best characterised monocot peptide transporter is HvPTR1 localized in the scutellum of barley grains and responsible for mobilisation of peptides from endosperm into germinating embryos [15, 16]. OsPTR6 was shown to transport Gly-His-Gly . From the OPT family, only one monocot sequence (OsGT1) has been functionally characterised so far .
Numerous transporters contributing to iron trafficking in plants are described and were functionally characterised for grasses. This is due to the fact that grasses evolved a distinct mechanism to acquire Fe from the soil best described as ‘chelation’ strategy . Strong Fe chelators called phytosiderophores (PS), are synthesised by the plant and secreted into the rhizosphere, where they bind Fe(III). The Fe(III)-PS complex is than taken up by Fe(III)–PS uptake proteins [20, 21] called Yellow Stripe-Like (YSL) transporters. Several YSL transporters have been identified and characterised (see for instance [22, 23, 24, 25]), among them the barley transporters HvYSL5 , HvYSL2  and HvYS1 . The role of YSL transporters in remobilisation and grain filling is unclear yet. YSL transporters are distantly related to the OPT family . In Arabidopsis and rice, 8 and 18 sequences belong to the YSL group.
Although numerous plant amino acid and peptide transporters have been identified and some of them functionally characterised, it is difficult to determine which are the most important for plant N recycling on both the source and the sink side. For barley, this situation is even more complicated as the genomic sequence is only partially assembled . Furthermore, only 0.06% (264 ESTs) from 444,652 barley ESTs in assembly 35 of HarvEST:Barley v1.83 (H35, ) represent sequences expressed in glumes and those 33,376 ESTs (7.47%) derived from leaf cDNA libraries are not representative for remobilising flag leaves. Sequence information from EST collections might also be reduced for membrane-associated compounds because of high instability of respective E. coli clones.
Next generation sequencing (NGS) technologies offer new opportunities to analyse plants without fully sequenced genomes. Transcriptome large-scale parallel pyrosequencing was addressed to flag leaves, glumes and developing grains in order to analyse remobilisation and import of N compounds immediately before and after seed set. Data evaluation was focussed on two specific groups of genes responsible for remobilisation and accumulation of nitrogen, cysteine peptidases and N transporters. Combination of publicly available and RNA-seq data reduced redundancy, increased length of gene-specific contigs and identified new members within the respective gene families. Members of the AAT gene family were over-represented in the set of RNA-seq N transporter sequences. Sequence alignment allowed to reconstruct 25 full-length AAT genes. Based on temporal expression profiling of these genes we hypothesise that establishment of high N-sink strength in developing grains is perceived in flag leaf and glumes, the tissues in close proximity to developing seeds. We postulate that metabolites communicate the increasing sink strength to the remobilising tissues by modulating transcript amounts as shown here for amino acid permeases. Thus, AAT gene activities might be involved in source/sink communication in barley. In addition, fluctuating transcript abundances of AAT genes especially in flag leaves might reflect tissue-specific regulation of sink/source transition.
RNA-seq and sequence assembly
mRNA was prepared from barley flag leaves, glumes and caryopses collected at different stages of grain development. Equal amounts of RNA were combined from each stage at 2 day intervals, from 4 days before anthesis up to 24 days after flowering (DAF) for flag leaves and glumes, between anthesis and 24 DAF for caryopses.
Output of large scale RNA-seq and sequence assembly
Ø Read length
Ø Contig size
Annotation of RNA-seq contigs and singletons
No. of contigs
Total no hits
No. of singletons
Total no hits
Percentages of total contig hits are comparable between glumes and grains (88.73% and 88.56%) but are higher for flag leaves (90.34%). BLAST searches against the different databases revealed 14.1% (FL), 11.3% (GL) and 10.1% (G) of new contig sequences as not functionally described in H35. For the total no hit category, results for flag leaves are different from those of glumes and grains (9.7%, 11.3% and 11.4%, respectively). Percentages of total-hit singletons are comparable for flag leaves and grains (51.1% and 50.8%) but higher for glumes (69.5%). This result coincides with the observed low number of glumes singletons (Table 1) and indicates lower complexity of the glumes transcriptome compared to flag leaves and grains.
The sum of contigs and singletons annotated from flag leaves (33,743 + 38,119) yields 71,862 expressed genes, which is clearly higher than the number of unigenes from H35 (50,938). This may reflect high redundancy of the flag leaf dataset, which is also obvious for grains (62,467 annotated sequences) but not for glumes (40,616 annotated sequences). This can be explained by the fact that only RNA-seq reads were used for the assembly process. Obviously, some of the RNA-seq singletons and/or contigs represent the same gene, but do not overlap and thus increase the numbers within the total hit category. The contribution of contigs should be lower than that of singletons.
Tissue-specificity of contigs
There are two main findings: (i) flag leaf and grain transcriptomes are highly similar in all depicted categories and (ii) the glumes transcriptome differs from flag leaves and grains, but is comparable to H35. Categories “trans-membrane transporter activity”, “substrate-specific transport activity” and to lesser extent “transferase-activity” and “nucleotide binding” are enriched in glumes. Especially for “ion binding”, “nucleic acid binding”, “lyase and ligase activities”, “oxidoreductase activity” and “cofactor binding”, the glumes transcriptome is more similar to that of H35 than to those of flag leaves and grains.
RNA-seq gained new sequence information for N transporters and cysteine peptidases
RNA-seq contigs of N transporters and cysteine peptidases cluster with annotated homologs from Arabidopsis and rice and enlarge available full-length sequences
Papain-like cysteine peptidases
For all further analyses, sequences derived from the combination of pyrosequencing and H35 as well as unique pyrosequences were considered.
Putative papain type cysteine peptidases in different plant species
The new dataset was aligned with all N transporter amino acid sequences of rice and Arabidopsis derived from Aramemnon  and phylogenetic trees were constructed.
From 71 RNA-seq contigs of the barley NRT1/PTR family only 45 cluster into the four subfamilies defined by Tsay et al. , while 26 sequences form a separate branch (data not shown). As BLAST analysis showed no obvious differences between these 26 and the other 45 candidates in sequence similarity to the clustering rice sequences (data not shown), a unique group in barley seems unlikely and these sequences were omitted from the tree. This deviant behaviour might be explained by the overall heterogeneity within this group or the limited sequence information of these outliers (average length of 184 aa compared to 305 aa for sequences that clustered).
Additional contribution from sequence analysis of barley full-length cDNAs
Comparison of N transporter and cysteine peptidases (CPEP) sequence information from full-length cDNA* and RNA-seq data
Found in H35
Both approaches identified additional and so far unknown sequences. Matsumoto et al.  identified 36 novel putative N transporter and CPEP genes, 36 novel genes were detected by pyrosequencing. In summary, sequence information of expressed barley N transporters and cysteine peptidases comprises 78 AAT, 71 NRT1/PTR, 29 OPT and 120 CPEP unigenes (Table 4). All RNA-seq unigenes can be considered as expressed in flag leaves, glumes or grains and are barley-specific as checked against the barley genome sequence .
Expression profiling of AAT genes revealed coordinate distinct developmental stages in flag leaves, glumes and grains
Overview of sequence information on N transporter genes
N transporter group
AATgenes with high expression in flag leaves, glumes and developing grains*
To visualise relationships between the three tissues, principle component analysis (PCA) was applied to the tissue-specific qRT-PCR results. Then, K-means clustering was used to identify developmental stages that might be related to each other. The results of the two-step procedure are depicted in the lower panels of Figure 9. In each panel, coloured areas represent related stages. For all three organs, a group including stages 8 and 10 DAF was identified (violet areas). Besides, developmental stages representing the late phase of grain development form separate groups (areas coloured in yellow). K-means clusters coloured in green represent stages of early development. They are highly dispersed, especially between flag leaves and glumes (lower panels of Figure 9A, B).
The majority of N accumulating in cereal grains originates from proteins remobilised from vegetative organs, but interactions of grain filling and remobilisation are only poorly understood. Here we used large-scale transcriptome pyrosequencing of flag leaves, glumes and developing grains to identify putative cysteine peptidases and transporters of amino acids, peptides and oligopeptides involved in N remobilisation and retranslocation into developing grains. This approach suggests that distinct amino acid transporters might be important in sink-source communication between remobilising organs and accumulating grains.
RNA-seq revealed the specific character of the glumes transcriptome
The read numbers gained by transcriptome sequencing are similar with about 0.5 million for each organ with higher values for flag leaves (Table 1). Also average length and number of reads per contig are comparable between the three organs. In glumes lower contig numbers but higher read numbers per contig were found and furthermore the number of singletons in the glume transcriptome is only one third compared to flag leaves and grains. This suggests either higher specificity or lower complexity of the glumes transcriptome.
Comparison of the three transcript sets as visualised in Figure 1 revealed high similarity between flag leaf and grain contigs (34.8% of identical sequences). On the other hand, only about 5% of glumes sequences are identical with those in flag leaves or grains, pointing again to either higher specificity or lower complexity of the glumes transcriptome.
Another argument underlining the specific character of the glumes transcriptome comes from annotation of organ-specific RNA-seq contigs and its comparison to H35 based on gene ontology terms (Figure 2). These results suggest a different function of the glumes transcriptome compared to the two other tissues especially regarding transport activity. Potential functions of the glumes transcriptome are more similar to these of H35 than to those of flag leaves and grains. H35 contigs consist of ESTs derived from several different tissues. Thus, functional annotation of the glumes transcriptome points to expression patterns representing an average of many tissues. This indicates that annotated functions in glumes are less tissue-specific, whereas transcriptomes of flag leaves and grains seem to be tissue-specific and functional annotation indicates similarity between the two organs.
In summary, comparisons between the transcriptomes of flag leaves, glumes and grains indicates that gene expression in glumes is less tissue-specific and might be characterised by higher activity of a lower number of genes. The glumes transcriptome seems to be different from those of flag leaves and grains whereas the latter two organs reveal functional similarity to each other.
Glumes might function as mediator between remobilising vegetative tissues and accumulating grains
Relative to grains, both flag leaves and glumes are source organs. Nitrogen mobilisation during grain filling and the role of flag leaves and glumes have been studied predominantly in wheat [3, 42, 43]. These studies revealed different cellular organisation and distribution of glumes compared to leaves of the same developmental stage . Glumes have more sclerenchyma cells, which serve as a supporting structure for the grain. Compared to flag leaves, glumes contain less green tissue and, consequently, fewer chloroplasts and less Rubisco [44, 45].
During grain development, a decline in the content of soluble proteins is detected in both flag leaves and glumes but patterns of remobilisation differ. Protein content in flag leaves remains constant up to anthesis and declines when grains develop. Glumes continue to accumulate protein until 5 DAF before remobilisation starts. The different initiation time of remobilisation suggests that glumes act as a transient sink for N derived from flag leaves and senescing vegetative organs. These studies indicate that glumes are supplying nitrogen to the grains during later developmental stages .
Glumes contain high percentages of Gln, Pro, Lys, Arg and His . Considerable high contents of Gln, Lys, Arg and His also occur in the nucellar projection (NP) compared to endosperm transfer cells (ETC) at the beginning of grain filling . The NP/ETC complex represents the transfer path between maternal and filial grain tissues and also functions as a metabolic interface to precondition amino acid supply to the developing endosperm. In NP cells, gene expression of different cytosolic isoforms of Gln synthetase (GS) could be involved in re-assimilation of ammonia from protein breakdown and production of N transport compounds . Such a function has been suggested also for GS present in glumes .
In summary, flag leaves and glumes obviously function differently, at least during early grain development when sink strength of the endosperm is still low. Analogies can be observed between glumes and supplying maternal grain tissues. This suggests that the glumes metabolism is adjusted to the changing demands of developing grains and points to a putative function of the glumes as mediator between (remobilising) vegetative tissues and (accumulating) grains.
RNA-seq identified a set of putative cysteine peptidase and N transporter genes possibly involved in remobilising and accumulating of nitrogen
RNA-seq provided new sequence information for cysteine peptidases and N transporter genes compared to H35 (Figure 4), reduced redundancy and increased unigene length within H35 data (Figure 5B). With respect to full-length cDNAs published by Matsumoto et al. , 36 contigs were identified as unique in the collection of pyrosequences (Table 4). While these new sequences might be involved in degradation and retranslocation of N compounds during grain development those cDNAs present only in the H35 or Matsumoto collections should be less relevant for such functions.
Papain-like cysteine peptidases play an important role in naturally senescing barley tissues [8, 9], especially between 7 and 21 days post anthesis . Although several genes encoding cysteine proteases are upregulated during senescence [6, 8, 48, 49, 50], direct evidence for the implication of specific members from this class of proteases in protein degradation is lacking.
Combination of H35 and RNA-seq sequence information identified a set of 59 unigenes that encode Papain-like cysteine peptidases (Table 4). This set represents 21 full-length sequences and 38 unigenes that belong to an unknown number of genes. Some of these sequences might belong to the same gene but cannot be aligned (redundancy problem). These candidates can be considered as active between anthesis and DAF 24 in at least one of the three tissues. In comparison, the total number of cysteine peptidase genes from the same peptidase family of rice and Arabidopsis is high (88 candidate genes in both species) pointing to a certain degree of specificity of the newly assembled contigs for remobilisation. In the C01 and C85 families most unigenes (26 and 17), as well as most new full-length sequences (4 and 2) were found (Table 4), predestining their members as promising candidates. Analysis of tissue-specificity and localisation of the respective gene products to defined cellular compartments remain to be done.
Amino acid permeases seem to be predominant in N retranslocation and grain filling
AAPs seem to be predominant in N retranslocation and grain filling. This conclusion was derived from over-representation of this gene family in the set of RNA-seq derived N transporter sequences (Table 5) and from its very strong expression in both, source and sink tissues (upper panel of Figure 9, Table 6). Two of the highly expressed putative AAP genes (HvAAP2, HvAAP6) are active only in the source tissues flag leaves and glumes, two others (HvAAP4 and HvAAP7) are expressed in source as well as sink tissues. Among the putative transporter genes listed in Table 6, HvAAP3 is specific for grains. HvAAP3 but also HvAAP4 show high sequence similarity to AtAAP1 and AtAAP8 (Additional file 1: Figure S1). The two Arabidopsis transporters play a role in supplying developing seeds with nitrogen [51, 52]. HvAAP2 and HvAAP6 are closely related to AtAAP2 and AtAAP5 (Additional file 1: Figure S1). AtAAP5 is expressed in mature leaves, stems and flowers and involved in long-distance transfer of amino acids, especially glutamine, the predominant amino acid found in the phloem . Promoter-reporter gene fusions showed that AtAAP2 is expressed in vascular tissues of stems and siliques. Furthermore, AtAAP2 expression is tightly associated with phloem strands that connect to fruits. Thus, AtAAP2 seems to be an excellent candidate for xylem-phloem transfer along this path [54, 55], a role that might also be assumed for HvAAP2 and HvAAP6. HvAAP7 is a member of a separated branch of the AAT tree harbouring only uncharacterised rice and barley sequences (Additional file 1: Figure S1). Because of its high expression in flag leaves and glumes during grain filling, HvAAP7 can be considered as being an interesting candidate for functional studies in barley.
Besides members of the AAP family, the two putative transporters HvLHT1 and HvLHT2 seem to be specifically important for N retranslocation in flag leaves (HvLHT2) and glumes (HvLHT1). At the sequence level, the two proteins are closely related to each other and to the functionally characterised OsHT1 . Because of its high expression and strong tissues-specificity (see upper panels of Figure 9), HvLHT1 might be an excellent candidate to elucidate the specific role of glumes for N supply to the developing grains. Two members of the ANT gene family (HvANT3, HvANT4, Table 6) are highly expressed in developing grains (Figure 9, upper right panel). Because only one member of the large ANT family is functionally characterised so far (AtANT1, ), any hint to possible functions of HvANT3 and HvANT4 in grain filling is missing.
A putative role for amino acid transporters in sink-source communication
Seed sink strength for N, which means the ability of the grain to attract and import N compounds, is due to high storage protein synthesis and high demand and/or intensity of active uptake via membrane-localised transporters . Recent work in our lab demonstrated that increasing sink strength due to overexpression of an amino acid transporter in legume seeds increases amino acid supply, total seed N and protein content [59, 60].
In barley grains, highest expression of storage protein genes occurs between 10 and 12 DAF . Storage protein accumulation starts two days later in aleurone and starchy endosperm cells . Simultaneously, when high N sink strength is initiated a set of AAT genes is transcriptionally activated in grains (Figure 9C, upper panel). Remarkably, expression of these genes is low between 8 and 14 DAF, but higher during early development. PCA and K-means clustering of AAT gene expression data, clearly separate three groups of data points belonging to stages 4 and 6 DAF, 8 to 14 DAF and 16 to 24 DAF (lower panel of Figure 9C). These groups have been assigned to pre-storage, intermediate (transition) and storage phases of barley grain development, respectively. This staging of grain development has been deduced from transcript profiling of 12,000 grain-expressed unigenes. Data evaluation justified the intermediate phase between 6 and 10 DAF . Considering expression profiles of AAT genes alone, the intermediate phase would start two days later and would be prolonged to 14 DAF (lower panel of Figure 9C). This reflects the interval between beginning starch accumulation in the differentiated caryopsis centre (6 DAF, ) and high storage protein synthesis in the peripheral parts of the grain. Such difference in the beginning of the transition phase reflects delayed beginning of protein accumulation compared to starch biosynthesis, and also reveals the internal gradient of caryopsis differentiation.
The highly expressed members of the AAT gene family in flag leaves and glumes differ from those in filling grains (upper panels of Figure 9), but phases of grain development are also reflected in the supplying organs flag leaves and glumes (lower panels of Figure 9). In grains and glumes, transition phase and grain filling include the same stages (8 to 14 DAF and 16 to 24 DAF, respectively). This supports the hypothesis that glumes adjust metabolism according to the specific demands of the grains. Remarkably, expression of AAT genes in flag leaves is elevated four days earlier than in glumes and grains (upper panels of Figure 9). Thus, flag leaves seem to respond to the expected demand for amino acids before sink strength is established in grains and respectively, the transition phase starts two days earlier. Striking differences are visible between flag leaves and glumes during pre-anthesis and early grain development (−4 to 6 DAF). This strengthens the assumption that glumes function in a distinctive way compared to flag leaves, at least during early grain development.
Increasing N demand can generate long-distance signals within the plant . Possibly certain N compounds or amino acids could be translocated through the phloem and its fluctuating levels might signal the nitrogen status of the plant. Especially glutamate has been suggested to function as an evolutionary conserved long-distance signal in plants as well as in animals . Cytokinins can also be involved in signalling the N status of the plant [66, 67]. The phloem might be important in delivering signals to distantly located plant organs. In this way, high grain demands for N might decrease assimilate levels in the phloem which could generate signals for remobilisation in the source.
We hypothesise that in such a way phases of grain developmen could be perceived in the ear-near tissues flag leaf and glumes. This would suggest development-specific signalling which mediates sink-source communication during grain development and which also might regulate AAT gene expression. Tissue-specific regulation of sink/source transition can also play a role as observed from fluctuating transcript abundances of AAT genes especially in flag leaves. Overall, such hypothetical relationship in sink-source communication has been derived from expression profiles of a collection of genes which transcripts are over-represented in a specific set of pyrosequences which demonstrates the power of this approach.
Analysis of the overall dataset showed, that flag leaves and glumes obviously have different functions during early grain development when endosperm sink strength is low. Analogies in gene expression observed between glumes and the supplying maternal tissues indicate that glumes function as mediators between remobilising vegetative tissues and accumulating grains. Combination of already known and newly derived sequence information reduced redundancy, increased contig length and identified new members of cysteine peptidase and N transporter gene families. Participation of the respective gene products in either N remobilisation or accumulation can be expected. Amino acid permeases (AAPs), a sub-group of the AAT family of N transporters seem to be predominant in N retranslocation and grain filling. In phylogenetic trees, putative HvAAP genes which are highly expressed in remobilising tissues cluster together with functionally characterised Arabidopsis transporters responsible for long-distance transport of amino acids. In contrast, grain-specific AAPs are most similar to Arabidopsis transporters active in developing embryos. Based on expression profiling of AAT genes and subsequent statistical data analysis we hypothesise that high grain demands for N might decrease assimilate levels in the phloem which could generate signals for remobilisation in the source. Our future scientific work will be focussed on identification of metabolic/hormonal phloem components, which signal the grain N status to the plant.
Overall, cysteine peptidase and N transporter sequences as identified in this study might be of high interest for applied research because of their obvious role in N partitioning for grain filling. Up to now, information is based only on transcript data. For application of this knowledge in development of new breeding strategies, the specific role of individual candidates (for instance specific cysteine peptidases, LHT, AAP and ANT genes) in N remobilisation and accumulation has to be clarified.
Plant growth and RNA preparation
Barley (Hordeum vulgare L.) plants of cv. Barke were grown in pots with Substrat2 (Klasmann-Deilmann GmbH, Germany) and fertilized with 10 g Osmocote (Scotts Ind BV, Netherlands) and 0.2% solution of Hakaphos red (Compo GmbH & Co KG, Germany) at 3 leaf and at heading stage, respectively. The plants grew in the greenhouse at 18°C with 16 h of light. Developmental stages for barley grains were determined as described by Weschke et al. . Flag leaves, glume fractions (including palea, lemma and awn) and grain tissues were collected based on grain developmental stages. For flag leaves and glumes, samples were collected in two day intervals starting from 4 days before anthesis until 24 days after flowering (DAF). Grains from 0, 4, 8, 10, 12, 14 DAF were manually dissected into maternal and filial parts and whole caryopses were sampled at 16, 20 and 24 DAF. Total RNA was isolated separately from each tissue at different stages using Purescript RNA isolation kit (Biozym, Hamburg, Germany). To prepare RNA samples for transcriptome sequencing, equal amounts of RNA from all stages were united for each tissue to achieve 27 μg of RNA for each organ. Pyrosequencing of the three libraries using the Roche/454 GS-FLX Titanium technology was done by GATC Biotech (Konstanz, Germany).
Generated raw reads are accessible at EMBL/EBI, European Nucleotide Archive (ENA Project ERP001286, ). All reads were adaptor and quality trimmed using SeqClean . Clustering and assembling was done separately for each library using the TGICL pipeline . The pipeline uses megablast  for pre-clustering and CAP3  for sequence assembly. The overlap settings for assembly were 95% identity and 35 bp overlap (all other parameters were set to default). The best BLASTn  hit of individual reads against all contigs was used to determine read numbers per contig. To cross-check these results, a second assembly has been generated using Newbler , (data not shown).
Comparisons of assembled sequences to public databases and each other were done by BLAST similarity searches  with different E-values. For stepwise blast, a perl script  using several BioPerl packages  was written. To determine Gene Ontology (GO) terms, sequences were analysed using Blast2GO [34, 35].
To obtain in silico expression levels numbers of all single reads matching one contig in the BLASTn query (E-value >1E-20) and derived from the same tissue were summarized.
Identification of N transporter sequences
BLASTn of all contigs from the tissue-specific assemblies was done against N transporter collections (AAT, OPT, NTR1/PTR) selected from H35 . Setting blast E-value to <1E-10 we considered sequences beyond that cut-off as putative candidates. Candidates were assembled with H35 sequences using the standard algorithm of Lasergene 8 (DNAStar Inc, Madison, USA). Newly created contigs and remaining RNA-seq singletons were used for further analysis.
Confirmation of the amino acid sequences was done by BLASTp against Aramemnon  and comparison to rice homologs. Identities between 30-94% (AAT), 59-92% (OPT) and 51-94% (NRT1/PTR) were observed. To exclude contaminations, a BLASTn of these sequences was done against the available barley genomic sequence  and verified the barley specificity with identities between 97 and 100% on nucleotide level.
Basic alignments for tree construction and corresponding sequence distance matrices were calculated using the ClustalW algorithm with Blosum protein weight matrix in Lasergene 8 (DNAStar Inc, Madison, USA) with HvPT1  included as outgroup. For each dataset (AAT, OPT, and NRT1/PTR) mean pairwise distances between sequences were calculated and clustered with the neighbor-joining algorithm in PAUP* . Bootstrap support values were calculated by 1000 bootstrap re-samples for each dataset. Phylogenetic trees were visualised with FigTree v1.3.1 . The EMBL accessions of RNA-seq N transporter sequences and IPK previously unpublished sequences (HvAAP1, HvAAP2, HvPTR2, HvPTR3, HvPTR6) are summarised in Additional file 8: Table S3, the additionally used EST-sequences are available at H35 .
Identification of cysteine peptidase sequences
Cysteine peptidases were identified by BLASTn searches (E-value <1E-10) against known cysteine peptidases sequences from H35 and by BLASTx (E-value <1E-10) searches against cysteine peptidases from Arabidopsis thaliana and Oryza sativa[80, 81]. Candidates from the three tissue libraries were assembled with H35 sequences using the standard algorithm of Lasergene 8 (DNAStar Inc, Madison, USA). The created contigs and remaining RNA-seq singletons were used for further analysis. For annotation and classification of the cysteine peptidases the translated amino acid sequences were compared by BLASTp to those of known cysteine peptidases from barley and homologs from Arabidopsis and rice in the MEROPS database . To exclude contaminations, sequences were blasted against the available barley genomic sequence  and were verified with identities between 93 and 100% at nucleotide level.
Plant material was collected in two day steps (flag leaf −4 – 24 DAF, glumes −4 – 24 DAF, filial grain tissue 4 – 24 DAF) and homogenised at −80°C. Total RNA was extracted using Spektrum Plant Total RNA Kit (Sigma Aldrich, Steinheim, Germany) and treated with RNase-free TURBO™ DNase (Ambion, Life Technologies, Darmstadt, Germany). cDNA was synthesized from 2 μg of total RNA with Superscript™ III (Invitrogen, Life Technologies, Darmstadt, Germany) using poly(dT) and random hexamer primers according to the manufacturer’s instructions. 1 μg diluted cDNA (1:32) was used for qRT-PCR with gene-specific primers (Additional file 9: Table S1). Real time PCR was performed using ABI Prism 7900HT Sequence Detection System and Power SybrGreen PCR Mastermix reagent; data was analysed with SDS 2.2.1 Software (all Applied Biosystems, Darmstadt, Germany). Determination of a suitable reference gene, test of PCR efficiencies and determination of CT values were done according to Radchuk et al. . The CT values were determined for three biological replicates, with three technical replicates for each value, and normalized against actin expression (ΔCT). The arithmetic averages of the ΔCT values were calculated and 2-ΔCT values were used for clustering and visualization of data with Multiple Experiment Viewer v4.7 .
PCA and K-means clustering
The entire set of qRT-PCR data was subjected to principle component analysis (PCA) and analysed using the J-Express software 2011 . Thereby, the first axis was placed in the direction of the largest variance component, the second orthogonal axis in direction of the second largest variance. As about 60% of the total variance is represented along the first axis, and more than 12% along the second, coordinates on these two axes which together represent nearly two third of the total variance are plotted in Figure 9. To center the given data set and to define number of clusters representative for each tissue K-means clustering  was performed using OriginPro 8.1 .
We are grateful to Ruslana Radchuk for support in qRT-PCR analysis and to Angela Stegmann, Elsa Fessel and Gabriele Einert for excellent technical assistance. This work was supported by Deutsche Forschungsgemeinschaft (FOR948, WE 1641/13-1).
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.