Introduction

Water is an essential solvent for cell biochemical reactions and is indispensable for life. Extreme dehydration reduces cell turgor and adversely affects cellular metabolic processes. Prolonged water deficits, such as imposed by severe droughts, result in leaf wilting and ultimately ends in plant death. While the majority of plants are very sensitive to water loss and capitulate under drought stress, several plant species have genetic adaptations ensuring their survival in marginal lands and extreme environments with limited water. There is intense research interest in understanding the molecular responses of plants to drought stress.

Upon sensing soil water deficits, plants activate transcriptional changes enabling them to deploy mechanisms for conserving water, metabolic reprogramming for adaptation to drought stress, and redirection of growth patterns to follow moisture gradients. The signalling events underpinning the adaptive responses to drought are complex and involve abscisic acid (ABA)-dependent and ABA-independent pathways. Dehydration triggers the biosynthesis of ABA1, which regulates plant water balance and osmotic stress tolerance via control of stomatal aperture2 and activation of stress tolerance genes3. ABA binds to its soluble receptor complex, pyrabactin resistance1/PYR1-Like/regulatory component of PYR1/PYRL/RCAR ABA receptors4,5. Receptor binding inhibits protein phosphatase 2C activity4,5,6,7,8,9,10,11, triggering autophosphorylation of SnRK2 kinases12,13,14, which in turn phosphorylate numerous substrates and activate multiple pathways including guard cell closure and drought stress-adaptive gene expression15.

A conserved ABA-responsive element in the gene promoter is an essential cis-acting element for regulating ABA-inducible gene expression16. MYB and MYC recognition sites are additional cis-acting elements identified in the promoters of some ABA-regulated genes17. Activation of ABA-dependent pathways in transgenic Arabidopsis by constitutive overexpression of the transcription factors ABF2, MYC2, or MYB2, leads to improved tolerance to drought/osmotic stress18,19. ABA-independent signalling pathways also operate in activation of stress-responsive genes during drought. Neither the primary receptors involved nor the signalling components that lead to drought-induced gene expression via ABA-independent pathways are known. However, the responsive genes possess a conserved cis-acting element in the promoter sequence known as the dehydration-responsive element (DRE)20. DRE-binding Protein 2A (DREB2A) specifically binds the DRE sequence21 to activate Arabidopsis gene expression in response to drought, high salinity, and heat-shock stress21,22. Constitutive activation of the ABA-independent pathways by overexpression of DREB2A confers increased drought tolerance in Arabidopsis21.

Transcriptomic changes driven by drought-induced signalling reprogram the proteome and cellular metabolism. The functional significance of most of the proteins is not fully understood. However, some of these have a role in signal transduction and activation of further gene expression, while others clearly support the adaptive response strategy to re-establish cellular homeostasis and survival under drought stress. The classes of proteins deployed during plant adaptation to drought were reviewed by Shinozaki and Yamaguchi-Shinozaki3. They include aquaporins for water movement across membranes and enzymes for the biosynthesis of osmolyte sugars, proline, and glycine-betaine, which are important for osmotic rebalancing. Cellular detoxification enzymes, such as ascorbate peroxidase, glutathione-S-transferase, catalase, and superoxide dismutase prevent oxidative damage, while protection of membranes and macromolecules is maintained by chaperones, messenger RNA-binding proteins, late embryogenesis abundant proteins, and similar proteins. The adaptive reprogramming of the transcriptome and proteome is supported by increased protein turnover facilitated by enzymes and proteins, such as ubiquitin, Clp protease, and thiol proteases. Transgenic plants overexpressing some of these genes acquire drought tolerance23, indicating that the gene products really function in stress tolerance.

Most of the research into plant molecular responses to drought has been conducted using drought-sensitive model species, such as Arabidopsis thaliana. Sorghum (Sorghum bicolor L. Moench), a naturally drought tolerant cereal24 with high genetic diversity, is a good model system for studying drought stress-adaptive responses25, especially with a view to identify novel genes that could be used to generate drought tolerant crops. The sorghum genome has been sequenced26 and some transcriptomic27 and proteomic28 analysis of leaf responses to osmotic stress and drought have been reported. We have a longstanding interest in understanding how the extracellular matrix proteome changes during stress-adaptive responses29,30. Our hypothesis is that the extracellular matrix is a repository of signal molecules used for cell-cell communications during stress adaptation, and analysis of this compartment may lead to identification of signal-regulatory proteins with a pivotal role in drought tolerance. Here, we used a sorghum cell suspension culture system to identify differentially expressed proteins in the extracellular matrix during osmotic stress and show that selected targets are differentially expressed in drought-tolerant and sensitive sorghum lines during drought stress.

Results

Identification of sorghum cell suspension culture ECM proteins

We designed experiments to isolate fractions enriched for secreted proteins in the soluble phase of the sorghum extracellular matrix (ECM). Our goal was to identify these proteins and analyse their response to osmotic stress. We used sorghum cell suspension cultures as a source of easily extractable soluble ECM proteins from the culture growth medium. Basing on preliminary data obtained from the growth curve, we used exponential phase 8-day-old cultures for stress treatments. Sorghum cell cultures were treated with 400 mM sorbitol31 and cells harvested every 24 h until 72 h for RNA extraction. We analysed expression profiles of sorghum homologues of Arabidopsis drought marker genes, ERD1 and DREB2A, to monitor the osmotic stress response and establish the optimal time for harvesting cells for protein extraction. We identified sorghum homologues of Arabidopsis ERD1 and DREB2A, which we named ERD1-1 (SORBI_3004G162400), ERD1-2 (SORBI_3006G065100), DREB2A-1 (SORBI_3009G101400), and DREB2A-2 (SORBI_3003G058200). With the exception of DREB2A-2, all the genes were activated by sorbitol treatment, with expression peaking at 48 h (Fig. 1). Therefore, in subsequent experiments, 48 h was selected as the time after sorbitol addition to harvest cell cultures for protein extraction. Use of 4 biological replicates for both sorbitol treatments and controls ensured that proteins with highly reproducible responses were identified.

Figure 1
figure 1

Activation of sorghum ERD1 and DREB2A expression in response to sorbitol. Sorghum cell suspension cultures were treated with sorbitol and cells harvested at the indicated time-points. Gene expression was analysed using qRT-PCR. Error bars represent means ± S.D. (n = 3).

Cell cultures were treated with sorbitol and secreted proteins were isolated from the culture medium by simple filtration of the cell culture and acetone precipitation of the filtrate. ECM protein samples from control and osmotic stressed cultures were then digested with trypsin, labelled with iTRAQ, fractionated by liquid chromatography, and analysed using tandem mass spectrometry. Only proteins with at least 2 sequenced peptides, each with a statistical confidence threshold ≥ 95%, were considered positively identified. A total of 179 different proteins were positively identified in the ECM fractions of sorghum cell cultures. The full mass spectrometry data of these proteins is provided in Supplementary Dataset (Table S2). This dataset represents a snapshot of the sorghum cell culture secretome at 10 days post-subculturing. Some of the 179 proteins have functional annotations in the protein database derived from sequence identity, which include peroxidases, alpha-galactosidases, alpha-mannosidase, endoglucanases, purple acid phosphatase, malate dehydrogenase and xyloglucan endotransglucosylase. Other proteins are annotated as uncharacterized proteins since database annotation is still incomplete. All the functionally annotated and uncharacterized proteins identified here will require experimental validation of protein function. Apart from the sorghum specific proteins, we also identified trypsin and human keratin proteins, which are known contaminants in proteomic analysis. These contaminants serve as defacto positive controls and their identification in interrogating extensive protein databases indicates that protein identification was specific.

Differentially expressed ECM proteins in response to osmotic stress

For quantitative analysis of osmotic stress-related protein expression, a minimum threshold of 2-fold change in protein abundance at a significance level of p ≤ 0.05 was applied to filter the dataset. This resulted in a total of 92 proteins that were differentially expressed in response to sorbitol-induced osmotic stress (Table 1). With the exception of one down-regulated protein, the rest were up-regulated, indicating that sorbitol triggered an overall increase in protein secretion. Next we used the SignalP tool to analyse the protein sequences for the presence of a signal peptide, which targets proteins to the secretory pathway. A predicted N-terminal signal peptide was identified in 54 of these proteins (Table 1), indicating that they are secreted via the classical secretory pathway requiring a leader sequence. The remaining proteins were predicted not to have an N-terminal signal peptide (Table 1). Bioinformatic analysis of the primary sequences was used to detect putative functional domains in the differentially expressed proteins, which were then assigned to specific protein families (Table 1). There were 18 proteins assigned to glycosyl-hydrolases/glycosidases, 5 to cell wall modifying enzymes, 12 to proteases, 27 to redox proteins, and 30 proteins were left unclassified.

Table 1 List of sorghum secreted proteins that are responsive to sorbitol-induced osmotic stress.

Analysis of sorbitol-induced gene expression

The observed increase in the amount of secreted proteins may be a result of increased expression of the genes encoding these proteins or increased translation of the corresponding mRNA. To investigate if osmotic stress transcriptionally regulated some of these candidates, we used qRT-PCR analysis on randomly selected 12 genes from the top 30 proteins of differentially expressed proteins that had been ranked in descending order of the fold-change magnitude (Supplementary Dataset - Table S3). Sorghum cell cultures were treated with sorbitol and samples for RNA extraction harvested 0, 2, 4, 6 and 24 h later. We focused on early transcriptional responses, which precede changes at the protein level analysed 48 h after sorbitol addition. With the exception of SORBI_3002G417800, whose expression did not respond to osmotic stress at any time-point, all the other 11 genes investigated responded significantly to sorbitol at least at one time-point (Fig. 2). However, for Sb0246s002010 and SORBI_3005G132400 the significant response within the first 24 h was transcriptional repression. For the other genes, there was either an initial suppression of gene expression at the early time-points followed by activation (e.g., SORBI_3007G172100), or gene activation without any suppression (e.g., SORBI_3002G302000) (Fig. 2). Taken together, these results show that increased protein secretion into the ECM observed in this study could be driven by transcriptional regulation, post-transcriptional regulation, or regulated at both transcription and translation levels, depending on the specific proteins. Moreover, the different expression profiles across the sampled 12 genes suggest that there is complex coordination of the gene network governing the proteome response to osmotic stress.

Figure 2
figure 2

Sorbitol-induced gene expression. Sorghum cell suspension cultures were treated with sorbitol and cells harvested at the indicated time-points for qRT-PCR analysis. Error bars represent means ± S.D. (n = 3). One, two and three asterisks indicate statistically significant differences between control and sorbitol treatment means at each time-point, p ≤ 0.05, 0.01, and 0.001, respectively.

Analysis of drought-induced gene expression in sorghum plants

Six of the 12 genes analysed by qRT-PCR were activated ≥2-fold in response to sorbitol treatment of sorghum cell suspension cultures (Fig. 2). We then investigated if activation of these 6 genes (S0RBI_3001G342600, SORBI_3007G172100, SORBI_3002G302000, SORBI_3004G142800, SORBI_3002G315800 and SORBI_3009G190800) in the in vitro cell culture system is recapitulated in sorghum plants exposed to drought stress. We selected two sorghum varieties with contrasting drought response phenotypes; the drought-tolerant SA 1441 and “drought-sensitive” ICSB 338. After a period of growth with optimal soil water content, the plants were exposed to drought stress by withholding water for 11 days. Across all the 6 genes, there was a significant difference in drought-induced expression in root tissues of the two sorghum varieties (Fig. 3A,B). Expression of SORBI_3007G172100, SORBI_3002G302000 and SORBI_3009G190800 increased in response to drought, with up-regulation in the drought-sensitive ICSB 338 variety being significantly greater than the tolerant SA 1441 variety (Fig. 3A). Conversely, SORBI_3001G342600, SORBI_3004G142800 and SORBI_3002G315800 were significantly suppressed in the drought-sensitive ICSB 338 while remaining largely unchanged in the drought tolerant variety SA 1441 (Fig. 3B).

Figure 3
figure 3

Drought stress-induced gene expression in sorghum roots. Drought-tolerant SA 1441 and drought-sensitive ICSB 338 sorghum plants were exposed to drought for 11 days and gene expression analysed by qRT-PCR. The control plants were not exposed to drought and had a gene expression value set at 1-fold. Error bars represent means ± S.D. (n = 5). One and three asterisks indicate statistically significant differences between the SA 1441 and ICSB 338 means, p ≤ 0.05 and 0.001, respectively.

In leaf tissues, expression of all 6 genes was up-regulated in the drought-tolerant variety SA 1441 (Fig. 4). Thus, at least within this 6 gene selection, SA 1441 recruited all genes in leaf tissues responding to drought, while only half of them responded in the roots. In contrast, ICSB 338 had very marginal or no response across all genes in leaves, while the roots had a very robust upregulation of 3 genes and suppression of the other 3 genes. Collectively, these results demonstrate that candidates selected from our protein dataset are differentially expressed in sorghum lines with contrasting drought responses.

Figure 4
figure 4

Drought stress-induced gene expression in sorghum leaves. Drought-tolerant SA 1441 and drought-sensitive ICSB 338 sorghum plants were exposed to drought for 11 days and gene expression analysed by qRT-PCR. The control plants were not exposed to drought and had a gene expression value set at 1-fold. Error bars represent means ± S.D. (n = 5). One and two asterisks indicate statistically significant differences between the SA 1441 and ICSB 338 means, p ≤ 0.05 and 0.01, respectively.

Discussion

Drought stress triggers remarkable physiological responses and growth perturbations, which significantly diminish plant biomass and seed yield. These responses are underpinned by changes in gene expression, which are governed by poorly understood signalling processes. As sorghum is a crop that thrives under drought, it is an attractive model crop for gene discovery and studying the mechanisms driving adaptation to drought. Here we used a sorghum cell suspension culture system to obtain fractions enriched for ECM proteins. The ECM is a functional space in which secreted proteins, carbohydrates and other metabolites play a pivotal role in cell growth, cell-cell communication, and responses to changes in environmental factors. A cell culture system is scalable for production of high amounts of secretory molecules for analysis. Moreover, cell cultures are a useful in vitro system, which has been instrumental in key plant science discoveries, such as discovery of the roles of oxidative cross-linking of the cell wall32 or of ROS and nitric oxide33,34 in plant pathogen interactions.

We made three key observations relating to the ECM and sorghum adaptive responses to drought stress. First, there was an overall increase in protein secretion when cells were exposed to osmotic stress. Secretion of over 50% of the soluble ECM proteins identified in this study was upregulated by ≥2-fold. Similarly, an increase in protein secretion was observed in chickpea cell cultures responding to polyethylene glycol treatment35. Previous studies have demonstrated that increased protein secretion is essential for mounting a defensive response to pathogen attack36,37. Because most pathogens invade the ECM space, secretion of a cocktail of antimicrobial proteins is essential in terminating the attack. The surge in protein secretion in response to osmotic stress appears to suggest a key role for the ECM in drought adaptive responses. This might be important, particularly in switching metabolism from optimal growth to stress adaptation. Upon sensing soil water deficits, shoot growth is suppressed and resources are funnelled towards root growth in pursuit of the receding ground water. Programmed cell death may be invoked to kill off root meristems to break apical dominance38 as a strategy to redirect root growth away from water-depleted zones towards available water gradients. The changes in protein expression observed here constitute part of the gene network underpinning these physiological and morphological changes. Proteins are part of the molecular cargo exported into the plant ECM to build the cell wall infrastructure, decorate the external face of the plasma membrane with receptor complexes, and regulate cell division and differentiation30,39. The heightened protein secretion triggered by osmotic stress could play a crucial role in mediating the changes in growth and cellular physiology associated with drought.

The second key finding relates to identification of specific differentially expressed ECM proteins. These fell into four broad functional categories, namely glycosyl-hydrolases/glycosidases, cell wall modifying enzymes, proteases, and redox proteins. Glycosyl-hydrolases/glycosidases are known carbohydrate metabolising enzymes and have diverse substrate specificity40,41. In this study, we identified 18 hydrolases from different families, indicating the wide spectrum of substrate specificity and mechanisms of action. Although the precise role of these enzymes in osmotic stress response is not clear, carbohydrates are important biomolecules, which have structural42 and signalling40 functions. Interestingly, none of these glycosyl hydrolases/glucosidases identified in the present secretome study were reported in a sorghum drought study, which focused on the leaf proteome28. However, glycosyl-hydrolases/glycosidases have also been identified in secretome studies of Arabidopsis responding to both pathogen attack43 and nutritional phosphate deficiency44. A computational functional annotation study attempted to assign putative functions to the 238 uncharacterised sorghum glycoside hydrolases, with stress response functions being ascribed to these enzymes45.

There were 5 cell wall modifying proteins that responded to osmotic stress, which included putative expansin-like and fascilin-like protein families (Table 1). Expansins are known extracellular proteins involved in remodelling cell walls by facilitating cell wall relaxation and extension46; while fascilin domain containing proteins may be involved in cell adhesion processes47. Expansins have been identified in rice secretome studies exposed to rice blast fungus and elicitor48, while a fascilin-like arabinogalactan protein was identified in Arabidopsis secretome following pathogen infection43. Our study indicates that the role for these proteins span several types of plant stress.

Of particular note was the increased secretion of proteases and redox proteins. The identified proteases are putative members of the peptidase, serine carboxypeptidase, aspartic peptidase, gamma-glutamyl-transpeptidase and peptidase subtilisin-related protein families. Proteolytic cleavage of proteins and peptides could be useful in regulating enzyme activity49 and post-translational activation of peptide signals via cleavage of inhibitory domains of pro-peptides50,51. Deployment of these signal regulatory proteins could play critical roles during stress adaptation. Proteolysis could also function in the control of protein turnover, which becomes critical during stress response52. These enzymes have also been identified in previous secretome studies31,44. Several redox proteins, including peroxidases and thioredoxin had increased secretion after imposition of osmotic stress. Peroxidases are important in cell wall lignification53, but are also part of a large protein network that controls the homeostasis of ROS. At low concentration, ROS serve a signalling role54,55, but function in cell death activation at high concentration55,56. Thioredoxin is a molecular switch used for regulating enzyme activity via reducing disulphide bridges linking cycteine residues57,58. Overall, our results indicate that ECM protein networks could play very wide-ranging functions in drought stress adaptive responses.

The third key observation we made was that genes encoding selected candidate proteins are differentially expressed between drought-tolerant and “drought-sensitive” sorghum varieties exposed to drought. We found that selected genes are transcriptionally regulated by sorbitol-induced osmotic stress in the in vitro cell suspension culture system. Analysis of these genes in sensitive versus drought-tolerant sorghum varieties exposed to drought revealed significant differences in expression profiles. Drought activation of gene expression in the sensitive sorghum line was limited to 3 genes mainly in root tissues, with very modest changes in leaves (Figs 3 and 4). Although activation of the same 3 genes in roots of the drought-tolerant SA 1441 was lower than in ICSB 338, the latter also activated all six genes in leaves (Fig. 4). This may indicate that successful drought tolerance requires adaptive gene expression in both subterranean and aerial plant organs. Future genetic experiments could provide functional data of single or multiple genes in adaptive responses to drought. Collectively, these expression profiles indicate two things. First, that the ECM could provide targets for use in enhancing drought tolerance in crops. Because the response of the sorghum varieties to drought differs from each other, then genes/proteins differing in their response to drought between the two varieties could be potential key regulators of drought adaptation. Second, that datasets of differentially expressed ECM proteins under osmotic stress may provide biomarkers that could be used in breeding programmes to rapidly identify drought-tolerant and sensitive varieties.

In conclusion, the ECM is replete with proteins involved in cell growth control, cell communication and cell signalling during responses to environmental stress. A wide range of plant species and experimental systems has been used to study the ECM proteins, including sorghum. This study extends the number of proteins identified in the sorghum ECM from 14 proteins29 to 179 proteins (Supplementary Dataset - Table S2). A large proportion of these (∼72%) possess a predicted signal peptide (Supplementary Dataset - Table S2), which targets them to the secretory pathway59, while the remainder do not possess a signal peptide. This raises the concern of whether the apparent increase in secretion of some of these proteins actually arises from sorbitol-induced cell death and release of intracellular proteins. However, we discount this possibility on the basis of three observations. First, all the proteins identified with increased abundance after sorbitol treatment were also identified in the stress-free control cell cultures. Their secretion in exponentially growing viable control cultures makes cell death an unlikely cause. Secondly, if cell death was responsible, we would have expected to identify many abundant cytosolic house-keeping proteins appearing in sorbitol samples only and absent from control samples. This was not the case. Finally, sorbitol at the concentrations used and time-scale of treatment causes cells to lose water and shrink, with no reduction in cell viability (data not shown). Proteins without a signal peptide identified in our ECM fractions add to the growing number of animal and plant proteins, which are secreted into the ECM via alternative mechanisms not requiring the signal peptide60,61. For example, a leaderless CaRRP1 protein has been confirmed to be a bona fide ECM protein using a YFP-tagged recombinant version of the protein35. Increased secretion of both signal peptide-containing and leaderless proteins is a strong indication that the ECM protein network is part of the molecular machinery deployed when sorghum encounters deficits in soil water content. Importantly, differential expression of some of the target proteins between drought-tolerant and drought-sensitive sorghum varieties implicates the candidates in mediating drought tolerance, though genetic experiments will be required to definitively confirm this.

Methods

Plant material and growth conditions

Seeds of white sorghum previously used for the generation of cell suspension cultures62 were obtained from Professor Bongani Ndimba (Agricultural Research Council (ARC), South Africa). In this study, white sorghum callus and cell suspension cultures were initiated and maintained on Murashige and Skoog Minimal Organics medium under dark conditions as described previously62. The cell cultures were sub-cultured every two weeks and used for sorbitol-induced osmotic stress treatments 8 days post sub-culture. Drought-tolerant (SA 1441) and drought-sensitive (ICSB 338) sorghum varieties were obtained from the ARC-Grain Crop Institute, Potchefstroom, South Africa. Sorghum seeds were sown in potted soil and grown at 25–30 °C under a 16 h-photoperiod. Plants were grown in square pots with a volume of 216 cm3 filled with Levington F2 + Sand compost (ICL Ltd., Ipswich, UK). All plants were well watered until they reached the V3 stage (3 fully expanded leaves with the fourth one emerging) before imposing drought stress by cessation of watering.

Osmotic and drought stress treatments

Eight days after subculturing, sorghum cell suspension cultures were exposed to osmotic stress by treating with 400 mM sorbitol. Control cell cultures were spiked with an equivalent volume of sterile distilled water for the same duration. A time-course sorbitol treatment experiment of sorghum cell suspension cultures was carried over a 72 h period, and expression analysis of drought marker genes ERD1 (early responsive to dehydration 1) and DREB2A was analysed at 0, 24, 48 and 72 h in order to establish the most appropriate time for proteome analysis. For protein analysis, 4 biological replicates of 30 mL each were treated with sorbitol and harvested 48 h later. For RNA analysis, 3 biological replicates of 10 mL each were treated and harvested 0, 2, 4, 6, and 24 h later. For drought stress treatments, well-watered plants at the V3 stage were divided into two groups; control and drought stressed plants. The control plants were watered throughout the experiment as necessary, while water was withheld for 11 days from the drought-stressed plants. Five biological replicates were generated for each group. For leaf samples, each biological replicate was a pool of 3 leaves, each coming from an independent plant. For root samples, a biological replicate consisted of roots pooled from 2 plants. The leaf or root material was snap-frozen in liquid nitrogen and stored at −80 °C for use in RNA extraction.

RNA extraction and analysis

Total RNA was extracted from cell cultures, root and leaf samples using RNeasy Plant Kits (Qiagen, Manchester, UK) according to the manufacturer’s instructions. First strand complementary DNA (cDNA) synthesis was performed using 1.5 µg total RNA template and oligo-(dT)15 using the GoScriptTM Reverse Transcription System (Promega, Southampton, UK) according to the manufacturer’s instructions. Quantitative real-time PCR (qRT-PCR) was performed on the Rotar-Gene 3000 (Corbett Research, Sydney, Australia) using the SensiFAST™ SYBR® No-ROX kit (Bioline, London, UK). The reaction consisted of 10 µl SensiFAST reagent, 0.4 µM each of the forward and reverse primers, and 5 µl of 8-fold dilution cDNA in a final volume of 20 µl. The thermal cycling conditions were as follows: denaturation at 95 °C for 3 min followed by 40 cycles of 95 °C for 10 sec, annealing at 56 °C for 15 sec and extension at 72 °C for 25 sec. All reactions were carried out on 3 technical replicates for each of the biological replicate. Data analysis was carried out using the REST2009 version 2.0.13 software (Qiagen) using Sb03g038910 as a constitutive reference control gene, whose expression does not alter in response to drought stress27. The primer sequences of all genes used are listed in supplementary Table S1.

Protein sample preparation and iTRAQ Labelling

Control and sorbitol-treated cell cultures were filtered through 2 layers of Miracloth to separate the cells from the growth medium. Secreted proteins were isolated from the growth medium by acetone precipitation as described previously63 and solubilised in a solution containing 9 M urea, 2 M thiourea and 4% (w/v) CHAPS. There were 4 biological replicates of controls and the same for sorbitol treatments. Labelling of protein samples with iTRAQ tags was performed as described previously63 with minor modifications. Briefly, for each sample, 50 μg of protein were reduced with tris(2-carboxyethylphosphine) (TCEP) and alkylated with methyl-methane-thiol-sulfonate (MMTS). Thereafter, protein samples were digested at 37 °C for ~16 h using a 1:10 (w/w) trypsin to protein sample ratio, vacuum-dried, re-suspended in triethylammonium bicarbonate buffer (pH 8.5), and labelled with an 8-plex iTRAQ reagent kit (Applied Biosystems Sciex, Foster City, USA) according to the manufacturer’s instructions.

Peptides of the 4 control replicates were labelled with 113, 114, 115, and 116 iTRAQ tags, while sorbitol-treated samples were labelled with 117, 118, 119, and 121 tags. All eight samples were pooled to make one composite sample, which was then vacuum-dried and re-suspended in 3.8 mL of buffer A (10 mM K2HPO4/25% acetonitrile, pH 3.0). Thereafter, the sample was separated into 50 fractions on a PolySULFOETHYL A strong cation exchange column (Poly LC Inc. 200 × 2.1 mm, 5 μm) at 300 nL/min on an Ettan LC (GE Healthcare, Pittsburgh, USA) HPLC system. Peptide separation was performed using a biphasic gradient of: 0–150 mM KCl over 11.25 column volumes and 150–500 mM KCl in buffer A over 3.25 column volumes. A total of 50 fractions were collected over the gradient, and reduced to 30 by pooling those with low peptide concentration. The 30 fractions were dried down and re-suspended in 90 μL of 2% acetonitrile/0.1% formic acid. Aliquots of 20 μL from each fraction were analysed by LC-MS/MS using a QStar Pulsar i mass spectrometer (MDS-Sciex/Applied Biosystems).

Mass spectra data analysis

Mass spectra data were analysed as described previously63, with minor modifications. Briefly, ProteinPilot software 4.5 (Beta) Revision 1656 Paragon algorithm build 1654 (ABSciex) was used for data analysis against the UniProt database sequences for S. bicolor (downloaded in October 2013, 58756 entries) plus 162 known contaminants from proteomic experiments. A minimum score threshold of 2.0 (99% confidence) was set for protein identification and all proteins identified on the basis of a single peptide were filtered out of the dataset, resulting in a total of 179 unique proteins.

For quantitative analysis of the differentially expressed proteins, the abundance of each protein in all samples was calculated as a ratio to the 113-tagged control sample. Averages of the ratios for each protein across the four replicates were calculated. The fold-change in protein expression was denoted by the ratio of control to sorbitol-treated samples. For the down-regulated proteins, the osmotic stressed average was the numerator and the control was the denominator, with a negative sign denoting down-regulation. A probability value for the comparison of the control average to sorbitol average was obtained from the Student’s t-test at 95% confidence.

Bioinformatic analysis

The presence of an N-terminal signal peptide on all identified proteins was predicted using SignalP 4.164. The InterPro65 and Superfamily66 databases were used for protein sequence analysis to identify protein functional domains used for assignments to relevant protein families.

Data availability statement

The datasets generated and/or analysed during the current study are available from the corresponding author on request.