Introduction

The seed of a plant is a structurally and physiologically organised entity; the embryo develops into the plant, the endosperm is enriched with storage compounds to nurture the embryo during primordial growth stages, and a seed coat protects the seed from adverse conditions. Seed formation begins after a double fertilization event followed by embryogenesis with three phases of cell divisions, a maturation phase in which seed storage compounds accumulate, differentiation of the embryo and acquisition of desiccation tolerance. The embryo and endosperm differ in ploidy level and coordinately interact to regulate plant development (Chaudhury et al. 2001). Apart from providing nutrients during seed germination, the endosperm also sustains a high osmotic potential and provides mechanical strength during embryo growth (Lopes and Larkins 1993) and plant growth substances such as cytokines promote cytokinesis and signalling molecules for embryogenesis (Hong et al. 1996; van Hengel et al. 1998).

Seeds occupy immense economic and social relevance to humankind and animals (Shewry and Casey 1999). Majority of the proteins and oil consumed by humans and animals are derived from seeds. Givotia moluccana (L.) Sreem. (white catamaran tree; Euphorbiaceae) is an economically important tree with softwood that is used extensively for making toys, boxes and fancy articles. The genus Givotia is represented by four species that are confined to India, Sri Lanka, Madagascar and Eastern Equatorial Africa (Sreemadhavan 1975). The seeds of G. moluccana have been reported to be rich in oil used to lubricate fine machinery. In addition, its endosperm extracts have been used to improve the digestion of children by Palliyar tribes of Tamilnadu (Geetha and Vijayalakshmi 2013). The endospermic seeds of G. moluccana have a pronounced dormancy of 1–1.5 years with low natural regeneration. The molecular mechanisms underlying this seed dormancy have been unexplored, but understanding the mechanism may lead to improvements in propagation and regeneration. Proteomic analysis is a powerful tool for understanding such complex biological processes as seed dormancy and germination.

Two-dimensional electrophoresis (2-DE), followed by identification of particular spots through matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI–TOF–MS/MS) is a powerful technique for obtaining an overview of the differentially expressed proteins in numerous plant parts and also at different stages of plant development (Lin et al. 2003; Cánovas et al. 2004). Studies of proteome differences between embryo and endosperm in tomato, Arabidopsis, rice, wheat, maize and barley have shown changes in abundance of storage proteins, housekeeping enzymes, defense-related enzymes and proteins involved in metabolic processes (Runeberg-Roos et al. 1994; Gallardo et al. 2001; Méchin et al. 2004; Komatsu 2005; Sheoran et al. 2005; Bønsager et al. 2007; He et al. 2015). Liu et al. (2009) reported that significant groups of differentially abundant proteins in the embryo and endosperm of Jatropha curcas were associated with metabolism and disease/defence. The seed morphology of Givotia shares similarities with those of the most-studied species of Euphorbiaceae, Jatropha and Ricinus, but despite the economic importance of Givotia species, there are no reports on the proteomes in their seeds. The present study is thus aimed at analyzing the proteome of embryo and endosperm to discern differentially abundant proteins and understand their function.

Materials and methods

Plant material

Mature fruits of G. moluccana were collected from the Forest Research Station, Mulugu, Telangana during January–February 2016 and dried in the sun until the moisture content was lower than 10%.

Seed size and mass

The seeds were removed manually from the dried fruits with the help of secateurs, and seed coats were broken with the help of a bench-vice tool. The seed coat, embryo and endosperm were separated manually using sterile forceps and weighed. Embryo and endosperm from freshly harvested dried fruits (< 10% moisture content) were frozen in liquid nitrogen and stored at − 80 °C until further use.

Protein extraction

Proteins were extracted from embryos and endosperms separately using the protocol described of Shen et al. (2003). About 100 mg each of embryo and endosperm were ground into fine powder in liquid nitrogen with 100 mg of polyvinylpolypyrrolidone and transferred into microcentrifuge tubes containing pre-chilled extraction buffer (50 mM Trizma base pH 7.5, 250 mM sucrose, 10 mM EDTA, 1 mM phenylmethanesulfonyl fluoride (PMSF), 1 mM dithiothreitol (DTT) and 1% Triton X-100). The homogenate was stirred at 4 °C for 10 min and centrifuged at 13,000×g for 30 min at 4 °C. The supernatant was collected into a fresh tube and mixed with equal volumes of 30% trichloroacetic acid (v/v) and incubated at − 20 °C overnight, then centrifuged at 12,000×g for 15 min at 4 °C. The pellet was washed thrice with pre-chilled 100% acetone containing 6.5 mM DTT, then centrifuged. Finally, the pellet was vacuum-dried for 3 min at 30 °C and allowed to dissolve overnight in sample buffer containing 7 M urea, 2 M thiourea, 4% w/v 3-[(3-cholamidopropyl) dimethylammonio]-1-propanesulfonate hydrate (CHAPS), 2% w/v ampholytes and 6.5 mM DTT. Protein concentration was determined using the amido-black method with BSA as the standard (Henkel and Bieger 1994).

2-D electrophoresis and image analysis

2-DE was performed according to Rani et al. (2015). In brief, proteins were separated at 20 °C in the first dimension (GE Healthcare Ettan IPGphor 3 Isoelectric focusing system) by loading 500 µg of protein sample on 18 cm immobilized pH gradient (IPG) strips with linear pH gradient of 4–7 (GE Healthcare). Before SDS-PAGE in the second dimension, the strips were equilibrated for 20 min with equilibration buffers containing 6 M urea, 30% v/v glycerol, 2% w/v SDS, 50 mM Tris-HCl pH 8.8 with 2% w/v DTT, followed by 2.5% (w/v) iodoacetamide. After equilibration, the IPG strip was separated on 12% SDS-PAGE in EttanDalt6 vertical electrophoresis system (GE Healthcare) at 20 °C at 150 V and stained with Coomassie blue G-250.

Image and data analyses

The 2-DE of embryo and endosperm protein samples was performed with three biological replicates, and 2-DE gel images were obtained by scanning the gels with a Magic Image scanner (GE Healthcare). Differentially expressed spots in 2-DE gels were analysed through ImageMaster 2D platinum software version 7.0 (GE Healthcare). Triplicate 2-DE gel images of embryo and endosperm were aligned, and spots were detected automatically. Match set classes were created, and gels were further aligned by selecting four spots as landmarks. Normalization of spots was done by considering percentage of the volume of each spot relative to the total volume of all spots in the gel. The relative change in protein abundance between embryo and endosperm was determined by comparison with embryo gels. The protein spots that differed by ≥ 1.5-fold between embryo and endosperm were considered as more abundant and ≤ 0.6 as less abundant. This arbitrary cut-off was found to be statistically significant in proteome studies (Rani and Podile 2014). The differentially expressed protein spots were selected for further analysis after Student’s t test (p ≤ 0.05).

Protein identification

Spots of interest were excised from the stained 2-DE gels and digested by sequence-grade trypsin (Promega, Madison, WI, USA) as described by Rani and Podile (2014). After trypsin digestion, peptides were analyzed by MALDI-TOF mass spectrometry using a BrukerAutoflex III Smartbeam (Bruker Daltonics, Bremen, Germany). The obtained data were used in a search of the National Center for Biotechnology Information (NCBI) or Swiss-Prot databases using Mascot software (http://www.matrixscience.com) and BioTools 3.1 software (BrukerDaltonics, Bremen, Germany). The taxonomic category was set to Viridiplantae (green plants), and other search parameters included fixed modification of carbamidomethyl (C), variable change of oxidation (M), enzyme trypsin, peptide tolerance of 100 ppm and MS/MS tolerance of 0.4 Da. The protein identity was accepted only if the MASCOT probability was at a significant threshold level (p ≤ 0.05), and at least two peptides matched.

Results

Seed size and mass

Seeds of G. moluccana are oval with a diameter of 11–16 mm and comprise embryo with two thin cotyledons that are embedded in the large endosperm (Fig. 1). The dry mass of a single seed varied from 0.6 to 1.1 g. The seeds constituted 17.7% of the fruit mass. It is composed of the seed coat, embryo and endosperm, which constituted 62.8%, 2.3% and 34.9% of the total seed mass, respectively (Fig. 2). The seed coat comprises two layers: a hard brown episperm with a strongly attached thin, white endopleura.

Fig. 1
figure 1

Morphology of different parts of Givotia moluccana fruit. Scale bar = 1 cm

Fig. 2
figure 2

Mean dry mass (± SD) of different parts of Givotia fruit. Three independent experiments were done, each with three replicates

Embryo and endosperm proteome profiles

The total soluble proteins of the embryo and endosperm were profiled by 2-DE gels. Scatter plots drawn between three independent gels of the embryo and endosperm proteome revealed similar spot pattern in each experiment (Fig. 3). Coomassie blue staining followed by imaging of the gel by Image scanner revealed 492 spots in the embryo and 432 spots in the endosperm (Fig. 4). The protein spots visualized on the 2-DE gels were found in the range of 4–7 pH. Based on the protein distribution, molecular mass and pI range, the spotting pattern of embryo and endosperm proteins on the 2-DE maps had many similarities; a few protein spots differed in abundance (Fig. 5). The protein distribution between the embryo and endosperm was compared by grouping the spots horizontally (0.5 pI was considered as 1 unit, and pI ranged from 4 to 7; Fig. 6) and vertically (using known protein molecular mass markers; Fig. 7). Both histograms had high similarities in proteome distribution patterns for the embryo and endosperm (Figs. 6, 7). Most of the proteins from the embryo and endosperm were distributed similarly in the pI range from 4 to 6, comprising 70% of the total proteins (relative quantity). The protein spots of the embryo were found to be 8% less at pI of 6–6.5 and 3% higher in the pI range from 6.5 to 7 compared to endosperm proteome map (Fig. 6). Both proteomes were in the molecular mass range starting from 200 kDa to less than 30 kDa (Fig. 7) and revealed many similarities in the distribution pattern of protein spots between these tissues with respect to molecular mass. The majority of the proteins (80%) were in the range of molecular weight from 100 to 25 kDa as calculated by ImageMaster software. However, certain protein spots differed between the endosperm and embryo (Fig. 7).

Fig. 3
figure 3

Scatter plots of proteins that matched from the two 2-DE gels of (a) embryo and (b) endosperm. Plots were constructed using ImageMaster 2D Platinum software. The x-axis indicates protein abundance from one 2-DE map, and the y-axis indicates abundance from the other

Fig. 4
figure 4

2-DE maps of (a) embryo and (b) endosperm. Circles indicate significantly abundant in embryo (A) and endosperm (B) or specific to embryo (C) and endosperm (D)

Fig. 5
figure 5

Enlarged area showing differences in abundance of 16 representative protein spots in (a) embryo and (b) endosperm

Fig. 6
figure 6

Distribution of protein abundance in embryo and endosperm with respect to pI

Fig. 7
figure 7

Distribution of protein abundance in embryo and endosperm with respect to molecular mass

With the 2-DE proteome map of the embryo as a reference, the endosperm proteome was compared. About 368 proteins (66.2%) were commonly present in both tissues, whereas 124 protein spots were found only in the embryo and 64 protein spots were explicitly present in the endosperm. Based on the relative spot volume, 44 proteins were significantly (p ≤ 0.05) more abundant in the embryo than in the endosperm.

Protein identification

Of the 21 major differentially abundant protein spots analyzed by MALDI-TOF–MS/MS, eight spots (11, 74, 125, 128, 134, 135, 147 and 180) were significantly higher in relative volume in the endosperm than in the embryo (Figs. 5, 8) and eight (spots 27, 72, 105, 119, 132, 271, 300 and 302) were higher in the embryo. Three protein spots (493, 512 and 514) were endosperm-specific, and 2 (414 and 427) were embryo-specific (Table 1).

Fig. 8
figure 8

Mean relative spot volume (± SD) of the abundant proteins in the embryo and endosperm

Table 1 Analysis of embryo and endosperm spots with differential protein abundance using MALDI TOF–MS/MS

The embryo-abundant protein spots were functionally classified into six classes: metabolism and energy (20%), defense and stress (20%), protein destination and storage (20%), cell growth and division (20%), and unclassified proteins (20%) (Fig. 9a). Endosperm-abundant proteins were also classified into six classes: metabolism and energy (9.1%), defense and stress (9.1%), protein destination and storage (36.3%), cell growth and division (18.2%), predicted and unknown proteins (18.2%), and unclassified proteins (9.1%) (Fig. 9b).

Fig. 9
figure 9

Classification of identified proteins in (a) embryo and (b) endosperm according to their notated functions

Discussion

The small embryo of G. moluccana has two thin cotyledons and is embedded in an enlarged and bulky endosperm. The 2-DE protein profiles of the embryo and endosperm showed high similarities in terms of distribution of proteins spots. A significant similarity in the distribution of protein spots and protein types with variations in protein abundance between the mature embryo and endosperm was also reported for J. curcas and tomato (Sheoran et al. 2005; Liu et al. 2009). In angiosperms, the embryo and endosperm develop after distinct fertilization events and differ in function, morphology and ploidy. The endosperm results from the fusion of two polar nuclei (2n) with a second sperm nucleus (n) and is triploid (3n), whereas the embryo is diploid (2n) formed by the fusion of the sperm nucleus (n) with the egg (n) (Lopes and Larkins 1993). Our results indicated that the proteomes of embryo and endosperm were highly comparable in terms of distribution of protein spots though the structures differ in ploidy and morphology.

However, 44 protein spots were differentially abundant between the embryo and endosperm. Around 368 proteins matched between the two, greater than 60% using the embryo as the reference. Unmatched proteins constituted approximately 34% of the total; 124 proteins were specific to the embryo, and 64 were specific to the endosperm. These unique protein spots suggest different functions in the embryo and endosperm.

Proteins related to metabolism and energy

Chloroplast and photosynthesis are important for germination and subsequent vigour of seedlings (Allorent et al. 2015). The purpose of seed photosynthesis is fascinating even though seeds are considered to be sink organs which acquire with storage molecules such as carbohydrates, proteins, mRNAs and other nutrients from maternal tissues (Hua et al. 2012). In the present study, cytochrome f (spot 27A) and tRNA (Ile)-lysidine synthase (spot 271A) (TilS), involved in photosynthesis and amino acid synthesis, were more abundant in the embryo than in the endosperm. Cytochrome f, an intrinsic membrane-bound protein that is part of the cytochrome b6f complex in the photosynthetic electron transfer chain, is engaged in the transmission of electrons from the Rieske Fe-S protein to plastocyanin in the thylakoid lumen (Gray 1992). Allorent et al. (2015) reported that Arabidopsis thaliana seeds lacking cytochrome b6f had poor germination. tRNA (Ile)-lysidine synthase (TilS) is involved in changing the specificity of tRNA from met to Ile. TilS binds to the tRNA ending with a purine and modifies cytidine in the first position to lysidine. Metabolic activities of cells are reactivated after the onset of germination, and the abundance of the proteins related to photosynthesis and amino acid biosynthesis is an indication that seeds, upon receiving signals for germination, can activate photosynthetic machinery to generate energy needed for plant growth.

The maturation phase in oil-seed plants is exemplified by the differentiation of a photosynthetic stage in which photosynthesis may play a direct role in lipid storage accumulation (Eastmond et al. 1996; Goffman et al. 2005). Hydroxyacyl-ACP dehydratase (spot 493D) (HAD), related to mitochondrial fatty acid synthesis, was specifically present in the endosperm and participates in the synthesis of unsaturated fatty acids and involved in the dehydration of long and short chain hydroxyl acyl-ACPS (Kastaniotis et al. 2004). HAD also plays a key role in mitochondrial function and morphology in yeast and crucial for promoting photorespiration in plants (Guan et al. 2017).

Defence- and stress-related proteins

Two protein spots identified as phenylalanine ammonia lyase (PAL) (spot 72A) and glutathione S-transferase DHAR2-like (GST) (spot 119a) were abundant in the embryo compared to the endosperm. On the other hand, isoflavone reductase-like protein (IFR) (spot 147B) was abundant in the endosperm. PAL catalyzes the first rate-limiting step in the phenylpropanoid pathway and protects the plant from several biotic and abiotic stresses in plants. The products of the phenylpropanoid pathway have antioxidative properties, thereby reducing the oxidative damage from reactive oxygen species (ROS). Seeds are exposed to ROS from seed development and germination (El-Maarouf-Bouteau and Bailly 2008). GST protects the cells from oxidative damage, dehydration, herbicides and toxic substances. They are involved in fusion of non-ribosomally synthesized reduced glutathione to various toxic molecules and hydrophobic electrophiles. DHAR-like GSTs are unique and hold a cysteine molecule in the active site, catalyze the reduction of dehydroascorbate to ascorbate (Dixon et al. 2002). IFRs are a major group of phytoalexins derived from the phenylpropanoid pathway involved in isoflavonoid biosynthesis and help protect against phytopathogens (Kim et al. 2008). IFR levels increase in Ginkgo biloba seeds exposed to UV-B, and after treatment with abscisic acid (ABA) and salicylic acid (SA) (Cheng et al. 2013). The high abundance of stress-related proteins in the embryo and endosperm of Givotia indicates the potential ability to withstand biotic and abiotic factors like high temperature, desiccation tolerance along with regulating the plant growth.

Protein destination and storage

Many seeds can withstand harsh environmental conditions and produce a well-developed plant in suitable conditions. The evolution of flexible mechanisms allows the seeds to protect themselves from stress; failure of this process can lead to death of the new seedling. One such mechanism protects proteins from dysfunction due to environmental stress. Chaperones hold the responsibility of preserving the proteins in their functionally active conformations. In seeds in the present study, chaperonin CPN60-2 (spot 74B) and aspartic proteinase A1 (AP-spot 125B and 134B) were more abundant in the endosperm while HSP70-HSP-90 organising protein 3-like (HOP3) (302A) was abundant in the embryo. These proteins are involved in protein assembly, folding and stability, and proteolysis (Lindquist and Craig 1988). Chaperonin CPN60-2, a mitochondrial chaperonin HSP60 that functions in an ATP-dependent manner, acts in the import and assembly of mitochondrial proteins (Wang et al. 2004). Under abiotic stress, CPN60 promotes refolding and assembly of misfolded or unfolded proteins in the mitochondrial matrix and the proper development of chloroplasts and subsequently embryo and seedling growth (Apuya et al. 2001).

HOP3 is a molecular cochaperone that protects the plant from both the biotic and abiotic stresses. Most of the pathogenesis-related proteins and the proteins associated with heat stress such as HSP70 cognates induce binding proteins in the endoplasmic reticulum, which guides them in proper synthesis, folding, assembly and transport (Yángüez et al. 2013). In this study, APs were abundant in the endosperm. A1 APs are the most-studied plant proteinases involved in proteolysis and protein processing in various plant organs, and are believed to play a role in plant senescence, stress responses, programmed cell death (PCD) and reproduction (Simoes and Faro 2004). Proteolysis and mobilization of storage proteins, globulins and gliadins are induced during rice and wheat seed germination. Chen and Foolad (1997) hypothesized that APs might be involved in the synthesis of new proteins from nucellar cell death proteins for embryo and endosperm development as part of the nucellar cells degeneration during PCD in barley after pollination.

The embryo-specific ATP-dependent zinc metalloprotease FtsH (spot 414C) is an ATP-dependent protease associated with diverse cellular activities (AAA) that control the quality of integral membrane proteins. They are similar to bacterial FtsHs and are localized in chloroplast and mitochondria of plants. FtsHs proteases are highly conserved, consisting of three functional domains in a single polypeptide chain where the N-terminal membrane-spanning helices are exposed to ATPase of AAA superfamily and zinc-binding domain toward the stroma (Leonhard et al. 1999). Single or double mutant analysis of FtsH proteases revealed their importance in thylakoid biogenesis (Zaltsman et al. 2005). AtFtsH 1 is believed to be involved in the assembly and maintenance of PSII complexes by proteolysis of damaged D1 proteins due to photo-oxidation. A mutation in AtFtsH 4 also resulted in abnormalities in the structure of mitochondria (Rigas et al. 2009), and the functional protein is required for the dynamic assembly and stability of complex I (Kolodziejczak et al. 2007).

Cell division and growth

Embryo spots 132A, 105A and endosperm spots 135B and 180B are expected to play an essential role in cell division and growth of the plant. Gibberellin 3-beta-dioxygenase 2–2 (GA20ox) (spot 132A) play an important role in hormonal biosynthesis and root hair formation. Gibberellins (GAs) are involved in diverse functions such as seed formation, germination, stem elongation, development of leaf and trichomes, pollen maturation and induction of flowering (Daviere and Achard 2013). Chemically, they are diterpenoids, and their synthesis occurs in three phases in different compartments of the cell. The first phase occurs in plastids with the cyclization of geranylgeranyldiphosphate, followed by formation of ent-kaurene takes place. The second phase, oxidization of ent-kaurene to GA12, occurs in the endoplasmic reticulum, and the final phase in cytosol involves the formation of bioactive GAs by the action of 2-oxoglutarate-dependent dioxygenases, GA 20-oxidase (GA20ox) and GA3-oxidase (GA3ox) (Yamaguchi 2008). Gibberellin 3-beta-dioxygenase 2–2 converts the inactive GA9 and GA20 into bioactive GA1 and GA4, which are active in the regulation of shoot elongation and in floral induction in short-day conditions (Eriksson et al. 2006).

CASP-like protein 2U10 (spot 105A) was found in the embryo and CASP-like protein 2D1 (spot 135B) in the endosperm. CASP-like proteins (CASPLs) are membrane-bound proteins that are important in the regulation of cell wall component deposition in the root endodermis (Enstone et al. 2002), thereby they are crucial for root formation (Roppolo et al. 2014). CASPLs also prevent the lateral diffusion of solutes by regulating the apoplastic barrier among inter- and intraorganismal apoplasms. Though the endosperm may not be involved in the root hair formation during seed germination, the presence of CASPLs in the endosperm is unclear and unexplored. Actin (spot 180B), highly conserved and abundant in eukaryotes, is expressed throughout the life of plants and involved in developmental processes including cell division and polarity, cell wall deposition and cell elongation (Meagher et al. 2000).

Conclusions

The proteome analysis of G. moluccana showed marked differences in the abundance of 44 protein spots between embryo and endosperm, and 21 were selected and identified through MALDI–TOF–MS/MS. The identified proteins are involved in cell division, root formation, hormone biosynthesis, photosynthesis, chloroplast biogenesis, fatty acid, amino acid synthesis including cellular redox homeostasis and stress responses. This study provides a foundation for further studies related to seed development and germination of this important tree species.