Background

In Egypt, sheep and goats are important domestic animals reared either under desert conditions or Nile valley. Sheep and goats contribute about 6% and 4.5% of the total red meat produced, respectively (Statistics of Ministry of Agriculture, 2018). Identification of molecular characterization of genes underlying livestock productive traits may allow applying advanced biotechnology techniques to improve animal productivity. Growth hormone (GH) controls the body growth rate, milk production, reproduction as well as carbohydrate, lipid, and protein metabolism. The previous study demonstrated that the GH mRNA expression was significantly greater in all tissues examined in fertile Lezhi goat does than in infertile Tibetan goat does. This indicated that the GH may play an important role in the development of preovulatory follicles and the number of ovulatory follicles and finally, the ovulation rate in goat does [1]. The genetically selected sheep for low fatness have higher blood concentration of growth hormone than the other sheep [2].

The Assaf sheep and Boer goat breeds are widely used in Egypt for genetic improvement whether using artificial insemination or natural mating. Assaf sheep was produced by crossbreeding between Palestinian Awassi and German East Friesian sheep. It is a dual-purpose breed, raised for both milk and meat; however, it is used primarily for milk production [3]. It is worth mentioning that Assaf sheep are widespread in Spain, Portugal, and Middle East countries, but they are limited spread in Egypt. Assaf sheep adult body weight is about 110 kg for rams and about 80 kg for ewes. The growth rate of lambs is about 350 g/day from birth to 4 months. Age at first lambing is 13 months, and fertility ranged from 75 to 85%. Litter size is 1.4, and the average milk yield is 359 L in 220 days of lactation, with milk of 7.2% fat and 5.5% protein [4].

South Africa is the origin of Boer goat [5], and it is specialized for meat production. Boer goat is well adapted for tropical and semitropical conditions, high fertility rates, and resistance for the disease [6]. Boer goat is one of the most meat breeds spread in the world due to fast growth rate and excellent carcass qualities, where the average litter size is 1.9 kids/doe, the litter birth weight is 6.0 kg, and the average kid birth weight is 3.2 kg [7]. The present investigation aims to study the genetic variations of isolated growth hormone cDNA sequences from Assaf sheep and Boer goat breeds that used for the genetic improvement of small ruminant local breeds in Egypt.

Methods

Isolation of growth hormone and cDNA synthesis

The Boer goat was imported from South Africa by Egyptian Ministry of the Agriculture, while the Assaf sheep was obtained from Sinai and both have reared at National Research Centre farm from 2013, Noubaria, El-Beheira Governorate. For each breed, two pure males at third generation were slaughtered to obtain their pituitary gland tissues for GH mRNA isolation. The total RNA was isolated from the pituitary gland tissues of Assaf and Boer breeds using Biozol kit (Bioflux®) and reverse transcribed to the cDNA using First Strand cDNA Synthesis Kit (Fermentas, Canada).

Polymerase chain reaction (PCR)

The efficiency of reverse transcription has been assessed using PCR. As_GH primer was published on NCBI (Ac: EU935861.1, 671 bp), As_GH-F 5′-GCTCACCAGCTATGATGGCTG-3′ and As_GH-R 5′-TGGCAACTAGAAGGCGCAGCT-3′, while Bo_GH primer was designed based on the National Center for Biotechnology Information (NCBI) database (Ac: X07035, 785 bp) using Primer-Blast tool available at https://www.ncbi.nlm.nih.gov/tools/pr imer-blast/. The Bo_GH primer sequence was Bo_GH-F 5′-CCGCGGAGGGTCCTGCTGACAGCTC-3′ and Bo_GH-R 5′-GAGCTCTTGATGCAATTTCCTCGC-3′. The PCR condition of As_GH-cDNA was denaturation at 94 °C for 1 min, annealing at 60 °C for 2 min, extension at 72 °C for 3 min (35 cycles), and a final extension at 72 °C for 10 min, while Bo_GH-cDNA was denaturation at 94 °C for 1 min, annealing at 62 °C for 2 min, extension at 72 °C for 3 min (35 cycles), and a final extension at 72 °C for 10 min.

Construction of GH cloning vector

Growth hormone cDNA sequences of Assaf sheep and Boer goat were ligated into pTZ57R/T cloning vector according to the InsTAclone cloning kit procedures (#K1213, Fermentas, USA) as follows: 3 μl of pTZ vector, 3 μl of both As_GH/Bo_GH cDNA, 6 μl of 10× ligase buffer, 1.5 μl T4 DNA ligase, and 16.5 μl water in total 30 μl reaction mixture, then incubated for ligation at 22 °C/1 h. The constructions of As_GH and Bo_GH-PTZ were transformed into DH10B cells for proliferation and extraction. The constructed As_GH and Bo_GH-PTZ vectors were sequenced for GH cDNA sequences. The efficiency of transformation was determined using previous primers and PCR conditions.

Bioinformatics analysis of As_GH and Bo_GH cDNA sequences

The consensus sequence of As_GH and Bo_GH cDNA was created using the Bioedit 7.2.5 software [8]. The open reading frame (ORF) of As_GH and Bo_GH cDNA sequences was predicted using https://web.expasy.org/translate/ [9]. The cleavage site was predicted using SignalP 4.1 Server tool http://www.cbs.dtu.dk/services/SignalP/ [10]. Predicted As_GH and Bo_GH protein feature annotations have been illustrated using Protter program version 1.0 that available at http://wlab.ethz.ch/protter/# [11].

The cysteine state and disulfide bonds of As_GH and Bo_GH cDNA sequences were carried out using an available tool at http://clavius.bc.edu/~clotelab/DiANNA/ [12]. The physical characteristics of the As_GH and Bo_GH proteins were calculated using the EMBOSS Pepstats tool which is available at https://www.ebi.ac.uk/Tools/seqstats/emboss_pepstats/. The transcription promoters have been predicted using http://www.fruitfly.org/seq_tools/promoter .html [13].

The conserved domains of As_GH and Bo_GH and As_GH cDNA sequences were detected using conserved domains tool http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi [14] The conserved domain multiple alignments were achieved using https://www.st-va.ncbi.nlm.nih.gov/tools/cobalt/cobalt.cgi?CMD=Web. The protein motifs of As_GH and Bo_GH cDNA were predicted using the search motif library tool (https://www.genome.jp/tools/motif/). The protein motifs multiple alignments were achieved using http://meme-suite.org/tools/mast or https://prosite.expasy.org/scanprosite/. Pairwise and multiple alignments for SNPs prediction between As_GH and Bo_GH cDNA sequences and between the GenBank database achieved using the Bioedit 7.2.5 software [8]. The multiple alignments were achieved for As_GH and Bo_GH protein sequences vs. GenBank database using the Bioedit 7.2.5 software [8].

Results

Growth hormone cDNA isolation and subcloning

The growth hormone cDNA sequence was isolated from the pituitary gland of Assaf sheep and Boer goat. The molecular weight of the As_GH cDNA sequence was 665 bp and was 774 bp for Bo_GH cDNA (Figs. 1 and 2, respectively). Polymerase chain reaction was used to confirm the successful ligation of As_GH and Bo_GH cDNA sequences into the construction of As_GH-TZ57R/T vector and Bo_GH-TZ57R/T vector (Figs. 3 and 4, respectively). The complete sequences of As_GH and Bo_GH were registered in the GenBank database under accession number MH128986 for the As_GH sequence and MG744290 for the Bo_GH sequence.

Fig. 1
figure 1

Gel electrophoresis bands of As_GH cDNA

Fig. 2
figure 2

Gel electrophoresis bands of Bo_GH cDNA

Fig. 3
figure 3

PCR electrophoresis of As_GH-TZ57R/T vector

Fig. 4
figure 4

PCR electrophoresis of Bo_GH-TZ57R/T vector

Growth hormone cDNA sequence annotation

Assaf and Boar growth hormone SNPs

Pairwise alignment of As_GH and Bo_GH cDNA sequence

Pairwise alignment between As_GH and Bo_GH cDNA growth hormone sequences (Fig. 5) indicated that there were four synonymous alleles substitutions (small letter) which yield a new codon that encodes the same amino acids. In Assaf sheep, TCC for serine (Ser, S251), CCA for proline (Pro, P263), GAT for aspartic acid (Asp, D542), and GAT for aspartic acid (Asp, D545) versus in Boer goat, TCt for serine (Ser, S263), CCg for proline (Pro, P275), GAc for aspartic acid (Asp, D554), and GAc for aspartic acid (Asp, D557), respectively. Assaf sheep has a unique three SNPs (A637G638G639) that encodes for arginine amino acid (Arg, R194); this insertion mutation (AAG) has been absent in the growth hormone cDNA sequences of Boer goat in the current study and all breeds in GenBank database.

Fig. 5
figure 5

Pairwise alignment of Assaf_GH and Boer_GH cDNA sequences with their predicted protein residues

Multiple alignments of As_GH sequence vs. GenBank sheep database

Multiple alignments between growth hormone sequence of Assaf sheep and GenBank database (Fig. 6) showed thirty-two synonymous alleles substitutions (small letter) which yield new codons that encode for same amino acids and twelve synonymous alleles substitutions (small letter) which yield new codons that encode for different amino acids (Table 1).

Fig. 6
figure 6

Multiple alignments of Assaf_GH cDNA sequences with GenBank sheep database

Table 1 Multiple alignment of As_GH cDNA sequence with GenBank sheep database

Also, the unique three SNPs (A637G638G639) of the Assaf growth hormone have been confirmed in the result of the multiple alignments comparing with of all breeds in the GenBank database. Growth hormone cDNA sequences of Ossimi sheep have one start codon (ATG) and three unique SNPs (A12A13G14) encodes for lysine (Lys2, K) vs. Alanine (Ala3, A) in all GenBank database. There are two alleles substitutions in Ossimi and Afghani sheep which yield a new codon that encodes for different amino acid; for Ossimi sheep, Gg472c473 encodes for glycine (Gly, G), and Ca628g629 encodes for glutamine (Gln, Q) for Afghani sheep, respectively compared to GTT for valine (Val, V) and CGC for arginine (Arg, R) in the GenBank database and in Afghani sheep, Gg389c390 for glycine (Gly, G) compared to GTT for valine (Val, V) in the GenBank database.

Multiple alignments of Bo_GH sequence vs. GenBank goat database

Multiple alignments between growth hormone sequence of Boer goat and GenBank goat database (Fig. 7) showed nine synonymous alleles substitutions (small letter) which yield for new codons that encode for same amino acids: Gln, Ser, Thr, Asn, Cys, Leu, Arg, Glu, and His residues (Table 2). On the other hand, there are six synonymous alleles substitutions (small letter) which yield for new codons that encode for different amino acids: Ala to Thr, Pro to Leu, Gly to Val, Gly to Ser, Phe to Leu, and Arg to Cys. The Bo_GH cDNA sequence has no SNPs compared to the growth hormone sequence at GenBank goat database (Table 2).

Fig. 7
figure 7

Multiple alignments of Boer_GH cDNA sequences with GenBank goat database

Table 2 Multiple alignment of Bo_GH cDNA sequence with GenBank goat database

Pakistani Kamori, Tharri, and Beetal goat shared with the same SNPs encode for the same amino acids: g43CC for Ala15, CAt588 for His197, c607GG for Arg203, TGc207for Cys207, and TGt645 for Cys215 vs. A63CC for Thr15, CAC608 for His169, A627GG for Arg203, TGT665 for Cys207, and TGC665 for Cys215 compared to the GenBank goat database, respectively.

Single nucleotide polymorphisms of A/G was observed in Afghani sheep (a25GC that encoded for Ser5) and Indian Beetal goat (a28GC that encoded for Ser10) vs. G111GC that encoded for Gly35 in GenBank sheep database and GGC in GenBank goat database, respectively. In Pakistani Beetal goat has G/A SNP, g358GC (Gly120) compared to A78GC (Ser120) in GenBank goat database.

The prediction of growth hormone promoter

The As_GH and Bo_GH cDNA sequences and the predicted promoters were shown in Fig. 8. The As_GH promoter sequence was spanned from 404 to 454 bp (50 bp length) with a 0.74 prediction score. The sequence included the ATG start codon CCTCGGACCGTGTCTATGAGAAGCTGAAGGACCTGGAGGAAGGCATCCT. While the promoter sequence of the Bo_GH cDNA sequence was GGACCGTGTCTATGAGAAGCTGAAGGACCTGGAGGAAGGCATCCTGGCCC (50 bp length), contained start codon (ATG) and spanned from 416 to 466 bp with prediction score 0.74. The identity score between both promoters was 100% irrespective of position difference. The promoter sequence in As_GH and Bo_GH sequence was completely identical and has the same length irrespective of position difference.

Fig. 8
figure 8

Pairwise alignment of As_GH and Bo_GH cDNA sequences and their promoter sequence

Protein structure annotation

Signal peptides sequence

The signal peptides ranged from 1 to 27 residues for both As_GH and Bo_GH protein sequences with 100% similarity sequence peptides. The predicted cleavage site was located between 26 and 27 amino acids in both Bo_GH and As_GH protein (Fig. 9).

Fig. 9
figure 9

The predicted cleavage site of signal peptide in As_GH and Bo_GH protein sequences

Conserved domain and motifs

The conserved domain of As_GH sequence protein was matched with two hits, growth hormone-like superfamily that interval from 31 to 216 aa and somatotropin like that interval 36–216 aa. Also, Bo_GH sequence protein has two hits, growth hormone-like superfamily that interval from 31 to 215 aa and somatotropin like that interval from 36–215 aa (Figs. 10 and 11, respectively). The multiple alignments of conserved domain of the As_GH protein sequence and GenBank sheep database (Fig. 12) showed that the conserved domain of all breeds was matched with growth hormone-like superfamily hit that started from protein methionine (Met31), and there were six different substitution amino acids in four breeds: Ser34 and Gly125 in Afghani, Gly155 and Gln206 in Ossimi, Gly153 in Tibetan, and His138 and Arg173 in Latti sheep compared to Gly, Val, Val, Arg, Glu, Tyr, and Thr, respectively in the GenBank sheep breeds. The As_GH sequence protein has a unique amino acid Arg194, and it is absent in all GenBank sheep database.

Fig. 10
figure 10

Conserved domain of As_GH protein residues

Fig. 11
figure 11

Conserved domain of Bo_GH protein residues

Fig. 12
figure 12

Multiple alignment of conserved domain of As_GH protein residues with GenBank database

The multiple alignments of the conserved domain of the Boer goat and GenBank goat database (Fig. 13) showed that the conserved domain of all breeds was shared with somatolactin (SL) and somatolactin-like protein hit that ranged from Ala27 and included the previous two hits growth hormone-like superfamily and somatotropin like hit. There were six different substitution amino acids in five breeds: Gly120, 156 in Pakistani Beetal, Leu123 in Lezhi, Gly156 in Kamori, Ser10 and Gly131 in Indian Beetal, and Pro181 in Thari goat compared to Ser, Val, Phe, Val, Gly, and Val in the GenBank sheep breeds, respectively. There was no different substitution in the Bo_GH protein sequence compared to the GenBank goat database.

Fig. 13
figure 13

Multiple alignment of conserved domain of Bo_GH protein residues with GenBank database

The multiple alignments of the As_GH protein sequence with the GenBank database showed that the As_GH protein sequence has one motif signature, Somatotropin_1 (CFSETIPAPT GKNEAQQKSDLELLRISLLLIQSW) from 79 to 112 aa (Fig. 14), while the multiple alignments of the Bo_GH protein sequence with the GenBank database showed two motifs, Somatotropin_1 (191 related sequences) and Somatotropin_2 (195 related sequences). In the present study, the Bo_GH sequence protein had two common motifs signature (Fig. 15): Somatotropin_1 (CFSETIPAPTGKNEAQQKSDLELLRISLLLIQSW) from 79 to 112 aa and Somatotropin_2 (CFRKDLHKTETYLRVMKC) from 190 to 207 aa.

Fig. 14
figure 14

Motif patterns of As_GH protein sequence compared with GenBank database

Fig. 15
figure 15

Motif patterns of Bo_GH protein sequence compared with GenBank database

Cysteine bridge and disulfide bonds

Five conserved cysteines (Cys) residues were detected in both As_GH and Bo_GH protein sequences at the following positions: Cys18, Cys79, Cys190, Cys208, and Cys216 in As_GH protein sequence and at Cys18, Cys79, Cys190, Cys207, and Cys215 in Bo_GH protein sequence. Two sulfide bonds were predicted between Cys79, Cys190 and between Cys208 and Cys216 aa in As_GH protein and between Cys79, Cys190 and between Cys207 and Cys215 in Bo_GH protein (Figs. 16 and 17, respectively). The comprehensive summary annotation of As_GH and Bo_GH protein residues was shown in Table 3.

Fig. 16
figure 16

The Assaf sheep GH_cDNA protein features and disulfide bounds

Fig. 17
figure 17

The Boer goat GH_cDNA protein features and disulfide bounds

Table 3 Comprehensive summary of As_GH and Bo_GH annotations

Protein alignments of As_GH and Bo_GH predicted protein sequence

Pairwise alignments

The pairwise alignments of As_GH and Bo_GH predicted proteins (Fig. 18) indicated that the homology percentage of growth hormone amino acids in both breeds reached 99.5%. As_GH protein sequence has a unique residue (arginine, R194) that was absented in the Bo_GH protein sequence.

Fig. 18
figure 18

Pairwise alignments of As_GH and Bo_GH protein sequences

Multiple alignments of growth hormone protein residues

The multiple alignments of As_GH protein sequences vs. GenBank protein database (Fig. 19) showed eleven modified residues in growth hormone protein sequence in five sheep breeds: Assaf, Ossimi, Afghani, Latti, and Tibetan. The unique residue of Arg194 in As_GH protein sequence was absented in all growth hormone protein sequences of sheep and goat breeds at the GenBank database. All breeds in the present study and GenBank database started with two start codons (MM) except Egyptian Ossimi sheep that have one start codon. Ossimi sheep have five modified amino acids’ substitution: Lys2, Thr5, Ala14 (signal peptide), Gly155, and Gln207 vs. Ala3, Pro6, Thr15 (signal peptide), Val156, and Arg209 in the GenBank database, respectively. In Afghani and Tibetan sheep have two modifies amino acids substitution: Ser5, Gly126 and Gly154, Arg175 vs. Gly35, Val156 and Glu154, Thr176, respectively compared to the GenBank database. The sheep GenBank database has dominant SNP at AC526A that encoded for Thr173 compared to Ag518A encoded for Arg173 in Chines Tibetan sheep.

Fig. 19
figure 19

Multiple alignment of As_GH protein sequence vs. GH protein sequence at GenBank sheep database

The multiple alignments of Bo_GH protein sequences vs. GenBank growth hormone protein database (Fig. 20) showed twelve modified amino acids in five goat breeds: Tharri, Kamori, Pakistani Beetal, India Beetal, and Lezhi. Pakistani goat breeds (Tharri, Kamori, and Beetal) were shared in the same amino acid substitution of alanine at the same position Ala15 vs. Threonine (Thr15), respectively in Bo_GH and GenBank growth hormone protein sequences of goat breeds. The Pakistani Kamori and Beetal goat breed have the same amino acid substitution of glycine at the same position (Gly156), while, was at a different position (Gly131) in Indian Beetal goat vs. valine (Val156), respectively compared to growth hormone protein sequence of Boer and GenBank goat breeds. The multiple alignments showed that Kamori goat has Arg18 substitution vs. Cys18; Indian Beetal has Met1 and Ser10 substitution vs. Gly26 and Gly35; Pakistani Beetal has Gly120 substitution vs. Ser120; Tharri goat has Pro181 substitution vs. Leu181; Lazhi goat has Leu123 substitution vs. Pro123 compared to GenBank database.

Fig. 20
figure 20

Multiple alignment of Bo_GH protein sequence vs. GH protein sequence at GenBank goat database

Discussion

Growth hormone cDNA subcloning in TZ57R/T vector

The electrophoretic pattern of As_GH-TZ57R/T vector and Bo_GH-TZ57R/T constructions were confirmed the successful ligation of As_GH and Bo_GH cDNA sequences into the construction of TZ57R/T cloning vector (Figs. 3 and 4, respectively).

Growth hormone cDNA sequence annotation

Assaf and Boar growth hormone SNPs

Single nucleotide polymorphisms (SNPs) in different candidate genes and their association with animal performance have investigated in different animal species: cattle [15,16,17], sheep [18, 19], and goats [20]. The growth hormone gene SNPs have widely used for the genetic marker as an aid to genetic selection in farm animals: cattle [17, 21,22,23], sheep [24], and goat [25, 26].

Pairwise alignment of As_GH and Bo_GH cDNA sequence

Assaf sheep has a unique three SNPs (A637G638G639) that encodes for arginine amino acid (Arg, R194); this insertion mutation (AAG) has been absent in the growth hormone cDNA sequences of Boer goat in the current study and all breeds in GenBank database. This SNPs (AAG) may be used to develop a genetic marker for Assaf sheep breed.

Multiple alignments of As_GH sequence vs. GenBank sheep database

Also, the unique three SNPs (A637G638G639) of the Assaf growth hormone have been confirmed in the result of the multiple alignments comparing with of all breeds in the GenBank database. Ossimi sheep have three unique SNPs (A12A13G14) encodes for lysine (Lys2, K) vs. alanine (Ala3, A) in all GenBank database. Due to the difference of physicochemical properties of Lys and Ala residues [27], therefore substitution of Ala with Lys in Ossimi sheep may have an adverse effect on the function of growth hormone and thereby, affect animal performance.

Ossimi, Afghani, and Kazakh sheep are the most breeds have SNPs compared to the Genbank database that may be due to the interfering of random crossbreeding in each breed. There are two alleles substitutions in Ossimi and Afghani sheep which yield a new codon that encodes for different amino acid; for Ossimi sheep, Gg472c473 encodes for glycine (Gly, G), and Ca628g629 encodes for glutamine (Gln, Q) for Afghani sheep, respectively compared to GTT for valine (Val, V) and CGC for arginine (Arg, R) in the GenBank database and in Afghani sheep, Gg389c390 for glycine (Gly, G) compared to GTT for valine (Val, V) in the GenBank database. Both glycine and glutamine did not have the same physicochemical properties of valine and arginine; thereby, substitution between them may harm the protein function [27].

Multiple alignments of Bo_GH sequence vs. GenBank goat database

Pakistani Kamori, Tharri, Beetal, and Indian Beetal goat are the most breeds have SNPs compared to the GenBank goat database. Most goat breeds in Pakistani or in India are the multipurpose type used for meat and milk production or sometimes also for hair [28]. This may explain the variety of SNPs in the growth hormone cDNA sequence in these breeds due to the random crossbreeding in each breed.

Pakistani Kamori, Tharri, and Beetal goat shared with the same SNPs encode for the same amino acids. These single nucleotide polymorphisms can be used to develop genetic markers for each breed. Single nucleotide polymorphisms of A/G was observed in Afghani sheep (A) and Indian Beetal goat vs. G in GenBank sheep and goat database. In Pakistani Beetal goat has G/A SNP; g358GC (Gly120) compared to A78GC (Ser120) in GenBank goat database. This substitution of Gly with Ser is difficult to predict, and modeling studies suggest that it would not be great [18]. The multiple alignments of the growth hormone of sheep and goats in the present results are in agreement with the results reported previously, where there were two genotypes, G/G nucleotide and A/G nucleotide, in sheep and goat [29, 30]. This A/G genotype is associated with high growth traits like birth chest girth, weaning weight, large litter sizes, and good body conformation. It may be due to the presence of both glycine and serine residues in the heterozygous animals, and these residues are interconvertible; when the body needs any of them and not available from the feed, it uses serine to produce glycine and vice versa. The serine residue is involving in the metabolic processes that burn glucose and fatty acids for energy, besides that used to make creatine which combines with water to “pump up” muscle mass.

Moreover, glycine is required for the synthesis of protein, construction of DNA, as well as RNA and synthesis of bile acids and other amino acids in the body, and it helps in retarding degeneration of muscles [31]. Therefore, production improvements can be achieved using these growth hormone SNPs through the selection of animals that have AG genotype and enter them in breeding programs of Egyptian sheep and goat as a way to increase their productivity. In Egyptian sheep and goat, SNP of g55G and a55G with two genotypes (G/G and A/G) has detected in the growth hormone sequence of Egyptian sheep and goat. The GG and AG genotype frequencies were 35.56 and 64.44% in Barki sheep, 19.23 and 80.77% in Rahmani sheep, and 76.67 and 23.33% in Ossimi sheep, respectively. In goats, the GG and AG genotype frequencies were 0 and 100% for Baladi, 13.33 and 86.67% for Barki, and 23.53 and 76.47% for Zaraibi, respectively [32].

In the present study, the multiple alignments showed that SNP of C/T was observed in both sheep and goat; TCc248 (Ser80) in Ossimi sheep and TCc254 (Ser81) in China sheep vs. TCC251 (Ser81) in all GenBank sheep database, while TCc246 and TCc168 for Ser81 in Indian Tibetan, Lazhi, and Beetal goats vs. TCT263 for Ser81 in for all GenBank goat database. The present results are confirmed previously by [33], where the heterozygote counterparts for C1763T and A1780G SNPs in GH gene sequence exhibited heavy body weights (p < 0.05) in Indian Osmanabadi and Sangamneri goat breeds.

The prediction of growth hormone promoter

The promoter controls and regulates the first step of gene expression, so, is the most important regulatory sequence in the gene [34]. The promoter sequence in As_GH and Bo_GH sequence was completely identical sequence and has the same length irrespective of position difference.

Protein structure annotation

Signal peptides sequence

The signal peptides are unique sequence and usually ranged from 16 to 30 residues extended in the N-terminal of newly synthesized secretory and involved in the transport of the protein to or via cell membranes and targeting to the endoplasmic reticulum (ER) membrane for translation initiation [35, 36]. The signal peptides are discarded during protein transportation via cell membrane by a specific peptidase [37]; they are consisting of tripartite structure (1) region of hydrophilic residues, (2) region of hydrophobic residues, and (3) cleavage site with signal peptidase (SPase) [37].

Conserved domain and motifs

In the conserved domain of sheep breeds, residues of Ser, Gly, and Glu have the same physicochemical properties; therefore, the substitution between them may not affect the protein function [27]. Likewise, in the conserved domain of the goat breeds, the residues of Ser, Gly, and Leu, Val have the same physicochemical properties. Therefore, the substitution between them may not affect the protein function [27]. Other residues substitutions in both conserved domain may affect protein function due to the difference in physicochemical properties [27].

The GH protein sequence is strongly conserved in most mammals, but there are differences in the biological and receptor-binding properties due to the species-specificity of receptor-binding [38]. A protein domain is a conserved and distinct part of molecular evolution, usually related to specific molecular functions of such protein folding and can function independently of the rest of the protein. Detection of the significant protein conserved domains is often required for basic cellular function, stability, or reproduction [39]. Detection of the conserved domain on the growth hormone sequence of Assaf sheep and Boer goat may be indicated that the isolation and sequencing processes of growth hormone are achieved correctly.

The protein motifs are consecutive and conserved amino acids sequence of protein families (called motifs signatures) and can often be used as a prediction tool for protein function [40]. Therefore, the Bo_GH sequence protein and all sheep and goat breeds in GenBank database had two common motifs signature: Somatotropin_1 (CFSETIPAPTGKNEAQQKSDLELLRI SLLLIQSW) and Somatotropin_2 (CFRKDLHKTETYLRVMKC). Any change in these consecutive protein residues causes an inability to predict the motif signatures. Interestingly, the As_GH protein sequence had only one motif signature, Somatotropin_1 (CFSETIPAPTGKNEAQQKSDLELLRISLLLIQSW), and the second motif signature (Somatotropin_2) is unpredictable. Three novel distinct nucleotides (AAG) that encode for arginine (R194) in protein sequence were observed inside the consecutive sequence of the second motif signature_2 (CFRKRDLHKTETYLRVMKC). Therefore, due to the presence of this insertion mutation makes it undetectable. There are not available growth hormone cDNA sequences for Assaf sheep in the GenBank database; therefore, it could not confirm that this mutation is specific for this breed, or this is due to individual mutation. Further studies are needed to confirm that these SNPs are stable mutations in Assaf sheep breed or are transiently mutation.

Cysteine bridge and disulfide bonds

The disulfide bonds play a crucial role in the folding and stability of most extracellular secreted proteins [41], protection of protein integrity from the extracellular milieu oxidants and proteolytic enzymes, thereby increase their half-life of the protein [42]. Through the oxidative folding process, four from five cysteine residues are establishing two disulfide bonds between the thiol groups of cysteine residues to stabilize the folded form of a protein [43]. Also, in the intracellular environment, the sulfhydryl side chain of cysteines is excellent for binding to metals, such as zinc [44].

Protein alignments of As_GH and Bo_GH predicted protein sequence

Pairwise alignments

As_GH protein sequence has a unique residue (arginine, R194) that was absented in the Bo_GH protein sequence. Arginine (Arg) is a polar and positively charged amino acid; it prefers to be on the surface of the protein and frequently involved in salt-bridges where they pair with a negatively charged residue (aspartate or glutamate) to create stabilizing hydrogen bonds. Therefore, the presence of arginine (insertion mutation) in the As_GH sequence may be important for increasing the protein stability [45] and also may be involved in the growth hormone-receptor binding [46].

Multiple alignments of growth hormone protein residues

Gene families arise by gene duplication and natural selection. The multiple alignments are an essential study to understand the evolutionary event between species [47]. In most mammals, the GH sequence is strongly conserved, but differences in the biological and receptor-binding properties are due to the species-specificity of receptor-binding [38]. Due to the difference of physicochemical properties between Lys and Ala [27], the substitution of Lys2 residue in the GH signal peptide of Ossimi sheep vs. Ala3 residue in the GenBank database may be causing a reduction in GH secretion in Ossimi sheep. Although there is a difference in the properties of Pro6 residue (small residue) and Thr5 residue (polar residue), they shared in to facilitate the intracellular signal transduction of the proteins, so they can be substituted without negative effect on protein function [9, 48, 49].

The sheep GenBank database has dominant SNP at AC526A that encoded for Thr173 compared to Ag518A encoded for Arg173 in Chines Tibetan sheep. The SNP of C in the growth hormone sequence showed a positive association with the growth rate [50]. Therefore, substation C to G may have a negative effect on the growth rate of Chines Tibetan sheep.

The substitution of Val156 with Gly156 and Gly35 with Ser5 in Afghani sheep breed may be not affecting GH function because the Gly and Val residues are small size and hydrophobic residues, and Gly and Ser residues are tiny and small residues; hence, the substitution is functional [49]. Likewise, the substitution of Glu154 residue that found in all breeds with Gly154 in Tibetan sheep breed may be causing a reduction in growth hormone function due to the differences between them in physiochemical properties [48], where the Gly is a small and hydrophobic residue, while the Glu is a polar and negative charged residue [51]. The Tyr168 residue is presented in the growth hormone protein sequence of GenBank sheep breeds vs. His168 residue in Pakistani Latti sheep breed. Tyrosine and histidine are polar amino acids (neutral), hence may the Tyr residue can be substituted with His residue without the adverse effect of GH protein function [48].

Pakistani goat breeds (Tharri, Kamori, and Beetal) were shared in the same residue substitution of Ala15 vs. Thr15 in GenBank goat breeds database; this substitution may be not affecting the GH protein function because both residues are small size [49], and this may be used this mutation as a genetic marker for these goat breeds. The substitution of Gly156 Pakistani Kamori and Beetal goat breeds with Val156 in GenBank goat breeds database may be acceptable where the Gly and Val residues are small sizes and hydrophobic residues; hence, the substitution is functional [49, 51]. The leucine and proline are very non-reactive residues and rarely directly involved in protein active or binding sites. Leucine can be substituted by other hydrophobic, particularly aliphatic, residues. Since the leucine and proline are small sizes and aliphatic residues, so leucine can be substituted by proline residue in Lazhi and Tharri goat [48, 51].

Conclusion

It was concluded that the GH sequences of Assaf and Boer goat are highly conserved and the homogeny in the codon region (99.5%). The Assaf sheep has a unique three SNPs (A637A638G639) that encodes for arginine (Arg194) that absented in growth hormone cDNA sequences of Boer goat in the current study and GenBank database breeds. Since there were no available records of GH cDNA sequences of Egyptian sheep or goats in the GenBank database, it was not possible to confirm that these SNPs that were reviewed in the current study are a distinctive characteristic for Egyptian breeds. Therefore, further studies are needed to analyses the genetic variations of growth hormone gene in different sheep and goat breeds in Egypt as well as documenting the relationship between these variations and the productive performance of animals.