Introduction

In recent years, high‐throughput techniques such as serial analysis of gene expression (SAGE), expressed sequence tag (EST), microarray and RNA-seq have been widely employed to study the gene functions and understand the transcriptional regulations in humans, animals as well as plants1,2,3,4,5. However, the high throughput expression data requires validation using real-time quantitative polymerase chain reaction (qPCR). The qPCR technique because of its dynamic range, scalability, sensitivity, and reproducibility has always been considered as precise technique to estimate the relative abundance of mRNA transcripts in any cell types6,7,8,9. However, in order to perform appropriate gene expression analysis, it has become mandatory to select stable reference genes (RGs) that can normalize provide accurate and reliable qPCR results for each and every experimental condition10. Earlier, most of the studies included single traditionally used housekeeping genes such as GAPDH or ACTB which might not provide accurate normalization of expression data. Later on, it has been strongly advocated that panel of two or more RGs should be employed for normalizing the expression data of target genes11,12. Further, the choice of appropriate RGs for individual experimentation dealing with varied biological resources was also realized for achieving the reproducibility and correct inferences13. Since last one decade or so, there has been significant increase in use of RT-qPCR technique to perform gene expression studies in different livestock and poultry species. Unfortunately, many of the studies didn’t follow the correct procedures to normalize the gene expression data13 Therefore, concerns have been raised by researchers across the world to ensure accuracy, consistency and reliability of RT-qPCR data by employing proper normalization methods.

Identification of panel of RGs has become the most popular approach to normalize the qPCR-based gene expression data as evident from numerous publications across mouse14,15,16, human5,17 plants18,19 and livestock species20,21,22,23,24. Lack of appropriate RGs can greatly compromise the reliability of qPCR due to technical variations or errors arises during sample preparation, like quality and starting amount of RNA, efficiency of reverse transcription, efficiency of PCR and errors during pipetting25. All these technical variations will affect both the target genes as well as selected panel of RGs. Therefore, it’s important to normalize the gene expression data by identifying suitable RGs or internal control genes (ICGs) in order to obtain an accurate and reliable gene expression data. Identification and validation of appropriate RGs has thus become an essential component in any gene expression studies wherein RGs are exposed to the same experimental conditions as target genes26.

Ladakh, one of the world’s highest inhabited region (3500–5500 m above sea level),  home to several unique native animal genetic resources such as cattle, yak, goat, sheep, donkeys, horses and double hump camel. The economy of local people is mainly dependent on these livestock species. The native cattle known as “Ladakhi cattle” (Bos indicus) is a unique germplasm that provides 2.5–4.5 kg of milk and has excellent adaptation potential to high altitude hypobaric hypoxia stress27,28. The Ladakhi yak (Bos grunniens) is also very well adapted and major source of milk and milk products for the local people. The Ladakhi goat (Capra hircus) or world famous pashmina goat is mainly reared for meat, milk, fiber (Pashmina and Mohair), hide and skin. Ladakhi donkey (Equus asinus) and Zanskari ponies (Equus caballus) are yet another important animal genetic resources that serves as an important pack animal for the local people and Indian army. Another unique species; double hump camel (Camelus bactrianus) is quite popular amongst tourists especially for safari in world famous cold desert stretch of Nubra valley region of Ladakh. Each of these species has developed effective mechanism to survive at high altitude and low oxygen condition. Under such adverse climatic conditions, the survival and performance of exotic breeds is not a viable option. It only allows the well adapted animal genetic resources to thrive and perform. Therefore, understanding transcriptome signatures and identifying genes highly abundant across all these species will provide strong clue on molecular mechanism operating at transcriptional level in response to abiotic hypoxia stress across these species. By making such advancements, not only these resources will be characterized and documented but will also help to understand these unique animals production attribute in a better way for future exploitation and overall improvement. As a step forward, the present study was designed to identify and select panel of stably expressed RGs for future transcriptional studies in each of the six livestock species of Ladakh.

In recent past, numerous studies have been conducted in similar lines to identify panel of appropriate RGs in several livestock species such as cattle22,29 buffaloes20,21, yak30, pig31, goat32,33 sheep33,34, horse35 etc. These studies have represented wide array of environmental or experimental conditions such as responses to external stimuli (heat stress, endurance, exercise), physiological or developmental stages, lactation cycle, cellular response etc.20,21,22,28,30,35,36,37. It is now evident that set of RGs that perform well in one particular condition or species may not work well in other experimental conditions or other species. Therefore, in the present study, an effort was made to evaluate and identify panel of appropriate RGs in peripheral blood mononuclear cells (PBMCs) of six livestock species well adapted to high altitude region of Leh-Ladakh viz., Ladakhi cattle, Ladakhi yak, Ladakhi donkey, Changthangi goat, Zanskar ponies and double hump camel. All these livestock species are native of Leh and Ladakh and have been naturally selected not only to sustain but perform and reproduce well under high altitude hypoxia stressful conditions. The 10 candidate RGs that were evaluated in the present study were; glyceraldehyde 3-phosphate dehydrogenase (GAPDH), beta-Actin (ACTB), ubiquitously expressed transcript (UXT), ribosomal protein S15A (RPS15A), beta 2-microglobulin (B2M), ribosomal protein L-4 (RPL4), ribosomal proteinS18 (RPS18), ribosomal protein S9 (RPS9), ribosomal protein S23 (RPS23), hydroxymethylbilane synthase (HMBS), hypoxanthine phosphoribosyl transferase (HPRT1).

Results

Specificity, expression abundance and coefficient of variation of individual RGs

In the present study, an effort was made to identify the appropriate RGs in all the major livestock species that are native of Leh-Ladakh region viz., Ladakhi cattle (LAC), Ladakhi yak (LAY), Ladakhi donkey (LAD), Changthangi goat (CHG), Double hump camel (DHC), Zanskar ponies (ZAP). The specificity of each primer pair was confirmed by the specific amplification checked in agarose gel and presence of single peak in melt curve analysis. The correlation coefficient (R2) and amplification efficiency (E) for individual primer pair in each of the six livestock species are given in Table 1. The expression abundance of individual RGs in each species is shown in Box Whisker plot (Fig. 1a–f). The Ct values of individual RGs ranged from RPS23 (13.94) to HPRT1 (30.18) in LAC; RPS23 (14.37) to RPS15 (33.82) in LAY; RPS23 (13.86) to RPS15 (35.47) in LAD; RPS15 (16.05) to RPS23 (34.90) in DHC; RPS15 (13.63) to RPS23 (34.92) in CHG; RPS15 (16.06) to RPS23 (36.06) in ZAP (Table 2).

Table 1 Gene symbol, accession number, primer sequence, melting temperature (Ta), amplicon size, slope, PCR efficiency and R2 of RGs for each evaluated RG.
Figure 1
figure 1

Expression level of individual candidate RGs in LAC (a), LAY (b), LAD (c), CHG (d), DHC (e) and ZAP (f). The data is presented as quantification cycle (Ct) values of each gene in the box-whisker diagram. The median is shown as a line across the box while whiskers indicate maximum and minimum values.

Table 2 The average raw Ct values of individual RGs in different species.

Expression stability analysis of RGs in each livestock species

Ladakhi Cattle (LAC)

The geNorm analysis ranked candidate reference genes as per their mean expression stability value (M value) which was below the threshold value of 1.5 for all the 10 RGs. The ranking order based on M value were EEF1A1 = RPL4 > RPS23 > RPS9 > UXT > B2M > GAPDH > RPS15 > HPRT1 > ACTB (Fig. 2a). The M value ranged from 0.147 (EEF1A1) to 0.689 (ACTB). The lower M value indicates higher expression stability while higher M value indicates lower expression stability. On the basis of M value, EEF1A1 = RPL4 RG pair was most stable expressed while ACTB was least stable. Another parameter that was evaluated by geNorm was the pairwise variation Vn/n + 1 in order to calculate the optimal number of RGs to be required for normalization. The pairwise variation (V) score of all the RGs were below 0.15 (Fig. 2b) which is an ideal pairwise recommended score11. Therefore, as per V value, combination of two RGs could be suggested to normalize the qPCR data in PBMCs of Ladakhi cattle.

Figure 2
figure 2

geNorm analysis for ranking of genes based on average expression stability measure (M value), Pair-wise variation (Vn/Vn + 1) between the normalization factors NFn and NFn + 1 to determine the optimal number of reference genes and Normfinder analysis in LAC (ac), LAY (df), LAD (gi), CHG (jl), DHC (mo) and ZAP (pr respectively).

In Normfinder analysis as well, the ranking stability of individual RGs were decided by the lower values indicating higher stability. In LAC, Normfinder analysis resulted in same panel of stable RGs (EEF1A1, RPL4, UXT, RPS23) as identified in geNorm analysis. On the other hand, ACTB, HPRT1, RPS15 RGs were identified as least stable. The ranking order from most to least stable RGs was as follows: EEF1A1 > RPL4 > UXT > RPS23 > RPS9 > B2M > GAPDH > RPS15 > HPRT1 > ACTB (Fig. 2c, Table 3).

Table 3 Overall ranking of best suitable RGs across different species.

The gene expression variation for 10 candidate RGs was also calculated using BestKeeper algorithm. In BestKeeper analysis, raw Ct values were used to evaluate stability of individual RGs based on their SD and CV values. The lower value indicates higher expression stability; however, the SD > 1 value indicates the reference gene is unstable and cannot be used for normalization. The RPS15 and GAPDH genes having lowest SD values of 0.123, 0.163 indicated expression stability. This was followed by B2M, EEF1A1, RPL4, UXT, RPS23, RPS9, HPRT1 and ACTB with SD values 0.391, 0.468, 0.481, 0.491, 0.503, 0.642, 0.712 and 0.812, respectively (Table 4). The ACTB gene on the other hand was least stable gene with highest SD value. Additionally, the inter-gene relationship for 10 RGs pairs was also estimated. Strong correlation coefficients (r) were observed for RPL4/EEF1A1 (0.980), EEF1A1/RPS9 (0.971), RPS23/RPS9 (0.966), RPS23/EEF1A1 (0.962), RPL4/RPS23 (0.961), RPL4/ RPS9 (0.922), UXT/ RPS9 (0.898) (Table 5). This analysis provided strong evidence that these pair of genes have similar expression pattern across the animals. Further, BestKeeper index was calculated for each gene and the correlation between each candidate RGs and BestKeeper was estimated. The relationship between RGs and BestKeeper was described in terms of Pearson correlation coefficient (r), coefficient of determination (r2) and the p value. The p < 0.05 was obtained for all genes indicating a significant contribution of all genes towards the index. Though the EEF1A1 (0.978) and RPS9 (0.973) showed high correlation values but their high fold change makes these genes as unreliable reference genes. The statistically significant SD and correlation shown by the RGs from with BestKeeper algorithm appeared to be consistent with their evaluation assessed by geNorm and Normfinder.

Table 4 Analysis of parameters based quantitative cycling points (CP) for 10 candidate RGs in LAC.
Table 5 Analysis of repeated pair-wise correlation amongst genes in LAC with BestKeeper index.

Additionally, RefFinder based analysis was carried out that ranks the stability order of RGs in a more refined way by taking into consideration geNorm, Normfinder, BestKeeper and delta Ct algorithms. The stability order and ranking of the RGs as per RefFinder were; EEF1A1 (1.41) RPL4 (2.11), RPS23 (3.98), UXT (4.36), RPS15 (4.76), B2M (5.05), GAPDH (5.12), RPS9 (5.32), HPRT1 (9), ACTB (10).

Ladakhi Yak (LAY)

The M value for all the 10 genes in geNorm analysis were found to be within acceptable range in LAY. The ranking order of RGs was GAPDH = RPS9 > ACTB > RPS23 > HPRT1 > UXT > B2M > EEF1A1 > RPL4 > RPS15 (Fig. 2d). GAPDH and RPS9 showed higher gene expression stability with M value of 0.223 followed by ACTB, RPS23 and HPRT1 with M value of 0.386, 0.507, 0.595 respectively (Table 3). On the other hand, RPS15, RPL4 and EEF1A1 were least stable with higher M values of 1.242, 1.121 and 0.992, respectively. The pair wise variation analysis showed V4/5 combination with least V value (0.132) followed by V3/4 (0.148) and V5/V6 (0.150) combinations (Fig. 2e). Since all these V values were well within the acceptable range (recommended cut-off value 0.15), therefore use of panel of 3 RGs (GAPDH, RPS9 and ACTB) is likely to provide most accurate normalization in Ladakhi yak samples. The Normfinder analysis also identified same set of RGs in LAY samples with highest stability; GAPDH (0.112), RPS9 (0.112) and ACTB (0.202) albeit slight change in their ranking order; GAPDH > RPS9 > ACTB > RPS23 > HPRT1 > UXT > EEF1A1 > B2M > RPS15 > RPL4 (Fig. 2f, Table 3). Similar to geNorm, RPL4 (1.538) and RPS15 (1.523) were found to be least stable RGs.

In BestKeeper analysis, UXT was found to be most stable with minimum SD value (0.180) followed by HPRT1, RPL4, GAPDH, ACTB, RPS9, RPS23, B2M, EEF1A1, RPS15 with the SD values of 0.240, 0.500, 0.600, 0.610, 0.680, 0.840, 1.350, 1.590, 1.750, respectively (Table 6). Additionally, high correlation coefficient was observed for RPS9/GAPDH (r = 0.973), B2M/ACTB (r = 0.942), EEF1A1/GAPDH (r = 0.931), EEF1A1/RPS9 (r = 0.923), RPS23/ACTB (r = 0.914) and B2M/EEF1A1 (r = 0.909) pair combinations. The best correlation between RGs and BestKeeper was observed for GAPDH (r = 0.967), RPS9 (r = 0.965), EEF1A1 (r = 0.960), ACTB (r = 0.929), B2M (r = 0.927) (Table 7). The high correlation values for these genes indicated their reliability as RGs, The GAPDH, RPS9 and ACTB were termed as best RGs on the basis of highest correlation value and less SD.

Table 6 Analysis of parameters based quantitative cycling points (CP) for 10 candidate RGs in LAY.
Table 7 Analysis of repeated pair-wise correlation amongst genes in LAY with BestKeeper index.

The RefFinder analysis also identified GAPDH, RPS9 and ACTB to be most stable RGs while RPS15, B2M and EEF1A1 were the least stable RGS in LAY. In the present investigation, all four methods geNorm, Normfinder BestKeeper and RefFinder have demonstrated that GAPDH, RPS9 and ACTB are the most stable RGs in PBMCs of LAY.

Ladakhi Donkey (LAD)

In Ladakhi donkey as well, the geNorm analysis showed mean expression stability values of 10 RGs within the acceptable range and varied from 0.250 (HPRT1 = B2M) to 1.405 (RPS9) (Table 3). The stability ranking of RGs was: HPRT1 = B2M > RPS23 > ACTB > EEF1A1 > GAPDH > UXT > RPL4 > RPS15 > RPS9 (Fig. 2 g). The B2M and HPRT1 RGs showed highest expression stability with lowest M value while RPS9 and RPS15 RGs showed least expression stability with highest M value. Based on pair-wise variation analysis (V value), V3/4 combination (B2M HPRT1 and RPS23) with V value of 0.142 was found to provide the most accurate normalization in Ladakhi donkey (Fig. 2 h). In Normfinder analysis as well; HPRT1 (0.123), B2M (0.324) and ACTB (0.518) were most stable with lowest values (Fig. 2i). On the other hand, the RPS9 (1.912), RPS15 (1.377) and RPL4 (1.366) RGs were least stable.

The BestKeeper analysis showed ACTB gene to be most stable with the lowest crossing point SD value of 0.295. This was followed by HPRT1, RPS23 and B2M RGs with SD value of 0.311, 0.318 and 0.472, respectively. On the other hand, RPS9 with highest crossing point SD value of 1.635 was found to be the least stable (Table 8). In addition, the inter-gene relation for 10 RGs pairs was also estimated. B2M/GAPDH (r = 1.0), HPRT1/B2M (r = 0.985), HPRT1/GAPDH (r = 0.985), B2M/EEF1A1 (r = 0.855) and EEF1A1/GAPDH (r = 0.854) showed the strong correlation coefficients (Table 9). The highly correlated RGs were combined into BestKeeper index and the correlation between each candidate RGs and BestKeeper was estimated. The relationship between RG and BestKeeper was described in terms of Pearson correlation coefficient (r), coefficient of determination correlation between BestKeeper and RGs was observed for HPRT1 (r = 0.942) and GAPDH (r = 0.941) followed by B2M (0.940) and EEF1A1 (0.927) genes. The statistically significant correlation shown by RGs (HPRT1, B2M) with the BestKeeper index appeared to be consistent with their evaluation as assessed by geNorm and Normfinder. RefFinder was another tool, were evaluating and identified RGs from comprehensive data set. HPRT1, B2M and ACTB were most stable and RPS9, RPS15 and GAPDH were least stable genes identified by RefFinder in LAD.

Table 8 Analysis of parameters based quantitative cycling points (CP) for 10 candidate RGs in LAD.
Table 9 Analysis of repeated pair-wise correlation amongst genes in LAD with BestKeeper index.

Chanthangi Goat (CHG)

The geNorm analysis of all the 10 candidate RGs in Changthangi goat exhibited mean expression stability (M) values well below 1.5 (Table 3). The stability ranking RGs were in the following order; RPS9 = HPRT > ACTB > RPS23 > EEF1A1 > UXT > GAPDH > RPL4 > RPS15 > B2M (Fig. 2j). The RPS9 and HPRT were most stable with lowest M value of 0.378 while RPS15 and B2M had maximum expression variability and highest M values of 1.474 and 1.721 respectively.

Further, the pair-wise variation analysis provided within the acceptable limit on sequential addition of another gene to the two most stably expressed genes, viz., B2M and HPRT1 the pair-wise combination V2/3 gave the acceptable V value of 0.143 (< 0.15) suggesting that the geometric mean between RPS9, HPRT1 and ACTB is optimal for data normalization in Changthangi goat (Fig. 2k). Similar to geNorm, Normfinder also identified RPS9 (0.310), RPS23 (0.477), ACTB (0.486) and HPRT1 (0.740) as most stable and B2M (2.517) and RPS15 (2.015) as least stably expressed genes (Fig. 2l, Table 3). There was good agreement between geNorm and Normfinder outcome, albeit slight variation was observed in the ranking of RGs. The BestKeeper algorithm showed consistent expression levels for all the RGs. RPS9 (0.176), exhibited low SD and 0.422 correlation coefficients in BestKeeper analysis, pointing towards their expression stability (Table 10). Additionally, RPS9/ACTB (r = 0.974), B2M/RPS23 (r = 0.801), B2M/EEF1A1 (r = 0.739), and RPS23/GAPDH (r = 0.712) showed the strong correlation coefficients (Table 11). B2M (0.847) showed the high correlation value but they showed the high fold change thus their reliability as a RGs is not applicable. RefFinder were identified the overall ranking of the gene. The ranking of genes was RPS9 (1), HPRT1 (2.38), ACTB (2.71), RPS23 (3.13), UXT (5.48), GAPDH (5.96), EEF1A1 (6.44), RPL4 (8), RPS15 (9), B2M (10). In the present investigation, all four algorithmic methods geNorm Normfinder, BestKeeper and RefFinder have demonstrated that RPS9, HPRT1 and ACTB are the most stable RGs in CHG.

Table 10 Analysis of parameters based quantitative cycling points (CP) for 10 candidate RGs in CHG.
Table 11 Analysis of repeated pair-wise correlation amongst genes in CHG with BestKeeper index.

Double hump Camel (DHC)

The geNorm analysis of 10 RGs showed M values ranging from 0.600 to 1.352 in double hump camel (Table 3). The M values for all the RGs were within the acceptable limit of < 1.5. On the basis of relative expression stability and stepwise exclusion, the ranking order of RGs was: B2M = RPS9 > HPRT1 > ACTB > RPS23 > EEF1A1 > UXT > GAPDH > RPL4 > RPS15 (Fig. 2m). The expression of RPS9 and B2M RGs with lowest M values of 0.600 were found to be most stable while RPL4 and RPS15 RGs with highest M values of 1.180 and 1.352, respectively were found to be least stable RGs in DHC. Based on pair-wise combination, the V values for V3/4, V5/6 and V6/7 and were close to the threshold value of 0.15. Therefore, the combination of V3/4 with ACTB, HPRT1 and B2M RGs should provide the accurate normalization of qPCR data in DHC (Fig. 2n).

In Normfinder analysis, the RGs were ranked as follows: HPRT1 > ACTB > B2M > RPS9 > UXT > EEF1A1 > RPS23 > GAPDH > RPL4 > RPS15 (Fig. 2o). The HPRT1 (0.295), ACTB (0.418), B2M (0.420), RPS9 (0.586) were four most stable RGs as per stability values.

In BestKeeper analysis, HPRT1 gene with the lowest crossing point SD value of 0.203 was found to be most stable. This was followed by EEF1A1, ACTB and RPS23 genes with SD values of 0.294, 0.377, and 0.462, respectively (Table12). On the other hand, RPS15, RPL4 and GAPDH RGs with high crossing point SD values of 1.61, 1.54, 0.86 respectively were found to be least stable. Strong correlation was observed in inter gene relationship of the RGs RPL4/UXT (r = 0.908), HPRT1/UXT (r = 0.884) and HPRT1/B2M (r = 0.755) (Table 13). The relationship between RGs and BestKeeper was described in terms of Pearson correlation coefficient (r), coefficient of determination correlation between BestKeeper and RGs was observed for HPRT1 (r = 0.797) and B2M (r = 0.809) followed by UXT, RPS9 and ACTB gene.

Table 12 Analysis of parameters based quantitative cycling points (CP) for 10 candidate RGs in DHC.
Table 13 Analysis of repeated pair-wise correlation amongst genes in DHC with BestKeeper index.

RefFinder based overall analysis resulted in stability ranking of RGs as; HPRT1 (1.32) > ACTB (2.63) >  B2M (2.71) >  RPS9 (2.99) > EEF1A1 (4.56) > RPS23 (5.60) > UXT (5.92) > GAPDH (8.00) > RPL4 (9.00) > RPS15 (10.00). Overall, HPRT1, B2M and ACTB were identified as the most appropriate RGs in high altitude adapted DHC using all four algorithms.

Zanskar Horses (ZAP)

The M values calculated using geNorm analysis for all the RGs in Zanskar ponies are shown in Table 3. Except, B2M and GAPDH RGs, the M values for all other RGs were within the acceptable limit of < 1.5. The M value for all the RGs in ZAP ranged from 0.135 to 1.721. The ranking order of RGs was as follows; RPS9 = RPL4 > RPS23 > EEF1A1 > UXT > RPS15 > HPRT1 > ACTB > B2M > GAPDH (Fig. 2p). The two most stable RGs with lowest M value were RPS9 and RPL4 (0.135) while GAPDH and B2M were the least stable RGs with M value of 1.712 and 1.394 respectively. Further, the V values for V2/3, V3/4, V4/5 and V5/6 were within the threshold limit of 0.15. Based on geNorm analysis, the geometric mean of RPS9, RPL4 and RPS23 RGs is likely to provide accurate normalization of gene expression data in ZAP (Fig. 2q).

In Normfinder analysis ranking of genes in high altitude ZAP from most stable to least stable was as follows: EEF1A1 (0.631), UXT (0.684), RPS9 (0.749), RPS23 (0.759), RPL4 (0.763), RPS15 (1.022), ACTB (1.36), HPRT1 (1.727), B2M (1.945), GAPDH (2.795) (Fig. 2r).

From BestKeeper algorthim, UXT gene revealed minimum SD value of 0.341 with smallest variation, followed by RPS23, RPS9, RPL4, EEF1A1, RPS15, ACTB, HPRT1, B2M and GAPDH with the SD value 0.434, 0.465, 0.469, 0.498, 0.816, 1.211, 1.226, 1.914, 2.099 respectively (Table 14). The best correlation between RGs and BestKeeper was observed for B2M (r = 0.947) and ACTB (r = 0.662) (Table 15). The high correlation values for these genes indicated their reliability as RGs.

Table 14 Analysis of parameters based quantitative cycling points (CP) for 10 candidate RGs in ZAP.
Table 15 Analysis of repeated pair-wise correlation amongst genes in ZAP with BestKeeper index.

In RefFinder analysis, RPS9, RPL4 and UXT were overall most stable while GAPDH, B2M and HPRT1 were the least stable. Based on all the methods; geNorm Normfinder, BestKeeper and RefFinder RPS9, RPL4 and UXT were observed to be most stable RGs in ZAP.

Discussion

These days, identification of appropriate RGs is a fundamental part of gene expression studies. It has been suggested in many reports10,26,38 that there are no panel of RGs that can be used universally for normalization of gene expression data. Several studies have been highlighted the importance of proper RGs for normalization of target genes29,30,39. Although, qPCR is a sensitive and efficient technique to quantify the expression profile of genes in different experimental conditions, there are several inevitable variations including mRNA quality and expression variability, identification of appropriate normalization factors becomes obligatory for accurate quantization of target genes expression profile. It becomes more imperative in comparative expression studies between different experimental conditions. To the best of our knowledge no such study has been reported in livestock species that are adapted to high altitude regions. In our study, a total of 10 candidate RGs that belonged to basic cellular processes from different functional categories were evaluated for their expression stability across high altitude adapted animals like Ladakhi Cattle, Ladakhi Yak, Double hump Camel, Ladakhi Donkey, Chanthangi Goat, Zanskar Horses. The panel of stable RGs in each livestock species were; EEF1A1, RPL4, RPS23 (Ladakhi cattle); GAPDH, RPS9, ACTB (Ladakhi yak); B2M, HPRT1, RPS23, ACTB (Ladakhi donkey); RPS9, HPRT1 ACTB (Changthangi goat); HPRT1, ACTB, B2M and RPS23 (Double hump camel); RPS9, RPL4, UXT (Zanskari ponies). For different species, panel of RGs identified in PBMCs were different. Similarly, several other studies conducted by our group has also reported different panel of RGs for different experimental condition; viz., RPS9 and RPS15 were identified as stably expressed RGs in PBMCs of Sahiwal cows and Murrah buffaloes under heat stress conditions22. Similarly, both genes were also recognized as stable RGs in mammary gland of dairy cows across different stages of lactation23. Beta-2M, RPS23, RPL4 and EEF1A1 as most trustworthy RGs in heat stressed mammary explants and mammary epithelial cells of buffaloes21,40. RPL4, EEF1A1, ACTB and GAPDH genes were found to be most stable genes in milk derived mammary epithelial cells in Sahiwal cows during different lactation stages41.

In similar lines, other groups have also identified suitable RGs in different cell types or experimental conditions in livestock species. For example; UCHL5, RPLP0 and TBP were identified as stable reference genes in whole blood samples from healthy and leukemia-virus infected cattle42. Interestingly, their studies, have identified ACTB and GAPDH, the two most commonly used RGs as the least stable genes. In another study, PPIA and RPLP0 were identified as most appropriate RGs in milk somatic cells while YWHAZ was identified as most stable RG in frozen whole blood of goats infected with caprine arthritis encephalitis virus43. The panel of RGs were also identified in PBMCs of goat infected with peste des petits ruminants virus44. They found that GAPDH and 18S rRNA were most stable while ACTB was least reliable in PPRV infected PBMCs of goat. Similarly, identification of stable RGs for transcriptomic studies in bulls for meat quality trait45 and muscles46 were also reported. Tanushree et al.47 identified another panel of RGs; GAPDH, RPS15 and HPRT for normalization of qPCR data in in-vitro fertilized and cloned embryos of riverine buffaloes. These studies clearly emphasized the fact that there is a need to identify panel of stably expressed genes for individual cell types, species and experimental conditions so as to achieve accurate normalization and consistency in RT-qPCR results. In the present investigation, species wise most stable RGs were identified using geNorm, Normfinder, BestKeeper and RefFinder analysis which could be quite useful in normalization of expression data in PBMC of different species adapted to high altitude environments, substantiating the importance of RGs for particular experimental conditions30. To the best of our knowledge, this is the first systematic attempt to identify panel of RGs across different species types adapted to high altitude hypoxia conditions.

Conclusion

Use of reference genes or internal control genes (ICGs) or housekeeping (HKGs) genes with constant expression level between samples in response to experimental treatment or physiological state, are now considered as effective method for normalization of transcriptional data to account for the experimental variations11. In the present study, species wise panel of RGs were identified such as ACTB, RPS15 in Ladakhi cattle; GAPDH, RPS9 in Ladakhi yak; B2M, HPRT1 in Ladakhi donkey; HPRT1, RPS9 in Changthangi goat; B2M, HPRT1 in Double hump camel and RPS9, RPL4 in Zanskar ponies. To the best of our knowledge, this is the first systematic attempt to identify panel of reference genes across different livestock species adapted to high altitude region Leh-Ladakh. The data presented here could be used as a resource to select most suitable reference for accurate normalization of transcriptional data during all future studies resembling the experimental conditions highlighted in this study.

Methods

Livestock species, sampling and PBMCs isolation

The sampling was done in accordance with the guidelines and regulations of Institutional Animal Ethics Committee (IAEC). All the detailed related to animals experiment as per the ARRIVE guideline. All the procedures were approved by the animal ethics committee of ICAR-NBAGR. For this study, about 7–8 ml of blood was collected from jugular veins using EDTA vacutainer tubes from 32 individuals representing 6 native livestock species that are native of Leh-Ladakh region of India and well adapted to cold arid hypoxia conditions. For sampling, 6 young females of around 1–2 years age of each of Ladakhi cattle (LAC), Ladakhi yak (LAY) and 5 each of Ladakhi donkey (LAD), Changthangi goat (CHG), Zanskar ponies (ZAP) and Double hump camel (DHC) were randomly selected from the breeding tract of these populations. The geographical coordinates of sampling site were latitude-34° 9′ 9.3168″ N, and longitude 77° 34′ 37.3764″ E. The blood samples blood samples were transported to the laboratory for further processing and isolation of peripheral blood mononuclear cells (PBMCs). The PBMCs were isolated within 2–3 h of blood sample collection. The density gradient centrifugation procedure adopted for purification of PBMCs has been described in one of our previous publication28. The entire workflow of the experiment was showed in Fig. 3.

Figure 3
figure 3

The entire workflow of the qPCR experiment conducted in PBMCs of different species adapted to cold arid hypoxia environment. Maps was made using Mapchart (https://www.mapchart.net/india.html).

Purification of total RNA and cDNA synthesis

For isolation of total RNA, the purified PBMCs were suspended in 1.0 ml Trizol reagent (Thermo Fisher Scientific, USA). After homogenization, the standard protocol based on chloroform and isopropanol extraction was followed to isolate the total RNA. The total RNA was further purified by employing silica-membrane RNeasy spin columns (Qiagen, Germany) along with on column digestion by DNase 1 enzyme (Qiagen, Germany) to remove traces of genomic DNA. The concentration and purity of extracted was measured using Nano view plus (Biohrome Spectros, USA). The integrity of each RNA sample was also confirmed by presence of 28S and 18S ribosomal bands on 1.5% agarose gel.

cDNA synthesis and real time quantitative PCR (RT-qPCR)

The first strand cDNA synthesis was carried out using RevertAid First Strand cDNA Synthesis Kit (Thermo Fisher Scientific, CA, USA). First strand cDNA was synthesized using 200 ng of purified RNA, oligo-dT (18) primer, dNTP mix, random primers, RiboLock™ RNase inhibitor, M-MuLV reverse transcriptase supplied with RevertAid First Strand cDNA Synthesis kit (Thermo Scientific, CA, USA). The reaction for cDNA synthesis was set up using the program: 25 °C for 5 min, 50 °C for 60 min, and 70 °C for 15 min. The cDNA sample was diluted 1:4 (v:v) with DNase⁄RNase free water. Before subjecting for qPCR reactions, each of the cDNA samples was amplified using GAPDH in a semi-quantitative PCR. This step was done to ensure the quality of all the 32 first strand cDNA synthesized from PBMCs of 6 livestock species. The amplified products were checked on 2.5% agarose gel to ensure specific amplification. A total of 10 potential candidate RGs viz., GAPDH, ACTB, RPS9, RPS15, RPS23, B2M, EEF1A1, RPL4, UXT and HPRT1 were evaluated in this study. The purpose of evaluating the stability ranking of these 10 RGs was to provide most appropriate panel of RGs in each of six livestock species of Leh-Ladakh so that any future transcriptional data could be normalized accurately. All relevant details like gene name, primer sequences, melting temperature etc. are tabulated in Table 1.

The qPCR reactions were performed in a final volume of 10 μL containing 4 μL diluted cDNA combined with 6 μL of master mix composed of 5 μL Maxima SYBR Green/ROX qPCR master mix (2X) (Fermentas Thermo, USA), 0.4 μL each of 10 μM forward and reverse primers, and 0.2 μL DNase/RNase free water. All the reactions were performed in duplicate along with six-point standard curve along with non-template control with following amplification conditions; 2 min at 50 °C, 10 min at 95 °C, 40 cycles of 15 s at 95 °C (denaturation) and 1 min at 60 °C (annealing + extension) in a Step one plus real time PCR instrument (ABI, California). For standard curve of each primer pair, fivefold serial dilution was made using pooled cDNA samples. The qPCR expression data for each gene was extracted in the form of crossing points and data was subjected for subsequent analysis.

Identification of reference genes and statistical analysis

In order to evaluate the expression stability of RGs in individual species, 10 candidate genes viz., GAPDH, ACTB, RPS9, RPS15, RPS23, B2M, EEF1A1, RPL4, UXT and HPRT1 from different functional categories were selected. four independent statistical approaches viz. geNorm11, Normfinder48, BestKeeper49 and RefFinder50 were used to identify most stable RGs.

The geNorm software measure the expression stability as M value which is based on overall pairwise comparison among the reference genes. The M value is inversely correlated to gene expression stability and ranks the RGs accordingly. In addition, pair wise variation analysis (V values) was also carried out using geNorm software to select optimal number of RGs to be used for normalization of target gene data. Normfinder algorithm determined the optimal RGs and the combination of two genes for a two-gene normalization factor with its corresponding stability value. The BestKeeper analysis is based on pairwise comparisons of raw cycle threshold (Ct), values of each gene. The result of BestKeeper analysis is displayed as standard deviation (SD) and coefficient of variance (CV). BestKeeper software calculated the descriptive statistics of every candidate gene and excludes the genes having standard deviation (SD) greater than 1, lower the standard deviation more is the stability of genes.

The data was analysed by direct comparing the Ct values in geNorm and Normfinder. The relative Ct values based on comparative Ct-method were the input data for geNorm and Normfinder11,51 wherein, the average Ct value of each duplicate reaction was converted to relative quantity data [transformed using comparative Ct method as Efficiency (minimum Ct − sample Ct)] with the highest expression level set to 1. As input for BestKeeper analysis, the average Ct value of each duplicate reaction was used directly (without conversion to relative quantity).

Animal ethics

All the experimental procedure was done in accordance with the guidelines and regulations of Institutional Animal Ethics Committee (IAEC), ICAR-National bureau of animal genetic resources (ICAR-NBAGR), Karnal, Haryana, India.