Background

Real-time quantitative PCR (qPCR) is an efficient method for quantification of mRNA transcription levels due to its high sensitivity, reproducibility and large dynamic range; in addition, real-time qPCR is fast, easy to use and provides simultaneous measurement of gene expression in many different samples for a limited number of genes. One of the critical steps in comparing transcription profiles is accurate normalisation, therefore, a number of variables should be controlled such as amount of starting material, enzymatic efficiencies, differences in transcriptional activity and presence of inhibitors in different sample materials. One way to standardize samples is to quantify the starting material i.e. number of cells or quantity of RNA. However, this does not take the RNA quality and enzymatic efficiencies into account. To control these variables, which are not a result of the experimental design, normalisation using reference genes are routinely used e.g. [13]. The accurate quantification of reference genes allows normalisation of differences in amount of the amplified cDNA in individual samples. The normalisation adjusts for differences in amount and quality of starting material and differences in RNA preparation and cDNA synthesis, since the reference gene is exposed to the same preparation steps as the gene of interest. Recently a set of reference genes (ACTB, TBP and topoisomerase (DNA) II beta (TOP2B)) have proven suitable for normalisation of qPCR data of backfat and longissimus dorsi muscle in the pig [4].

In the present study the expression stability and expression level of nine potential reference genes have been compared in 17 different pig tissues. This has enabled us to assess the suitability of the genes for normalisation of mRNA across several tissues leading to the identification of the best reference genes for studies of high and low abundance transcripts, respectively. The suitable reference genes from this study can be used for normalisation of qPCR data in several tissues. Thus, the study gives useful information of importance for a broad range of different functional studies.

Results

Candidate reference genes

Nine housekeeping genes were selected from commonly used reference genes [57]. Gene symbols, their full names, functions and pig EST accession numbers are listed in Table 1. Genes with different functions were chosen in order to avoid genes belonging to the same biological pathways that may be co-regulated. The porcine sequences of the genes were obtained by fasta search with the human cDNA sequence for each gene against a porcine EST database[8].

Table 1 Selected candidate reference genes

QPCR efficiency and variability

Standard curves were generated using relative concentration vs. the threshold cycle (Ct). The linear correlation coefficient (R2) of all the nine genes ranged from 0.998 to 1.000. Based on the slopes of the standard curves, the amplification efficiencies of the standards ranged from 93%~103%, (derived from the formula E = 10 1/-slope -1). This calculation method results in efficiencies higher than 100% which is an overestimate of the "real efficiency" [9]. The not normalised Ct values of all the nine genes in all the samples were within 13.5 to 30.5 cycles, covered by the range of the standard curves.

Expression level and stability of putative reference genes in various tissues

The range of expression stability calculated by geNorm [6] in the genes analysed was (from the most stable to the least stable): ACTB/RPL4, TBP, HPRT1, HMBS, YWHAZ, SDHA, B2M and GAPDH (Figure 1). The M values for ACTB, RPL4, TBP, HPRT1, HMBS, YWHAZ and SDHA were lower than 1.5, and therefore it was concluded that these genes have comparable stability in different pig tissues. For calculation of normalization factor (NF) geNorm suggested the five most stable reference genes. The NF was calculated based on the geometric mean of the Ct values. Normalised Ct values are shown in Table 2. After normalization against the NF, the ranking of the relative expression levels was (from high to low): B2M, RPL4, ACTB, GAPDH, YWHAZ, SDHA, HPRT1, TBP and HMBS (Figure 2). Furthermore, HMBS, and GAPDH showed tissue-specific regulation, i.e. HMBS is clearly up-regulated in bone marrow and GAPDH is clearly up-regulated in muscle tissue.

Figure 1
figure 1

Gene expression stability of candidate reference genes. Gene expression stability of candidate reference genes in different pig tissues analyzed by the geNorm program. Threshold for eliminating a gene as unstable was M ≥ 1.5. The respective genes and gene names are: beta-actin (ACTB), beta-2-microglobulin (B2M), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), hydroxymethylbilane synthase (HMBS), ribosomal protein L4 (RPL4), succinate dehydrogenase complex subunit A (SDHA), TATA box binding protein (TPB) and tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein zeta polypeptide (YWHAZ).

Figure 2
figure 2

The relative expression level of the reference candidate genes in pig tissues. The relative expression level of the reference candidate genes in pig tissues. HMBS was the lowest expressed gene and B2M was the highest expressed gene. The respective genes and gene names are: beta-actin (ACTB), beta-2-microglobulin (B2M), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), hydroxymethylbilane synthase (HMBS), ribosomal protein L4 (RPL4), succinate dehydrogenase complex subunit A (SDHA), TATA box binding protein (TPB) and tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein zeta polypeptide (YWHAZ).

Table 2 RNA transcription levels (normalised Ct values) of the candidate reference genes. Values in parentheses shows the ranking of genes in the specific tissue calculated by GeNorm. Values in bold have an M-value above 1.5.

Discussion

For an exact comparison of mRNA transcription in different samples or tissues it is crucial to choose the appropriate reference gene. The optimal reference gene should be constantly transcribed in all types of cells at any time in cell cycle and differentiation. Moreover the transcription of such a gene should not be regulated by internal or external influences, at least not more than the general variation in RNA synthesis. The reference gene used for normalisation of gene expression in real-time qPCR studies should also pass through the same steps of analysis as the gene to be quantified. However, such a perfect reference gene does probably not exist. The stability in expression of often used reference genes such as GAPDH and ACTB has been shown to vary considerably and are consequently unsuitable as reference genes for normalisation of gene expression analysis in some cases [5, 1012]. Also the low expressed reference gene TBP is highly regulated when comparing normal and tumour tissues [13].

Numerous studies have been carried out in order to evaluate reference genes in specific tissues in several species. The majority of these studies are directed towards specific tissues [1, 3, 7, 14]. They clearly demonstrate that it is very difficult to find a 'universal' reference gene having stable expression in all cell types and tissues, and in particular to find reference genes that remain stable between samples taken at different time points under different experimental conditions. The first priority, however, is to identify genes with stable expression preferably across cell types since many real-time qPCR studies are performed on cDNA isolated from tissues with a mixed cell population.

There have been limited experiments of reference gene selection for use in other livestock and production animal species. Bogaert et al. [15] proposed ubiquitin (UBB), ACTB and B2M as reference genes for normalisation of qPCR data for normal equine skin. In cattle studies, De Ketelaere et al. [16] selected SDHA, YWHAZ, and 18S rRNA as being the most stable genes for accurate normalisation of qPCR of bovine polymorphonuclear leukocytes and Goossens et al. [2] found YWHAZ, GAPDH and SDHA to be the most stable reference genes across different preimplantation embryonic stages. Common for the bovine studies are the target of the studies, which are single cell populations. Garcia-Crespo et al. [17] compared expression of six potential reference genes in sheep tissues and found GAPDH with the lowest variation among the panel of six tissues, whereas ACTB and YWHAZ showed the worst score in variability. This is not in agreement with our findings but can be a result of the tissue composition. At present three studies have examined reference genes in pig i.e. Foss et al. [18] describes the use of GAPDH, ACTB and HPRT in immune cells and tissues using northern blot and PCR. In this study GAPDH was shown to be more stable than ACTB. Erkens et al [4] validated mRNA expression stability of 10 reference genes in porcine backfat and longissimus dorsi muscle. The most stable reference genes suitable for normalisation of qPCR data of backfat and longissimus dorsi muscle in the pig was ACTB, TBP and TOP2B. Kuijk et al. [19] investigated transcription levels of seven frequently used reference genes (inclusive B2M, ACTB, GAPDH) at different stages of early porcine embryonic development. Our study is the first detailed study of the stability and level of pig reference genes across a large number of tissues. Our findings confirm studies demonstrating tissue specific regulation of some of the commonly used housekeeping genes. Furthermore, it provides recommendations for choice of reference genes in studies where high and low abundant transcripts are under investigation. In agreement with the findings of Erkens et al. [4] both TBP and ACTB are suitable reference genes. However, our data show that ACTB is most relevant for high abundant transcripts, while TPB is most relevant for low abundant transcripts. It is clear from both Erkens et al. [4] and our study that GAPDH is quite unstable and not suitable as a reference gene.

Conclusion

Our study demonstrates that expression stability varies greatly between genes. Seven of the nine genes investigated (i.e. ACTB, RPL4, TBP, HPRT1, HMBS, YWHAZ, SDHA) were found to have a high stability across tissues. Two of the genes investigated were shown to have tissue-specific regulation, i.e. HMBS is clearly up-regulated in bone marrow and GAPDH is clearly up-regulated in muscle. Based on both expression stability and expression level, our data suggest that ACTB and RPL4 are good reference genes for high abundant transcripts while TPB and HPRT1 are good reference genes for low abundant transcripts in expression studies across different pig tissues.

Methods

Candidate reference genes and primer design

Nine housekeeping genes were selected from commonly used reference genes [57]. The porcine sequences of the genes were obtained by fasta search with the human cDNA sequence for each gene against a porcine EST database[8]. The consensus sequences were used for comparison with genomic porcine sequences, if available, to reveal the exon-intron structure. When no genomic porcine sequence was available the gene structure was obtained by comparison of the pig sequence with human and mouse genomic sequence assuming that the exon-exon boundaries are conserved between human, mouse and pig.

All the primers were designed by Primer 3 software [20]. For information on primer sequences see Table 3. Primers spanning at least one intron were chosen to minimize inaccuracies due to genomic DNA contamination. If the genes had known pseudogenes, primers were chosen according to the alignment results between the genes and the pseudogenes, so the primers were unique to the genes and different from the pseudogenes. The secondary structure of the amplicons was analyzed by Mfold using the criteria -3<dG<0 [21] to optimize the PCR efficiency. Primers and amplicons were in silico verified with BLASTN for specificity and the size of the PCR products was confirmed with gel electrophoresis.

Table 3 Primers, PCR conditions and qPCR efficiency

RNA extraction

Seventeen different porcine tissues were collected from three young female siblings. Total RNA was extracted using different protocols depending of the tissue: TRIreagent® (Molecular Research Centre, inc.) for liver, kidney, thymus, RNeasy lipid kit (Qiagen) for adipose (subcutaneous), cortex cerebri, cerebellum, hippocampus, lymph nodules (jejunal), RNeasy Fibrous Kit (Qiagen) for muscle (longissimus dorsi), heart (muscle), skin (dermis and epidermis) and RNeasy kit (Qiagen) for pancreas, bone marrow, bladder, lung, stomach (mucosal membranes), small intestine (mucosal membranes) according to each manufacturer protocol. Contaminating DNA was degraded by treating each sample with RQ1 RNase-free DNase (Promega) according to the instructions manual, followed by a spin-column purification (Qiagen RNeasy). The total RNA was quantified by optical density and the quality was evaluated by gel electrophoresis. Intact rRNA subunit of 28S and 18S were observed on the gel indicating minimal degradation of the RNA.

cDNA synthesis

One μg of total RNA was reverse transcribed at 42°C using Improm-II™ reverse trancriptase (Promega) and Oligo(dT) according to the manufacturers recommendations. Prior to use in qPCR cDNA was diluted 1:8 with H2O.

Quantitative PCR with SYBR green

For each transcript a standard curve was constructed using the purified PCR product generated for each specific primer pair. Single reactions were prepared for each cDNA along with each serial of dilution using the Brilliant® SYBR® Green Master Mix (Stratagene). Each PCR reaction also included a reverse transcription negative control (without reverse transcriptase) to confirm the absence of genomic DNA, a non template negative control to check for primer-dimer and a porcine genomic DNA control to verify no specific amplification with the primers. Each reaction consisted of 20 μl containing 2 μl of cDNA and 5 pmol of each primer. The real time qPCR was run on MX3000p (Stratagene). The cycling conditions were 1 cycle of denaturation at 95°C/10 min, followed by 40 three-segment cycles of amplification (95°C/30 sec, 58°C–63°C (gene depending, see table 2)/1 min, 72°C/30 sec) where the fluorescence was automatically measured during PCR and one three-segment cycle of product melting (95°C/1 min, 55°C/30 sec, 95°C/30 sec). The baseline adjustment method of the Mx3000 (Stratagene) software was used to determine the Ct in each reaction. A melting curve was constructed for each primer pair to verify the presence of one gene-specific peak and the absence of primer dimmer. All samples were amplified in duplicates and the mean was used for further analysis.

Analysis of expression stability

The gene expression levels were measured by real-time qPCR, and the expression stabilities were evaluated by the M value of geNorm [6]. The M value for each reference gene is the average pairwise variation for that gene with all the other tested control genes. Stepwise exclusion of the gene with the highest M value allows ranking of the tested genes according to their expression stability.