Evaluation of reference genes for transcript analyses in Komagataella phaffii (Pichia pastoris)

Besleaga, Mihail; Vignolle, Gabriel A.; Kopp, Julian; Spadiut, Oliver; Mach, Robert L.; Mach-Aigner, Astrid R.; Zimmermann, Christian

doi:10.1186/s40694-023-00154-1

Evaluation of reference genes for transcript analyses in Komagataella phaffii (Pichia pastoris)

Methodology
Open access
Published: 29 March 2023

Volume 10, article number 7, (2023)
Cite this article

Download PDF

You have full access to this open access article

Fungal Biology and Biotechnology Aims and scope Submit manuscript

Evaluation of reference genes for transcript analyses in Komagataella phaffii (Pichia pastoris)

Download PDF

Mihail Besleaga¹^na1,
Gabriel A. Vignolle^1,2^na1,
Julian Kopp¹,
Oliver Spadiut¹,
Robert L. Mach¹,
Astrid R. Mach-Aigner¹ &
…
Christian Zimmermann¹

2244 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

Background

The yeast Komagataella phaffii (Pichia pastoris) is routinely used for heterologous protein expression and is suggested as a model organism for yeast. Despite its importance and application potential, no reference gene for transcript analysis via RT-qPCR assays has been evaluated to date. In this study, we searched publicly available RNASeq data for stably expressed genes to find potential reference genes for relative transcript analysis by RT-qPCR in K. phaffii. To evaluate the applicability of these genes, we used a diverse set of samples from three different strains and a broad range of cultivation conditions. The transcript levels of 9 genes were measured and compared using commonly applied bioinformatic tools.

Results

We could demonstrate that the often-used reference gene ACT1 is not very stably expressed and could identify two genes with outstandingly low transcript level fluctuations. Consequently, we suggest the two genes, RSC1, and TAF10 to be simultaneously used as reference genes in transcript analyses by RT-qPCR in K. phaffii in future RT-qPCR assays.

Conclusion

The usage of ACT1 as a reference gene in RT-qPCR analysis might lead to distorted results due to the instability of its transcript levels. In this study, we evaluated the transcript levels of several genes and found RSC1 and TAF10 to be extremely stable. Using these genes holds the promise for reliable RT-qPCR results.

Selection and Validation of Reference Genes for Gene Expression Studies in Codonopsis pilosula Based on Transcriptome Sequence Data

Article Open access 28 January 2020

Selection and evaluation of reference genes for RT-qPCR expression studies on Burkholderia tropica strain Ppe8, a sugarcane-associated diazotrophic bacterium grown with different carbon sources or sugarcane juice

Article 17 August 2016

Selection and validation of reference genes for RT-qPCR indicates that juice of sugarcane varieties modulate the expression of C metabolism genes in the endophytic diazotrophic Herbaspirillum rubrisubalbicans strain HCC103

Article 10 July 2017

Background

The ascomycetous yeast Komagataella phaffii (previously Pichia pastoris [1]) is routinely used for heterologous protein expression due to its fast growth rates and cheap cultivation conditions—compared to mammalian cell cultures, and its capability to secrete high levels of recombinant proteins [2]. Moreover, a large genetic toolbox is available for K. phaffi making it an interesting workhorse for the production of recombinant proteins in academia as well as the biopharmaceutical industry [3]. K. phaffi was also suggested to be used as a model organism for yeasts as it has undergone a slower evolution than Saccharomyces cerevisiae and retained more characteristics of ancient yeasts and other metazoan organisms [4]. Consequently, different aspects of general biology and especially the protein production and secretion pathway have been studied over the years (reviewed in [3,4,5,6,7,8]). In many of these studies, reverse transcription quantitative PCR (RT-qPCR) analyses are used to determine the transcript levels of native and heterologous genes [9,10,11,12,13,14,15,16,17,18,19,20].

RT-qPCR is a routinely used method for the quantification of individual transcripts in biological samples. In brief, first, the RNA is extracted and then reverse transcribed yielding cDNA. This cDNA is then used as template in a qPCR reaction. The design and choice of primers determine which sequence is to be amplified and therefore quantified. Generally, qPCR assays can be used to calculate the absolute number of targets in a sample, when a set of serial standard dilutions is used to infer a standard curve. However, RT-qPCR assays are mostly performed to compare the abundance of certain transcripts among two or more samples. By an appropriate choice of samples, we can evaluate the influences of different stimuli, cultivation time, developmental state, and many other factors on the transcript abundance of the chosen genes of interest.

Currently, different mathematical methods are used to calculate the relative transcript levels and compare different samples. The commonly used “Delta–delta method” was first described in the Applied Biosystems User Bulletin No. 2 (P/N 4303859). It presumes an identical and perfect efficiency of target and reference genes. In contrast, the model of Pfaffl takes different efficiencies into account [21]. The ‘efficiency calibrated mathematical method for the relative expression ratio in real-time PCR’ was partially published in an internal magazine of Roche (Biochemica No. 2 2001). According to Pfaffl [21], the Roche model is mathematically hard to follow but delivers identical results as Pfaffl’s method. The standard curve based method of Larionov et al. does not need to take efficiencies into account, as it is based on dilution standards [22]. Notably, all these calculation methods rely on the usage of a reference value to compare different samples. Current state-of-the-art is the usage of internal reference values i.e., reference genes. These are genes with constant transcript levels in all samples. Therefore, their abundance in a sample is a representation of the sample itself. The more reference genes transcript is present, the more RNA was isolated and successfully transcribed. Normalization to reference genes aims to mitigate or even nullify differences in the relative abundance of the different types of RNA (rRNA, tRNA, mRNA, etc.) or in the efficiency of the reverse transcription among the different samples.

Based on this central role for the calculation of relative transcript levels, the choice of reference genes is of utmost importance and must be carefully validated. This is stressed in the widely accepted ‘Minimum Information for Publication of Quantitative Real-Time PCR Experiments’ (MIQE) guidelines [23] and was previously discussed and demonstrated in other fungi such as S. cerevisiae [24], Aspergillus spp. [25, 26], and Trichoderma reesei [27]. The MIQE guidelines further state that at least two reference genes should be used for accurate normalization [23]. In K. phaffi, ACT1 (encoding for the cytoskeleton-monomer, actin) is often used as reference gene in RT-qPCR assays [9, 10, 12, 14,15,16,17,18,19,20]. To our knowledge, this gene is used in the K. phaffi community out of habit and due to the lack of validated reference genes.

In this study, we assessed several publicly available RNA-Seq data sets and ranked all annotated genes according to their variation among the samples. We manually curated this list and chose eight stably expressed genes. To test and validate these genes and ACT1, we compiled a test set of 35 independent samples. These samples cover a broad range of typical and atypical cultivation conditions and stresses and originate from three different K. phaffi strains. The transcript levels of the eight potential reference genes and ACT1 were measured in all samples and evaluated with routinely used tools (i.e., the comparative Delta Ct method [28], BestKeeper [29], Normfinder [30], Genorm [31], and RefFinder [32]).

Results

Identification of stably expressed genes by the assessment of RNA-Seq data

To identify potential reference genes in K. phaffii, we searched for stably expressed genes in publicly available RNASeq data (Table 1). The used data sets cover a variety of different culturing conditions and sampling timepoints of the strain GS115. We compared the data sets “Glycerol”, “Methanol”, “Glycerol and Methanol” and “YNBE” to “Glucose” each and rigorously filtered (robust base mean coverage and low log2 Fold Change, details in the Materials and Methods section) for genes with stable expression in each of those four comparisons (Additional file 1: Table S1). Next, we narrowed the gene list down to those genes that appeared in each comparison, giving us a final set of 38 genes (Additional file 2: Table S2).

Table 1 RNA-Seq datasets used in this study

Full size table

Comparison and evaluation of refences genes

Next, we manually picked 8 genes to be experimentally tested for stable expression in K. phaffii (Table 2). We decided to exclude genes without homologs in S. cerevisiae and without gene description. Further, we omitted all genes whose products are potentially somehow involved in or influenced by mechanisms of protein expression or secretion (e.g., ER-residing proteins). Hence, we focused on genes whose products are involved in nuclear processes (e.g., ribosome synthesis, epigenetics, transcription). We also included the currently often used reference gene, ACT1 in our evaluation experiment, although ACT1 was not identified as stably expressed (Additional file 2: Table S2).

Table 2 Potential reference genes to be empirically evaluated

Full size table

To test the applicability of the potential reference genes we compiled a sample set from three K. phaffii strains covering a broad range of cultivation conditions, such as different carbon sources, cultivation methods, cultivation stage and different stress types (Table 3). The samples were designed to represent typical cultivation conditions in K. phaffii experiments. Thus, we reason that genes found to be expressed stably in these samples might be suitable reference genes for future transcript analyses. Notably, no biological replicates of the cultivation conditions were performed, as we did not aim to determine the concrete transcript levels and performance of the genes in each sample. In contrast, our goal was to create a sample set with a large diversity to generate data on the overall applicability and robustness of the genes.

Table 3 Samples used for reference gene evaluation

Full size table

The samples were subjected to RT-qPCR analyses measuring the transcript levels of the potential reference genes. The obtained Ct values (Additional file 3: Table S3) were entered into the RefFinder [32] online tool at http://blooge.cn/RefFinder/. This tool integrates the four commonly used tools, the comparative Delta Ct method [28], BestKeeper [29], Normfinder [30] and Genorm [31] and calculates and comprehensive value based on the results of the four individual results (Additional file 3: Table S3 and Fig. 1). Out of the nine tested genes, RSC1 and TAF10 had the lowest variability and thus the most stable transcript levels. In contrast, ACT1 had a substantially higher “comprehensive gene stability” value (Fig. 1 and Additional file 3: Table S3) and was thus less stably expressed in the tested samples.

Discussion

The choice of a reference gene is an essential and crucial part of a transcript analysis via RT-qPCR. The abundance of a reference gene’s transcript shall represent the overall amount of isolated RNA and reverse transcribed cDNA. Consequently, using genes with fluctuating transcript levels is detrimental and prevents a reliable transcript analysis. In this study, we filtered publicly available RNA Seq data set from K. phaffii GS115 for genes with a low overall log2 Fold Change in different data set comparison and manually revised the gene list to obtain potential reference genes. We then tested and evaluated the applicability of these genes experimentally by creating a diverse set of samples from three K. phaffii strains under a broad range of cultivation conditions. The transcript levels of the genes were then compared and evaluated using five broadly used tools for reference gene evaluation.

The obtained results demonstrate that the often-used ACT1 gene is not as stably expressed as other genes (Fig. 1) in our tested samples (Table 3). The stability value (calculated with geNORM finder) of ACT1 of 0.656 lies above the suggested cut-off of 0.5 according to [36]. Further, the average standard deviation (calculated with BestKeeper) of ACT1 lies with 1.029 just above the suggested cut-off of 1.0 according to [29]. This is in accordance with previous findings in S. cerevisiae [24], where ACT1 was also shown to be unsuitable as reference gene. Consequently, we strongly discourage the usage of this gene as reference gene in future RT-qPCR assays in K. phaffii.

The transcript levels of the genes RSC1 and TAF10 exhibited generally low log2 Fold Changes during the comparison of the RNASeq data (Additional file 1: Table S1) and had the lowest “comprehensive gene stability” in our experimental gene evaluation assay. RSC1 is part of the RSC chromatin remodeling complex, while TAF10 is a subunit of both the TFIID complex and the SAGA complex. Both complexes are involved in the gene transcription by the RNA Polymerase II [37]. Based on their biological roles, it comes to no surprise that RSC1 and TAF10 are expressed stably. We consider both genes as suitable reference genes and recommend the simultaneous usage (according to the MIQE guidelines [23]) for relative transcript analyses in K. phaffii.

Materials and methods

Assessment of RNA-Seq data

To search for genes with stable transcript levels, we used 11 publicly available RNA-Seq data sets from K. phaffii GS115. These datasets were derived from three studies and encompass 9 different sampling conditions (Table 1). The RNA-Seq datasets were downloaded from the Sequence Read Archive (SRA) [38] of the National Center for Biotechnology Information (NCBI), with a total of 67.429 G bases and 20.597 Gb. We grouped the different data sets into five sample sets based on the added carbon source (“Glucose”, “Glycerol”, “Glycerol and Methanol”, “Methanol”, “YNBE”) according to the description in the original studies (Table 1). The raw reads were inspected using FastQC v0.11.5 and then analyzed and quality trimmed using Trimmomatic [39]. We extracted a reference transcriptome using gffread v0.12.7 [40] from the reference genome of K. phaffii GS115 [41, 42]. Next, we used salmon 1.4.0 [43] to create a salmon index on the reference transcriptome and quantified each of the datasets, including the –gcBias flag to account for the effects of sample specific biases such as fragment-level GC bias. The quantification results were imported into the R environment and analyzed with the DESeq2 package [44], the tximport package [45], and the Bioconductor package [46].

Next, we compared the following sample sets using DEseq2: glycerol vs. glucose, glycerol and methanol vs. glucose, methanol vs. glucose and YNBE vs. glucose. For each comparison we calculated the Log fold change shrinkage (LFC) the expressed genes. These were then filtered for genes with a base mean coverage of > 500 and < 3000, to avoid false positives and potential sequencing artifacts, and a log2 Fold Change of < 0.05 and > -0.05 to screen for genes with low changes of the transcript levels within the different data sets (same carbon source, different time points). The obtained gene lists were then compared and filtered for genes appearing in each comparison to obtain a list of genes with stable transcript levels in different carbon sources compared to “Glucose”.

Fungal strains and cultivation conditions

The K. phaffi strains CBS 7435, GS115 and BSYBG11 (constitutively expressing a recombinant protein) were pre-cultivated in Yeast nitrogen base media (YNBM) for 20 h at 30 °C, 230 rpm in a shaking incubator (Multitron, Infors HT, Basel, Switzerland). The YNBM consisted of potassium phosphate buffer (pH 6.0), 0.1 M; Yeast Nitrogen Base w/o Amino acids and Ammonia Sulfate (BD Difco, Difco Laboratories Incorporated, part of BD, Franklin Lakes, NJ, USA), 13.4 g/L; (NH₄)₂SO₄, 10 g/L; biotin, 400 mg/L; glucose, 20 g/L [47]. For GS115, Histidine was added to cultivation resulting in a final concentration of 20 mg/L. Once the complete substrate in the initial culture was consumed (at line-determined via Cedex measurements, as described previously [48]), the cultures were used as inoculum (representing 10% of the final volume) for further cultivations. The pre-cultures were used to inoculate shake flasks containing specific medium and conditions (Table 3, samples 2–29). Samples were taken after 1 h or 8 h. Sample 1 was taken from the pre-culture of CBS 7435 directly after glycerol depletion, whereas sample 30 was taken 8 h after glycerol depletion from the pre-culture of BSYBG11.

The low salt lysogenic broth (LB) medium consisted of tryptone, 10.0 g/L; NaCl, 5.0 g/L; yeast extract, 5.0 g/L. The BMGY media consisted of yeast extract, 10.0 g/L; peptone, 20.0 g/L; potassium phosphate buffer (pH 6.0), 0.1 M; Yeast Nitrogen Base w/o Amino acids and Ammonia Sulfate (Difco), 13.4 g/L; biotin, 400 mg/L; glycerol, 10.0 g/L. The DSMZ medium consisted of yeast extract, 3.0 g/L; malt extract, 3.0 g/L; peptone from soybeans, 5.0 g/L; glucose, 10.0 g/L. Basal salt medium (BSM) consists of 85% (v/v) phosphoric acid, 26.7 mL/L; CaSO₄*2H₂O, 1.17 g/L; K₂SO₄, 18.2 g/L; MgSO₄*7H₂O, 14.9 g/L; KOH, 4.13 g/L; glycerol, 20 g/L supplied with trace elements [49].

The strain BSYBG11 constitutively expressing a recombinant protein was also cultivated in a Minifors 2 bioreactor system (max. working volume: 2 L; Infors HT, Bottmingen, Switzerland). The cultivation offgas flow was analyzed online using offgas sensors—IR for CO₂ and ZrO₂ based for O₂ (Blue Sens Gas analytics, Herten, Germany). Process control and feeding was performed using EVE software (Infors HT, Bottmingen, Switzerland). The pH was monitored using a pH-sensor EasyFerm Plus (Hamilton, Reno, NV, USA). During the cultivations pH was kept constant at 5.0 and was controlled with base addition only (12.5% NH₄OH), while acid (10% H₃PO₄) was added manually, if necessary. Temperature was kept constant at 30 °C. Aeration was carried out using a mixture of pressurized air and pure oxygen at 2 vvm to keep dissolved oxygen (dO₂) always higher than 30%. The dissolved oxygen was monitored using fluorescence dissolved oxygen electrode Visiferm DO (Hamilton, Reno, NV, USA). At the end of the batch phase, represented by a sudden drop in CO₂ signal and a parallel increase in the dO₂, methanol pulse (0.5% v/v, supplied with basal media trace element stock solution) was added to the bioreactors for metabolism adaptation (Table 3, samples 32–33). After 24 h from adaptation pulse, fed-batch cultivation started and lasted 42 h. Mixed-feed (80 g/L methanol mixed with 400 g/L glycerol) was used for samples 32 and 33 (Table 3) while for samples 34 and 35, (Table 3) a derepressed feeding strategy was applied by setting feeding rate at a limiting level.

RNA extraction

Approx. 0.1 g of yeast cells were resuspended in 1 ml RNAzol RT (Sigma-Aldrich) and lyzed using a Fast-Prep-24 (MP Biomedicals, Santa Ana, CA, USA) with 0.5 g of glass beads (1 mm diameter) twice at 6 m/s for 30 s. Samples were incubated at room temperature for 5 min and then centrifuged at 12,000 g for 5 min. 650 µl of the supernatant were mixed with 650 µl ethanol and RNA isolated using the Direct-zol RNA Miniprep Kit (Zymo Research, Irvine, CA, USA) according to the manufacturer’s instructions. This Kit includes a DNAse treatment step. The concentration and purity were measured using a NanoDrop ONE (Thermo Fisher Scientific, Waltham, MA, USA).

RT-qPCR assays

500 ng of isolated total RNA was reverse transcribed using the LunaScript RT SuperMix (NEB) according to the manufacturer’s instructions. The resulting cDNA was diluted 1:50 and 2 µl were used as template in a 15 µl reaction using the Luna Universal qPCR Master Mix (NEB) according to the manufacturer’s instructions. Used primers are listed in (Additional file 4: Table S4). All reactions were performed in technical duplicates on a Rotor-Gene Q system (Qiagen, Hilden, Germany).

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files.

Abbreviations

BMGY:: Buffered glycerol-complex medium
BSM:: Basal salt medium
cDNA:: Copy DNA
DMSZ:: Deutsche Sammlung von Mikroorganismen und Zellkulturen
ER:: Endoplasmic reticulum
LB:: Lysogenic broth, or Luria broth
MIQE:: Minimum information for publication of quantitative real-time PCR experiments
mRNA:: Messenger RNA
NCBI:: National Center for Biotechnology Information
RNase MRP:: RNAse for mitochondrial RNA processing
RNASeq:: RNA sequencing
rNRA:: Ribosomal RNA
RSC:: Remodeling the structure of chromatin
RT-qPCR:: Reverse transcription quantitative PCR
SAGA:: Spt-Ada-Gcn5 acetyltransferase
SRA:: Sequence read archive
SWI/SNF:: SWItch/sucrose non-fermentable
TFIID:: Transcription factor II D
TFIIIC:: Transcription factor III C
tRNA:: Transfer RNA
YNBE:: Yeast nitrogen base ethanol
YNBM:: Yeast nitrogen base media

References

Kurtzman CP. Biotechnological strains of Komagataella (Pichia) pastoris are Komagataella phaffii as determined from multigene sequence analysis. J Ind Microbiol Biotechnol. 2009;36(11):1435–8.
Article CAS PubMed Google Scholar
Freyre FM, Vázquez JE, Ayala M, Canaán-Haden L, Bell H, Rodríguez I, et al. Very high expression of an anti-carcinoembryonic antigen single chain Fv antibody fragment in the yeast Pichia pastoris. J Biotechnol. 2000;76(2–3):157–63.
Article CAS PubMed Google Scholar
Fischer JE, Glieder A. Current advances in engineering tools for Pichia pastoris. Curr Opin Biotechnol. 2019;59:175–81.
Article CAS PubMed Google Scholar
Bernauer L, Radkohl A, Lehmayer LGK, Emmerstorfer-Augustin A. Komagataella phaffii as emerging model organism in fundamental research. Front Microbiol. 2021. https://doi.org/10.3389/fmicb.2020.607028.
Article PubMed PubMed Central Google Scholar
Bustos C, Quezada J, Veas R, Altamirano C, Braun-Galleani S, Fickers P, et al. Advances in cell engineering of the Komagataella phaffii platform for recombinant protein production. Metabolites. 2022;12(4):346.
Article CAS PubMed PubMed Central Google Scholar
Peña DA, Gasser B, Zanghellini J, Steiger MG, Mattanovich D. Metabolic engineering of Pichia pastoris. Metab Eng. 2018;50:2–15.
Article PubMed Google Scholar
Raschmanová H, Weninger A, Knejzlík Z, Melzoch K, Kovar K. Engineering of the unfolded protein response pathway in Pichia pastoris: enhancing production of secreted recombinant proteins. Appl Microbiol Biotechnol. 2021;105(11):4397–414.
Article PubMed PubMed Central Google Scholar
Liu C, Gong J-S, Su C, Li H, Li H, Rao Z-M, et al. Pathway engineering facilitates efficient protein expression in Pichia pastoris. Appl Microbiol Biotechnol. 2022;106(18):5893–912.
Article CAS PubMed Google Scholar
Guerfal M, Ryckaert S, Jacobs PP, Ameloot P, Van Craenenbroeck K, Derycke R, et al. The HAC1 gene from Pichia pastoris: characterization and effect of its overexpression on the production of secreted, surface displayed and membrane proteins. Microb Cell Fact. 2010;9(1):49.
Article PubMed PubMed Central Google Scholar
Sjöblom M, Lindberg L, Holgersson J, Rova U. Secretion and expression dynamics of a GFP-tagged mucin-type fusion protein in high cell density Pichia pastoris bioreactor cultivations. Adv Biosci Biotechnol. 2012;3(3):238–48.
Article Google Scholar
Sha C, Yu X-W, Li F, Xu Y. Impact of gene dosage on the production of lipase from Rhizopus chinensis CCTCC M201021 in Pichia pastoris. Appl Biochem Biotechnol. 2013;169(4):1160–72.
Article CAS PubMed Google Scholar
Zhu T, Hang H, Chu J, Zhuang Y, Zhang S, Guo M. Transcriptional investigation of the effect of mixed feeding to identify the main cellular stresses on recombinant Pichia pastoris. J Ind Microbiol Biotechnol. 2013;40(2):183–9.
Article CAS PubMed Google Scholar
He J, Ma X, Zhang F, Li L, Deng J, Xue W, et al. New strategy for expression of recombinant hydroxylated human collagen α1 (III) chains in Pichia pastoris GS 115. Biotechnol Appl Biochem. 2015;62(3):293–9.
Article CAS PubMed Google Scholar
Yang H, Zhai C, Yu X, Li Z, Tang W, Liu Y, et al. High-level expression of Proteinase K from Tritirachium album Limber in Pichia pastoris using multi-copy expression strains. Protein Expr Purif. 2016;122:38–44.
Article CAS PubMed Google Scholar
Aw R, McKay PF, Shattock RJ, Polizzi KM. Expressing anti-HIV VRC01 antibody using the murine IgG1 secretion signal in Pichia pastoris. AMB Express. 2017;7(1):70.
Article PubMed PubMed Central Google Scholar
Tredwell GD, Aw R, Edwards-Jones B, Leak DJ, Bundy JG. Rapid screening of cellular stress responses in recombinant Pichia pastoris strains using metabolite profiling. J Ind Microbiol Biotechnol. 2017;44(3):413–7.
Article CAS PubMed Google Scholar
Ito Y, Terai G, Ishigami M, Hashiba N, Nakamura Y, Bamba T, et al. Exchange of endogenous and heterogeneous yeast terminators in Pichia pastoris to tune mRNA stability and gene expression. Nucleic Acids Res. 2020;48(22):13000–12.
Article CAS PubMed PubMed Central Google Scholar
Hou C, Yang Y, Xing Y, Zhan C, Liu G, Liu X, et al. Targeted editing of transcriptional activator MXR1 on the Pichia pastoris genome using CRISPR/Cas9 technology. Yeast. 2020;37(4):305–12.
Article CAS PubMed Google Scholar
Lin N-X, He R-Z, Xu Y, Yu X-W. Oxidative stress tolerance contributes to heterologous protein production in Pichia pastoris. Biotechnol Biofuels. 2021;14(1):160.
Article CAS PubMed PubMed Central Google Scholar
Dou W, Zhu Q, Zhang M, Jia Z, Guan W. Screening and evaluation of the strong endogenous promoters in Pichia pastoris. Microb Cell Fact. 2021;20(1):156.
Article CAS PubMed PubMed Central Google Scholar
Pfaffl MW. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res. 2001;29(9): e45.
Article CAS PubMed PubMed Central Google Scholar
Larionov A, Krause A, Miller W. A standard curve based method for relative real time PCR data processing. BMC Bioinformatics. 2005;6(1):62.
Article PubMed PubMed Central Google Scholar
Bustin SA, Benes V, Garson JA, Hellemans J, Huggett J, Kubista M, et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem. 2009;55(4):611–22.
Article CAS PubMed Google Scholar
Teste M-A, Duquenne M, François JM, Parrou J-L. Validation of reference genes for quantitative expression analysis by real-time RT-PCR in Saccharomyces cerevisiae. BMC Mol Biol. 2009;10(1):99.
Article PubMed PubMed Central Google Scholar
Archer M, Xu J. Current practices for reference gene selection in RT-qPCR of Aspergillus: outlook and recommendations for the future. Genes. 2021;12(7):960.
Article CAS PubMed PubMed Central Google Scholar
Suleman E, Somai BM. Validation of hisH4 and cox5 reference genes for RT-qPCR analysis of gene expression in Aspergillus flavus under aflatoxin conducive and non-conducive conditions. Microbiol Res. 2012;167(8):487–92.
Article CAS PubMed Google Scholar
Steiger MG, Mach RL, Mach-Aigner AR. An accurate normalization strategy for RT-qPCR in Hypocrea jecorina (Trichoderma reesei). J Biotechnol. 2010;145(1):30–7.
Article CAS PubMed Google Scholar
Silver N, Best S, Jiang J, Thein SL. Selection of housekeeping genes for gene expression studies in human reticulocytes using real-time PCR. BMC Mol Biol. 2006;7:33.
Article PubMed PubMed Central Google Scholar
Pfaffl MW, Tichopad A, Prgomet C, Neuvians TP. Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeper–Excel-based tool using pair-wise correlations. Biotechnol Lett. 2004;26(6):509–15.
Article CAS PubMed Google Scholar
Andersen CL, Jensen JL, Ørntoft TF. Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Can Res. 2004;64(15):5245–50.
Article CAS Google Scholar
Vandesompele J, De Preter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, et al. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol. 2002. https://doi.org/10.1186/gb-2002-3-7-research0034.
Article PubMed PubMed Central Google Scholar
Xie F, Xiao P, Chen D, Xu L, Zhang B. miRDeepFinder: a miRNA analysis tool for deep sequencing of plant small RNAs. Plant Mol Biol. 2012. https://doi.org/10.1007/s11103-012-9885-2.
Article PubMed Google Scholar
Zhang C, Ma Y, Miao H, Tang X, Xu B, Wu Q, et al. Transcriptomic analysis of Pichia pastoris (Komagataella phaffii) GS115 during heterologous protein production using a high-cell-density fed-batch cultivation strategy. Front Microbiol. 2020;11:463.
Article PubMed PubMed Central Google Scholar
Brady JR, Whittaker CA, Tan MC, Kristensen DL 2nd, Ma D, Dalvie NC, et al. Comparative genome-scale analysis of Pichia pastoris variants informs selection of an optimal base strain. Biotechnol Bioeng. 2020;117(2):543–55.
Article CAS PubMed Google Scholar
Gupta A, Krishna Rao K, Sahu U, Rangarajan PN. Characterization of the transactivation and nuclear localization functions of Pichia pastoris zinc finger transcription factor Mxr1p. J Biol Chem. 2021;297(4): 101247.
Article CAS PubMed PubMed Central Google Scholar
Hellemans J, Mortier G, De Paepe A, Speleman F, Vandesompele J. qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data. Genome Biol. 2007;8(2):R19.
Article PubMed PubMed Central Google Scholar
Lee TI, Causton HC, Holstege FC, Shen WC, Hannett N, Jennings EG, et al. Redundant roles for the TFIID and SAGA complexes in global transcription. Nature. 2000;405(6787):701–4.
Article CAS PubMed Google Scholar
Leinonen R, Sugawara H, Shumway M. The sequence read archive. Nucleic Acids Res. 2011;39:D19-21.
Article CAS PubMed Google Scholar
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Article CAS PubMed PubMed Central Google Scholar
Pertea G, Pertea M. GFF utilities: GffRead and GffCompare. F1000Res. 2020. https://doi.org/10.12688/f1000research.23297.2.
Article PubMed PubMed Central Google Scholar
Sturmberger L, Chappell T, Geier M, Krainer F, Day KJ, Vide U, et al. Refined Pichia pastoris reference genome sequence. J Biotechnol. 2016;235:121–31.
Article CAS PubMed PubMed Central Google Scholar
Valli M, Tatto NE, Peymann A, Gruber C, Landes N, Ekker H, et al. Curation of the genome annotation of Pichia pastoris (Komagataella phaffii) CBS7435 from gene level to protein function. FEMS Yeast Res. 2016. https://doi.org/10.1093/femsyr/fow051.
Article PubMed Google Scholar
Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017;14(4):417–9.
Article CAS PubMed PubMed Central Google Scholar
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550.
Article PubMed PubMed Central Google Scholar
Soneson C, Love MI, Robinson MD. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res. 2015;4:1521.
Article PubMed Google Scholar
Zhu A, Ibrahim JG, Love MI. Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences. Bioinformatics. 2019;35(12):2084–92.
Article CAS PubMed Google Scholar
Dietzsch C, Spadiut O, Herwig C. A dynamic method based on the specific substrate uptake rate to set up a feeding strategy for Pichia pastoris. Microb Cell Fact. 2011;10(1):14.
Article CAS PubMed PubMed Central Google Scholar
Gundinger T, Kittler S, Kubicek S, Kopp J, Spadiut O. Recombinant protein production in E. coli using the phoA expression system. Fermentation. 2022;8(4):181.
Article CAS Google Scholar
Spadiut O, Dietzsch C, Herwig C. Determination of a dynamic feeding strategy for recombinant Pichia pastoris strains. Methods Mol Biol (Clifton, NJ). 2014;1152:185–94.
Article CAS Google Scholar

Download references

Acknowledgements

We thank Matthias Steiger (TU Wien) for providing the K. phaffi strain CBS 7534. We thank Anton Glieder (TU Graz) for providing the protein-expressing strain BSYBG11.

Funding

Open access funding provided by Austrian Science Fund (FWF). This research was funded by the Austrian Science Fund (FWF, https://www.fwf.ac.at/), grant number P 35642 and the Austrian Research Promotion Agency (FFG, https://www.ffg.at/), grant number 880555. The funders had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Mihail Besleaga and Gabriel A. Vignolle contributed equally to this work

Authors and Affiliations

Institute of Chemical, Environmental and Bioscience Engineering, TU Wien, Gumpendorfer Strasse 1a, 1060, Wien, Austria
Mihail Besleaga, Gabriel A. Vignolle, Julian Kopp, Oliver Spadiut, Robert L. Mach, Astrid R. Mach-Aigner & Christian Zimmermann
Center for Health and Bioresources, Competence Unit Molecular Diagnostics, AIT Austrian Institute of Technology GmbH, 1210, Vienna, Austria
Gabriel A. Vignolle

Authors

Mihail Besleaga
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel A. Vignolle
View author publications
You can also search for this author in PubMed Google Scholar
Julian Kopp
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Spadiut
View author publications
You can also search for this author in PubMed Google Scholar
Robert L. Mach
View author publications
You can also search for this author in PubMed Google Scholar
Astrid R. Mach-Aigner
View author publications
You can also search for this author in PubMed Google Scholar
Christian Zimmermann
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MB performed the cultivation experiments and co-drafted this manuscript, GAV performed the RNA-Seq data assessment and co-drafted this manuscript, JK was involved in the design of this study, co-supervised the cultivation experiments, and co-drafted this manuscript, OS provided resources for this study and co-supervised the cultivation experiments. RLM and AMA provided resources for this study, CZ co-designed this study, performed the RT-qPCR experiments, and co-drafted this manuscript. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Christian Zimmermann.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Genes with low log2FoldChanges in the different data set comparisons.

Additional file 2: Table S2.

Genes with consistent low log2FoldChanges in all data set comparisons.

Additional file 3: Table S3.

Ct_values_evaluation.

Additional file 4: Table S4.

Primers used in this study.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Besleaga, M., Vignolle, G.A., Kopp, J. et al. Evaluation of reference genes for transcript analyses in Komagataella phaffii (Pichia pastoris). Fungal Biol Biotechnol 10, 7 (2023). https://doi.org/10.1186/s40694-023-00154-1

Download citation

Received: 14 February 2023
Accepted: 10 March 2023
Published: 29 March 2023
DOI: https://doi.org/10.1186/s40694-023-00154-1

Evaluation of reference genes for transcript analyses in Komagataella phaffii (Pichia pastoris)

Abstract

Background

Results

Conclusion

Similar content being viewed by others

Selection and Validation of Reference Genes for Gene Expression Studies in Codonopsis pilosula Based on Transcriptome Sequence Data

Selection and evaluation of reference genes for RT-qPCR expression studies on Burkholderia tropica strain Ppe8, a sugarcane-associated diazotrophic bacterium grown with different carbon sources or sugarcane juice

Selection and validation of reference genes for RT-qPCR indicates that juice of sugarcane varieties modulate the expression of C metabolism genes in the endophytic diazotrophic Herbaspirillum rubrisubalbicans strain HCC103

Background

Results

Identification of stably expressed genes by the assessment of RNA-Seq data

Comparison and evaluation of refences genes

Discussion

Materials and methods

Assessment of RNA-Seq data

Fungal strains and cultivation conditions

RNA extraction

RT-qPCR assays

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Table S1.

Additional file 2: Table S2.

Additional file 3: Table S3.

Additional file 4: Table S4.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation