Introduction

Late blight, caused by the oomycetes Phytophthora infestans, is the most serious disease of potato. This pathogen is a highly variable and severely damages potato crops, and therefore its management is a challenging task. During the 1960s–1980s, race-specific resistance genes were deployed in potato breeding using the hexaploid wild species Solanum demissum1. However, over the decades, the R genes were defeated due to the emergence of new P. infestans strains. Hence, there is a need to identify new resistance sources in wild species background, of which many of them are yet to be characterized at transcriptome level.

The genus Solanum is a rich source of genetic diversity, containing over 200 wild species2. Several wild species have been identified in potato, which confer late blight resistance such as S. pinnatisectum, S. cardiophyllum, S. bulbocastanum, S. stoloniferum, S. bulbocastanum, S. demissum, S. polytrichon, and S. microdontum3. Many useful genes derived from wild species cannot be transferred through conventional breeding due to sexual barriers caused by differences in the ploidy number and the endosperm balance number4. Hence, somatic hybridization has been deployed via protoplast fusion to overcome crossing barriers5. For example, we developed interspecific potato somatic hybrids through protiplast fusion for late blight resistance namely P7 (Solanum tuberosum + S. pinnatisectum)6, and Crd6 (S. tuberosum + S. cardiophyllum)7. However, these somatic hybrids have not yet characterized at transcriptome level for late blight resistance genes.

With the advancement in the post-genomics era and availability of the potato genome8, it is now feasible to analyse genes at the whole genome level in potato. Many reports are available on whole transcriptome sequencing for late blight resistance in potato9,10,11. Transcriptomics studies have been performed on host–pathogen interaction to discover gene networks regulating late blight resistance in potato12. In addition, many candidate genes have been identified in potato for multiple traits such as late blight, potato virus Y and bacterial wilt causing pathogens interaction13, wild species S. pinnatisectum conferring late blight resistance14, contrasting potato foliage and tuber defense response15, and scion grafted to potato rootstock for improving late blight resistance16.

In this study, we aimed to identify genes associated with late blight resistance by transcriptome sequencing in interspecific potato somatic hybrids and varieties after artificial inoculation. Differentially expressed genes (DEGs), heat map, Venn diagram, scatter plot, volcano plots, Gene Ontology (GO) characterization, the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways analysis were investigated. Selected genes likely to be involved in the resistance mechanism were characterized at the sequence level and motifs were discovered. A few selected DEGs were confirmed by real-time quantitative polymerase chain reaction (RT-qPCR) analysis. Our study sheds light on genes involved in late blight resistance in potato that can be used for its management through breeding and biotechnological interventions.

Results

Late blight resistance assay

Five potato genotypes were tested twice for late blight resistance by artificial inoculation of P. infestans under controlled conditions (Fig. 1). Based on the mean data of two years, we identified highly resistant genotypes such as somatic hybrids P7 (Area Under Disease Progressive Curve, AUPDC: 32.89) and Crd6 (AUPDC: 2.75), and potato variety cv. Kufri Girdhari (AUPDC: 1.12). Whereas, a popular potato cv. Kufri Jyoti (AUPDC: 167.54) was susceptible compared to highly susceptible cv. Kufri Bahar (control) (AUPDC: 241.33) (Table 1). Resistant and susceptible genotypes are shown in Fig. 1.

Figure 1
figure 1

Late blight resistance assay of potato somatic hybrids and varieties under contorlled conditions by artificial inoculation with P. infestans: (a) highly resistant (P7, Crd6, and Kufri Girdhari), (b) susceptible (Kufri Jyoti and Kufri Bahar). Late blight infected leaves (black lesions) in susceptoble genotypes (Kufri Jyoti and Kufri Bahar) are shown with yellow arrow.

Table 1 Late blight resistant test of potato somatic hybrids and varieties by artificial inoculation of P. infestans under controlled conditions.

Transcriptome data generation

Total RNA-sequencing of five samples was performed in duplicates based on 2 × 150 bp chemistry of Illumina NextSeq500 platform. High quality paired-end reads (QV > 25) were generated for all genotypes in both replicates (R1/R2) such as P7 (10./8.72 Gb), Crd6 (8.88/8.21 Gb), Kufri Girdhari (11.13/9.51 Gb), Kufri Jyoti (9.92/10.23 Gb), and Kufri Bahar (10.77/9.62 Gb). The reference mapping of the quality reads showed good mapping results with the potato genome sequence data for all samples in bot replicates (R1/R2) namely P7 (75.3%/78.3%), Crd6 (72.7%/70.2%), Kufri Girdhari (85.8%/81.56%), Kufri Jyoti (82.9%/85.32%) and Kufri Bahar (75.3%/81.21%).

Identification of differeltially expressed genes (DEGs)

DEGs were identified in P7, Crd6, Kufri Girdhari, and Kufri Jyoti versus Kufri Bahar (control). The complete lists of significant DEGs are provided in Supplementary Excel datasets # 14, and summarized in Table 2. Significant DEGs were identified based on the statistical significance (p ≤ 0.05) for up-regulated genes (≥ 2 log2 fold change (FC)) and down-regulated genes (≤ − 2 log2 FC). Table 3 provides a summary of top 10 DEGs of each up-regulated and down-regulated in the genotypes. Heat maps of DEGs in selected samples are shown in Fig. 2 (P7), and Fig. 3 (Kufri Girdhari, KG). Some of the highly up-regulated genes (> 3 Log2 FC) were over-represented in most samples were disease resistance protein, transcription factor (MYB, AP2-EREBP), leucine-rich repeat receptor kinase, CC-NBS-LRR/NBS-LRR resistance protein, fructose-bisphosphate aldolase, Sn-2 protein, zinc finger protein, late blight resistance protein homolog R1B-23 so on. Similarly, the predominant down-regulated genes (< − 3 Log2 FC) were C2H2-type zinc finger protein, BRASSINOSTEROID INSENSITIVE 1-associated receptor kinase 1, zinc finger protein, ethylene-responsive transcription factor 4, and protein kinase domain containing protein.

Table 2 Summary of differentially expressed genes (DEGs) by RNA-sequencing in potato genotypes after artificial inoculation with P. infestans for late blight resistance.
Table 3 Selected differentially expressed genes (p < 0.05) in the leaf tissues of potato genotypes after artificial inoculation with P. infestans by RNA-seq analysis.
Figure 2
figure 2

Heat maps of top 50 differentially expressed genes (p < 0.05) for late blight resistance in potato somatic hybrid P7 versus Kufri Bahar (KB, control) by RNA-seq. In heat map, each horizontal line refers to a gene. Relatively up-regulated genes are shown in red colour, whereas down-regulated genes are shown in green colour.

Figure 3
figure 3

Heat maps of top 50 differentially expressed genes (p < 0.05) in potato variety Kufri Girdhari (KG) versus Kufri Bahar (KB, control) by RNA-seq after artificial inoculation of P. infestans under controlled conditions. In heat map, each horizontal line refers to a gene. Relatively up-regulated genes are shown in red colour, whereas down-regulated genes are shown in green colour.

Venn diagram analysis and gene expression marker development

Venn diagram analysis showed common genes in four genotypes such as P7, Crd6, Kufri Girdhari and Kufri Jyoti (Fig. 4). In which, P7 and Crd6 shared only 39 common up-regulated genes and 65 down-regulated genes. On the other hand, Kufri Girdhari and Kufri Jyoti shared 17 up-regulated and 7 down-regulated genes. A total of eight genes were found common among the resistant genotypes (P7, Crd6, and Kufri Girdhari). These genes were fructose-bisphosphate aldolase (PGSC0003DMG400022263), flavonoid glucoyltransferase UGT73E2 (PGSC0003DMG400017119), carbonyl reductase (PGSC0003DMG400000021), gamma aminobutyrate transaminase isoform2 (PGSC0003DMG400024281), chloroplast ferredoxin I (PGSC0003DMG400011950), conserved gene of unknown function (PGSC0003DMG400005633), ferric-chelate reductase (PGSC0003DMG401018223), and glyceraldehyde-3-phosphate dehydrogenase B subunit (PGSC0003DMG400029406). Of which gene expression markers (RT-qPCR) were developed based on higher gene expression in the resistant genotypes for four genes namely fructose-bisphosphate aldolase (PGSC0003DMG400022263), gamma aminobutyrate transaminase isoform2 (PGSC0003DMG400024281), carbonyl reductase (PGSC0003DMG400000021) and glyceraldehyde-3-phosphate dehydrogenase B subunit (PGSC0003DMG400029406) (Supplementary Table S1). RT-qPCR gene expression patterns were mathched with our RNA-seq results. Thus, these genes could be used in future for the identification of resistant genotypes based on RT-qPCR analysis.

Figure 4
figure 4

Venn diagrams showing common genes (up-regulated and down-regulated DEGs) in potato genotypes P7, Crd6, Kufri Jyoti and Kufri Girdhari.

GO annotation and KEGG pathways analysis

All DEGs were functionally characterized by GO terms where molecular function showed the highest gene counts (65,645) followed by biological process (54,476) and cellular component (48,301) (Supplementary Table S2). The WEGO plots are depicted in Fig. 5 for up-regulated and down-regulated genes in P7 and Kufri Girdhari (Supplementary Excel datasets # 58). The GO annotations of genes in CRd6 and Kufri Jyoti are shown in Supplementary Fig. S1. Overall, a few GO terms were found predominantly such as catalytic activity, binding, metabolic process, cellular process, cell, and membrane. All DEGs were classified into 24 KEGG functional pathways categories, which included KEGG annotated gene counts such as P7 (5448), Crd6 (5428), Kufri Jyoti (5414) and Kufri Girdhari (5478) (Fig. 6) (Supplementary Tables S3 and S4) and detailed in Supplementary Excel datasets (#912). In all combinations, maximum KEGG annotated gene counts were found for signal transduction than other pathways like translation, carbohydrate metabolism, folding sorting and degradation, amino acid metabolism, energy metabolism, lipid metabolism and transport, catabolism, cell growth and death, and environmental adaptation. Scatter plot and volcano plot analysis showed up-regulated and down-regulated genes are shown in Supplementary Figs. S2 and S3.

Figure 5
figure 5

Gene Ontology (GO) characterization for cellular component, molceular fucntion, and biological process of up-regulated and down-regulated DEGs in P7 and Kufri Girdhari.

Figure 6
figure 6

KEGG pathways classification of the annotated genes counts in potato genotypes (P7, Crd6, Kufri Jyoti and Kufri Girdhari). X-axis indicates gene counts and y-axis indicates KEGG pathways groups.

Phylogeny tree and conserved motif analysis in selected genes

Multiple sequence alignments of 38 amino acid sequences belonging to 18 selected genes were performed (Supplementary Table S5). These selected genes belonged to disease resistance like bacterial spot disease resistance protein 4, disease resistance proteins, disease resistance protein RPM1, late blight resistance protein homolog R1B-23, CC-NBS-LRR resistance proteins, LRR family proteins, NBS-LRR resistance proteins, cytochrome P450, MYB and AP2-EREBP TFs. Subsequently, the aligned amino acid sequences were used for phylogeny analysis using the MEGA software. The Neighbor-Joining tree was constructed, which clearly distinguished the 38 amino acid sequences representing 18 genes into five major clusters (I–V) (Fig. 7). Cluster I contained a total of 10 sequences consisting of CC-NBS-LRR resistance proteins (8) and NBS-LRR resistance proteins (2). Cluster II included 8 seqeunces such as transcription factors (2), cytochrome P450 (1), MYB TF (2), AP2-EREBP TF (1), and LRR family proteins (2). On the other hand, cluster III contained 6 sequences such as CC-NBS-LRR resistance protein (1), resistance gene (1), bacterial spot disease resistance protein 4 (4). Interestingly, cluster IV has only one sequence of disease resistance protein RPM1 (DMG400029405; DMP400051212). Lastly, cluter V possessed total 13 amino acid sequences including late blight resistance protein homolog R1B-23 (3), disease resistance proteins (10). Thus, phylogeny anaysis indicated genetic relationship among the selected resistance genes, TFs and CC-NBS-LRR/NBS-LRR/LRR proteins conferring late blight resistance in potato genotypes. Furthermore, these amino acid sequences were scanned using InterProScan database, and observed common family/domain particularly disease resistance protein, plants (IPR044974) in all resistance genes and CC-NBS-LRR proteins. Whereas, other TF revealed different domains like MYB domain (IPR017930), LRR domain (IPR032675) and AP2/ERF domain (IPR044808). Sequences were also analysed and three conserved motifs alongwith motif locations were predicted using the MEME software (Fig. 8).

Figure 7
figure 7

A cluster analysis based on the Neighbor-Joining method derived from the bootstrap consensus tree inferred from 100 replicates using the MEGA software showing the relationship among the 38 amino acid sequence of 18 genes involved in conferring late blight resistance in potato genotypes.

Figure 8
figure 8

Conserved motifs analysis using the MEME (version 5.5.3) software in 38 amino acid sequences of 18 genes involved in late blight resistance in potato genotypes. (a) Sequence logo of three conserved motifs, (b) Motif location and gene name (in short) with peptide IDs (prefix with ‘PGSC0003’).

Validation of selected genes by RT-qPCR analysis

Selected eight genes (2 genes from each sample of both up-regulated and down-regulated) were validated by RT-qPCR analysis using Kufri Bahar as highly susceptible control. RT-qPCR analysis was performed for one of each gene from up-regulated and down-regulated DEGs, such as disease resistance protein and C2H2-type zinc finger protein for P7; gamma aminobutyrate transaminase isoform 2, and AP2/ERF domain-containing transcription factor for Crd6; bacterial spot disease resistance protein 4 and leucine-rich repeat protein in Kufri Girdhari; and disease resistance protein and protein kinase domain-containing protein in Kufri Jyoti. RT-qPCR results are consistent with the RNA-seq results with minor variation in gene expression values (Supplementary Table S6).

Discussion

Transcriptome profiling for late blight resistance in potato

We provide an overview of global gene expression profiles for genes associated with late blight resistance in potato by total RNA sequencing. This study included highly resistant interspecific potato somatic hybrids P7 (S. pinnatisectum-originated) and Crd6 (S. cardiophyllum-originated) and highly resistant potato variety Kufri Girdhari, and popular but susceptible variety Kufri Jyoti compared with highly suscetible Kufri Bahar (control). Genes were analysed after artificial inoculation of P. infestans at 96 h post-inoculation stage. Our findings are in consistent with previous results on late blight resistance assays in potato on both cultivated and wild species, including our interspecific somatic hybrids6,7. Whole transcriptome analysis has identified genes in contrasting resistant and susceptible genotypes9,10. Our RNA-seq results were also in accordance with earlier findings on gene identification through microarray technology in potato cv. Kufri Girdhari17. This study provides an overview of molecular signatures of somatic hybrids (P7 and Crd6) and varieties (Kufri Jyoti and Kufri Girdhari) suggesting genetic make-up consisting of a network of resistance genes, TFs and stress related genes. Our study sheds light on improving understanding of the induced genes associated with late blight resistance in potatoes.

Disease resistance genes play major role in late blight resistance in potato

Disease resistance proteins play an important role in late blight resistance in potato. Resistance (R) proteins in plants mediate the recognition of specific pathogen-derived factors called avirulence (Avr) proteins. Upon Avr perception, R proteins initiate defence responses that limit further pathogen progress. These responses often result in macroscopically visible cell death, referred to as the hypersensitive response. To illustrate, highly up-regulated genes in resistant genotypes were bacterial spot disease resistance protein 4, CC-NBS-LRR resistance protein, disease resistance protein RPM1, and Sn-2 protein. Earlier, we observed similar group of genes such as disease resistance genes through microarray technology in potato conferring late blight resistance17,18.

Plant defense response consists of a multitude of reactions after pathogen infection. The majority of R proteins are NBS-LRR proteins containing a central nucleotide-binding and hydrolysing domain (NB-ARC) and a C-terminal leucine-rich repeat (LRR) domain. Many R genes like putative disease resistance protein over- or under-expressed during host pathogen interaction. It is known that defence-related R gene as well as signal molecules are induced at 72 h post-inoculation, which results in metabolic changes in plants19. In this study, 96-h post inoculated samples were analysed due to delay in reaction response between resistant and susceptible genotypes, which might be due to different P. infestns strains, controlled environmental conditions and genotype response. Recently, Duan and co-workers12 identified higher induction of susceptibility genes such as SWEET after P. infestans inoculation on potato. Our results provide a valuable resource for understanding the interactions between P. infestans and potato at the 96-h post-inoculation stage.

The Indian potato variety Kufri Jyoti was developed in the year 1960s possessing late blight resistance due to the presence of R genes derived from the hexaploid wild species S. demissum. But due to the evolution of new strains of P. infestans, resistance has broken down in this variety and it is susceptible now. Another newly released potato variety Kufri Girdhari was developed in the year 2008, which contains high resistance to late blight. This is might be due to the presence of several resistance genes as observed in this study, such as CC-NBS-LRR type resistance protein (PGSC0003DMG400008596), disease resistance protein RPM1 (PGSC0003DMG400029405), disease resistance protein (PGSC0003DMG400018464), MYB transcription factor MYB139 (PGSC0003DMG402010883) and late blight resistance protein homolog R1B-23 (PGSC0003DMG400025545). Probably these genes provide late blight resistance in Kufri Girdhari against P. infestans infection.

Transcription factors (TFs) regulates gene expression to provide late blight resistance

TFs are one of the key regulators in plant metabolism. TFs constitute an important part of gene networks and signalling pathways in biotic/abiotic stress response. Indeed, the regulation of defense gene expression is largely governed by specific transcription factors. In this study, highly up-regulated TFs genes in resistant genotypes were MYB, MYB139, AP2-EREBP, and C2H2L TFs, thylene-responsive transcription factor 1B and many other genes. Highly down-regulated TFs were C2H2-type zinc finger protein, AP2/ERF, TSRF1, WRKY, R2R3, leucine-rich repeat protein and others. Previous study witnesses that genes with putative functions of transcription-related, such as heat shock protein transcription factor, zinc finger ring-box protein-like and NAC domain-containing protein NAC22, and MYB44 TF are induced after P. infestans inoculation19. Our findings are accordance with the previous findings showing that NAC domain protein is a family of plant-specific transcription factors involved in plant development and disease resistance20.

The WRKY and MYB TFs have been demonstrated to play key roles in plant responses to stresses, particularly P. infestans infection21,22. The zinc finger proteins play a pivotal role in the regulation of plant defense mechanisms against P. infestans infection. Hypersensitivity plays a major role in the induction of disease resistance pathway, and acts as a downstream signalling pathway for enhancing the systemic resistance in crop plants. This suggests the importance of the transcription elements in activating the defense system during host–pathogen interactions. Our results showed that many genes in potato leaves up-regulated after P. infestans infection. This finding provides an overview of the underlying mechanisms related to the modulation and regulation of pathways in response to Phytophthora interactions. Consistent with this, previous studies manifested the induced expression of the responsive genes in the late blight resistance associated TFs such as WRKY, ERF, MAPK, bHLH Myc-type TF and NBS-LRR family genes10. We also observed differential regulation of MYB, MYB139, AP2-EREBP, C2H2L, and CC-LRR-NBS genes. Thus, our study highlights the role of TFs and other DNA binding proteins in defense response to P. infestans in potato.

Stress-resonsive genes, protein kinases and phytohormones control late blight resistance

Stress-responsive genes play an important role in imparting disease resistance in potato. Some of the stress-responsive genes and kinases in resistant genotypes were cytochrome P450 hydroxylase, serine-threonine protein kinase plant-type, and leucine-rich repeat receptor kinase. The role of cytochrome P450 has been deciphered for enhancing plant resistance via jasmonic acid and ethylene signaling pathways in soybean. Duan and co-workers12 identified up-regulation of lignin-forming anionic peroxidase genes that may participate in ethylene-induced defense response against P. infestans. The protein kinases mainly calcium-dependent protein kinases (CDPKs) and leucine-rich repeat receptor-like protein kinases (LRR-RKs) determines key functions in pathogen recognition in lentil23. In the line of earlier research findings10,11,12, our results confirmed the involvement of signal transduction and stress-responsive genes in activating and maintaining a defense response to P. infestans in potato.

The activation of signaling pathways is mostly regulated by salicylic acid, jasmonic acid, ethylene, which induce gene expression and defense response genes against P. infestans inoculation in plants24. We observed phytohormones related genes in resistant genotypes such as gibberellin regulated protein, BRASSINOSTEROID INSENSITIVE 1-associated receptor kinase 1, ethylene-responsive proteinase inhibitor 1, and ethylene-responsive transcription factor. Our findings were suppoted by Yang et al.9 indicating gene expression profiling under exogenous ethylene application in late blight resistant potato genotype SD20. Consequently, they identified multiple signaling pathways including ethylene, salicylic acid, jasmonic acid, abscisic acid, auxin, cytokinin and gibberellin involved in SD20. It has been proven that ethylene-induced gene expression profiling provides insights into the ethylene signaling transduction pathway and its potential mechanisms in disease defense systems in potato. Moreover, the role of photosynthesis is well-known in plant growth and development. Researchers have described that most of the genes associated with photosynthesis pathways were down-regulated upon P. infestans inoculation leading to hypersensitive response and leaf lesion25. Thus, the stress-responsive genes, protein kinases and phytohormones do play crucial roles in conferring late blight resistance in potato.

Conclusion

Our study provides a landscape of transcriptome profiling in potato of diverse genetic backgrounds including interspecific somatic hybrid and common potato varieties. This illuminated the role of disease resistance, TFs, stress-responsive genes, and phytohormones genes imparting late blight resistance upon P. infestans infection under controlle conditions. We showed that key regulators of late blight resistance are disease resistance genes, TFs, stress-responsive genes and protein kinases. Further, genes were validated by RT-qPCR analysis. Interestingly, gene expression markers were developed for four genes viz., fructose-bisphosphate aldolase (PGSC0003DMG400022263), gamma aminobutyrate transaminase isoform2 (PGSC0003DMG400024281), carbonyl reductase (PGSC0003DMG400000021) and glyceraldehyde-3-phosphate dehydrogenase B subunit (PGSC0003DMG400029406), which showed gene expression in resistant genotypes (P7, Crd6 and Kufri Girdhari) that could be utilized for screenign purpose. However, the functional characterization of these candidate genes would be required in future interventions through transgenics or genome editing technologies. Collectively, our study provides information on genes and regulatory elements involved in late blight resistance in potato and paves a path for its management in future.

Methods

Plant materials

In this study, two interspecific potato somatic hybrids namely P7 (S. tuberosum + S. pinnatisectum)6, and Crd6 (S. tuberosum + S. cardiophyllum)7; and three common potato varieties namely Kufri Girdhari, Kufri Jyoti, and Kufri Bahar (control) were used. The genotypes like P7, Crd6 and Kufri Girdhari are highly resistant to late blight, whereas Kufri Jyoti is susceptible and Kufri Bahar (control) is highly susceptible to late blight disease. These genotypes were available at our institute in the Division of Crop Improvement, Indian Council of Agricultural Research—Central Potato Research Insitute, Shimla, Himachal Pradesh, India. Disease-free in vitro plants were maintained by sub-culturing leafy nodes on the Murashige and Skoog (MS) medium26 at pH 5.8 supplemented with sucrose (20 g/L) and solidified with gelrite (2 g/L) and cultures were grown at 20 °C under a 16-h photoperiod (light intensity 50–60 µmol/m2/s) as described by Sarkar et al.6.

Late blight resistance assay

In vitro plants were grown in the earthen pots (20 × 25 cm2) with three replications containing a sterile mixture of soil/FYM-based compost (1:1, v/v) under a glass-house during the summer season under Shimla hills (31.10° N, 7.17° E, 2200 m above mean sea level) at the institute following standard cultural practices. All five genotypes were artificially inoculated with the P. infestans isolate HP09/40 (A2 mating type: 1.2.3.4.5.6.7.8.9.10.11) under controlled chamber (18 ± 2 °C temperature, and 80–90% relative humidity). The inoculum was prepared on highly susceptible potato variety Kufri Bahar. The zoospore concentration (5 × 104 sporangia/ml) was adjusted using a hemocytometer. Fifty days old plants were artificially inoculated by mist spray of the pathogen. Late blight symptoms (%) based on leaf and stem lesions were recorded on the leaves and stems after 3, 5 and 7 days of infection. The AUDPC (%.day) was calculated based on the percent disease infestation27. Resistant/susceptible genotypes were classified based on the AUDPC value as highly resistant (HR ≤ 50), resistant (R = 50–100), moderately resistant (MR = 100–150) and susceptible (S ≥ 150]28. Leaf tissues were collected at 96 h post-inoculation when clear symptoms appeared on the susceptible plants. At this stage, resistant and susceptible plants were clearly distinguished by late blight symptoms. Leaf samples were collected from all five genotypes, snap-frozen in liquid nitrogen, and stored at − 80 °C until further use. The leaf tissues (three biological replicates) were processed for transcriptome sequencing (two technical replicates).

Transcriptome sequencing and reference mapping

Transcriptome analysis was carried out following our earlier protocols29. Briefly, total RNA was isolated from the leaf tissues of five samples using a modified CTAB and lithium chloride method30. The isolated RNA was checked for quality on 1% denaturing RNA agarose gel, and quantified by spectrophotometrically using NanoDrop (ThermoFisher Scientific, Wilmington, Delaware USA). The paired-end sequencing libraries were prepared using Illumina TruSeq Stranded mRNA sample prep kit following the manufacturer’s instructions (Illumina, San Diego, CA, USA). The PCR enriched libraries were analyzed on 4200 Tape Station system using high sensitivity D1000 Screen tape as per the manufacturer’s instructions (Agilent Technologies, Santa Clara, CA, USA). The PE illumina libraries were sequenced using Illumina NextSeq500 platform. The raw data was processed using Trimmomatic v0.38 to obtain high-quality reads (QV > 25). The high quality reads were mapped to the reference potato genome8 using TopHat v2.1.1 software with default parameters31.

Differential gene expression analysis

The transcriptome data were assembled and DEGs were identified using the cufflinks (v2.2.1) and cuffdiff (version 2.2.1) softwares32. DEGs were analyzed in genotypes such as P7, Crd6, Kufri Jyoti, and Kufri Girdhari versus Kufri Bahar (control). Kufri Bahar was used as a control in all DEGs combinations. Log2 fold change (FC) values greater than zero were considered up-regulated (≥ 2.00) whereas less than zero were down-regulated (≤ − 2.00) along with a p value threshold of 0.05 for statistically significant results. An average linkage hierarchical cluster analysis was performed with the top 50 DEGs using the Multiple experiments Viewer (MeV v4.9.0)33. Common genes were identified in DEGs using the Venny 2.1 tool34, which were further used for common gene expression marker development based on RT-qPCR analysis in resistant genotypes (Supplementary Table S1). Eurofins Genomics proprietary R scripts were used to depict scatter plots and volcano plots following detailed procedures described elsewhere29.

Gene annotation analysis

The GO annotations of the DEGs were obtained from the Ensembl Plants database for Solanum tuberosum. The information on gene counts was assigned to three main GO domains (biological process, cellular component, and molecular function). The bar plots depicting the GO distribution were prepared through the WEGO portal (http://wego.genomics.org.cn/cgi-bin/wego/index.pl)35. The functional annotations of the DEGs were carried out against the curated KEGG GENES database using KAAS (KEGG Automatic Annotation Server (http://www.genome.jp/kegg/ko.html)36.

Sequence diveristy and conserved motif analysis in selected genes

A total of 38 amino acid sequences of 18 selected potential genes involved in late blight resistance in potato genotypes were downloaded from the potato genome sequencing consortium8 and analysed for sequence diversity and conserved motifs search analysis (Supplementary Table S5). Gene sequences were downloaded from the potato genome sequencing consortium. The sequences were aligned by multiple sequence alignment using BioEdit version 7.2.5 with default parameters37. Further, a phylogeny tree was constructed using the Molecular Evolutionary Genetics Analysis 6 (MEGA6) software38 based on the Neighbor-Joining method derived from the bootstrap consensus tree inferred from 100 replicates showing the relationship in the selected genes. All the peptide sequences were scanned using the protein function analysis tool InterProScan of EMBL-EBI (https://www.ebi.ac.uk/interpro/)39. The Multiple Expectation Maximization for Motif Elicitation (MEME version 4.9.1)40 was used to detect conserved motifs in the selected genes.

Validation of selected genes through RT-qPCR analysis

Eight selected DEGs were validated through RT-qPCR analysis following our earlier protocols29. The RT-qPCR primers were designed from the coding sequences of the potato genome with the IDT PrimerQuest Tool (https://eu.idtdna.com/Primerquest/Home/Index) (Supplementary Table S6). The same leaf tissues, as used in transcriptome analysis, were used for RT-qPCR analysis. RT-qPCR analysis was executed using Power SYBR Green PCR Master Mix in ABI PRISM HT7900 (Applied Biosystems Warrington, UK) following temperature profile at 50 °C for 2 min; 95 °C for 10 min; and 40 cycles of 95 °C for 15 s, 60 °C for 1 min, and 72 °C for 30 s with an internal standard potato ubiquitin-ribosomal protein gene (ubi3; L22576).

Accordance statement

This manuscript comply with the ‘IUCN Policy Statement on Research Involving Species at Risk of Extinction’ and the ‘Convention on the Trade in Endangered Species of Wild Fauna and Flora’ for Experimental research and field studies on plants (either cultivated or wild), including the collection of plant material, must comply with relevant institutional, national, and international guidelines and legislation.