Whole Genome Resequencing of Capsicum baccatum and Capsicum annuum to Discover Single Nucleotide Polymorphism Related to Powdery Mildew Resistance

Ahn, Yul-Kyun; Manivannan, Abinaya; Karna, Sandeep; Jun, Tae-Hwan; Yang, Eun-Young; Choi, Sena; Kim, Jin-Hee; Kim, Do-Sun; Lee, Eun-Su

doi:10.1038/s41598-018-23279-5

Whole Genome Resequencing of Capsicum baccatum and Capsicum annuum to Discover Single Nucleotide Polymorphism Related to Powdery Mildew Resistance

Article
Open access
Published: 26 March 2018

Volume 8, article number 5188, (2018)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Whole Genome Resequencing of Capsicum baccatum and Capsicum annuum to Discover Single Nucleotide Polymorphism Related to Powdery Mildew Resistance

Download PDF

Yul-Kyun Ahn¹,
Abinaya Manivannan²,
Sandeep Karna²,
Tae-Hwan Jun³,
Eun-Young Yang²,
Sena Choi²,
Jin-Hee Kim²,
Do-Sun Kim² &
…
Eun-Su Lee²

4865 Accesses
23 Citations
6 Altmetric
Explore all metrics

Abstract

The present study deals with genome wide identification of single-nucleotide polymorphism (SNP) markers related to powdery mildew (PM) resistance in two pepper varieties. Capsicum baccatum (PRH1- a PM resistant line) and Capsicum annuum (Saengryeg- a PM susceptible line), were resequenced to develop SNP markers. A total of 6,213,009 and 6,840,889 SNPs for PRH1 and Saengryeg respectively have been discovered. Among the SNPs, majority were classified as homozygous type SNPs, particularly in the resistant line. Moreover, the SNPs were differentially distributed among the chromosomes in both the resistant and susceptible lines. In total, 4,887,031 polymorphic SNP loci were identified between the two lines and 306,871 high-resolution melting (HRM) marker primer sets were designed. In order to understand the SNPs associated with the vital genes involved in diseases resistance and stress associated processes, chromosome-wise gene ontology analysis was performed. The results revealed the occurrence that SNPs related to diseases resistance genes were predominantly distributed in chromosome 4. In addition, 6281 SNPs associated with 46 resistance genes were identified. Among the lines, PRH1 consisted of maximum number of polymorphic SNPs related to NBS-LRR genes. The SNP markers were validated using HRM assay in 45 F₄ populations and correlated with the phenotypic disease index.

Discovery of Genome-Wide DNA Polymorphisms and Resistance-Relative Genes in Chaling Wild Rice (Oryza rufipogon Griff.) by Whole-Genome Sequencing

Article 03 October 2020

Identification of a molecular marker tightly linked to bacterial wilt resistance in tomato by genome-wide SNP analysis

Article 19 January 2018

Molecular mapping of powdery mildew resistance gene PmSGD in Chinese wheat landrace Shangeda using RNA-seq with bulk segregant analysis

Article 09 February 2018

Introduction

Chili pepper is an economically important horticultural crop in Solanaceae family that also includes potato, tomato, eggplant, petunia and tobacco. The Solanaceae family includes more than 3,000 varied species with the similar numbers of chromosomes (n = 12) but significantly different genomic sizes. Peppers have been used as a vegetable, condiment, spice, medicine, coloring agent and source of vitamins^1,2,3. The most common cultivated pepper species are Capsicum annuum, Capsicum frutescens, Capsicum chinense, Capsicum pubescens, and Capsicum baccatum^4,5. Though pepper consists of several potential economic values, fungi, bacteria and viruses cause heavy losses in pepper fruit production. Powdery mildew (PM) is the most common devastating fungal disease in pepper and is caused by Leveillulataurici. In an agricultural setting, this disease could be controlled using agrochemicals or genetic resistance lines. The selection of good PM resistance varieties through traditional breeding potentially requires more than 10 years. Hence, molecular marker-assisted breeding is the current plant breeding method of choice, and the most frequently used markers include single-nucleotide polymorphisms (SNPs). DNA-based molecular markers are employed in plant breeding for genetic diversity and genome association analyses^6,7,8,9. Over the last decade, major innovations in sequencing technologies and bioinformatics have been achieved, prompting a transition from classical conservation genetics to conservation genomics^10,11,12,13. Rapid innovations in genome sequencing platforms, such as next generation sequencing (NGS), provide numerous opportunities for transcriptome assembly, functional annotation of genes, and identification of molecular markers^14,15. New software tools in NGS technology enable the cost effective identification, confirmation, and evaluation of genetic markers on a large scale.

SNPs have been accepted as potential selection markers in genome-wide studies given the high density of markers near loci of interest⁶. NGS technologies have identified genome-wide SNPs in several crops, such as bean¹⁶, barley¹⁷, cassava¹⁸, cabbage¹⁹, grape²⁰ and maize²¹. In pepper, several thousand genetic markers, especially SNPs have been discovered^{22,23,24,25,26,27,28}. Recently, Kim et al.²⁹ sequenced and assembled the pepper genome (Capsicum annuum cv. CM334) at a genomic size of 3.48 Gb. This reference genome will provide the opportunity to improve quality, cultivation, and disease resistance in Capsicum species. The aim of this research is to discover SNP variants for future marker-assisted breeding studies related to PM resistance using Capsicum annuum cv. CM334 as a reference for data mining. Thus, in the present study resequencing of two pepper varieties, Capsicum baccatum (PRH1- PM resistant line) and Capsicum annuum (Saengryeg - PM susceptible line), using the HiSeq. 4000 Illumina platform and the genome wide identification of SNPs have been implemented

Results

Genome sequencing, pre-processing and alignment of reads to the reference genome

A summary of the sequencing, sequence preprocessing, and alignment to the read mapping were presented in Table 1. In total, 130,370,103 and 118,588,231 paired-raw reads were discovered for PRH1 and Saengryeg, respectively, with an average length of 151 bp. A total of 19.69 and 17.91 Gb paired-end raw reads were recorded for both pepper varieties. The total genome coverages were ≒ 11.31× and ≒ 10.29× of the reference genome. The Solexa QA (v.1.13) package was used to generate high-quality clean reads. Raw reads were assessed for quality, and impractical parts were discarded. After the removal of adaptor sequences, ambiguous and low-quality reads (Q value <20), a total of 97,216,537 and 88,964,871 reads were discovered for PM resistant and susceptible pepper varieties, with ≒ 5.61× and ≒ 5.17× of genome coverage respectively. After the removal of non-specific reads, the remaining reads were mapped to the reference genome. A total of 194,523,074 and 177,929,742 clean, high-quality reads were recorded for PRH1 and Saengryeg, respectively, compared with the reference genome, covering 88,448,386 (45.47%) and 1,080,500,795 (39.24%) of mapped reads, respectively.

Table 1 Summary of sequencing, sequence pre-processing and alignment of reads to the reference genome.

Full size table

Identification and distribution of SNP markers

Genome-wide SNPs were identified using an improved BWA-SAMtools workflow. The high-quality filtered reads of PRH1 and Saengryeg were mapped to the reference genome. A total of 6,213,009 and 6,840,889 SNPs were identified for both pepper varieties. Based on the SNP ratio to the read map, SNPs were classified into homozygous, heterozygous and other types. Among the identified SNPs, 88.59% homozygous, 3.65% heterozygous, and 7.76% other types of SNPs were determined in PRH1. Likewise, in Saengryeg, 95.04% homozygous, 1.91% heterozygous, and 3.05% other type SNPs were identified. The occurrence of low percentage of heterozygous SNPs in both lines was due to the relatively low sequence depth and rigid SNP calling requirement. Capsicum consists of 12 chromosomes, and the SNPs are distributed evenly across all chromosomes. Our further analysis revealed that the number of SNPs differed in chromosome 1 to 12 for the two pepper varieties (Fig. 1). The greatest number of homozygous SNPs were noted in chromosome 10 (1,096,754) in Saengryeg whereas chromosome 1 consisted of maximum number of homozygous SNPs (601,032) in PRH1. Similarly, chromosome 1 in PRH1 possessed higher number of heterozygous SNPs (23,932) and chromosome 12 consisted of maximum heretozygous SNPs (15,942) in Saengryeg. However, the least number of SNPs was discovered on chromosome 8 in both pepper varieties. The detailed dataset for the chromosomal distribution of SNPs is listed in Table 2.

Table 2 Distribution of SNPs in the chromosomes of PRH1 and Saengryeg.

Full size table

Annotation of SNPs based on their position in the pepper genome

The SNPs were classified into two main categories (intergenic or genic region) according to their position in the pepper genome sequence. Further genic SNPs were sub-classified as intron and coding DNA sequences (CDS). A total of 6,213,009 and 6,804,889 genome-wide SNPs were discovered for PRH1 and Saengryeg, respectively. Of the discovered SNPs, 5,781,951 (93.06%) and 6,695,385 (93.39%) of intergenic SNPs were recognized for PRH1 and Saengryeg, respectively. Further, these SNPs were classified into homozygous, heterozygous and other type depending upon the ratio to read map. In addition, 82.28% and 93.58% of homozygous type SNPs were identified in the intergenic region for PRH1 and Saengryeg, respectively. We discovered that the number of SNPs in intron was greater than that of CDSs in the genic regions. Most of the SNPs were located in the intergenic regions and were classified as homozygous type (Table 3). All the identified SNPs were analyzed for polymorphisms between PRH1 and Saengryeg. A total number of 15,941,182 SNP loci were identified with respect to the reference genome. Of the identified SNP loci, 4,887,031 polymorphic and 469,978 non-polymorphic loci were identified between PRH1 and Saengryeg. The genomic distribution of polymorphic SNP markers is presented in Fig. 2. High-resolution melting (HRM) marker primers were identified by targeting SNPs to discriminate between two lines. Among the polymorphic SNPs, 4,164,456 HRM candidates were identified, and 597,434 primer sets were selected. A total of 306,871 HRM primer markers were recommended for further breeding purposes (Supplementary file S1). These sets of HRM primers possibly discriminate between the two lines.

Table 3 Summary of SNP classification by genome structure.

Full size table

Chromosome-wise characterization of polymorphic SNPs

In order to gain deeper insight into the SNPs associated with the genes involved in disease resistance and stress tolerance process, chromosome-wide functional annotation of polymorphic SNPs were performed. The distribution of SNP markers were analyzed in each chromosomes and the functional characterization of genes with higher polymorphic SNPs have been carried out. Overall, the majority of the genes with high polymorphic SNPs widely involved in carbohydrate metabolism, transcription regulation, ion binding, nucleotide binding, protein transport, fatty acid metabolism, receptors, photosynthesis, post-translational modifications, stress response, regulatory elements, proteolysis, secondary metabolism, biosynthesis, diseases resistance, and others. However, in each chromosome the genes with various functions displayed the major proportion (Fig. 3). For instance, in chromosome 1 the SNPs were highly identified in genes involved in carbohydrate metabolism followed by transport related genes. Transcription regulation related genes consisted of numerous polymorphic SNPs in chromosome 2 and 8. In chromosome 3, the genes associated with post-translational modifications consisted of more polymorphic SNPs. Likewise the diseases resistances genes with high polymorphic SNPs dominated the chromosome 4. Moreover, nucleotide/ion binding and ion transport genes with polymorphic SNPs were identified in chromosomes 5, 6, 7, 9, and 10. Genes involved in biosynthesis consisted of vast number of SNPs in chromosome 11 and 12.

Identification of polymorphic SNP markers associated with pathogen resistance genes

In total, 6281 SNPs associated with 46 pathogen resistance genes with nucleotide binding site-leucine rich repeat (NBS-LRR) motif were identified in the introns and coding regions of the genes (Supplementary file S2). The occurrence of SNPs related to NBS-LRR genes in each chromosome has been listed in Fig. 4. The maximum number of SNPs was distributed in chromosome 4, whereas the least number of SNPs was observed in chromosome 8. Moreover, the PM resistant line PRH1 consisted of greater number of NB-LRR linked SNPs in comparison with the susceptible line Saengryeg. Overall, the occurrence of higher number SNPs particularly associated with the NB-LRR resistance genes could play a vital role in the attribution of PM resistance.

Phenotypic evaluation for PM resistance and validation of SNP markers

In order to assess the disease resistance indexes, the parental types and the F₄ population were co-cultivated with the powdery mildew pathogen. The infection range observed in the plants has been categorized from 1–5 scale from PM resistance to susceptible (Supplementary Table file 3). The parents of F₄ population exhibited contrasting degree of resistance to the PM disease. The C. baccatum variety (PRH1) displayed high resistance scale of 1, whereas the C. annuum variety (Saengryeg) exhibited resistance score of 5. However, among the 45 individuals in F₄ population, 11 exhibited the resistance score of 1 followed by 22 plants resulted in the moderate disease resistance level of 3 and 12 plants displayed the severity with the index of 5. Further, to validate the identified SNP markers, HRM assay in both the parental types along with the F₄ population of 45 progenies has been performed. Among the 36 HRM primers employed, 19 primers significantly distinguished the resistant and susceptible progenies in the F₄ population. The HRM primers employed in this study have been listed in the Table 4.The representative HRM melt curves obtained for the parents with the heterozygous SNP variation of G/A and C/A have been illustrated in Fig. 5. Moreover the majority of heterozygous SNPs were observed to be prominent among the population studied. Thus, the current HRM platform provided a suitable approach for the validation of SNP markers among the population.

Table 4 List of HRM primers designed for genotyping polymorphic genic SNPs from each chromosome.

Full size table

Discussion

In general, a primary requisite of genotyping of all the individuals in a population is necessary for trait mapping in traditional approaches of breeding, which is a highly expensive, labor intensive and time consuming process. Moreover, the occurrence of mere levels of variations or polymorphism also acts as a vital challenge during molecular marker discovery. In order to address these difficulties, next generation sequencing (NGS) strategies have been widely applied in genomics based on breeding of important agricultural and horticultural crops. Recent advancements in NGS technology have facilitated the routine use of high-throughput, low-cost markers for plant breeding programs. New software tools enable the discovery, validation, and assessment of genetic markers on a large scale. Among different marker systems, SNPs are the most important and attractive DNA-based molecular markers used for genetic diversity and genome association analyses and comparative genetics in plant breeding^6,7,8,9. SNP markers are highly polymorphic, co-dominant, precise, reproducible, high-throughput, economical and informative²⁸. Moreover, the discovery of genome-wide SNPs aids in the improvement of marker assisted selection, particularly for the identification of traits associated with disease resistance. In this study, a complete genome resequencing of two pepper varieties with contrasting powdery mildew (PM) tolerance ability, PRH1 (PM resistance) and Saengryeg (PM susceptible), has been examined for the identification of SNP markers associated with powdery mildew resistance. The available whole genome sequence information of Capsicum annuum cv CM334 has been utilized as the reference genome to enable the comparison between the C. annum and C. baccatum lines used in this study.

In the current endeavor,intersepecific breeding of sexually incompatible pepper species has been performed due to their potential traits. For instance, the C. baccatum is well-known for fruit quality, disease resistance, and high contents of valuable secondary metabolites²⁹. Therefore, the interspecific breeding of peppers results in progenies with high fruit quality and disease resistance. The C. baccatum variety used in this study displayed resistance to powdery mildew and anthracnose diseases. Hence, the whole genome re-sequencing (WGRS) based on discovery of SNPs in the variable pepper varieties could enhance the understanding of SNPs associated with disease resistance. The resequencing and SNP discovery resulted in the identification of 6,213,009 SNPs for PRH1 and 6,840,889 SNPs for Saengryeg. The SNPs identified in the present study were higher than the SNPs discovered by Nimmakayala et al. in C.annuum and C.baccatum varieties using genotyping by sequencing approach³⁰. The report suggested the collective identification of 36,621 potential SNP markers linked to various genomic regions in in C. annuum and C. baccatum that can be utilized for the genome wide association studies in pepper varieties³⁰. Moreover, the identified SNPs in the present study have been majorly categorized into homozygous type with 88.59% and 95.04% for PRH1 and Saengryeg, respectively. This suggests that the sequence of reference genome could be generated from homozygous loci. Further, the chromosomal distribution of SNPs in the pepper genome revealed that a total of 10.92% of homozygous SNPs were located on chromosome 1, and 16.96% of the homozygous SNPs were located on chromosome 10 for PRH1 and Saengryeg, respectively.

In addition, the distribution of SNPs within the pepper genome illustrated the occurrence of higher percentage of SNPs in intergenic regions compared with genic regions. Likewise, several SNPs were identified in the intronic region than in CDSs. Similar results were also reported in tomato by Kim et al.³¹. Furthermore, the location of SNPs plays a vital role, particularly SNPs should be located in intragenic regions to implicate the phenotypic traits. These SNPs are expected to be applied to marker assisted selection because they could be considered as functional markers. A total of 5,941,182 SNP loci have been detected between Saengryeg and PRH1. Of them, 30.63% SNPs were distributed in polymorphic loci. Potential polymorphic homozygous SNPs were filtered to discover breed-specific markers in both of the pepper varieties. HRM analysis has been applied to identify precise, cost-effective and efficient tool to detect sequence variations, such as SNPs³². This technique has been successfully implemented to identify SNPs that have been used for genotype discovery, genetic mapping and mutation scanning^33,34,35,36. Among the discovered homozygous type polymorphic SNPs, 597,434 HRM marker primers were identified that potentially discriminate between two lines. Of them, 306,871 HRM primers were recommended for further experimental research related to PM-based melting patterns and amplification efficacy.

The numerous amount of polymorphic SNPs identified in the genic region were functionally annotated in each chromosome to gain deeper insight into the SNPs associated with the genes involved in disease resistance. A comparative genetics study on the resistance genes in Solanaceae family has shed light on to the potential loci in different chromosomes linked with disease resistance³⁷. The vital R genes associated with disease resistance were conserved among the related species such as pepper, tomato, and potato³⁷. The current results revealed that each chromosome consisted of several SNPs associated with the genes involved in vital metabolic processes. However, chromosome 4 consisted of larger set of SNPs associated with disease resistance in comparison with other chromosomes. According to Grube et al.³⁷, the diseases resistance gene loci located in the chromosome 4 of pepper could render resistance against fungal pathogens. Correspondingly, chromosome 4 could play a vital role in encompassing the genes required for disease resistance in pepper. Moreover, the chromosome 5–10 consisted of SNPs related to genes involved in ion and metal binding. The roles of ion/metal binding genes are inevitable particularly under stressed conditions in pepper plants. The uptake and transportation of nutrients and water from the environment to the plant is a complex as well an important process for the improvement of physiological functioning of plants in stress. Hence, the SNPs related to these genes could act as a vital marker under stress.

Furthermore, higher number of polymorphic SNPs associated with disease resistance genes such as NBS-LRR were also identified in chromosome 4. Among the two varieties, the resistant PRH1 possessed higher distribution of polymorphic SNPs related to NBS-LRR genes. In plants, NBS-LRR is a large family of proteins encoded by the resistance genes and NBS-LRR proteins involved in the recognition of pathogens³⁸. Several reports suggested the importance of NBS-LRR proteins in the resistance against numerous diseases including powdery mildew in plants^39,40,41. In the present study, polymorphic SNPs were identified in genes encoding for LRR receptor-like serine/threonine-protein kinase, F-box/LRR, TIR-NBS-LRR resistances protein, CC-NBS-LRR resistance protein, and TIR1 like protein, etc. Hence, the identification of SNPs associated with the disease resistance genes could aid in the enhancement of screening processes in the molecular breeding of pepper with powdery mildew resistance.

The identified SNPs were validated using HRM primers in the parents and F₄ population derived from the C. annuum and C. baccatum varieties. The HRM primers were selected from all the chromosomes and evaluated in the parents and the population. Among the tested primers, 19 primers were able to distinguish the population and the results were correlated with the phenotypic disease evaluation scores for each individual. Overall, the polymorphic SNPs discovered in this study can be utilized for the identification of powdery mildew resistance and susceptible cultivars in pepper breeding. However, in future the present investigation will be extended to evaluate large populations with more number of HRM primers corresponding to important SNPs associated with powdery mildew resistance in pepper.

In summary, the present endeavor reports the discovery of numerous SNP markers with potential applications in population genetics, molecular breeding, linkage mapping, and comparative genomics on gene-based association studies. For the first time, polymorphic SNPs were discovered from C. annuum and C. baccatum varieties of pepper with different powdery mildew resistance property. The SNP information obtained from the current WGRS approach in pepper can be utilized for the genomics assisted breeding of Capsicum with powdery mildew resistance.

Methods

Isolation of genomic DNA from pepper plants

Young leaves of PRH1 and Saengryeg were used for genomic DNA isolation. Briefly, 300 mg of leaves were ground into fine powder using liquid nitrogen. High-quality DNA was extracted using the cetyltrimethylammonium bromide (CTAB) extraction method⁴². Powdered samples were mixed with CTAB buffer and incubated at 65 °C for 10 minutes. Sample mixtures were cooled to room temperature, and chloroform was then added to the sample mixture. Chloroform sample mixtures were mixed thoroughly and centrifuged at 13,000 rpm for 5 minutes at 4 °C. The supernatant was transferred into a new tube, and an equal volume of absolute ethanol was added. The solution was centrifuged at 13,000 rpm for 5 minutes at 4 °C, and the supernatant was discarded. Then, 70% ethanol was added to the sample, which was then centrifuged at 13,000 rpm for 5 minutes at 4 °C. Once again, the supernatant was discarded, and precipitated DNA pellets were dried at room temperature. The precipitated DNA pellets were then used as a starting material for purification using the Sigma Genelute plant DNA isolation kit (G2N70, Sigma). The DNA quality was assessed by electrophoresing the DNA on 1% agarose gel. The concentration of the extracted DNA was estimated using a GE Healthcare Bio-Science NanoVue via assessment of a single absorbance peak at 260 nm, a 260/280 absorbance ratio of 1.8 to 2.0 and no evidence of substantial band shearing or contamination (either RNA or polysaccharide).

DNA library construction and massively parallel sequencing

Purified whole genomic DNA was randomly sheared using a Covaris S2 (Covaris, Woburn, MA) to yield DNA fragments in the target range of 400 to 500 bp, and average molecular sizes were assessed using an Agilent Bioanalyzer 2100 (Agilent Technologies, Palo Alto, CA). Subsequently, the resulted overhangs were converted to blunt ends using a TruSeq DNA Sample Preparation Kit v2 (Illumina, CA, USA) followed by a clean-up protocol using AMPure XP Beads (Beckman Coulter Genomics, Danvers, MA). To enhance the ligation between the fragmented DNA and index adapters and to avoid self-ligation, the 3′ ends were adenylated. After adenylation, the index adapters were ligated to the fragmented genomic DNA, and the ligated products were purified using the AMPure XP Beads. The ligated products were size-selected on a 2% agarose gel followed by gel elution and column purification. The selected ligated DNA fragments with adapter sequences were enhanced through PCR using adapter-specific primers. Further, the DNA was re-isolated and the average molecular sizes of the libraries were evaluated using the Agilent Bioanalyzer 2100 (Agilent Technologies, Palo Alto, CA) to assess a sharp peak in the expected 500–600 bp range. Each library was loaded on the HiSeq. 4000 platform, and the high-throughput sequencing was performed to ensure that each sample met the 10-fold average sequencing depth.

Preprocessing

After sequencing, the raw reads were trimmed using the Solexa QA v.1.13 package (Cox et al., 2010). The quality of bases from either end of Illumina reads commonly drop in, therefore either end of the reads were trimmed when the Phred quality score dropped below Q = 20 (or 0.05 probability of error). In addition, all 5′ and 3′ stretches of ambiguous ‘N’ nucleotides were also clipped. Trimming resulted in reads with a mean length of 101 bp across all samples, and a minimum length of 25 bp was applied during sequence trimming. These data were used for downstream analysis. The reference genome sequence of Capsicum annuum cv. CN334 was downloaded from Sol Genomic Network (SGN) at http://www.sgn.cornell.edu/.

Alignment, detection, and annotation of SNPs

To align the reads to the pepper reference genome, the Burrows-Wheeler Aligner (BWA 0.6.1-r104) program⁴³ was applied. The BWA default values for mapping were used, except for seed length (−l) = 30, maximum differences in the seed (−k) = 1, number of threads (−t) = 16, maximum number of gap extensions (−e) = 50, mismatch penalty (−M) = 6, gap open penalty (−O) = 15, and gap extension penalty (−E) = 8. Mapped reads were extracted from the resulting BAM file using SAMtools 0.1.16⁴⁴ for further analyses. The high mapping quality ensures reliable (unique) mapping of the reads, which is important for variant calling. Using the varFilter command, SNPs were called only for variable positions with a minimal mapping quality (−Q) of 30. The minimum and maximum of read depths were set as 3 and 100, respectively. An in-house script considering biallelic loci was used to select significant sites in the called SNP positions³¹. Depending on the ratio of SNP reads to mapped reads, variant types were classified into three categories: homozygous SNP (more than 90%), heterozygous SNP (morethan 40% and less than 60%), and other SNPs for the remaining types. The polymorphic SNPs between two samples with sufficient sequences on both sides of the SNP site, without structural variation were noted adjacent to the SNP site and selected for primer design. To design primers flanking the SNP, an in-house script and Primer3 (v2.3.5) software were used⁴⁵. The parameters employed for the primer designing areas follows, primer length 18–24 bp, with 20 bp as the optimum; primer GC% = 20–80%, with the optimum value being 50%; primer Tm 55–65 °C, with 60 °C as the optimum; and product size range of 80–600 bp. After the designed primers were mapped to the genome sequence, only the primers that aligned were selected as candidates for SNP markers.

Functional annotation of genic SNPs

The functional annotations of polymorphic SNPs were determined using the information acquired from gene ontology consortium (www.geneontology.org) and Gene Ontology (UniProt) (www.uniprot.org/help/gene_ontology). The number of SNPs associated with each gene was identified manually.

Genotyping of SNPs using high resolution melt assay (HRM)

For the SNP validation, HRM primers were designed from each chromosome and evaluated in 46 F₄individuals and compared with the parental lines. The HRM analyses were performed in 20 μl of total reaction mixture containing 2 μl of DNA extract (200 ng), 1× of SsoFastEvagreenSupermix (Bio-Rad Laboratories, Hercules, CA, USA), and 200 nM of forward and reverse primers. The reactions were performed in a fluorometric thermal cycler CFX96 real-time system (Bio-Rad Laboratories, Hercules, CA, USA), following program: 98 °C for 2 min, 45 cycles at 98 °C for 5 s and 60 °C for 10 s. The peaks obtained were normalized and analyzed for the difference in the melt curve.

Physiological disease resistance evaluation

The HRM results were correlated with the physiological evaluation of disease resistance. For the infection of powdery mildew, the parental lines used in this study, C. annuum - TF68 and C. baccatum – ARI are the close relatives of PRH1 and Saengryeg. The parents as well as the F₄ populations were maintained in a polyvinyl house along with disease infected plants under a normal day light condition with night/day set temperatures of 27/15 °C and 60–70% RH. The experiment was performed in triplicates in random block design. The disease severity has been assessed in 1–5 scale (1-resistant, 3-moderate and 5- sensitive) after two weeks.

References

Marin, A., Ferreres, F., Tomas-Barberan, F. A. & Gil, M. I. Characterization and quantitation of antioxidant constituents of sweet pepper (Capsicum annuum L.). J Agric Food Chem. 52, 3861–9 (2004).
Article CAS PubMed Google Scholar
Mejia, L. A., Hudson, E., de Mejia, E. G. & Vazquez, F. Carotenoid content and vitamin-A activity of some common cultivars of Mexican peppers (Capsicum annuum L.) as determined by HPLC. J Food Sci. 53, 1448–1451 (1998).
Google Scholar
Sun, T. et al. Antioxidant activities of different colored sweet bell peppers (Capsicum annuum L.). J Food Sci. 72, S98–102 (2007).
Article CAS PubMed Google Scholar
Pickersgill, B. Genetic resources and breeding of Capsicum spp. Euphytica 96, 129–133 (1997).
Article Google Scholar
Von Hippel, E. & Von Krogh, G. Open source software and the “private-collective” innovation model: Issues for organization science. Organization Science 14, 209–223 (2003).
Article Google Scholar
Altshuler, D. et al. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature 407, 513–6 (2000).
Article ADS CAS PubMed Google Scholar
Edwards, D. & Batley, J. Plant genome sequencing: applications for crop improvement. Plant Biotechnol J. 8, 2–9 (2010).
Article CAS PubMed Google Scholar
Lu, F. H., Cho, M. C. & Park, Y. J. Transcriptome profiling and molecular marker discovery in red pepper, Capsicum annuum L. TF68. Mol Biol Rep. 39, 3327–35 (2012).
Article CAS PubMed Google Scholar
Yu, J. N., Won, C., Jun, J., Lim, Y. & Kwak, M. Fast and cost-effective mining of microsatellite markers using NGS technology: an example of a Korean water deer Hydropotes inermis argyropus. PLoS One 6, e26933 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Allendorf, F. W., Hohenlohe, P. A. & Luikart, G. Genomics and the future of conservation genetics. Nat Rev Genet. 11, 697–709 (2010).
Article CAS PubMed Google Scholar
Ekblom, R. & Wolf, J. B. A field guide to whole-genome sequencing, assembly and annotation. Evol Appl. 7(10), 26–42 (2014).
Google Scholar
Primmer, C. R. From conservation genetics to conservation genomics. Ann N Y Acad Sci. 1162, 357–68 (2009).
Article ADS CAS PubMed Google Scholar
Steiner, C. C., Putnam, A. S., Hoeck, P. E. A. & Ryder, O. A. Conservation genomics of threatened animal species. Annu Rev Anim Biosci. 1, 261–81 (2013).
Article PubMed Google Scholar
Davey, J. W. et al. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 12, 499–510 (2011).
Article ADS CAS PubMed Google Scholar
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nat Biotechnol. 26, 1135–45 (2008).
Article CAS PubMed Google Scholar
Perseguini, J. M. et al. Genome-Wide Association Studies of Anthracnose and Angular Leaf Spot Resistance in Common Bean (Phaseolus vulgaris L.). PLoS One 11, e0150506 (2016).
Article PubMed PubMed Central Google Scholar
Zhou, G., Zhang, Q., Tan, C., Zhang, X. Q. & Li, C. Development of genome-wide InDel markers and their integration with SSR, DArT and SNP markers in single barley map. BMC Genomics 16, 804 (2015).
Article PubMed PubMed Central Google Scholar
Rabbi, I. Y., Kulembeka, H. P., Masumba, E., Marri, P. R. & Ferguson, M. An EST-derived SNP and SSR genetic linkage map of cassava (Manihot esculenta Crantz). Theor Appl Genet. 125, 329–42 (2012).
Article CAS PubMed Google Scholar
Song, X., Ge, T., Li, Y. & Hou, X. Genome-wide identification of SSR and SNP markers from the non-heading Chinese cabbage for comparative genomic analyses. BMC Genomics 16, 328 (2015).
Article PubMed PubMed Central Google Scholar
Emanuelli, F. et al. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape. BMC Plant Biol. 13, 39 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yang, X. et al. Comparison of SSRs and SNPs in assessment of genetic relatedness in maize. Genetica 139, 1045–54 (2011).
Article CAS PubMed Google Scholar
Ahn, Y. K. et al. Transcriptome analysis of Capsicum annuum varieties Mandarin and Blackcluster: assembly, annotation and molecular marker discovery. Gene 533, 494–499 (2014).
Article CAS PubMed Google Scholar
Barchi, L. et al. A high-resolution, intraspecific linkage map of pepper (Capsicum annuum L.) and selection of reduced recombinant inbred line subsets for fast mapping. Genome 50, 51–60 (2007).
Article CAS PubMed Google Scholar
Kim, H. J. et al. Pepper EST database: comprehensive in silico tool for analyzing the chili pepper (Capsicum annuum) transcriptome. BMC Plant Biol. 8, 101 (2008).
Article PubMed PubMed Central Google Scholar
Kim, S. et al. Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat Genet. 46, 270–8 (2014).
Article CAS PubMed Google Scholar
Lee, J. M., Nahm, S. H., Kim, Y. M. & Kim, B. D. Characterization and molecular genetic mapping of microsatellite loci in pepper. Theor Appl Genet. 108, 619–27 (2004).
Article CAS PubMed Google Scholar
Livingstone, K. D., Lackney, V. K., Blauth, J. R., van Wijk, R. & Jahn, M. K. Genome mapping in capsicum and the evolution of genome structure in the Solanaceae. Genetics 152, 1183–202 (1999).
CAS PubMed PubMed Central Google Scholar
Lombardi, M. et al. Assessment of genetic variation within a global collection of lentil (Lens culinarisMedik.) cultivars and landraces using SNP markers. BMC Genet. 15, 150 (2014).
Article PubMed PubMed Central Google Scholar
Rodríguez-Burruezo, A., Prohens, J., Raigón, M. D. & Nuez, F. Variation for bioactive compounds in ají (Capsicum baccatum L.) and rocoto (C. pubescens R. & P.) and implications for breeding. Euphytica 170(1-2), 169–181 (2009).
Article Google Scholar
Nimmakayala, P. et al. Genome-wide divergence and linkage disequilibrium analyses for Capsicum baccatum revealed by genome-anchored single nucleotide polymorphisms. Front Plant Sci. https://doi.org/10.3389/fpls.2016.01646 (2016).
Kim, J. E., Oh, S. K., Lee, J. H., Lee, B. M. & Jo, S. H. Genome-wide SNP calling using next generation sequencing data in tomato. Mol Cells. 37, 36–42 (2014).
Article CAS PubMed PubMed Central Google Scholar
Distefano, G., Caruso, M., La Malfa, S., Gentile, A. & Wu, S. B. High resolution melting analysis is a more sensitive and effective alternative to gel-based platforms in analysis of SSR–an example in citrus. PLoS One 7, e44202 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Lehmensiek, A., Sutherland, M. W. & McNamara, R. B. The use of high resolution melting (HRM) to map single nucleotide polymorphism markers linked to a covered smut resistance gene in barley. Theor Appl Genet. 117, 721–8 (2008).
Article CAS PubMed Google Scholar
Mackay, J. F., Wright, C. D. & Bonfiglioli, R. G. A new approach to varietal identification in plants by microsatellite high resolution melting analysis: application to the verification of grapevine and olive cultivars. Plant Methods 4, 8 (2008).
Article PubMed PubMed Central Google Scholar
Muleo, R. et al. Mutation scanning and genotyping by high-resolution DNA melting analysis in olive germplasm. Genome 52, 252–60 (2009).
Article CAS PubMed Google Scholar
Wu, S. B., Wirthensohn, M. G., Hunt, P., Gibson, J. P. & Sedgley, M. High resolution melting analysis of almond SNPs derived from ESTs. Theor Appl Genet. 118, 1–14 (2008).
Article CAS PubMed Google Scholar
Grube, R. C., Radwanski, E. R. & Jahn, M. Comparative genetics of disease resistance within the Solanaceae. Genetics 155, 873–887 (2000).
CAS PubMed PubMed Central Google Scholar
Meyers, B. C., Kozik, A., Griego, A., Kuang, H. & Michelmore, R. W. Genome-wide analysis of NBS-LRR–encoding genes in Arabidopsis. The Plant Cell 15, 809–834 (2003).
Article CAS PubMed PubMed Central Google Scholar
Coleman, C. et al. The powdery mildew resistance gene REN1 co-segregates with an NBS-LRR gene cluster in two Central Asian grapevines. BMC Genetics 10, 89 (2009).
Article PubMed PubMed Central Google Scholar
Dunemann, F., Peil, A., Urbanietz, A. & Garcia‐Libreros, T. Mapping of the apple powdery mildew resistance gene Pl1 and its genetic association with an NBS‐LRR candidate resistance gene. Plant Breed. 126, 476–481 (2007).
Article CAS Google Scholar
Donald, T. M. et al. Identification of resistance gene analogs linked to a powdery mildew resistance locus in grapevine. Theor Appl Genet. 104, 610–618 (2002).
Article CAS PubMed Google Scholar
Doyle, J. J. & Doyle, J. L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19, 11–15 (1987).
Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–95 (2010).
Article PubMed PubMed Central Google Scholar
Li, H. et al. Genome Project Data Processing Subgroup: The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–9 (2009).
Article PubMed PubMed Central Google Scholar
Untergasser, A. et al. Primer3–new capabilities and interfaces. Nucleic Acids Res. 40, e115 (2012).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research work was supported by the Cooperative Research Program for Agriculture Science and Technology Development [Project No. PJ012671022018], Rural Development Administration, Republic of Korea.

Author information

Authors and Affiliations

Department of Vegetable Crops, Korea National College of Agriculture and Fisheries, Jeonju, 54874, Republic of Korea
Yul-Kyun Ahn
Vegetable Research Division, National Institute of Horticultural and Herbal Science, Rural Development Administration, Jeonju, 55365, Republic of Korea
Abinaya Manivannan, Sandeep Karna, Eun-Young Yang, Sena Choi, Jin-Hee Kim, Do-Sun Kim & Eun-Su Lee
Department of Plant Bioscience, Pusan National University, Busan, 46241, Republic of Korea
Tae-Hwan Jun

Authors

Yul-Kyun Ahn
View author publications
You can also search for this author in PubMed Google Scholar
Abinaya Manivannan
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Karna
View author publications
You can also search for this author in PubMed Google Scholar
Tae-Hwan Jun
View author publications
You can also search for this author in PubMed Google Scholar
Eun-Young Yang
View author publications
You can also search for this author in PubMed Google Scholar
Sena Choi
View author publications
You can also search for this author in PubMed Google Scholar
Jin-Hee Kim
View author publications
You can also search for this author in PubMed Google Scholar
Do-Sun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Eun-Su Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.K.A., S.K., T.H.J. and E.Y.Y., designed and conceived the experiments; S.K., S.C., J.H.K., A.M., and E.S.L. performed the experiments. Y.K.A., A.M., S.K. and D.S.K. analyzed the data. A.M. and S.K. wrote the paper. Y.K.A. proofread and finalized the manuscript.

Corresponding author

Correspondence to Yul-Kyun Ahn.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information.

Supplementary Dataset 2

Supplementary Dataset 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ahn, YK., Manivannan, A., Karna, S. et al. Whole Genome Resequencing of Capsicum baccatum and Capsicum annuum to Discover Single Nucleotide Polymorphism Related to Powdery Mildew Resistance. Sci Rep 8, 5188 (2018). https://doi.org/10.1038/s41598-018-23279-5

Download citation

Received: 29 June 2017
Accepted: 06 March 2018
Published: 26 March 2018
DOI: https://doi.org/10.1038/s41598-018-23279-5
Springer Nature Limited

This article is cited by

Genome-wide association study and candidate gene identification for agronomic traits in 182 upward-growing fruits of C. frutescens and C. annuum
- Genying Fu
- Shuang Yu
- Shanhan Cheng
Scientific Reports (2024)
A comprehensive and conceptual overview of omics-based approaches for enhancing the resilience of vegetable crops against abiotic stresses
- Vikas Mangal
- Milan Kumar Lal
- Devendra Kumar
Planta (2023)
Whole-genome resequencing reveals genomic footprints of Italian sweet and hot pepper heirlooms giving insight into genes underlying key agronomic and qualitative traits
- Salvatore Esposito
- Riccardo Aiese Cigliano
- Pasquale Tripodi
BMC Genomic Data (2022)
Genetic diversity and structure of Capsicum annuum as revealed by start codon targeted and directed amplified minisatellite DNA markers
- David O. Igwe
- Celestine A. Afiukwa
- George N. Ude
Hereditas (2019)

Whole Genome Resequencing of Capsicum baccatum and Capsicum annuum to Discover Single Nucleotide Polymorphism Related to Powdery Mildew Resistance

Abstract

Similar content being viewed by others

Introduction

Results

Genome sequencing, pre-processing and alignment of reads to the reference genome

Identification and distribution of SNP markers

Annotation of SNPs based on their position in the pepper genome

Chromosome-wise characterization of polymorphic SNPs

Identification of polymorphic SNP markers associated with pathogen resistance genes

Phenotypic evaluation for PM resistance and validation of SNP markers

Discussion

Methods

Isolation of genomic DNA from pepper plants

DNA library construction and massively parallel sequencing

Preprocessing

Alignment, detection, and annotation of SNPs

Functional annotation of genic SNPs

Genotyping of SNPs using high resolution melt assay (HRM)

Physiological disease resistance evaluation

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation