Abstract
Genome-wide single-nucleotide polymorphisms (SNPs) are highly useful in unraveling genetic insights and are essential to accelerate selections for genetic improvement in tobacco. The discovery of genome-wide SNPs in tobacco is very complex due to its high level of repetitive genome and polyploidy. At present, publicly available genomic data on SNPs are very limited, which warrants the need for high-throughput SNPs for application in tobacco breeding. In this research paper, we describe our efforts on SNP discovery by whole genome resequencing of 18 flue-cured Virginia (FCV) tobacco genotypes and annotation of SNPs in the tobacco genome. A large amount of data of about 225 GB per genotype was generated, with an average read depth of 50× using paired-end next-generation sequencing (NGS) with the HiSeq 2500 platform. The discovery of a large number of SNPs and indels was attempted to assist mapping and, thus, the selection processes to develop superior tobacco breeding lines. Discovered SNPs, their functional annotation, mapping to the reference genome, and their relative positioning in the linkage group are discussed in this paper.





Similar content being viewed by others
References
Andrews S (2017) FastQC. A quality control tool for high throughput sequence data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc
Bindler G, Plieske J, Bakaher N, Gunduz I, Ivanov N, Van der Hoeven R, Ganal M, Donini P (2011) A high density genetic map of tobacco (Nicotiana tabacum L.) obtained from large scale microsatellite marker development. Theor Appl Genet 123(2):219–230. https://doi.org/10.1007/s00122-011-1578-8
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120
Clevenger JP, Ozias-Akins P (2015) SWEEP: a tool for filtering high-quality SNPs in polyploid crops. G3 (Bethesda) 5(9):1797–1803. https://doi.org/10.1534/g3.115.019703
Ganesh CT, Saiprasad GVS, Mohan Raju B, Sheshshayee MS, Udayakumar M (2014) Genetic structure in FCV tobacco population as assessed by multi-locus genotyping using SSR markers. Adv Crop Sci Tech 2:127. https://doi.org/10.4172/2329-8863.1000127
Garrison E, Marth G (2012) Haplotype-based variant detection from short-read sequencing. arXiv:1207.3907
Gong D, Huang L, Xu X, Wang C, Ren M, Wang C, Chen M (2016) Construction of a high-density SNP genetic map in flue-cured tobacco based on SLAF-seq. Mol Breed 36(7):100. https://doi.org/10.1007/s11032-016-0514-7
Krzywinski MI, Schein JE, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19:1639–1645
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359
Lee YR, Yoon JB, Lee J (2016) A SNP-based genetic linkage map of Capsicum baccatum and its comparison to the Capsicum annuum reference physical map. Mol Breed 36(5):61. https://doi.org/10.1007/s11032-016-0485-8
Leitch IJ, Hanson L, Lim KY, Kovarik A, Chase MW, Clarkson JJ, Leitch AR (2008) The ups and downs of genome size evolution in polyploid species of Nicotiana (Solanaceae). Ann Bot 101:805–814
Sarala K, Rao RV (2008) Genetic diversity in Indian FCV and burley tobacco cultivars. J Genet 87(2):159–163
Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P (2015) Sambamba: fast processing of NGS alignment formats. Bioinformatics 31:2032–2034. https://doi.org/10.1093/bioinformatics/btv098
Tong Z, Xiao B, Jiao F, Fang D, Zeng J, Wu X, Chen X, Yang J, Li Y (2016) Large-scale development of SSR markers in tobacco and construction of a linkage map in flue-cured tobacco. Breed Sci 66(3):381–390. https://doi.org/10.1270/jsbbs.15129
Wang DG, Fan JB, Siao CJ et al (1998) Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280(5366):1077–1082
Wang D, Chen X, Zhang Z, Liu D, Song G, Kong X, Geng S, Yang J, Wang B, Wu L, Li A, Mao L (2015) A MADS-box gene NtSVP regulates pedicel elongation by directly suppressing a KNAT1-like KNOX gene NtBPL in tobacco (Nicotiana tabacum L.). J Exp Bot 66(20):6233–6244. https://doi.org/10.1093/jxb/erv332
Xiao B, Tan Y, Long N, Chen X, Tong Z, Dong Y, Li Y (2015) SNP-based genetic linkage map of tobacco (Nicotiana tabacum L.) using next-generation RAD sequencing. J Biol Res 22:11. https://doi.org/10.1186/s40709-015-0034-3
Acknowledgements
The authors duly acknowledge the efforts of Nucleome Informatics Pvt Ltd., Hyderabad, for performing NGS of 12 genotypes and bioinformatics analysis. The authors also thank ITC’s IT division at LSTC and at ITC InfoTech for their timely support. The authors sincerely thank Dr. CC Lakshmanan, Chief Scientist, and Mr. Chittaranjan Dar, member of CMC & Head of Quality Assurance, LSTC, CPO & EHS, for their respective roles in mentoring, support, and inspiration.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Thimmegowda, G.C., Ramadoss, S.K., Kaikala, V. et al. Whole genome resequencing of tobacco (Nicotiana tabacum L.) genotypes and high-throughput SNP discovery. Mol Breeding 38, 121 (2018). https://doi.org/10.1007/s11032-018-0876-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11032-018-0876-0


