Reliable identification of large numbers of candidate SNPs from public EST data

Buetow, Kenneth H.; Edmonson, Michael N.; Cassidy, Anna B.

doi:10.1038/6851

Reliable identification of large numbers of candidate SNPs from public EST data

Letter
Published: March 1999

Volume 21, pages 323–325, (1999)
Cite this article

From

View current issue Submit your manuscript

Kenneth H. Buetow¹,
Michael N. Edmonson² &
Anna B. Cassidy²

327 Accesses
208 Citations
6 Altmetric
Explore all metrics

Abstract

High-resolution genetic analysis of the human genome promises to provide insight into common disease susceptibility. To perform such analysis will require a collection of high-throughput, high-density analysis reagents. We have developed a polymorphism detection system that uses public-domain sequence data. This detection system is called the single nucleotide polymorphism pipeline (SNPpipeline). The analytic core of the SNPpipeline is composed of three components: PHRED, PHRAP and DEMIGLACE. PHRED and PHRAP are components of a sequence analysis suite developed to perform the semi-automated analysis required for large-scale genomes^1,2 (provided courtesy of P. Green). Using these informatics tools, which examine redundant raw expressed sequence tag (EST) data, we have identified more than 3,000 candidate single-nucleotide polymorphisms (SNPs). Empiric validation studies of a set of 192 candidates indicate that 82% identify variation in a sample of ten Centre d'Etudes Polymorphism Humain (CEPH) individuals. Our results suggest that existing sequence resources may serve as a valuable source for identifying genetic variation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

**Figure 1: RFLP confirmation of candidate SNP in UNIGENE set Hs.54515.**

**Figure 2: Candidate SNP within UNIGENE set Hs.83816.**

References

Ewing, B., Hillier, L., Wendl, M.C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8, 175–185 ( 1998).
Article CAS Google Scholar
Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186–194 (1998).
Article CAS Google Scholar
Schuler, G.D. Pieces of the puzzle: expressed sequence tags and the catalog of human genes. J. Mol. Med. 75, 694–698 (1997).
Article CAS Google Scholar
Hillier, L. et al. Generation and analysis of 280,000 human expressed sequence tags. Genome Res. 6, 807– 828 (1996).
Article CAS Google Scholar
Wang, D.G. et al. Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science 280, 1077–1082 (1998).
Article CAS Google Scholar
Murray, J.C. et al. A comprehensive human linkage map with centimorgan density. Cooperative Human Linkage Center (CHLC). Science 265 , 2049–2054 (1994).
Article CAS Google Scholar
Jin, L. & Nei, M. Limitations of the evolutionary parsimony method of phylogenetic analysis. Mol. Biol. Evol. 7 , 82–102 (1990).
CAS Google Scholar
Sokal, R.R. & Sneath, P.H.A. Principles of Numerical Taxonomy (W.H. Freeman, San Francisco, 1963).
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Population Genetics, NCI, NIH, Bethesda, 20892, Maryland, USA
Kenneth H. Buetow
Fox Chase Cancer Center, Philadelphia, 19111, Pennsylvania, USA
Michael N. Edmonson & Anna B. Cassidy

Authors

Kenneth H. Buetow
View author publications
You can also search for this author in PubMed Google Scholar
Michael N. Edmonson
View author publications
You can also search for this author in PubMed Google Scholar
Anna B. Cassidy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kenneth H. Buetow.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Buetow, K., Edmonson, M. & Cassidy, A. Reliable identification of large numbers of candidate SNPs from public EST data. Nat Genet 21, 323–325 (1999). https://doi.org/10.1038/6851

Download citation

Received: 17 November 1998
Accepted: 27 January 1999
Issue Date: March 1999
DOI: https://doi.org/10.1038/6851
Springer Nature America, Inc.

This article is cited by

Role and Present Status of Biotechnology in Augmenting Poultry Productivity in India
- C. Paswan
- T. K. Bhattacharya
- P. Guru Vishnu
Proceedings of the National Academy of Sciences, India Section B: Biological Sciences (2014)
Identification of candidate genes involved in the biosynthesis of carotenoids in Brassica rapa
- Parameswari Paul
- Vignesh Dhandapani
- Yong Pyo Lim
Horticulture, Environment, and Biotechnology (2014)
Mining of gene-based SNPs from publicly available ESTs and their conversion to cost-effective genotyping assay in sorghum [Sorghum bicolor (L.) Moench]
- Yemane Girma
- Dadakhalandar Doddamani
- Gurusiddesh Hiremath
Journal of Crop Science and Biotechnology (2014)
Genomic profile of the plants with pharmaceutical value
- Saikat Gantait
- Sandip Debnath
- Md. Nasim Ali
3 Biotech (2014)
Identification of single nucleotide polymorphisms from the transcriptome of an organism with a whole genome duplication
- Kris A Christensen
- Joseph P Brunelli
- Gary H Thorgaard
BMC Bioinformatics (2013)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reliable identification of large numbers of candidate SNPs from public EST data

From

Abstract

Access this article

Similar content being viewed by others

Next-Generation Sequencing: Advantages, Disadvantages, and Future

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

Bioinformatics: new tools and applications in life science and personalized medicine

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

This article is cited by

Role and Present Status of Biotechnology in Augmenting Poultry Productivity in India

Identification of candidate genes involved in the biosynthesis of carotenoids in Brassica rapa

Mining of gene-based SNPs from publicly available ESTs and their conversion to cost-effective genotyping assay in sorghum [Sorghum bicolor (L.) Moench]

Genomic profile of the plants with pharmaceutical value

Identification of single nucleotide polymorphisms from the transcriptome of an organism with a whole genome duplication

Navigation

Reliable identification of large numbers of candidate SNPs from public EST data

Abstract

Access this article

Similar content being viewed by others

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation