The National Center for Biotechnology Information has created the dbGaP public repository for individual-level phenotype, exposure, genotype and sequence data and the associations between them. dbGaP assigns stable, unique identifiers to studies and subsets of information from those studies, including documents, individual phenotypic variables, tables of trait data, sets of genotype data, computed phenotype-genotype associations, and groups of study subjects who have given similar consents for use of their data.
References
The GAIN Collaborative Research Group. Nat. Genet. 39, 1045–1051 (2007).
Lowrance, W.W. & Collins, F.S. ETHICS: identifiability in genomic research. Science 317, 600–602 (2007).
Harold, E.R. XML Bible 2nd edn. (Hungry Minds, Indianapolis, Indiana, USA, 2001).
Bray, T., Paoli, J., Sperberg-McQueen, C.M., Maler, E. & Yergeau, F. Extensible Markup Language (XML) 1.0 4th edn. (World Wide Web Consortium (W3C), 2006) <http://www.w3.org/TR/REC-xml/>.
Clayton, D.G. et al. Nat. Genet. 37, 1243–1246 (2005).
Devlin, B. & Roeder, K. Biometrics 55, 997–1004 (1999).
Acknowledgements
Thanks to T. Manolio, J. Coleman, F Collins and C. O'Donnell for useful comments and discussion. This research was supported by the Intramural Research Program of the US National Institutes of Health, National Library of Medicine.
Author information
Authors and Affiliations
Corresponding author
Supplementary information
Supplementary Text and Figures
Supplementary Figure 1 and Supplementary Note (PDF 140 kb)
Rights and permissions
About this article
Cite this article
Mailman, M., Feolo, M., Jin, Y. et al. The NCBI dbGaP database of genotypes and phenotypes. Nat Genet 39, 1181–1186 (2007). https://doi.org/10.1038/ng1007-1181
Issue Date:
DOI: https://doi.org/10.1038/ng1007-1181
- Springer Nature America, Inc.
This article is cited by
-
AKR1C2 genetic variants mediate tobacco carcinogens metabolism involving bladder cancer susceptibility
Archives of Toxicology (2024)
-
dbGaPCheckup: pre-submission checks of dbGaP-formatted subject phenotype files
BMC Bioinformatics (2023)
-
Unappreciated subcontinental admixture in Europeans and European Americans and implications for genetic epidemiology studies
Nature Communications (2023)
-
Linking complex disease and exposure data—insights from an environmental and occupational health study
Journal of Exposure Science & Environmental Epidemiology (2023)
-
The influence of the topographic location of geographic atrophy on vision-related quality of life in nonexudative age-related macular degeneration
Graefe's Archive for Clinical and Experimental Ophthalmology (2023)