Genomes can be compared at different levels of divergence, either between species or within species. Within species genomes can be compared between different subpopulations, such as human subpopulations from different continents. Investigating the genomic differences between different human subpopulations is important when studying complex diseases that are affected by many genetic variants, as the variants involved can differ between populations. The 1000 Genomes Project collected genome-scale variation data for 2504 human individuals from 26 different populations, enabling a systematic comparison of variation between human subpopulations. In this chapter, we present step-by-step a basic protocol for the identification of population-specific variants employing the 1000 Genomes data. These variants are subsequently further investigated for those that affect the proteome or RNA splice sites, to investigate potentially biologically relevant differences between the populations.
Comparative genomics Population variation Human genomics Single-nucleotide polymorphisms
This is a preview of subscription content, log in to check access.
Springer Nature is developing a new tool to find and evaluate Protocols. Learn more
Sabeti PC, Reich DE, Higgins JM et al (2002) Detecting recent positive selection in the human genome from haplotype structure. Nature 419:832–837CrossRefPubMedGoogle Scholar
Xue Y, Zhang X, Huang N et al (2009) Population differentiation as an indicator of recent positive selection in humans: an empirical evaluation. Genetics 183:1065–1077CrossRefPubMedPubMedCentralGoogle Scholar
Blanco E, Parra G, Guigó R (2007) Using geneid to identify genes. Curr Protoc Bioinformatics Chapter 4:Unit 4.3Google Scholar
Speir ML, Zweig AS, Rosenbloom KR et al (2016) The UCSC genome browser database: 2016 update. Nucleic Acids Res 44:D717–D725CrossRefPubMedGoogle Scholar