Abstract
Characterizing the genetic structure of worldwide populations is important for understanding human history and is essential to the design and analysis of genetic epidemiological studies. In this study, we examined genetic structure and distant relatedness and their effect on the extent of linkage disequilibrium (LD) and homozygosity in the founder population of Quebec (Canada). In the French Canadian founder population, such analysis can be performed using both genomic and genealogical data. We investigated genetic differences, extent of LD, and homozygosity in 140 individuals from seven sub-populations of Quebec characterized by different demographic histories reflecting complex founder events. Genetic findings from genome-wide single nucleotide polymorphism data were correlated with genealogical information on each of these sub-populations. Our genomic data showed significant population structure and relatedness present in the contemporary Quebec population, also reflected in LD and homozygosity levels. Our extended genealogical data corroborated these findings and indicated that this structure is consistent with the settlement patterns involving several founder events. This provides an independent and complementary validation of genomic-based studies of population structure. Combined genomic and genealogical data in the Quebec founder population provide insights into the effects of the interplay of two important sources of bias in genetic epidemiological studies, unrecognized genetic structure and cryptic relatedness.
Similar content being viewed by others
References
Barrett JC, Fry B, Maller J, Daly MJ (2005) Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21(2):263–265
Bherer C, Labuda D, Roy-Gagnon MH, Houde L, Tremblay M, Vézina H (2010) Admixed ancestry and stratification of Quebec regional populations. Am J Phys Anthropol (in press). doi:10.1002/ajpa.21424
Boerwinkle E, Chakraborty R, Sing CF (1986) The use of measured genotype information in the analysis of quantitative phenotypes in man. I. Models and analytical methods. Ann Hum Genet 50(Pt 2):181–194
Bouchard G, Vezina H (2009) Projet BALSAC—Rapport annuel 2008–2009. Université du Québec à Chicoutimi. http://www.uqac.ca/balsac/pdf/ra0708.pdf
Cann HM, de Toma C, Cazes L, Legrand M-F, Morel V, Piouffre L, Bodmer J, Bodmer WF, Bonne-Tamir B, Cambon-Thomsen A, Chen Z, Chu J, Carcassi C, Contu L, Du R, Excoffier L, Friedlaender JS, Groot H, Gurwitz D, Herrera RJ, Huang X, Kidd J, Kidd KK, Langaney A, Lin AA, Mehdi SQ, Parham P, Piazza A, Pistillo MP, Qian Y, Shu Q, Xu J, Zhu S, Weber JL, Greely HT, Feldman MW, Thomas G, Dausset J, Cavalli-Sforza LL (2002) A human genome diversity cell line panel. Science 296(5566):261–262
Charbonneau H, Desjardins B, Légaré J, Denis H (2000) The population of the St-Lawrence Valley, 1608–1760. In: Haines MR, Steckel RH (eds) A population history of North America. Cambridge University Press, New York, pp 99–142
Colonna V, Nutile T, Ferrucci RR, Fardella G, Aversano M, Barbujani G, Ciullo M (2009) Comparing population structure as inferred from genealogical versus genetic information. Eur J Hum Genet 17(12):1635–1641
Desjardins B (1998) Le Registre de la population du Québec ancien. Ann Demogr Hist 2:215–226
Desjardins M, Frenette Y, Bélanger J, Hétu B (1999) Histoire de la Gaspésie. Les Presses de l’Université Laval, Sainte-Foy
Devlin BB, Roeder KK (1999) Genomic control for association studies. Biometrics 55(4):997–1004
Excoffier L, Laval G, Schneider S (2005) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinform Online 1:47–50
Freedman ML, Reich D, Penney KL, McDonald GJ, Mignault AA, Patterson N, Gabriel SB, Topol EJ, Smoller JW, Pato CN, Pato MT, Petryshen TL, Kolonel LN, Lander ES, Sklar P, Henderson B, Hirschhorn JN, Altshuler D (2004) Assessing the impact of population stratification on genetic association studies. Nat Genet 36(4):388–393
Frenette P (1996) Histoire de la Côte-Nord. Collection Les Régions du Québec. Institut québécois de recherche sur la culture et Presses de l’Université Laval
Gagnon A, Heyer E (2001) Fragmentation of the Quebec population genetic pool (Canada): evidence from the genetic contribution of founders per region in the 17th and 18th centuries. Am J Phys Anthropol 114(1):30–41
Heath SC, Gut IG, Brennan P, McKay JD, Bencko V, Fabianova E, Foretova L, Georges M, Janout V, Kabesch M, Krokan HE, Elvestad MB, Lissowska J, Mates D, Rudnai P, Skorpen F, Schreiber S, Soria JM, Syvanen AC, Meneton P, Hercberg S, Galan P, Szeszenia-Dabrowska N, Zaridze D, Genin E, Cardon LR, Lathrop M (2008) Investigation of the fine structure of European populations with applications to disease association studies. Eur J Hum Genet 16(12):1413–1429
Institut de la statistique du Québec, Gouvernement du Québec (2010). http://www.stat.gouv.qc.ca/
Jakkula E, Rehnström K, Varilo T, Pietiläinen OPH, Paunio T, Pedersen NL, deFaire U, Järvelin M-R, Saharinen J, Freimer N, Ripatti S, Purcell S, Collins A, Daly MJ, Palotie A, Peltonen L (2008) The genome-wide patterns of variation expose significant substructure in a founder population. Am J Hum Genet 83(6):787–794
Kang HM, Sul JH, Service SK, Zaitlen NA, Kong SY, Freimer NB, Sabatti C, Eskin E (2010) Variance component model to account for sample structure in genome-wide association studies. Nat Genet 42(4):348–354
Karigl G (1981) A recursive algorithm for the calculation of identity coefficients. Ann Hum Genet 45(Pt 3):299–305
Labuda M, Labuda D, Korab-Laskowska M, Cole DE, Zietkiewicz E, Weissenbach J, Popowska E, Pronicka E, Root AW, Glorieux FH (1996) Linkage disequilibrium analysis in young populations: pseudo-vitamin D-deficiency rickets and the founder effect in French Canadians. Am J Hum Genet 59(3):633–643
Lange K, Sinsheimer JS, Sobel E (2005) Association testing with Mendel. Genet Epidemiol 29(1):36–50
Marchini J, Cardon LR, Phillips MS, Donnelly P (2004) The effects of human population structure on large genetic association studies. Nat Genet 36(5):512–517
McQuillan R, Leutenegger AL, Abdel-Rahman R, Franklin CS, Pericic M, Barac-Lauc L, Smolej-Narancic N, Janicijevic B, Polasek O, Tenesa A, Macleod AK, Farrington SM, Rudan P, Hayward C, Vitart V, Rudan I, Wild SH, Dunlop MG, Wright AF, Campbell H, Wilson JF (2008) Runs of homozygosity in European populations. Am J Hum Genet 83(3):359–372
McVean G (2009) A genealogical interpretation of principal components analysis. PLoS Genet 5(10):e1000686
Moreau C, Vezina H, Yotova V, Hamon R, de Knijff P, Sinnett D, Labuda D (2009) Genetic heterogeneity in regional populations of Quebec—parental lineages in the Gaspe Peninsula. Am J Phys Anthropol 139(4):512–522
Moreau C, Vezina H, Jomphe M, Lavoie EM, Roy-Gagnon MH, Labuda D (2010) When genetics and genealogies tell different stories—maternal lineages in Gaspesia. Ann Hum Genet (in press). doi:10.1111/j.1469-1809.2010.00617.x
Ober C, Abney M, McPeek MS (2001) The genetic dissection of complex traits in a founder population. Am J Hum Genet 69(5):1068–1079
Patterson N, Price AL, Reich D (2006) Population structure and eigenanalysis. PLoS Genet 2(12):e190
Poon AH, Laprise C, Lemire M, Montpetit A, Sinnett D, Schurr E, Hudson TJ (2004) Association of vitamin D receptor genetic variants with susceptibility to asthma and atopy. Am J Respir Crit Care Med 170(9):967–973
Pouyez C, Lavoie Y (1983) Les Saguenayens. Introduction à l’histoire des populations du Saguenay. Presses de l’Université du Québec
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38(8):904–909
Price AL, Helgason A, Palsson S, Stefansson H, St. Clair D, Andreassen OA, Reich D, Kong A, Stefansson K (2009) The impact of divergence time on the nature of population structure: an example from Iceland. PLoS Genet 5(6):e1000505
Price AL, Zaitlen NA, Reich D, Patterson N (2010) New approaches to population stratification in genome-wide association studies. Nat Rev Genet 11(7):459–463
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association mapping in structured populations. Am J Hum Genet 67(1):170–181
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, de Bakker PIW, Daly MJ, Sham PC (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81(3):559–575
R Development Core Team (2010) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. http://www.R-project.org
Reich D, Thangaraj K, Patterson N, Price AL, Singh L (2009) Reconstructing Indian population history. Nature 461(7263):489–494
Reynolds J, Weir BS, Cockerham CC (1983) Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genetics 105(3):767–779
Scriver CR (2001) Human genetics: lessons from Quebec populations. Annu Rev Genomics Hum Genet 2(1):69–101
Seldin MF, Shigeta R, Villoslada P, Selmi C, Tuomilehto J, Silva G, Belmont JW, Klareskog L, Gregersen PK (2006) European population substructure: clustering of northern and southern populations. PLoS Genet 2(9):1339
Slatkin M (1995) A measure of population subdivision based on microsatellite allele frequencies. Genetics 139(1):457–462
The International HapMap Consortium (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449(7164):851–861
Vézina H, Tremblay M, Desjardins B, Houde L (2005) Origines et contributions génétiques des fondatrices et des fondateurs de la population québécoise. Cah Que Demogr 34(2):235–258
Voight BF, Pritchard JK (2005) Confounding from cryptic relatedness in case-control association studies. PLoS Genet 1(3):e32
Wigginton JE, Cutler DJ, Abecasis GR (2005) A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet 76(5):887–883
Yamaguchi-Kabata Y, Nakazono K, Takahashi A, Saito S, Hosono N, Kubo M, Nakamura Y, Kamatani N (2008) Japanese population structure, based on SNP genotypes from 7003 individuals compared to other ethnic groups: effects on population-based association studies. Am J Hum Genet 83(4):445–456
Yotova V, Labuda D, Zietkiewicz E, Gehl D, Lovell A, Lefebvre JF, Bourgeois S, Lemieux-Blanchard E, Labuda M, Vezina H, Houde L, Tremblay M, Toupance B, Heyer E, Hudson TJ, Laberge C (2005) Anatomy of a founder effect: myotonic dystrophy in Northeastern Quebec. Hum Genet 117(2–3):177–187
Zapala MA, Schork NJ (2006) Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables. Proc Natl Acad Sci USA 103(51):19430–19435
Zhang Z, Ersoz E, Lai C-Q, Todhunter RJ, Tiwari HK, Gore MA, Bradbury PJ, Yu J, Arnett DK, Ordovas JM, Buckler ES (2010) Mixed linear model approach adapted for genome-wide association studies. Nat Genet 42(4):355–360
Acknowledgments
We are grateful to all participants who generously shared their DNA and information required to reconstruct their genealogies and to Laurent Richard from the Historical Geography Laboratory at Laval University (QC, Canada) for cartography work. Support of the Réseau de Médecine Génétique Appliquée (RMGA) of the Fonds de la Recherche en Santé du Québec (FRSQ) as well as of the Canadian Institutes of Health Research (CIHR; to DL and HV) is gratefully acknowledged. CB was supported by a studentship from the Fondation de l’Hôpital Sainte-Justine and the Fondation des Étoiles and is now scholar from the FRSQ.
Author information
Authors and Affiliations
Corresponding authors
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Roy-Gagnon, MH., Moreau, C., Bherer, C. et al. Genomic and genealogical investigation of the French Canadian founder population structure. Hum Genet 129, 521–531 (2011). https://doi.org/10.1007/s00439-010-0945-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-010-0945-x