Linkage disequilibrium and haplotype block structure in a composite beef cattle breed

Mokry, Fabiana Barichello; Buzanskas, Marcos Eli; de Alvarenga Mudadu, Maurício; do Amaral Grossi, Daniela; Higa, Roberto Hiroshi; Ventura, Ricardo Vieira; de Lima, Andressa Oliveira; Sargolzaei, Mehdi; Conceição Meirelles, Sarah Laguna; Schenkel, Flávio Schramm; da Silva, Marcos Vinicius Gualberto Barbosa; Méo Niciura, Simone Cristina; de Alencar, Maurício Mello; Munari, Danísio Prado; de Almeida Regitano, Luciana Correia

doi:10.1186/1471-2164-15-S7-S6

Linkage disequilibrium and haplotype block structure in a composite beef cattle breed

Research
Open access
Published: 27 October 2014

Volume 15, article number S6, (2014)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

Linkage disequilibrium and haplotype block structure in a composite beef cattle breed

Download PDF

Fabiana Barichello Mokry¹,
Marcos Eli Buzanskas²,
Maurício de Alvarenga Mudadu³,
Daniela do Amaral Grossi⁴,
Roberto Hiroshi Higa⁵,
Ricardo Vieira Ventura⁴,
Andressa Oliveira de Lima¹,
Mehdi Sargolzaei⁴,
Sarah Laguna Conceição Meirelles⁶,
Flávio Schramm Schenkel⁴,
Marcos Vinicius Gualberto Barbosa da Silva⁷,
Simone Cristina Méo Niciura³,
Maurício Mello de Alencar³,
Danísio Prado Munari² &
…
Luciana Correia de Almeida Regitano³

4059 Accesses
47 Citations
Explore all metrics

Abstract

Background

The development of linkage disequilibrium (LD) maps and the characterization of haplotype block structure at the population level are useful parameters for guiding genome wide association (GWA) studies, and for understanding the nature of non-linear association between phenotypes and genes. The elucidation of haplotype block structure can reduce the information of several single nucleotide polymorphisms (SNP) into the information of a haplotype block, reducing the number of SNPs in a coherent way for consideration in GWA and genomic selection studies.

Results

The maximum average LD, measured by r² varied between 0.33 to 0.40 at a distance of < 2.5 kb, and the minimum average values of r² varied between 0.05 to 0.07 at distances ranging from 400 to 500 kb, clearly showing that the average r² reduced with the increase in SNP pair distances. The persistence of LD phase showed higher values at shorter genomic distances, decreasing with the increase in physical distance, varying from 0.96 at a distance of < 2.5 kb to 0.66 at a distance from 400 to 500 kb. A total of 78% of all SNPs were clustered into haplotype blocks, covering 1,57 Mb of the total autosomal genome size.

Conclusions

This study presented the first high density linkage disequilibrium map and haplotype block structure for a composite beef cattle population, and indicates that the high density SNP panel over 700 k can be used for genomic selection implementation and GWA studies for Canchim beef cattle.

Long-range linkage disequilibrium in French beef cattle breeds

Article Open access 23 July 2021

Linkage disequilibrium, persistence of phase and effective population size estimates in Hereford and Braford cattle

Article Open access 01 February 2016

Genome wide linkage disequilibrium and genetic structure in Sicilian dairy sheep breeds

Article Open access 10 October 2014

Background

With the advance in high-throughput single nucleotide polymorphism (SNP) detection and genotyping technologies, genome-wide association (GWA) and genomic selection (GS) studies in livestock have become of great interest. Nonetheless, both types of studies rely on the extent of linkage disequilibrium between markers across the genome. Linkage disequilibrium (LD) can be defined as the non-random relationship of alleles between two loci within a population. The development of LD maps and the characterization of haplotype block structure at the population level are useful parameters for guiding GWA, candidate gene and candidate region studies [1], as well as for understanding the nature of non-linear association between phenotypes and genes. LD in a population can be affected by some factors, such as population structure, migration, selection, genetic drift, mutations and recombination rates [2].

A variety of studies can be found regarding linkage disequilibrium in cattle populations. The first genome-wide LD study was conducted in a Dutch black-and-white dairy cattle population based on 284 microsatellite markers [3]. From there on, many other studies have been performed and confirmed extensive LD in cattle [4–8], being followed by a second generation of LD maps developed with 30 k or more SNPs spanning the entire bovine genome [9–12]. Lastly, a recent high density LD study on Nellore using 700 k SNPs has been published [13], which can be considered the third generation of LD maps, concluding that the estimated LD for SNPs within a physical genomic distance of 30 kb corroborates the use of the 700 k SNPs panel for genomic selection implementation in Nellore cattle.

Haplotype block studies are not as common as LD map studies in cattle. The elucidation of haplotype block structure can bring important considerations for GWA and GS studies, such as the possibility of selecting a set of SNPs with the prospect of reducing the information of several SNPs into the information of a haplotype block, reducing the number of SNPs in a coherent way [14], and optimizing the design and analysis of GWA. Haplotype blocks can also be used for detection of genomic regions under selection during evolution, and identification of signatures of recent positive selection [15].

With the new releases of commercial high density SNP panels with over 700 k SNPs, it is possible to build high definition LD maps and haplotype blocks. In this study, we used the Illumina BovineHD BeadChip to investigate LD and persistence of LD phase patterns, as well as the haplotype block structure for Canchim, a composite breed of beef cattle. Canchim was originated in the early 1960's in Brazil from crosses between Bos primigenius indicus (zebu) and Bos primigenius taurus (Charolais) animals [16]. The final composition of Canchim animals is 5/8 parts Charolais and 3/8 parts zebu (currently, the Nellore breed is the most common zebu breed used in the crosses to obtain Canchim animals). However, the Canchim Breeding Association allows for four different crossing schemes to generate Canchim animals, and one of these schemes, called UEPAE (Unidade de Execução de Pesquisa de Âmbito Estadual), produces animals with, on average, 3.1% more Charolais in the final composition, called MA animals, but they are still evaluated with other Canchim animals taking into account the differences in the percentage of each original breed (zebu and Charolais) [17].

Methods

Animals

In respect to the Canchim composition particularity, a sample of 399 animals (285 Canchim, 114 MA) was selected, and Canchim animals will be referred to as CA, the MA animals as MA, and the joint group of Canchim and MA animals as CAN. These 399 animals belong to seven herds located in two Brazilian states (São Paulo and Goiás), and are registered in the Canchim Breeding Association database. This study has been performed with the approval of the Embrapa Southeast Livestock Ethical Committee of Animal Use (CEUA-CPPSE) under protocol number 02/2009.

The Canchim population is considered to be rather small, especially when compared to other breeds in Brazil. The herd is estimated to contain approximately 30,000 animals, according to data from the Canchim Breeding Association [18], and supplied just 0.12% of all beef cattle semen sales in Brazil [19]. However, this data set originated from 50 different bulls, comprised of Canchim and MA animals from the breed developer farm and other farms. This is representative of 0.01% of the entire Canchim population. For these reasons, we ignored the probable founder effects, and considered this sample to be representative of the current Canchim population.

Genotyping and SNP quality control

The 399 animals described above were genotyped using the BovineHD BeadChip (Illumina Inc., San Diego, CA), which consists of 786,799 SNPs evenly distributed along the bovine genome with an average distance of 3 kb. In this study, sex chromosomes and loci without an assigned position in the Cattle Genome Assembly UMD 3.1 [20] were discarded, as well as animals with a call rate < 0.90. The SNP quality control was carried out according to low call rates (< 0.90), Hardy-Weinberg equilibrium (< 0.0001), and minor allele frequency (MAF) < 0.05, as extreme values of MAF can reduce the power to properly estimate linkage disequilibrium and persistence of LD phase [21].

Linkage disequilibrium and persistence of linkage disequilibrium phase

The data set was divided into Canchim and MA animals for an exploratory analysis of persistence of LD phase between both genetic groups, and LD estimation was performed using the SNPPLD tool available in the gebv software [22]. The LD measurement adopted in this study was r² [23], which is the correlation coefficient between SNP pairs, and was calculated according to the following equation:

r_{i j}^{2} = \frac{p_{i j} - p_{i} \times p_{j}^{2}}{p_{i} (1 - p_{i}) \times p_{j} (1 - p_{j})}

where p_ij is the frequency of the two-marker haplotype, and p_i, and p_j are the marginal allelic frequencies in the i^th and j^th SNP, respectively [24]. The value of r² can vary from 0 to 1, where zero means no correlation between SNP pairs, and 1 means perfect correlation between the SNP pairs. Due to the significant amount of possible SNP pair-wise comparisons, the r² calculation was limited to SNPs within maximum distances of 500 kb from each other, since r²-values obtained using SNPs with distances over 500 kb presented low LD values (data not shown), and to estimate all SNPs pair-wise comparisons would exponentially increase computing processing. Paternal and maternal haplotypes were utilized for the estimation of LD.

The persistence of LD phase was evaluated across genetic groups (CA and MA) by the Pearson correlation of the square root of r² (r), by attributing the appropriate sign based on the calculated D value, called signed r. Persistence of LD phase calculation was performed according to the following equation:

D = p_{i j} - (p_{i} \times p_{j})

where p_ij, p_i, and p_j were defined as stated above. The results were ordered by chromosome (chr) and distance between SNPs.

Haplotype block structure

Haplotype block refers to a combination of alleles linked along a chromosome, which are inherited together from a common ancestor [25]. The haplotype block structure study was carried out using the joint CAN group. For doing this, the same quality control filters were applied through the PLINK v1.07 software [26], and the phase and haplotype reconstruction were performed using the BEAGLE Version 3.3.1 software [27] for each chromosome. Afterwards, the Haploview [28] software, which uses haplotype block definition by Gabriel et al. [29] by default, was used for estimating haplotype block patterns for the 29 pairs of autosome chromosomes, within SNPs at a maximum distance of 500 kb.

Results

A total of 395 animals (283 CA and 112 MA) passed the quality control filters from the SNPPLD software, yielding a total of 716,089 SNPs for CA, 658,132 SNPs for MA, and 713,615 SNPs for CAN animals. The number of SNP pairs showed small variation among CA, MA, and CAN due to the quality control filtering, in which some SNPs were included for CA, MA, and CAN, and some other SNPs were not included, as shown in Table 1. The maximum average r² of 0.40 was obtained for CA animals at a distance of < 2.5 kb, while MA animals obtained an average r² of 0.33, and the joint population (CAN) resulted in an average r² of 0.38 (Table 1). The average minimum values of r² (0.07 CA, 0.05 MA, and 0.06 CAN) were obtained at a distance from 400 to 500 kb (Table 1), clearly showing that the average r² reduced with the increase in SNP pair distances. The average r², up to the physical genomic distance of 500 kb, was slightly higher for CA animals, followed by CAN, and lastly, by MA animals (Figure 1). The persistence of LD phase between CA and MA animals showed higher values at shorter genomic distances, decreasing with the increase in physical distance (Figure 1). The persistence of LD phase varied from 0.96 at distances < 2.5 kb, to 0.66 at distances between 400 and 500 kb, with an overall average of 0.85 (Table 1). Even though the LD decay with the increase in distance is clear (Figure 1), it is possible to identify some chromosomes (chrs) with lower LD decay among genetic groups, as in chrs 5, 14, and 21 (Figure 2).

Table 1 Summary of SNP pairs, average linkage disequilibrium (r²), standard deviation (SD), median, persistence of LD phase (PL) between CA and MA genetic groups by genomic distance.

Full size table

The quality control filtering for the haplotype block study was performed using the PLINK v1.07 software [26], which permitted 664,027 SNPs to remain in this study, with an average distance between SNP markers of 3.8 kb, leading to a total autosomal genome size of 2.51 Mb. Haplotype blocks formed by only two SNPs were discarded to avoid spurious block formation. A descriptive summary of the haplotype block analysis can also be found in Table 2. From the previous SNPs, 517,393 were clustered into haplotype blocks, which correspond to 78% of all SNPs, covering 1.57 Mb of the total autosomal genome size (Table 2). Chr 1 showed the highest number of SNPs (41,830) and haplotype blocks (4,787), while chr 25 presented the smallest number of SNPs (11,671) and haplotype blocks (1,396) (Table 2), with an overall average haplotype block length of 20 kb (Figure 3). Overall, 61% of chromosome lengths were covered by haplotype blocks, with chrs 23 and 28 showing the smallest coverage (53%), while chrs 2, 7 and 9 showed the greatest coverage (66%) (Table 2). Figure 4 displays the haplotype distributions on chrs 7, 10, 12, 15, 16, 21, 23, 24, 25, and 29 with some uncommon haplotype distributions (gaps without any haplotype - chrs 10, 12, 23; areas with higher frequencies of larger haplotypes - chrs 7, 12, 16, 24, 29; and uncommon haplotype pattern distributions at the extreme ends - chrs 15, 21, 25).

Table 2 Haplotype block summary per chromosome (chr).

Full size table

Discussion

The choice of using r² instead of D' for assessing LD measurements was due to the fact that it is less affected by allele frequencies in a finite population size compared to D', which tends to overestimate LD in small samples and low frequency alleles [12, 30, 31]. According to the literature, mean r²-values above 0.30 can be considered a strong LD and useful for QTL mapping [31], while an mean r²-value of 0.20 is considered enough to achieve an accuracy of 0.85 for genomic breeding value (GBV) estimation [32]. Mean values of r²=0.30 and above were found in CA animals extending to the physical genomic distance of 10 kb, being followed by CAN animals at a distance of 7.5 kb, and lastly by MA animals to the physical genomic distance of 5 kb (Table 1). Considering the great appeal of using SNPs for GBV estimation and the above mentioned threshold of r²=0.20, for CA animals GBV can be estimated by using SNP markers at a distance between 30-40 kb, for CAN animals by using SNP markers at distances between 20-30 kb, and for MA animals by using SNP markers at distances between 10-20 kb. Even though making comparisons between LD studies is difficult, as the level of LD varies due to sample size, marker types, density, and population history [33], the results obtained in this study are in agreement with a recent LD study using approximately 700 kb SNPs in Nellore cattle, which reports levels of LD (r²) higher than 0.30 for SNP markers spanning to a physical genomic distance of 3 kb, and an r² higher than 0.20 for SNP markers continuing to a physical genomic distance of 20 kb [13].

Animals from the MA genetic group showed lower levels of LD compared to CA animals (Table 1). This might be explained in part by the crossing system used to obtain MA animals (CA × zebu = F1, and F1 × Charolais = MA), while most of the CA animals are obtained by CA × CA crosses. Some studies have shown that the average LD decay with the increase in physical genomic distance between loci is more accentuated in crossbred and admixed populations compared to purebred populations [34, 35]. One of the reasons is that individuals from crossbred populations are less related to each other (further common ancestor), leading to LD haplotypes in crossbred populations being narrower than LD haplotypes in purebreds [35]. In the MA situation, these animals are obtained by initially crossing Canchim with zebu animals, and the progeny are then crossed with Charolais animals. Despite the decrease of LD levels along with the increase in physical genomic distance, the LD behavior also showed variability among chromosomes and chromosomal regions. These variations can be attributed to many factors, such as differences in recombination rates between and within chromosomes, heterozygosity, selection effects, and genetic drift.

The understanding of the persistence of LD phase is essential for effective genomic selection across admixed populations or crossbred animals, as a pair of SNPs can exhibit the same value of r² between two populations, but in different LD phases [36]. The correlation of the signed r value represents the degree of genetic relationship between populations [37], and between MA and CA animals it was over 0.80 continuing to a genomic distance of 100 kb between both genetic groups (Table 1), and decreased to a minimum of 0.66 extending to a genomic distance of 500 kb. According to previous studies [35, 37], high correlation values imply in consistency of LD phase, and considering a threshold of r² = 0.20 to achieve an accuracy of 0.85 for genomic breeding value estimation [32], the persistence of LD phase between MA and CA animals stayed over 0.91 until reaching a genomic distance of 30 kb, meaning that one population can be used to predict the performance of the other (e.g. MA animals) [12], which allows for considering both genetic groups (CA and MA animals) together as one breed (CAN animals) for future studies and for genetic evaluation purposes.

There are many published studies on LD and haplotype block properties for cattle, which vary in many aspects (breed of interest, marker types, marker density, and chromosome regions), yielding average haplotype block sizes from a few kb in length (5.7 kb considering 2 or more SNPs [9], 26.2 kb considering 4 or more SNPs [8]) to hundreds of kb in length (700 kb [6]), and covering from 2.18% to 4.67% [4, 10] of the detected genome. However, these studies used smaller marker densities, with an average distance of 62 kb between adjacent markers [10]. Another study, which considered only high-density markers in specific areas of the bovine genome (approximate distances of 5 kb between adjacent markers), reported an average block size of 10.3 kb across many breeds, an average of 3.8 SNPs per block, and a total of 34% of the high-density areas covered by haplotype blocks [9]. These values are still smaller than the ones reported in this study for percentage of covered genome (61%), average number of SNPs per block (6.64 SNPs), and a total of 78% SNPs in haplotype blocks, corroborating the assumption that as the marker density increases, the more haplotype blocks of smaller sizes will be identified. On the other hand, this is not supported in total by a study in German Holstein cattle, which did not report relevant variation on haplotype block number with the increase of marker density, but an increase in block coverage percentage [10].

There are many aspects reported in the literature involved in LD shaping, which in turn affects the haplotype block structure, such as meiotic recombination, natural and artificial selection, population bottle necks, genetic drift, and admixture [29, 38–42]. However, most of these studies were carried out in humans [39–42], which, among other differences, are not affected by artificial selection and have a higher effective population size than cattle [43]. All these factors play important roles in the haplotype block structure which could be the cause of some of the uncommon haplotype distributions found in Figure 4.

Conclusions

This study describes the first high-density linkage disequilibrium map and haplotype block structure for a composite beef cattle population. Considering an r² ≥ 0.20 as being useful for genomic breeding value estimations, the results demonstrate that the high density SNP panel used here can be implemented for genomic selection of Canchim beef cattle. The persistence of LD phase between CA and MA animals was consistent, which supports the decision of considering both genetic groups together in future studies and in genetic evaluation programs. Further studies on factors affecting the uncommon haplotype block distribution still need to be carried out in order to better understand the way these factors are shaping the LD and haplotype blocks.

Availability of supporting data

The genomic data and further supporting data are available upon request from Dr. Luciana Correia de Almeida Regitano (Embrapa Southeast Livestock. Address: Rodovia Washington Luiz, km 234, São Carlos, São Paulo, 13560-970, Brazil; e-mail: luciana.regitano@embrapa.br; telephone: +55 16 34115600).

References

De La Vega FM, Dailey D, Ziegle J, Williams J, Madden D, Gilbert DA: New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies. Biotechniques. 2002, 52-54. 48-50
Hedrick PW: Genetics Of Populations. 2005, Jones and Bartlett Publishers, Biological Science (Jones and Bartlett) Series
Google Scholar
Farnir F, Coppieters W, Arranz J, Berzi P, Cambisano N, Grisart B, Karim L, Marcq F, Moreau L, Mni M, Nezer C, Simon P, Vanmanshoven P, Wagenaar D, Georges M: Extensive Genome-wide Linkage Disequilibrium in Cattle. Genome Res. 2000, 10: 220-227. 10.1101/gr.10.2.220.
Article PubMed CAS Google Scholar
Khatkar MS, Zenger KR, Hobbs M, Hawken RJ, Cavanagh JAL, Barris W, McClintock AE, McClintock S, Thomson PC, Tier B, Nicholas FW, Raadsma HW: A primary assembly of a bovine haplotype block map based on a 15,036-single-nucleotide polymorphism panel genotyped in holstein-friesian cattle. Genetics. 2007, 176: 763-72.
Article PubMed CAS PubMed Central Google Scholar
Odani M, Narita A, Watanabe T, Yokouchi K, Sugimoto Y, Fujita T, Oguni T, Matsumoto M, Sasaki Y: Genome-wide linkage disequilibrium in two Japanese beef cattle breeds. Anim Genet. 2006, 37: 139-44. 10.1111/j.1365-2052.2005.01400.x.
Article PubMed CAS Google Scholar
Gautier M, Faraut T, Moazami-Goudarzi K, Navratil V, Foglio M, Grohs C, Boland A, Garnier J-G, Boichard D, Lathrop GM, Gut IG, Eggen A: Genetic and haplotypic structure in 14 European and African cattle breeds. Genetics. 2007, 177: 1059-70. 10.1534/genetics.107.075804.
Article PubMed CAS PubMed Central Google Scholar
McKay SD, Schnabel RD, Murdoch BM, Matukumalli LK, Aerts J, Coppieters W, Crews D, Dias Neto E, Gill CA, Gao C, Mannen H, Stothard P, Wang Z, Van Tassell CP, Williams JL, Taylor JF, Moore SS: Whole genome linkage disequilibrium maps in cattle. BMC Genet. 2007, 8: 74-
Article PubMed PubMed Central Google Scholar
Kim E-S, Kirkpatrick BW: Linkage disequilibrium in the North American Holstein population. Anim Genet. 2009, 40: 279-88. 10.1111/j.1365-2052.2008.01831.x.
Article PubMed CAS Google Scholar
Villa-Angulo R, Matukumalli LK, Gill CA, Choi J, Van Tassell CP, Grefenstette JJ: High-resolution haplotype block structure in the cattle genome. BMC Genet. 2009, 10: 19-
Article PubMed PubMed Central Google Scholar
Qanbari S, Pimentel ECG, Tetens J, Thaller G, Lichtner P, Sharifi AR, Simianer H: The pattern of linkage disequilibrium in German Holstein cattle. Anim Genet. 2010, 41: 346-56.
PubMed CAS Google Scholar
Flury C, Tapio M, Sonstegard T, Drögemüller C, Leeb T, Simianer H, Hanotte O, Rieder S: Effective population size of an indigenous Swiss cattle breed estimated from linkage disequilibrium. J Anim Breed Genet. 2010, 127: 339-47. 10.1111/j.1439-0388.2010.00862.x.
Article PubMed CAS Google Scholar
Lu D, Sargolzaei M, Kelly M, Li C, Vander Voort G, Wang Z, Plastow G, Moore S, Miller SP: Linkage disequilibrium in Angus, Charolais, and Crossbred beef cattle. Front Genet. 2012, 3: 152-August
Article PubMed PubMed Central Google Scholar
Espigolan R, Baldi F, Boligon AA, Souza FR, Gordo DG, Tonussi RL, Cardoso DF, Oliveira HN, Tonhati H, Sargolzaei M, Schenkel FS, Carvalheiro R, Ferro JA, Albuquerque LG: Study of whole genome linkage disequilibrium in Nellore cattle. BMC Genomics. 2013, 14: 305-10.1186/1471-2164-14-305.
Article PubMed CAS PubMed Central Google Scholar
Johnson GC, Esposito L, Barratt BJ, Smith AN, Heward J, Di Genova G, Ueda H, Cordell HJ, Eaves IA, Dudbridge F, Twells RC, Payne F, Hughes W, Nutland S, Stevens H, Carr P, Tuomilehto-Wolf E, Tuomilehto J, Gough SC, Clayton DG, Todd JA: Haplotype tagging for the identification of common disease genes. Nat Genet. 2001, 29: 233-7. 10.1038/ng1001-233.
Article PubMed CAS Google Scholar
Sabeti PC, Reich DE, Higgins JM, Levine HZP, Richter DJ, Schaffner SF, Gabriel SB, Platko J V, Patterson NJ, McDonald GJ, Ackerman HC, Campbell SJ, Altshuler D, Cooper R, Kwiatkowski D, Ward R, Lander ES: Detecting recent positive selection in the human genome from haplotype structure. Nature. 2002, 419: 832-7. 10.1038/nature01140.
Article PubMed CAS Google Scholar
Vianna AT: Formação Do Gado Canchim Pelo Cruzamento Charoles-Zebu. Ministério da Agricultura. 1960, 48: SIA. Estudos T{é}cnicos
Google Scholar
Alencar MM: Bovino - Raça Canchim: Origem E Desenvolvimento. EMBRAPA-UEPAE de São Carlos. 1986
Google Scholar
Associação Brasileira de Criadores de Canchim: Raça Canchim: O precoce brasileiro. Merc da Carne. 2008, 28-31.
Google Scholar
Associação Brasileira de Inseminação Artificial: Índex Asbia: Importação, Exportação E Comercialização de Sêmen. 2012, 19:
Google Scholar
Zimin A V, Delcher AL, Florea L, Kelley DR, Schatz MC, Puiu D, Hanrahan F, Pertea G, Van Tassell CP, Sonstegard TS, Marçais G, Roberts M, Subramanian P, Yorke JA, Salzberg SL: A whole-genome assembly of the domestic cow, Bos taurus. Genome Biol. 2009, 10: R42-10.1186/gb-2009-10-4-r42.
Article PubMed PubMed Central Google Scholar
Goddard KA, Hopkins PJ, Hall JM, Witte JS: Linkage disequilibrium and allele-frequency distributions for 114 single-nucleotide polymorphisms in five populations. Am J Hum Genet. 2000, 66: 216-34. 10.1086/302727.
Article PubMed CAS PubMed Central Google Scholar
Sargolzaei M, Schenkel F, VanRaden P: gebv: Genomic breeding value estimator for livestock. Tech Rep to Dairy Cattle Breed Genet Comm. 2009, 1-7. University of Guelph
Google Scholar
Hill W, Robertson A: Linkage disequilibrium in finite populations. Theor Appl Genet. 1968, 38: 226-231. 10.1007/BF01245622.
Article PubMed CAS Google Scholar
Badke YM, Bates RO, Ernst CW, Schwab C, Steibel JP: Estimation of linkage disequilibrium in four US pig breeds. BMC Genomics. 2012, 13: 24-10.1186/1471-2164-13-24.
Article PubMed CAS PubMed Central Google Scholar
The International HapMap Consortium: A haplotype map of the human genome. Nature. 2005, 437: 1299-320. 10.1038/nature04226.
Article PubMed Central Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, Sklar P, de Bakker PIW, Daly MJ, Sham PC: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007, 81: 559-75. 10.1086/519795.
Article PubMed CAS PubMed Central Google Scholar
Browning SR, Browning BL: Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007, 81: 1084-97. 10.1086/521987.
Article PubMed CAS PubMed Central Google Scholar
Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21: 263-5. 10.1093/bioinformatics/bth457.
Article PubMed CAS Google Scholar
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D: The structure of haplotype blocks in the human genome. Science. 2002, 296: 2225-9. 10.1126/science.1069424.
Article PubMed CAS Google Scholar
Hedrick PW: Gametic disequilibrium measures: proceed with caution. Genetics. 1987, 117: 331-41.
PubMed CAS PubMed Central Google Scholar
Ardlie KG, Kruglyak L, Seielstad M: Patterns of linkage disequilibrium in the human genome. Nat Rev Genet. 2002, 3: 299-309. 10.1038/nrg777.
Article PubMed CAS Google Scholar
Meuwissen TH, Hayes BJ, Goddard ME: Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001, 157: 1819-1829.
PubMed CAS PubMed Central Google Scholar
Pritchard JK, Przeworski M: Linkage disequilibrium in humans: models and data. Am J Hum Genet. 2001, 69: 1-14. 10.1086/321275.
Article PubMed CAS PubMed Central Google Scholar
Shifman S: Linkage disequilibrium patterns of the human genome across populations. Hum Mol Genet. 2003, 12: 771-776. 10.1093/hmg/ddg088.
Article PubMed CAS Google Scholar
Toosi A, Fernando RL, Dekkers JCM: Genomic selection in admixed and crossbred populations. J Anim Sci. 2010, 88: 32-46. 10.2527/jas.2009-1975.
Article PubMed CAS Google Scholar
Goddard ME, Hayes BJ, McPartlan H, Chamberlain AJ: Can the same genetic markers be used in multiple breeds?. Proc 8th World Congr Genet Appl to Livest Prod Belo Horizonte, Brazil 2006. 2006, 16-22.
Google Scholar
De Roos APW, Hayes BJ, Spelman RJ, Goddard ME: Linkage disequilibrium and persistence of phase in Holstein-Friesian, Jersey and Angus cattle. Genetics. 2008, 179: 1503-12. 10.1534/genetics.107.084301.
Article PubMed CAS PubMed Central Google Scholar
Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES: High-resolution haplotype structure in the human genome. Nat Genet. 2001, 29: 229-32. 10.1038/ng1001-229.
Article PubMed CAS Google Scholar
Zhang K, Akey JM, Wang N, Xiong M, Chakraborty R, Jin L: Randomly distributed crossovers may generate block-like patterns of linkage disequilibrium: an act of genetic drift. Hum Genet. 2003, 113: 51-9.
PubMed Google Scholar
Greenwood TA, Rana BK, Schork NJ: Human haplotype block sizes are negatively correlated with recombination rates. Genome Res. 2004, 14: 1358-61. 10.1101/gr.1540404.
Article PubMed CAS PubMed Central Google Scholar
Greenawalt DM, Cui X, Wu Y, Lin Y, Wang H-Y, Luo M, Tereshchenko I V, Hu G, Li JY, Chu Y, Azaro MA, Decoste CJ, Chimge N-O, Gao R, Shen L, Shih WJ, Lange K, Li H: Strong correlation between meiotic crossovers and haplotype structure in a 2.5-Mb region on the long arm of chromosome 21. Genome Res. 2006, 16: 208-14.
Article PubMed CAS PubMed Central Google Scholar
Phillips MS, Lawrence R, Sachidanandam R, Morris AP, Balding DJ, Donaldson MA, Studebaker JF, Ankener WM, Alfisi S V, Kuo F-S, Camisa AL, Pazorov V, Scott KE, Carey BJ, Faith J, Katari G, Bhatti HA, Cyr JM, Derohannessian V, Elosua C, Forman AM, Grecco NM, Hock CR, Kuebler JM, Lathrop JA, Mockler MA, Nachtman EP, Restine SL, Varde SA, Hozza MJ, et al: Chromosome-wide distribution of haplotype blocks and the role of recombination hot spots. Nat Genet. 2003, 33: 382-7. 10.1038/ng1100.
Article PubMed CAS Google Scholar
Hayes BJ, Visscher PM, McPartlan HC, Goddard ME: Novel multilocus measure of linkage disequilibrium to estimate past effective population size. Genome Res. 2003, 13: 635-43. 10.1101/gr.387103.
Article PubMed CAS PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge all technicians involved in this study for providing excellent assistance, the Embrapa Bioinformatics Multi-Users Laboratory for its facility and human resources, and also to the Canchim Breed Association for providing DNA material. We thank Dr. David Z. Mokry for English revisions and corrections. This article was funded by grants from CAPES/PNPD n^o 02663/09-0 (Coordination for Enhancement of Higher Education Personnel), and Embrapa (Brazilian Agricultural Research Corporation). The funding agencies had no role in study design, data collection and analysis, interpretation, decision to publish, or preparation of the manuscript.

Marcos Vinicius Gualberto Barbosa da Silva, Maurício Mello de Alencar, Luciana Correia de Almeida Regitano, and Marcos Eli Buzanskas (142053/2010-4) are CNPq fellows. Fabiana Barichello Mokry, Andressa Oliveira de Lima, and Marcos Eli Buzanskas (5285-11-9) are CAPES fellows.

This article has been published as part of BMC Genomics Volume 15 Supplement 7, 2014: Proceedings of the 9th International Conference of the Brazilian Association for Bioinformatics and Computational Biology (X-Meeting 2013). The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/15/S7.

Author information

Authors and Affiliations

Department of Genetics and Evolution, Federal University of São Carlos, Rodovia Washington Luiz, km 235, PO BOX 676, 13565-905, São Carlos, Brazil
Fabiana Barichello Mokry & Andressa Oliveira de Lima
Departamento de Ciências Exatas, UNESP - Univ Estadual Paulista, Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal, SP, 14884-900, Brazil
Marcos Eli Buzanskas & Danísio Prado Munari
Embrapa Southeast Livestock, Rodovia Washington Luiz, km 234, PO BOX 339, 13560-970, São Carlos, Brazil
Maurício de Alvarenga Mudadu, Simone Cristina Méo Niciura, Maurício Mello de Alencar & Luciana Correia de Almeida Regitano
Centre for Genetic Improvement of Livestock, University of Guelph, 50 Stone Road East, N1G 2W1, Guelph, Ontario, Canada
Daniela do Amaral Grossi, Ricardo Vieira Ventura, Mehdi Sargolzaei & Flávio Schramm Schenkel
Embrapa Agricultural Informatics, Avenida André Tosello, 209, PO BOX 6041, 13083-886, Campinas, Brazil
Roberto Hiroshi Higa
Department of Animal Science, Federal University of Lavras, PO BOX 3037, 37200-00, Lavras, Brazil
Sarah Laguna Conceição Meirelles
Embrapa Dairy Cattle, Rua Eugênio do Nascimento, 610, 36038-330, Juiz de Fora, Brazil
Marcos Vinicius Gualberto Barbosa da Silva

Authors

Fabiana Barichello Mokry
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Eli Buzanskas
View author publications
You can also search for this author in PubMed Google Scholar
Maurício de Alvarenga Mudadu
View author publications
You can also search for this author in PubMed Google Scholar
Daniela do Amaral Grossi
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Hiroshi Higa
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Vieira Ventura
View author publications
You can also search for this author in PubMed Google Scholar
Andressa Oliveira de Lima
View author publications
You can also search for this author in PubMed Google Scholar
Mehdi Sargolzaei
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Laguna Conceição Meirelles
View author publications
You can also search for this author in PubMed Google Scholar
Flávio Schramm Schenkel
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Vinicius Gualberto Barbosa da Silva
View author publications
You can also search for this author in PubMed Google Scholar
Simone Cristina Méo Niciura
View author publications
You can also search for this author in PubMed Google Scholar
Maurício Mello de Alencar
View author publications
You can also search for this author in PubMed Google Scholar
Danísio Prado Munari
View author publications
You can also search for this author in PubMed Google Scholar
Luciana Correia de Almeida Regitano
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luciana Correia de Almeida Regitano.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

FBM, MEB: contributed equally for the development of this research with data analysis, interpretation, figure compositions, manuscript writing and revision. MAM: data analysis, manuscript discussion and revision. DAG: data analysis, interpretation of results, manuscript revision. RHH: data analysis, manuscript discussion and revision. RVV: data analysis, interpretation. AOL: data analysis, interpretation of results, manuscript revision. MS: provided the SNPPLD software, interpretation. SLCM: experimental design, manuscript revision. FSS: results interpretation and discussion. MVGBS: experimental design, manuscript revision. SCMN: experimental design, DNA handling, manuscript revision. MMA: experimental design. DPM: data interpretation. LCAR: experimental design, interpretation, and manuscript revision.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Mokry, F.B., Buzanskas, M.E., de Alvarenga Mudadu, M. et al. Linkage disequilibrium and haplotype block structure in a composite beef cattle breed. BMC Genomics 15 (Suppl 7), S6 (2014). https://doi.org/10.1186/1471-2164-15-S7-S6

Download citation

Published: 27 October 2014
DOI: https://doi.org/10.1186/1471-2164-15-S7-S6

Linkage disequilibrium and haplotype block structure in a composite beef cattle breed