Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip

Ramayo-Caldas, Yuliaxis; Castelló, Anna; Pena, Romi N; Alves, Estefania; Mercadé, Anna; Souza, Carla A; Fernández, Ana I; Perez-Enciso, Miguel; Folch, Josep M

doi:10.1186/1471-2164-11-593

Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip

Research article
Open access
Published: 22 October 2010

Volume 11, article number 593, (2010)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip

Download PDF

Yuliaxis Ramayo-Caldas¹,
Anna Castelló¹,
Romi N Pena^2,5,
Estefania Alves³,
Anna Mercadé¹,
Carla A Souza^1,6,
Ana I Fernández³,
Miguel Perez-Enciso^1,4 &
…
Josep M Folch¹

7942 Accesses
87 Citations
Explore all metrics

Abstract

Background

Recent studies in pigs have detected copy number variants (CNVs) using the Comparative Genomic Hybridization technique in arrays designed to cover specific porcine chromosomes. The goal of this study was to identify CNV regions (CNVRs) in swine species based on whole genome SNP genotyping chips.

Results

We used predictions from three different programs (cnvPartition, PennCNV and GADA) to analyze data from the Porcine SNP60 BeadChip. A total of 49 CNVRs were identified in 55 animals from an Iberian x Landrace cross (IBMAP) according to three criteria: detected in at least two animals, contained three or more consecutive SNPs and recalled by at least two programs. Mendelian inheritance of CNVRs was confirmed in animals belonging to several generations of the IBMAP cross. Subsequently, a segregation analysis of these CNVRs was performed in 372 additional animals from the IBMAP cross and its distribution was studied in 133 unrelated pig samples from different geographical origins. Five out of seven analyzed CNVRs were validated by real time quantitative PCR, some of which coincide with well known examples of CNVs conserved across mammalian species.

Conclusions

Our results illustrate the usefulness of Porcine SNP60 BeadChip to detect CNVRs and show that structural variants can not be neglected when studying the genetic variability in this species.

Genome-wide mapping of copy number variations in commercial hybrid pigs using a high-density SNP genotyping array

Article 01 January 2016

A genome-wide single nucleotide polymorphism and copy number variation analysis for number of piglets born alive

Article Open access 27 April 2019

Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs

Article Open access 26 July 2023

Background

The pig (Sus scrofa) is one of the most widespread livestock species and one of the most economically important worldwide. The porcine genome has a total of 18 autosomes plus the X/Y sex chromosome pair; it is similar in size, complexity and chromosomal organization to the human genome. In contrast to SNPs and microsatellites, structural variations have received considerably less attention in pigs. Copy number variants (CNVs) are DNA segments ranging in length from kilobases to several megabases with a variable number of repeats among individuals [1]. Segmental duplications and CNVs have been extensively studied in other organisms [2–7]. Previous studies at genome scale suggest that CNVs comprise 5-12% of the human and ~4% of the dog genome [5, 8–10]. CNVs can influence gene expression, affect several metabolic traits and have been associated with Mendelian and complex genetic disorders [1].

Recent studies in pigs have detected CNVs using the Comparative Genomic Hybridization (CGH) technique in arrays designed to cover specific porcine chromosomes [11, 12]. An alternative, cheaper method for CNV detection is based on whole genome SNP genotyping chips [13–15], but it has not been tested yet, to our knowledge, in the swine species. A high-density porcine SNP BeadChip has recently been released by Illumina, which contains probes to genotype 62,163 SNPs covering the whole genome. This platform has an average distance between SNPs of 39.61 kb in autosomes and 81.28 kb in chromosome X (based on Sscrofa9 genome sequence assembly) and is a very valuable resource to study pig genetic variability and the molecular dissection of complex traits of economic importance [16].

The goal of this study was to detect CNV regions (CNVRs) from the Porcine SNP60 BeadChip data on autosomal chromosomes using a pedigree from an Iberian x Landrace (IBMAP) cross and to validate them in a collection of unrelated pigs from different origins.

Results and Discussion

Detection of structural variants

The Porcine SNP60 BeadChip data from 55 IBMAP animals were analyzed by multiple predictions from three different programs: cnvPartition (Illumina), PennCNV [17] and GADA [18]. The initial number of CNVs called by each software was 94, 84, and 200, respectively. Figure 1 summarizes the CNVs identified and compares the results obtained from the three programs.

For further analyses, we retained only CNVs applying a more stringent criterion, namely CNV regions (CNVRs) containing overlapping CNVs recalled by at least two programs, spanning three or more consecutive SNPs and detected in a minimum of two animals. A total of 49 CNVRs located in 13 of the 18 analyzed autosomal chromosomes were identified (Figure 2). All of these CNVRs showed Mendelian inheritance in animals across several generations of the IBMAP cross and therefore are unlikely to be artefacts or false positives, suggesting that our empirical criterion to retain CNVRs is reasonable.

The percentage of CNVRs confirmed by at least two programs was 52.38% for PennCNV, 21% for GADA and 40.42% for cnvPartition. A total of 26 CNVRs (53.06%) were detected by the three algorithms (Figure 1). Similar results were reported by Winchester et al. (2009) comparing different algorithms for CNV detection, suggesting that PennCNV is the most accurate program in the prediction of CNVs for the Illumina's platform [19]. In a recent study [20], the relative performance of seven methods for CNV identification was evaluated showing that the PennCNV algorithm has a moderate power and the lowest false positive rate. This is likely explained by the unique ability of this algorithm to integrate family relationships and signal intensities from parent-offspring trios data. The low percentage of CNVs called by the GADA software might be explained by the relative low coverage of the Porcine SNP60 BeadChip.

The size of the CNVRs detected ranged from 44.7 kb to 10.7 Mb, with a median size of 754.6 kb (Table 1). The Porcine SNP60 BeadChip was originally developed for high-throughput SNP genotyping in association studies. Although CNV detection is feasible with this technology, it is impaired by low marker density, non-uniform distribution of SNPs along pig chromosomes and lack of non-polymorphic probes specifically designed for CNV identification [16]. Hence, only the largest CNVRs are expected to be assessed with the Porcine SNP60 BeadChip. This explains the difference in minimum CNV length between our study (44.7 kb) and the work of Fadista et al., 2008 (9.3 kb) using the CGH technique.

Table 1 Description of the 49 CNVRs detected in the pig genome

Full size table

Among the first 55 animals analyzed, a single CNVR (CNVR35) was called in two animals whereas the remaining 48 CNVRs were called in three or more animals. A segregation analysis was performed in 372 additional animals from the IBMAP cross and the distribution of the CNVRs was additionally studied in 133 unrelated pig samples from different geographical origins (see Methods). All initially detected 49 CNVRs were segregating in the IBMAP cross and 41 were also detected in American pig populations (Additional file 1, Table S1). The number of animals with alternative alleles for the CNVRs ranged from five (CNVR13, CNVR46) to 270 (CNVR15). The predicted status for the CNVRs was 19 (38.7%) for gain, eight (16.3%) for loss and 22 (45%) for regions with gain or loss status in different animals (Table 1). This proportion may be related to natural selection, as it is assumed that the genome is more tolerant to duplications than to deletions [21–24]. The high percentage of CNVRs with gain or loss status may be the result of including in the analysis pig breeds with different genetic origins and from different countries. However, to establish the real status of CNVRs, validation by other techniques such as quantitative PCR (qPCR) will be necessary.

Genes located within CNVRs

The Biomart software in the Ensembl Sscrofa9 Database was used to retrieve genes annotated within the genomic regions of CNVRs. A total of 153 protein-coding genes, four miRNA, six miscRNA, three pseudogenes, two rRNA, two snoRNA and nine snRNA were annotated within the 49 CNVRs (Additional file 2, Table S2). Two or more annotated genes were found in 15 CNVRs, whereas one gene only was located in 14 CNVRs. No annotated genes were identified in 20 CNVRs, but this can be due to the incomplete annotation of the Sscrofa9 genome sequence assembly. In contrast to the high number of genes found in this study, it has been suggested that CNVs are located preferably in gene-poor regions [25, 26], probably because CNVs present in gene-rich regions may be deleterious and therefore removed by purifying selection [24].

Validation by quantitative PCR

Real time quantitative assays were designed for CNVR validation on seven genomic regions simultaneously detected with the three programs (CNVRs 1, 3, 15, 17, 22, 32, and 36; Table 1). Five of these CNVRs (15, 17, 22, 32, and 36) were confirmed by qPCR, nevertheless fewer animals were validated for CNVRs 15, 17, and 32 (Additional file 3, Fig. S1). Thus, the false discovery rate (FDR) for the seven analyzed CNVRs was 29%; it should be noted that the percentage of CNVRs validated in this study (71%) is higher than previously reported in pigs (50%) [11]. This result might be explained by the stringent criteria used in our analysis, which was proposed in order to increase confidence and minimize the false positives. Nevertheless, we were not able to confirm two of the CNVRs..Several factors may account for the discrepancy in CNVR prediction between the in silico analysis of Porcine SNP60 BeadChip data and the qPCR method. First, the incomplete status of the 4× sequence depth Sscrofa9 assembly and the low probe density of the Porcine SNP60 BeadChip makes it difficult to establish the true boundaries of CNVRs and may result in an over estimation of their real size. Therefore, it cannot be ruled out that the primers used to validate the CNVRs by qPCR may have been designed outside the structural polymorphic region. Second, polymorphisms such as SNPs and indels may influence the hybridization of the qPCR primers, changing the relative quantification (RQ) values for some animals. Finally, the true CNVR boundaries may be also polymorphic between the analyzed animals.

For the qPCR validation of CNVR36, a PCR protocol for the Cytochrome P4502 C32 Fragment gene [EMBL: ENSSSCG00000010487] was designed. A total of 37 animals were analyzed: 21 with statistical evidence for CNVR and 16 without the CNVR (control group). One of the animals from the control group was selected as reference. Six false positive animals were observed, indicating a FDR of 29% for CNVR36 (Figure 3).

A qPCR assay with primers located in the SLC16A7 gene [EMBL: ENSSSCG00000000456] was used for CNVR22 validation. A total of 50 animals were analyzed: 21 with statistical evidence for CNVR (12 from the IBMAP cross and nine unrelated individuals belonging to six different breeds of American populations) and 29 without the CNVR (control group). One of the animals from the control group was selected as reference. Nine of the IBMAP cross animals were validated by qPCR (FDR = 25%). Conversely, only three animals from the American populations were validated by qPCR, suggesting a higher FDR (67%) (Figure 4). These differences in FDR may be explained by the higher accuracy of the PennCNV algorithm when family information is available and stress the usefulness of including family information in CNV detection. However, this conclusion should be taken with caution due to the limited number of animals analyzed.

For CNVRs 22 and 36, copy number changes were also identified by qPCR in animals where CNVs were not detected initially in the statistical analysis (three and eight animals, respectively). This represents a false negative rate of 10% (3/29) for CNVR22 and 50% (8/16) for CNVR36. The three false negative animals for CNVR22 were classified as deletions by qPCR protocol. A similar situation, but with a different copy number status, was observed for CNVR36, where the eight false negative animals showed a duplication pattern by qPCR. False negative identification is common in CNV detection, and has been reported previously using the CGH technique in pigs and other mammalian species [5, 11].

Three of the validated CNVRs (17, 22, and 36) showed differential patterns of copy number variants between breeds. For instance, CNVR22 showed a loss (deletion) in Landrace and in animals from other breeds (Figure 4). Assuming that Iberian pigs have two copies of CNVR22 (qPCR RQ = 1), five animals showing an RQ = 0 by qPCR are predicted to be homozygous for a deletion on this genomic region. In CNVR36, a loss was found in Iberian pigs relative to Landrace animals (Figure 3). In agreement with the Mendelian segregation of this CNVR, hybrid animals show intermediate RQ values. The RQ mean values were 0.49 for Iberian, 2.51 for Landrace and 1.2 for hybrid animals.

CNVR36 contains a miRNA gene [EMBL: ENSSSCG00000019484] and the Cytochrome P4502 C32 Fragment gene (Additional file 2, Table S2), which is a member of the Cytochrome P450 gene family (CYTP45O). Proteins coded by this gene family constitute the major catalytic component of the liver mixed-function oxidase system and play a pivotal role in the metabolism of many endogenous and exogenous compounds [27]. Interestingly, CNVs comprising genes of the CYTP45O family have been described in humans and dogs [5, 10, 28], but had not been previously reported in pigs. In humans, copy number variations of CYTP45O genes have been associated with variation in drug metabolism phenotypes [29–31]. Differential expression of genes of the CYTP450 family has been correlated with androsterone levels in pigs from Duroc and Landrace breeds [32]. It has also been demonstrated that the total CYTP450 activity was slightly higher in minipigs compared to conventional pigs [33]. CNVR36 lays close to the peak position of a QTL for androsterone leves described in a cross between Large White and Chinese Meishan [34]. This suggests a possible role of this structural variation in determining androsterone levels; however, more studies will be necessary to validate this hypothesis.

CNVR22, also validated by qPCR, comprises the SLC16A7 gene. This gene belongs to the solute carrier family 16 gene family, which encodes 14 proteins that are largely known as monocarboxylate transporters (MCTs). The human SLC16A7 gene encodes the MCT2 protein [35] and it is expressed in several normal human tissues. In pigs, MCT2 may function as a housekeeping lactate transporter, regulating the acidification of glycolytic muscles [36]. Remarkably, CNVR22 is located in the middle of the confidence interval of a QTL for meat pH described in four pig populations [37].

Duplication events have also been validated by qPCR for SOX14 [EMBL: ENSSSCG00000011656] (CNVR32) and INSC [EMBL: ENSSSCG00000013385] (CNVR15). Copy number changes have not been previously reported in either of them in pigs. SOX14 is a member of the SOX gene family [38] of transcription factors involved in the regulation of embryo development and cell fate determination. SOX14 may have a major role in the regulation of nervous system development and it is a mediator of the neuronal death process [39]. SOX14 is an intronless gene that may has arisen by duplication from an ancestral SOX B gene, which likely was the product of a retrotransposition event [40]. Inscuteable (INSC) was first described in Drosophila and it plays a central role in the molecular machinery for mitotic spindle orientation and regulates cell polarity for asymmetric division [41, 42]. Inscuteable homologs have been found in several species, including vertebrates and insects [43]. In mammals, INSC is functionally conserved and it is required for correct orientation of the mitotic spindle in retina [43] and skin [44] precursor cells.

The qPCR assay for CNVR17 validation was designed over the sequence of one expressed sequence tag [EMBL: EW037329]. From four Cuban creole pigs tested, three animals showed a deletion and one animal a duplication event (Additional file 3, Fig. S1).

Other relevant CNVRs

Although other CNVRs have not been analyzed by qPCR, there is evidence of structural polymorphism in the literature. For instance, CNVR45 contains the KIT gene, a well-characterized and functionally important CNV in pigs. The dominant white coat phenotype in pigs is caused by KIT gene duplication or triplication and a splice mutation in one of the KIT gene copies [45–49]. In addition, studies in other mammals [5, 6, 50–55] have described CNVRs overlapping other gene families including: Olfactory receptor family, Glutamate receptor family, Solute carrier family, Cytochrome P450 family, Cyclic nucleotide phosphodiesterases family and Fucosyltransferase family. Twelve of the CNVRs detected in our study include or overlap porcine orthologues of these genes. Furthermore, 13 of the detected CNVRs include 47 genes previously reported in the Human Database of Genomic Variants http://projects.tcag.ca/variation/?source=hg19[56] (Additional file 4, Table S3).

Conclusions

We have described the first CNVRs in swine based on whole genome SNP genotyping chips. A total of 49 CNVRs were identified in 13 autosomal chromosomes. These CNVRs showed Mendelian inheritance across 427 individuals belonging to several generations of an Iberian x Landrace cross, and were also confirmed in different pig breeds. Five out of seven selected CNVRs were validated by qPCR; among the remaining CNVRs we found well known examples of CNVs conserved across mammalian species. Although these results illustrate the usefulness of Porcine SNP60 BeadChip to detect CNVRs, the number detected here is probably a gross underestimate given the wide interval between SNPs in the Porcine 60 k BeadChip.

Methods

Animal Material

We analyzed a total of 560 animals, including 427 individuals (150 males and 277 females) belonging to several generations of the IBMAP cross. This population was originated by crossing three Iberian (Guadyerbas line) boars with 31 Landrace sows [57, 58] (Additional file 5, Fig. S2). The remaining 133 pig samples were obtained from different geographical origins: 127 from American local breeds and village pigs [59], four black Sicilian pigs, one Hungarian Mangalitza and one Chinese Wild boar (Additional file 6, Table S4). We adhered to our national and institutional guidelines for the ethical use and treatment of animals in experiments.

Genotyping

All 560 animals were genotyped with the Porcine SNP60 BeadChip (Illumina Inc., USA) using the Infinium HD Assay Ultra protocol (Illumina). Raw data had high-genotyping quality (call rate >0.99) and were visualized and analyzed with the GenomeStudio software (Illumina). For subsequent data analysis, a subset of 50.572 SNPs was selected by removing the SNPs located in sex chromosomes and those not mapped in the Sscrofa9 assembly.

Statistical analysis

Following the recommendations of Winchester et al. (2009) to increase the confidence in CNV detection and limit the number of false positives, we used predictions from multiple programs. First, we used the Illumina's proprietary software GenomeStudio to check data quality and the cnvPartition v2.4.4 Analysis Plug-in for CNV detection. The minimum probe count employed was three and the remaining parameters were used according to the default criteria provided (Illumina). Then, we exported the signal intensity data of logRratio and B allele frequency to employ the R package for Genome Alteration Detection Algorithm (GADA) [18], which includes one algorithm based in sparse Bayesian learning to predict CNV changes. The multiple array analysis option was employed and the parameters defined for the Bayesian learning model and the backward elimination (BE) were: 0.8 for sparseness hyperparameter (a_α), 8 for critical value of the BE and 3 as the minimum number of SNPs at each segment.

Next, we used the command line version of PennCNV software that integrates, in a joint-calling algorithm, a Hidden Markov Model (HMM) with family relationships, signal intensities for parent-offspring trios, marker distance and population frequency of allele B [17]. The CNV calling was performed using the default parameters of the HMM model with 0.01 of UF factor. The "-trio" and "-quartet" arguments were employed to make use of our family information.

It is unclear with this kind of data, where the statistical properties of the methods are unknown, which is the optimum strategy to balance false positives and power. Here we chose to follow a pragmatic approach, requiring that the CNV was called by at least two algorithms, detected in at least two animals and contained three or more consecutive SNPs. Hence, these genomic regions should be referred as copy number variable regions (CNVRs). To define the size of each CNVR in the genome, we used the overlapping region between CNV predictions from different programs.

Pipeline analysis for CNVR detection was initially performed in 55 individuals of the IBMAP cross (13 males and 42 females), including all founder Iberian boars (three males), 24 founder Landrace sows, 17 F1, three F2, and eight backcross animals. Subsequently, we tested the segregation of these initially detected CNVRs in the rest of the IBMAP cross animals (372), and described their distribution in 127 unrelated pig samples from American local pigs, four black Sicilian pigs, one Hungarian Mangalitza and one Chinese Wild boar.

Gene annotation within the CNVRs was retrieved from the Ensembl Genes 57 Database using the Biomart [http://www.biomart.org] software.

Quantitative real time PCR

Quantitative real time PCR (qPCR) was used to validate seven genomic regions detected by the three methods and representing different predicted status of copy numbers. We used the 2^-ΔΔCt method for relative quantification (RQ) of CNVs [5, 60, 61]. This comparative method uses a target assay for the DNA segment being interrogated for copy number variation and a reference assay for an internal control segment, which is normally a known single copy gene; moreover a reference sample is included. The method requires the target and reference PCR efficiencies to be nearly to equal. Experiments were performed on the test and control primers to verify comparable efficiency in amplification prior to analysis of copy number.

CNVRs were quantified using the Taqman chemistry in an ABI PRISM^® 7900HT instrument (Applied Biosystems, Inc., Foster City, CA); results were analyzed with the SDS software (Applied Biosystems). Primers and hydrolysis probes (Taqman-MGB labeled with FAM) were designed for the seven CNVR regions with the Primer Express software (Applied Biosystems). A previously described [62] design on the glucagon gene [EMBL:GCG] was used as single copy control region, but a single nucleotide substitution on primer forward was introduced to adapt the primer to the porcine species. Primers and probes are shown in Additional file 7, Table S5.

PCR amplifications were performed in a total volume of 20 μl containing 10 ng of genomic DNA. Taqman PCR Universal Master Mix (Applied Biosystems) was used in all reactions except in GCG amplifications, where TaqMan^® PCR Core Reagents (Applied Biosystems) with 2.5 mM MgCl₂ were utilized. All primers and probes were used at 900 nM and 250 nM respectively, except CytochromeP450 2C32 Fragment forward primer, which was used at 300 nM. Each sample was analyzed in triplicate. The thermal cycle was: 2 min at 50°C, 10 min at 95°C and 40 cycles of 15 sec at 95°C and 1 min at 60°C. One sample without copy number variation for each of the genomic regions analyzed was used as reference.

Data availability

The full data set have been submitted to dbVAR [63] under the accession number nstd44.

Abbreviations

CNV :: copy number variation
CNVR :: CNV region
PCR :: polymerase chain reaction
IBMAP :: Iberian x Landrace intercross
qPCR :: quantitative real time PCR
RQ :: relative quantification value
CYTP450 :: Cytochrome P450 gene family
SLC16A7 :: solute carrier family 16 member 7
MCT2 :: monocarboxylic acid transporter 2
SOX14 :: SRY (sex determining region Y)-box 14
INSC :: inscuteable homolog (Drosophila).

References

Orozco LD, Cokus SJ, Ghazalpour A, Ingram-Drake L, Wang S, van Nas A, Che N, Araujo JA, Pellegrini M, Lusis AJ: Copy number variation influences gene expression and metabolic traits in mice. Hum Mol Genet. 2009, 18: 4118-4129. 10.1093/hmg/ddp360.
CAS PubMed PubMed Central Google Scholar
Perry GH, Ben-Dor A, Tsalenko A, Sampas N, Rodriguez-Revenga L, Tran CW, Scheffer A, Steinfeld I, Tsang P, Yamada NA, Park HS, Kim JI, Seo JS, Yakhini Z, Laderman S, Bruhn L, Lee C: The fine-scale and complex architecture of human copy-number variation. Am J Hum Genet. 2008, 82: 685-695. 10.1016/j.ajhg.2007.12.010.
CAS PubMed PubMed Central Google Scholar
Guryev V, Saar K, Adamovic T, Verheul M, van Heesch SA, Cook S, Pravenec M, Aitman T, Jacob H, Shull JD, Hubner N, Cuppen E: Distribution and functional impact of DNA copy number variation in the rat. Nat Genet. 2008, 40: 538-545. 10.1038/ng.141.
CAS PubMed Google Scholar
She X, Cheng Z, Zollner S, Church DM, Eichler EE: Mouse segmental duplication and copy number variation. Nat Genet. 2008, 40: 909-914. 10.1038/ng.172.
CAS PubMed PubMed Central Google Scholar
Nicholas TJ, Cheng Z, Ventura M, Mealey K, Eichler EE, Akey JM: The genomic architecture of segmental duplications and associated copy number variants in dogs. Genome Res. 2009, 19: 491-499. 10.1101/gr.084715.108.
CAS PubMed PubMed Central Google Scholar
Bae JS, Cheong HS, Kim LH, Namgung S, Park TJ, Chun JY, Kim JY, Pasaje CF, Lee JS, Shin HD: Identification of copy number variations and common deletion polymorphisms in cattle. BMC Genomics. 2010, 11: 232-10.1186/1471-2164-11-232.
PubMed PubMed Central Google Scholar
Wang X, Nahashon S, Feaster T, Bohannon-Stewart A, Adefope N: An initial map of chromosomal segmental copy number variations in the chicken. BMC Genomics. 2010, 11: 351-10.1186/1471-2164-11-351.
PubMed PubMed Central Google Scholar
Li J, Yang T, Wang L, Yan H, Zhang Y, Guo Y, Pan F, Zhang Z, Peng Y, Zhou Q, He L, Zhu X, Deng H, Levy S, Papasian CJ, Drees BM, Hamilton JJ, Recker RR, Cheng J, Deng HW: Whole Genome Distribution and Ethnic Differentiation of Copy Number Variation in Caucasian and Asian Populations. PLoS One. 2009, 4: e7958-10.1371/journal.pone.0007958.
PubMed PubMed Central Google Scholar
Choy KW, Setlur SR, Lee C, Lau TK: The impact of human copy number variation on a new era of genetic testing. BJOG.
Yim SH, Kim TM, Hu HJ, Kim JH, Kim BJ, Lee JY, Han BG, Shin SH, Jung SH, Chung YJ: Copy number variations in East-Asian population and their evolutionary and functional implications. Hum Mol Genet. 19: 1001-1008. 10.1093/hmg/ddp564.
PubMed PubMed Central Google Scholar
Fadista J, Nygaard M, Holm LE, Thomsen B, Bendixen C: A snapshot of CNVs in the pig genome. PLoS One. 2008, 3: e3916-10.1371/journal.pone.0003916.
PubMed PubMed Central Google Scholar
Tang H, Li F, Finlayson HA, Smith S, Lu Z, Langford C, Archibald AL: Structural And Copy Number Variation In The Pig Genome. Book Structural And Copy Number Variation In The Pig Genome. Edited by: City: Plant & Animal Genomes XVIII Conference. 2010, Town & Country Convention Center, January 9-13, 2010
Google Scholar
Tuefferd M, Bondt AD, Wyngaert IVD, Talloen W, Verbeke T, Carvalho B, Clevert DA, Alifano M, Raghavan N, Amaratunga D, Göhlmann H, Broët P, Camilleri-Broët S: Genome-wide copy number alterations detection in fresh frozen and matched FFPE samples using SNP 6.0 arrays. Genes, Chromosomes and Cancer. 2008, 47: 957-964. 10.1002/gcc.20599.
CAS PubMed Google Scholar
Komura D, Shen F, Ishikawa S, Fitch KR, Chen W, Zhang J, Liu G, Ihara S, Nakamura H, Hurles ME, Lee C, Scherer SW, Jones KW, Shapero MH, Huang J, Aburatani H: Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays. Genome Res. 2006, 16: 1575-1584. 10.1101/gr.5629106.
CAS PubMed PubMed Central Google Scholar
Peiffer DA, Le JM, Steemers FJ, Chang W, Jenniges T, Garcia F, Haden K, Li J, Shaw CA, Belmont J, Cheung SW, Shen RM, Barker DL, Gunderson KL: High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. Genome Research. 2006, 16: 1136-1148. 10.1101/gr.5402306.
CAS PubMed PubMed Central Google Scholar
Ramos AM, Crooijmans RPMA, Affara NA, Amaral AJ, Archibald AL, Beever JE, Bendixen C, Churcher C, Clark R, Dehais P, Hansen MS, Hedegaard J, Hu ZL, Kerstens HH, Law AS, Megens HJ, Milan D, Nonneman DJ, Rohrer GA, Rothschild MF, Smith TPL, Schnabel RD, Van Tassell CP, Taylor JF, Wiedmann RT, Schook LB, Groenen MAM: Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology. PLoS One. 2009, 4: e6524-10.1371/journal.pone.0006524.
PubMed PubMed Central Google Scholar
Wang K, Li M, Hadley D, Liu R, Glessner J, Grant SF, Hakonarson H, Bucan M: PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 2007, 17: 1665-1674. 10.1101/gr.6861907.
CAS PubMed PubMed Central Google Scholar
Pique-Regi R, Monso-Varona J, Ortega A, Seeger RC, Triche TJ, Asgharzadeh S: Sparse representation and Bayesian detection of genome copy number alterations from microarray data. Bioinformatics. 2008, 24: 309-318. 10.1093/bioinformatics/btm601.
CAS PubMed PubMed Central Google Scholar
Winchester L, Yau C, Ragoussis J: Comparing CNV detection methods for SNP arrays. Briefings in Functional Genomics and Proteomics. 2009, 8: 353-366. 10.1093/bfgp/elp017.
CAS PubMed Google Scholar
Dellinger AE, Saw SM, Goh LK, Seielstad M, Young TL, Li YJ: Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays. Nucl Acids Res. 2010, gkq040-
Google Scholar
Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W, Cho EK, Dallaire S, Freeman JL, Gonzalez JR, Gratacos M, Huang J, Kalaitzopoulos D, Komura D, MacDonald JR, Marshall CR, Mei R, Montgomery L, Nishimura K, Okamura K, Shen F, Somerville MJ, Tchinda J, Valsesia A, Woodwark C, Yang F: Global variation in copy number in the human genome. Nature. 2006, 444: 444-454. 10.1038/nature05329.
CAS PubMed PubMed Central Google Scholar
Locke DP, Sharp AJ, McCarroll SA, McGrath SD, Newman TL, Cheng Z, Schwartz S, Albertson DG, Pinkel D, Altshuler DM, Eichler EE: Linkage disequilibrium and heritability of copy-number polymorphisms within duplicated regions of the human genome. Am J Hum Genet. 2006, 79: 275-290. 10.1086/505653.
CAS PubMed PubMed Central Google Scholar
Brewer C, Holloway S, Zawalnyski P, Schinzel A, FitzPatrick D: A chromosomal duplication map of malformations: regions of suspected haplo- and triplolethality--and tolerance of segmental aneuploidy--in humans. Am J Hum Genet. 1999, 64: 1702-1708. 10.1086/302410.
CAS PubMed PubMed Central Google Scholar
Conrad DF, Hurles ME: The population genetics of structural variation. Nat Genet. 2007
Google Scholar
Conrad DF, Andrews TD, Carter NP, Hurles ME, Pritchard JK: A high-resolution survey of deletion polymorphism in the human genome. Nat Genet. 2006, 38: 75-81. 10.1038/ng1697.
CAS PubMed Google Scholar
Freeman JL, Perry GH, Feuk L, Redon R, McCarroll SA, Altshuler DM, Aburatani H, Jones KW, Tyler-Smith C, Hurles ME, Carter NP, Scherer SW, Lee C: Copy number variation: New insights in genome diversity. Genome Research. 2006, 16: 949-961. 10.1101/gr.3677206.
CAS PubMed Google Scholar
Anzenbacher P, Anzenbacherova E: Cytochromes P450 and metabolism of xenobiotics. Cell Mol Life Sci. 2001, 58: 737-747. 10.1007/PL00000897.
CAS PubMed Google Scholar
Stankiewicz P, Lupski JR: Genome architecture, rearrangements and genomic disorders. Trends in Genetics. 2002, 18: 74-82. 10.1016/S0168-9525(02)02592-1.
CAS PubMed Google Scholar
Daly AK: Pharmacogenetics of the cytochromes P450. Curr Top Med Chem. 2004, 4: 1733-1744. 10.2174/1568026043387070.
CAS PubMed Google Scholar
Ledesma MC, Agundez JA: Identification of subtypes of CYP2D gene rearrangements among carriers of CYP2D6 gene deletion and duplication. Clin Chem. 2005, 51: 939-943. 10.1373/clinchem.2004.046326.
CAS PubMed Google Scholar
Ouahchi K, Lindeman N, Lee C: Copy number variants and pharmacogenomics. Pharmacogenomics. 2006, 7: 25-29. 10.2217/14622416.7.1.25.
CAS PubMed Google Scholar
Grindflek E, Berget I, Moe M, Oeth P, Lien S: Transcript profiling of candidate genes in testis of pigs exhibiting large differences in androstenone levels. BMC Genet. 11: 4-10.1186/1471-2156-11-4.
Skaanild MT, Friis C: Cytochrome P450 sex differences in minipigs and conventional pigs. Pharmacol Toxicol. 1999, 85: 174-180. 10.1111/j.1600-0773.1999.tb00088.x.
CAS PubMed Google Scholar
Lee GJ, Archibald AL, Law AS, Lloyd S, Wood J, Haley CS: Detection of quantitative trait loci for androstenone, skatole and boar taint in a cross between Large White and Meishan pigs. Anim Genet. 2005, 36: 14-22. 10.1111/j.1365-2052.2004.01214.x.
CAS PubMed Google Scholar
Lin RY, Vera JC, Chaganti RS, Golde DW: Human monocarboxylate transporter 2 (MCT2) is a high affinity pyruvate transporter. J Biol Chem. 1998, 273: 28959-28965. 10.1074/jbc.273.44.28959.
CAS PubMed Google Scholar
Sepponen K, Koho N, Puolanne E, Ruusunen M, Poso AR: Distribution of monocarboxylate transporter isoforms MCT1, MCT2 and MCT4 in porcine muscles. Acta Physiol Scand. 2003, 177: 79-86. 10.1046/j.1365-201X.2003.01051.x.
CAS PubMed Google Scholar
Srikanchai T, Murani E, Wimmers K, Ponsuksili S: Four loci differentially expressed in muscle tissue depending on water-holding capacity are associated with meat quality in commercial pig herds. Mol Biol Rep. 37: 595-601. 10.1007/s11033-009-9856-0.
Google Scholar
Arsic N, Rajic T, Stanojcic S, Goodfellow PN, Stevanovic M: Characterisation and mapping of the human SOX14 gene. Cytogenet Cell Genet. 1998, 83: 139-146. 10.1159/000015149.
CAS PubMed Google Scholar
Osterloh JM, Freeman MR: Neuronal death or dismemberment mediated by Sox14. Nat Neurosci. 2009, 12: 1479-1480. 10.1038/nn1209-1479.
CAS PubMed PubMed Central Google Scholar
Kirby PJ, Waters PD, Delbridge M, Svartman M, Stewart AN, Nagai K, Graves JA: Cloning and mapping of platypus SOX2 and SOX14: insights into SOX group B evolution. Cytogenet Genome Res. 2002, 98: 96-100. 10.1159/000068539.
CAS PubMed Google Scholar
Kraut R, Chia W, Jan LY, Jan YN, Knoblich JA: Role of inscuteable in orienting asymmetric cell divisions in Drosophila. Nature. 1996, 383: 50-55. 10.1038/383050a0.
CAS PubMed Google Scholar
Chia W, Yang X: Asymmetric division of Drosophila neural progenitors. Curr Opin Genet Dev. 2002, 12: 459-464. 10.1016/S0959-437X(02)00326-X.
CAS PubMed Google Scholar
Zigman M, Cayouette M, Charalambous C, Schleiffer A, Hoeller O, Dunican D, McCudden CR, Firnberg N, Barres BA, Siderovski DP, Knoblich JA: Mammalian inscuteable regulates spindle orientation and cell fate in the developing retina. Neuron. 2005, 48: 539-545. 10.1016/j.neuron.2005.09.030.
CAS PubMed Google Scholar
Lechler T, Fuchs E: Asymmetric cell divisions promote stratification and differentiation of mammalian skin. Nature. 2005, 437: 275-280. 10.1038/nature03922.
CAS PubMed PubMed Central Google Scholar
Moller M, Chaudhary R, Hellmén E, Höyheim B, Chowdhary B, Andersson L: Pigs with the dominant white coat color phenotype carry a duplication of the KIT gene encoding the mast/stem cell growth factor receptor. Mammalian Genome. 1996, 7: 822-830. 10.1007/s003359900244.
CAS Google Scholar
Marklund S, Kijas J, Rodriguez-Martinez H, RÃ¶nnstrand L, Funa K, Moller M, Lange D, Edfors-Lilja I, Andersson L: Molecular Basis for the Dominant White Phenotype in the Domestic Pig. Genome Research. 1998, 8: 826-833.
CAS PubMed PubMed Central Google Scholar
Johansson Moller M, Chaudhary R, Hellmen E, Hoyheim B, Chowdhary B, Andersson L: Pigs with the dominant white coat color phenotype carry a duplication of the KIT gene encoding the mast/stem cell growth factor receptor. Mamm Genome. 1996, 7: 822-830. 10.1007/s003359900244.
CAS PubMed Google Scholar
Seo BY, Park EW, Ahn SJ, Lee SH, Kim JH, Im HT, Lee JH, Cho IC, Kong IK, Jeon JT: An accurate method for quantifying and analyzing copy number variation in porcine KIT by an oligonucleotide ligation assay. BMC Genetics. 2007, 8: 81-10.1186/1471-2156-8-81.
PubMed PubMed Central Google Scholar
Larson G, Dobney K, Albarella U, Fang M, Matisoo-Smith E, Robins J, Lowden S, Finlayson H, Brand T, Willerslev E, Rowley-Conwy P, Andersson L, Cooper A: Worldwide Phylogeography of Wild Boar Reveals Multiple Centers of Pig Domestication. Science. 2005, 307: 1618-1621. 10.1126/science.1106927.
CAS PubMed Google Scholar
Niimura Y, Nei M: Extensive gains and losses of olfactory receptor genes in mammalian evolution. PLoS One. 2007, 2: e708-10.1371/journal.pone.0000708.
PubMed PubMed Central Google Scholar
Young JM, Friedman C, Williams EM, Ross JA, Tonnes-Priddy L, Trask BJ: Different evolutionary processes shaped the mouse and human olfactory receptor gene families. Hum Mol Genet. 2002, 11: 535-546. 10.1093/hmg/11.5.535.
CAS PubMed Google Scholar
Quignon P, Giraud M, Rimbault M, Lavigne P, Tacher S, Morin E, Retout E, Valin AS, Lindblad-Toh K, Nicolas J, Galibert F: The dog and rat olfactory receptor repertoires. Genome Biol. 2005, 6: R83-10.1186/gb-2005-6-10-r83.
PubMed PubMed Central Google Scholar
Young JM, Endicott RM, Parghi SS, Walker M, Kidd JM, Trask BJ: Extensive copy-number variation of the human olfactory receptor gene family. Am J Hum Genet. 2008, 83: 228-242. 10.1016/j.ajhg.2008.07.005.
CAS PubMed PubMed Central Google Scholar
Poot M, Eleveld MJ, Van 't Slot R, Ploos van Amstel HK, Hochstenbach R: Recurrent copy number changes in mentally retarded children harbour genes involved in cellular localization and the glutamate receptor complex. Eur J Hum Genet. 2009, 18: 39-46. 10.1038/ejhg.2009.120.
PubMed Central Google Scholar
Nozawa M, Kawahara Y, Nei M: Genomic drift and copy number variation of sensory receptor genes in humans. Proc Natl Acad Sci USA. 2007, 104: 20421-20426. 10.1073/pnas.0709956104.
CAS PubMed PubMed Central Google Scholar
Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, Lee C: Detection of large-scale variation in the human genome. Nat Genet. 2004, 36: 949-951. 10.1038/ng1416.
CAS PubMed Google Scholar
Clop A, Ovilo C, Perez-Enciso M, Cercos A, Tomas A, Fernandez A, Coll A, Folch JM, Barragan C, Diaz I, Oliver MA, Varona L, Silio L, Sanchez A, Noguera JL: Detection of QTL affecting fatty acid composition in the pig. Mamm Genome. 2003, 14: 650-656. 10.1007/s00335-002-2210-7.
PubMed Google Scholar
Perez-Enciso M, Clop A, Noguera JL, Ovilo C, Coll A, Folch JM, Babot D, Estany J, Oliver MA, Diaz I, Sanchez A: A QTL on pig chromosome 4 affects fatty acid metabolism: evidence from an Iberian by Landrace intercross. J Anim Sci. 2000, 78: 2525-2531.
CAS PubMed Google Scholar
Souza CA, Ramayo Y, Megens HJ, Rodríguez MC, Loarca A, Caal E, Soto H, Melo M, Revidatti MA, de la Rosa SA, Shemereteva IN, Okumura N, Cho IC, Delgado JV, Paiva SR, Crooijmans RPMA, Schook LB, Groenen MAM, Ramos-Onsins SE, Pérez-Enciso M: Porcine Colonization Of The Americas: A 60 k SNP Story. World Congress on Genetics Applied to Livestock Production . 2010, Leipzig, Germany. August 1-6
Google Scholar
Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods. 2001, 25: 402-408. 10.1006/meth.2001.1262.
CAS PubMed Google Scholar
Graubert TA, Cahan P, Edwin D, Selzer RR, Richmond TA, Eis PS, Shannon WD, Li X, McLeod HL, Cheverud JM, Ley TJ: A High-Resolution Map of Segmental DNA Copy Number Variation in the Mouse Genome. PLoS Genet. 2007, 3: e3-10.1371/journal.pgen.0030003.
PubMed PubMed Central Google Scholar
Ballester M, Castello A, Ibanez E, Sanchez A, Folch JM: Real-time quantitative PCR-based system for determining transgene copy number in transgenic animals. Biotechniques. 2004, 37: 610-613.
CAS PubMed Google Scholar
NCBI: Database of Genomic Structural Variation. Book Database of Genomic Structural Variation. City, [http://www.ncbi.nlm.nih.gov/dbvar]

Download references

Acknowledgements

This work was funded by MICINN project AGL2008-04818-C03/GAN (Spanish Ministry of Science) and by Innovation Consolider-Ingenio 2010 Programme (CSD2007-00036 "Centre for Research in Agrigenomics"). Y. Ramayo-Caldas was funded by a FPU PhD grant from Spanish Ministerio de Educación (AP2008-01450). C.A. Souza was funded by a PhD grant from CAPES, Brazil. We thank S. Scherrer, R. Pique-Regi, Yang Bin and A. Esteve for the help with data analysis. We also thank Martien Groenen (Wageningen, NL) for providing information about SNP positioning in Assembly 9.

Author information

Authors and Affiliations

Departament de Ciència Animal i dels Aliments, Facultat de Veterinària, Universitat Autònoma de Barcelona, 08193, Bellaterra, Spain
Yuliaxis Ramayo-Caldas, Anna Castelló, Anna Mercadé, Carla A Souza, Miguel Perez-Enciso & Josep M Folch
Genètica i Millora Animal, IRTA Lleida, 25198, Lleida, Spain
Romi N Pena
Departamento de Mejora Animal SGIT-INIA, 28040, Madrid, Spain
Estefania Alves & Ana I Fernández
Institut Català de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
Miguel Perez-Enciso
Department of Animal Production, University of Lleida, 25198, Lleida, Spain
Romi N Pena
Universidade Católica de Brasília, Brazil
Carla A Souza

Authors

Yuliaxis Ramayo-Caldas
View author publications
You can also search for this author in PubMed Google Scholar
Anna Castelló
View author publications
You can also search for this author in PubMed Google Scholar
Romi N Pena
View author publications
You can also search for this author in PubMed Google Scholar
Estefania Alves
View author publications
You can also search for this author in PubMed Google Scholar
Anna Mercadé
View author publications
You can also search for this author in PubMed Google Scholar
Carla A Souza
View author publications
You can also search for this author in PubMed Google Scholar
Ana I Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Perez-Enciso
View author publications
You can also search for this author in PubMed Google Scholar
Josep M Folch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuliaxis Ramayo-Caldas.

Additional information

Authors' contributions

YRC, AIF, MPE and JMF conceived and designed the experiment. JMF was the principal investigator of the project. YRC performed the data analysis and drafted the manuscript. EA, RNP, AIF, CAS, YRC, MPE and JMF collected samples. RNP and CAS performed DNA isolation. AM and AC performed the SNP genotyping and AC did the qPCR and RT-PCR assays. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1: Table S1. Results of the distribution analysis in American pig populations. (XLS 23 KB)

12864_2010_10005_MOESM2_ESM.XLS

Additional file 2: Table S2. Gene annotation within the CNVRs retrieved from the Ensembl Genes 57 Database using the Biomart software. (XLS 508 KB)

12864_2010_10005_MOESM3_ESM.TIFF

Additional file 3: Fig. S1. Results of quantitative PCR (qPCR) for CNVRs 15 (top), 17 (middle), and 32 (bottom). A total of 17 animals are showed in each plot. Breed abbreviations are: Ib: Iberian; Ld: Landrace; Hib: animals belonging to several generations of the IBMAP cross (F1, F2, and BC); CC: Cuban creole pig; Gu: Guatemala local breed; Yu: Yucatan miniature pig; Pe: Peruvian creole pig. (TIFF 736 KB)

12864_2010_10005_MOESM4_ESM.DOC

Additional file 4: Table S3. List of pig genes previously reported in the Human Database of Genomic Variants. (DOC 57 KB)

12864_2010_10005_MOESM5_ESM.TIFF

Additional file 5: Fig. S2. Structure of the IBMAP cross. Abbreviations are: Ib: Iberian; Ld: Landrace; F1: first generation; F2: second generation; F3: third generation; BC: first backcross; BC1_LD: second backcross. (TIFF 57 KB)

Additional file 6: Table S4. Description of samples from American local breeds. (DOC 32 KB)

Additional file 7: Table S5. Primers and probes used in quantitative PCR validation (DOC 36 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ramayo-Caldas, Y., Castelló, A., Pena, R.N. et al. Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip. BMC Genomics 11, 593 (2010). https://doi.org/10.1186/1471-2164-11-593

Download citation

Received: 15 June 2010
Accepted: 22 October 2010
Published: 22 October 2010
DOI: https://doi.org/10.1186/1471-2164-11-593

Copy number variation in the porcine genome inferred from a 60 k SNP BeadChip

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Results and Discussion

Detection of structural variants

Genes located within CNVRs

Validation by quantitative PCR

Other relevant CNVRs

Conclusions

Methods

Animal Material

Genotyping

Statistical analysis

Quantitative real time PCR

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Authors' contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation