Identification of polymorphisms in the bovine collagenous lectins and their association with infectious diseases in cattle

Fraser, R. S.; Lumsden, J. S.; Lillie, B. N.

doi:10.1007/s00251-018-1061-7

Identification of polymorphisms in the bovine collagenous lectins and their association with infectious diseases in cattle

Original Article
Open access
Published: 10 May 2018

Volume 70, pages 533–546, (2018)
Cite this article

Download PDF

You have full access to this open access article

Immunogenetics Aims and scope Submit manuscript

Identification of polymorphisms in the bovine collagenous lectins and their association with infectious diseases in cattle

Download PDF

R. S. Fraser¹,
J. S. Lumsden^1,2 &
B. N. Lillie¹

1818 Accesses
11 Citations
Explore all metrics

Abstract

Infectious diseases are a significant issue in animal production systems, including both the dairy and beef cattle industries. Understanding and defining the genetics of infectious disease susceptibility in cattle is an important step in the mitigation of their impact. Collagenous lectins are soluble pattern recognition receptors that form an important part of the innate immune system, which serves as the first line of host defense against pathogens. Polymorphisms in the collagenous lectin genes have been shown in previous studies to contribute to infectious disease susceptibility, and in cattle, mutations in two collagenous lectin genes (MBL1 and MBL2) are associated with mastitis. To further characterize the contribution of variation in the bovine collagenous lectins to infectious disease susceptibility, we used a pooled NGS approach to identify short nucleotide variants (SNVs) in the collagenous lectins (and regulatory DNA) of cattle with (n = 80) and without (n = 40) infectious disease. Allele frequency analysis identified 74 variants that were significantly (p < 5 × 10⁻⁶) associated with infectious disease, the majority of which were clustered in a 29-kb segment upstream of the collectin locus on chromosome 28. In silico analysis of the functional effects of all the variants predicted 11 SNVs with a deleterious effect on protein structure and/or function, 148 SNVs that occurred within potential transcription factor binding sites, and 31 SNVs occurring within potential miRNA binding elements. This study provides a detailed look at the genetic variation of the bovine collagenous lectins and identifies potential genetic markers for infectious disease susceptibility.

Genome-wide association study identifies loci associated with milk leukocyte phenotypes following experimental challenge with Streptococcus uberis

Article 03 June 2018

Allele substitution and dominance effects of CD166/ALCAM gene polymorphisms for endoparasite resistance and test-day traits in a small cattle population using logistic regression analyses

Article 24 October 2019

The genomic architecture of mastitis resistance in dairy sheep

Article Open access 16 August 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Infectious diseases are a major source of morbidity, mortality, and economic loss to the cattle industry. Infectious respiratory diseases alone account for close to $0.5–1 billion USD annually in North America (Miles 2009), and while estimates for other common infectious diseases, such as mastitis and gastrointestinal disease, are difficult to obtain, they undoubtedly add significantly to the economic impact of infectious disease (Schepers and Dijkhuizen 1991; Halasa et al. 2007; Heikkilä et al. 2012). Infectious diseases also represent a large source of agricultural antimicrobial use, which contributes to the development of antimicrobial resistance (Prescott et al. 2012). The approach to managing the impact of infectious disease has traditionally been to control the pathogen, largely ignoring the potential contributions of an immunologically deficient host (Miles 2009). Recently, however, there has been a broadening in focus to include host factors that contribute to infectious disease susceptibility.

The innate immune system represents the first line of defense against infectious diseases. Pattern recognition receptors (PRRs), a key part of the innate immune system, recognize conserved motifs on pathogens called pattern associated molecular patterns (Janeway 1989). The collagenous lectins are a subset of membrane-bound and/or soluble, circulating C-type lectins that function as PRRs, recognizing carbohydrate residues on the surfaces of bacteria, viruses, and fungi. The collagenous lectin family includes the collectins and ficolins, which share structural and functional similarities. Eleven collagenous lectin genes have been identified in cattle, including the genes encoding mannose-binding lectins A and C (MBL1 and MBL2); surfactant proteins A and D (SFTPA1 and SFTPD); collectins (CL) 10, 11, 12, 43, and 46; conglutinin (CGN1); and ficolin-1 (FCN1). CL43, CL46, and CGN1 are found in cattle and a few select herbivores, and structural similarities between these collagenous lectins and SFTPD suggest that they are evolutionarily related (Hansen and Holmskov 2002; Gjerstorff et al. 2004a).

Certain collagenous lectins can activate the lectin pathway of complement and can agglutinate or opsonize pathogens (Fujita 2002, 2004). The lectin pathway of complement is activated in part by four MBL-associated serine proteases (MASPs), encoded by two MASP genes, MASP1 and MASP2. The MASP proteins are structurally and functionally similar to C1r and C1s of the classical complement pathway (Thiel et al. 1997; Matsushita et al. 2013) and bind in proenzyme form to the collagen-like domain of the MBLs, FCN-1, and CL-11 (Matsushita and Fujita 1992; Matsushita et al. 2001; Hansen et al. 2010). Following ligand recognition, the MASP proteins lead to the cleavage of complement components C2 and C4, resulting in the activation of complement.

Short nucleotide variants (SNVs) in the collagenous lectin genes are associated with infectious disease susceptibility in a variety of species. A dominant negative missense mutation in human MBL2 leads to an opsonic defect and is a cause of recurring infections in children (Sumiya et al. 1991) and adults (Summerfield et al. 1995). Deficiencies of MBL-C resulting from MBL2 polymorphisms in humans are also associated with HIV, hepatitis B and C, meningococcal disease, and parasitic infections (Eisen and Minchinton 2003). Polymorphisms in the human ficolin genes are associated with leprosy (Boldt et al. 2013; Andrade et al. 2017), pneumonia (van Kempen et al. 2017) and Chagas disease (Luz et al. 2016), while variation in SFTPA2 is associated with different outcomes to influenza A virus infection (Herrera-Ramos et al. 2014). In animals, mutations in the promoter region of porcine MBL2 are associated with decreased expression of MBL-C and are more frequent in animals diagnosed with pneumonia, enteritis, serositis, or septicemia (Lillie et al. 2007). A missense mutation in porcine MBL1 is associated with decreased serum concentrations of MBL-A (Juul-Madsen et al. 2011). Relatively little is known about genetic variation in bovine collagenous lectins, and the few investigations that have been done have focused solely on mastitis and the MBL genes. Both a missense mutation in exon 2 and a promoter mutation of MBL1 are associated with altered activity of the classical complement pathway as well as with somatic cell score (SCS), a measure of the inflammatory cell content of milk and an indicator of mastitis (Wang et al. 2011; Liu et al. 2011; Yuan et al. 2012). Mutations in the coding region of MBL2 are also associated with SCS and complement activity (Zhao et al. 2012; Wang et al. 2012).

In order to address this gap in knowledge, we designed a targeted, next-generation sequencing study that captured the bovine collagenous lectin and related MASP genes as well as surrounding regulatory DNA. We sequenced our target regions in 120 cattle, 80 of which were diagnosed with infectious disease, and 40 of which lacked any evidence of infectious disease. We provide a comprehensive look at the variation in the collagenous lectin genes of these cattle, including in silico predictions of functional effects of identified variants. We also performed association analysis and identified 74 variants significantly associated with infectious disease in cattle.

Materials and methods

Sample selection and library preparation

Samples of liver or lung were collected from cattle presenting to the postmortem service at the Ontario Veterinary College or the Animal Health Laboratory at the University of Guelph. Cattle underwent a complete autopsy under the supervision of a veterinary pathologist certified by the American College of Veterinary Pathologists. Ancillary testing (e.g., bacterial culture, viral PCR, etc.) was performed as necessary to confirm the presence of pathogens. Cattle were broadly divided into two major populations: those with and without evidence of infectious disease (referred to as infectious and noninfectious in this article). They were then subdivided into pools of five animals each based on the similarity of their diagnosis at autopsy (Table 1). The study population consisted predominantly of female Holstein-Friesians (66.7%), with fewer intact and castrated male Holstein-Friesians (8.3%). The remainder (25.0%) was composed of a variety of other breeds and crosses (Table 2).

Table 1 Cattle were placed into groups of five animals each based on the similarity of the diagnosis determined at autopsy

Full size table

Table 2 Breakdown of the study population by breed and gender

Full size table

Tissue samples were stored at − 20 °C until processed. DNA was extracted using a commercial column based DNA extraction kit (QIAGEN DNeasy Blood and Tissue kit, Mississauga, ON, Canada), and sample concentration was evaluated via fluorometry (Qubit 2.0, Thermo Fisher Scientific, Mississauga, ON, Canada). Equimolar amounts of DNA from cattle in each group were pooled to obtain a final concentration of 1 μg of DNA in 50 μl of low-EDTA buffer TE. Each pool of DNA was acoustically sheared to a target range of 600 bp (Covaris M220, Woburn, MA, USA).

Following acoustic shearing, each pool of DNA underwent end repair, A-tailing, and adapter ligation (including a unique index) using a KAPA Library Preparation Kit for Illumina Platforms (KAPA Biosystems, Wilmington, MA, USA) as per the manufacturer’s instructions, with the following exception: a single cleanup step was performed following adapter ligation. The cleanup was performed by adding 0.6X PEG/NaCl SPRI solution and only the DNA bound to the magnets was retained. The pools were then combined in equimolar amounts to create a single sequencing library.

Target enrichment was performed using a SeqCap EZ Developer Enrichment Kit as per the SeqCap EZ Library SR User’s Guide v.4.2. Target regions consisted of the collagenous lectins and related MASP genes and were based on coordinates obtained from the UMD3.1.1 (bosTau8) genome (The Bovine Genome Sequencing and Analysis Consortium et al. 2009) hosted by the University of Santa Cruz, CA (Karolchik et al. 2004) (Table 3). Up to 50 kb upstream and 3 kb downstream of each gene was targeted for sequencing, in an attempt to capture a portion of regulatory DNA. Due to some inconsistencies in the annotation of the bovine collagenous lectins, annotations from NCBI and Ensembl were compared and reviewed, and the most appropriate annotation for each gene was chosen. For example, the MBL1 gene is not annotated in Ensembl or UCSC, while the SFTPA1 gene, located nearby on the same chromosome, has four annotated transcript variants in Ensembl. The NCBI RefSeq accession for MBL1, NM_001010994.3, is identical to the Ensembl SFTPA1 transcript ENSBTAT00000001165, leading to some uncertainty surrounding the true identity of these transcripts. Alignment of the four bovine SFTPA1 transcripts to the coding and protein sequences SFTPA1 and MBL1 from other species using Clustal Omega (Goujon et al. 2010; Sievers et al. 2011) showed that ENSBTAT00000031298 and ENSBTAT00000001165 had the highest percent identity matrices to SFTPA1 and MBL1, respectively. Thus, in contradiction to the annotation found in Ensembl, ENSBTAT00000001165 was considered to represent the bovine MBL1 gene, while only the ENSBTAT00000031298 transcript was considered to represent the SFTPA1 gene. Similarly, we believe the gene annotated as FCNB in Ensembl and FCN1 in NCBI more closely resembles FCN1, and is referred to as such in this study. Coordinates for COLEC10 and MASP2 were adjusted slightly based on sequence homology to other species in order to ensure they contained start codons. CL43 was adjusted to correspond to the findings of Hansen et al. (2003). A complete list of the coordinates for gene annotation in this study is provided in Online Resource 1.

Table 3 The regions targeted for resequencing are given for each gene included in the study. Genes in close proximity to each other were sequenced as a single unit

Full size table

Following enrichment, the library was sequenced using MiSeq Reagent Kit v3 (600-cycle) sequencing chemistry on an Illumina MiSeq (San Diego, CA, USA). In order to achieve adequate depth of sequencing, the same library was sequenced twice. All statistical analyses were performed in R unless otherwise specified (R Core Team 2017).

Bioinformatic analysis of NGS data

The sequencing data was processed in two stages (Fig. 1). In the first stage, data from each run was processed separately, while in the second stage, data from the same pool from different runs was merged and then further processed. The data from both sequencing runs was first trimmed using Trimmomatic v. 0.36 (Bolger et al. 2014) based on the following criteria: (a) leading and trailing bases with a quality score of less than 20 were trimmed, (b) reads were trimmed if quality dropped below an average score of 20 over a 5 bp sliding window, and (c) reads were dropped if they were less than 75 bp in length. Reads were then mapped to the bovine genome UMD3.1.1 using BWA-MEM algorithm of BWA v. 0.75 (Li and Durbin 2009). PCR and optical duplicates were removed with Picard v. 1.127 (http://broadinstitute.github.io/picard/, accessed 2018-01-12). The Genome Analysis Toolkit (GATK) Best Practices Guidelines (DePristo et al. 2011; Van der Auwera et al. 2013) were followed for in/del realignment and base quality score recalibration (BQSR) using GATK v. 3.6 (McKenna et al. 2010). At this point, BAM files for each pool generated during the different runs were merged. Each merged BAM file was reprocessed for PCR duplicates and in/del realignment. Variant calling was performed on merged BAM files using the joint genotyping protocol outlined in the GATK Best Practices guidelines. Variants were filtered using separate hard filters for SNVs and in/dels. Multiallelic variants and spanning deletions were excluded, and known variants were obtained from dbSNP v. 150 (Sherry et al. 2001). Evaluation of target capture between the noninfectious and infectious populations was performed by comparing the overall mean of the mean of each pool within each population using a two-way ANOVA and a least-square means post-hoc test.

Variant analysis

In silico analysis of the variants was performed for coding region variants, downstream 3 kb, and the upstream 5 kb of potential regulatory DNA. Variant density was evaluated both in terms of target regions and functional genomic regions using a one-way ANOVA and Kruskal-Wallis post hoc test. Correlations between variant density and GC content as well as indel number were assessed using a Pearson’s correlation coefficient. Missense coding variants were analyzed using Polyphen2 (Adzhubei et al. 2010) and the SIFT algorithm (Sim et al. 2012) run by the Variant Effect Predictor (VEP) hosted by Ensembl (McLaren et al. 2010). For Polyphen2, batch submission was used and a FASTA file containing the protein sequences was submitted. For VEP, options were left at their default settings.

The 3 kb downstream from the stop codon of the targeted genes was analyzed for potential miRNA recognition elements (MREs). Although the 3′ UTR was annotated within Ensembl for all genes with the exception of COLEC10 and SFTPD, we opted to analyze the entire 3 kb for each gene to maintain consistency. Multiple MRE discovery algorithms were used to maximize the accuracy of the predictions (Riffo-Campos et al. 2016). miRanda v. 3.3a (Enright et al. 2003) and Targetscan v. 7.0 (Agarwal et al. 2015) were used to identify MREs binding mature miR sequences from cattle accessed from miRbase 21 (Kozomara and Griffiths-Jones 2013). The energy threshold parameter of miRanda was set to − 20 kcal/mol, with other parameters for both programs left at their default settings. The intersection of the seed region of predicted MREs from both programs was obtained using BEDtools v. 2.25.0 (Quinlan and Hall 2010). BEDtools was then used to identify variants in our dataset that intersected with the seed sequence of MREs predicted by both algorithms.

Transcription factor (TF) binding site (TFBS) analysis was performed on the 5 kb upstream to the start codon for each target gene using CIS-BP v. 1.02 (Weirauch et al. 2014). The species was set as Bos taurus, and a motif model of “PWM - LogOdds” with a minimum threshold of 8 was selected; only cis acting TFBSs were considered. Variants falling within predicted TFBSs were identified using BEDtools. Many of the TFBSs shared identical sequences and bound TFs belonging to the same family, and were thus collapsed into a single result with results reported by TF family. Putative TFBSs were further refined by identifying conserved 50-bp-long DNA sequence motifs within the 5 kb upstream from the start codon for each gene across eight different species, cattle (UMD3.1.1), pig (Sscrofa10.2), horse (EquCab2), rat (Rnor_6.0), mouse (GRCm38), gorilla (gorGor3.1), chicken (Galgal4), and human (GRCh38), using MEME v 4.10.1 (Bailey and Elkan 1994). The bovine specific genes CGN, CL43, and CL46 were aligned to the sequences of SFTPD from other species, as they are believed to be evolutionarily related (Hansen and Holmskov 2002; Gjerstorff et al. 2004b). A minimum E-value of 0.05 was used to consider a motif conserved. The conserved motifs were then examined for the presence of TFBSs predicted by CIS-BP and containing variants.

Allelic association

The estimated frequency of variant alleles was compared between the noninfectious and infectious populations using Popoolation2 (Kofler et al. 2011). Processed BAM files for each pool were combined using Samtools into a single BAM file for each population, and an mpileup file was generated using minimum mapping and base qualities of 20 (Li et al. 2009). Popoolation2 was used to transform the mpileup file into a sync file, which was then down-sampled according to the recommendations of Popoolation2 to 400 reads per population using the method “fraction” to mitigate the impact of variable read depths on statistical testing. Fisher’s exact test was used to determine the significance of allele frequency estimates between the two populations. A minimum of 5% of the reads (20) across both populations combined was required for the allele to be considered in the allele frequency estimation. The Benjamini-Hochberg procedure was used to correct for multiple testing (Benjamini and Hochberg 1995) with adjusted p value cutoffs labeled as described in the BADGE system (Manly 2005).

Results

Evaluation of target capture efficiency showed that the median of the mean target coverage was 172.5; however, there were significant differences (p < 0.05) both in terms of the depth of coverage for each target region, as well as the total depth of coverage between the noninfectious and infectious populations (Fig. 2). The depth of coverage between the two populations at individual target regions was not statistically different.

Joint genotyping identified a total of 5439 unique variants, all of which were present in both the noninfectious and infectious populations. These included 5418 SNVs and 21 in/dels. The majority (5317, 97.7%) of the variants identified were present in dbSNP 150, while 122 were novel discoveries. A further 32,794 variants were present in dbSNP 150 within our target intervals. Of these, 2922 were present in our population but were excluded due to various filtering parameters, 29,696 loci were not variant in our population, and 356 loci were not successfully sequenced.

A total of 83 coding variants (40 synonymous, 43 missense), 2297 intronic variants, 309 variants within the downstream 3 kb of the stop codon, 414 variants within the 5 kb upstream of the start codon, and 2915 variants in the region between 5 and 50 kb upstream of the start codon were identified. The density of variants was examined both by gene and by region (Fig. 3). Significant differences (p < 0.05) in variant density were observed between genes, with the highest density of variants found in the FCN1 gene and surrounding DNA (Fig. 3a). Variant density by region was not significantly different, though the coding region tended to have the lowest density (Fig. 3b).

Among the coding variants, 43 missense variants were identified. In silico predictions of the effects of the mutations using either algorithm identified a total of 11 SNVs expected to have damaging or possibly damaging effects on protein structure or function (Table 4), with three of the variants (within SFTPD, MBL2, and CL43) predicted to be damaging by both algorithms.

Table 4 Potentially deleterious missense mutations as determined by in silico analysis with two different prediction algorithms

Full size table

A total of 20,147 potential cis TFBSs were predicted in the upstream 5 kb of the targeted genes. Of these, 1351 contained a SNV identified in this study. In highly conserved DNA sequence motifs (based on our multispecies comparison of upstream regulatory regions), 148 TFBSs containing a SNV were found across 10 of the targeted genes (Fig. 4a). These TFBSs were members of 30 TF families (Fig. 4b).

Analysis of the 3 kb downstream from the stop codon using both algorithms identified 469 potential miRNA recognition elements (MREs). Within the seed region of the predicted MREs there were 31 SNVs (Fig. 5), two of which impacted two separate miRs. A total of 28 unique miRs were predicted to bind in these regions, 5 of which bound multiple loci. A single MREs intersected with the annotated 3′ UTR of SFTPA1.

Evaluation of the frequency of variant alleles identified 25 BADGE class I (p < 2 × 10⁻⁷) and 49 class II (p < 5 × 10⁻⁶) variants that were significantly associated with either the Non-infectious or Infectious populations (Fig. 6). Seventeen associations were found clustered in intron 2 of MASP1 (Fig. 6b), and a further 48 associations were present in a ~ 21 kb region ~ 29 kb upstream from CGN1, the first gene of the bovine collectin locus (Fig. 6c). Four associations were found distributed upstream, downstream, and within intron 8 of FCN1 (Online Resource 2a); four associations were found in the introns 4 and 5 and the downstream region of COLEC11 (Online Resource 2b); and a single association was found in intron 2 of COLEC12 (Online Resource 2c).

Discussion

Associating polymorphisms with infectious disease susceptibility has important implications in agricultural economics, animal breeding, and animal health and welfare. Probing the underlying genetics of complex traits like innate immunity is challenging and requires large numbers of animals with well-defined phenotypes (Ron and Weller 2007). The phenotypes of our study samples were determined through complete autopsies supervised by certified veterinary pathologists. This included evaluation of gross tissues, ancillary testing as required, and review of histopathological specimens to define the extent and nature of disease. In order to address the considerable expense required to sequence large numbers of animals, we opted to use a pooled and targeted next-generation sequence approach. Pooled sequencing has been shown to be an accurate and cost-effective method of variant discovery and allele frequency estimation in next-generation sequencing experiments (Bansal et al. 2011; Mullen et al. 2012; Rellstab et al. 2013; Bertelsen et al. 2016), as well as in genome-wide association studies (Keele et al. 2015). While whole genome sequencing would provide a more complete picture of disease associated variants, the cost remains prohibitive, despite decreasing sequencing costs. Instead, targeted resequencing allows specific regions of interest in the genome to be queried, providing much more detail than the available high density SNP array from Illumina: of the 5439 variants found in our study, only 223 are present in the Illumina BovineHD BeadChip (https://support.illumina.com/array/array_kits/bovinehd_dna_analysis_kit/downloads.html, accessed 2018-01-12).

Allele frequency estimation identified 74 significant variants (BADGE class II or higher), after correction for multiple testing, that were associated with infectious disease. Over half of the significant alleles occurred in a 29-kb segment of DNA upstream from CGN1, and these alleles met the highest level of statistical significance proposed by the BADGE system (Manly 2005). CGN1 is the first gene in the bovine collectin locus, a 260-kb region on chromosome 28 which includes the CL46, CL43, SFTPD, MBL1, and SFTPA1 genes (Gjerstorff et al. 2004a), and several of these genes have been implicated in infectious disease susceptibility. Plasma concentration of conglutinin is known to be heritable and low levels of plasma conglutinin are associated with increased incidence of respiratory disease in cattle (Holmskov et al. 1998). Whole transcriptome sequencing of the abomasum of parasite-susceptible and resistant calves found that resistant animals expressed higher levels of CL46 (Li et al. 2011). In cattle breeds native to China, certain MBL1 haplotypes are associated with somatic cell score, an indirect marker of mastitis (Wang et al. 2011). The block of highly significant associations discovered here could theoretically impact any of the genes within the collectin locus, as DNA kilobases to megabases upstream of genes can have a regulatory impact through enhancer, silencer, insulator, or locus control region elements (Maston et al. 2006) that are difficult to predict, and for which the bovine genome lacks annotation. The relative similarity of the allele frequencies in the Non-infectious and Infectious populations was similar for all of these variants, suggesting that they are likely in linkage disequilibrium (Online Resource 3). The pooled design of this study precludes haplotype analysis so further study of these variants, and the genes of the nearby collectin locus, both in terms of RNA expression and epigenetics, is warranted.

Two class I and 15 class II associations were found in intron 2 of MASP1. Human MASP1 encodes three protein isoforms with distinct functions, MASP-1, MASP-3, and MAP1 (Beltrame et al. 2015). Only a single transcript is annotated for bovine MASP1, which best corresponds to the MASP-3 isoform, and an additional 4 transcripts are predicted by NCBI. As with human MASP1, the transcript and predicted transcripts that encode the three MASP1 isoforms share the first 8 exons; thus, the cluster of variants noted in intron 2 is also present in intron 2 for all of these predicted transcripts. Genetic mutations resulting in human MASP1 deficiency are associated with infectious disease, and several intronic mutations leading to MASP1 deficiency have been identified, though none are identical to the associations found here (Ingels et al. 2013; Beltrame et al. 2015). Again, the relative similarity of the allele frequencies for the different loci suggests significant linkage disequilibrium. It should be noted that mutations that inhibit the function of the MASPs (and complement-activating collagenous lectins) may also confer benefits: excess activation of complement can contribute to tissue damage or autoimmune disease, and decreased or more moderate levels of complement activity may therefore be beneficial in some scenarios (Beltrame et al. 2015).

A further four associations were found in COLEC11 and two in COLEC12; however, no in silico consequences were predicted for any of the six. There is little known about the relative importance of CL-K1 and CL-P1, the proteins encoded by these two genes, in the innate immune response to infectious disease, and, to the authors knowledge, these are the first reported associations between mutations in these genes and infectious disease of cattle. Recently, CL-P1 was shown to have a soluble form that can activate the alternative pathway of complement (Ma et al. 2015), and CL-K1 is capable of activating the lectin pathway of complement (Ma et al. 2013), presenting possible pathways through which genetic mutations could hamper the immune system.

Although in silico analysis of the disease-associated alleles did not identify any biological effects, several interesting predictions were made regarding other variants present in our dataset. Two missense mutations found in MBL2 and one in MASP2 (Table 3) were previously shown to be associated with SCS in Chinese Holstein cattle (Wang et al. 2012; Wu et al. 2015). The frequency of these three variants was not significantly different in our populations of infectious and noninfectious cattle. Variant rs210611099:c.125C>A was rare, with only eight alleles predicted across both populations (MAF 3.3%). Though rare, this is substantially higher than the reported allele frequency of 0.27% in Chinese Holstein cattle (Wang et al. 2012), and may be the result of different breeds (Chinese Holsteins versus the mixture of breeds common to North America in our study). The mutation occurs in the collagen-like domain (CLD) of MBL-C and is predicted in silico to have a significant impact on protein structure and function. Previously reported mutations in the CLD had an impact on MBL-C driven complement activation (Larsen et al. 2004), as well as on higher-order oligomerization through disruption of the Gly-X-Y collagen-like repeats (Sumiya et al. 1991). Thus, despite its rarity, the role of this variant in infectious disease susceptibility remains of interest. Genotyping of a larger number of animals may be useful in clarifying the discrepancy between studies in allele frequencies and may provide the statistical power required to determine whether this rare variant plays a role in bovine innate immune defense.

The second MBL2 missense variant associated with SCS, rs210426415:c.92G>A, leads to the substitution of glutamine for arginine in the N-terminal domain of MBL-C. This domain is believed to utilize conserved cysteine residues to facilitate the functionally critical oligomerization of MBL-C (Wallis and Drickamer 1999). This variant was predicted by both in silico algorithms to have a low impact on protein structure and function, and was not associated with disease in our study, possibly the result of a smaller cohort of animals diagnosed with mastitis. Thus, although genotyping of larger numbers of North American cattle may reveal an association with infectious disease, this study does not provide evidence for further investigation.

The MASP2 variant previously shown to be associated with SCS, rs207667073.G>A, results in an amino acid change in the CUB domain that is predicted by the SIFT algorithm, but not Polyphen2, to be deleterious to protein structure or function. This allele showed no significant difference between the noninfectious and infectious populations; however, only two alleles were found in the entire population of cattle studied. Again, genotyping of larger number of North American cattle may prove useful in further defining the role of this variant.

Only one variant, rs381773088 in SFTPA1, was present within an annotated 3′ UTR and intersected with a predicted MRE for bta-miR-328. To our knowledge, there are no published studies on the role of bta-miR-328 in cattle; however, a study on the human homolog demonstrated that it plays a role in vitro in the innate immune defense against Haemophilus influenza through negative regulation of phagocytosis (Tay et al. 2015). Gram-negative pulmonary pathogens related to H. influenza, notably M. haemolytica, P. multocida, and H. somni, are important causes of respiratory disease in cattle; thus, this predicted MRE affected by a variant may hold relevance for future investigations.

In silico prediction of transcription factor binding sites relies on the observation that the amino acid sequence of the DNA-binding domain of transcription factor proteins largely predicts their DNA-binding specificity, and does so in a highly conserved manner (Kasahara et al. 2006; Jolma et al. 2013; Weirauch et al. 2014). Regulatory DNA is also highly conserved in animals (Nitta et al. 2015). Thus, to reduce the large number of predicted TFBSs identified in the targeted genes, conserved DNA sequences present in the potential promoter region were identified by comparing sequences from up to eight different animals including domestic livestock and more distantly related species (human, gorilla, cattle, horse, pig, rat, mouse, and chicken). This conservative approach narrowed the results to 148 potential TFBSs, some of which were similar to TFBSs predicted or shown in previous studies to be involved in the regulation of collagenous lectins. For example, we identified binding sites for SRF (a member of the MADS box family of transcription factors) in SFTPA1, and an SRF binding site was also identified in human SFTPA2 (Grageda et al. 2014). Initial characterization of the promoter of CL43 included in silico prediction of TFBSs (Hansen et al. 2003), and the same binding sites were predicted in our study for Myb, ARNT, cEBP, Myc, MyoD, Mzf-1, N-Myc, and USF; however, only Mzf-1 was both present within a conserved motif and contained a SNV. This discrepancy may be the result of more stringent requirements in our study (including exclusion of trans binding sites and sites outside of conserved sequences), or potentially due to differing prediction algorithms. HNF3alpha (also known as FOXA1) is known to regulate the expression of human and chicken MBL2 (Naito et al. 1999; Kjærup et al. 2013), and while binding sites were predicted in CL43 and COLEC11, no binding site was predicted for bovine MBL2.

The density of variants varied significantly by gene, but not by gene region (Fig. 3). FCN1 and the two surfactant protein genes, SFTPA1 and SFTPD, had a significantly higher variant density than the bottom two (for FCN1) and four (for SFTPA1 and SFTPD) genes. Both intrinsic and extrinsic factors can affect the degree of sequence variation. Intrinsically, the GC content and the number of in/dels in a region are predictive of variation (Hodgkinson and Eyre-Walker 2011). The GC content of the FCN1 gene was the second highest of the targeted genes at 55.1%; however, the overall correlation between GC content and the density of variants was not significant (R = 0.36, p = 0.23, Online Resource 4). Although the FCN1 target contained the highest number of indels (13/21), there was no correlation between number of indels and variation density (R = 0.36, p = 0.23, Online Resource 5). Extrinsically, genes that interact with the environment (such as innate immune response genes) can often be found in “hot spots” in the genome, which show higher levels of variation in response to increased adaptive pressures (Chuang and Li 2004). Interestingly, a recent study on the equine collagenous lectins found that two equine ficolin genes, FCN1 and FCN1-like, also had the highest variant density amongst the equine collagenous lectins (Fraser, unpublished). Five of the eight coding region mutations found in FCN1 were located within the fibrinogen-like domain, and three were missense. Although the relative role of the ficolin genes in the innate immune response of these two species is still under investigation, the increased variation observed in both species might suggest increased evolutionary pressure to adapt to different pathogens.

Variation in innate immune genes can significantly impact the susceptibility of animals to infectious diseases. Here, we have comprehensively documented the variation in a subset of innate immune genes, the collagenous lectins, in cattle with and without infectious diseases. Comparison of mutant alleles in the two populations identified 74 alleles associated with infectious disease. These alleles warrant further investigation, both in terms of population level frequencies, and, if confirmed to be significantly associated with disease susceptibility, in terms of their biological mechanisms of action.

References

Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR (2010) A method and server for predicting damaging missense mutations. Nat Methods 7:248–249. https://doi.org/10.1038/nmeth0410-248
Article PubMed PubMed Central CAS Google Scholar
Agarwal V, Bell GW, Nam JW, Bartel DP (2015) Predicting effective microRNA target sites in mammalian mRNAs. elife 4. https://doi.org/10.7554/eLife.05005.001
Andrade FA, Beltrame MH, Bini VB, Gonçalves LB, Boldt AB, de Taborda Messias-Reason IJ (2017) Association of a new FCN3 haplotype with high ficolin-3 levels in leprosy. PLoS Negl Trop Dis 11:e0005409
Bailey TL, Elkan C (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 2:28–36
PubMed CAS Google Scholar
Bansal V, Tewhey R, Leproust EM, Schork NJ (2011) Efficient and cost effective population resequencing by pooling and in-solution hybridization. PLoS One 6:e18353. https://doi.org/10.1371/journal.pone.0018353
Article PubMed PubMed Central CAS Google Scholar
Beltrame MH, Boldt ABW, Catarino SJ, Mendes HC, Boschmann SE, Goeldner I, Messias-Reason I (2015) MBL-associated serine proteases (MASPs) and infectious diseases. Mol Immunol 67:85–100. https://doi.org/10.1016/j.molimm.2015.03.245
Article PubMed CAS Google Scholar
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Stat Methodol. https://doi.org/10.2307/2346101
Bertelsen HP, Gregersen VR, Poulsen N, Nielsen RO, Das A, Madsen LB, Buitenhuis AJ, Holm LE, Panitz F, Larsen LB, Bendixen C (2016) Detection of genetic variation affecting milk coagulation properties in Danish Holstein dairy cattle by analyses of pooled whole-genome sequences from phenotypically extreme samples (pool-seq). J Anim Sci 94:1365–1313. https://doi.org/10.2527/jas.2015-9884
Article PubMed CAS Google Scholar
Boldt ABW, Sanchez MIN, Stahlke ERS, Steffensen R, Thiel S, Jensenius JC, Prevedello FC, Mira MT, Kun JFJ, Messias-Reason IJT (2013) Susceptibility to leprosy is associated with M-ficolin polymorphisms. J Clin Immunol 33:210–219. https://doi.org/10.1007/s10875-012-9770-4
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:btu170–bt2120. https://doi.org/10.1093/bioinformatics/btu170
Article CAS Google Scholar
Chuang JH, Li H (2004) Functional bias and spatial organization of genes in mutational hot and cold regions in the human genome. PLoS Biol 2:E29. https://doi.org/10.1371/journal.pbio.0020029
Article PubMed PubMed Central CAS Google Scholar
DePristo MA, Banks E, Poplin R et al (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43:491–498. https://doi.org/10.1038/ng.806
Article PubMed PubMed Central CAS Google Scholar
Eisen DP, Minchinton RM (2003) Impact of mannose-binding lectin on susceptibility to infectious diseases. Clin Infect Dis 37:1496–1505. https://doi.org/10.1086/379324
Article PubMed CAS Google Scholar
Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS (2003) MicroRNA targets in Drosophila. Genome Biol 5:R1. https://doi.org/10.1186/gb-2003-5-1-r1
Article PubMed PubMed Central Google Scholar
Fujita T (2002) Evolution of the lectin–complement pathway and its role in innate immunity. Nat Rev Immunol 2:346–353. https://doi.org/10.1038/nri800
Article PubMed CAS Google Scholar
Fujita T (2004) Primitive complement system—recognition and activation. Mol Immunol 41:103–111. https://doi.org/10.1016/j.molimm.2004.03.026
Article PubMed CAS Google Scholar
Gjerstorff M, Hansen S, Jensen B, Dueholm B, Horn P, Bendixen C, Holmskov U (2004a) The genes encoding bovine SP-A, SP-D, MBL-A, conglutinin, CL-43 and CL-46 form a distinct collectin locus on Bos taurus chromosome 28 (BTA28) at position q.1.8-1.9. Anim Genet 35:333–337. https://doi.org/10.1111/j.1365-2052.2004.01167.x
Article PubMed CAS Google Scholar
Gjerstorff M, Madsen J, Bendixen C et al (2004b) Genomic and molecular characterization of bovine surfactant protein D (SP-D). Mol Immunol 41:369–376
Article PubMed CAS Google Scholar
Goujon M, McWilliam H, Li W, Valentin F, Squizzato S, Paern J, Lopez R (2010) A new bioinformatics analysis tools framework at EMBL-EBI. Nucleic Acids Res 38:W695–W699. https://doi.org/10.1093/nar/gkq313
Article PubMed PubMed Central CAS Google Scholar
Grageda M, Silveyra P, Thomas NJ, DiAngelo SL, Floros J (2014) DNA methylation profile and expression of surfactant protein A2 gene in lung cancer. Exp Lung Res 41:93–102. https://doi.org/10.3109/01902148.2014.976298
Article PubMed PubMed Central CAS Google Scholar
Halasa T, Huijps K, Østerås O, Hogeveen H (2007) Economic effects of bovine mastitis and mastitis management: a review. Vet Q 29:18–31. https://doi.org/10.1080/01652176.2007.9695224
Article PubMed CAS Google Scholar
Hansen S, Holmskov U (2002) Lung surfactant protein D (SP-D) and the molecular diverted descendants: conglutinin, CL-43 and CL-46. Immunobiology 205:498–517. https://doi.org/10.1078/0171-2985-00150
Article PubMed CAS Google Scholar
Hansen S, Holm D, Moeller V, Vitved L, Bendixen C, Skjoedt K, Holmskov U (2003) Genomic and molecular characterization of CL-43 and its proximal promoter. Biochim Biophys Acta 1625:1–40
Article PubMed CAS Google Scholar
Hansen S, Selman L, Palaniyar N, Ziegler K, Brandt J, Kliem A, Jonasson M, Skjoedt MO, Nielsen O, Hartshorn K, Jorgensen TJD, Skjodt K, Holmskov U (2010) Collectin 11 (CL-11, CL-K1) is a MASP-1/3–associated plasma collectin with microbial-binding activity. J Immunol 185:6096–6104. https://doi.org/10.4049/jimmunol.1002185
Article PubMed CAS Google Scholar
Heikkilä A-M, Nousiainen JI, Pyörälä S (2012) Costs of clinical mastitis with special reference to premature culling. J Dairy Sci 95:139–150
Article PubMed CAS Google Scholar
Herrera-Ramos E, López-Rodríguez M, Ruíz-Hernández JJ, Horcajada J, Borderías L, Lerma E, Blanquer J, Pérez-González M, García-Laorden M, Florido Y, Mas-Bosch V, Montero M, Ferrer J, Sorlí L, Vilaplana C, Rajas O, Briones M, Aspa J, López-Granados E, Solé-Violán J, de Castro F, Rodríguez-Gallego C (2014) Surfactant protein A genetic variants associate with severe respiratory insufficiency in pandemic influenza A virus infection. Crit Care 18:R127. https://doi.org/10.1186/cc13934
Article PubMed PubMed Central Google Scholar
Hodgkinson A, Eyre-Walker A (2011) Variation in the mutation rate across mammalian genomes. Nat Rev Genet 12:756–766. https://doi.org/10.1038/nrg3098
Article PubMed CAS Google Scholar
Holmskov U, Jensenius JC, Tornoe I, Lovendahl P (1998) The plasma levels of conglutinin are heritable in cattle and low levels predispose to infection. Immunology 93:431–436
Article PubMed PubMed Central CAS Google Scholar
Ingels C, Vanhorebeek I, Steffensen R, Derese I, Jensen L, Wouters PJ, Hermans G, Thiel S, van den Berghe G (2013) Lectin pathway of complement activation and relation with clinical complications in critically ill children. Pediatr Res 75:99–108. https://doi.org/10.1038/pr.2013.180
Article PubMed CAS Google Scholar
Janeway CA (1989) Approaching the asymptote? Evolution and revolution in immunology. Cold Spring Harb Symp Quant Biol 54(Pt 1):1–13. https://doi.org/10.1101/SQB.1989.054.01.003
Article PubMed CAS Google Scholar
Jolma A, Yan J, Whitington T, Toivonen J, Nitta KR, Rastas P, Morgunova E, Enge M, Taipale M, Wei G, Palin K, Vaquerizas JM, Vincentelli R, Luscombe NM, Hughes TR, Lemaire P, Ukkonen E, Kivioja T, Taipale J (2013) DNA-binding specificities of human transcription factors. Cell 152:327–339. https://doi.org/10.1016/j.cell.2012.12.009
Article PubMed CAS Google Scholar
Juul-Madsen HR, Norup LR, Jørgensen PH, Handberg KJ, Wattrang E, Dalgaard TS (2011) Crosstalk between innate and adaptive immune responses to infectious bronchitis virus after vaccination and challenge of chickens varying in serum mannose-binding lectin concentrations. Vaccine 29:9499–9507. https://doi.org/10.1016/j.vaccine.2011.10.016
Article PubMed CAS Google Scholar
Karolchik D, Hinrichs AS, Furey TS et al (2004) The UCSC Table Browser data retrieval tool. Nucleic Acids Res 32:D493–D496. https://doi.org/10.1093/nar/gkh103
Article PubMed PubMed Central CAS Google Scholar
Kasahara T, Miyazaki T, Nitta H et al (2006) Evaluation of methods for duration of preservation of RNA quality in rat liver used for transcriptome analysis. J Toxicol Sci 31:509–519. https://doi.org/10.2131/jts.31.509
Article PubMed CAS Google Scholar
Keele JW, Kuehn LA, McDaneld TG et al (2015) Genomewide association study of lung lesions in cattle using sample pooling. J Anim Sci 93:956–964. https://doi.org/10.2527/jas.2014-8492
Article PubMed CAS Google Scholar
Kempen G, Meijvis S, Endeman H, Vlaminckx B, Meek B, Jong B, Rijkers G, Bos WJ (2017) Mannose-binding lectin and l-ficolin polymorphisms in patients with community‐acquired pneumonia caused by intracellular pathogens. Immunology. 151:81–88
Kjærup RM, Norup LR, Skjødt K, Dalgaard TS, Juul-Madsen HR (2013) Chicken mannose-binding lectin (MBL) gene variants with influence on MBL serum concentrations. Immunogenetics 65:461–471. https://doi.org/10.1007/s00251-013-0689-6
Article PubMed CAS Google Scholar
Kofler R, Pandey RV, Schlotterer C (2011) PoPoolation2: identifying differentiation between populations using sequencing of pooled DNA samples (Pool-Seq). Bioinformatics 27:3435–3436. https://doi.org/10.1093/bioinformatics/btr589
Article PubMed PubMed Central CAS Google Scholar
Kozomara A, Griffiths-Jones S (2013) miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res 42:D68–D73. https://doi.org/10.1093/nar/gkt1181
Article PubMed PubMed Central CAS Google Scholar
Larsen F, Madsen HO, Sim RB, Koch C, Garred P (2004) Disease-associated mutations in human mannose-binding lectin compromise oligomerization and activity of the final protein. J Biol Chem 279:21302–21311. https://doi.org/10.1074/jbc.M400520200
Article PubMed CAS Google Scholar
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. https://doi.org/10.1093/bioinformatics/btp324
Article PubMed PubMed Central CAS Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. https://doi.org/10.1093/bioinformatics/btp352
Article PubMed PubMed Central CAS Google Scholar
Li RW, Rinaldi M, Capuco AV (2011) Characterization of the abomasal transcriptome for mechanisms of resistance to gastrointestinal nematodes in cattle. Vet Res 42:114. https://doi.org/10.1186/1297-9716-42-114
Article PubMed PubMed Central CAS Google Scholar
Lillie BN, Keirstead ND, Squires EJ, Hayes MA (2007) Gene polymorphisms associated with reduced hepatic expression of porcine mannan-binding lectin C. Dev Comp Immunol 31:830–846. https://doi.org/10.1016/j.dci.2006.11.002
Article PubMed CAS Google Scholar
Liu J, Ju Z, Li Q, Huang J, Li R, Li J, Ma L, Zhong J, Wang C (2011) Mannose-binding lectin 1 haplotypes influence serum MBL-A concentration, complement activity, and milk production traits in Chinese Holstein cattle. Immunogenetics 63:727–742. https://doi.org/10.1007/s00251-011-0548-2
Article PubMed CAS Google Scholar
Luz PR, Miyazaki MI, Chiminacio Neto N, Padeski MC, Barros ACM, Boldt ABW, Messias-Reason IJ (2016) Genetically determined MBL deficiency is associated with protection against chronic cardiomyopathy in Chagas disease. PLoS Negl Trop Dis 10:e0004257–e0004216. https://doi.org/10.1371/journal.pntd.0004257
Article PubMed PubMed Central CAS Google Scholar
Ma YJ, Skjoedt M-O, Garred P (2013) Collectin-11/MASP complex formation triggers activation of the lectin complement pathway—the fifth lectin pathway initiation complex. J Innate Immun 5:242–250. https://doi.org/10.1159/000345356
Article PubMed CAS Google Scholar
Ma YJ, Hein E, Munthe-Fog L, Skjoedt MO, Bayarri-Olmos R, Romani L, Garred P (2015) Soluble Collectin-12 (CL-12) is a pattern recognition molecule initiating complement activation via the alternative pathway. J Immunol 195:3365–3373. https://doi.org/10.4049/jimmunol.1500493
Article PubMed CAS Google Scholar
Manly KF (2005) Reliability of statistical associations between genes and disease. Immunogenetics 57:549–558. https://doi.org/10.1007/s00251-005-0025-x
Article PubMed Google Scholar
Maston GA, Evans SK, Green MR (2006) Transcriptional regulatory elements in the human genome. Annu Rev Genomics Hum Genet 7:29–59. https://doi.org/10.1146/annurev.genom.7.080505.115623
Article PubMed CAS Google Scholar
Matsushita M, Fujita T (1992) Activation of the classical complement pathway by mannose-binding protein in association with a novel C1s-like serine protease. J Exp Med 176:1497–1502
Article PubMed CAS Google Scholar
Matsushita M, Endo Y, Hamasaki N, Fujita T (2001) Activation of the lectin complement pathway by ficolins. Int Immunopharmacol 1:359–363
Article PubMed CAS Google Scholar
Matsushita M, Endo Y, Fujita T (2013) Structural and functional overview of the lectin complement pathway: its molecular basis and physiological implication. Arch Immunol Ther Exp 61:273–283. https://doi.org/10.1007/s00005-013-0229-y
Article CAS Google Scholar
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA (2010) The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297–1303. https://doi.org/10.1101/gr.107524.110
Article PubMed PubMed Central CAS Google Scholar
McLaren W, Pritchard B, Rios D, Chen Y, Flicek P, Cunningham F (2010) Deriving the consequences of genomic variants with the Ensembl API and SNP effect predictor. Bioinformatics 26:2069–2070. https://doi.org/10.1093/bioinformatics/btq330
Article PubMed PubMed Central CAS Google Scholar
Miles DG (2009) Overview of the North American beef cattle industry and the incidence of bovine respiratory disease (BRD). Anim Health Res Rev 10:101–103. https://doi.org/10.1017/S1466252309990090
Article PubMed Google Scholar
Mullen MP, Creevey CJ, Berry DP, McCabe MS, Magee DA, Howard DJ, Killeen AP, Park SD, McGettigan PA, Lucy MC, MacHugh DE, Waters SM (2012) Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples. BMC Genomics 13:16. https://doi.org/10.1186/1471-2164-13-16
Article PubMed PubMed Central CAS Google Scholar
Naito H, Ikeda A, Hasegawa K, Oka S, Uemura K, Kawasaki N, Kawasaki T (1999) Characterization of human serum mannan-binding protein promoter. J Biochem 126:1004–1012
Article PubMed CAS Google Scholar
Nitta KR, Jolma A, Yin Y, Morgunova E, Kivioja T, Akhtar J, Hens K, Toivonen J, Deplancke B, Furlong EEM, Taipale J (2015) Conservation of transcription factor binding specificities across 600 million years of bilateria evolution. elife 4:403. https://doi.org/10.7554/eLife.04837
Article CAS Google Scholar
Prescott JF, Szkotnicki J, McClure JT, Reid-Smith RJ, Léger DF (2012) Conference report: antimicrobial stewardship in Canadian agriculture and veterinary medicine. How is Canada doing and what still needs to be done? Can Vet J 53:402–407
Quinlan AR, Hall IM (2010) BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26:841–842. https://doi.org/10.1093/bioinformatics/btq033
Article PubMed PubMed Central CAS Google Scholar
R Core Team (2017) A language and environment for statistical programming. Available at: https://www.R-project.org
Rellstab C, Zoller S, Tedder A, Gugerli F, Fischer MC (2013) Validation of SNP allele frequencies determined by pooled next-generation sequencing in natural populations of a non-model plant species. PLoS One 8:e80422. https://doi.org/10.1371/journal.pone.0080422
Article PubMed PubMed Central CAS Google Scholar
Riffo-Campos ÁL, Riquelme I, Brebi-Mieville P (2016) Tools for sequence-based miRNA target prediction: what to choose? Int J Mol Sci 17:1987. https://doi.org/10.3390/ijms17121987
Article PubMed Central CAS Google Scholar
Ron M, Weller JI (2007) From QTL to QTN identification in livestock—winning by points rather than knock-out: a review. Anim Genet 38:429–439. https://doi.org/10.1111/j.1365-2052.2007.01640.x
Article PubMed CAS Google Scholar
Schepers JA, Dijkhuizen AA (1991) The economics of mastitis and mastitis control in dairy cattle: a critical analysis of estimates published since 1970. Prev Vet Med 10:213–224
Article Google Scholar
Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K (2001) dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 29:308–311. https://doi.org/10.1093/nar/29.1.308
Article PubMed PubMed Central CAS Google Scholar
Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Soding J, Thompson JD, Higgins DG (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539–539. https://doi.org/10.1038/msb.2011.75
Article PubMed PubMed Central Google Scholar
Sim N-L, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC (2012) SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res 40:W452–W457. https://doi.org/10.1093/nar/gks539
Article PubMed PubMed Central CAS Google Scholar
Sumiya M, Super M, Tabona P et al (1991) Molecular basis of opsonic defect in immunodeficient children. Lancet 337:1569–1570
Article PubMed CAS Google Scholar
Summerfield JA, Ryder S, Sumiya M, Thursz M, Gorchein A, Monteil MA, Turner MW (1995) Mannose binding protein gene mutations associated with unusual and severe infections in adults. Lancet 345:886–889
Article PubMed CAS Google Scholar
Tay HL, Kaiko GE, Plank M, Li JJ, Maltby S, Essilfie AT, Jarnicki A, Yang M, Mattes J, Hansbro PM, Foster PS (2015) Antagonism of miR-328 increases the antimicrobial function of macrophages and neutrophils and rapid clearance of non-typeable Haemophilus influenzae (NTHi) from infected lung. PLoS Pathog 11:e1004549. https://doi.org/10.1371/journal.ppat.1004549
Article PubMed PubMed Central CAS Google Scholar
The Bovine Genome Sequencing and Analysis Consortium, Elsik CG, Tellam RL et al (2009) The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324:522–528. https://doi.org/10.1126/science.1169588
Article PubMed Central CAS Google Scholar
Thiel S, Vorup-Jensen T, Stover CM, Schwaeble W, Laursen SB, Poulsen K, Willis AC, Eggleton P, Hansen S, Holmskov U, Reid KBM, Jensenius JC (1997) A second serine protease associated with mannan-binding lectin that activates complement. Nature 386:506–510. https://doi.org/10.1038/386506a0
Article PubMed CAS Google Scholar
Van der Auwera GA, Carneiro MO, Hartl C et al (2013) From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics 11:11.10.1–11.10.33. https://doi.org/10.1002/0471250953.bi1110s43
Article Google Scholar
Wallis R, Drickamer K (1999) Molecular determinants of oligomer formation and complement fixation in mannose-binding proteins. J Biol Chem 274:3580–3589
Article PubMed CAS Google Scholar
Wang C, Liu M, Li Q, Ju Z, Huang J, Li J, Wang H, Zhong J (2011) Three novel single-nucleotide polymorphisms of MBL1 gene in Chinese native cattle and their associations with milk performance traits. Vet Immunol Immunopathol 139:229–236. https://doi.org/10.1016/j.vetimm.2010.10.023
Article PubMed CAS Google Scholar
Wang X, Ju Z, Huang J, Hou M, Zhou L, Qi C, Zhang Y, Gao Q, Pan Q, Li G, Zhong J, Wang C (2012) The relationship between the variants of the bovine MBL2 gene and milk production traits, mastitis, serum MBL-C levels and complement activity. Vet Immunol Immunopathol 148:311–319. https://doi.org/10.1016/j.vetimm.2012.06.017
Article PubMed CAS Google Scholar
Weirauch MT, Yang A, Albu M, Cote AG, Montenegro-Montero A, Drewe P, Najafabadi HS, Lambert SA, Mann I, Cook K, Zheng H, Goity A, van Bakel H, Lozano JC, Galli M, Lewsey MG, Huang E, Mukherjee T, Chen X, Reece-Hoyes JS, Govindarajan S, Shaulsky G, Walhout AJM, Bouget FY, Ratsch G, Larrondo LF, Ecker JR, Hughes TR (2014) Determination and inference of eukaryotic transcription factor sequence specificity. Cell 158:1431–1443. https://doi.org/10.1016/j.cell.2014.08.009
Article PubMed PubMed Central CAS Google Scholar
Wu J, Bai JY, Li L, Huang S, Li CM, Wang GL (2015) Genetic polymorphisms of the BMAP-28 and MASP-2 genes and their correlation with the somatic cell score in Chinese Holstein cattle. Genet Mol Res 14:1–8. https://doi.org/10.4238/2015.January.15.1
Article PubMed CAS Google Scholar
Yuan Z, Li J, Li J, Gao X, Xu S (2012) SNPs identification and its correlation analysis with milk somatic cell score in bovine MBL1 gene. Mol Biol Rep 40:7–12. https://doi.org/10.1007/s11033-012-1934-z
Article PubMed CAS Google Scholar
Zhao ZL, Wang CF, Li QL, Ju ZH, Huang JM, Li JB, Zhong JF, Zhang JB (2012) Novel SNPs of the mannan-binding lectin 2 gene and their association with production traits in Chinese Holsteins. Genet Mol Res 11:3744–3754. https://doi.org/10.4238/2012.October.15.6
Article PubMed CAS Google Scholar

Download references

Acknowledgements

This work was financially supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) [Discovery grant 418356-2012] and an Ontario Veterinary College PhD Fellowship to Russell Fraser. The authors wish to acknowledge Dr. Jutta Hammermueller for her technical help.

Author information

Authors and Affiliations

Department of Pathobiology, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada
R. S. Fraser, J. S. Lumsden & B. N. Lillie
St. George’s University, True Blue, Grenada
J. S. Lumsden

Authors

R. S. Fraser
View author publications
You can also search for this author in PubMed Google Scholar
J. S. Lumsden
View author publications
You can also search for this author in PubMed Google Scholar
B. N. Lillie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to B. N. Lillie.

Electronic supplementary material

The variants identified in this study have been submitted to the European Variant Archive (EVA) accession PRJEB24763.

ESM 1

(XLSX 28 kb)

ESM 2

(PDF 238 kb)

ESM 3

(XLSX 19 kb)

ESM 4

(PDF 91 kb)

ESM 5

(PDF 88 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Fraser, R.S., Lumsden, J.S. & Lillie, B.N. Identification of polymorphisms in the bovine collagenous lectins and their association with infectious diseases in cattle. Immunogenetics 70, 533–546 (2018). https://doi.org/10.1007/s00251-018-1061-7

Download citation

Received: 20 February 2018
Accepted: 01 May 2018
Published: 10 May 2018
Issue Date: August 2018
DOI: https://doi.org/10.1007/s00251-018-1061-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Identification of polymorphisms in the bovine collagenous lectins and their association with infectious diseases in cattle

Abstract

Similar content being viewed by others

Genome-wide association study identifies loci associated with milk leukocyte phenotypes following experimental challenge with Streptococcus uberis

Allele substitution and dominance effects of CD166/ALCAM gene polymorphisms for endoparasite resistance and test-day traits in a small cattle population using logistic regression analyses

The genomic architecture of mastitis resistance in dairy sheep

Introduction