Development and Experimental Validation of a 20K Atlantic Cod (Gadus morhua) Oligonucleotide Microarray Based on a Collection of over 150,000 ESTs

Booman, Marije; Borza, Tudor; Feng, Charles Y.; Hori, Tiago S.; Higgins, Brent; Culf, Adrian; Léger, Daniel; Chute, Ian C.; Belkaid, Anissa; Rise, Marlies; Gamperl, A. Kurt; Hubert, Sophie; Kimball, Jennifer; Ouellette, Rodney J.; Johnson, Stewart C.; Bowman, Sharen; Rise, Matthew L.

doi:10.1007/s10126-010-9335-6

Development and Experimental Validation of a 20K Atlantic Cod (Gadus morhua) Oligonucleotide Microarray Based on a Collection of over 150,000 ESTs

Original Article
Open access
Published: 03 December 2010

Volume 13, pages 733–750, (2011)
Cite this article

Download PDF

You have full access to this open access article

Marine Biotechnology Aims and scope Submit manuscript

Development and Experimental Validation of a 20K Atlantic Cod (Gadus morhua) Oligonucleotide Microarray Based on a Collection of over 150,000 ESTs

Download PDF

Marije Booman¹,
Tudor Borza²,
Charles Y. Feng¹,
Tiago S. Hori¹,
Brent Higgins²,
Adrian Culf³,
Daniel Léger³,
Ian C. Chute³,
Anissa Belkaid³,
Marlies Rise^1,2,
A. Kurt Gamperl¹,
Sophie Hubert²,
Jennifer Kimball⁴,
Rodney J. Ouellette³,
Stewart C. Johnson⁵,
Sharen Bowman² &
…
Matthew L. Rise^1,6

1804 Accesses
35 Citations
6 Altmetric
Explore all metrics

Abstract

The collapse of Atlantic cod (Gadus morhua) wild populations strongly impacted the Atlantic cod fishery and led to the development of cod aquaculture. In order to improve aquaculture and broodstock quality, we need to gain knowledge of genes and pathways involved in Atlantic cod responses to pathogens and other stressors. The Atlantic Cod Genomics and Broodstock Development Project has generated over 150,000 expressed sequence tags from 42 cDNA libraries representing various tissues, developmental stages, and stimuli. We used this resource to develop an Atlantic cod oligonucleotide microarray containing 20,000 unique probes. Selection of sequences from the full range of cDNA libraries enables application of the microarray for a broad spectrum of Atlantic cod functional genomics studies. We included sequences that were highly abundant in suppression subtractive hybridization (SSH) libraries, which were enriched for transcripts responsive to pathogens or other stressors. These sequences represent genes that potentially play an important role in stress and/or immune responses, making the microarray particularly useful for studies of Atlantic cod gene expression responses to immune stimuli and other stressors. To demonstrate its value, we used the microarray to analyze the Atlantic cod spleen response to stimulation with formalin-killed, atypical Aeromonas salmonicida, resulting in a gene expression profile that indicates a strong innate immune response. These results were further validated by quantitative PCR analysis and comparison to results from previous analysis of an SSH library. This study shows that the Atlantic cod 20K oligonucleotide microarray is a valuable new tool for Atlantic cod functional genomics research.

Development and validation of a mixed-tissue oligonucleotide DNA microarray for Atlantic bluefin tuna, Thunnus thynnus (Linnaeus, 1758)

Article Open access 25 November 2015

Developing specific molecular biomarkers for thermal stress in salmonids

Article Open access 16 October 2018

Gene Expression Profiling in Fish Toxicology: A Review

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The collapse of wild populations of Atlantic cod (Gadus morhua) has had a large impact on the cod fishery, which has historically been an important industry in several countries including Canada and Norway. To allow wild cod stocks to recover while still satisfying consumer demands, Atlantic cod aquaculture is an important alternative to wild fishery. However, attempts at commercial-scale cod aquaculture have not been fully successful and are hampered by slow growth, early maturation (Hansen et al. 2001), infectious diseases (Samuelsen et al. 2006), and stress associated with handling (Brown et al. 2003) and with fluctuating temperatures in sea cages (Gollock et al. 2006). In order to solve these problems, we need to learn more about cod biology, especially the response to stress and pathogens as the high-density of fish populations in aquaculture makes them more prone to exposure to these factors.

Atlantic cod has a peculiar immune system compared to other teleosts (reviewed in Pilstrom et al. 2005; Samuelsen et al. 2006; Solem and Stenvik 2006). Specific antibody responses to pathogens are weak or absent, although Atlantic cod does respond to vaccination (Gudmundsdóttir et al. 2009; Lund et al. 2006, 2007). Atlantic cod has a relatively high level of natural antibodies with a broad specificity and an affinity at least comparable to that of acquired antibodies, which might play an important role in the immune response (Magnadottir et al. 2009). Compared to other teleost species, Atlantic cod express an exceptionally high number of MH class I genes (Miller et al. 2002; Persson et al. 1999). However, despite significant efforts, MH class II sequences have not yet been identified in Atlantic cod. Further studies of the genes, proteins, cells, and molecular pathways involved in Atlantic cod immune responses will greatly benefit from large-scale genomics projects that are currently being conducted on this species. By understanding the genes and pathways involved in Atlantic cod responses to pathogens and other stressors, we will gain valuable knowledge to use in the management of Atlantic cod aquaculture and to improve broodstock quality.

The Atlantic Cod Genomics and Broodstock Development Project (CGP, www.codgene.ca) aims to provide genomic tools that can be used to increase our understanding of cod biology and to identify molecular markers for the selection of broodstock with favorable commercial phenotypes (such as rapid growth characteristics and resistance to stress and disease). These tools will also be useful in the development of new vaccines and therapeutics against the pathogens that threaten Atlantic cod aquaculture. As part of the CGP, we have previously used suppression subtractive hybridization (SSH) cDNA library construction, and sequencing and quantitative reverse transcription–polymerase chain reaction (QPCR) analysis of SSH-identified transcripts, to study gene expression responses of Atlantic cod to different stimuli such as heat stress (Hori et al. 2010; Pérez-Casanova et al. 2008), subclinical high nodavirus carrier state (Rise et al. 2008, 2010), and intraperitoneal (IP) injection of viral mimic polyriboinosinic polyribocytidylic acid [poly(I:C)] (Rise et al. 2008, 2010) or formalin-killed, atypical Aeromonas salmonicida (Feng et al. 2009). In total, the CGP created 42 SSH and normalized cDNA libraries and generated over 150,000 expressed sequence tags (ESTs; Bowman et al. 2010).

Aquaculture-relevant genomics research has benefitted greatly from the development of microarray platforms for a variety of teleost species (reviewed in Rise et al. 2009). For Atlantic cod, only two microarray platforms have been published to date. The small-scale CodStress microarray contains 746 ESTs (Lie et al. 2009) that were selected from four cDNA libraries constructed from tissues of Atlantic cod that were exposed to toxicants, and this microarray was designed for use as a diagnostic tool in ecotoxicology studies. The IMR 16K cDNA microarray (Edvardsen et al. 2010) contains 16,384 sequences that were selected from a relatively small set of ~44,000 ESTs from 20 un-normalized cDNA libraries.

Here, we describe the development of a large-scale microarray platform, the Atlantic cod oligonucleotide microarray containing 20,000 elements (20K), based on the EST resources that were generated by the CGP containing over 150,000 ESTs from 42 cDNA libraries, representing multiple tissues, developmental stages, and stimuli (Bowman et al. 2010). To test the newly developed microarray platform, we used it in a study of the spleen transcriptome response to IP injection with formalin-killed, atypical A. salmonicida using the same samples as were previously used in our SSH study (Feng et al. 2009). Results from the current microarray study were validated by QPCR assays, by comparing them to results from the SSH study (Feng et al. 2009), and by functional analysis of the resulting informative gene list. This study shows that the Atlantic cod 20K oligonucleotide microarray is a valuable new tool for Atlantic cod functional genomics research.

Materials and Methods

EST Clustering and the Selection of Representative Sequences for Microarray Probe Design

The CGP has generated a database of 158,877 Atlantic cod ESTs from 23 normalized cDNA libraries and 19 SSH cDNA libraries, representing a variety of tissue types, developmental stages, and stimuli (Bowman et al. 2010). The SSH libraries were constructed using tissues from fish stimulated with IP-injected formalin-killed, atypical A. salmonicida (Feng et al. 2009), IP-injected viral mimic poly(I:C) (Rise et al. 2008, 2010), or heat stress (Hori et al. 2010). The “All Version 2.0” sequence set consists of all ESTs from the CGP database that passed screening criteria of Paracel Transcript Assembler (PTA; Paracel Inc., Pasadena, CA), and this set of 154,142 ESTs was clustered and assembled using PTA as described by Bowman et al. (2010). Our use of the terms “cluster,” “contig,” “cluster-singleton,” and “singleton” is based on the PTA procedure. During the clustering process, PTA starts by placing each EST in its own unique cluster. Then, a pairwise comparison is performed between each EST and all other ESTs to determine which pairs of sequences are similar. PTA then considers all pairwise comparison hits (i.e., sequences that were found to be similar) and merges the two sequences that generated the hit into one cluster. PTA then continues by iterating through the pairwise comparison hits, each time merging the two clusters that contain the two sequences that generated a hit. By the end of this clustering process, some clusters will contain multiple ESTs, while others will only contain one EST. These latter sequences are no longer called “clusters” but are defined as “singletons,” i.e., ESTs that did not show sequence similarity to any other ESTs. For the subsequent assembly procedure, each cluster is processed separately and PTA assembles the ESTs within each cluster into one or more contiguous sequences (contigs). During this process, it might not be able to assemble all ESTs from a cluster into a contig. The single ESTs that were clustered (i.e., showed sequence similarity to other ESTs) but are not part of a contig are defined as “cluster-singletons.” “Clusters” are thus defined as groups of ESTs that show sequence similarity and that can form one or multiple contigs, or remain as “cluster-singletons.”

Clustering and assembly of the 154,142 ESTs resulted in 51,814 putative transcripts corresponding to 23,838 contigs and 27,976 singletons (Bowman et al. 2010). From this collection of 51,814 putative transcripts, a selection was made of candidate sequences for probe design.

To reduce redundancy, multiple contigs from the same cluster were only selected if they showed <60% identity over their entire length, and cluster-singletons within contig-containing clusters were discarded. For those clusters containing only cluster-singletons, the longest cluster-singleton within the cluster was selected. This screening step allowed for the selection of one or a few representative contigs or cluster-singletons from each cluster and, along with non-clustered singletons, produced a total number of 39,247 unique candidate sequences. For this microarray, oligonucleotide probes would be designed in sense (5′–3′) orientation, which requires that the orientation of selected sequences is known. For sequences from normalized libraries, which were generated using directional cloning (Bowman et al. 2010), the orientation could be inferred; however, for sequences from SSH libraries, which were generated using non-directional TA cloning (Feng et al. 2009; Hori et al. 2010; Rise et al. 2008, 2010), this was not possible since the orientation of inserts was unknown. In order to design probes with a proper 5′–3′ orientation, the orientation of all candidate sequences was checked in two steps. In the first step, a BLASTX search was performed against GenBank database release 165.0; all sequences having significant BLASTX hits (E value <10⁻⁵) in the opposite orientation were reversed and complemented. In the second step, sequences lacking significant BLASTX hits were reversed and complemented, when necessary, based on directional cloning information. Candidate sequences were thus divided into two categories: 34,355 “known orientation” candidate sequences with known or inferred orientation (determined by BLASTX alignment and/or directional cloning information) and 4,892 “unknown orientation” candidate sequences (i.e., sequences originating from SSH libraries without significant BLAST hits). Based on homology search in UniRef90 (UniProt), nr (NCBI), Kyoto Encyclopedia of Genes and Genomes, Clusters of Orthologous Groups, Protein Families, Large Subunit Ribosomal RNA, and Small Subunit Ribosomal RNA databases using AutoFACT (Koski et al. 2005), the category of 34,355 “known orientation” candidate sequences was split further into three groups: 11,152 “annotated” sequences showing significant homology (bit score >40) to sequences with functional annotation; 2,861 “unassigned” sequences showing significant homology to sequences with non-informative annotation, i.e., to sequences corresponding to proteins with unknown function; and 20,342 “unclassified” sequences showing no significant homology to any sequences in these databases. A schematic representation of the complete selection process is given in Fig. 1.

In addition to the four sequence categories described above (“annotated,” “unassigned,” “unclassified,” and “unknown orientation”), a fifth category of 58 “custom” sequences was chosen based on gene function. These sequences originated from CGP and included hemoglobin genes (Borza et al. 2009), CC chemokines (Borza et al. 2010), and genes involved in immune-relevant signaling pathways such as the RIG-I-like receptor signaling pathway, Toll-like receptor signaling, and interferon stimulation.

Oligonucleotide Probe Design

The design of 50-mer oligonucleotide probes was performed separately on each of the five sequence categories using ArrayDesigner version 4.2 (Premier Biosoft International, Palo Alto, CA) with the following relevant parameters: melting temperature, 72 ± 5°C; length, 50 nucleotides; position from the 3′ end: anywhere (for “annotated,” “unassigned,” and “custom” sequences) or 500 (for “unclassified” and “non-directed” sequences); hairpin dG, −6.0; dimer dG, −8.0; run/repeat, 5 max; word size, 15; BLASTN-DUST filtering: on; avoid homology: checked. Using these parameters, good probes could be designed for 25,695 “known orientation” and 2,695 “unknown orientation” sequences (Electronic Supplementary Material (ESM) Fig. S1). However, more than 60% of the “known orientation” sequences were represented by “unclassified” sequences, i.e., by sequences for which the homology search failed to reveal any relevant hits in databases. In order to maximize the number of probes designed from “annotated” and “unassigned” sequences, a second pass of probe design was carried out with sequences from these two categories that were rejected in the first pass; in this second pass, homology was allowed to occur among probes. The second pass produced good probes for 1,956 “annotated” and “unassigned” sequences. Probes from both design steps were checked by reciprocal BLASTN search to remove highly homologous probes. All probes were also checked for the presence of simple sequence repeats using Tandem Repeat Finder (Benson 1999) and two online versions of Repeat Masker (http://lucy.ceh.uvic.ca/repeatmasker/cbr_repeatmasker.py; http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker); the probes identified as having repeated sequences or harboring low complexity regions were replaced with non-repetitive probes designed from the same transcript where possible. Finally, all probes having homologues in databases were checked for correct 5′ to 3′ orientation by running BLAST searches against GenBank database release 168.0 for confirmation purposes.

For the “annotated,” “unassigned,” and “custom” sequences, the selection process was now complete, resulting in 10,318 probes from the “annotated” sequences, 2,625 from the “unassigned” sequences, and 202 from the “custom” sequences. For the “unclassified” sequences, an additional selection step followed based on the number of ESTs contributing to each contig sequence, resulting in 6,397 probes from the “unclassified” sequences (5,953 with three or more ESTs per contig, 444 with two ESTs per contig). For the “unknown orientation” sequences, an additional selection step followed based on the abundance in SSH libraries and predicted roles in the immune and stress responses, resulting in 458 probes (one sense and one antisense for each of 229 selected sequences) from the “unknown orientation” sequences. Only one probe per candidate sequence was included for all categories of sequences, except for the “custom” sequences where two or more probes per candidate sequence were included. The total number of designed oligonucleotide probes was 20,000.

Oligonucleotide Synthesis and Microarray Production

Oligonucleotide probes were synthesized by the phosphoramidite method at a 1-nmol scale and desalted (Integrated DNA Technologies, Coralville, IA). Isolated DNA was analyzed by matrix-assisted laser desorption/ionization–time of flight mass spectrometry. DNA of low quality was resynthesized. In addition to the 20,000 specific probes described above, the following positive and negative control features were included: cod actin oligo (192 spots), human actin oligo (192 spots), SpotReport Alien Oligos (nos. 4, 6, 7, and 8; Stratagene, La Jolla, CA; 96 spots each), and buffer-only spots (704 spots). Probes were resuspended in sodium phosphate buffer (Schott-Nexterion Spot; Schott, Louisville, KY), to a final concentration of 30 μM, in Genetix X7020 384-well plates (Genetics, Boston, MA) using a Janus liquid-handling robot (PerkinElmer, Waltham, MA) and spotted in duplicate on epoxy microarray slides (Schott-Nexterion Slide E) using an OmniGrid 100 microarrayer (Genomic Solutions, Ann Arbor, MI) equipped with SMT-S50 silicon print pins (Parallel Synthesis Technologies, Santa Clara, CA). The microarray contains 41,472 features (i.e., 20,000 specific probes spotted in duplicate and 1,472 control spots) spotted in 48 blocks of 29 rows and 30 columns each (the last row of each block contains only 24 features). Microarrays were produced at the Atlantic Microarray Facility of the Atlantic Cancer Research Institute (ACRI, Moncton, NB).

Two microarrays of each print batch were analyzed by salt reflection scan and by hybridization with Cy3 SpotQC Detector Oligo (Integrated DNA Technologies) and subsequent scanning on an Axon GenePix 4200AL scanner (Molecular Devices, Sunnyvale, CA) for any defects that may have occurred in spotting. Before use, microarrays were stored at room temperature in vacuum-sealed microarray pouches (Corning, Corning, NY).

The microarray platform is available in the NCBI Gene Expression Omnibus (GEO) under accession no. GPL10532. The annotation file for the probes on the microarray is available as ESM Table S1.

Bacterial Antigen Stimulation and Tissue Sample Collection

The spleen tissue samples used for the microarray study were part of a larger study on Atlantic cod immune response to bacterial antigens, and study setup, bacterial antigen preparation, bacterial antigen stimulation, and tissue collection have been described previously (Feng et al. 2009). In short, 150 juvenile, healthy-appearing Atlantic cod from a single family were divided over three tanks corresponding to three treatments. At the pre-injection time point (0 h), eight individuals were sampled from each tank, euthanized, and spleens and brains collected in RNAse-free 1.5-ml tubes and flash-frozen in liquid nitrogen and stored at −80°C until RNA extraction. The remaining individuals received one of three treatments: IP injection of 100 μl sterile phosphate-buffered saline (PBS; PBS group), IP injection of 100 μl formalin-killed, atypical A. salmonicida (referred to as Asal) in PBS (Asal group), or no handling at all (undisturbed control group). At 2, 6, 24, and 72 h post-injection (HPI) and 1 week post-injection, eight individuals were sampled from each tank, euthanized, and spleens and brains collected as described for the 0 h individuals. The current study used spleen tissues from PBS and Asal fish from 0 h and 24 HPI time points for microarray and QPCR; spleen tissues from undisturbed control fish from all time points were used for the microarray common reference sample (ESM Fig. S2). Immune stimulation and sampling of the fish were carried out in accordance with an Animal Care Utilization Protocol issued by Memorial University of Newfoundland’s Animal Care Committee.

Nodavirus Testing

To determine if any individuals were asymptomatic carriers of nodavirus, an RT-PCR test was conducted. For each individual, 1 μg of DNAse-I-treated, column-purified total RNA from brain tissue was reverse-transcribed using the Moloney murine leukemia virus reverse transcriptase and random hexamers (Invitrogen, Burlington, ON) at 37°C for 50 min. PCR reactions contained 1 U of DyNAzyme EXT DNA polymerase (MJ Research, Waltham, MA), 1X DyNAzyme EXT Buffer, 0.2 mM dNTPs, 4 μl of 10x diluted cDNA (corresponding to 20 ng of input total RNA), and 0.2 μM each of nodavirus-specific primers AC2F1 (5′-GTGGTTACGTGGCTGGCTTC-3′) and AC2R4 (5′-GTTCTGCTTTCCCACCATTTG-3′). PCR cycling conditions consisted of 40 cycles of (94°C for 30 s, 61°C for 30 s, and 72°C for 10 s). For each of the 50 μl PCR reactions, 5.5 μl of 10X Blue juice (Invitrogen) was added, and 15 μl of this mixture was resolved on 1.5% agarose gels using a 100-bp ladder (Invitrogen) as a marker. Gels were photographed under UV light using G:BOX and GeneSnap software (Syngene, Frederick, MD), and the intensities of the bands from the PCR products were quantified as a measure of carrier state using GeneTools (Syngene). Mean pixel intensity background levels, as measured in the lane where the “no template control” PCR product was loaded, were 5,456. Individuals with mean pixel intensities between 5,200 and 7,310 (no band or very low-intensity bands) were classified as “no/low nodavirus carrier” and individuals with pixel intensities above 11,130 (high-intensity, clearly visible bands) were classified as “high nodavirus carrier” as described previously (Rise et al. 2010; ESM Fig. S3).

RNA Extraction

Total RNA extraction was described in detail in Feng et al. (2009). Briefly, total RNA was extracted from spleen tissue samples from individual fish using TRIzol reagent (Invitrogen) according to the manufacturer’s instructions. Total RNA was treated with DNAse I (RNAse-Free DNAse Set; Qiagen, Valencia, CA) and column-purified using the RNeasy MinElute Cleanup kit (Qiagen) according to the manufacturer’s instructions. Quality and quantity of isolated total RNA samples were assessed by agarose gel electrophoresis and NanoDrop spectrophotometry.

Microarray Hybridization

Spleen DNAse-treated and column-purified total RNA samples from six individual fish each from the PBS 0 h, PBS 24 HPI, Asal 0 h, and Asal 24 HPI groups were used for microarray analysis (ESM Fig. S3). Spleen DNAse-treated and column-purified total RNA samples from 31 “undisturbed control” fish from all time points were pooled (with each individual contributing an equal amount of RNA) and used as common reference. For each individual sample, 5 μg of total RNA was labeled with AlexaFluor 647 using the Invitrogen SuperScript Direct cDNA Labeling kit according to the manufacturer’s protocol (Invitrogen). For the pooled common reference, 5 μg of total RNA was labeled with AlexaFluor 555 using the same protocol. Formamide-based hybridization buffer (2x concentrated) and LNA dT blocker (Genisphere, Hatfield, PA) were added to purified, labeled cDNA, and on each microarray, an experimental sample was co-hybridized with the common reference using LifterSlips (Thermo Scientific, Waltham, MA). Hybridizations were performed overnight (~16 h) at 42°C. Detailed protocols for slide pre-hybridization, hybridization, and washing are described in ESM Methods S1.

Microarray Data Acquisition

Tiff images containing fluorescence data were obtained using a ScanArray Gx Plus scanner and ScanExpress v4.0 (Perkin Elmer), and signal intensity data were extracted using Imagene v7.5 (Biodiscovery, El Segundo, CA). Using R and Bioconductor package marray, control spots and flagged spots were removed and data were log₂-transformed and Loess-normalized per subgrid. Raw signal cutoff levels per microarray per channel were calculated as the average of the median background signal + 2 standard deviations, and data were thresholded by replacing normalized log ratios of spots with raw signal values below cutoff with “NA.” Normalized log ratios of flagged spots were also replaced with “NA.” Normalized log ratios from duplicate probes were averaged. The final normalized, thresholded, and averaged dataset contains 20,000 probes. Detailed protocols including the R scripts for these procedures are available in ESM Methods S2. This microarray dataset is described in GEO series GSE22312, and individual sample processed and raw data are available under GEO accession nos. GSM555358–GS555380.

Microarray Data Analysis

Before further data analysis, all probes that were absent in more than 25% of all microarrays were discarded from the dataset, leaving a final dataset of 15,429 probes. Two-class comparison analysis was performed using the Significance Analysis of Microarrays (SAM) algorithm (Tusher et al. 2001) as implemented in the Bioconductor package siggenes (Schwender et al. 2006). First, missing values were imputed using the EM_array method from the LSimpute package (Bø et al. 2004; Celton et al. 2010). After running SAM, a false discovery rate (FDR) cutoff of 0.01 was used to determine significant differentially expressed genes. An additional analysis was performed using a less stringent FDR cutoff of 0.05; results from this analysis are not discussed (they are available as ESM Table S2), but three genes were selected from this list for QPCR analysis (Table 1). Detailed protocols including the R scripts for these procedures are available in ESM Methods S3. In Genesis (Sturn et al. 2002), the data were clustered using Pearson uncentered correlation and complete linkage hierarchical clustering.

Table 1 QPCR primers

Full size table

A final gene list of 82 probes responsive to Asal was manually annotated by comparing the full sequences that the probes were representing against the nr database from NCBI using BLASTX and by choosing the most significant (bit score >40, E value <10⁻⁴) hit with an informative description. Gene Ontology (GO) annotation was added to the gene list by choosing the most significant (bit score >40) human or mouse hit with a reviewed UniProt entry. That UniProt accession number was used to query QuickGO for the associated GO Biological Process entries.

Quantitative RT-PCR Analysis

Fifteen genes were selected for QPCR analysis. For four genes (HAMP, CAMP, GmSCYA123, and IL8), primers were designed and tested and QPCR analysis was performed previously (Feng et al. 2009). Because the QPCR analysis for these four genes was previously performed on a different real-time instrument and using technical duplicates instead of technical triplicates, these four genes were reanalyzed in the current study using the real-time instrument and protocol described here. For the remaining 11 genes, QPCR primers were designed from the sequences that were used to design the microarray probes (Table 1) using the Primer 3 program (http://frodo.wi.mit.edu). Prior to QPCR analysis, a quality control procedure was performed for all newly designed primer pairs. They were first tested using a no reverse transcription control as template, for which none of the selected primer pairs showed detectable amplification. Each primer pair was further tested using a five-point 5x dilution series of cDNA from pooled Asal 24 HPI samples as template. Primer pairs showed a single peak in the dissociation curve and had no amplification in no-template controls. Amplification efficiencies were determined and are listed in Table 1. Ribosomal RNA 18S was chosen as a normalizer gene, and its amplification efficiency and suitability as a normalizer has been determined previously (Rise et al. 2008). Seven fish each from the PBS 0 h, PBS 24 HPI, and Asal 0 h groups and eight fish from the Asal 24 HPI group were used in the QPCR study (ESM Fig. S3). For each sample, 1 μg of column-purified, DNAse-treated total RNA was reverse-transcribed as described in Rise et al. (2008). PCR amplification was performed using Power SYBR Green chemistry and a 7500 Fast Real-Time PCR machine (Applied Biosystems, Foster City, CA) in a total reaction volume of 13 μl containing 1 μl of cDNA (corresponding to 10 ng of input total RNA), 50 nM each of forward and reverse primers, and 1x PowerSYBR mastermix (Applied Biosystems). The PCR program consisted of one cycle of 50°C for 2 min, one cycle of 95°C for 10 min, followed by 40 cycles of 95°C for 15 s and 60°C for 1 min, with data collection after each 60°C step. For each sample, the gene of interest and the normalizer gene were run on the same plate in triplicate. ABI 7500 software v2.0 (Applied Biosystems) was used to automatically determine baseline and threshold settings, except for 18S and interferon-inducible GTPase_b for which the baseline was set manually. Threshold cycle (Ct) values were averaged over triplicates, and for each gene the relative quantity (RQ) values were calculated using the method described by Pfaffl (2001) using the individual with lowest RQ for that gene as calibrator to avoid RQ values lower than 1. When the standard deviation of triplicates was higher than 0.5, Dixon’s Q test (Rorabacher 1991) was used to determine if there was an outlier within the triplicate (at a confidence level of 95%). If an outlier was detected, it was discarded and the average Ct was calculated using the remaining two Ct values. If no outlier was detected, all three Ct values were discarded. For one of the genes (interferon-inducible GTPase_b), Ct values could not be detected in all samples due to extremely low or absent transcript levels in some individuals. To enable statistical analysis of this gene, all samples without detectable Ct values were assigned a Ct value of 39.01, which corresponds to the highest detectable Ct value among the remaining samples for that gene.

Gene expression differences between treatments within each time point, and between time points within each treatment, were determined by t tests on RQ values using SYSTAT 12.0 with a p value cutoff of 0.05.

Results

Microarray Design

Sequences to be represented on the microarray were selected from the CGP EST sequence set All Version 2.0, which contains 154,142 ESTs that were assembled into 23,838 contigs and 27,976 singletons for a total of 51,814 putative transcripts (Bowman et al. 2010; Fig. 1). Since the number of assembled sequences was significantly higher than the 20,000 unique Atlantic cod probes that our microarray layout design was based on, a selection procedure was designed to reduce the number of sequences that exhibit high homology to each other, to maximize the number of sequences that have functional annotation, and to minimize the number of sequences for which homology search in public databases failed to reveal any significant hits (using AutoFACT with a bit score threshold of 40). These latter sequences, indicated as “unclassified” in Fig. 1, underwent an additional highly stringent selection procedure based on the depth of the contig so that no “unclassified” singletons were included on the microarray. We also aimed to minimize the number of sequences with unknown orientation that show no significant homology to any known sequences (as determined by BLASTX search using an E value threshold of 10⁻⁵). However, these sequences all originated from SSH libraries that were enriched for transcripts responsive to pathogens or other stressors and so could potentially represent genes that play an important role in stress and/or immune responses. Therefore, we chose to include 229 of the most highly abundant sequences from this group and designed probes in both orientations for these sequences. No singletons were included on the microarray unless they had significant sequence homology with an annotated entry in a public database.

Because the majority of sequences (19,798 of 20,000) were selected from the complete CGP EST database, which is based on normalized and SSH cDNA libraries from a variety of tissues, developmental stages, and treatments (Bowman et al. 2010), the microarray provides good coverage of genes from many pathways including those involved in stress and immune responses. To ensure the usefulness of the microarray for specific research areas, 58 Atlantic cod sequences were manually selected to represent genes of special interest to CGP research. These include 9 hemoglobin genes, 18 CC chemokines, and 31 genes from three immune-relevant functional groups: interferon-stimulated genes, genes involved in Toll-like receptor signaling, and genes from the RIG-I-like receptor signaling pathway.

Microarray-Based Gene Expression Analysis

Global gene expression profiles were analyzed using spleen RNA samples from four treatment groups: PBS pre-injection (PBS 0 h), PBS 24 h post-injection (PBS 24 HPI), formalin-killed, atypical A. salmonicida pre-injection (Asal 0 h), and formalin-killed, atypical A. salmonicida 24 h post-injection (Asal 24 HPI; ESM Fig. S2). Fish from the PBS groups and Asal groups were kept in separate tanks. For each treatment group, spleen RNA samples from six fish were analyzed, except from the PBS 0 h group where one of the six samples failed due to consistent low signal intensities on the microarray. RT-PCR with brain RNA templates and nodavirus-specific primers showed that some fish involved in this study were asymptomatic carriers of nodavirus (ESM Fig. S3). Comparison of global gene expression profiles of asymptomatic nodavirus “high carrier” and “no/low carrier” pre-injection (0 h) spleen samples using SAM, with an FDR cutoff of 0.05, showed no significant differences in gene expression (data not shown).

To determine which transcripts were responsive to Asal, we compared the Asal 24 HPI group to both the Asal 0 h group and the PBS 24 HPI group using SAM with an FDR cutoff of 0.01. Only the genes found to be responsive in both comparisons were included in the final gene list (Fig. 2). This highly stringent analysis was employed to minimize false positives in the final informative gene list.

When comparing the Asal 0 h and Asal 24 HPI groups, there were 104 differentially expressed probes, all upregulated in response to Asal (Fig. 2 and ESM Table S3). Since the fish in the pre-injection group (Asal 0 h) were never handled during the experiment, this list of 104 probes represents genes responding to stimulation with Asal as well as handling, anesthesia, and injection stress. We sought to identify genes responding only to handling and injection stress by comparing PBS 0 h and PBS 24 HPI groups, but no significant differences were found. In conclusion, all 104 probes were responsive to stimulation with Asal when comparing Asal 0 h and Asal 24 HPI groups.

When comparing the two post-injection groups, PBS 24 HPI and Asal 24 HPI, we identified 1,313 differentially expressed probes, of which 1,012 were upregulated and 301 were downregulated in response to Asal (Fig. 2 and ESM Table S4). Since the “PBS fish” and “Asal fish” were held in two separate tanks, this list of 1,313 probes represents genes responding to stimulation with Asal and/or differences between tanks. The genes that were associated with differences between tanks were determined by comparing the two pre-injection groups PBS 0 h and Asal 0 h, resulting in 31 informative probes, of which 12 were also present in the list of 1,313 probes from the PBS 24 HPI and Asal 24 HPI comparison (Fig. 2 and ESM Table S4). These 12 probes that were associated with tank differences were removed from the list of 1,313 probes. In conclusion, 1,301 probes were responsive to stimulation with Asal when comparing PBS 24 HPI and Asal 24 HPI groups (indicated in bold type in ESM Table S4).

The final list consisted of 82 probes that were all significantly upregulated in response to Asal in both comparisons (Asal 0 h with Asal 24 HPI and PBS 24 HPI with Asal 24 HPI, i.e., present in both lists of 104 and 1,301 probes; Fig. 2). This final list is available as ESM Table S5. The samples and probes were clustered according to the expression of these 82 probes using a hierarchical clustering algorithm (Fig. 3). Clustering shows a clear general distinction between the Asal 24 HPI samples and the control samples (Asal 0 h, PBS 0 h, and PBS 24 HPI), with the exception of one control sample from the PBS 24HPI group that clusters together with the Asal 24 HPI group. Within the Asal 24 HPI group, the leftmost sample clusters away from the other samples in this group and shows a weaker upregulation of gene expression. The list of 82 Asal-responsive probes was manually annotated with gene names and GO Biological Process entries using BLASTX (E value <10⁻⁵, bit score >40) and QuickGo (ESM Table S5A). There is some redundancy in annotation of the list and the 82 probes represent 71 unique genes, of which 51 had a gene name annotation (ESM Table S5B). The other 20 were “unclassified” (i.e., had no significant homology with any sequence in the NCBI nr database). Twenty-seven genes were associated with one or more Biological Process GO entries. The GO entries that were most common (i.e., represented by three or more genes) were “proteolysis,” “transport,” “immune response,” and “oxidation reduction” (ESM Table S6). Because the number of genes with GO annotation was low, a second functional classification was made based on information from literature, GO annotation, UniProt, and Entrez Gene databases. If no functional information specific for cod or other teleosts was available, information for putative human or mouse orthologs was used. The most abundant categories were “immune, inflammatory and bactericidal response,” “proteolysis,” and “transport.” These categories and the genes associated with them are listed in Table 2.

Table 2 Selection of genes that show increased expression upon stimulation with formalin-killed, atypical A. salmonicida

Full size table

QPCR Analysis

To confirm the results of the microarray, 12 genes from the 71 unique genes responsive to Asal were chosen for QPCR analysis (Table 1): cathelicidin (CAMP), CC chemokine “CCL19 group” GmSCYA123, hepcidin (HAMP), complement component 1, s subcomponent (C1S), CCAAT/enhancer-binding protein beta 2 (CEBPB2), cathepsin L (CTSL), stromal cell-derived factor 1 precursor (SDF1), unclassified gene all_v2.0.2958.C1, unclassified gene all_v2.0.6615.C2, bactericidal permeability increasing protein/lipopolysaccharide binding protein variant b (BPI/LBP), interferon-inducible GTPase_b (IIGP_b), and cytochrome b-245 beta polypeptide (CYBB). Three genes that were Asal-responsive in a less stringent analysis (using an FDR cutoff of 0.05 instead of 0.01; ESM Table S2), interleukin 8 (IL8), CC chemokine “fish group” GmSCYA104, and interferon-inducible GTPase_a (IIGP_a), were added to provide further validation of the microarray results. This makes a total of 15 microarray-identified genes that were subjected to QPCR. Thirteen of those were chosen based on their functional annotation suggesting a role in the immune or defense response; the remaining two were selected from the “unclassified” genes (all_v2.0.2958.C1 and all_v2.0.6615.C2). Eight of the 15 genes we selected that were identified by microarray as Asal-responsive were also identified as Asal-responsive in SSH analysis (Feng et al. 2009). Results from the QPCR analysis of spleen tissue for these eight genes are shown in Fig. 4. Of these genes, CAMP, GmSCYA123, HAMP, and IL8 were analyzed previously by QPCR using the same spleen samples but with a different QPCR instrument and using technical duplicates instead of technical triplicates (Feng et al. 2009). QPCR for these genes was repeated for the current study to ensure that all genes were analyzed using the same protocol. Seven of the 15 genes we selected were newly identified as Asal-responsive by microarray analysis, and results from QPCR analysis for these genes are shown in Fig. 5. Of 15 genes tested, 12 showed a significant difference in gene expression levels (p < 0.05) between Asal 0 h and Asal 24 HPI samples and/or between PBS 24 HPI and Asal 24 HPI samples, as expected from the microarray results. For three genes (CYBB, C1S, and all_v2.0.2958.C1), QPCR could not confirm a significant transcript expression response to Asal.

As was found with SAM analysis of the microarray data, QPCR data did not show significant differences in spleen transcript expression levels between asymptomatic nodavirus “high carrier” and “no/low carrier” pre-injection samples (data not shown). QPCR data for individual samples are included as ESM Table S7.

Discussion

Highly Stringent Analysis of Microarray Data Identifies 82 Probes That Were Responsive to Stimulation with Formalin-Killed, Atypical A. salmonicida

Because this study was set up to test the newly developed Atlantic cod 20K microarray platform, we chose to use a stringent selection of the final Asal-responsive gene list. Therefore, all gene lists were based on an FDR cutoff of 0.01 and the final responsive gene list contained only the 82 probes that were responsive to Asal when compared to both unstimulated and PBS-stimulated fish.

The list of Asal-responsive probes from the comparison between Asal 24 HPI and Asal 0 h was considerably smaller (104 probes) than the list of Asal-responsive probes from the comparison between Asal 24 HPI and PBS 24 HPI (1,313 probes; Fig. 2 and ESM Tables S3 and S4). Closer investigation by hierarchical clustering of the samples using these gene lists gives a possible explanation for this (ESM Fig. S4). When clustering the Asal 24 HPI and PBS 24 HPI samples using the 1,313 probes that were differentially expressed, the results showed that overall within-group variation between individuals was comparable between these two groups, apart from one outlier sample in the Asal 24 HPI group. However, when clustering the Asal 0 h and Asal 24 HPI samples using the 104 probes that were differentially expressed, within-group variation in the Asal 0 h group was much larger than within-group variation in the Asal 24 HPI group. Because the SAM algorithm is based on a modified t test, large variability within a group could negatively impact the statistical power to detect significant gene expression differences, which could explain the shorter list of Asal-responsive genes found when comparing the Asal 0 h and Asal 24 HPI groups. Despite this, we have chosen to restrict the final Asal-responsive gene list to the overlap of both comparisons to minimize the possibility of false positives, which is preferable to minimizing false negatives for validation of the new 20K microarray platform.

The final list of 82 Asal-responsive probes represents 71 unique genes. The list includes nine probes for six related CC chemokines, which belong to a phylogenetic clade of teleost CC chemokines with no equivalent in mammals, and are therefore designated “fish group” CC chemokines (Borza et al. 2010). Because these genes are related (see Supplementary data 2 in Borza et al. 2010) and the probes share different levels of homology with each other, the mRNA expression levels of individual “fish group” CC chemokines as measured by the microarray are probably affected by the presence of mRNA from related “fish group” CC chemokines. Therefore, expression levels of individual “fish group” CC chemokines have to be interpreted with care. In Table 2, all “fish group” CC chemokine genes have been grouped under one entry.

QPCR Analysis and Comparison with SSH Analysis Confirm Validity of Microarray Results

For 12 of the 15 genes (80%) that were chosen for QPCR validation, the QPCR analysis confirmed that they were significantly upregulated (p < 0.05) in response to Asal stimulation (indicated in ESM Table S5B). There are several possible explanations why QPCR and microarray results were different for the other three genes, such as different location of microarray probe and QPCR amplicon, possible misassembly of contigs, and differences in specificity between the QPCR assay and the corresponding informative 50-mer microarray probe.

The spleen tissue samples used in the microarray experiments were previously used for SSH analysis (Feng et al. 2009). When comparing the 71 unique Asal-responsive genes to the 820 assembled contigs and singletons from the spleen forward SSH library (i.e., enriched for transcripts that were upregulated in response to Asal; Supplementary Table S1A in Feng et al. 2009), there was an overlap of 13 genes (indicated in ESM Table S5B). The SSH libraries were constructed from samples from multiple sampling times (2, 6, 24, and 72 HPI), and so it is expected that a large number of SSH sequences will not be found in the current microarray analysis which is limited to one sampling time point (24 HPI).

The three deepest contigs reported in the spleen forward SSH library were small inducible cytokine SCYA (which is GmSCYA123, a CC chemokine of the “CCL19 group”; Borza et al. 2010), cathelicidin, and hemoglobin subunit beta 1 (Feng et al. 2009). Both GmSCYA123 and cathelicidin were found in the list of Asal-responsive genes as determined by microarray and showed some of the highest transcript level changes (respectively 17 and 6 times higher in the Asal 24 HPI group than in the Asal 0 h group). Hemoglobin subunit beta 1 was not found to be responsive to Asal in the current microarray study, but this is not surprising since hemoglobin subunit beta 1 was also represented by two deep contigs (of 15 and 7 ESTs) in the reverse (i.e., enriched for transcripts that were downregulated by Asal) spleen SSH library (Supplementary Table S1B in Feng et al. 2009). These data suggest that hemoglobin subunit beta 1 transcripts were highly abundant in both tester and driver RNA pools involved in SSH library construction rather than up- or downregulated in response to Asal stimulation.

Functional Analysis Shows a Large Number of Formalin-Killed, Atypical A. salmonicida-Responsive Genes are Immune-Related

Analysis of GO annotations and manual functional annotations both showed that a large number of genes were involved in immune response or associated functions such as inflammatory response and bactericidal response. In the hierarchical clustering procedure, a large number of these genes clustered together. In Fig. 3, two colored bars indicate two such clusters; the blue cluster contains genes with antimicrobial functions such as cathelicidin, hepcidin, and bactericidal/permeability-increasing protein (BPI)/lipopolysaccharide-binding protein (LBP), and the orange cluster contains most of the CC chemokine “fish group” probes. Previously, SSH analysis of these samples also showed that “immune response” and “proteolysis” were the most abundant GO categories in the spleen forward library (i.e., genes that were upregulated by stimulation with Asal; Feng et al. 2009).

The current microarray study has identified some additional Asal-responsive genes that were not found in SSH analysis, such as BPI/LBP, several “fish group” CC chemokines, syndecan-2, and a novel protein similar to MALT1. BPI/LBP and the “fish group” CC chemokine GmSCYA104 were also confirmed to be Asal-responsive by QPCR. Atlantic cod BPI/LBP is a probable descendant of the ancestral gene that gave rise to mammalian BPI and LBP genes (Stenvik et al. 2004), like its putative ortholog identified in rainbow trout (Inagawa et al. 2002). Based on mRNA expression, protein charge, and amino acid sequence, Stenvik et al. (2004) suggest that Atlantic cod BPI/LBP might have a function that is more similar to that of mammalian BPI than LBP. Mammalian BPI has strong antimicrobial activity against Gram-negative bacteria and can neutralize endotoxins (reviewed in Schultz and Weiss 2007). Stenvik et al. (2004) also showed that BPI/LBP mRNA expression is upregulated in cod spleen after stimulation with formalin-killed Vibrio anguillarum, similar to the response we have shown to stimulation with Asal. The “fish group” CC chemokines have both a homeostatic and an inflammatory function and have been shown previously to be upregulated by stimulation with Asal (Borza et al. 2010). Syndecan-2 is a cell surface heparan sulfate proteoglycan that is a co-receptor for IL8, thought to immobilize IL8 on the cell surface to mediate communication between endothelial cells and neutrophils in humans (Halden et al. 2004). The human Malt1 protein, when in complex with Bcl10, can activate NFκB (Lucas et al. 2001, 2004). Interestingly, Martin et al. (2009) found that the CBM complex, containing Carma3, Bcl10, and Malt1, was a key component of the activation of NFκB by IL8.

There are some well-known genes involved in the antibacterial response, such as g-type lysozyme, transferrin, ferritin, and MH class I, which are missing from our stringent list of 82 Asal-responsive probes. However, these genes are present in the less stringent list of 1,313 Asal-responsive probes that are informative when comparing Asal 24HPI to PBS 24HPI (ESM Table S4). As mentioned earlier, hierarchical clustering revealed that the Asal 0 h samples’ transcript expression profiles were more heterogeneous than those of the other groups. This may be the cause of the relatively short Asal-responsive probe list when comparing Asal 24 HPI to Asal 0 h (ESM Table S3), and of the absence of a number of “expected” genes from this list, which therefore are also missing from our stringent final gene list.

Recently, Caipang et al. (2010) reported on gene expression changes in Atlantic cod gill tissue in response to A. salmonicida using semiquantitative RT-PCR. They showed an increase in expression of several antibacterial and cytokine genes. Our microarray study corroborated these results and identified a large number of additional genes involved in antimicrobial, inflammatory, and immune responses that are responsive to Asal. Some of the genes that were Asal-responsive in our study were also found to be responsive to A. salmonicida in other species such as Atlantic salmon (Salmo salar; Ewart et al. 2005; Fast et al. 2007; Martin et al. 2006), rainbow trout (Oncorhynchus mykiss; Mulder et al. 2007), and Atlantic halibut (Hippoglossus hippoglossus; Park et al. 2005).

Overall, our analysis shows that in response to stimulation with Asal, a large number of genes involved in the innate immune response are upregulated. Since Atlantic cod is known to mount at best a weak specific antibody response to pathogens, this strong innate immune response might be an alternative mechanism in the defense of Atlantic cod against bacterial pathogens.

The Atlantic cod 20K Oligonucleotide Microarray is a Valuable New Tool for Genomics Research

Our newly developed Atlantic cod 20K oligonucleotide microarray platform is a large-scale microarray that contains genes from a large variety of tissues and involved in many pathways. There are only two other Atlantic cod microarray platforms currently published: the IMR Atlantic cod 16K cDNA microarray (Edvardsen et al. 2010) and the small-scale CodStress microarray with genes involved in stress and immune responses (Lie et al. 2009). The IMR EST database used by Edvardsen et al. (2010) is relatively small (it consists of ~44,000 ESTs from 20 un-normalized cDNA libraries), and sequences for the IMR 16K cDNA microarray were only selected from ten of those cDNA libraries, representing ~27,000 ESTs. In addition, 3,967 of the 16,000 sequences selected were singletons without annotation. In contrast, our Atlantic cod 20K oligonucleotide array contains sequences selected from a much larger collection of 154,142 ESTs from 42 normalized and SSH cDNA libraries, representing a diverse set of tissues, developmental stages, and stimuli, and no un-annotated singletons were included on the array. Therefore, we anticipate our Atlantic cod 20K oligonucleotide microarray to be an excellent representation of the Atlantic cod transcriptome in a broad range of conditions and thus a valuable tool for Atlantic cod functional genomics research. The CodStress microarray contains genes that originated from tissues of Atlantic cod exposed to toxicants (Lie et al. 2009), a treatment that is not represented in the collection of CGP cDNA libraries. Therefore, the CodStress microarray and the newly developed Atlantic cod 20K oligonucleotide microarray could supplement each other in toxicology and other research. Together with the recently constructed SNP platform and linkage map (Hubert et al. 2010), and an ongoing effort to sequence the full Atlantic cod genome (Johansen et al. 2009), they form an important set of tools for marker-assisted selection to improve Atlantic cod aquaculture.

In summary, we developed and validated the CGP Atlantic cod 20K oligonucleotide microarray. The selection of sequences from a diverse collection of ESTs ensures that the microarray can be applied in a broad spectrum of Atlantic cod functional genomics studies. The incorporation of sequences selected from SSH libraries, which were enriched for defense-related genes, makes this microarray platform particularly useful for studies of Atlantic cod global gene expression responses to immune stimuli and other stressors. We demonstrated this by using the 20K microarray to analyze the Atlantic cod spleen response to IP injection with formalin-killed, atypical A. salmonicida, resulting in a gene expression profile that indicates strong antimicrobial and innate immune responses.

References

Benson G (1999) Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 27:573–580
Article PubMed CAS Google Scholar
Borza T, Stone C, Gamperl AK, Bowman S (2009) Atlantic cod (Gadus morhua) hemoglobin genes: multiplicity and polymorphism. BMC Genet 10:51
Article PubMed Google Scholar
Borza T, Stone C, Rise ML, Bowman S, Johnson SC (2010) Atlantic cod (Gadus morhua) CC chemokines: diversity and expression analysis. Dev Comp Immunol 34:904–913
Article PubMed CAS Google Scholar
Bowman S, Hubert S, Higgins B, Stone C, Kimball J, Borza T, Bussey JT, Simpson G, Kozera C, Curtis BA, Hall JR, Hori TS, Feng CY, Rise M, Booman M, Gamperl AK, Trippel E, Symonds J, Johnson SC, Rise ML (2010) An integrated approach to gene discovery and marker development in Atlantic cod (Gadus morhua). Mar Biotechnol (NY). doi:10.1007/s10126-010-9285-z
Bø TH, Dysvik B, Jonassen I (2004) LSimpute: accurate estimation of missing values in microarray data with least squares methods. Nucleic Acids Research 32:e34
Article PubMed Google Scholar
Brown JA, Minkoff G, Puvanendran V (2003) Larviculture of Atlantic cod (Gadus morhua): progress, protocols and problems. Aquaculture 227:357–372
Article Google Scholar
Caipang CM, Lazado CC, Brinchmann MF, Kiron V (2010) Infection-induced changes in expression of antibacterial and cytokine genes in the gill epithelial cells of Atlantic cod, Gadus morhua during incubation with bacterial pathogens. Comp Biochem Physiol B Biochem Mol Biol 156:319–325
Article PubMed Google Scholar
Celton M, Malpertuy A, Lelandais G, de Brevern AG (2010) Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments. BMC Genomics 11:15
Article PubMed Google Scholar
Edvardsen RB, Malde K, Mittelholzer C, Taranger GL, Nilsen F (2010) EST resources and establishment and validation of a 16k cDNA microarray from Atlantic cod (Gadus morhua). Comp Biochem Physiol Part D Genomics Proteomics. doi:10.1016/j.cbd.2010.06.002
Ewart KV, Belanger JC, Williams J, Karakach T, Penny S, Tsoi SC, Richards RC, Douglas SE (2005) Identification of genes differentially expressed in Atlantic salmon (Salmo salar) in response to infection by Aeromonas salmonicida using cDNA microarray technology. Dev Comp Immunol 29:333–347
Article PubMed CAS Google Scholar
Fast MD, Johnson SC, Jones SR (2007) Differential expression of the pro-inflammatory cytokines IL-1beta-1, TNFalpha-1 and IL-8 in vaccinated pink (Oncorhynchus gorbuscha) and chum (Oncorhynchus keta) salmon juveniles. Fish Shellfish Immunol 22:403–407
Article PubMed CAS Google Scholar
Feng CY, Johnson SC, Hori TS, Rise M, Hall JR, Gamperl AK, Hubert S, Kimball J, Bowman S, Rise ML (2009) Identification and analysis of differentially expressed genes in immune tissues of Atlantic cod stimulated with formalin-killed, atypical Aeromonas salmonicida. Physiol Genomics 37:149–163
Article PubMed CAS Google Scholar
Gollock MJ, Currie S, Petersen LH, Gamperl AK (2006) Cardiovascular and haematological responses of Atlantic cod (Gadus morhua) to acute temperature increase. J Exp Biol 209:2961–2970
Article PubMed CAS Google Scholar
Gudmundsdóttir S, Magnadóttir B, Björnsdóttir B, Arnadóttir H, Gudmundsdóttir BK (2009) Specific and natural antibody response of cod juveniles vaccinated against Vibrio anguillarum. Fish Shellfish Immunol 26:619–624
Article PubMed Google Scholar
Halden Y, Rek A, Atzenhofer W, Szilak L, Wabnig A, Kungl AJ (2004) Interleukin-8 binds to syndecan-2 on human endothelial cells. Biochem J 377:533–538
Article PubMed CAS Google Scholar
Hansen T, Karlsen O, Taranger GL, Hemre GI, Holm JC, Kjesbu OS (2001) Growth, gonadal development and spawning time of Atlantic cod (Gadus morhua) reared under different photoperiods. Aquaculture 203:51–67
Article Google Scholar
Hori TS, Gamperl AK, Afonso LO, Johnson SC, Hubert S, Kimball J, Bowman S, Rise ML (2010) Heat-shock responsive genes identified and validated in Atlantic cod (Gadus morhua) liver, head kidney and skeletal muscle using genomic techniques. BMC Genomics 11:72
Article PubMed Google Scholar
Hubert S, Higgins B, Borza T, Bowman S (2010) Development of a SNP resource and a genetic linkage map for Atlantic cod (Gadus morhua). BMC Genomics 11:191
Article PubMed Google Scholar
Inagawa H, Honda T, Kohchi C, Nishizawa T, Yoshiura Y, Nakanishi T, Yokomizo Y, Soma G (2002) Cloning and characterization of the homolog of mammalian lipopolysaccharide-binding protein and bactericidal permeability-increasing protein in rainbow trout Oncorhynchus mykiss. J Immunol 168:5638–5644
PubMed CAS Google Scholar
Johansen SD, Coucheron DH, Andreassen M, Karlsen BO, Furmanek T, Jørgensen TE, Emblem A, Breines R, Nordeide JT, Moum T, Nederbragt AJ, Stenseth NC, Jakobsen KS (2009) Large-scale sequence analyses of Atlantic cod. N Biotechnol 25:263–271
Article PubMed CAS Google Scholar
Koski LB, Gray MW, Lang BF, Burger G (2005) AutoFACT: an automatic functional annotation and classification tool. BMC Bioinformatics 6:151
Article PubMed Google Scholar
Lie KK, Lanzen A, Breilid H, Olsvik PA (2009) Gene expression profiling in Atlantic cod (Gadus morhua L.) from two contaminated sites using a custom-made cDNA microarray. Environ Toxicol Chem 28:1711–1721
Article PubMed CAS Google Scholar
Lucas PC, Yonezumi M, Inohara N, McAllister-Lucas LM, Abazeed ME, Chen FF, Yamaoka S, Seto M, Nunez G (2001) Bcl10 and MALT1, independent targets of chromosomal translocation in malt lymphoma, cooperate in a novel NF-kappa B signaling pathway. J Biol Chem 276:19012–19019
Article PubMed CAS Google Scholar
Lucas PC, McAllister-Lucas LM, Nunez G (2004) NF-kappaB signaling in lymphocytes: a new cast of characters. J Cell Sci 117:31–39
Article PubMed CAS Google Scholar
Lund V, Børdal S, Kjellsen O, Mikkelsen H, Schrøder MB (2006) Comparison of antibody responses in Atlantic cod (Gadus morhua L.) to Aeromonas salmonicida and Vibrio anguillarum. Dev Comp Immunol 30:1145–1155
Article PubMed CAS Google Scholar
Lund V, Børdal S, Schrøder MB (2007) Specificity and durability of antibody responses in Atlantic cod (Gadus morhua L.) immunised with Vibrio anguillarum O2b. Fish Shellfish Immunol 23:906–910
Article PubMed CAS Google Scholar
Magnadottir B, Gudmundsdottir S, Gudmundsdottir BK, Helgason S (2009) Natural antibodies of cod (Gadus morhua L.): specificity, activity and affinity. Comp Biochem Physiol B Biochem Mol Biol 154:309–316
Article PubMed Google Scholar
Martin SA, Blaney SC, Houlihan DF, Secombes CJ (2006) Transcriptome response following administration of a live bacterial vaccine in Atlantic salmon (Salmo salar). Mol Immunol 43:1900–1911
Article PubMed CAS Google Scholar
Martin D, Galisteo R, Gutkind JS (2009) CXCL8/IL8 stimulates vascular endothelial growth factor (VEGF) expression and the autocrine activation of VEGFR2 in endothelial cells by activating NFkappaB through the CBM (Carma3/Bcl10/Malt1) complex. J Biol Chem 284:6038–6042
Article PubMed CAS Google Scholar
Miller KM, Kaukinen KH, Schulze AD (2002) Expansion and contraction of major histocompatibility complex genes: a teleostean example. Immunogenetics 53:941–963
Article PubMed CAS Google Scholar
Mulder IE, Wadsworth S, Secombes CJ (2007) Cytokine expression in the intestine of rainbow trout (Oncorhynchus mykiss) during infection with Aeromonas salmonicida. Fish Shellfish Immunol 23:747–759
Article PubMed CAS Google Scholar
Park KC, Osborne JA, Tsoi SC, Brown LL, Johnson SC (2005) Expressed sequence tags analysis of Atlantic halibut (Hippoglossus hippoglossus) liver, kidney and spleen tissues following vaccination against Vibrio anguillarum and Aeromonas salmonicida. Fish Shellfish Immunol 18:393–415
Article PubMed CAS Google Scholar
Persson AC, Stet RJ, Pilström L (1999) Characterization of MHC class I and beta(2)-microglobulin sequences in Atlantic cod reveals an unusually high number of expressed class I genes. Immunogenetics 50:49–59
Article PubMed CAS Google Scholar
Pérez-Casanova JC, Rise ML, Dixon B, Afonso LO, Hall JR, Johnson SC, Gamperl AK (2008) The immune and stress responses of Atlantic cod to long-term increases in water temperature. Fish Shellfish Immunol 24:600–609
Article PubMed Google Scholar
Pfaffl MW (2001) A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Research 29:e45
Article PubMed CAS Google Scholar
Pilstrom L, Warr GW, Stromberg S (2005) Why is the antibody response of Atlantic cod so poor? The search for a genetic explanation. Fisheries Science 71:961–971
Article Google Scholar
Rise ML, Hall J, Rise M, Hori T, Gamperl AK, Kimball J, Hubert S, Bowman S, Johnson SC (2008) Functional genomic analysis of the response of Atlantic cod (Gadus morhua) spleen to the viral mimic polyriboinosinic polyribocytidylic acid (pIC). Dev Comp Immunol 32:916–931
Article PubMed CAS Google Scholar
Rise ML, Liu Z, Douglas SE, Brown LL, Nash JHE, McFall-Ngai MJ (2009) Aquaculture-related applications of DNA microarray technology. In: Overturf K (ed) Molecular research in aquaculture. Blackwell, Ames, pp 63–101
Chapter Google Scholar
Rise ML, Hall JR, Rise M, Hori TS, Browne MJ, Gamperl AK, Hubert S, Kimball J, Bowman S, Johnson SC (2010) Impact of asymptomatic nodavirus carrier state and intraperitoneal viral mimic injection on brain transcript expression in Atlantic cod (Gadus morhua). Physiol Genomics 42:266–280
Article PubMed CAS Google Scholar
Rorabacher DB (1991) Statistical treatment for rejection of deviant values: critical values of Dixon’s “Q” parameter and related subrange ratios at the 95% confidence level. Anal Chem 63:139–146
Article CAS Google Scholar
Samuelsen OB, Nerland AH, Jørgensen T, Schrøder MB, Svåsand T, Bergh O (2006) Viral and bacterial diseases of Atlantic cod Gadus morhua, their prophylaxis and treatment: a review. Dis Aquat Organ 71:239–254
Article PubMed Google Scholar
Schultz H, Weiss JP (2007) The bactericidal/permeability-increasing protein (BPI) in infection and inflammatory disease. Clin Chim Acta 384:12–23
Article PubMed CAS Google Scholar
Schwender H, Krause A, Ickstadt K (2006) Identifying interesting genes with siggenes. RNews 6:45–50
Google Scholar
Solem ST, Stenvik J (2006) Antibody repertoire development in teleosts—a review with emphasis on salmonids and Gadus morhua L. Dev Comp Immunol 30:57–76
Article PubMed CAS Google Scholar
Stenvik J, Solstad T, Strand C, Leiros I, TØ JT (2004) Cloning and analyses of a BPI/LBP cDNA of the Atlantic cod (Gadus morhua L.). Dev Comp Immunol 28:307–323
Article PubMed CAS Google Scholar
Sturn A, Quackenbush J, Trajanoski Z (2002) Genesis: cluster analysis of microarray data. Bioinformatics 18:207–208
Article PubMed CAS Google Scholar
Tusher VG, Tibshirani R, Chu G (2001) Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA 98:5116–5121
Article PubMed CAS Google Scholar

Download references

Acknowledgments

We would like to thank the fish husbandry staff at the Dr. Joe Brown Aquatic Research Building (Ocean Sciences Centre, MUN) as well as everyone involved in tissue sampling. We also thank Yves Daigle at the Atlantic Microarray Facility at ACRI (www.atlanticcancer.ca/amf) for coordinating the production of the microarrays. This research was supported in part by Genome Canada, Genome Atlantic, and the Atlantic Canada Opportunities Agency through the Atlantic Cod Genomics and Broodstock Development Project. A complete list of supporting partners can be found at www.codgene.ca/partners.php. Funding was also provided by a Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant and a Canada Research Chair to MLR, and through a NSERC Major Facilities Grant to the Ocean Sciences Centre (Memorial University of Newfoundland, MUN).

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Ocean Sciences Centre, Memorial University of Newfoundland, 1 Marine Lab Road, St. John’s, NL, Canada, A1C 5S7
Marije Booman, Charles Y. Feng, Tiago S. Hori, Marlies Rise, A. Kurt Gamperl & Matthew L. Rise
Genome Atlantic, 1721 Lower Water Street, Halifax, NS, Canada, B3J 1S5
Tudor Borza, Brent Higgins, Marlies Rise, Sophie Hubert & Sharen Bowman
Atlantic Microarray Facility, Atlantic Cancer Research Institute, 35 Providence Street, Moncton, NB, Canada, E1C 8X3
Adrian Culf, Daniel Léger, Ian C. Chute, Anissa Belkaid & Rodney J. Ouellette
NRC Institute for Marine Biosciences, 1411 Oxford Street, Halifax, NS, Canada, B3H 3Z1
Jennifer Kimball
Pacific Biological Station, Fisheries and Oceans Canada, Nanaimo, BC, Canada, V9T 6N7
Stewart C. Johnson
Canada Research Chair (Tier 2) in Marine Biotechnology, Ocean Sciences Centre, Memorial University of Newfoundland, 1 Marine Lab Road, St. John’s, NL, Canada, A1C 5S7
Matthew L. Rise

Authors

Marije Booman
View author publications
You can also search for this author in PubMed Google Scholar
Tudor Borza
View author publications
You can also search for this author in PubMed Google Scholar
Charles Y. Feng
View author publications
You can also search for this author in PubMed Google Scholar
Tiago S. Hori
View author publications
You can also search for this author in PubMed Google Scholar
Brent Higgins
View author publications
You can also search for this author in PubMed Google Scholar
Adrian Culf
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Léger
View author publications
You can also search for this author in PubMed Google Scholar
Ian C. Chute
View author publications
You can also search for this author in PubMed Google Scholar
Anissa Belkaid
View author publications
You can also search for this author in PubMed Google Scholar
Marlies Rise
View author publications
You can also search for this author in PubMed Google Scholar
A. Kurt Gamperl
View author publications
You can also search for this author in PubMed Google Scholar
Sophie Hubert
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Kimball
View author publications
You can also search for this author in PubMed Google Scholar
Rodney J. Ouellette
View author publications
You can also search for this author in PubMed Google Scholar
Stewart C. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Sharen Bowman
View author publications
You can also search for this author in PubMed Google Scholar
Matthew L. Rise
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthew L. Rise.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

Supplementary Figure S1

Overview of probe design process (GIF 65.7 kb)

High resolution image file (TIFF 405 kb)

Supplementary Figure S2

Experimental setup and tissue collection. Individuals were divided over three tanks corresponding to treatment (UC, PBS, or Asal). Gray boxes indicate which sample groups were used for the common reference in the microarray analysis; black boxes indicate which sample groups were used in the microarray and QPCR analysis (GIF 46.6 kb)

High resolution image file (TIFF TIFF 286 kb)

Supplementary Figure S3

Nodavirus RT-PCR test and sample overview. Gel image of the nodavirus RT-PCR test. Samples with a high intensity, clearly visible band (pixel intensities >11,130) were assigned asymptomatic nodavirus “high carrier” carrier status (indicated by plus signs) and samples with a very low intensity or absent band (pixel intensities 5,200–7,310) were assigned asymptomatic nodavirus “no/low” carrier status (indicated by minus signs). Sample PBS 0 h fish no. 4 showed a weak positive band in previous studies that used a different RT-PCR protocol (i.e., higher quantity of cDNA template, different thermal cycler, and higher amount of PCR product loaded on gel; Rise et al. 2008, 2010); in previous QPCR analysis for four genes (ISG15, DHX58, RSAD2, and SACS), this individual behaved as a “no/low” carrier sample (Rise et al. 2010). Circles and squares indicate which samples were used in microarray and QPCR analysis, respectively. The black circle indicates the sample that failed microarray hybridization (GIF 21.3 kb)

High resolution image file (TIFF 162 kb)

Supplementary Figure S4

Hierarchical cluster analysis shows differences in within-group variability. Left panel Hierarchical clustering of PBS 24 HPI and Asal 24 HPI samples using the 1,313 probes that were differentially expressed. Right panel Hierarchical clustering of Asal 0 h and Asal 24 HPI samples using the 104 probes that were differentially expressed. Not all probes are shown in this picture. A larger vertical distance in the sample tree represents a larger difference between samples. This figure shows that within-group variability between PBS 24 HPI and Asal 24 HPI is comparable (apart from the outlier Asal 24 HPI sample). Within-group variability in Asal 0 h is, however, much larger than in Asal 24HPI (GIF 39.5 kb)

High resolution image file (TIFF 462 kb)

Supplementary Figure S5

Hierarchical clustering of 82 probes that are responsive to stimulation with formalin-killed, atypical A. salmonicida (large version). Sample groups are indicated at the top. Asal 24 HPI (red); Asal 0 h (green); PBS 0 h (blue); PBS 24 HPI (yellow). Two outlier individuals are indicated with an asterisk. Probe ID and description are indicated on the right side. Two gene clusters have been highlighted (see “Discussion”): antimicrobial genes (blue); CC chemokines (orange) (PDF 367 kb)

Supplementary Table S1

Microarray layout and annotation (XLS 5027 kb)

Supplementary Table S2

Probes responsive to injection with formalin-killed, atypical A. salmonicida at less stringent FDR cutoff of 0.05. Supplementary Table S2A: Final probe list. Probes were manually annotated using BLASTX. For probe description annotation, the first significant hit with informative annotation was used. For UniProt annotation, the first significant human or mouse hit with reviewed UniProt entry was used. Supplementary Table S2B: Unique gene list. In green are genes that were present in the spleen SSH forward library of Feng et al. (2009)). In bold are genes that were confirmed by QPCR. For genes that were represented by multiple probes, the average fold change and SEM was calculated. Information on probe ID and name was removed for those genes (XLS 115 kb)

Supplementary Table S3

Probes responsive to Asal when comparing Asal 0 h and Asal 24 HPI groups (XLS 29 kb)

Supplementary Table S4

Differentially expressed probes between PBS 24 HPI and Asal 24 HPI. Differentially expressed probes between PBS 24 HPI and Asal 24 HPI. Twelve probes were also found to be differentially expressed between PBS 0 h and Asal 0 h. These are indicated in the column “associated with tank differences.” The remaining 1,301 probes (in bold type) are responsive to Asal when comparing PBS 24 HPI and Asal 24 HPI groups (XLS 215 kb)

Supplementary Table S5

Probes responsive to injection with formalin-killed, atypical A. salmonicida. Supplementary Table S5A: Final probe list. Probes were manually annotated using BLASTX. For probe description annotation, the first significant hit with informative annotation was used. For UniProt annotation, the first significant human or mouse hit with reviewed UniProt entry was used. Supplementary Table S5B: Unique gene list. In green are genes that were present in the spleen SSH forward library of Feng et al. (2009)). In bold are genes that were confirmed by QPCR. For genes that were represented by multiple probes, the average fold change and SEM was calculated. Information on probe ID and name was removed for those genes (XLS 52 kb)

Supplementary Table S6

GO Biological process annotations of the final unique gene list (XLS 28 kb)

Supplementary Table S7

QPCR data for Asal-responsive genes as determined by microarray (XLS 52 kb)

Supplementary Methods S1

Microarray pre-hybridization, hybridization, and washing protocol (DOC 59 kb)

Supplementary Methods S2

Microarray data acquisition protocol and R scripts (DOC 47 kb)

Supplementary Methods S3

Microarray analysis protocol and R scripts (DOC 32 kb)

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Booman, M., Borza, T., Feng, C.Y. et al. Development and Experimental Validation of a 20K Atlantic Cod (Gadus morhua) Oligonucleotide Microarray Based on a Collection of over 150,000 ESTs. Mar Biotechnol 13, 733–750 (2011). https://doi.org/10.1007/s10126-010-9335-6

Download citation

Received: 27 August 2010
Accepted: 05 November 2010
Published: 03 December 2010
Issue Date: August 2011
DOI: https://doi.org/10.1007/s10126-010-9335-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Development and Experimental Validation of a 20K Atlantic Cod (Gadus morhua) Oligonucleotide Microarray Based on a Collection of over 150,000 ESTs

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

EST Clustering and the Selection of Representative Sequences for Microarray Probe Design

Oligonucleotide Probe Design

Oligonucleotide Synthesis and Microarray Production

Bacterial Antigen Stimulation and Tissue Sample Collection

Nodavirus Testing

RNA Extraction

Microarray Hybridization

Microarray Data Acquisition

Microarray Data Analysis

Quantitative RT-PCR Analysis

Results

Microarray Design

Microarray-Based Gene Expression Analysis

QPCR Analysis

Discussion

Highly Stringent Analysis of Microarray Data Identifies 82 Probes That Were Responsive to Stimulation with Formalin-Killed, Atypical A. salmonicida

QPCR Analysis and Comparison with SSH Analysis Confirm Validity of Microarray Results

Functional Analysis Shows a Large Number of Formalin-Killed, Atypical A. salmonicida-Responsive Genes are Immune-Related

The Atlantic cod 20K Oligonucleotide Microarray is a Valuable New Tool for Genomics Research

References

Acknowledgments

Open Access

Author information

Authors and Affiliations

Corresponding author

Electronic Supplementary Material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation