Serial Analysis of Gene Expression in the Chicken Otocyst
- First Online:
- Cite this article as:
- Sinkkonen, S.T., Starlinger, V., Galaiya, D.J. et al. JARO (2011) 12: 697. doi:10.1007/s10162-011-0286-z
- 598 Downloads
The inner ear arises from multipotent placodal precursors that are gradually committed to the otic fate and further differentiate into all inner ear cell types, with the exception of a few immigrating neural crest-derived cells. The otocyst plays a pivotal role during inner ear development: otic progenitor cells sub-compartmentalize into non-sensory and prosensory domains, giving rise to individual vestibular and auditory organs and their associated ganglia. The genes and pathways underlying this progressive subdivision and differentiation process are not entirely known. The goal of this study was to identify a comprehensive set of genes expressed in the chicken otocyst using the serial analysis of gene expression (SAGE) method. Our analysis revealed several hundred transcriptional regulators, potential signaling proteins, and receptors. We identified a substantial collection of genes that were previously known in the context of inner ear development, but we also found many new candidate genes, such as SOX4, SOX5, SOX7, SOX8, SOX11, and SOX18, which previously were not known to be expressed in the developing inner ear. Despite its limitation of not being all-inclusive, the generated otocyst SAGE library is a practical bioinformatics tool to study otocyst gene expression and to identify candidate genes for developmental studies.
Keywordsgene array inner ear development otic vesicle SAGE Sox
The otic vesicle, or otocyst, is one of the earliest morphological manifestations of the vertebrate inner ear. It arises by invagination of the otic placode, which is an ectodermal thickening that develops near the developing hindbrain. This process happens in chicken embryos during the second and third days of embryonic development (E2–E3). In intermediate stages, the otic placode has folded inward to form a pouch that is also called otic pit. The otic pit subsequently pinches off from the surface ectoderm into the underlying mesenchyme, resulting in the formation of the otocyst.
It has been hypothesized that axis formation of the developing inner ear already happens during otocyst formation, where signals from the surrounding tissues result in the regionalization of the developing otic placode, pit, and otocyst (reviewed in Fekete 1996; Fekete and Wu 2002). Despite a complex patterning process that is already manifest at the otocyst stage by regionalized expression of specific markers (for a review, see Streit 2007), the otocyst itself is a remarkable structure because it contains all the necessary progenitor cells to form the major cell types of the inner ear. This autonomy was revealed by grafting otocysts into other regions of the developing body (Swanson et al. 1990) as well as by determining that the otic lineage that is specified during otic induction has already reached a largely committed state when the otocyst is formed (Groves and Bronner-Fraser 2000).
Inner ear cell regeneration research has been utilizing the fact that stem cell-derived mammalian otic progenitor cells, defined by the expression of otocyst markers such as PAX2, PAX8, and DLX5, display a certain degree of commitment toward the otic lineage (Li et al. 2003a, b; Oshima et al. 2007, 2010). This was demonstrated by showing the potential of otic progenitors to differentiate into different cell types that express makers indicative of neurons, hair cells, and supporting cells. Based on these findings, it has been hypothesized that stem cell-derived otic progenitors are similar to otocyst cells (Brigande and Heller 2009; Diensthuber et al. 2009).
Here, we present the results of an unbiased interrogation of gene expression in the chicken otocyst. This analysis was spurred by the lack of comprehensive gene expression information at this important stage of development. We decided to utilize serial analysis of gene expression (SAGE), which is a quantitative method that can be used to identify known as well as new genes (Saha et al. 2002; Velculescu et al. 1995). Of the 39,326 seventeen-base pair sequence tags that we found, we evaluated 16,008 unique sequences that resulted in 4,153 unequivocally identified genes. Although our study lacked the sensitivity of more modern high-throughput deep sequencing methods, we consider the results as an important contribution because they provide a comprehensive summary of genes that are expressed at medium and high levels. Our analysis revealed potential signaling proteins and receptors as well as almost 300 transcriptional regulators that are expressed in the chicken otocyst. Some of these regulators have been previously known to play important roles during inner ear development, but we found additional candidate genes, such as several members of the Sox gene family, which have thus far not been evaluated in the context of the developing inner ear.
Tissue dissection, RNA preparation, and SAGE library preparation
Total RNA was isolated using a commercial kit (RNeasy Mini Kit, Qiagen, Hilden, Germany). RNA integrity and quality was confirmed by gel electrophoresis and by visual assessment. Five micrograms total RNA, the combined yield of 200 otocysts, was used for SAGE library synthesis (I-SAGE Long kit, Invitrogen) starting with attaching polyA+ RNA to oligo(dT)-paramagnetic beads, reverse transcription, and second strand synthesis. The resulting cDNA was cleaved with NlaIII, divided into two fractions, and bound to two different adapters containing a type IIS restriction nuclease recognition site. The adapters with adjacent 21-bp cDNA pieces were released from the oligo(dT)-magnetic beads using the type IIS restriction endonuclease MmeI. The two pools of released adapter-linked tags were ligated to 130-bp ditags and amplified with 27 PCR cycles with specific primers to each adapter. The adapters were then cleaved with NlaIII, 34-bp ditags were purified from the adapters by polyacrylamide gel electrophoresis, and concatenated. Concatemers were fractioned by size, gel-purified, and then cloned into pZErO-1 provided with the kit.
Quality control was conducted in two steps. First, 20 colonies were picked, plasmid DNA was prepared, and the resulting plasmids were digested with NsiI, which resulted in the release of the individual inserts. We obtained 20 distinct restriction patterns with a mean insert length of 671 bp (±425 bp (SD)). The smallest fragment was 200 bp and the largest was 1,700 bp. In a second step, we sequenced 24 inserts, obtaining 12,025 bp of raw data. These insert lengths slightly exceeded the expected ≈25 SAGE tags per clone predicted by the kit manufacturer. Three thousand eight hundred forty colonies were robotically picked and the plasmid concatemer inserts were directly sequenced (GeneWiz, South Plainfield, NJ).
Modifications from the manufacturer’s protocol included that the gel electrophoreses for the 130- and 34-bp ditags were performed on 10% polyacrylamide gels (Novex TBE Gels, Invitrogen) and that DNA was isolated from polyacrylamide gels with QIAEXII beads (Qiagen). Concatemers were separated on a 2% agarose gel and purified before subcloning with a column-based gel extraction kit (QIAquick, Qiagen).
At the highest stringency settings for DNA sequence quality and tag extraction, 39,326 individual 17-bp tags were extracted from the 3,512 sequencing data files using SAGE2000 analysis software (version 4.5, Invitrogen). Only unequivocal sequences were used for library construction, which resulted in a library about three times smaller than predicted by the quality control samples. A potential reason for this shortfall might be that the sequence quality achieved with direct sequencing was lower than the sequencing results obtained with plasmid DNA, which was used for the quality control clones. For computerized mapping, we appended the NlaIII restriction site (5′-CATG-3′) to the 5′ end of each tag. The resulting 21-bp tags were mapped using 22,290 G. gallus cDNA sequences available through the Ensembl 52 database (www.ensembl.org/info/data/ftp/), 19,307 G. gallus cDNAs available from the RefSeq database (ftp.ncbi.nih.gov), and 33,383 G. gallus sequences from the Unigene database (www.ncbi.nlm.nih.gov/unigene). Mapping was conducted using MAQ software (Li et al. 2008; maq.sourceforge.net) with all parameters set to default allowing for two mismatches in the sequence alignments. Ingenuity Pathway Analysis (IPA) software was accessed via the Stanford University Bioinformatics Resource (cmgm.stanford.edu).
Reverse transcriptase PCR
Chicken otocyst RNA was isolated (Absolutely RNA Miniprep Kit, Stratagene/Agilent Technologies, La Jolla, CA) and treated with RNase-free DNase I (Roche Diagnostics, Mannheim, Germany). The RNA concentration was determined by spectrophotometric analysis using a NanoDrop (Thermo Fisher Scientific, Wilmington, DE). Total RNA extracts were then used for reverse transcription (RT) into cDNA (first strand) using SuperScript III Reverse Transcriptase (Invitrogen) and Oligo(dT)18 primer (Invitrogen) with 350 ng of total RNA per 20 μl reaction. To prevent RNA degradation, 1 μl RiboLock RNase Inhibitor (Fermentas, Thermo Fisher Scientific, Glen Burnie, MD) was also included in each reaction. Control reactions were done without reverse transcriptase.
Oligonucleotide primer pairs were designed for each gene of interest using NCBI Primer-BLAST (www.ncbi.nlm.nih.gov/tools/primer-blast) with NCBI Reference Sequences as template. For each gene tested, at least two primer pairs covering two non-overlapping 300- to 700-bp regions were used to confirm mRNA expression. A full list of primers tested can be found in Electronic supplementary materials (ESM) Table 1. PCR was performed using GoTaq Green Master Mix (Promega, Madison, WI) with 2 μl cDNA template and 1 μl 400 nM each of forward and reverse gene-specific primers. The following cycling conditions were employed: initial denaturation at 94°C (3 min); 30 cycles of denaturation at 94°C (30 s), annealing at 55°C (1 min), and elongation at 72°C (1 min); and a hold at 4°C. Aliquots of PCR products were electrophoresed in a 2.0% agarose gel, stained with SYBR Safe (Invitrogen) in 1X TAE buffer at 120 V for 35 min, and documented using UV transillumination and digital photography (Kodak Gel Logic 200 Imaging System).
In situ hybridization
The T7 promoter sequence (5′-TAATACGACTCACTATAGGG-3′) was added to the 5′ end of the forward or reverse primer for the different Sox2 cDNAs to allow for conversion of the PCR product to sense and antisense cRNA probes for in situ hybridization. Of the PCR product, 500 ng was used to synthesize digoxigenin-labeled antisense probes (DIG RNA Labeling Kit, Roche Diagnostics), which were resuspended in 30 μl RNAse-free water. Embryos were dissected at HH stage 18–19 (E3) and HH stage 26–28 (E5), fixed overnight with 4% paraformaldehyde in phosphate-buffered saline (PBS, pH 7.4), transferred into 30% sucrose in PBS for 24–36 h, and embedded in O.C.T compound (Tissue-Tek). Sections were cut with a cryomicrotome (CM3050 S, Leica), collected on ultrastick slides (precleaned Gold Seal, Rite-on, Micro Slides), dried at 37°C for 45 min, and stored frozen at −70°C. For hybridization, the sections were brought to room temperature and rehydrated in 100 μl diluted probe (1:100) in 50% formamide, 10% dextran sulfate, 1 mg/ml yeast RNA, 1x Denhardt’s solution, 185 mM NaCl, 5.6 mM NaH2PO4, 5 mM Na2HPO4, 5 mM EDTA, and 15 mM Tris at pH 7.5. After coverslipping and overnight incubation at 65°C in a chamber humidified with 50% formamide in 150 mM NaCl, 15 mM trisodium citrate, pH 7 (1× SSC), the coverslips were removed in 5x SSC and the slides washed twice for 30 min each in 50% formamide and 0.1% Triton X-100 in 1x SSC at 65°C. Thereafter, the slides were washed for 15 min in 0.2x SSC and for 15 min in 150 mM NaCl and 100 mM Tris at pH 7.5 at room temperature. For antibody detection, the sections were blocked for 30 min in 0.5% blocking powder (Roche Diagnostics), 10% heat-inactivated goat serum, 100 mM NaCl, 0.1% Triton X-100, and 100 mM Tris at pH 7.5. The slides were then incubated for 2 h at room temperature in a blocking solution pre-incubated for 1 h with alkaline phosphatase-conjugated anti-digoxigenin Fab fragments (1:500, Roche Diagnostics). Unbound Fab fragments were removed by washing twice for 30 min each in 150 m NaCl and 100 mM Tris at pH 7.5. The sections were first incubated in detection buffer (100 mM NaCl, 50 mM MgCl2, 100 mM Tris at pH 9.5) for 10 min. For detection, the sections were then covered with 200 μl of chromogen solution consisting of 20 μl NBT/BCIP stock solution (Roche Diagnostics) and 50 μl Levamisol stock solution (20x concentrate, Invitrogen) in 1 ml detection buffer, coverslipped, and incubated overnight at room temperature in a humidified chamber. Coverslips were removed and color reaction was stopped in 1 mM EDTA and 10 mM Tris at pH 8.1. Slides were embedded in 50% glycerol in PBS and coverslipped. Analysis and photography was conducted on an Axiovert 25 microscope with an AxioCam MRC camera, using AxioVision software (V 220.127.116.11, Zeiss).
SAGE library of the chicken otocyst
Otocysts were dissected from HH stage 18–19 chicken embryos (Fig. 1A), total RNA was extracted, and subjected to a commercial long-SAGE protocol, resulting in a library of concatemerized tags. Individual clones of SAGE concatemers were sequenced, resulting in 39,326 seventeen-base pair tags with tag counts up to 718 for the most abundant tag; 3,292 tags were represented between two and five times, whereas the majority of tags (11,717) were only found once (Fig. 1B). Overall, we identified 16,008 unique sequence tags (ESM Table 2; NCBI Geo DataSet accession no. GSM651351).
Gene annotation reveals abundance of transcriptional regulators
Genes with the highest expression based on SAGE tag count
Cytochrome c oxidase I
Ariadne homolog, ubiquitin-conjugating enzyme E2 binding protein, 1
Cytochrome c oxidase II
Cytochrome c oxidase III
ATP synthase 6, ATPase subunit 6
Nucleophosmin (nucleolar phosphoprotein B23, numatrin)
NADH dehydrogenase, subunit 4 (complex I)
Ribosomal protein L13
Ribosomal protein L10a
Ribosomal protein L4
Ribosomal protein L23
Ribosomal protein S27a
Midkine (neurite growth-promoting factor 2)
NADH dehydrogenase, subunit 5 (complex I)
Ribosomal protein S3
Eukaryotic translation elongation factor 1 α1
ATP synthase, H+ transporting, mitochondrial F1 complex, β polypeptide
Ribosomal protein S29
Ribosomal protein S27-like
Ribosomal protein S15
Tubulin, β 2A
Ribosomal protein, large, P1
Ribosomal protein S3A
Ribosomal protein L21
Ribosomal protein L36
Ribosomal protein L35
Nucleic acid binding
Heterogeneous nuclear ribonucleoprotein A3
Overall, we identified 299 genes that encode transcriptional regulators (ESM Table 5) which can be categorized into transcription factors containing zinc-coordinating DNA-binding domains (11%), helix-loop-helix domains (13%), basic domains (15%), ß-scaffold factors with minor groove contacts (16%), and others (45%). Fifty-one transcriptional regulators were previously known to be expressed in the developing inner ear. Known examples for each respective category are GATA2 and GATA3 (Lillevali et al. 2007) for zinc-coordinating DNA-binding domains, PAX2 and FOXG1 (Herbrand et al. 1998; Li et al. 2004; Pauley et al. 2006) for helix-loop-helix domains, NEUROG1 and NEUROD1 (Liu et al. 2000; Ma et al. 2000) for basic domains, and SOX10 and NOTCH1 (Lewis et al. 1998; Stone and Rubel 1999; Watanabe et al. 2000) for ß-scaffold factors with minor groove contacts. Two hundred forty-eight transcriptional regulators were previously unknown in the context of early inner ear development.
Secreted proteins and transmembrane proteins
Genes that encode growth factors, cytokines, and other secreted proteins are the second group of developmentally interesting otocyst genes (ESM Table 6). Of the 172 genes that we identified in this group, several were previously known in inner ear development and include BMP7, FGF10, FGF19, FRZB, TGFß2, NETRIN1, SLIT1, WNT3, and WNT5A (Abraira et al. 2008; Alsina et al. 2004; Battisti and Fekete 2008; Hollyday et al. 1995; Liu et al. 2008; Oh et al. 1996; Okano et al. 2005; Sanchez-Calderon et al. 2007; Sienknecht and Fekete 2009). Transcripts encoding the secreted signaling protein midkine (MDK) were by far the most abundantly expressed mRNA that we detected. Midkine has been previously reported in the postnatal mouse cochlea, and it has been shown that the protein is involved in regulating the expression of the tectorial membrane component ß-tectorin (Jia et al. 2001; Zou et al. 2006), but early developmental roles in the inner ear have not been reported. Other proteins, such as opticin (OPTC), have previously been shown in the otic vesicle, but their function in inner ear development remains unknown (Frolova et al. 2004). Several genes emerged in our screen as novel candidates for roles in inner ear development, such as olfactomedin-like 2A, 2B, 3 (OLFML2A, OLFML2B, OOLFML3), which belong to a class of proteins implicated in a variety of developmental processes (reviewed by Tomarev and Nakaya 2009), or neudesin (NENF), which may play roles in neuronal differentiation and development (Kimura et al. 2006). We identified various TGFß antagonists such as twisted gastrulation protein homolog 1 (TWSG1) or follistatin-like 3 (FSTL3). Lastly, we identified various secreted proteins of unknown function during development, but with previous implications in cancer or other cell growth- and death-related processes; examples for these proteins are AGR3, CLU, EGFL7, and HDGF.
Our analysis of transmembrane-spanning proteins revealed high transcript expression levels of many tight junction and cell adhesion proteins such as claudin 1 (CLDN1), CLDN3, and CLDN17; integrins α3 (ITGA3) and α6 (ITGA6); integrins ß1, ß2, ß3, and ß5 (ITGB1, ITGB2, ITGB3, ITGB5); neurexin 1 (NRXN1); as well as cell adhesion molecule 1 (CADM1) and epithelial cell adhesion molecule TACSTD1, among others. One of the most abundant genes identified in this category encodes protein tyrosine kinase 7 (PTK7), a protein implicated in the regulation of planar cell polarity, convergent extension, and Wnt signaling (Lu et al. 2004; Puppo et al. 2011; Yen et al. 2009). Another interesting gene in this regard encodes the Ig superfamily protein protogenin (PRTG), which has been shown to play a role in suppressing premature neural differentiation and whose roles in other tissues might similarly be in controlling the timing of transitions between early progenitor state and differentiation (Ito et al. 2011; Wong et al. 2010). Probably the most interesting group of genes that we identified encodes receptors for signaling proteins because they might reveal information about the developmental processes happening in the otocyst. These include genes that encode receptors for ligands that are already known for playing roles in otic development such as FGFR1, FZD1, FZD2, FZD3, FZD4, FZD7, NGFR, and NOTCH1, which have previously been shown to be expressed in the vertebrate otocyst (Adam et al. 1998; Pirvola et al. 2002; Sienknecht and Fekete 2009; Stevens et al. 2003; von Bartheld et al. 1991; Wright and Mansour 2003). BMPR1, BMPR2, LGFR1, SMO1, PTCH1, DISP1, and TGFBR2 are genes that were presumed to be expressed in the otocyst because their ligands, such as BMPs and other TGFß family members, IGF, as well as hedgehog signaling proteins, have been shown to be expressed and active during inner ear development (Bok et al. 2005; Frenz et al. 1991, 1992; Liu et al. 2002; Oh et al. 1996; Riccomagno et al. 2002; Yamashita and Oesterle 1995). Other identified genes include receptors for somatostatin (SSTR1), interleukin 11 (IL11RA), endothelin (EDNRB), and tumor necrosis factors (TNFRSF1A, TNFRSF6B, TNFRSF19) and orphan receptors such as lathrophilin 3 (LPNHN3; Sudhof 2001).
Other potentially interesting transcripts encoded transmembrane proteins involved in cell recognition and adhesion that play roles in axonal guidance and cell migration such as the semaphorins SEMA4B, SEMA5B, SEMA6D, SEMA7A and some components of their receptor complex such as Plexins A1 and B2 (PLXNA1, PLXNB2; Perrot et al. 2002). Additional genes with similar roles include ephrin B1 (EFNB1) and ephrin receptors (EPHA4, EPHA5, EPHB3), netrin G1 and the netrin receptor UNC5B, the Slit receptors ROBO1 and ROBO2, as well as the Slit-like transmembrane protein SLITRK6. The possible roles of some of these genes in axon guidance and cell migration has been discussed in the context of the inner ear (Fekete and Campero 2007; Webber and Raz 2006), and their expression patterns and potential function are the focus of intensive research (Battisti and Fekete 2008; Katayama et al. 2009; Matilainen et al. 2007).
Index for gene names shown in Figure 4
Apoptosis antagonizing transcription factor
Activity-dependent neuroprotector homeobox
Activating signal cointegrator 1 complex subunit 1
CASK interacting protein 1
Churchill domain containing 1
CBF1 interacting corepressor
CCCTC-binding factor (zinc finger protein)
E2F transcription factor 4, p107/p130-binding
E2F transcription factor 5, p130-binding
Forkhead box M1
Hepatic leukemia factor
Heat shock transcription factor 2
Jumonji, AT rich interactive domain 1B
Limb bud and heart development homolog (mouse)
v-maf musculoaponeurotic fibrosarcoma oncogene homolog F (avian)
Mediator complex subunit 14
Mediator complex subunit 16
Mediator complex subunit 24
MYST histone acetyltransferase (monocytic leukemia) 4
Nucleophosmin (nucleolar phosphoprotein B23, numatrin)
NOTCH-regulated ankyrin repeat protein
Paired box 2
Proteasome (prosome, macropain) 26S subunit, non-ATPase, 9
Regulatory factor X, 2 (influences HLA class II expression)
SAP30 binding protein
Small nuclear RNA activating complex, polypeptide 5, 19 kDa
SRY (sex determining region Y)-box 2
SRY (sex determining region Y)-box 4
SRY (sex determining region Y)-box 7
SRY (sex determining region Y)-box 8
SRY (sex determining region Y)-box 10
SRY (sex determining region Y)-box 11
SRY (sex determining region Y)-box 18
TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 250 kDa
Transducin (beta)-like 1 X-linked receptor 1
TGFB-induced factor homeobox 2
Trimethylguanosine synthase homolog (S. cerevisiae)
TATA element modulatory factor 1
Tumor protein p53
Vascular endothelial zinc finger 1
Zinc finger protein, multitype 1
Zinc finger protein 326
Fibroblast growth factor receptor 1
Insulin-like growth factor 1 receptor
Integrin, beta 1 (fibronectin receptor, beta polypeptide, antigen CD29 includes MDF2, MSK12)
Integrin, beta 5
Protogenin homolog (Gallus gallus)
PTK7 protein tyrosine kinase 7
Sema domain, seven thrombospondin repeats (type 1 and type 1-like), transmembrane domain (TM) and short cytoplasmic domain, (semaphorin) 5B
Smoothened homolog (Drosophila)
Epithelial cell adhesion molecule
Hepatoma-derived growth factor (high-mobility group protein 1-like)
Midkine (neurite growth-promoting factor 2)
Known and novel Sox genes expressed in the otocyst
One of the most strongly represented groups of transcription factors in the chicken otocyst SAGE library were the Sox genes. Previous reports show the expression of SOX1, SOX2, SOX3, SOX6, SOX9, SOX10, and SOX21 in the chicken otocyst, or in the otic vesicle of various species including African clawed frog, zebrafish, and mouse (Barrionuevo et al. 2008; Liu et al. 2003; Neves et al. 2007; Uchikawa et al. 1999; Watanabe et al. 2000). Clearly highlighting the limitation of SAGE, showing that about 40,000 tags are far from exhaustive, is the fact that tags for SOX2, SOX3, and SOX6 were not represented in our SAGE library. Nevertheless, we found six Sox genes that previously were not known to be expressed in the developing inner ear, which include SOX4, SOX5, SOX7, SOX8, SOX11, and SOX18.
The chicken embryo is one of the major animal models used to study inner ear induction and development. In the past decades, many genes have been found that are expressed by cells of the otocyst, and the specific roles of some of these genes have been elucidated. Nevertheless, no comprehensive study has been conducted on gene expression in the chicken otocyst. We hypothesized that the existing collection of otocyst markers and genes is just the tip of an iceberg, and we consequently decided to investigate, using a high-throughput method, gene expression in this clearly defined transient structure. Unlike the mouse and human genomes, the chicken genome is comparably poorly annotated, which complicated the analysis strategy. We refrained from using gene arrays whose preselected genes are constrained by these shortcomings. Additionally, at the onset of this study, no comprehensive chicken gene arrays were commercially available and next-generation sequencing techniques, likewise, were not yet developed. We decided to employ SAGE, which is a relatively unbiased method, based on sequencing of short tags that are directly adjacent to a NlaIII restriction site in the 3′ region of any given mRNA (Velculescu et al. 1995). The NlaIII recognition sequence is 4-bp long (5′-CATG-3′) and theoretically occurs once in every 256 bp. Using long-SAGE (Saha et al. 2002), which employs 17-mer tags instead of 10-mer tags, which were used in initial SAGE protocols, we were able to utilize a specificity of 421. Indeed, we only found 53 ambiguous tags, which either occur more than once in the transcriptome or were associated with more than one gene as a result of annotation ambiguities.
Our analysis is not based on comparative or subtractive studies, and consequently, many genes identified are widely expressed. Nevertheless, the results of our study do not preclude the use of bioinformatic tools to extract subtractive or otherwise user-defined datasets, and the reader is encouraged to use our dataset as needed. A recent very elegant gene array study focusing on FGF-based otic induction in mouse embryos is an example of the powerful specificity that can be achieved by selecting proper tissues for comparison (Urness et al. 2010). In this specific case, wild-type mouse otic placode tissue was compared with tissue from the prospective otic placode of Fgf3−/−;Fgf10−/− mice in which otic development fails to be initiated. This study revealed several transcriptional regulator genes that depend on FGF-based otic induction, including Hmx2, Hmx3, Foxg1, and Sox9, which we also found in our dataset. Other studies that focused on the identification of otocyst genes used differential display of chicken otocyst RNA against RNA from surrounding tissues (Gong et al. 1997) and on cDNA subtraction of mouse otocyst minus liver cDNA (Powles et al. 2004). The differential display study identified only a small number of unknown genes, and the collection of 280 specific transcripts found in the mouse otocyst cannot be directly compared with our data because the dataset was only partially annotated and has not been deposited in a format usable for in silicio comparison, for example via the NCBI Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/).
An obvious limitation of the SAGE method is the number of tags which results in libraries that are reasonable large, but that are far from exhaustive, particularly when dealing with complex tissues consisting of different cell types. Analysis of our chicken otocyst dataset clearly revealed this limitation. For example, known and easily detectable otocyst genes such as SOX2, PAX8, and FOXI3 (Groves and Bronner-Fraser 2000; Ohyama and Groves 2004; Uchikawa et al. 1999; Wood and Episkopou 1999) were not represented in our library, and 45% of all annotated tags were only represented once. The consequence of this limitation is probably a major reason why the SAGE method appears to be a transient technology that is in the process of being replaced with much more comprehensive and massive parallel next-generation sequencing methods capable of generating datasets of tens of millions of tags with a single run. Likewise, microarray and cross-species comparison methods are becoming increasingly more accessible to study gene expression in avian species and have already been successfully used in recent years (Hawkins et al. 2007).
Our analysis also revealed many other potentially important genes that have not previously been considered in the context of inner ear development, and some have just recently been investigated. We found a number of secreted proteins that are novel candidates for signaling functions in the developing otocyst. Transmembrane proteins consisted of members of previously known families of proteins that are essentially involved in inner ear development such as receptors for FGFs, BMPs, and WNTs, as well as NOTCH1, among others. Interestingly, we found a relatively large group of proteins belonging to families that have been implicated in axonal guidance and cell migration; some of these proteins have previously been shown to be expressed in the otocyst and other developmental stages of the inner ear (Battisti and Fekete 2008; Matilainen et al. 2007). The expression and function of Slit-like transmembrane protein SLITRK6, for example, has recently been analyzed in mouse inner ear development (Katayama et al. 2009). Slitrk6 is strongly expressed in the prosensory and sensory patches of the auditory and vestibular organs; the innervation density of these organs was reduced or abolished in Slitrk6−/− mice.
In summary, we have used the SAGE method to assemble a list of sequence tags that can be associated with gene expression in the chicken otocyst. Although not all-inclusive, this SAGE library is a practical bioinformatics tool to study otocyst gene expression. For user-defined analyses, the library is available in electronic formats that can be directly queried online such as NCBI GEO, or it can be imported into commercial or public domain bioinformatic software packages such as IPA. We used the Sox gene family as an example to highlight the depth as well as the limitations of the library and to demonstrate that the collection of otocyst SAGE tags is a useful tool for molecular and developmental studies of early inner ear development.
This project was supported by the Sigrid Jusélius Foundation and Instrumentarium Science Foundation (to S.T.S.), a Stanford Dean’s Fellowship, and by fellowship D/06/41764 from the German Academic Exchange Service (to V.S.), as well as grants DC006167, DC010042, and P30 DC010363 from the National Institutes of Health (to S.H.).