Abstract
Retinal degeneration often affects the whole retina even though the disease-causing gene is specifically expressed in the light-sensitive photoreceptors. The molecular basis of the retinal defect can potentially be determined by gene-expression profiling of the whole retina. In this study, we measured the gene-expression profile of retinas microdissected from a zebrafish pde6cw59 (pde6c) mutant. This retinal-degeneration model not only displays cone degeneration caused by a cone-specific mutation, but also other secondary cellular changes starting from 4 days postfertilization (dpf). To capture the underlying molecular changes, we subjected pde6c and wild-type (WT) retinas at 5 dpf/ 120 h postfertilization (hpf) to RNA sequencing (RNA-Seq) on the Illumina HiSeq 2,000 platform. We also validated the RNA-Seq results by Reverse Transcription Quantitative Polymerase Chain Reaction (RT-qPCR) of seven phototransduction genes. Our analyses indicate that the RNA-Seq dataset was of high quality, and effectively captured the molecular changes in the whole pde6c retina. This dataset will facilitate the characterization of the molecular defects in the pde6c retina at the initial stage of retinal degeneration.
Design Type(s) | genotype design |
Measurement Type(s) | transcription profiling assay |
Technology Type(s) | RNA sequencing |
Factor Type(s) | genotype |
Sample Characteristic(s) | Danio rerio • retina |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Similar content being viewed by others
Background & Summary
In retinal degenerative diseases, light-sensitive photoreceptors often die due to mutations in genes specifically expressed in these cells. This photoreceptor-specific defect will lead to a series of cellular changes, which would ultimately affect the well-being of the whole retina. For example, in a zebrafish mutant pde6cw59 (pde6c), an A>G point mutation was identified in the pde6c (phosphodiesterase 6C, cGMP-specific, cone, alpha prime) gene1. This mutation was predicted to cause a frameshift in the coding sequence and result either in a truncated PDE6C or degradation of pde6c mRNA through nonsense-mediated decay. This mutation ultimately affects both cone and rod photoreceptors.
In wild-type zebrafish retina, photoreceptor progenitors withdraw from the cell cycle at 48 h post-fertilization (hpf) to form the photoreceptor precursors in the outer nuclear layer (ONL)2. These precursors will begin to differentiate into rods and cones at 50 hpf in a ventral part of the retina3. Then, cones gradually distribute evenly throughout retina; whereas rods distribute sporadically in the retina, with higher density in the ventral region. By 3 dpf, these photoreceptors are mature enough that the fish larvae will display the first visually-evoked startle response4. In pde6c, cones seem to develop following the normal course. They form outer segment by 3 dpf, but they degenerate starting at 4 dpf. Pde6c rods die subsequently as bystanders1, even though they do not express the pde6c gene. They also develop first in the retina, but they appear abnormal with more pronounced outer segment at 4 dpf1, when cones first degenerate. Nonetheless, rod number is comparable to WT at least up to 6 dpf5, and only begins to reduce in the central retina by 8 dpf1. The degeneration of these photoreceptors will also cause reactive gliosis as early as 7 dpf6, and abnormal morphology in bipolar cells with displaced nuclei and axonal processes at 8 dpf1. These tissue-level consequences are all initiated by a cone-specific pde6c mutation. The molecular basis of these retinal defects can often be analysed by profiling gene expression of the whole retina7–10, which can detect expression change even in a few cells11. Such expression profiles will facilitate the study of gene-regulatory mechanisms for these pathological events.
In this study, we measured the gene expression in the pde6c mutant retina and WT retina by RNA sequencing (RNA-Seq). We conducted RNA-Seq at 5 dpf/ 120 hpf. At this stage, cones have substantially degenerated, whereas the bystander rods display abnormal morphology1,5. The Müller cells also display phenotype that is consistent with reactive gliosis (unpublished observation). Therefore, measuring retinal gene expression at 5 dpf/ 120 hpf would give us a glimpse of the global retinal defects at the initial stage of retinal degeneration. We analysed the sequencing results by various exploratory analyses, which indicate our RNA-Seq dataset was of high quality. We also evaluated the quality of our RNA-Seq results by reverse-transcription quantitative polymerase chain reaction (RT-qPCR), using a few key phototransduction genes. We observed a high correlation of fold changes of these genes as measured by both techniques. This observation strongly implicates that our RNA-Seq dataset effectively measured the expression changes in the pde6c retina. This dataset will therefore facilitate downstream characterization of the molecular problems in the pde6c retina.
Methods
Fish maintenance and embryo collection
The pde6cw59 (pde6c) mutant line1 was purchased from the Zebrafish International Resource Center (ZIRC; http://zebrafish.org/) and maintained according to standard procedures12. The embryos were collected at 15-min intervals and raised at 28 °C in E3 medium13. The breeding of parents was also staggered so that the embryonic retinas could all be dissected and collected at 120 hpf. The homozygous mutant embryos (i.e., pde6cw59/w59) were obtained from the cross of the pde6c heterozygous parents, whereas the WT-control embryos were collected from the cross of genotyped WT siblings of the pde6c heterozygous parents. All protocols were approved by the Purdue Animal Care and Use Committee.
Optokinetic response (OKR)
An OKR apparatus was constructed based on the specifications as described previously14. This machine was used to measure the OKR of the zebrafish larvae. During the assay, the larvae were partially immobilized in 3% methylcellulose in a 35-mm Petri dish. The dish was placed in the center of a circular drum with 20° black and white vertical stripes attached on the inner surface. These stripes were illuminated by a Fiber Lite M1–150 illuminator (Dolan-Jenner Industries, Boxborough, MA). The illuminance was approximately 20,000 Lux at the level of the Petri dish, as measured by a LX1010B light meter (Mastech, Taipei, Taiwan). During the OKR measurement, the rotation speed of the drum was set at 8 revolutions per minute. In response to stripe rotation, normal larval eyes would display a characteristic smooth pursuit of rotation, and a rapid, corrective movement called saccade.
Genotyping
The pde6cw59 mutation was genotyped as previously described5. In short, DNA was extracted from adult tail fin or larval body after retinal dissection. Then, the extracted DNA was used to amplify a 157-bps fragment from the pde6c gene with a specific pair of primers (Table 1). During amplification, a restriction site BsaXI was created only in the mutated pde6cw59 allele but not in the WT allele. This difference was discriminated by restriction analysis of these DNA fragments by BsaXI. On the resulting agarose gel, a WT allele would give a 157-bps band, whereas the pde6cw59 allele would give a 122-bps and a 35-bps band.
Micro-dissection of zebrafish retina
WT retinas and pde6cw59/w59 retinas were microdissected from 5-dpf larvae using a previously published protocol7,15. These larvae were first screened by the OKR assay. This non-invasive screening streamlined the identification of pde6cw59/w59 larvae from the cross of heterozygous parents, as only ¼ of the clutch would be homozygotes and they looked phenotypically normal. All larvae were dark-adapted for at least 2 h before dissection. Then, three retinas were dissected from three independent larvae and combined as one biological replicate. The remaining larval trunk was further genotyped to confirm its genetic identity. Three biological replicates were finally collected for each genotype (Table 2).
Total RNA extraction, characterization and amplification
Total RNAs was extracted from the biological replicates by an optimized procedure7,8 that combined TRIzol (Life Technologies, Grand Island, NY) and RNeasy Micro kit (Qiagen, Valencia, CA). The quality of the extracted total RNAs were evaluated by Bioanalyzer electrophoresis (Agilent Technologies, Santa Clara, CA). The RIN values were 9.1, 9.1 and 8.9 for the three replicates of WT retinas, and were 9.5, 9.1 and 9.3 for the three replicates of pde6c retinas (Table 2). Then, one nanogram of the total RNA from each group was amplified by Ovation RNA-Seq System V2 (NuGEN Technologies, San Carlos, CA). The yield of the amplified cDNA products was evaluated by Nanodrop spectrophotometry (Thermo Scientific).
RNA sequencing and analysis
For each replicate, one microgram of the amplified cDNA was paired-end sequenced by Illumina HiSeq 2,000 (Illumina, Inc., San Diego, CA). The six samples—three WT retinas (WR1, WR2, and WR3 in Table 2) and three pde6c retinas (PR1, PR2, and PR3 in Table 2)—were combined and run on three different lanes. Running all samples in one lane eliminated lane-to-lane variation16, whereas running the samples on multiple lanes increased coverage. It should be noted that another six unrelated samples were included in these three lanes during RNA sequencing. This reduced the theoretical coverage of each sample by half.
Sequencing quality check and read alignment
The raw fastq reads were trimmed for adapters and preprocessed to remove low-quality reads using Trimmomatic version 0.317 with default parameter setting for Illumina paired-end reads. The reads were aligned to primers and/or adaptors with no more than two mismatches. Reads shorter than 36 bases were removed. After adapter removal, the quality of each paired-end sequence file was evaluated using FastQC analysis http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (see Code Availability 1) based on the following parameters: 1) Distribution of quality score (Phred score) per base, 2) Distribution of quality scores of the raw sequences, 3) Distribution of duplicated reads, and 4) GC content (%) distribution of the raw sequences. The analyses indicate that our sequencing was of high quality, as illustrated by a typical sample in Fig. 1. First, we evaluated quality of sequences by plotting the boxplots for Phred scores per base (Fig. 1a) and distribution of average Phred scores of all sequences (Fig. 1b). Both the Phred score per base and average Phred score of all sequences are generally above 28, the default threshold for a high-quality base call. Second, we confirmed the diversity of library sequences by plotting a duplication plot (Fig. 1c), which shows the proportion of library sequences with different levels of duplication. The blue line shows the full sequence set and how its duplication levels are distributed. The red line shows the sequences after removal of duplicated reads, i.e., de-duplicated set. In both cases, most sequences fall into the far left of the plot, indicating that the library was diverse and most sequences occurred only a small number of times in the final set. The blue lines are relatively flat compared to red line, thus do not indicate significant contamination or any severe technical duplication. The duplicated reads constitute about 30% of all reads (Fig. 1c). Third, we compared the distribution of sequence GC content (%) to the theoretical normal distribution (Fig. 1d). The two curves substantially overlap with each other, indicating that the adaptors removal was successful and there is no obvious contamination.
These reads were then aligned to the zebrafish genome, release 89 <ftp://ftp.ensembl.org/pub/release-89/> using STAR v2.5.3a18 (see Code Availability 1). The aligned reads were then sorted and indexed using SAMTools v1.4.119. The key outputs of alignment are shown in Table 3. These include the number of input reads, average input read length, uniquely mapped read, and mismatch rate per base. The uniquely mapped reads rate is around 70% and the mismatch rate per base is only about 1%. These observations indicate that the quality of all sequencing results was satisfactory.
Read counts, normalization, and differential gene expression
To determine transcript abundance, we used the aligned reads to calculate the Fragments Per Kilobase of transcript per Million mapped reads (FPKM) by Cufflinks v2.2.120(see Code Availability 1). The percentage of genes with nonzero FPKM in all samples are around 65% and shown in Table 4. We compared the FPKMs of the three biological replicates of the same genotype in pairwise scatterplots (Fig. 2). Most values on the scatterplots fall along the y=x line, supporting the consistency among the biological replicates.
We also summarized read counts aligned to protein-coding genes in the gene transfer format (gtf) file from Ensemble Genome release 89 by featureCounts v1.5.221 (see Code Availability 1). The output of featureCounts was fed to the package DESeq2 v1.16.122 in R statistical environment (see Code Availability 1) to identify differentially expressed genes. This package normalized the count data against library size, log2-transformed the normalized data, calculated fold change between samples, inferred significance using a model with negative binomial distribution, and adjusted for multiple hypothesis testing. The resulting log2 fold changes of all samples were then plotted in an MA plot (Fig. 3a), which shows the fold-change values of genes against the mean of normalized counts of all samples. These fold changes show a typical pattern for genome-wide experiments: the majority of the genes did not show a significant change in expression (log2 fold change about 0); whereas only a few genes showed a significant differential expression (Wald significance test22; adjusted P-value less than 0.1). These differentially-expressed genes are highlighted as red points in the plot. If the genes have a log2 fold change either greater than 2 or less than −2, they are plotted as open triangles.
Then, we further validate the reproducibility across the biological replicates by hierarchical clustering (HC) (Fig. 3b) and principle component analysis (PCA) (Fig. 3c), using the replicates’ log2-transformed counts after library-size normalization and variance stabilization (see Code Availability 1). In the HC heatmap (Fig. 3b), the 3 WT samples (WR1, WR2, WR3) are clustered together and the 3 pde6c samples (PR1, PR2, PR3) are clustered together. In the PCA plot (Fig. 3c), these WT and pde6c samples are well separated from each other by their difference in the first PC, which explained 76% of the variance in the expression variables. Together, these sequencing analyses suggest that our RNA-seq had consistently measured the WR and PR samples, and effectively captured their biological difference.
Code availability
1. Code used for quality assessment and data analysis in this study is available at: https://gist.github.com/coralzhang/fc4e51609ff316486c1682feed6404a9/471afb15b7f7b230a38e4eedaadcf5f679412a07.
Data Records
Raw FASTQ files for the RNA-seq libraries were deposited to the NCBI Gene Expression Omnibus (GEO) with accession number GSE101544 and have been assigned BioProject accession PRJNA394760 (Data Citation 1) and SRA number SRP112616 (Data Citation 2). Key outputs of the analysis were deposited to the GEO with the same accession number (Data Citation 1) providing access to all relevant data files.
Technical Validation
To validate the RNA-Seq results, we collected independent samples and analyzed the expression difference of selected genes by RT-qPCR. First, 30 eyes were microdissected from 15 larvae in both WT and pde6c groups. Then, total RNAs were extracted from these eye samples and reverse-transcribed into cDNAs as described7. These cDNAs were used for RT-qPCR reactions using SYBR Green PCR Master Mix (Life Technologies, Grand Island, NY). The RT-qPCR primers were designed with the RealTime PCR program (Integrated DNA Technologies, Coralville, IA) (Table 1). These RT-qPCR reactions were run on an Applied Biosystems 7,300 Real-Time PCR System (Life Technologies, Grand Island, NY) according to manufacturer’s instructions.
In the RT-qPCR analysis, we measured the expression of seven phototransduction genes: rhodopsin (rho), four cone opsins (red, green, blue, and uv), and two transducins (gnat1 and gnat2). β-actin was used as the normalization control. We collected gene expressions in duplicates from independently-dissected eyes, and calculated a fold change between pde6c and WT samples for each duplicate using the ΔΔCt method23. Then, we averaged the fold changes of the duplicates. The resulting fold changes highly correlated to those fold changes obtained by RNA-Seq (Fig. 4; r=0.91, t-test P value=0.0039). This high correlation strongly suggests that the RNA-Seq data effectively captured the gene-expression change of the pde6c retina. This RNA-Seq dataset will likely facilitate the characterization of the molecular defects in different cell types of the pde6c retina.
Additional information
How to cite this article: Zhang, L. et al. Expression profiling of the retina of pde6c, a zebrafish model of retinal degeneration. Sci. Data 4:170182 doi: 10.1038/sdata.2017.182 (2017).
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
References
Stearns, G., Evangelista, M., Fadool, J. M. & Brockerhoff, S. E. A mutation in the cone-specific pde6 gene causes rapid cone photoreceptor degeneration in zebrafish. J. Neurosci. 27, 13866–13874 (2007).
Hu, M. & Easter, S. S. Retinal Neurogenesis: The formation of the initial central patch of postmitotic cells. Dev. Biol. 207, 309–321 (1999).
Schmitt, E. A. & Dowling, J. E. Comparison of topographical patterns of ganglion and photoreceptor cell differentiation in the retina of the zebrafish, Danio rerio. J. Comp. Neurol. 371, 222–234 (1996).
Easter, S. S. & Nicola, G. N. The development of vision in the zebrafish (Danio rerio). Dev. Biol. 180, 646–663 (1996).
Zhang, L. et al. A naturally-derived compound Schisandrin B enhanced light sensation in the pde6c Zebrafish model of retinal degeneration. PLoS ONE 11, e0149663 (2016).
Morris, A. C., Scholz, T. L., Brockerhoff, S. E. & Fadool, J. M. Genetic dissection reveals two separate pathways for rod and cone regeneration in the teleost retina. Dev. Neurobiol. 68, 605–619 (2008).
Leung, Y. F. & Dowling, J. E. Gene expression profiling of Zebrafish embryonic retina. Zebrafish 2, 269–283 (2005).
Leung, Y. F., Ma, P. & Dowling, J. E. Gene expression profiling of zebrafish embryonic retinal pigment epithelium in vivo. Invest. Ophthalmol. Vis. Sci. 48, 881–890 (2007).
Leung, Y. F., Ma, P., Link, B. A. & Dowling, J. E. Factorial microarray analysis of zebrafish retinal development. Proc. Natl. Acad. Sci. USA 105, 12909–12914 (2008).
Zhang, L. et al. Expression profiling of the RPE in zebrafish smarca4 mutant revealed altered signals that potentially affect RPE and retinal differentiation. Mol. Vis. 20, 56–72 (2014).
Hensley, M. R. et al. Cellular expression of Smarca4 (Brg1)-regulated genes in zebrafish retinas. BMC Dev. Biol. 11, 45 (2011).
Westerfield, M. The zebrafish book : a guide for the laboratory use of zebrafish (Danio rerio University of Oregon Press, 2000.
Nusslein-Volhard, C. & Dahm, R. Zebrafish : a practical approach. (Oxford University Press, 2002).
Brockerhoff, S. E. Measuring the optokinetic response of zebrafish larvae. Nat. Protoc. 1, 2448–2451 (2006).
Zhang, L. & Leung, Y. F. Microdissection of zebrafish embryonic eye tissues. J. Vis. Exp., pii 2028 10.3791/2028 (2010).
Auer, P. L. & Doerge, R. W. Statistical design and analysis of RNA sequencing data. Genetics 185, 405–416 (2010).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Trapnell, C. & Salzberg, S. L. How to map billions of short reads onto genomes. Nat. Biotechnol. 27, 455–457 (2009).
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) Method. Methods 25, 402–408 (2001).
Data Citations
Acknowledgements
We thank Purdue Genomics Core Facility for their technical assistance on RNA sequencing. Liyun Zhang was partially supported by a Charles Kelman M.D. Postdoctoral Fellowship from the International Retinal Research Foundation. Xinlian Zhang and Wenxuan Zhong was partially supported by National Science Foundation (Grant No. DMS-1440037 DMS-1438957) and NIH (R01GM113242, R01GM122080). Chi Pui Pang was partially supported by a Direct grant (Grant No. 2041771) from the Medical Panel, The Chinese University of Hong Kong, and a General Research Fund (Grant No. 2140880) from the Research Grants Council of Hong Kong. Mingzhi Zhang was partially supported by the National Natural Science Foundation of China (Grant No. 81470636).
Author information
Authors and Affiliations
Contributions
L.Z. and Y.F.L. designed the study; L.Z. conducted the experiments; L.Z., X.Z., G.Z., C.P.P., Y.F.L., M.Z., and W.Z. analysed the data; L.Z., X.Z., Y.F.L., and W.Z. wrote the paper.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
ISA-Tab metadata
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.
About this article
Cite this article
Zhang, L., Zhang, X., Zhang, G. et al. Expression profiling of the retina of pde6c, a zebrafish model of retinal degeneration. Sci Data 4, 170182 (2017). https://doi.org/10.1038/sdata.2017.182
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/sdata.2017.182
- Springer Nature Limited