RiboStreamR: a web application for quality control, analysis, and visualization of Ribo-seq data

Perkins, Patrick; Mazzoni-Putman, Serina; Stepanova, Anna; Alonso, Jose; Heber, Steffen

doi:10.1186/s12864-019-5700-7

RiboStreamR: a web application for quality control, analysis, and visualization of Ribo-seq data

Research
Open access
Published: 06 June 2019

Volume 20, article number 422, (2019)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

RiboStreamR: a web application for quality control, analysis, and visualization of Ribo-seq data

Download PDF

Patrick Perkins¹,
Serina Mazzoni-Putman²,
Anna Stepanova²,
Jose Alonso² &
…
Steffen Heber^1,3

10k Accesses
11 Citations
5 Altmetric
Explore all metrics

Abstract

Background

Ribo-seq is a popular technique for studying translation and its regulation. A Ribo-seq experiment produces a snap-shot of the location and abundance of actively translating ribosomes within a cell’s transcriptome. In practice, Ribo-seq data analysis can be sensitive to quality issues such as read length variation, low read periodicities, and contaminations with ribosomal and transfer RNA. Various software tools for data preprocessing, quality assessment, analysis, and visualization of Ribo-seq data have been developed. However, many of these tools require considerable practical knowledge of software applications, and often multiple different tools have to be used in combination with each other.

Results

We present riboStreamR, a comprehensive Ribo-seq quality control (QC) platform in the form of an R Shiny web application. RiboStreamR provides visualization and analysis tools for various Ribo-seq QC metrics, including read length distribution, read periodicity, and translational efficiency. Our platform is focused on providing a user-friendly experience, and includes various options for graphical customization, report generation, and anomaly detection within Ribo-seq datasets.

Conclusions

RiboStreamR takes advantage of the vast resources provided by the R and Bioconductor environments, and utilizes the Shiny R package to ensure a high level of usability. Our goal is to develop a tool which facilitates in-depth quality assessment of Ribo-seq data by providing reference datasets and automatically highlighting quality issues and anomalies within datasets.

QoRTs: a comprehensive toolset for quality control and data processing of RNA-Seq experiments

Article Open access 19 July 2015

Estimation of ribosome profiling performance and reproducibility at various levels of resolution

Article Open access 10 May 2016

QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization

Article Open access 08 January 2016

Background

The rapid developments of next-generation sequencing technologies have made it possible to probe gene transcription reliably on a genome-wide level [1]. However, transcript abundance is often an insufficient proxy for protein abundance [2]. Ribosome profiling, also known as Ribo-seq, has been developed to close this gap [3, 4]. In a Ribo-seq experiment, the mRNA-ribonucleoprotein complexes formed by translating ribosomes are isolated and subjected to nuclease digestion. The mRNA fragments that are associated with ribosomes are protected from digestion and can be isolated and sequenced. Each of the resulting sequences correspond to the position of an active ribosome on the translated transcript, see Fig. 1. Typically, a Ribo-seq experiment also includes an accompanying RNA-seq component where the abundance of all transcripts is measured. Having access to both types of data allows users to estimate the number of ribosomes that are associated with an individual transcript. Hence, Ribo-seq can be used to infer high-resolution information about ribosome occupancy, translation initiation, elongation, and termination, as well as translational efficiency and translational regulation. Ribo-seq has become a popular research tool, and the amount of publicly available Ribo-seq data is rapidly increasing. Comprehensive data sets are now available for all major model organisms including yeast, bacteria, human, mouse, worm, fly, zebrafish, and Arabidopsis [3, 5,6,7,8,9,10,11]. The complicated experimental protocol in combination with the lack of standardization make Ribo-seq data analysis and quality control challenging tasks that consist of multiple steps and require a considerable amount of expert knowledge. Initial data pre-processing steps usually include removal of adapter sequences, low-quality reads, ribosomal RNAs, and reads that map to multiple locations. The remaining reads are then mapped to a genome or transcriptome using mapping tools such as Tophat or STAR [12, 13]. After preprocessing and mapping, a typical Ribo-seq analysis begins with multiple quality control steps, including investigating read length distributions (RLDs), assessing trinucleotide footprint periodicity, and computing read counts for various feature types, such as coding sequences, 3′ and 5′ UTRs, and non-coding RNAs. Often, meta-gene plots are computed as well. Meta-gene plots are visualizations of the aggregated read densities over a set of genes, and can aid in investigating ribosome occupancy patterns during ribosome initiation, elongation, and termination. Once a thorough quality control analysis has been performed, users often annotate translated features (ORFs, translation start and stop sites) and search for differentially translated genes. Such genes show significant changes in ribosome occupancy across different treatments, tissues, genotypes, or time-points relative to their RNA-seq levels. Additionally, Ribo-seq data can be used to investigate various aspects of translational control, such as ribosome pausing, codon usage, alternative splicing and nonsense mediated decay [14,15,16].

The various Ribo-seq applications have given rise to several software packages and statistical methods for data processing and quality control. For example, RiboProfiling, riboSeqR, and SystemPipeR are R packages which provide various functions for building Ribo-seq data analysis workflows and performing data QC [17,18,19]. Additionally, RiboGalaxy is a web based platform which hosts various standalone tools, including RiboTools, Proteoformer, plastid, and RUST, which can be used to process, analyze, and visualize Ribo-seq data [20,21,22,23,24]. Riboviz is another tool which allows browser based comparative analyses of published ribosome-profiling datasets [25]. See [26] for a detailed review of computation resources for ribosome profiling. A comprehensive Ribo-seq analysis can typically involve multiple of these software packages, and often requires a considerable amount of expert knowledge, as well as experience in programming or command line usage. Hence, there is a distinct need for a flexible and user-friendly environment for comprehensive Ribo-seq analyses which is accessible to mainstream biologists who lack training in bioinformatics. The goal of the riboStreamR platform is to provide such an environment. RiboStreamR is a web application written in R that hosts a suite of custom tools for Ribo-seq data analysis. These tools aid in data processing, quality control, and visualization of Ribo-seq and RNA-seq data, and facilitate user customization and reproducibility.

The rest of this paper is organized as follows: the implementation section discusses the design of the riboStreamR environment, including data processing steps and platform design. Next, the results section discusses the basic functionality and performance of riboStreamR, including graphic customization and anomaly detection. In the discussion section, we describe advantages and shortcomings of our web platform, as well as our planned future work. A worked example of a riboStreamR data quality analysis is also provided in Additional file 1.

Implementation

Environment

RiboStreamR is publicly available as a web application. We have chosen R and Bioconductor as the basis for our implementation. R is an open source and extremely popular programming language for applications in the field of bioinformatics. The Bioconductor project within R provides various previously established functions and packages for next generation sequencing (NGS) data analysis [27]. In addition, it also offers vast options for customization of graphics and visualizations, and it provides easy access to a wealth of statistical and machine learning tools. The R code for the application is available at https://github.com/pjperki2/riboStreamR.

In order to make riboStreamR an accessible and user-friendly web application, the Shiny package was used. Shiny is an R package which supports the development of interactive web applications by converting R code into CSS and HTML. The web interface of riboStreamR gives the user the ability to interact with their data via a streamlined graphical user interface instead of a command line or programming language. This supports our aim to make data analysis more manageable for researchers that lack experience in programming. Additionally, a server-based platform places the computational load on the host server instead of the user’s computer and removes the burden of the user having to download and install multiple different pieces of software or R packages.

Data processing and platform design

An analysis with riboStreamR begins at the ‘Data Upload and Preprocessing’ page, where the user uploads sets of aligned reads (.bam files), as well as a genome sequence file (.fasta file), and a reference genome annotation file (.gff3 or .gtf file). The flow of data in a typical Ribo-seq experiment is depicted in Fig. 2. BAM files for both Ribo-seq and RNA-seq data are supported. Each BAM file is read into the R environment as a distinct GRanges objects, from the GenomicRanges package [28], while multiple BAM files can be combined and represented as a GRangesList object. When initially generated, these matrix-like objects contain information on each individual alignment in the BAM files, including the chromosome, strand, and start/end position. A set of additional descriptive attributes (see Table 1) are then computed for each alignment. The attributes are used to dynamically filter and subset the data in order to increase graphical customization. Subsequently, p-site computation and adjustment is performed for each Ribo-seq read. The p-site is the position at which ribosomes process codons, and, as compared to the start or end position of the read, is a more accurate approximation of the specific site where the ribosome is interacting with the mRNA molecule [11]. This adjusted position can then be used to determine whether the ribosome is ‘in-frame’ with a corresponding start codon. Figure 2d shows an example how the p-site is inferred for various read lengths.

Table 1 Description of alignment attributes. Each attribute is contained within a separate metadata column in the GRanges object

Full size table

Once data pre-processing is completed for all BAM files, the analysis proceeds with downstream tools which perform quality control, analysis, and visualizations of the processed data. All tools in our platform are essentially standalone applications, and therefore the user may choose to use them in any order.

Results

Functionality

RiboStreamR includes 10 individual tools which facilitate quality control, downstream analysis, and result visualization. The individual tools can be accessed through tabs across the top of the application. Each tool consists of a toolbar, where graphical parameters can be adjusted, and an output pane, which displays the graphical output of the tool. A ‘Submit’ button at the bottom of each toolbar generates output for the selected set of parameters; see Fig. 3 for an example. The tools are described in Table 2 and depicted in Fig. 3. The platform is not structured as a step-by-step pipeline, but as a set of standalone tools which can all be accessed once data input and processing is complete. While facilitating a comprehensive evaluation of Ribo-seq data quality is the main intention of the platform, riboStreamR also supports the analysis of RNA-seq experiments, including differential expression analyses. Our tools use several previously established R packages for NGS applications including GenomicRanges, GenomicFeatures, and GenomicAlignments for the storage and manipulation of the alignments; ShortReads, systemPipeR, BioStrings, riboSeqR, and RSamTools for the processing of the alignments; edgeR for inferring differentially translated genes, and ggplot2, cowplot, and grid for producing the wide range of different visualizations [28,29,30,31,32,33]. A case study, which demonstrates the use of the platform’s tools within the context of a typical Ribo-seq quality analysis has been provided in Additional file 1.

Table 2 Description of Tools within riboStreamR

Full size table

Graphic customization

RiboStreamR facilitates graphical customization via various options for plot aesthetic and layout, as well as through dynamic filtering and sub-setting of data. Visualizations are customized through the use of the toolbar, which is located on the left side or bottom of each tool. The toolbar has three different types of adjustable parameters, see Fig. 4 for examples. Filtering parameters allow users to select the alignments they want to include in their analysis; alignments can be filtered by any of the attributes listed in Table 1. Organizational parameters change how the selected alignments are grouped and positioned within the output. For example, users may wish to compare the alignments from one sample against alignments from all other samples combined, or they might want to compare subsets across a specific set of attributes, such as feature type, read length, and GC content. Plotting parameters affect the appearance or dimension of the output. Examples include adjustable axis values, axis labels, color schemes, and line types. Through the adjustment of these three types of customization parameters, users are able to create presentation quality graphics which are specific to their exact experimental inquiries. The output from each individual tool is also downloadable as a PDF image.

Reference datasets and anomaly detection

RiboStreamR allows users to integrate a set of high-quality reference datasets, together with their own data, into their analyses. The reference datasets were collected from Arabidopsis roots and shoots, and were generated using a protocol which yields high-resolution and high-quality Ribo-seq data [11]. The reference datasets can be chosen as an input option from the data upload page, and included as a sample in any of the downstream tools. In addition, these datasets can be used as references for anomaly detection. One of the goals of our platform is to identify abnormal datasets. We have implemented four independent anomaly detection strategies with the Summary Table tool:

1.
Anomaly detection based on expert defined thresholds for read periodicity, and percent of reads mapped to rRNA, tRNA, and CDS regions;
2.
Outlier detection based on interquartile ranges derived from user samples, using Tukey’s fence [34].
3.
Outlier detection based on user-selected controls. We compute summary QC metrics (including periodicity, feature percentages, and percentage of uniquely mapped reads) from the selected controls, and compare them to the corresponding values derived from the user’s samples. Outliers are defined using percent error calculation, with a threshold of 25% difference [35].
4.
Outlier detection based on our supplied reference data sets, using the method describe in item 2 above. We have precomputed a large set of quality metrics for various analysis parameter configurations to facilitate fast and accurate comparisons between the user-samples and our reference data.

Suspicious values found using these methods are flagged and reported in a separate table within the output pane.

Report generation

The Report Generation tool within riboStreamR produces a comprehensive R Markdown report of the performed quality control and data analysis results. This tool can run and summarize a full analysis in one click, or it can be used to combine the output of any number of the individual tools into one PDF or HTML document. Textual notes or summaries which describe the processing or analysis steps may also be appended to each graphic. Based on the chosen configuration, the generated reports include important information about the selected analysis parameters, as well as user-provided text as notes, figure summaries, or bibliographies for inclusion in publications. Additionally, individual graphics generated from each tool maybe downloaded independently within each tool for incorporation in publications or research papers.

Performance

A complete Ribo-seq analysis using riboSeqR typically takes around 30 min per sample, where we assume that the underlying BAM file contains around 20 million reads. This time does not include fastq file processing and read mapping steps, which are not handled by our platform at this time. The main performance bottlenecks in the riboStreamR framework are data upload, data pre-processing, and output generation. The BAM file will be read into R as a 400 MB initial GRanges object approximately. The process of uploading data from the user’s machine to the web server took us approximately 15 min per sample. Once the data are uploaded, the pre-processing steps, which include computing alignment attributes and p-site adjustments, typically take between 10 and 15 min per sample. The additional alignment attributes computed during preprocessing typically result in a about five-fold increase in the size of the initial GRanges objects. Finally, the time it takes to generate output graphics is usually less than one minute per sample and tool. Certain tools, including the summary table and the meta distribution plots, utilize an alignment subsampling strategy in order to reduce the time it takes to produce output even further.

Discussion

Need for a web application based Ribo-seq analysis platform

As described above, various tools for streamlining quality control, analysis, and visualization of Ribo-seq data exist. Although extremely useful, the environments, user-interfaces, and infrastructures, of these tools vary considerably, and some of them require the knowledge of command line usage, R, or some other programming language. There exists a need for a platform which provides a consolidated suite of analysis tools that are accessible through a user-friendly GUI. The RiboGalaxy [20] server is a tool that addresses some of these pain points. However, despite its popularity and usefulness, RiboGalaxy relies on combining processing steps and outputs from various third party tools, and does not focus on consistency and compatibility amongst the different tools. Our riboStreamR application is designed to fill this gap. The platform has the advantage of being an open-ended environment in which it is not a requirement for the user to follow a specific step-by-step procedure to ensure output/input compatibility. RiboStreamR takes advantage of the notable flexibility and functionality that R provides, but in a manner which removes the need for users to have programming experience in R, or another programming language. Of course, there are also certain advantages of alternative software tools over riboStreamR. For example, providing more upstream tools, such as fastq file QC; supporting a wider range of QC metrics, such as codon density analysis; functionality to map footprints to a reference genomes; and supporting integration with command line tools or other NGS software.

Future work

We plan to expand the utility of riboStreamR as a quality control and data analysis platform. As riboStreamR is still currently in development, there are many prospective features for use in both upstream processing and downstream analysis which are not yet fully implemented. These include additional tools for fastq quality analyses, optimized algorithms to map fastq files of ribosomal footprints to a reference genome, and additional downstream analysis tools to investigate codon usage, and to identify ribosome pause-sites and functional uORFs. Currently our tool has primarily been tested using data from Arabidopsis thaliana, but our goal is to perform extensive validation experiments using data from other species and to include corresponding datasets as references. A further research goal is the development and integration of more robust anomaly detection algorithms tailored for Ribo-seq data. We also plan to expand riboStreamR’s use of automation to simplify user’s analyses. Examples include the automatic generation of textual summaries and the automatic optimization of plotting parameters. Lastly, we plan to further optimize the platform’s underlying R code, for example by compressing sequence attributes to decrease memory consumption.

Conclusion

The riboStreamR platform aims to harness the wealth of resources provided by the R and Bioconductor universe within a user-friendly web application. In contrast to the pipeline infrastructure of many NGS analysis systems, riboStreamR is designed as a suite of tools, where each tool is available to the user once their data have been uploaded and preprocessed. RiboStreamR is designed around an easy-to-use GUI which provides users with various options for graphical customization via dynamic filtering and arranging of data. This more accessible environment eases many of the burdens that typically make Ribo-seq data analysis a daunting task. Further improvements of platform automation and anomaly detection algorithms are priorities for future work.

Availability and requirements

Project name: riboStreamR Ribo-seq analysis platform.

Project home page: http://uhura.cos.ncsu.edu:3842/

Archived version: riboStreamR.

Operating system(s): Platform independent.

Programming language: R.

Other requirements: R version ≥3.2, Bioconductor version ≥3.2.

License: Artistic-2-0.

Any restrictions to use by non-academics: none.

Abbreviations

BAM :: Binary version of sequence alignment map format
CSS:: Cascading Style Sheets is a style sheet language used for describing the presentation of a document written in a markup language
FASTQ :: short read sequence file format
HTML:: Hypertext Markup Language is the standard markup language for creating web pages and web applications
NGS :: Next generation sequencing
PDF:: Portable document format
QC:: Quality control
Ribo-Seq :: NGS profiling of mRNA populations bound to ribosomes
RLD:: Read length distribution
RNA-Seq :: NGS profiling of mRNA
SAM :: Sequence alignment map format
TSC:: Translation stop coon
TSS:: Translation start site
UTR:: Untranslated region

References

Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nature Rev Genet. 2009;10:57–63.
Article CAS Google Scholar
Greenbaum D, Colangelo C, Williams K, Gerstein M. Comparing protein abundance and mRNA expression levels on a genomic scale. Genome Biol. 2003;4:117.
Article Google Scholar
Ingolia NT, Ghaemmaghami S, Newman JR, et al. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–23.
Article CAS Google Scholar
Guo H, Ingolia NT, Weissman JS, Bartel DP. Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature. 2010;466:835–40.
Article CAS Google Scholar
Li GW, Oh E, Weissman JS. The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria. Nature. 2009;484:538–41.
Article CAS Google Scholar
Michel AM, et al. Observation of dually decoded regions of the human genome using ribosome profiling data. Genome Res. 2012;22:2219–29. https://doi.org/10.1101/gr.133249.111.
Article CAS PubMed PubMed Central Google Scholar
Ingolia NT, Lareau LF, Weissman JS. Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell. 2011;147(4):789–802.
Article CAS Google Scholar
Stadler M, Artiles K, Pak J, Fire A. Contributions of mRNA abundance, ribosome loading, and post- or peri-translational effects to temporal repression of C. elegans heterochronic miRNA targets. Genome Res. 2012;22:2418–26.
Article CAS Google Scholar
Dunn JG, Foo CK, Belletier NG, Gavis ER, Weissman JS. Ribosome profiling reveals pervasive and regulated stop codon readthrough in Drosophila melanogaster. Elife. 2013;2:e01179.
Article Google Scholar
Bazzini AA, Lee MT, Giraldez AJ. Ribosome profiling shows that miR-430 reduces translation before causing mRNA decay in zebrafish. Science. 2012;336:233–7.
Article CAS Google Scholar
Hsu P, Calviello WHY, Li FW, Rothfels C, Ohler U, Benfey P. Super-resolution ribosome profiling reveals unannotated translation events in Arabidopsis. Proc Natl Acad Sci. 1016:113. https://doi.org/10.1073/pnas.1614788113.
Article CAS Google Scholar
Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25:1105–11.
Article CAS Google Scholar
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21.
Article CAS Google Scholar
Li GW, Oh E, Weissman JS. The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria. Nature. 2012;484:538–41. https://doi.org/10.1038/nature10965.
Article CAS PubMed PubMed Central Google Scholar
Juntawong P, Girke T, Bazin J, Bailey-Serres J. Translational dynamics revealed by genome-wide profiling of ribosome footprints in Arabidopsis. Proc Natl Acad Sci U S A. 2014;111:E203–12. https://doi.org/10.1073/pnas.1317811111.
Article CAS PubMed Google Scholar
Smith JE, Baker KE. Nonsense-mediated RNA decay - a switch and dial for regulating gene expression. Bioessays. 2015;37:612–23. https://doi.org/10.1002/bies.201500007.
Article CAS PubMed PubMed Central Google Scholar
Popa A, Lebrigand K, Paquet A, Nottet N, Robbe-Sermesant K, Waldmann R, Barbry P. RiboProfiling: a Bioconductor package for standard Ribo-seq pipeline processing. F1000Res. 2016;5. https://doi.org/10.12688/f1000research.8964.
Chung BY, Hardcastle TJ, Jones JD, Irigoyen N, Firth AE, Baulcombe DC, Brierley I. The use of duplex-specific nuclease in ribosome profiling and a user-friendly software package for Ribo-seq data analysis. Rna. 2015;21(10):1731–45. https://doi.org/10.1261/rna.052548.115.
Article CAS PubMed PubMed Central Google Scholar
Backman TWH, Girke T. systemPipeR: NGS workflow and report generation environment. BMC Bioinformatics. 2016;17:388. https://doi.org/10.1186/s12859-016-1241-0.
Article Google Scholar
Michel AM, Mullan JP, Velayudhan V, Oconnor PB, Donohue CA, Baranov PV. RiboGalaxy: a browser based platform for the alignment, analysis and visualization of ribosome profiling data. RNA Biol. 2016;13(3):316–9. https://doi.org/10.1080/15476286.2016.1141862.
Article PubMed PubMed Central Google Scholar
Legendre R, Baudin-Baillieu A, Hatin I, Namy O. RiboTools: a galaxy toolbox for qualitative ribosome profiling analysis. Bioinformatics. 2015;31(15):2586–8. https://doi.org/10.1093/bioinformatics/btv174.
Article CAS PubMed Google Scholar
Crappé J, Ndah E, Koch A, Steyaert S, Gawron D, De Keulenaer S, Menschaert G. PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration. Nucleic Acids Res. 2014;43(5). https://doi.org/10.1093/nar/gku1283.
Article Google Scholar
Dunn JG, Weissman JS. Plastid: nucleotide-resolution analysis of next-generation sequencing and genomics data. BMC Genomics. 2016;17:958.
Article Google Scholar
O’Connor PB, Andreev DE, Baranov PV. Comparative survey of the relative impact of mRNA features on local ribosome profiling read density. Nat Commun. 2016;7:12915. https://doi.org/10.1038/ncomms12915.
Article CAS PubMed PubMed Central Google Scholar
Carja O, Xing T, Plotkin JB, Shah P. Riboviz: analysis and visualization of ribosome profiling datasets. BMC Bioinformatics. 2017;18:461. https://doi.org/10.1101/100032.
Article CAS PubMed PubMed Central Google Scholar
Wang H, Wang Y, Xie Z. Computational resources for ribosome profiling: from database to web server and software. Brief Bioinform. 2017. https://doi.org/10.1093/bib/bbx093.
Article Google Scholar
Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, Bravo HC, Davis S, Gatto L, Girke T, Gottardo R, Hahne F, Hansen KD, Irizarry RA, Lawrence M, Love MI, MacDonald J, Valerie O, Oleś AK, Pagès H, Reyes A, Shannon P, Smyth GK, Tenenbaum D, Waldron L, Morgan M. Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods. 2015;12(2):115–21.
Article CAS Google Scholar
Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan M, Carey V. Software for computing and annotating genomic ranges. PLoS Comput Biol. 2013;9. https://doi.org/10.1371/journal.pcbi.1003118.
Article CAS Google Scholar
Morgan M, Anders S, Lawrence M, Aboyoun P, Pagès H, Gentleman R. ShortRead: a Bioconductor package for input, quality assessment and exploration of high-throughput sequence data. Bioinformatics. 2009;25:2607–8. https://doi.org/10.1093/bioinformatics/btp450.
Article CAS PubMed PubMed Central Google Scholar
Pagès H, Aboyoun P, Gentleman R, Biostrings DRS. Efficient manipulation of biological strings; 2017.
Google Scholar
Morgan M, Pagès H, Obenchain V, Hayden N. Rsamtools: binary alignment (BAM), FASTA, variant call (BCF), and tabix file import; 2017.
Google Scholar
Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer:Verlag; 2009.
Book Google Scholar
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–40.
Article CAS Google Scholar
Hoaglin DC, John W. Tukey and data analysis. Stat Sci. 2003;18(3):311–8.
Article Google Scholar
Abramowitz M, Stegun IA. Handbook of mathematical functions with formulas, graphs, and mathematical tables. New York: Dover; 1972. p. 14.
Google Scholar

Download references

Acknowledgements

The authors would like to thank Cranos Williams for his valuable input in the design of riboStreamR, and Hao Chen for his help in evaluating and testing riboStreamR.

Funding

This research and the publication costs have been supported by the National Science Foundation grant IOS1444561.

Availability of data and materials

The application can be found at uhura.cos.ncsu.edu:3842. The R code can be found at https://github.com/pjperki2/riboStreamR. The reference datasets are deposited in the Gene Expression Omnibus (GEO) database, www.ncbi.nlm.nih.gov/geo (accession no. GSE81332).

About this supplement

This article has been published as part of BMC Genomics Volume 20 Supplement 5, 2019: Selected articles from the 7th IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS 2017): genomics. The full contents of the supplement are available online at https://bmcgenomics.biomedcentral.com/articles/supplements/volume-20-supplement-5.

Author information

Authors and Affiliations

Bioinformatics Research Center, North Carolina State University, Raleigh, NC, 27607, USA
Patrick Perkins & Steffen Heber
Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, 27607, USA
Serina Mazzoni-Putman, Anna Stepanova & Jose Alonso
Computer Science Department, North Carolina State University, Campus Box 8206, Raleigh, NC, 27695-8206, USA
Steffen Heber

Authors

Patrick Perkins
View author publications
You can also search for this author in PubMed Google Scholar
Serina Mazzoni-Putman
View author publications
You can also search for this author in PubMed Google Scholar
Anna Stepanova
View author publications
You can also search for this author in PubMed Google Scholar
Jose Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Heber
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed in conceiving the idea for riboStreamR, PP implemented the platform in R, and PP and SH wrote the article. All authors have read and approved the final version of this manuscript.

Corresponding author

Correspondence to Steffen Heber.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Case study A case study demonstrating the functionality of the various tools in riboStreamR for QC analysis. (DOCX 80850 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Perkins, P., Mazzoni-Putman, S., Stepanova, A. et al. RiboStreamR: a web application for quality control, analysis, and visualization of Ribo-seq data. BMC Genomics 20 (Suppl 5), 422 (2019). https://doi.org/10.1186/s12864-019-5700-7

Download citation

Published: 06 June 2019
DOI: https://doi.org/10.1186/s12864-019-5700-7

RiboStreamR: a web application for quality control, analysis, and visualization of Ribo-seq data

Abstract

Background

Results

Conclusions

Similar content being viewed by others

QoRTs: a comprehensive toolset for quality control and data processing of RNA-Seq experiments

Estimation of ribosome profiling performance and reproducibility at various levels of resolution

QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization

Background

Implementation

Environment

Data processing and platform design

Results

Functionality

Graphic customization

Reference datasets and anomaly detection

Report generation

Performance

Discussion

Need for a web application based Ribo-seq analysis platform

Future work

Conclusion

Availability and requirements

Abbreviations

References

Acknowledgements

Funding

Availability of data and materials

About this supplement

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Additional files

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation