The Role of Spike-In Standards in the Normalization of RNA-seq

  • Davide Risso
  • John Ngai
  • Terence P. Speed
  • Sandrine Dudoit

Abstract

Normalization of RNA-seq data is essential to ensure accurate inference of expression levels, by adjusting for sequencing depth and other more complex nuisance effects, both within and between samples. Recently, the External RNA Control Consortium (ERCC) developed a set of 92 synthetic spike-in standards that are commercially available and relatively easy to add to a typical library preparation. In this chapter, we compare the performance of several state-of-the-art normalization methods, including adaptations that directly use spike-in sequences as controls. We show that although the ERCC spike-ins could in principle be valuable for assessing accuracy in RNA-seq experiments, their read counts are not stable enough to be used for normalization purposes. We propose a novel approach to normalization that can successfully make use of control sequences to remove unwanted effects and lead to accurate estimation of expression fold-changes and tests of differential expression.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Davide Risso
    • 1
  • John Ngai
    • 2
  • Terence P. Speed
    • 1
    • 3
    • 4
  • Sandrine Dudoit
    • 5
  1. 1.Department of StatisticsUniversity of CaliforniaBerkeleyUSA
  2. 2.Department of Molecular and Cell Biology, Helen Wills Neuroscience Institute, and Functional Genomics LaboratoryUniversity of CaliforniaBerkeleyUSA
  3. 3.Bioinformatics DivisionWalter and Eliza Hall InstituteMelbourneAustralia
  4. 4.Department of Mathematics and StatisticsThe University of MelbourneVictoriaAustralia
  5. 5.Division of Biostatistics and Department of StatisticsUniversity of CaliforniaBerkeleyUSA

Personalised recommendations