After the publication of this work [1] it was noticed that there were typographical errors in the following equations: equation 5 in column 2, equation 7 in column 2, equation 8 in column 1.

The bracket was placed incorrectly, so it should read:

\ log _2 (Y_{gij} + 0.5) rather than (\ log _2 Y_{gij} + 0.5)

It was brought to our attention that a new submission to the webtool for the eXpress algorithm for the ENCODE GM12878 dataset performs better than what is reported in the paper. While looking into the reason for this discrepancy we found two errors. First, the commands and parameter settings provided in the log information on the webtool were incorrect. Second, we realized that we ran the eXpress submission differently from the other methods for this particular dataset. One cause for the discrepancy was the accidental use of a different transcript FASTA file. We reran eXpress controlling for these differences and confirmed that better results are attained. Row 2 in Table 1 is changed, and the updated row is below.

Table 1 Summarized metrics for analyzed pipelines based on an experimental dataset

The comparative figures for GM12878 change (panel A Figures 3, 4, 5, 6 and Additional file 1: Figure S5). The new figures are below.

Fig. 3
figure 1

Standard deviations of transcript quantifications based on a an experimental dataset (GM12878) and b a simulation dataset (one of the cell lines). Seven quantification methods are shown here

Fig. 4
figure 2

Proportions of discordant expression calls based on a an experimental dataset (GM12878) and b a simulation dataset (one of the cell lines). Seven quantification methods are shown here

Fig. 5
figure 3

Proportion differences of transcript quantifications in genes with only two annotated transcripts based on a an experimental dataset (GM12878) and b a simulation dataset (one of the cell lines). Seven quantification methods are shown

Fig. 6
figure 4

ROC curves indicating performance of quantification methods based on differential expression analysis of a an experimental dataset and b a simulation dataset. Seven quantification methods are shown. FP false positive, TP true positive

The following statements should now read:

  • Performance was generally poor, with one method clearly underperforming and RSEM slightly outperforming the rest.

  • In the first dataset, Flux Capacitor clearly underperform s compared with the other methods in the regions with most data (A between 3 and 8).

  • Here we see Flux Capacitor underperforming and RSEM slightly outperforming the other methods in the simulation dataset.

  • With the exception of the underperforming Flux Capacitor, we found that the other algorithms performed similarly.

The eXpress entry in the webtool, including the log-file entry which includes the scripts, has also been updated. You can see this in the ENCODE: 2 reps, high depth tab here: http://rafalab.rc.fas.harvard.edu/rnaseqbenchmark

The authors apologize for this error.