Introduction

The inbred rat strain SHR/Ola (spontaneously hypertensive rat) is frequently used to study cardiovascular and metabolic diseases [1]. In a previous study, we investigated translational regulation in this strain compared to the BN-Lx reference strain [2], in which small mRNA fragments were sequenced alongside the ribosome-protected fragments. However, since these sequencing reads are on average only 40 nucleotides long, we proceeded to generate also long RNA sequencing reads from the same animals. This enables investigation of features that are more depending on mapping length, such as splicing, circular RNA detection, or variant calling.

Main text

Total RNA was isolated from frozen, pulverized liver and heart left ventricle. Here, we describe the ribosomal RNA-depleted RNA sequencing obtained from these samples. An overview with SRA and BioSample identifiers is given in Table 1. The animal identifiers here (BN-03 etc.) correspond to the previously used identifiers [2], which can be found in the BioProject https://www.ncbi.nlm.nih.gov/bioproject/PRJEB7498.

Table 1 Overview of sequenced samples

The Illumina Ribo-Zero kit (Illumina Cat.-No. MRZH11124) was applied to remove ribosomal RNA and the sequencing libraries were prepared using the Illumina TruSeq Stranded mRNA Library Prep Kit (Illumina Cat.-No. RS-122-2101). The raw sequencing reads can be downloaded from the NCBI SRA (short read archive) under the accession number SRP095829 (https://www.ncbi.nlm.nih.gov/sra/?term=SRP095829), or accessed through the NCBI BioProject PRJNA314751 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA314751). Per sample, 30–40 million reads were obtained. The adapter sequence GATCGGAAGAGCACACGT was trimmed using Flexbar [3] and trimmed reads with 18 nucleotides or longer were aligned to version rn6 of the rat genome in conjunction with the Ensembl annotation version 81 using tophat2 [4]. The numbers of raw, trimmed, aligned, and unique aligned reads are shown in Fig. 1. The absolute and relative numbers of aligned and unique aligned reads are shown alongside the number of reads mapping to the rat 45S ribosomal RNA in Table 2.

Fig. 1
figure 1

Number of reads of the eight RNA sequencing samples described here. Shown are, from left to right: raw read numbers; reads after adapter trimming and removal of reads shorter than 18 nucleotides; reads mapping to the rat genome/transcriptome; unique reads mapping to the rat genome/transcriptome

Table 2 Alignment statistics

To quantify the de novo detection of splice sites, we applied the FindCirc pipeline [5], which identifies back-splicing events indicative of circular RNAs. Comparing the long RNA reads presented here with the smaller RNA fragments that were sequenced alongside the ribosome protected fragments [2], we found the following improvements in the average numbers of detected back-splice junctions per sample: for liver samples, 273 vs. 38, and for heart left ventricle, 679 vs. 61.

In summary, we provide here a high quality RNA sequencing dataset for two rat strains, with two different tissues from two different animals for each strain. This data can be used to analyze differences between tissues and strains, e.g. variant calling, de novo transcriptome assembly, differential expression and splicing, or circRNA detection. Since the samples originate from the same animals as the ones in the study about translational differences between a healthy rat (BN-Lx) and a model for hypertensive cardiomyopathy and metabolic syndrome (SHR) [2], the ribosome profiling data from that study can readily be integrated into the analysis.

Limitations

  • The data described here is limited to two organs, heart left ventricle and liver.

  • The sequencing depth does not allow for reliable quantification and characterization of very low abundant transcripts.

  • Genomic DNA was not sequenced, and has to be used from other animals of the same strains.