In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Kakuk, Balázs; Dörmő, Ákos; Csabai, Zsolt; Kemenesi, Gábor; Holoubek, Jiří; Růžek, Daniel; Prazsák, István; Dani, Virág Éva; Dénes, Béla; Torma, Gábor; Jakab, Ferenc; Tóth, Gábor E.; Földes, Fanni V.; Zana, Brigitta; Lanszki, Zsófia; Harangozó, Ákos; Fülöp, Ádám; Gulyás, Gábor; Mizik, Máté; Kiss, András Attila; Tombácz, Dóra; Boldogkői, Zsolt

doi:10.1038/s41597-023-02149-4

In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Data Descriptor
Open access
Published: 09 May 2023

Volume 10, article number 262, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Data

In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Download PDF

Balázs Kakuk ORCID: orcid.org/0000-0002-4314-5707¹^na1,
Ákos Dörmő¹^na1,
Zsolt Csabai¹,
Gábor Kemenesi^2,3,
Jiří Holoubek^4,5,6,
Daniel Růžek ORCID: orcid.org/0000-0003-4655-2380^4,5,6,
István Prazsák¹,
Virág Éva Dani¹,
Béla Dénes ORCID: orcid.org/0000-0002-9889-529X⁷,
Gábor Torma¹,
Ferenc Jakab^2,3,
Gábor E. Tóth^2,3,
Fanni V. Földes^2,3,
Brigitta Zana^2,3,
Zsófia Lanszki^2,3,
Ákos Harangozó¹,
Ádám Fülöp¹,
Gábor Gulyás¹,
Máté Mizik¹,
András Attila Kiss¹,
Dóra Tombácz ORCID: orcid.org/0000-0001-5520-2978¹ &
…
Zsolt Boldogkői ORCID: orcid.org/0000-0003-1184-7293¹

2412 Accesses
4 Citations
9 Altmetric
Explore all metrics

Abstract

The recent human Monkeypox outbreak underlined the importance of studying basic biology of orthopoxviruses. However, the transcriptome of its causative agent has not been investigated before neither with short-, nor with long-read sequencing approaches. This Oxford Nanopore long-read RNA-Sequencing dataset fills this gap. It will enable the in-depth characterization of the transcriptomic architecture of the monkeypox virus, and may even make possible to annotate novel host transcripts. Moreover, our direct cDNA and native RNA sequencing reads will allow the estimation of gene expression changes of both the virus and the host cells during the infection. Overall, our study will lead to a deeper understanding of the alterations caused by the viral infection on a transcriptome level.

Time course profiling of host cell response to herpesvirus infection using nanopore and synthetic long-read transcriptome sequencing

Article Open access 09 July 2021

Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen

Article Open access 14 February 2019

Analyzing viral epitranscriptomes using nanopore direct RNA sequencing

Article 24 August 2022

Background & Summary

Monkeypox virus (MPXV) belongs to the Poxviridae family, which contains many viruses that infect various animal taxa including invertebrates, reptiles, and mammals. MPXV is the member of the human pathogenic Orthopoxvirus genus, which also includes the cowpox virus, the vaccinia virus (VACV) and the highly dangerous variola virus, the causative agent of smallpox^1,2. Smallpox infections caused millions of deaths throughout the history until a global vaccination program has successfully eradicated the virus from the human population³. Infections of MPXV, have also been reported, although with lower mortality and milder morbidity³.

Monkeypox is a zoonotic pathogen, endemic to West and Central Africa and with the exception of some rare cases, human MPXV infections were localized only here during the last decades. However, due to a recent outbreak, a growing number of cases were reported from countries where the disease is not endemic^4,5. The genomic monitoring of the 2022 MPXV outbreak revealed that the circulating MPXV strain is related to the less pathogenic West African clade of MPXVs but forms a highly divergent novel clade with an elevated mutation rate^6,7,8. Consequently, the Public Health Emergency of International Concern (PHEIC) highlighted the epidemic potential of the virus outside its endemic region as well.

The orthopoxviruses are one of the largest of all animal viruses. Their virion is brick-shaped, membrane-coated and approximately 200–300 nm in diameter. Orthopoxviruses possess a large, linear double-stranded DNA genome, around 200 kbp in length⁹. In contrast to most other mammalian DNA viruses, which replicate in the nucleus (such as herpesviruses and adenoviruses), poxviruses remain in the cytoplasm. Viral DNA replication and the transcription of MPXV genes take place within compartments called “viral factories”, independently of the host cell¹⁰. This extraordinary feature draws attention to the means through MPXV regulates the gene expression of its host cell.

The transcriptional effect of MPXV infection on different cell types has been characterized before using micro-array-based techniques^11,12,13. Rubins and colleagues used a high-resolution poxvirus-human microarray covering 24 h of infection and classified all MPXV genes for the first time according to their temporal expression¹⁴. They also compared the expression profile of MPXV to VACV and found that only the minority of transcripts are species-specific¹⁴. And though recent studies have re-evaluated these data using comparative pathway analyses, the detailed transcriptomic characteristics of MPXV-infected cells remains undescribed¹⁵. Thus, while micro-array-based techniques reveal useful insights, they are unable to resolve many aspects of the transcriptome, including the detection of the plethora of different transcript isoforms, which have been detected in closely related viruses, for example in VACV¹⁶.

RNA-sequencing has become the most widely applied method in transcriptome research. Short-read sequencing (SRS) techniques generate sufficient depth of sequencing and have a high accuracy, but transcriptome annotations may remain incomplete because of the fragmented nature of the sequenced cDNAs^17,18,19. This is especially true in the case of viruses, which have gene-dense genomic regions where transcripts substantially overlap each other. Additionally, SRS has a severe limitation in distinguishing the different transcript isoforms²⁰. Long-read sequencing methods (LRS), including Pacific Biosciences and Oxford Nanopore Technologies (ONT) offer an alternative for transcriptome sequencing that enables the recovery of full-length RNA molecules, which is invaluable for a precise transcriptome annotation²¹. Although these methods generate fewer reads and have higher error rates, compared to SRS, with sufficient read-depth, the assembly of complete transcriptomes of well-annotated genomes, like that of MPXV becomes possible^22,23,24,25. Moreover, with the MinION platform it is possible to sequence native RNA molecules directly (dRNA-seq). This way the false products arising from either the reverse-transcription or PCR steps during the library preparation can be avoided. A drawback of dRNA sequencing technique however, is its inefficiency to precisely annotate the 5′ termini of mRNAs²⁶. However, this problem can be overcome via the combined usage of dRNA-seq and 5′-end sensitive PCR-free direct cDNA sequencing methods (dcDNA)^22,27,28,29. Furthermore, direct cDNA-seq can be used to accurately quantify gene expression, as it is not affected by biases introduced in the RT-PCR of traditional PCR-cDNA-sequencing³⁰.

As of now, only a few transcriptomes have been analyzed by next generation sequencing (NGS) methods. This includes the VACV, a model for orthopoxviruses and a close relative of MPXV^{31,32,33,34,35}. LRS methods have been used to redefine the highly intricate structure of VACV transcriptome³⁶, moreover the dynamic gene expression changes were analyzed in detail during the time course of the infection^16,37,38. However, to our best knowledge there is a lack of RNA sequencing datasets on the MPXV transcriptome. Hence, our goal in this work is to present an LRS dataset that will enable an accurate transcriptome annotation of MPXV.

In this study, the transcriptomes of the MPXV along with its host cell were sequenced using an Oxford Nanopore Technologies (ONT) MinION long-read sequencing device. Two sequencing approaches were utilized in this study: a dcDNA-seq of 6 different time-points (1-, 2-, 4-, 6-, 12- and 24-hours post infection) from the virus-infected cells, each with 3 biological replicates, and a dRNA-seq library from a mixture of the time-point samples.

This dataset can be used for the analysis of temporal transcriptomes of MPXV and the infected cells. Since even short-read transcriptomic data are completely missing of MPXV, our long-read RNA-seq dataset should serve as a gap-filler and will enable the in-depth characterization of its transcriptome. The transcriptomic landscape of human MPXV presented here will contribute to our better understanding of the virus and can ultimately aid the development of effective treatments in the future.

Methods

Figure 1 shows the detailed workflow of the study.

Cells

CV-1 (CCL-70, African green monkey, kidney) cell line was used which was obtained from American Type Culture Collection (ATCC). For the experiment 75 cm² tissue culture flasks (CELLSTAR®; Greiner Bio-One GmbH, Frickenhausen, Germany) were plated with 2 × 10⁵ cells in Minimum Essential Medium Eagle culture medium (MEM) with 10% fetal bovine serum (FBS). The CV-1 cells were cultivated until ~80% (~1.2 × 10⁶) confluency at 37 °C in humified 5% CO₂ atmosphere. Before the infection, the monolayer was washed with 1 X PBS (Thermo Fisher Scientific, Waltham, MA, USA).

Collection, detection, isolation and propagation of the virus

The MPXV (MPXV_NRL 4279/2022) was isolated from skin lesions and kindly provided by Dr. Jirincova (The National Institute of Public Health, Prague, Czech Republic). All procedures with infectious materials were performed under BSL-4 conditions at the National Laboratory of Virology, University of Pécs. The virus was passaged once on CV-1 cells to reach a sufficient amount of infective particles. The same batch of working stock was used during the experiment. The viral titer of the working stock was determined with plaque assay on CV-1 cells. Non-infected control cultures were inoculated with MEM and treated the same way as the infected ones. For the infection, 2 ml MPXV with 5 plaque-forming units (pfu)/cell (MOI = 5) was used, which was diluted with MEM to reach the sufficient concentration. Cells were incubated with monkeypox inoculum at 37 °C for 1 hour while were shaken gently in every ten minutes. The virus inoculum was removed, then the cell monolayer was washed once with 1 x PBS. For the flasks 10 mL MEM medium was added which was supplemented with 2% FBS, 2 mM L-glutamine and 1% penicillin and streptomycin solution. The cells were incubated at 37 °C for 1, 2, 4, 6, 12 and 24 hours in a humidified 5% CO₂ atmosphere. Each time, the experiment was done in triplicate and subjected to direct cDNA sequencing. Prior to direct RNA sequencing extra flask was used to sample the following time points: 2-, 6-, 12- and 24-hours post-infection. Direct RNA sequencing was carried out without replicates. After the incubation, the supernatant was removed, and the cells were washed with PBS. The dry flasks were stored at −80 °C until further processes. The cells were washed and scraped down into lysis buffer and transferred to 1,5 mL Eppendorf Tubes® (Thermo Fisher Scientific, Inc.).

Isolation of total RNA

Total RNA was purified from the MPXV-infected and from mock-infected CV-1 cells at various time points after infection from 1 to 24 hours. For this, the NucleoSpin RNA Kit (Macherey-Nagel) was used, following the manufacturer’s recommendations. Briefly, cells were collected by centrifugation (1000 × g), then 350 µl RA1 lysis buffer (part of the NucleoSpin RNA Kit) and 3.5 µl β-Mercapthoethanol (Sigma Aldrich) were added to the samples and then, mixtures were centrifuged at 11,000 × g for 1 min in NucleoSpin Filter tubes. Filters were discarded, and the lysate was washed using 70% EtOH (350 µl) on NucleoSpin RNA Column with centrifugation at 11,000 × g for 30 sec. Membrane Desalting Buffer (350 µl, from the NucleoSpin RNA Kit) was then added to desalt the membrane, which was finally dried with centrifugation (11,000 × g). Residual DNA was removed using rDNase enzyme [rDNase:rDNase reaction buffer (1:9 ratio, NucleoSpin Kit)]. The enzymatic reaction was carried out at room temperature (RT) for 15 min. The NucleoSpin Kit’s RAW2 Buffer (200 µl) was used on the NucleoSpin Filter, which inactivated the enzyme. After a short centrifugation (11,000 × g, 30 min) the Filter was placed in a new Eppendorf tube. The next washing step was carried out with RAW3 Buffer (600 µl, from the NucleoSpin RNA Kit) and centrifugation (11,000 × g, 30 min). This step was repeated with 250 µl RAW3 Buffer. The purified total RNA samples were eluted from the Filter in 60 µl nuclease-free water (NucleoSpin RNA Kit) and they were stored at −80 °C (Table 1).

Table 1 RNA quantities. Obtained yield of total RNAs and polyA-selected RNAs (ng/µl).

Full size table

Poly(A) selection

Polyadenylated RNA was enriched using the Lexogen’s Poly(A) RNA Selection Kit V1.5. This method is based on oligo(dT) beads, which hybridize RNAs with polyadenylated 3′ ends, but RNAs without poly(A) stretches (e.g. rRNAs) do not captured by the beads and therefore, they will be washed out. The applied protocol is as follows: the beads (from of the Lexogen Kit) were resuspended and 4 µl for each RNA samples was used. Beads were collected in a magnet, and the supernatant was discarded. RNAs were resuspended in Bead Wash Buffer (75 μl, Lexogen Kit) and then were placed on the magnet, and supernatant was discarded. This washing step was repeated. Beads were resuspended in RNA Hybridization Buffer (20 μl, Lexogen Kit). Ten μg from the total RNA samples were diluted to 20 µl in nuclease-free water (UltraPure™, Invitrogen) and then they were denatured at 60 °C for 1 min. Denatured RNA samples were mixed with 20 µl beads. The mixtures were incubated in a shaker incubator with 1250 rpm agitation at 25 °C for 20 min. Next, the samples were placed in a magnetic rack. Supernatant was discarded, the tubes were removed from the magnet, the collected samples were resuspended in 100 µl Bead Wash Buffer (Lexogen Kit), and finally, they were incubated for 5 min at 25 °C with 1250 rpm agitation. Supernatant was discarded and this washing step was repeated once. Beads were resuspended in 12 µl nuclease-free water, then kept at 70 °C for 1 min. After this incubation step, tubes were placed on a magnetic rack and supernatant, containing the polyadenylated fraction of RNA samples were placed to new DNA LoBind (Eppendorf) tubes (Table 1). Samples were stored at −80 °C.

Direct cDNA sequencing

Direct (d)cDNA libraries were generated with the aim of analyzing the dynamic pattern of MPXV transcripts and the effect of viral infection on the host cell gene expression profile. RNA samples from different time points (1, 2, 4, 6, 12 and 24 h p.i., and from the mock, three biological replicates from each) were used individually for library preparation. The ONT’s Direct cDNA Sequencing Kit (SQK-DCS109, ONT) was applied according to the manufacturer’s recommendations. Briefly, first-strand cDNAs were synthesized from the polyA(+) RNA samples using the Maxima H Minus Reverse Transcriptase enzyme (Thermo Fisher Scientific) and the SSP and VN primers (supplied in the ONT kit). The potential RNA contamination was eliminated by applying RNase Cocktail Enzyme Mix (Thermo Fisher Scientific).

The second cDNA strands were generated with LongAmp Taq Master Mix (New England Biolabs). The ends of the double-stranded cDNAs were repaired with NEBNext End repair/dA-tailing Module (New England Biolabs) and then the adapters were ligated using the NEB Blunt/TA Ligase Master Mix (New England Biolabs). The Native Barcoding (12) Kit (ONT) was used for multiplex sequencing. The samples (200 fmol/flow cell) were loaded onto MinION R9.4 SpotON Flow Cells (ONT, Table 2).

Table 2 Sequencing barcodes and amount of libraries (in µl) used for sequencing.

Full size table

Direct RNA sequencing

Direct RNA sequencing (SQK-RNA002; Version: DRS_9080_v2_revO_14Aug2019, Last update: 10/06/2021) was used to sequence the native RNA strands to avoid any potential bias from reverse transcription or PCR. Fifty ng (in 9 μl) from a mixture of polyA(+) RNAs from various time points (2, 6, 12 and 24 h p.i.) was used for library preparation. As a first step, 1 μl RT Adapter (110 nM; ONT Kit) was ligated to the RNA sample using 3 μl NEBNext Quick Ligation Reaction Buffer (New England BioLabs), 0.5 μl RNA CS (ONT Kit), and 1.5 μl T4 DNA Ligase (2 M U/ml New England BioLabs) at RT for 10 min. The first cDNA strand was generated using SuperScript III Reverse Transcriptase (Life Technologies), as recommended by the Direct RNA sequencing (DRS) manual (ONT). The reaction was carried out at 50 °C for 50 min and it was followed by the inactivation step at 70 °C for 10 min. Next, the sequencing adapters (ONT’s DRS kit) were ligated to the cDNA at RT for 10 min using the T4 DNA ligase enzyme and NEBNext Quick Ligation Reaction Buffer. The dRNA library was sequenced on an R9.4 SpotON Flow Cell.

RNAClean XP beads and AMPure XP beads (both from Beckman Coulter) were used after each of the enzymatic reactions for washing the dRNA-seq and dcDNA-seq libraries, respectively.

Bioinformatics

The generated sequencing reads were basecalled with the Guppy software (available at ONT’s community site https://community.nanoporetech.com/), with the following parameters:–flowcell FLO-MIN106–kit SQK-DCS109–barcode_kits EXP-NBD114–min_qscore 8–recursive–calib_detect. Based on a quality threshold of 8, the basecalled reads were separated into a ‘pass’ and a ‘fail’ group – the subsequent analyses were carried out on the passed reads. The .fastq files containing the passed reads for the respective samples were merged.

The resulting sequences were then mapped to a combined reference, containing the host genome (GenBank assembly accession: GCF_015252025.1³⁹) and the viral genome (GenBank assembly accession: GCA_023516015.3⁴⁰, GenBank nucleotide accession: (ON563414.3⁴¹), using minimap2⁴². The reference genomes were downloaded from NCBI GenBank. The mapping parameters were the following: minimap2 -ax splice -Y -C5–cs–MD -un -G 10000. The generated .bam files were uploaded to the European Bioinformatics Institute’s European Nucleotide Archive (EBI ENA) under the following BioProject ID: PRJEB56841⁴³ and to the Sequence Read Archive (SRA) under accession ERP141806⁴⁴. Supplementary Table S1 contains the ENA accession IDs and read files uploaded to ENA.

The subsequent analyses were carried out within the R environment – all scripts are available in our GitHub repository https://github.com/Balays/MPOX_ONT_RNASeq⁴⁵. The workflow implements functions from the tidyverse⁴⁶ collection of R packages. The complete workflow can be re-run to produce all the analysis results, including generation of figures and tables. The first step in the MPOX-wf is to import the .bam files into the R workspace using Rsamtools⁴⁷. Raw alignment counts were calculated using idxstats. Then reads with secondary alignments were filtered out, as these are putatively chimeric RNAs. Viral and host read counts, according to the mapping results (Fig. 2) and read lengths (Fig. 3 and Supplementary Figure S1) were visualized with the ggplot2 package⁴⁸. Next, per-base coverage values and their statistics across the whole genome and also in 100 nt windows were calculated. Supplementary Table S2 contains the mean, median and standard deviation of the coverage of each time-point across the whole genome, while Supplementary Table S3 contains the more detailed (per-window) coverage statistics. The coverages were used for generating Supplementary Figure S2 and Supplementary Figure S3. The gene arrows for the genome annotation were generated using gggenes (https://github.com/wilkox/gggenes). The mean coverage on monkeypox genome in the dRNA sample and in the dcDNA samples (after log10 normalization) was visualized using the circlize package⁴⁹ (Fig. 4 and Fig. 5, respectively). The links in the center of the circle represent transcripts, as in the connections of the 5′- and 3′-ends of the reads. These putative transcripts were filtered to a read count threshold of 10. The transparency of the links is correlated with the abundance of the transcripts.

Data Records

Data (bam files containing the alignment and the sequence and its quality information as well) were uploaded to the EBI’s European Nucleotide Archive (ENA), under the following BioProject: PRJEB56841⁴³ and the files are located at NCBI SRA under accession ERP141806⁴⁴. Metadata of the uploaded files are available in the Supplementary Table S1. All data can be used without restrictions. In the case of dcDNA samples, from each time point, three biological replicates were generated; these were named according to the following scheme: 1h_A, 1h_B, 1h_C, 2h_A, …; where the ‘h’ stand for hours past infection (hpi).

The 21 dcDNA sequencing yielded a substantial amount of 15,062,290 reads that passed guppy’s QC filtering threshold of 8 (Table 3) and could be mapped onto either the host or to the viral reference genome (Fig. 2, left panel). The distribution of the lengths of these reads are shown in Fig. 3 and of the viral reads in Supplementary Figure S1. The mean of the read lengths did not change significantly, most of the reads were in the 800–1000 nt bin.

Table 3 Sequencing summary.

Full size table

The ratio of viral reads showed a steady increase from around 1.52% ± 0.036% in the 1 hpi samples to 37.70% ± 1.45% in the 24 hpi samples (Fig. 2, right panel). The median coverage across the whole viral genome also increased: from 11 to 571 (Fig. 5). The total read count peaked at 4- and 6-hours post-infection and decreased afterwards. We observed a remarkable cytopathic effect after 12 hours, which reached a significant level on the cell monolayer and disrupted the coherence of cells. Most cells were perished at or after this time point. This is supported by the significant decrease in the host read counts and the increase in the viral read ratio.

The dRNA sequencing yielded 576,622 host and 318,802 reads of viral origin, corresponding to a 35.6% of viral read ratio and a mean coverage of 244 across the viral genome (Fig. 4). The two sequencing libraries compromise a total of 1,793,855 and 13,408,375 good quality viral and host reads, respectively.

Technical Validation

RNA

Qubit RNA BR and HS Assay Kits (Invitrogen) were used to measure the amount of total RNA and polyA-selected RNA samples, respectively. The final concentrations of the RNA samples were determined by Qubit 4.0.

cDNA

The amount of the cDNA samples and the ready cDNA libraries were measured using Qubit 4.0 fluorometer and Qubit dsDNA HS Assay Kit (Invitrogen). The quality of RNA was detected with the Agilent 4150 TapeStation System. RNA samples with RIN values ≥ 9.0 were used for sequencing (Fig. 6).

Three biological replicates were used for each of the infection time points. To analyze the effect of MPXV infection on the transcriptome profile of the host cells, mock-infected CV-1 cells were also harvested and sequenced.

Usage Notes

Our dataset can be used to annotate novel viral transcripts and transcript isoforms, but possibly from the host as well. There are several bioinformatic tools that can be used to achieve this, including: TALON⁵⁰; LIQA⁵¹; LoRTIA (https://github.com/zsolt-balazs/LoRTIA); EPI2ME’s transcriptomes workflow (https://github.com/epi2me-labs/wf-transcriptomes) or SQUANTI3 (https://github.com/ConesaLab/SQANTI3⁵²). Transcript annotation can be carried out from both types of sequencing data (dcDNA and dRNA), however as dRNA-seq yields less artificial or false products, it is suggested to use these reads for validating the dcDNA-seq derived transcripts³⁰. Although it is possible that some rare transcripts that are expressed in a subset of the time-points exclusively (e.g., some immediate early isoforms) could not be captured in the dRNA sequencing library. After identification, the novel transcripts should be annotated to ORFs, their coding capacity be estimated, their TSS and TES sites be analyzed and accordingly their isoform categories be assessed (long or short TSS, alternative termination, etc.).

The gene-wise and/or transcript-wise gene counts from the cDNA-seq data can be subjected to differential gene expression (DGE) or differential transcript expression (DTE), respectively. Furthermore, differential transcript usage analyses (DTU) can be carried out as well, for example with RATS⁵³. The https://github.com/nanoporetech/pipeline-transcriptome-de pipeline, based loosely on the workflow presented in⁵⁴, carries out these analyses from the annotated transcriptome, while EPI2ME’s transcriptomes workflow (https://github.com/epi2me-labs/wf-transcriptomes) carries out the transcript annotation and the above analyses in succession. The DGE, DTE and DTU analyses can be carried out both on the viral and on the host data and they can be based upon several comparisons, for example mock vs each time-point. In addition, the longitudinal expression data from cDNA-seq can be subjected to a time-series analysis as well⁵⁵.

Besides focusing on individual genes or transcripts, gene-set enrichment analysis (GSEA) or pathway enrichment analyses can also be carried out to identify biological pathways that are affected by the viral infection in the host cells, for example with pathfindR⁵⁶.

A combined workflow would be: 1.) detect transcripts using both sequencing approaches, but 2.) use the dRNA reads for validation, 3.) annotate them and carry out the transcript isoform analyses, 4.) quantify these validated transcripts in the cDNA data to estimate transcript counts, and finally 4.) carry out the above mentioned DGE, DTE, DTU and biological pathway analyses. Taken together, the almost 1.5 million viral and almost 13 million host reads enable the in-depth and temporal characterization of the Monkeypox transcriptome and the effect of the viral infection on the host gene expression.

Code availability

The complete workflow, from mapping to the generation of figures is available at the GitHub repository (https://github.com/Balays/MPOX_ONT_RNASeq).

References

Diven, D. G. An overview of poxviruses. J. Am. Acad. Dermatol. 44, 1–16 (2001).
Article CAS PubMed Google Scholar
Moss, B. & Smith, G. L. Poxviridae: The Viruses and Their Replication. in Field’s Virology 573–613 (2021).
Elwood, J. M. Smallpox and its eradication. J. Epidemiol. Community Heal. 43, 92–92 (1989).
Article Google Scholar
Adler, H. et al. Clinical features and management of human monkeypox: a retrospective observational study in the UK. Lancet Infect. Dis. 22, 1153–1162 (2022).
Article PubMed PubMed Central Google Scholar
Noe, S. et al. Clinical and virological features of first human monkeypox cases in Germany. Infection, https://doi.org/10.1007/s15010-022-01874-z (2022).
Luna, N. et al. Phylogenomic analysis of the monkeypox virus (MPXV) 2022 outbreak: Emergence of a novel viral lineage? Travel Med. Infect. Dis. 49, 102402 (2022).
Article CAS PubMed PubMed Central Google Scholar
Isidro, J. et al. Phylogenomic characterization and signs of microevolution in the 2022 multi-country outbreak of monkeypox virus. Nat. Med. 28, 1569–1572 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kumar, N., Acharya, A., Gendelman, H. E. & Byrareddy, S. N. The 2022 outbreak and the pathobiology of the monkeypox virus. J. Autoimmun. 131, 102855 (2022).
Article PubMed PubMed Central Google Scholar
Hendrickson, R. C., Wang, C., Hatcher, E. L. & Lefkowitz, E. J. Orthopoxvirus genome evolution: The role of gene loss. Viruses 2, 1933–1967 (2010).
Article CAS PubMed PubMed Central Google Scholar
Walsh, D. Poxviruses: Slipping and sliding through transcription and translation. PLoS Pathogens vol. 13 at https://doi.org/10.1371/journal.ppat.1006634 (2017).
Alkhalil, A. et al. Gene expression profiling of monkeypox virus-infected cells reveals novel interfaces for host-virus interactions. Virol. J. 7, 1–19 (2010).
Article Google Scholar
Rubins, K. H., Hensley, L. E., Relman, D. A. & Brown, P. O. Stunned silence: Gene expression programs in human cells infected with monkeypox or vaccinia virus. PLoS One 6 (2011).
Bourquain, D., Dabrowski, P. W. & Nitsche, A. Comparison of host cell gene expression in cowpox, monkeypox or vaccinia virus-infected cells reveals virus-specific regulation of immune response genes. Virol. J. 10 (2013).
Rubins, K. H. et al. Comparative analysis of viral gene expression programs during poxvirus infection: A transcriptional map of the vaccinia and monkeypox genomes. PLoS One 3, 1–12 (2008).
Article Google Scholar
Xuan, D. T. M. et al. Comparison of Transcriptomic Signatures between Monkeypox-Infected Monkey and Human Cell Lines. J. Immunol. Res. 2022 (2022).
Tombácz, D. et al. Time-course transcriptome profiling of a poxvirus using long-read full-length assay. Pathogens 10, 1–17 (2021).
Article Google Scholar
Nagalakshmi, U., Waern, K. & Snyder, M. RNA-seq: A method for comprehensive transcriptome analysis. Current Protocols in Molecular Biology at https://doi.org/10.1002/0471142727.mb0411s89 (2010).
Mutz, K.-O., Heilkenbrinker, A., Lönne, M., Walter, J.-G. & Stahl, F. Transcriptome analysis using next-generation sequencing. Curr. Opin. Biotechnol. 24, 22–30 (2013).
Article CAS PubMed Google Scholar
Anamika, K., Verma, S., Jere, A. & Desai, A. Transcriptomic Profiling Using Next Generation Sequencing - Advances, Advantages, and Challenges. in Next Generation Sequencing: Advances, Applications and Challenges (ed. Kulski, J. K.). https://doi.org/10.5772/61789 (IntechOpen, 2016).
Patterson, J. et al. Impact of sequencing depth and technology on de novo RNA-Seq assembly. BMC Genomics 20, 604 (2019).
Article PubMed PubMed Central Google Scholar
Grünberger, F., Ferreira-Cerca, S. & Grohmann, D. Nanopore sequencing of RNA and cDNA molecules in Escherichia coli. RNA 28, 400–417 (2022).
Article PubMed PubMed Central Google Scholar
Torma, G. et al. Combined Short and Long-Read Sequencing Reveals a Complex Transcriptomic Architecture of African Swine Fever Virus. Viruses 13, 579 (2021).
Article CAS PubMed PubMed Central Google Scholar
Torma, G. et al. Dual isoform sequencing reveals complex transcriptomic and epitranscriptomic landscapes of a prototype baculovirus. Sci. Rep. 12, 1291 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Shchelkunov, S. N. et al. Analysis of the Monkeypox Virus Genome. Virology 297, 172–194 (2002).
Article CAS PubMed Google Scholar
Prazsák, I. et al. Long-read sequencing uncovers a complex transcriptome topology in varicella zoster virus. BMC Genomics 19, 1–20 (2018).
Article Google Scholar
Soneson, C. et al. A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes. Nat. Commun. 10, 1–14 (2019).
Article CAS Google Scholar
Depledge, D. P. et al. Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen. Nat. Commun. 10, 1–13 (2019).
Article Google Scholar
Olasz, F. et al. Short and Long-Read Sequencing Survey of the Dynamic Transcriptomes of African Swine Fever Virus and the Host Cells. Front. Genet. 11 (2020).
Fülöp, Á. et al. Integrative profiling of Epstein–Barr virus transcriptome using a multiplatform approach. Virol. J. 19, 7 (2022).
Article PubMed PubMed Central Google Scholar
Tombácz, D. et al. In-Depth Temporal Transcriptome Profiling of an Alphaherpesvirus Using Nanopore Sequencing. Viruses 14, 1289 (2022).
Article PubMed PubMed Central Google Scholar
Yang, Z., Bruno, D. P., Martens, C. A., Porcella, S. F. & Moss, B. Simultaneous high-resolution analysis of vaccinia virus and host cell transcriptomes by deep RNA sequencing. Proc. Natl. Acad. Sci. USA https://doi.org/10.1073/pnas.1006594107 (2010).
Article PubMed PubMed Central Google Scholar
Yang, Z., Bruno, D. P., Martens, C. A., Porcella, S. F. & Moss, B. Genome-Wide Analysis of the 5′ and 3′ Ends of Vaccinia Virus Early mRNAs Delineates Regulatory Sequences of Annotated and Anomalous Transcripts. J. Virol. https://doi.org/10.1128/jvi.00428-11 (2011).
Article PubMed PubMed Central Google Scholar
Yang, Z. et al. Expression Profiling of the Intermediate and Late Stages of Poxvirus Replication. J. Virol. https://doi.org/10.1128/jvi.05446-11 (2011).
Article PubMed PubMed Central Google Scholar
Yang, Z., Martens, C. A., Bruno, D. P., Porcella, S. F. & Moss, B. Pervasive initiation and 3′-end formation of poxvirus postreplicative RNAs. J. Biol. Chem. https://doi.org/10.1074/jbc.M112.390054 (2012).
Article PubMed PubMed Central Google Scholar
Yang, Z., Maruri-Avidal, L., Sisler, J., Stuart, C. A. & Moss, B. Cascade regulation of vaccinia virus gene expression is modulated by multistage promoters. Virology https://doi.org/10.1016/j.virol.2013.09.007 (2013).
Article PubMed Google Scholar
Tombácz, D. et al. Long-read assays shed new light on the transcriptome complexity of a viral pathogen. Sci. Rep. 10, 1–13 (2020).
Article Google Scholar
Tombácz, D. et al. Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. Gigascience 7 (2018).
Maróti, Z. et al. Time-course transcriptome analysis of host cell response to poxvirus infection using a dual long-read sequencing approach. BMC Res. Notes 14, 239 (2021).
Article ADS PubMed PubMed Central Google Scholar
Sène, M. A. et al. Haplotype-resolved de novo assembly of the Vero cell line genome. npj Vaccines https://doi.org/10.1038/s41541-021-00358-9 (2021).
Article PubMed PubMed Central Google Scholar
NCBI Genbank https://www.ncbi.nlm.nih.gov/assembly/GCA_023516015.3 (2022).
NCBI Genbank https://www.ncbi.nlm.nih.gov/nuccore/ON563414 (2022).
Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics https://doi.org/10.1093/bioinformatics/bty191 (2018).
Article PubMed PubMed Central Google Scholar
ENA European Nucleotide Archive https://identifiers.org/ena.embl:PRJEB56841 (2022).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:ERP141806 (2022).
Kakuk, B. GitHub. https://github.com/Balays/MPOX_ONT_RNASeq (2022).
Wickham, H. et al. Welcome to the Tidyverse. J. Open Source Softw. 4, 1686 (2019).
Article ADS Google Scholar
Morgan M, Pagès H, Obenchain V, H. N. Rsamtools: Binary alignment (BAM), FASTA, variant call (BCF), and tabix file import (2022).
Wickham, H. ggplot2., https://doi.org/10.1007/978-0-387-98141-3 (Springer New York, 2009).
Gu, Z., Gu, L., Eils, R., Schlesner, M. & Brors, B. circlize implements and enhances circular visualization in R. Bioinformatics 30, 2811–2812 (2014).
Article CAS PubMed Google Scholar
Wyman, D. et al. A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification. bioRxiv 672931, https://doi.org/10.1101/672931 (2020).
Hu, Y. et al. LIQA: long-read isoform quantification and analysis. Genome Biol. https://doi.org/10.1186/s13059-021-02399-8 (2021).
Article PubMed PubMed Central Google Scholar
Tardaguila, M. et al. SQANTI: Extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification. Genome Res. https://doi.org/10.1101/gr.222976.117 (2018).
Article PubMed PubMed Central Google Scholar
Froussios, K., Mourão, K., Simpson, G., Barton, G. & Schurch, N. Relative abundance of transcripts (RATs): Identifying differential isoform abundance from RNA-seq [version 1; referees: 1 approved, 2 approved with reservations]. F1000Research 8, 1–21 (2019).
Article Google Scholar
Love, M. I., Soneson, C. & Patro, R. Swimming downstream: Statistical analysis of differential transcript usage following Salmon quantification. F1000Research https://doi.org/10.12688/f1000research.15398.3 (2018).
Article PubMed PubMed Central Google Scholar
Varoquaux, N. & Purdom, E. A pipeline to analyse time-course gene expression data. F1000Research 9, 1447 (2020).
Article Google Scholar
Ulgen, E., Ozisik, O. & Sezerman, O. U. PathfindR: An R package for comprehensive identification of enriched pathways in omics data through active subnetworks. Front. Genet. https://doi.org/10.3389/fgene.2019.00858 (2019).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was supported by National Research, Development and Innovation Office (NRDIO), Researcher-initiated research projects (Grant numbers: K 128247 and K 142674) to ZB and by the NRDIO Research projects initiated by young researchers (Grant number: FK 128252) to DT. The work was also supported by National Laboratory of Virology (RRF-2.3.1-21-2022-00010) to GK and FJ. IP was supported by the New National Excellence Program of the Ministry for Innovation and Technology (ÚNKP-22-4-SZTE-310). ÁH was supported by Hungarian Ministry of Innovation and Technology, National Academy of Scientist Education, (FEIF/646–4/2021-ITM_SZERZ). The APC fee was covered by the University of Szeged, Open Access Fund: 5954.

Funding

Open access funding provided by University of Szeged.

Author information

These authors contributed equally: Balázs Kakuk, Ákos Dörmő.

Authors and Affiliations

Department of Medical Biology, Albert Szent-Györgyi Medical School, University of Szeged, Somogyi u. 4., 6720, Szeged, Hungary
Balázs Kakuk, Ákos Dörmő, Zsolt Csabai, István Prazsák, Virág Éva Dani, Gábor Torma, Ákos Harangozó, Ádám Fülöp, Gábor Gulyás, Máté Mizik, András Attila Kiss, Dóra Tombácz & Zsolt Boldogkői
National Laboratory of Virology, Szentágothai Research Centre, University of Pécs, Pécs, Hungary
Gábor Kemenesi, Ferenc Jakab, Gábor E. Tóth, Fanni V. Földes, Brigitta Zana & Zsófia Lanszki
Institute of Biology, Faculty of Sciences, University of Pécs, Pécs, Hungary
Gábor Kemenesi, Ferenc Jakab, Gábor E. Tóth, Fanni V. Földes, Brigitta Zana & Zsófia Lanszki
Veterinary Research Institute, Hudcova 70, CZ-62100, Brno, Czech Republic
Jiří Holoubek & Daniel Růžek
Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branisovska 31, CZ-37005, Ceske Budejovice, Czech Republic
Jiří Holoubek & Daniel Růžek
Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice, 753/5, Brno, CZ-62500, Czech Republic
Jiří Holoubek & Daniel Růžek
Department of Microbiology and Infectious Diseases, University of Veterinary Medicine Budapest, 1143, Budapest, Hungária krt. 23-25, Hungary
Béla Dénes

Authors

Balázs Kakuk
View author publications
You can also search for this author in PubMed Google Scholar
Ákos Dörmő
View author publications
You can also search for this author in PubMed Google Scholar
Zsolt Csabai
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Kemenesi
View author publications
You can also search for this author in PubMed Google Scholar
Jiří Holoubek
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Růžek
View author publications
You can also search for this author in PubMed Google Scholar
István Prazsák
View author publications
You can also search for this author in PubMed Google Scholar
Virág Éva Dani
View author publications
You can also search for this author in PubMed Google Scholar
Béla Dénes
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Torma
View author publications
You can also search for this author in PubMed Google Scholar
Ferenc Jakab
View author publications
You can also search for this author in PubMed Google Scholar
Gábor E. Tóth
View author publications
You can also search for this author in PubMed Google Scholar
Fanni V. Földes
View author publications
You can also search for this author in PubMed Google Scholar
Brigitta Zana
View author publications
You can also search for this author in PubMed Google Scholar
Zsófia Lanszki
View author publications
You can also search for this author in PubMed Google Scholar
Ákos Harangozó
View author publications
You can also search for this author in PubMed Google Scholar
Ádám Fülöp
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Gulyás
View author publications
You can also search for this author in PubMed Google Scholar
Máté Mizik
View author publications
You can also search for this author in PubMed Google Scholar
András Attila Kiss
View author publications
You can also search for this author in PubMed Google Scholar
Dóra Tombácz
View author publications
You can also search for this author in PubMed Google Scholar
Zsolt Boldogkői
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.K.: carried out bioinformatics, analysis and interpretation of the data, wrote the manuscript. Á,D,: took part in RNA isolation, carried out PolyA-selection and direct cDNA sequencing. Z.C.: isolated the total RNA and participated in dRNA sequencing. G.K.: carried out viral infection. J.H.: propagated the virus. D.R.: propagated the virus. I.P.: participated in data analysis and wrote the manuscript. V.É.D.: participated and interpretation of data. B.D.: propagated and maintained the CV-1 cell line. G.T.: participated in interpretation of data. F.J.: supervision, participated in viral infection. G.E.T.: propagated the cells and the virus, participated in viral infection. F.V.F.: propagated the cells and the virus, participated in viral infection. B.Z.: propagated the cells and the virus, participated in viral infection. Z.L.: propagated the cells and the virus, participated in viral infection. Á.H.: participated in bioinformatics analysis. Á.F.: participated in bioinformatics analysis. G.G.: participated in bioinformatics analysis. M.M.: participated in cell culture experiments. A.A.K.: participated in bioinformatics analysis. D.T.: participated in the design of the experiments, in data analysis and wrote the manuscript. Z.B.: conceived and designed the experiments, supervised the project and wrote the manuscript. All authors read and approved the final paper.

Corresponding author

Correspondence to Zsolt Boldogkői.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kakuk, B., Dörmő, Á., Csabai, Z. et al. In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing. Sci Data 10, 262 (2023). https://doi.org/10.1038/s41597-023-02149-4

Download citation

Received: 19 December 2022
Accepted: 12 April 2023
Published: 09 May 2023
DOI: https://doi.org/10.1038/s41597-023-02149-4
Springer Nature Limited

This article is cited by

Monkeypox virus genomic accordion strategies
- Sara Monzón
- Sarai Varona
- Gustavo Palacios
Nature Communications (2024)

In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing

Abstract

Similar content being viewed by others

Background & Summary

Methods

Cells

Collection, detection, isolation and propagation of the virus

Isolation of total RNA

Poly(A) selection

Direct cDNA sequencing

Direct RNA sequencing

Bioinformatics

Data Records

Technical Validation

RNA

cDNA

Usage Notes

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation