Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity

Butera, Yvan; Mukantwari, Enatha; Artesi, Maria; Umuringa, Jeanne d’arc; O’Toole, Áine Niamh; Hill, Verity; Rooke, Stefan; Hong, Samuel Leandro; Dellicour, Simon; Majyambere, Onesphore; Bontems, Sebastien; Boujemla, Bouchra; Quick, Josh; Resende, Paola Cristina; Loman, Nick; Umumararungu, Esperance; Kabanda, Alice; Murindahabi, Marylin Milumbu; Tuyisenge, Patrick; Gashegu, Misbah; Rwabihama, Jean Paul; Sindayiheba, Reuben; Gikic, Djordje; Souopgui, Jacob; Ndifon, Wilfred; Rutayisire, Robert; Gatare, Swaibu; Mpunga, Tharcisse; Ngamije, Daniel; Bours, Vincent; Rambaut, Andrew; Nsanzimana, Sabin; Baele, Guy; Durkin, Keith; Mutesa, Leon; Rujeni, Nadine

doi:10.1038/s41467-021-25985-7

Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity

Article
Open access
Published: 29 September 2021

Volume 12, article number 5705, (2021)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity

Download PDF

5222 Accesses
19 Citations
82 Altmetric
Explore all metrics

Abstract

COVID-19 transmission rates are often linked to locally circulating strains of SARS-CoV-2. Here we describe 203 SARS-CoV-2 whole genome sequences analyzed from strains circulating in Rwanda from May 2020 to February 2021. In particular, we report a shift in variant distribution towards the emerging sub-lineage A.23.1 that is currently dominating. Furthermore, we report the detection of the first Rwandan cases of the B.1.1.7 and B.1.351 variants of concern among incoming travelers tested at Kigali International Airport. To assess the importance of viral introductions from neighboring countries and local transmission, we exploit available individual travel history metadata to inform spatio-temporal phylogeographic inference, enabling us to take into account infections from unsampled locations. We uncover an important role of neighboring countries in seeding introductions into Rwanda, including those from which no genomic sequences were available. Our results highlight the importance of systematic genomic surveillance and regional collaborations for a durable response towards combating COVID-19.

Genomic surveillance of SARS-CoV-2 in Puerto Rico enabled early detection and tracking of variants

Article Open access 11 August 2022

Genomic epidemiology of the SARS-CoV-2 epidemic in Brazil

Article Open access 18 August 2022

Dispersion patterns of SARS-CoV-2 variants Gamma, Lambda and Mu in Latin America and the Caribbean

Article Open access 28 February 2024

Introduction

The coronavirus disease 2019 (COVID-19), due to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), continues to impose a heavy death toll globally and represents a major global health challenge. The SARS-CoV-2 is a single-stranded positive-sense ribonucleic acid (RNA) virus that typically undergoes one to two single nucleotide mutations per month. Real-time whole-genome sequencing provides invaluable insights on the pandemic’s transmission dynamics and enables effective surveillance. Moreover, genomic data provide useful information required for the ongoing development of vaccines, therapeutics, and diagnostic tools. Analysis of SARS-CoV-2 mutations is particularly crucial when these affect epitopes involved in the induction of host immune responses as they may lead to immune evasion, with potential implications for vaccine (and immunotherapy) efficacy.

The global SARS-CoV-2 lineage nomenclature has already been proposed with A and B as the initial epidemiological lineages representing the two original haplotypes in Wuhan¹, followed by a number of sub-lineages. As described by Rambaut et al. ref. ¹, Pango lineages are monophyletic clusters of SARS-CoV-2 that are linked to an epidemiological event. Such an event can be an introduction into a distinct geographic area, evidence of increased transmission or a series of functionally relevant mutations. Variants of SARS-CoV-2 are defined by having a constellation of biologically relevant mutations, and many variants are now being monitored closely by the WHO and other public health agencies around the world. Variants may correspond to lineages directly as they operate on the same resolution, but some variants do not (e.g. B.1.1.7 + E484K is a variant, but does not correspond to a specific lineage as it has arisen many times independently). A number of variants of concern (VOCs) have been formalized by the WHO such as the Alpha VOC (B.1.1.7, 20I/501Y.V1 or VOC 202012/01), characterized by 23 mutations (13 non-synonymous mutations, four deletions and six synonymous mutations), that is associated with higher transmissibility² and increased mortality;^3,4 and the Beta VOC (B.1.351 or 20H/501Y.V2) that emerged independently of B.1.1.7, shares some mutations with the B.1.1.7 VOC and has recently also been associated with low vaccine efficacy in South Africa⁵. In Rwanda, the first case of SARS-CoV-2 was confirmed in the capital city of Kigali on March 14^th 2020, following a series of testing at the borders and the Kigali International Airport, (KIA), and was linked to incoming travelers from Mumbai, India. Subsequently, a countrywide total lockdown, coupled with strict prevention measures including contact tracing, was enforced for nearly 2 months aiming to contain the spread of the virus (Fig. 1A and Supplementary Table S1). From May 2020, lockdown restrictions were lifted progressively, a number of commercial activities resumed and the KIA reopened on the 1^st of August 2020 (Supplementary Table S1). However, despite continued massive testing⁶, contact tracing, hotspot mapping, and preventive measures⁷, the number of cases continued to increase (Fig. 1 and Supplementary Figure S1), mainly associated with cross-border land travels through truck drivers⁸ and imported cases (Supplementary Fig. S2). This culminated in a ‘first wave’ of local transmission between July and September 2020. Additional containment measures led to the decline of cases until November 2020 when schools and most activities resumed. In December 2020, another ‘wave’ of infections hit the country, peaking in January–February 2021. As a result, new movement restrictions were enforced, including a total lockdown in the capital city and a 7 days’ quarantine for international travelers in addition to two negative polymerase chain reaction (PCR) tests, one pre-departure and another one upon arrival (Supplementary table S1).

**Fig. 1: Comparison of the number of sequences taken and case counts over time and space.**

In this study, we reconstruct the introduction and subsequent dispersal of lineages A.23.1 and B.1.380 based on genomic analysis of isolates from the first and second waves of the epidemic in Rwanda. In particular, we highlight a shift from the ancestral dominant B.1.380 lineage in the early stages of local transmission to a new lineage, A.23.1, that is currently dominating throughout the country. Combining the collected genomic sequence data with associated individual travel histories to perform travel history-aware phylogeographic inference, we infer introductions into Rwanda from all of its surrounding countries including those from which no genomic sequences are available. Given the importance of these findings on regional surveillance of SARS-CoV-2, we emphasize the need for strengthening genomic surveillance at the country’s points of entry following the detection of the first cases of the B.1.1.7 and B.1.351 VOCs among travelers arriving at KIA.

Results

Patient characteristics

As of the 10th February 2021, a total of 16,865 cases have been confirmed in the country and the sequences analyzed represent 1.2% of the total confirmed cases. The proportion of daily confirmed cases versus the number of sequences taken is illustrated in Fig. 1. In Supplementary Fig. S1, we show a breakdown per month of these numbers, illustrating differences in genome sampling intensity compared to case counts throughout our study period, with the difference being most pronounced during the first 6 weeks of 2021. We sampled a total of 203 cases (reflecting the national screening efforts at points of entry and emerging hotspots) with an average age of 36.7 years, of whom 131 were males and 70 females (and two unknowns) in this study. Of these, location data were available for 152 individuals, of whom 99 lived in Kigali while others were living in different districts of the country. Significant efforts were made to obtain associated metadata for all cases, with specific attention to individual travel history data, as these may shed light on the origins of viral variants introduced from neighboring countries (Supplementary Data 1). Of the 203 cases, 28 had recorded travel history (mainly sampled at the airport and other points of entry through national monitoring and testing efforts) from Tanzania (6), Kenya (4), Demographic Republic of Congo (3), Uganda (3), United States of America (2), United Arab Emirates (2), South Sudan (1), Italy (1), Morocco (1), Senegal (1), Canada (1), China (1), Gabon (1), and Burundi (1). We show the origin of these collected travel cases for which we have genomic data in Fig. 2, with a focus on neighboring countries, which reveals that most travel cases originated in Tanzania, a country that has not yet made any genomic data available. Other important countries from which travelers originated were Kenya, Uganda, and Burundi, representative of the collected data on infected travelers arriving in Rwanda via air travel (Supplementary Fig. S2). We show the number of genome sequences and individual travel cases into Rwanda from these countries in Fig. 3. We also provide the GISAID accession identifiers associated with these genomes in Supplementary Table S2. For many African countries, limited to no sequences were available in GISAID, with sequencing heterogeneously clustered throughout the time period considered in this study. The availability of travel history data is thus critical in these cases as it allows us to characterize the viral population in these countries despite the absence of samples. We note that the travel cases from surrounding countries originate from the second half of 2020, with no such data being available from earlier time periods.

**Fig. 2: Map showing the number of sequences with recorded travel history per country (n = 28).**

**Fig. 3: Availability of whole genome sequences for African countries from which travelers entered Rwanda.**

Lineage characterization

The available genomes were analyzed using the Pangolin module¹. We show in Fig. 3 that the majority of the SARS-CoV-2 sequences in Rwanda belong to two distinct lineages, A.23.1 and B.1.380. However, the dynamics of their distribution changed over time, as shown in Fig. 4. Indeed, the early stages of local transmission were characterized by circulation of a dominant B.1.380 lineage, which has only been observed in Rwanda and Uganda. The diversity of the viral strains observed in the period of May to July 2020 are most likely early imports from Europe and Asia before suppressive measures (such as the countrywide lockdown and the airport closure; see Supplementary Table S1) were enforced. Nevertheless, an increased strain diversity is observed from the period August–October 2020, most likely reflecting introductions through cross-border land travels for goods and cargo⁸.

**Fig. 4: Lineage diversity sampled in Rwanda across four time points: May–Jul 2020 (n = 28), Aug–Oct 2020 (n = 86), Nov–Dec 2020 (n = 74), Jan–Feb 2021 (n = 28).**

Towards the end of 2020, we observed a selective sweep, with lineage A.23.1 taking over. This sub-lineage, first observed in Uganda in late 2020 was reported to contain at least four amino acid changes in the spike protein and amino acid changes in the proteins nsp3, nsp6, ORF8, and ORF9⁹. In particular, these authors suggest that the Q613H mutation in spike may be functionally equivalent to the D614G mutation that arose early in 2020 and is associated with increased viral transmissibility¹⁰. Bugembe et al. ref. ⁹. describe a selective sweep across Uganda of this lineage, which is now the dominant lineage circulating in Uganda as well. Rwandan genome sequencing shows the presence of A.23.1 as early as October 21^st 2020 and a sweep of this lineage was observed from late November (Fig. 4). A.23.1 continues to be the dominant lineage within Rwanda up until February 2021. More recently a number of infections associated with travel have been identified as variants of concern. The first import cases of B.1.1.7 and B.1.351 variants were sampled on December 28^th 2020 and January 4^th 2021, respectively. Analysis by Volz et al. ref. ². suggests that B.1.1.7 is a more transmissible lineage, with a recent study suggesting that B.1.1.7 is not only more transmissible than preexisting SARS-CoV-2 variants, but that it may also cause more severe illness and is associated with increased mortality⁴. However, data inclusive of this paper do not report onward transmission of these VOCs.

Phylogeographic reconstruction accommodating individual travel histories

We made use of publicly available data and the sequenced Rwandan SARS-CoV-2 genomes - all available in GISAID^11,12 (Supplementary Data 2) - to infer a time-scaled phylogenetic tree using maximum-likelihood inference (see “Methods”). This phylogeny enabled us to identify two subtrees with predominantly Rwandan sequences (Supplementary Fig. S3). Both of these subtrees consist of genetically distinct lineages, with the larger cluster belonging to lineage B.1.380 (and hence referred to as subtree B.1) and the smaller one to A.23.1 (referred to as subtree A). The considerable difference in sampling dates and genetic distance between the sequences suggests that the currently circulating SARS-CoV-2 Rwandan lineages are a result of at least two independent introduction events that established local transmission. Subtrees A and B.1 have 172 and 218 sequences, and contain a total of 49 and 134 Rwandan sequences, respectively.

To more accurately understand the pattern of SARS-CoV-2 introductions into Rwanda, we performed a Bayesian discrete phylogeographic analysis on subtrees A and B.1 (Supplementary Data 3, 4). The 172 genomes in subtree A originated from 33 locations and included all sequences from lineage A.23.1. The 218 genomes in subtree B originated from 37 locations and included the B.1.380 lineage. In our analysis of both subtrees, we fit a travel history-aware asymmetric discrete-state diffusion process to model the spatial spread between countries. Our phylogeographic reconstructions included a total of 17 sequences with travel history, 11 for the analysis of subtree A and six for subtree B.1. Interestingly, some of these sequences have associated travel histories originating from Tanzania (four in subtree A and one in subtree B.1), a country that had not reported any COVID-19 cases since May 8^th, 2020¹³, and also has no publicly available genomes on GISAID. While Burundi and South Sudan have been consistently reporting case numbers, no genomic sequences are available on GISAID from these countries yet. Our joint phylogeographic reconstructions are able to include those countries as locations with SARS-CoV-2 infections (which can then be considered as possible ancestral locations), by exploiting data on infected incoming travelers from those countries. This type of phylogeographic reconstruction enables to more accurately reconstruct the spread of pathogens by exploiting additional observed data, in the form of documented individual travel histories (which don’t need to be inferred).

Figures 5 and 6 show the estimated location-annotated phylogenies that enable to track the geographic spread of SARS-CoV-2 through time for subtrees A and B.1, with a focus on the available Rwandan sequences. In our analysis of subtree A (Fig. 5), which contains sequences from lineages A.23 and A.23.1, we inferred a minimum number of 22 (HPD 95%: [16–29]) introduction events into Rwanda, with a minimum of respectively 13 and 4 of these events originating from Uganda and Kenya (Fig. 7 and Supplementary Table S3). We found an expected number of two introduction events from Tanzania into Rwanda, corresponding to and being derived from the two arriving traveler cases, as well as single introduction events from South Sudan and China into Rwanda. Figure 5 also shows frequent mixing between Rwanda, Uganda, and Kenya, with the latter two estimated to have seeded introductions into Tanzania, from where no genomic sequences are available to date. However, by carefully collecting metadata of infected individuals, we are able to confirm the presence of lineage A.23.1 among travelers from Tanzania, despite the absence of genomic data. Our travel history-aware inference methodology further enables us to consider such unsampled countries to determine the intensity of exchanges between countries and potentially even infer one of the unsampled countries as the origin of the lineage. In our analysis of subtree B.1, which includes Rwandan lineage B.1.380, we inferred a minimum number of nine (HPD 95%: [8–12]) introduction events into Rwanda, with three of these events originating from Kenya (Fig. 7 and Supplementary Table S3). We also found an expected number of two introduction events from both Uganda and Italy. Using Bayesian stochastic search variable selection (BSSVS), we identified seven statistically supported (Bayes Factor > 3) transition routes into Rwanda for subtree A and six for subtree B.1 (Supplementary Table S3). Our analysis on subtree A showed that Uganda accounted for the majority of SARS-CoV-2 introductions into Rwanda (mean number of Markov jumps: 13.1; 95% HPD: [7–20]), whereas our analysis on subtree B.1 identified Kenya as the main source of SARS-CoV-2 introductions into Rwanda (mean number of Markov jumps: 3.2; 95% HPD: [0–5]).

**Fig. 5: Maximum clade credibility phylogeny for subtree A, representing diversity of lineages A.23 and A.23.1.**

**Fig. 6: Maximum clade credibility tree for subtree B.1, which includes Rwandan lineage B.1.380.**

**Fig. 7: Supported transitions into Rwanda.**

Consistent with previously published analyses of SARS-CoV-2, we observe that our discrete Bayesian phylogeographic reconstructions resulted in MCC trees of which the internal nodes can be poorly supported, a common phenomenon in SARS-CoV-2 phylogenies (Figs. 5 and 6). The considerable uncertainty in phylogenetic clustering results in a variety of diverging phylogeographic histories, which end up not being captured in the MCC trees as these only represent point estimates of the posterior distribution. To this end, we explored the ancestral spatial histories of individual samples of interest using Markov jump trajectory plots^14,15 (Fig. 8). In the case of subtree A, the travel-aware reconstructions showed four sequences consistently forming two clusters with posterior support > 0.9. However, the first two of these four cases correspond to cross-border truck drivers of Tanzanian nationality (sampled on the same day on the same sampling location, i.e. the Rusumo border), with no such metadata available for the other two cases in subtree A. Hence, the two inferred introductions actually correspond to four introduction events from Tanzania into Rwanda, which are clustered together by location in our joint inference, likely as a result of additional samples currently lacking from the border region. Because of this, sequences in each cluster result in nearly identical spatial histories. Fig. 8A, B show the Markov jump trajectory plots for these two introductions. Overall, we see considerable ambiguity in the ancestral locations prior to Tanzania, as seen by the density of lines landing in “Other” alternate locations. More broadly, we see that in both cases the Rwandan sequences diverged from ancestors in Tanzania, Kenya, and Uganda, with considerable uncertainty placed at the root, among the Democratic Republic of the Congo (DRC), Sierra Leone, and Mali. The introduction in subtree B, on the other hand, presents us with a different ancestral relation with Tanzania (Fig. 8C). Although we also generally observe considerable uncertainty in the ancestral paths, we observe a strong signal for an ancestry in Rwanda prior to the introduction from Tanzania. This would imply a transmission chain starting in Rwanda, spreading into Tanzania, and then being reintroduced into Rwanda. A similar dynamic of outflow and inflow of Rwandan lineages can be seen in the ancestral histories for the sequences with travel history to Morocco, Italy, and the DRC (Supplementary Figure S5). This suggests a bidirectional exchange of SARS-CoV-2 genomes between each of these countries and Rwanda. However, because of the differences in sequencing efforts across the globe, we cannot dismiss the possibility of intermediary locations in these cases. Nonetheless, all spatial trajectory plots imply the presence of SARS-CoV-2 lineages circulating in Tanzania after May 2020. The difference in ancestral histories coupled with the fact that these travel history sequences are genetically distant from each other implies that multiple SARS-CoV-2 lineages have circulated in Tanzania to this day.

**Fig. 8: Ancestral spatial trajectories for individual patients.**

In addition, subtree A contains a sequence with travel history to South Sudan. Although over 10,000 COVID-19 cases have been reported to date¹³, no genomic sequences were publicly available for South Sudan before June 2021. The sample tested at arrival in Rwanda presents us with evidence of lineage A having circulated in South Sudan during the months of May and June 2020 (Fig. 8D). As expected, the Markov jump trajectory plots for this sample also show considerable uncertainty in the reconstruction of the ancestral locations prior to South Sudan. Regardless, we see some support for ancestry in Kenya, Uganda and the DRC, which provides further evidence for viral transmission between the neighboring countries in the area. We compare in “Supplementary Materials” the diversity of lineages in Rwanda to that of two of its neighboring countries that have released a similar number of SARS-CoV-2 genomes, i.e. Uganda and Kenya. In Supplementary Fig. S6-S8, we show that, while each country has its own dynamics, Rwanda and Uganda have seen a similar rise in the number of infections with lineage A.23.1, whereas the surge in infections with B.1.380 was specific to Rwanda. In Supplementary Fig. S9 and S10, we show the differences between the number of recorded travel cases and the estimated Markov jumps in both subtrees A and B.1, illustrative of the ability of travel history-aware phylogeographic reconstruction to estimate transition between countries beyond what has been collected as part of the metadata associated with the available genomes.

To assess whether virus populations were structured per country, we performed compartmentalization analyses using tree-based methods on a posterior distribution of phylogenies in BaTS¹⁶. BaTS yielded significant values for all three statistics: p < 0.001 for PS and AI, p < 0.01 for MC(Rwanda) in both of the A.23.1 and B.1.380 subtree analyses. The significant degree of clustering suggests that for both of these lineages, local transmission chains have played an important role in driving the Rwandan epidemic. Because we find a significant tendency for Rwanda SARS-CoV-2 genomes to cluster according to the location of sampling, we subsequently investigated the spatial patterns of virus spread within Rwanda. Our continuous phylogeographic analysis of SARS-CoV-2 lineages highlight an important inter-connection of those lineages centered around Kigali, after having been introduced in Rwanda (Supplementary Fig. S11). Most sequences sampled outside the city appeared to be evolutionarily linked to sequences sampled within this city area, and would then correspond to independent dispersal events from Kigali. However, this phylogeographic pattern, i.e. the central importance of Kigali within the dispersal history of SARS-CoV-2 lineages, might to some extent result from the higher sampling effort within the capital city. To assess the effect of sampling bias, we performed multiple replicate analyses on subsampled data sets, showing a consistent pattern of spread with the one inferred on the original data set (Supplementary Fig. S12; see “Supplementary Materials” for additional information on the sensitivity analysis performed). Regardless, it is likely that a higher sampling effort outside Kigali would highlight more local transmission and that this represents an important avenue of further research.

Discussion

Here we describe the pattern of transmission of SARS-CoV-2 in Rwanda from May 2020 to February 2021. In particular, we report the spread of a SARS-CoV-2 variant of the A lineage (A.23.1) with notable amino acid changes in the spike protein as well as several non-spike protein changes first detected in Uganda⁹. Indeed, most SARS-CoV-2 sequence diversity in Rwandan strains belong to two distinct lineages: A.23.1 and B.1.380. The latter dominated throughout the early stages of the pandemic before a shift towards the A.23.1 lineage occurred in November 2020. A similar pattern was observed in neighboring Uganda as described by Bugembe et al. ref. ⁹. The authors describe the lineage as a variant of concern (VOC) in a sense that it shares mutations with the currently known lineage B VOCs such as the changes in key spike protein regions (the furin cleavage site and the 613/614 change). However, functional analyses are needed to determine whether these mutations have effects on transmission rates, immune evasion, vaccine efficacy, and/or case-fatality rates.

In this study, we reported on the ongoing genomic sequencing efforts in Rwanda, which are complemented with careful collection of associated travel history metadata of incoming travelers. These efforts enabled us to exploit this information by performing joint Bayesian travel history-aware phylogeographic inference on these data. By applying this recently developed approach, we demonstrated considerable contributions of neighboring countries’ sequence introductions into Rwanda (as well as possible bidirectional exchanges). Of particular interest to this study, we were able to include traveler cases from Tanzania, Burundi, and South Sudan while none of these three countries had made any SARS-CoV-2 genomes available throughout the study period. According to the data we collected, two infected Rwandan travelers returned from Tanzania on the 16^th of June, 2020 and two more on 4^th January, 2021. Our findings also complement a statement from the WHO¹⁷ that a number of travelers from Tanzania who have traveled to neighboring countries and beyond have tested positive for COVID-19. Incorporating travel history information in phylogeographic analysis can mitigate sampling bias (from unsampled or under-sampled countries)¹⁴, although this cannot fully replace the lack of sequences from other countries.

The reported import into Rwanda of two VOCs, i.e. B.1.1.7 and B.1.351, sampled at the Kigali International Airport in late December 2020 and early January 2021 are of particular interest. The patient infected with the B.1.1.7 variant was a Burundian traveling from Burundi while the patient infected with the B.1.351 variant was a Zimbabwean coming from the DRC, suggesting that VOCs may be actively circulating in neighboring countries. Indeed, although Burundi and Tanzania have currently no SARS-CoV-2 sequences uploaded onto GISAID, and South Sudan not until June 2021, the DRC has shared a total of 416 sequences, of which 21 are VOCs (eight B.1.1.7 and 13 B.1.351), while Kenya has shared a total of 1478 sequences.

Ongoing genomic surveillance in Rwanda revealed additional infections with these VOCs (mostly B.1.351) from travelers sampled at the airport. In an effort to curb the spread of the different lineages and variants, and following the upsurge of cases in November–December 2020, several measures were taken by the Rwandan government including a 7-day quarantine to all incoming passengers followed by an RT-PCR test, in addition to presenting a COVID-19 negative test upon arrival. Furthermore, the capital city of Kigali went through a total lockdown from mid-January to early February 2021, and travels between districts were prohibited until mid-March 2021. A 7 pm to 4 am curfew was instituted in early February 2021; public offices were closed and employees were working from their homes. All schools in Kigali were closed as well, and classes were being held online. Cafés and restaurants were only providing takeaway services. Churches, public swimming pools, and gyms were closed (Office of the Prime Minister - Republic of Rwanda 2021; Supplementary Table S1). Such suppression mechanisms (population-wide social distancing, lockdown, school closure, case isolation) have been shown to have the greatest impact (as far as non-pharmaceutical approaches are concerned) in terms of transmission control¹⁸. Additionally, all public health facilities received free antigen rapid diagnostic tests for every single person presenting COVID-19 related symptoms. Moreover, a vaccination campaign was initiated in March 2021, with the aim to vaccinate all front liners and vulnerable populations (elderly and people with other underlying health conditions) in the first phase. To this end, Rwanda received both Pfizer and AstraZeneca vaccines. A rapid and efficacious vaccination coverage will ease the social and economic disruptions associated with non-pharmaceutical interventions. However, a number of published studies^19,20,21 demonstrate evidence of escape of SARS-CoV-2 VOCs from vaccine-induced immunity. For example, Becker et al. ref. ²¹. reported a ‘substantially reduced Ab neutralization for the B.1.351 variant’ on sera obtained from vaccinated people, highlighting the importance of genomic surveillance, monitoring incoming travelers, and efficient contact tracing upon appearance of new variants.

Our results suggest that neighboring countries play an important role in establishing the circulation of (different strains of) SARS-CoV-2 in Rwanda. However, due to the unevenness in sampling across countries, with several not yet having provided any genomic sequences, additional data are required to accurately assess the effect of short-distance (e.g. crossing the borders with neighboring countries) versus long-distance travel in shaping the Rwandan epidemic.

Methods

Study design

This is an in-depth study of SARS-CoV-2 strains that circulate in Rwanda from May 2020 to February 2021, in which we describe the demography and epidemiology of 203 SARS-CoV-2 genomes from collected SARS-CoV-2 positive oropharyngeal swabs. These swabs were obtained from two distinct groups: from individuals residing in different provinces of Rwanda (n = 189) and from returning travelers, whose samples were collected at the airport (n = 14). All samples were extracted from the biorepository of the National Reference Laboratory (NRL), in Kigali, Rwanda. Samples with a cycle threshold (Ct) value below 33 were selected, ensuring a wide geographical representation as well as ports of entry, and case description variables (date and place of RT-PCR test, age, gender, occupation, residence, nationality, travel history) were reported.

Sequencing

RNA Extraction

Ribonucleic acid (RNA) of the virus was extracted from confirmed SARS-CoV-2 positive clinical samples with Ct values ranging from 13.4 to 32.7 on a Maxwell 48 device using the Maxwell RSC viral RNA kit (Promega) following a viral inactivation step using proteinase K according to the manufacturer’s instructions.

SARS-CoV-2 whole genome sequencing

Reverse transcription was performed using SuperScript IV VILO master mix, and 3.3 μl of RNA was combined with 1.2 μl of master mix and 1.5 μl of H₂O. This was incubated at 25 °C for 10 min, 50 °C for 10 min, and 85 °C for 5 min. PCRs used the primers and conditions recommended in the nCoV-2019 sequencing protocol (ARTIC Network, 2020) or the 1,200 bp amplicons described by Freed and colleagues²² (Supplementary Table S4).

Primers from version 3 of the ARTIC Network and the 1,200 bp amplicons were used and were synthesized by Integrated DNA Technologies. Samples were multiplexed using the Oxford Nanopore native barcoding expansion kits 1–12, 13–24, or the native barcoding expansion 96 in combination with the ligation sequencing kit 109 (Oxford Nanopore). Sequencing was carried out on a MinION using R9.4.1 flow cells.

Genome assembly

The data generated via the Oxford Nanopore Technology (ONT) MinION was processed using the ARTIC bioinformatic protocol (https://artic.network/ncov-2019/ncov2019-bioinformatics-sop.html). Briefly, the FAST5 sequence files were base called and demultiplexed using Guppy 4.2.2 in high accuracy mode, requiring barcodes at both ends of the read. FASTQ reads associated with each sample were filtered and concatenated via the guppyplex module. Consensus SARS-CoV-2 sequences were generated via the ARTIC nanopolish pipeline and assembled for each sample by aligning the respective sample reads to the Wuhan-Hu-1 reference genome (GenBank Accession: MN908947.3) with the removal of sequencing primers, followed by a polishing step using the raw Fast5 signal files. Positions with insufficient genome coverage were masked with N.

Phylogenetic and phylogeographic analysis

We downloaded all SARS-CoV-2 genomes from the available nextstrain build²³ with Africa-focused subsampling (https://nextstrain.org/ncov/africa) on February 23, 2021. These sequences were further complemented to include all 203 Rwandan sequences generated in this study and available on GISAID on February 24, 2021. The 203 Rwandan whole-genome SARS-CoV-2 genomes were assigned Pango lineages, as described by Rambaut et al. ref. ¹, using pangolin v2 and pangoLEARN model v2021-02-21 by O’Toole et al. ref. ²⁴. We used Squarify to construct the square treemaps of lineage diversity across three time points²⁵. We mapped the combined data set against the canonical reference (GISAID ID: EPI_ISL_406801) using minimap2²⁶ and trimmed the data to positions 265-29,674 and padded with Ns in order to mask out 3' and 5' UTRs. We used the resulting alignment to estimate an unrooted maximum-likelihood phylogeny (Supplementary data 5) using IQ-TREE v2.1.2²⁶ using its automated model selection approach that identified the general time-reversible model with empirical base frequencies and an auto-discrete-gamma model for varying rates across sites with eight rate categories (GTR + F + R8) as best fitting the data. We subsequently calibrated this phylogeny in time using TreeTime²⁷ while estimating the molecular clock and skyline coalescent model parameters and using three SARS-CoV-2 genomes from Wuhan, 2019, as the outgroup.

We went on to perform a discrete Bayesian phylogeographic analysis in BEAST 1.10.5²⁸ using a recently developed model that is able to incorporate available individual travel history information associated with the sequenced Rwandan samples^14,15. Exploiting such information can yield more realistic reconstructions of virus spread, particularly when travelers from unsampled or under sampled locations are included to mitigate sampling bias. To this end, and given that it is not feasible to perform such an analysis on the full data set due to a large number of sequences, we selected two subtrees in the overall phylogeny (see “Results” section) that predominantly consisted of Rwandan sequences, consisting of 172 (subtree A) and 218 sequences (subtree B.1), of which, respectively, 11 and six infected individuals have associated travel history information (Supplementary Table S2). We incorporated the collection dates for those sequences into our analyses, and treated the time when a traveler started the return journey to Rwanda as a random variable given that the time of traveling to the sampling location (in Rwanda) was not known (with sufficient precision). We specify normal prior distributions over these 17 random variables informed by an estimate of the time of infection and truncated to be positive (back-in-time) relative to sampling date. As in the work of Lemey et al. ref. ¹⁴, we use a mean of 10 days before sampling based on a mean incubation time of 5 days²⁹, and a constant ascertainment period of 5 days between symptom onset and testing¹⁸, and a standard deviation of 3 days to incorporate the uncertainty on the incubation time. We retrieved the 172 and 218 sequences from the full alignment and performed joint discrete phylogeographic inference on each resulting data set using BEAST 1.10.5, employing the BEAGLE 3.2.0 high-performance computational library³⁰ to improve performance. For each of these phylogeographic analyses, we make use of Bayesian stochastic search variable selection (BSSVS) to simultaneously determine which migration rates are zero depending on the evidence in the data and to efficiently infer the ancestral locations, in addition to providing a Bayes factor test to identify significant non-zero migration rates³¹. We also estimated the expected number of transitions (known as Markov jumps)³² into Rwanda from all other countries in the data set. These analyses ran for a total of 200 and 250 million iterations, respectively, with the Markov chains being sampled every 100,000th iteration, in order to reach an effective sample size (ESS) for all relevant parameters of at least 200, as determined by Tracer 1.7³³. We used TreeAnnotator to construct maximum clade credibility (MCC) trees for each subtree.

For each subtree analysis, we assessed whether the SARS-CoV-2 lineages were structured according to country. To this end, we investigated the association between phylogeny and sampling location using Bayesian Tip-association Significance testing as implemented in the BaTS software package¹⁶. BaTS allows testing for a significant degree of taxon-trait clustering by evaluating three different statistics: parsimony score (PS), association index (AI), and monophyletic clade (MC) size on a posterior sample of trees. These computed statistics are then compared to a null distribution of permuted taxon-trait values, corresponding to a situation of randomly mixed locations, implying a dominant role of importations over local circulation in establishing the (local) epidemic. We performed our BaTS analyses on a sample of 1000 posterior trees and computed 100 null replicates.

To explore the local spread of SARS-CoV-2 lineages introduced in Rwanda, we also performed a continuous phylogeographic analysis following a procedure similar to one defined by Dellicour et al. ref. ³⁴. Specifically, we used the relaxed random walk (RRW) diffusion model³⁵ available in BEAST 1.10.5²⁸ to infer the dispersal history of Rwandan lineages along Rwandan clades identified within the two subtree-specific MCC trees that resulted from the discrete Bayesian phylogeographic inference described above. To achieve a sufficient level of spatial precision, the continuous phylogeographic analysis was only based on those sampled genomes for which the Rwandan sector of origin was known, which is the maximal level of spatial precision available for these samples. For each sampled genome associated with this level of sampling precision, which corresponds to 57% of available Rwandan genomes, we retrieved geographic coordinates from a point randomly sampled within its sector of origin. The MCMC chain was run in BEAST 1.10.5 for 30 million iterations and sampled every 10,000^th iteration, its convergence/mixing properties were again assessed with Tracer³³, and an appropriate number of sampled trees was discarded as burn-in (10%). The resulting sets of plausible trees were used to obtain subtree-specific MCC summary trees using TreeAnnotator, and we then used functions available in the R package “seraphim”³⁶ to extract spatio-temporal information embedded within posterior trees and visualize the continuous phylogeographic reconstructions. Finally, we used the baltic Python library to visualize the phylogenies³⁷.

Ethical approval

The study was approved by the Rwanda National Ethics Committee (FWA Assurance No. 00001973 IRB 00001497 of IORG0001100/15April2020). An exemption from informed consent was issued based on the use of retrospective anonymous data and no medical intervention. The study was further approved by the IRB of the University of Rwanda, College of Medicine and Health Sciences (Approval notice No 325/CMHS IRB/2020).

Reporting Summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The reported SARS-CoV-2 genomes are available on GISAID (www.gisaid.org) under the accession numbers EPI_ISL_614763, EPI_ISL_614980, EPI_ISL_615063, EPI_ISL_615064, EPI_ISL_615067, EPI_ISL_615069, EPI_ISL_615071, EPI_ISL_615074, EPI_ISL_615075, EPI_ISL_707711-EPI_ISL_707713, EPI_ISL_ 707771-EPI_ISL_707774, EPI_ISL_707776, EPI_ISL_707777, EPI_ISL_707779, EPI_ISL_707780, EPI_ISL_707783, EPI_ISL_707787- EPI_ISL_707790, EPI_ISL_735436-EPI_ISL_735438, EPI_ISL_735444-EPI_ISL_735448, EPI_ISL_925847-EPI_ISL_925915, EPI_ISL_930567, EPI_ISL_930634, EPI_ISL_930853, EPI_ISL_960227-EPI_ISL_960302, EPI_ISL_1063900-EPI_ISL_1063901, EPI_ISL_1063905, EPI_ISL_1063915, EPI_ISL_1063994, EPI_ISL_1064022, EPI_ISL_1064147-EPI_ISL_1064149, EPI_ISL_1064152-EPI_ISL_1064154, EPI_ISL_1064163-EPI_ISL_1064166, EPI_ISL_1064168,EPI_ISL_1064170, EPI_ISL_1064171. We have also deposited the reads used to generate the SARS-CoV-2 genomes into the European Nucleotide Archive (ENA) under the accession number PRJEB45303.

References

Rambaut, A. et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. https://doi.org/10.1038/s41564-020-0770-5 (2020)
Volz, E. et al. Transmission of SARS-CoV-2 Lineage B.1.1.7 in England: insights from linking epidemiological and genetic data. medRxiv https://doi.org/10.1101/2020.12.30.20249034 (2021).
Horby, P. et al. NERVTAG note on B.1.1.7 severity. SAGE https://www.gov.uk/government/publications/nervtag-paper-on-covid-19-variant-of-concern-b117 (2021).
Davies, N. G. et al. Increased mortality in community-tested cases of SARS-CoV-2 lineage B.1.1.7. Nature https://doi.org/10.1038/s41586-021-03426-1 (2021).
Cele, S. et al. Escape of SARS–CoV-2 501Y.V2 variants from neutralization by convalescent plasma. medRxiv https://doi.org/10.1038/s41586-021-03471-w (2021).
Mutesa, L. et al. A pooled testing strategy for identifying SARS–CoV-2 at low prevalence. Nature https://doi.org/10.1038/s41586-020-2885-5 (2020).
Clarisse, M. et al. Use of technologies in COVID-19 containment in Rwanda. Rw. Public Health Bul. 2, 7–12 (2020).
Google Scholar
Musanabagnwa, C. et al. Easing lockdown restrictions during COVID-19 outbreak in Rwanda. Rw. Public Health Bul. 2, 24–29 (2020).
Google Scholar
Bugembe, D. L. et al. Emergence and spread of a SARS–CoV-2 lineage a variant (A.23.1) with altered spike protein in Uganda. Nat. Microbiol. https://doi.org/10.1038/s41564-021-00933-9 (2021).
Zhang, L. et al. SARS–CoV-2 spike-protein D614G mutation increases virion spike density and infectivity. Nat. Commun. https://doi.org/10.1038/s41467-020-19808-4 (2020).
Elbe, S. & Buckland-Merrett, G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob. Chall. 1, 33–46 (2017).
Article Google Scholar
Shu, Y. & Mccauley, J. GISAID: Global initiative on sharing all influenza data – from vision to reality. Euro Surveill. https://doi.org/10.2807/1560-7917.ES.2017.22.13.30494. (2017).
WHO. WHO Coronavirus (COVID-19) dashboard. WHO Web https://www.who.int/emergencies/diseases/novel-coronavirus-2019?adgroupsurvey={adgroupsurvey}&gclid=Cj0KCQjwnJaKBhDgARIs AHmvz6cWRpag13 Yhl0uOTpDqqBkygPrd- 6E7YVWisNDHwSmdet86IrizrTIaArDrEALw_wcB (2020).
Lemey, P. et al. Accommodating individual travel history and unsampled diversity in Bayesian phylogeographic inference of SARS–CoV-2. Nat. Commun. 11, 1–14 (2020).
Article ADS Google Scholar
Hong, S., Lemey, P., Suchard, M., Baele, & G. Bayesian phylogeographic analysis incorporating predictors and individual travel histories in BEAST. Curr Protoc. https://doi.org/10.1002/cpz1.98. PMID: 33836121 (2021).
Parker, J., Rambaut, A. & Pybus, O. G. Correlating viral phenotypes with phylogeny: accounting for phylogenetic uncertainty. Infect. Genet. Evol. 8, 239–246 (2008).
Article CAS Google Scholar
WHO. WHO Director General’s Statement on Tanzania and COVID-19. https://www.who.int/news/item/20-02-2021-who-director-general-s-statement-on-tanzania-and-covid-19 (2021).
Ferguson, Neil M et al. Impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. Imperial College COVID-19 Response Team, London, https://doi.org/10.25561/77482 (2020).
Supasa, P. et al. Reduced neutralization of SARS–CoV-2 B.1.1.7 variant from naturally acquired and vaccine induced antibody immunity. SSRN https://doi.org/10.2139/ssrn.3775873 (2021).
Zhou, D., Supasa, P., Ren, J., Stuart, D. I. & Screaton, G. R. Article evidence of escape of SARS–CoV-2 variant B. 1. 351 from natural and vaccine-induced sera ll ll evidence of escape of SARS-CoV-2 variant B. 1. 351 from natural and vaccine-induced sera. Cell 184, 2348–2361 (2021).
Article CAS Google Scholar
Becker, M. et al. Immune response to SARS–CoV-2 variants of concern in vaccinated individuals. Nat. Commun. https://doi.org/10.1038/s41467-021-23473-6 (2021).
Freed, N. E., Vlková, M., Faisal, M. B. & Silander, O. K. Rapid and inexpensive whole-genome sequencing of SARS-CoV-2 using 1200 bp tiled amplicons and Oxford Nanopore Rapid Barcoding. Biol. Methods Protoc. 5, 1–7 (2021).
Google Scholar
Hadfield, J. et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics 34, 4121–4123 (2018).
Article CAS Google Scholar
Toole, Á. O. et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 07, 1–9 (2021).
Google Scholar
Bruls, M., Huizing, K., van Wijk, J. J. Eurographics (Springer, 2000).
Li, H. Sequence analysis Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS Google Scholar
Sagulenko, P., Puller, V. & Neher, R. A. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 4, 1–9 (2018).
Article Google Scholar
Suchard, M. A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1. 10. Virus Evol. 4, 1–5 (2018).
Article Google Scholar
Lauer, S. A. et al. The incubation period of coronavirus disease 2019 (CoVID-19) from publicly reported confirmed cases: estimation and application. Ann. Intern. Med. 172, 577–582 (2020).
Article Google Scholar
Yres, D. A. L. A. et al. BEAGLE 3: Improved performance, scaling, and usability for a high-performance computing library for statistical phylogenetics. Softw. Syst. Evolution 68, 1052–1061 (2019).
Google Scholar
Lemey, P., Rambaut, A., Drummond, A. J. & Suchard, M. A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. https://doi.org/10.1371/journal.pcbi.1000520 (2009).
Minin, V. N. & Suchard, M. A. Counting labeled transitions in continuous-time Markov models of evolution. J. Math. Biol. 56, 391–412 (2008).
Article MathSciNet Google Scholar
Rambaut, A., DRummond, A. J., Xie, D., Baele, G. & Suchard, M. A. Posterior summarization in Bayesian phylogenetics using tracer. Syst. Biol. 67, 901–904 (2018).
Dellicour, S. et al. A phylodynamic workflow to rapidly gain insights into the dispersal history and dynamics of SARS–CoV-2 lineages. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msaa284 (2020).
Lemey, P., Rambaut, A., Welch, J. J. & Suchard, M. A. Phylogeography takes a relaxed random walk in continuous space and time. Mol. Biol. Evolution 27, 1877–1885 (2010).
Article CAS Google Scholar
Dellicour, S., Rose, R., Faria, N. R., Lemey, P. & Pybus, O. G. SERAPHIM: studying environmental rasters and phylogenetically informed movements. Bioinformatics 32, 3204–3206 (2016).
Article CAS Google Scholar
Dudas, G. & Rambaut, A. MERS-CoV recombination: implications about the reservoir and potential for adaptation.Virus Evol. 2, 1–11 (2016).
Article Google Scholar

Download references

Acknowledgements

This research was commissioned by the National Institute of Health Research (NIHR) Global Health Research programme (16/136/33) using UK aid from the UK Government (funding to E.M. and N.R. through TIBA partnership) and additional funds from the Government of Rwanda through RBC/National Reference Laboratory in collaboration with the Belgian Development Agency (ENABEL) for additional genomic sequencing at the GIGA Research Institute-Liege/Belgium. The views expressed in this publication are those of the authors and not necessarily those of the NIHR, the National Institute of Health Research, the Department of Health and Social Care, or the Rwandan Government. G.B. acknowledges support from the Internal Fondsen KU Leuven/Internal Funds KU Leuven (Grant No. C14/18/094) and the Research Foundation–Flanders (“Fonds voor Wetenschappelijk Onderzoek - Vlaanderen,” G0E1420N, G098321N). S.L.H. acknowledges support from the Research Foundation-Flanders (“Fonds voor Wetenschappelijk Onderzoek - Vlaanderen,” G0D5117N). S.D. is supported by the Fonds National de la Recherche Scientifique (FNRS, Belgium). VH was supported by the Bi otechnology and Biological Sciences Research Council (BBSRC) [grant number BB/M010996/1]. A.O.T. is supported by the Wellcome Trust Hosts, Pathogens & Global Health Programme [grant number: grant.203783/Z/16/Z] and Fast Grants [award number: 2236]. A.R. acknowledges the support of the Wellcome Trust (Collaborators Award 206298/Z/17/Z – ARTIC network) and the European Research Council (grant agreement no. 725422 – ReservoirDOCS).

Author information

These authors contributed equally: Yvan Butera, Enatha Mukantwari, Maria Artesi, Jeanne D’Arc Umuringa, Vincent Bours, Andrew Rambaut, Sabin Nsanzimana, Guy Baele, Keith Durkin, Leon Mutesa, and Nadine Rujeni.

Authors and Affiliations

Center for Human Genetics, College of Medicine and Health Sciences, University of Rwanda, Kigali, Rwanda
Yvan Butera, Jacob Souopgui & Leon Mutesa
Rwanda National Joint Task Force COVID-19, Rwanda Biomedical Centre, Ministry of Health, Kigali, Rwanda
Yvan Butera, Marylin Milumbu Murindahabi, Misbah Gashegu, Jean Paul Rwabihama, Djordje Gikic, Robert Rutayisire, Swaibu Gatare, Tharcisse Mpunga, Daniel Ngamije, Sabin Nsanzimana, Leon Mutesa & Nadine Rujeni
Laboratory of Human Genetics, GIGA Research Institute, Liège, Belgium
Yvan Butera, Maria Artesi, Bouchra Boujemla, Vincent Bours & Keith Durkin
National Reference Laboratory, Rwanda Biomedical Center, Kigali, Rwanda
Enatha Mukantwari, Jeanne d’arc Umuringa, Onesphore Majyambere, Esperance Umumararungu, Alice Kabanda, Patrick Tuyisenge, Reuben Sindayiheba, Robert Rutayisire & Swaibu Gatare
Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, Scotland
Áine Niamh O’Toole, Verity Hill, Stefan Rooke & Andrew Rambaut
Department of Microbiology, Immunology and Transplantation, Rega Institute KU Leuven, Leuven, Belgium
Samuel Leandro Hong, Simon Dellicour & Guy Baele
Spatial Epidemiology Laboratory, Université Libre de Bruxelles, Brussels, Belgium
Simon Dellicour
Department of Clinical Microbiology, University Hospital of Liège, Liège, Belgium
Sebastien Bontems
University of Birmingham, Birmingham, England
Josh Quick & Nick Loman
University College London, London, England
Paola Cristina Resende
Laboratory of Respiratory Viruses and Measles, Oswaldo Cruz Institute, FIOCRUZ, Rio de Janeiro, Brazil
Paola Cristina Resende
School of Science, College of Science and Technology, University of Rwanda, Kigali, Rwanda
Marylin Milumbu Murindahabi
Department of Molecular Biology, Institute of Biology and Molecular Medicine, IBMM, Université Libre de, Bruxelles, Gosselies, Belgium
Jacob Souopgui
African Institute for Mathematical Sciences, Kigali, Rwanda
Wilfred Ndifon
Department of Human Genetics, University Hospital of Liège, Liège, Belgium
Vincent Bours
School of Health Sciences, College of Medicine and Health Sciences, University of Rwanda, Kigali, Rwanda
Nadine Rujeni

Authors

Yvan Butera
View author publications
You can also search for this author in PubMed Google Scholar
Enatha Mukantwari
View author publications
You can also search for this author in PubMed Google Scholar
Maria Artesi
View author publications
You can also search for this author in PubMed Google Scholar
Jeanne d’arc Umuringa
View author publications
You can also search for this author in PubMed Google Scholar
Áine Niamh O’Toole
View author publications
You can also search for this author in PubMed Google Scholar
Verity Hill
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Rooke
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Leandro Hong
View author publications
You can also search for this author in PubMed Google Scholar
Simon Dellicour
View author publications
You can also search for this author in PubMed Google Scholar
Onesphore Majyambere
View author publications
You can also search for this author in PubMed Google Scholar
Sebastien Bontems
View author publications
You can also search for this author in PubMed Google Scholar
Bouchra Boujemla
View author publications
You can also search for this author in PubMed Google Scholar
Josh Quick
View author publications
You can also search for this author in PubMed Google Scholar
Paola Cristina Resende
View author publications
You can also search for this author in PubMed Google Scholar
Nick Loman
View author publications
You can also search for this author in PubMed Google Scholar
Esperance Umumararungu
View author publications
You can also search for this author in PubMed Google Scholar
Alice Kabanda
View author publications
You can also search for this author in PubMed Google Scholar
Marylin Milumbu Murindahabi
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Tuyisenge
View author publications
You can also search for this author in PubMed Google Scholar
Misbah Gashegu
View author publications
You can also search for this author in PubMed Google Scholar
Jean Paul Rwabihama
View author publications
You can also search for this author in PubMed Google Scholar
Reuben Sindayiheba
View author publications
You can also search for this author in PubMed Google Scholar
Djordje Gikic
View author publications
You can also search for this author in PubMed Google Scholar
Jacob Souopgui
View author publications
You can also search for this author in PubMed Google Scholar
Wilfred Ndifon
View author publications
You can also search for this author in PubMed Google Scholar
Robert Rutayisire
View author publications
You can also search for this author in PubMed Google Scholar
Swaibu Gatare
View author publications
You can also search for this author in PubMed Google Scholar
Tharcisse Mpunga
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Ngamije
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Bours
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Rambaut
View author publications
You can also search for this author in PubMed Google Scholar
Sabin Nsanzimana
View author publications
You can also search for this author in PubMed Google Scholar
Guy Baele
View author publications
You can also search for this author in PubMed Google Scholar
Keith Durkin
View author publications
You can also search for this author in PubMed Google Scholar
Leon Mutesa
View author publications
You can also search for this author in PubMed Google Scholar
Nadine Rujeni
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.B. and E.M.: Study design, data collection. M.A., Y.B., B.B., E.M., J.d.U.: RT-PCR, library preparation, whole genome sequencing. K.D., Y.B., S.R.: Whole genome sequencing, sequences cleaning and assembling, sequences fast files production. Y.B., E.M., S.B., O.M., J.d.U.: RNA extraction. P.T., R.S., M.G., R.R., E.U., S.D., A.K., O.M., R.M., S.G.: Sample selection, data collection. S.D., S.L.H., V.H., G.B.: Spatial and phylogeographic analysis. J.R., D.G., J.S., W.N., J.Q., M.M.M., A.K., P.C.R., N.L., J.P.R., S.N., T.M., D.N.: Provided technical guidance and review of the paper. G.B., A.O.T.: Phylogenetic analysis. V.B., J.S., W.N., G.B., A.R., S.N., K.D., M.A., L.M., N.R.: Provided scientific and technical guidance, review of the paper. V.B., N.R., G.B., K.D., L.M.: Secured funding and coordinated the study.

Corresponding authors

Correspondence to Guy Baele, Keith Durkin, Leon Mutesa or Nadine Rujeni.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Carla Mavian and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Butera, Y., Mukantwari, E., Artesi, M. et al. Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity. Nat Commun 12, 5705 (2021). https://doi.org/10.1038/s41467-021-25985-7

Download citation

Received: 14 April 2021
Accepted: 10 September 2021
Published: 29 September 2021
DOI: https://doi.org/10.1038/s41467-021-25985-7
Springer Nature Limited

This article is cited by

SARS-CoV-2 diagnostic testing rates determine the sensitivity of genomic surveillance programs
- Alvin X. Han
- Amy Toporowski
- Colin A. Russell
Nature Genetics (2023)
Save the giants: demand beyond production capacity of tantalum raw materials
- Philemon Lindagato
- Yongjun Li
- Gaoxue Yang
Mineral Economics (2023)
Emergence and spread of two SARS-CoV-2 variants of interest in Nigeria
- Idowu B. Olawoye
- Paul E. Oluniyi
- Christian T. Happi
Nature Communications (2023)
A genetic research story of giving back and returning to the country of a thousand hills
- Léon Mutesa
Nature Genetics (2022)
Tracing the international arrivals of SARS-CoV-2 Omicron variants after Aotearoa New Zealand reopened its border
- Jordan Douglas
- David Winter
- Jemma L. Geoghegan
Nature Communications (2022)

Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity

Abstract

Similar content being viewed by others

Introduction

Results

Patient characteristics

Lineage characterization

Phylogeographic reconstruction accommodating individual travel histories

Discussion

Methods

Study design

Sequencing

RNA Extraction

SARS-CoV-2 whole genome sequencing

Genome assembly

Phylogenetic and phylogeographic analysis

Ethical approval

Reporting Summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation