Molecular epidemiology of hepatitis B virus infection in Norway
Hepatitis B virus (HBV) infection remains a serious global health challenge. The widespread distribution of HBV is highlighted by multiple HBV genotypes associated with different geographical origin and transmission patterns, as well as, clinical outcomes. Investigating population HBV genotype composition and origin is therefore highly warranted.
In this molecular epidemiological study we analysed 1157 HBV S-gene sequences collected from patients in Norway, primarily in the period 2004–2011, and linked them to epidemiological data from the Norwegian surveillance system for communicable diseases.
Of the patients with reported country of infection (n = 909), 10% (n = 93) were infected in Norway, but the majority (n = 816; 90%) stated that they became infected outside of Norway. Of the patients infected outside of Norway, most became infected in Southeast and East Asia (n = 465; 51%) and Central, West, and North Africa (n = 254; 28%). The distribution of HBV genotypes in Norway is dominated by genotype D (32%) followed by genotype A (22%), B and C (18 and 18%, respectively), and E (7%). Genotype B, C and E were phylogenetically categorized by a majority of sequences originating from distinct geographical regions, either Asia or Africa, whereas genotype A and D originated from multiple geographic regions. However, within genotype A and D, our molecular epidemiology analysis indicated a geographical clustering of sequences depending on their geographical origin.
The majority of HBV patients in Norway became infected outside of Norway and were represented by most common genotypes. Patients stated to have been infected in Norway were found primarily within genotype A and D, and were phylogenetically characterized by both small local clusters and interspersed sequences that clustered with non-Norwegian sequences, indicating that a proportion of the patients assumed to have been infected in Norway likely became infected outside of Norway although assumed the contrary.
KeywordsHepatitis B virus Molecular epidemiology Phylogenetics Genotyping
General time reversible model
Hepatitis B surface antigen
Hepatitis B virus
Intravenous drug use
Markov chain Monte Carlo
Norwegian surveillance system for communicable diseases
Men who have sex with men
Mother to child transmission
People who inject drugs
Regional committee for medical and health research ethics
Viral hepatitis and hepatitis B virus (HBV) infection in particular is one of the world’s most significant public health problems . Globally, ca. 260 million people are estimated to be chronically infected with HBV and annually there are close to 900,000 HBV related deaths, mainly due to cirrhosis or hepatocellular carcinoma [2, 3, 4, 5, 6]. Since the introduction of an effective vaccine in 1982, the global immunisation coverage of infants has gradually increased to 87% in 2016 and hence the number of new chronic infections has dramatically decreased among immunised children [5, 7].
However, the global prevalence of HBV, indicated by the proportion of chronic HBV carriers in the population that is seropositive for the hepatitis B surface antigen (HBsAg), varies strongly between different geographical regions. Broadly viewed, high endemic regions (≥ 8%) include the majority of Africa, parts of Asia, and parts of the Middle East. Intermediate endemic regions (2–8%) include eastern- and southern Europe, Russia, parts of Central- and South America, and parts Africa and eastern and southeast Asia. Low endemic regions (< 2%) include Australia and New Zealand, Northern- and Western Europe, North America and the majority of Central and South America [3, 4]. As such, Norway is classified as a low endemic country for HBV infection . Both acute and chronic infections have been notifiable to the Norwegian surveillance system for communicable diseases (MSIS) since 1975 and 1992, respectively. Similar to many other low endemic regions, acute HBV infections in Norway are most commonly acquired through either intravenous drug use (> 60%) or sexual contact . However, the majority of people with chronic HBV infection in Norway are people from high- and intermediate endemic regions who became infected before arriving to Norway . In Norway, all migrants are offered HBV screening within 3 months on arrival. However, the test is voluntary, and no data are available for the proportion of migrants who choose not to be tested. Vaccination in Norway has until recently only been recommended for defined risk groups, such as people who inject drugs (PWID), men who have sex with men (MSM), immigrants and contacts of known carriers . However, from 2017, universal HBV vaccination was implemented in the child immunisation program in Norway.
The HBV is an enveloped virus in the family Hepadnaviridae with a circular and partially double-stranded DNA genome of approximately 3.2 kbp in length. The genome consists of four partially overlapping open reading frames (S, C, P and X) encoding seven proteins (preS1, preS2 and S surface antigen; precore and core protein; polymerase protein; X protein) . Based on a sequence divergence of > 7.5–8% when comparing complete genomes, HBV is currently divided into at least eight phylogenetically distinct groups, referred to as genotypes A–H with a further two tentatively proposed, genotypes I and J [9, 10, 11, 12]. Furthermore, some genotypes have been divided into sub-genotypes with > 4% sequence diversity based on comparison of whole genome sequences. These genotypes, or even sub-genotypes, have distinct or overlapping geographical distribution [9, 13, 14]. In general, genotype A is common in sub-Sahara and Western Africa, as well as Northern Europe. In Asia genotypes B and C are highly prevalent. Genotype D is distributed worldwide, but sub-genotypes show geographic patterns. The various genotypes have in several studies shown to be associated with differences in pathogenicity, transmission modes, disease progression and response to treatment [12, 14, 15, 16, 17, 18, 19, 20]. For example, genotype C is more strongly associated with development to advanced liver disease compared to all other genotypes [15, 17, 21]. HBV genotyping may therefore serve as an important clinical and epidemiological marker . HBV genotyping has consequently been a part of patient management in Norway since 2004.
This is the first molecular epidemiological study of HBV in Norway where we analyse genotype distribution related to transmission routes and geographical origin of infection among patients. The aim of the current study was to better understand the molecular epidemiology of HBV in Norway by linking HBV sequences with epidemiological data from the Norwegian surveillance system for communicable diseases (MSIS). Importantly, this will help understand the origin and distribution of HBV genotype in patients in Norway.
Samples submitted to the national reference laboratory for hepatitis at the Norwegian Institute of Public Health in the period 1979–2011 (n = 1160) for HBV genotype analysis were included in the study. Further, epidemiological data from MSIS on gender (n = 998), probable country of infection (n = 909) and route of transmission (n = 277) was linked to HBV-genotype and sequence data. The routes of transmission used were (i) mother to child transmission (MTCT), (ii) intravenous drug use (IDU) and (iii) sexual contact. The data include both acute and chronic infections, but the majority of cases are chronic infections as these samples were analysed for HBV genotype as part of their patient management. The study was approved by the Regional committee for medical and health research ethics (REC) South East.
HBV DNA was extracted from 200 μl plasma or serums samples by different methods used at the reference laboratory over the years, including the QIAamp DNA mini kit (QIAgen), Affigene extraction kit (Cepheid) and Abbot sp2000 (Abbott). The elution volume was 35–100 μl depending on the extraction method used above according to these manufactures’ protocol. HBV DNA was amplified from 5 μl extract using HBV-specific primers covering the S-gene region 5′– GACCCCTGCTCGTGTTA –3` (forwards) and 5′– TGAATACTTTCCAATCAATAGG – 3′ (reverse) using AB-gene PCR-buffer with SYBR green (0.33x), Pt-Taq, 3 mM Mg, 0.5 μM primer, 0.2 mM dNTP and an annealing temperature of 65 °C. The PCR products (808 bp) were sequenced using Big Dye Terminator v1.1 on the ABI Prism 3100 instrument (Applied Biosystems). All sequences produced in the present study have been deposited in NCBI GenBank (Accession numbers MK173066–MK174222). The viral samples were genotyped by submitting sequence of the S-gene to the HBV genotyping databases at NCBI (https://www.ncbi.nlm.nih.gov/projects/genotyping/formpage.cgi) and/or at the Max-Planck-Institute for informatics (https://hbv.geno2pheno.org/). The genotyping results were also confirmed in subsequent phylogenetic analyses.
Sequence alignment and model-testing
An HBV S-gene multiple sequence alignment was constructed with MAFFT v.7  using the G-INS-i standard settings and was visualized and edited in AliView . The alignment consisted of a total of 1512 S-gene HBV sequences, that included 1157 sequences sampled in Norway, in the period 1979–2011, and 355 reference sequences, sampled between 1980 and 2013, retrieved from NCBI (Additional file 1: Table S1). The NCBI sequences were chosen based on available information on year of isolation, genotype, geographical origin, as well as genetic similarity to the Norwegian sequences determined by the NCBI BLAST. The alignment was trimmed to a size of 740 bp. To select the most suitable evolutionary nucleotide substitution model, model-testing was conducted using jModelTest 2 . In all subsequent analyses, a general time reversible model (GTR) of nucleotide substitution, with a proportion of invariant sites (I) and gamma distribution of rates across sites with four rate categories (G4) was used.
Phylogenetic interference and molecular epidemiology
To place the Norwegian S-gene HBV sequences in context with those of other studies, a phylogeny was inferred based on the alignment consisting of 1157 sequences collected in Norway and 355 reference sequences from NCBI using MrBayes v.3.2.2 . This was done by executing two parallel runs with four Metropolis-coupled chains for 25 million Markov chain Monte Carlo (MCMC) generations, using GTR + I + G4, sampling every 1000 generations and run with default dirichlet priors, discarding the first 25% as burn-in and then summarized as a consensus tree. The phylogenetic tree was viewed and edited in FigTree v.1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/).
Information on likely transmission route was only available for 24% (n = 277) of the cases with the majority being transmitted through mother to child transmission 36% (n = 100), sex 33% (n = 91) and intravenous drug use 21% (n = 58). Of the cases claimed to be infected in Norway (n = 91) information on transmission was available for 76 cases, the majority being transmitted sexually, 33% (n = 30), or by IDU, 34% (n = 31).
The phylogeny of the Norwegian HBV S-gene sequences was inferred using a Bayesian approach that included 1157 sequences collected in Norway and 355 reference sequences from NCBI GenBank. The result of the Bayesian analysis, conducted in MrBayes, was summarized as a consensus tree and is visualised in Fig. 3. There was ≥0.95 posterior probability support for the monophyletic clustering of genotypes A–F and H. However, the phylogenetic relationships between genotypes were largely unresolved. The phylogeny also showed structure with regards to geographical origin. Genotypes B, C and E were categorized by a majority of sequences originating from a single geographic region (Asia, Asia and Africa, respectively), whereas genotypes A and D included sequences from multiple geographic regions. However, within genotypes A and D, the phylogeny indicated a clustering of sequences depending on their geographical origin. That is, African, Asian and European sequences were more likely to be found in separate clusters rather than a mixture thereof. The sequences from patients stated to have become infected in Norway were found primarily within genotype A and D and were characterized by both small clusters and interspersed sequences.
In our data we found that genotype D (61%) and genotype A (28%) were the dominant genotypes among persons infected in Europe (16% in total), including Norway. When exclusively looking into the cases infected in Norway, genotype D (47%) dominated whereas genotype A was found in 33% of the cases. Previous studies from Europe have similarly reported that genotype A and D dominate in Europe, but that genotype A is frequent in Northern Europe whereas genotype D is frequent in Southern Europe [26, 27]. The latter is not supported by our study, as we observe a higher prevalence of genotype D rather than genotype A among persons assumed to be infected in Norway.
The majority of the sequences from people stated to have become infected in Norway were found in clusters together or in clusters with sequences of European origin. However, some individual sequences were found to cluster with sequences from other non-European regions, indicating that these patients may have acquired their infection outside Europe or in Norway via a person previously infected outside Europe. We believe that this is partly due to the challenge of accurate reporting of origin of infection to MSIS due to the high number of cases among migrants that may not know their infections status before symptoms in adulthood and the difficulty to get accurate and verifiable information from these cases. It is important to note that we have not been able to distinguish between native Norwegians and migrants in this study. As such, the relatively high prevalence of genotype D in Norway may therefore reflect the origin of the migrant population from regions where genotype D is more prevalent. Furthermore, the transmission route is not known in the majority (83%) of cases according to MSIS . Therefore, likely route of transmission could not be presented in detail.
The interpretation of the molecular epidemiology and the transmission dynamics of HBV in our study was limited by sequence length and epidemiological data. Although the phylogenetic tree accurately reconstructs the various genotypes into distinct clades, complete genome sequence information would be needed to make detailed epidemiological inferences as well as to identify sub-genotypes and distinct clusters . Sub-genotyping, that may have given a better understanding of the different genotype pattern observed in Norway compared to other Northern European countries in particular, was therefore not performed as part of the study. Norway was reported as the second most frequent country of infection (Fig. 2), representing 8% of the total study population, but this is not supported by the phylogenetic analysis in several cases. Lack and uncertainty of the epidemiological data make the interpretation of the transmission dynamics difficult. Further, the ethical approval for combing epidemiological data from MSIS with HBV sequence data was limited to samples collected and analysed until 2011. Analysed sequence beyond this period was therefore excluded. Regardless of these limitations, this is the first molecular epidemiological study on HBV in Norway with more than 1100 sequences collected from all over Norway as genotyping is only performed at the national reference laboratory for hepatitis in Norway at the Norwegian Institute of Public Health as part of patient management. Given that the genotype distribution and that the immigrant population has been relatively stable, the data presented herein for the years 1979–2011 may inform current public health and treatment strategies. This is important, as highlighted previously, as the diversity of genotypes considerably differ with respect to geographical distribution, transmission routes, disease progression, responses to antiviral therapy and clinical outcomes [14, 28, 29, 30]. HBV genotyping is therefore relevant in diagnostics both from a clinical and epidemiological perspective, and to better understand the source of HBV in Norway and other countries.
In this study we observed a great mix of genotypes among patients in Norway. HBV infections in Norway are mainly driven by an influx of people with chronic infection who acquired their HBV infection prior to arrival in Norway. The overall genotype distributions in Norway therefore mirror that of the origin of the migrant population to a large extent. Given a population with a great mixture of genotypes, genotyping is important to identify chronic HBV infected at higher risk of liver disease progression enabling optimisation of management and antiviral therapy for these patients.
The authors wish to acknowledge Edward C. Holmes, Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Centre, School of Life and Environmental Sciences and Sydney Medical School, the University of Sydney, Sydney, New South Wales 2006, Australia, for language editing and Hans Blystad, Department of Tuberculosis, Blood Borne and Sexually Transmitted Infections, Norwegian Institute of Public Health, Oslo, Norway, for valuable epidemiological contribution to the manuscript.
JHOP is supported by the Swedish research council FORMAS (grant 2015–710). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Availability of data and materials
All relevant data have been made available in the article and the additional file. All sequence data used has been made available in public databases (NCBI GenBank accession numbers MK173066–MK174222).
Designed the study: JHOP and KSJ. Performed lab-work: HE, KIEB and KSJ. Performed epidemiological and bioinformatics work: JHOP, SM and KSJ. Wrote the manuscript: JHOP, SM and KSJ. All authors read and approved the manuscript.
Ethics approval and consent to participate
The study was approved by the Regional committee for medical and health research ethics (REC) South East, Norway (Approval number: 2009/849) with a dispensation from consent from participants as (a) it is difficult to get consent from this patients group (migrants, drug users), (b) the data was already available in the notification system and the lab analysis was performed prior to the start of the study, (c) some samples are old why it will be very hard to trace the patients, (d) any patient contact was made between the patient and the responsible medical doctor who sent inn samples for genotyping analysis before the project started. Personal identification was only needed to link origin and route of infection to the sequence. Linkage between patient data was only made temporarily whereafter all data, and subsequently all publicly available data, was anonymised.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 4.World Health Organization, World Health Organization, Global Hepatitis Programme. Global hepatitis report, 2017. 2017. https://www.who.int/hepatitis/publications/global-hepatitis-report2017/en/. Accessed 9 Jan 2019.
- 12.Tong S, Revill P. Overview of hepatitis B viral replication and genetic variability. J Hepatol 2016;64 1 Suppl:S4–16.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.