On March 11, 2020, the World Health Organization (WHO) declared COVID-19 a pandemic. Since then, unprecedented genomic surveillance efforts have been made to track the diversity of circulating SARS-CoV-2 lineages and, in particular, to monitor the spread of variants of concern (VOCs) [1, 2]. In November 2021, the Omicron VOC was identified in South Africa, and it rapidly spread all over the world [2]. Since then, the Omicron VOC evolved into several sublineages, including BA.1, BA.2, BA.3, BA.4, and BA.5 [3]. Like some other VOCs, SARS-CoV-2 BA.1 and BA.2 share amino acid substitutions in the spike protein (G339D, S477N, T478K, and N501Y) that enhance binding to the human receptor angiotensin-converting enzyme 2 (ACE2) [4]. Some receptor-binding domain mutations (G496S, A67V, T95I, Del 69-70, Del 143-145, and the insertion EPE between amino acids 214 and 215) that are specific to the SARS-CoV-2 BA.1 lineage have been predicted to substantially reduce the protection provided by SARS-CoV-2 vaccines [5]. Humoral immunity induced by vaccines, to a certain extent, fails to protect against BA.1 and BA.2 [6]. Moreover, the reproduction number of BA.1 is 1.4-fold lower than that of BA.2, which is characterized by higher viral loads in human nasal epithelial cells [6].

On November 30, 2021, SARS-CoV-2 BA.1 was identified in Brazil by our DASA network (https://dasa.com.br/en/genov/). We began screening for BA.1 by targeted sequencing of S-gene target failures (SGTFs) when the presence of the spike protein del69/70 signature was found in our routine use of the COVID-19 TaqPath assay from Thermo Fisher [7]. By January 10, 2022, the Omicron BA.1 lineage represented 99% of all circulating SARS-CoV-2 variants in Brazil (https://portal.fiocruz.br/observatorio-covid-19). However, to investigate the presence of the BA.2 variant, which does not possess the del69/70 mutation, we conducted targeted sequencing of samples without the SGTF signature.

From January 17 to 25, 2022, we identified 47 samples that were S-gene target positive (SGTP). SGTP samples were subjected to next-generation sequencing (NGS) using the Illumina COVIDSeq Test on a NovaSeq 6000 instrument (Illumina, CA, USA) (local ethical approval, CAAE 45540421.0.0000.5455).

One of these samples, collected on January 19, 2022, from a 34-year-old male resident of the city of Volta Redonda, Rio de Janeiro state, was identified as lineage BA.1 using the DragenTM Covid Lineage App, with S:Del 69-70. Intriguingly, TaqPath-based diagnostic PCR testing did not show the SGTF signature (Ct values: N, 19.64; S, 23.13; ORF1ab, 19.35), so we decided to investigate this sample in more detail.

A maximum-likelihood phylogenetic tree including this and other Brazilian sequences sampled in the same period was built using IQTREE2 [8]. Our background reference dataset also included complete genome sequences of Omicron BA.1, BA.2, and BA.3, Delta, and the recombinant XD as references. As expected for recombinants or consensus sequences resulting from coinfection, the sample clustered with a long branch and low bootstrap support within BA.1 clade (Fig. 1A). This sequence, deposited in the GISAID database under ID no. EPI_ISL_11271349, had 99.59% non-N bases with a mean coverage of 2,833. However, NextClade pointed to several individual mutations that differed between the query sequence and the nearest neighbor sequence that are typically associated with the BA.2 clade, although the PANGO designation assigned it as lineage BA.1. We then analyzed the mutation profile of the sequencing reads in detail and observed the presence of BA.1 characteristic mutations in the spike protein (A67V, T95I, Y145D, G496S, T547K, N856K, L981F, del 69-70, and del 142-144), as well as BA.2-specific amino acid mutations (T19I, V213G, S371F, T376A, D405N, and R408S) in different proportions, strongly suggesting that the sequence obtained was not due to a BA.1/BA.2 recombinant but instead to a coinfection with both of these lineages (Table 1).

Fig. 1
figure 1

(A) Maximum-likelihood phylogenetic tree including the study sequence and other Brazilian sequences sampled in the same period, constructed using IQTREE2. (B) Assembly of the reads of the study sequence showing lower coverage in the regions 21765-21770del and 21633-21641del, where SARS-CoV-2 BA.1 and BA.2, respectively, were expected to have deletions.

Table 1 Percentage of bases in key positions of the spike gene that can be used to discriminate between SARS-CoV-2 BA.1 and SARS CoV-2 BA.2

Figure 1B illustrates the mix present in the regions 21765-21770 and 21633-21641, where SARS-CoV-2 BA.1 and BA.2, respectively, are expected to have deletions. To confirm this observation, we performed an RT-PCR using specific probes to detect del69-70 and wild type, and both variants were amplified. Like most of the samples sent to us for routine SARS-CoV-2 surveillance, this sample did not have associated clinical data such as vaccination history.

Coinfections with SARS-CoV-2 sublineages have been reported previously [9,10,11,12]. However, to our knowledge, this is the first report of coinfection with sublineages BA.1 and BA.2. SARS-CoV-2 coinfection should be closely monitored, since it is a sine qua non precondition for the emergence of recombinant viruses. Unfortunately, SARS-CoV-2 genomic surveillance is suboptimal in many countries [13], and rare events like these are likely to go unnoticed. Moreover, standard bioinformatics protocols using the DragenTM Covid Lineage App and the absence of detailed genetic analysis of NGS reads often obscure the detection of putative recombinants or mixed infections. Therefore, we conclude that the number of coinfections with different SARS-CoV-2 sublineages most likely remains underestimated.