Introduction

SARS-CoV-2 has been reported third amongst coronaviruses which caused severe acute respiratory disease followed by SARS-CoV (2003) and MERS-CoV (2012) all through the last two decades. The whole world has been affected by coronavirus disease 2019 (COVID-19) which is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), inflicting havoc on medical systems along with costing millions of lives compared to limited minor populations affected by SARS and MERS. The ~30kb genome of SARS-CoV-2 encodes sixteen non-structural proteins (NSP1-NSP16), four structural proteins comprising Nucleocapsid (N), Membrane (M), Envelope (E), and Spike (S) proteins and six other accessory proteins which are encoded by independent open reading frames (ORF- ORF3a, ORF6, ORF7a, ORF7b, ORF8 and ORF10) [1]. To gain access into a cell and initiate infection, the S-protein interacts with the human Angiotensin-converting enzyme 2 (ACE2) receptor [2]. ACE2 was initially identified in 2003 as the receptor for SARS-CoV [3]. Furin, a proteolytic enzyme, cleaves the SARS-CoV-2 spike protein at a unique cleavage site (SPRRAR↓S) at the interface between the S1 and S2 subunits to produce two subunits, S1 and S2 [4,5,6,7]. The S1 subunit comprises of an N-terminal domain (NTD) and the receptor binding domain (RBD) and is accountable for interaction with the host-cell ACE2 for entry. The S2 subunit comprises of the spike trimeric core and aids in membrane fusion. The structural proteins make up the mature virion while the non-structural proteins are crucial for replication and transcription of the viral genome. The entry of the viral particles encompasses its attachment to the host cell membrane and fusion which are facilitated by the S glycoprotein. The homotrimeric S protein is introduced in numerous copies into the cell membrane of the virion providing it a crown-like form. The monomeric form of S protein is a type I membrane protein and contains 66 N-linked glycans per protein trimer and thus it is ascribed to class I viral fusion proteins epitomized by the influenza virus haemagglutinin protein [4, 8, 9]. According to previous reports [1, 4, 10], S proteins of coronaviruses and haemagglutinin share a common structural organization and conformational transition that aids membrane fusion. Viral infectivity, pathogenesis and host range depend on the receptor recognition by coronaviruses. The SARS-CoV-2 spike protein’s RBD region uniquely selects ACE2 as its receptor making it a popular target for vaccines and antiviral drugs [11]. Pathogenic potential of the SARS-CoV-2 genome may alter due to adaptive variants which might escalate the difficulty in drug and vaccine developments. The current vaccines might pose pronounced challenges owing to viral variants [12]. Global efforts are required to track these variants.. Numerous research groups had isolated and sequenced the SARS-CoV-2 and the resultant sequences were deposited in public databases to expedite the tracking of virus evolution [12]. To tackle the SARS-CoV-2 pandemic and as well as possible future related outbreaks it is imperative to understand the structural details of RBD-ACE2 interactions. This review seeks to provide a detailed summary of the structural information.

The structural details of angiotensin-converting enzyme 2 (ACE2)

The 805 amino acid carboxypeptidase, Angiotensin-converting enzyme 2 (ACE2), shares 42% amino acid homology with angiotensin converting enzyme (ACE), a critical component of the renin-angiotensin system (RAS) [13]. The RAS is an endocrine system that regulates hydromineral balance and cardiovascular function. Through a series of proteolytic cleavage events, the effector peptide angiotensin-II (Ang-II), an active end product of RAS, is formed from angiotensinogen. Angiotensinogen is primarily synthesized in the liver and secreted into the circulation, where it is cleaved at the N-terminus by renin to produce the decapeptide angiotensin-I (Ang-I). The N-terminal residues of Ang-I: Phe-8, His-9 and Ile-12 formed 9 H-bonds with renin. The interactions were mapped with the COCOMAPS web server [14] using the crystal structure of human angiotensinogen complexed with renin (PDB ID: 2 X0B) [15] as a template and the H-bond interactions are presented in Table 1. The Ang-I is then processed by the endothelial ACE into an octapeptide Ang-II by the cleavage of the dipeptide, His-Leu, from the C-terminus of angiotensin-I (Fig. 1). The ACE-Ang-II complex is stabilized by 12 H-bonds. Table 1 shows the details of the H-bond interactions based on the crystal structure of the human angiotensin-converting enzyme in complex with angiotensin-II (PDB ID: 4APH) [16]. When angiotensin I (Ang I) is converted to angiotensin II (Ang II), it binds to angiotensin II type I (AT) and type II receptors in the kidney, adrenal cortex, arterioles, and the brain. Ang II stimulates the release of aldosterone in the adrenal cortex, resulting in sodium and water retention [13, 15,16,17]. ACE can cleave a variety of other substrates, including the vasodilator bradykinin and N-acetyl-Ser-Asp-Lys-Pro (Ac-SDKP), a physiological modulator of haematopoiesis.

While ACE2 (EC 3.4.17.23) is homologous to ACE (EC 3.4.15.1) and functions as an important regulator of the renin-angiotensin system (RAS), there are multiple structural and functional differences. The ACE2 gene is located on chromosome Xp22, in contrast to the ACE gene, which is located on chromosome 17. It consists of 18 exons and 20 introns, most of which are similar to the exons found in the ACE gene [13, 18].

Fig. 1
figure 1

The renin-angiotensin system: a the N-terminal of angiotensinogen is processed by renin (PDB ID: 2 X0B) to release b angiotensin-I (PDB ID: 1N9U)[19]. Conversion of angiotensin-I to angiotensin-II (PDB ID: 1N9V)[19] by c ACE (PDB ID: 4APH) by the cleavage of the C-terminal dipeptide, His-Leu.

Table 1 Details of H-bond interactions of Renin-Ang-I complex and ACE- Ang-II complex

The major structural difference in both zinc metallopeptidases is the active site regions due to which the two enzymes counterbalance rather than reinforce each other’s actions. In contrast to two active sites of ACE (N- and C-terminal domains, each containing a Zn binding motif, HEMGH), ACE2 possesses only a single catalytic domain (Fig. 2a). ACE cleaves C-terminal dipeptide residues from susceptible substrates (a peptidyl dipeptidase), ACE2 acts as a simple carboxypeptidase able to hydrolyze Ang I, forming Ang 1–9 and Ang II to Ang 1–7[18].

The type I membrane protein ACE2 is a homodimer with a claw-shaped extracellular head domain, a small transmembrane domain, and a short intracellular segment (Fig. 2b). The catalytic zinc-binding peptidase domain (PD; residues 19–615) and the smaller neck domain (residues 616–726) compose the head domain, which appears to be where most of the homodimer interactions occur. The PD contains two lobes encompassing the peptide substrate binding site between them. A long linker connects the neck domain to the single-helix transmembrane (TM) domain [20, 21]. The production of Ang 1–9 is processed by the PD which cleaves Ang-I, which is further processed by other enzymes to form Ang 1–7. However, ACE2 can also directly process Ang II to Ang-1-7. The Neck domain is the major contributor to the homodimerization of ACE2, while PD provides a minor interface. The homodimer is stabilized by a network of cation-π interactions and H-bonds. The cation-π interactions are made by Arg-652 and Arg-710 of one protomer (A) with Tyr-641 and Tyr-633 of another protomer (B). The list of H-bond interactions is presented in Table 2. However, no such interactions are observed in PD which depends on non-bonded contacts. Due to the weak interactions of the PDs, they are able to rotate themselves and separate by a distance of ~25 Å while keeping the Neck domain unchanged for the transition from close to open conformation [22].

Fig. 2
figure 2

The structural details of ACE2: a Positions of active sites of ACE and ACE2 and b Dimeric structure of ACE2 with its domains: PD, Neck and TMD. The zinc ions are shown in red spheres. The PD with two lobes with substrate binding cleft is shown in the inset

Table 2 The details of H-bond interactions between two protomers of ACE2 homodimer

The structural details of SARS-CoV-2 RBD

The receptor binding domain (RBD) of both SARS CoV and SARS-CoV-2 uses ACE2 as its receptor. Hence, RBD has become a key therapeutic target. The RBD is a part of the trimeric class I fusion protein Spike (S). To fuse the viral membrane with the host cell membrane, the S-protein undergoes a substantial structural rearrangement from pre-fusion to post-fusion conformations [23]. The 1273 amino acid long S-protein comprises two functional subunits: S1 (residues 14–685), responsible for anchoring to the host cell receptor and S2 (residues 686–1273), responsible for fusion of the viral and cellular membranes (Fig. 3a) [24]. The SARS-CoV-2 RBD (residues 319–541) is present in the S1 subunit, and it appears in two distinct conformations “down” and “up” due to hinge-like conformational movements of the S1 subunit (Fig. 3b). The “down” conformation represents the pre-fusion receptor inaccessible state while the “up” corresponds to the accessible state that is required for ACE2 binding and epitope availability for antibodies [23, 25]. The S1 subunit contains the N-terminal domain (NTD), RBD, sub-domain 1 (SD1) and sub-domain 2 (SD2). There is a high degree of structural homology between individual domains of SARS CoV and SARS-CoV-2 S1 subunit. However, the major difference lies in the RBDs in their respective down conformations: the SARS-CoV RBD in the down conformation packs tightly against the NTD of the protomer partner and is angled closer to the central cavity of the trimer [25]. The crystal structure of SARS-CoV-2 RBD-ACE2 complex (PDB ID: 6M0J) reveals that the RBD contains 7 β strands: β1(residues 354–358), β2 (residues 376–379), β3 (residues 394–403), β4 (residues 432–437), β5 (residues 452–454), β6 (residues 492–494) and β7 (residues 507–516). Out of these, 5 strands namely β1, β2, β3, β4 and β7 are anti-parallel with short connecting helices and loops that form the core [20]. Between the β4 and β7 strands in the core, there is an extended insertion containing the short β5 and β6 strands, α4 and α5 helices and loops. This extended insertion is the receptor-binding motif (RBM) (residues 438–506), which contains most of the contacting residues of SARS-CoV-2 that bind to ACE2.

Fig. 3
figure 3

Structural features of SARS-CoV-2 RBD: a structure of trimeric spike protein indicating positions of S1 & S2 subunit, RBD and NTD; b Two monomeric units of spike showing two conformations of RBD : up & down (inset shows the RBD with RBM highlighted in cyan)

Details about SARS-CoV-2 RBD-ACE2 interactions

To enter the target cells, the SARS-CoV-2 Spike protein (S) uses human ACE2 as an entry receptor. Hence, the interaction between the SARS-CoV-2 -S and ACE is critical for fusion thereby transmission and pathogenicity of the virus. The importance of this interaction can be inferred from two recent studies which showed that HeLa cells that expressed ACE2 are prone to SARS-CoV-2 infection as opposed to the non ACE2 expressing cells [26] and low nanomolar range (1.2 nM) of binding affinity between RBD and ACE2 [24].The RBM (residues 438–506) of the RBD makes the maximum contact with the N-terminal PD domain of ACE2. A total of 13 H-bonds and 1 salt bridge were observed in the complex (Table 3). The extended RBM contacts the bottom lobe of ACE2 (Fig. 4), where its concave outer surface accommodates the N-terminal helix of the PD [20].

Fig. 4
figure 4

The interactions of SARS-CoV-2 RBD with ACE2 (PDB ID: 6M0J): Interaction of RBD with the bottom lobe of ACE2 (few interactions are shown with distances in the inset)

Table 3 Details of H-bond & salt bridge interactions of RBD and ACE2 complex

Details about mutational effect on the RBD-ACE2 interactions

Several variants have been reported till date and the structural basis of the effect of the specific mutations on ACE2 binding will be discussed here. Three strains namely, the UK strain with an N501Y mutation in the spike, the South African strain with three concurrent mutations K417N, E484K and N501Y and finally the Brazilian strain with mutations K417T, E484K and N501Y spread rapidly since their first detection [27]. From a structural perspective, N501Y mutation is not expected to contribute significantly towards enhancing complex stabilization since both asparagine and tyrosine can form hydrogen bonds with Tyr-41 of ACE2. Tyrosine substitution could also form pi-pi stacking interaction with Tyr-41 [28]. Indeed, in silico binding energy analysis shows that wild-type spike and N501Y mutant have similar total binding energy values [27, 29]. N501Y might be beneficial for this strain in terms of differential interaction with ACE2 [28]. However, as per earlier in silico reports [27, 29] the African and Brazilian strains of the spike RBD with triple mutations have very strong binding energy interactions with ACE2 as compared to wild type (WT) spike. In the South African variant Asn-417, Lys-484 and Tyr-501 form hydrogen bonds with residues Glu-30, Glu-35 and Glu-75 of ACE2. Lys-484 forms an additional salt bridge with Glu-35 of ACE2. In the case of the Brazilian variant, Glu-35 and Lys-353 of ACE2 formed hydrogen bonds with Lys-484 and Tyr-501 respectively. Salt bridge interaction was observed between Lys-483 and Glu-35. In general, electrostatic interactions were more abundant in both variants as compared to the wild-type spike in complex with ACE2. Similar situation was observed for E484K mutant spike as well. E484K formed H-bonds and salt bridges with Glu-35 of ACE2 which enhanced binding affinity [27].

Two of the most frequently encountered mutations in the SARS-CoV-2 spike are S477N and N439K respectively. In the wild-type spike, Asn-439 forms intra-chain hydrogen bonds with the Spike Ser-443 and Pro-499, thus, not contributing towards ACE2 interaction. However, mutation to the Lys would result in electrostatic interaction with Glu-329 of ACE2 thereby leading to complex stabilization. A similar effect is expected for the S447N mutation which is located in a surface loop of the spike. In the wild-type situation Ser-447 does not engage with ACE2, however, the presence of asparagine would result in the formation of an added H-bond with Ser-19 of ACE2. Both mutations lead to the stabilization of interaction with ACE2 [28].

Spike Q493R is an escape mutation that was first reported in October 2021 [30]. This particular mutation allowed the virus to acquire resistance to Bamlanivimab and Etesevimab which is a monoclonal antibody cocktail therapy [30]. The variant was identified from a patient who contracted COVID and was administered antibody therapy [30]. Gln-493 is an interfacial residue that forms a hydrogen bond with Glu-35 of the ACE2 receptor [28]. Q493R mutation would allow the positively charged arginine side chain to be accommodated within a surface exposed pocket of ACE2 formed by Asp-30, His-34 and Glu-35 respectively [28]. This would result in markedly increased stability of binding. In fact, in silico molecular dynamics simulations demonstrate that the Q493R mutation allows the creation of two extra salt bridges with Glu-35 (more stable) and Asp-38 with ACE2, which are not seen in the wild-type spike.[31]. However, the simulation also indicated the loss of interaction with ACE2 Lys31 and formation of an unfavourable interaction with Lys-353. Q498R similarly enhance binding affinity towards ACE2. In this case, it formed an additional salt bridge with Asp-38. Interactions with Tyr-41 and Gln-42 of ACE2 were stronger compared to WT spike interaction [31].

Spike variants with mutations like N440K and G476S are high affinity ACE2 binders and are located in solvent accessible loop regions. Compared to the WT spike, both mutations lead to drastic conformational alterations wherein, it reorients the RBM loop towards the binding interface thereby allowing additional interactions [31, 32]. Deleterious mutations like G446S and Y505H were also high affinity binders to ACE2 but were found to be destabilizing for RBD [31, 33].

L452R is another notable mutation which interestingly exists in the Delta variant but is absent from the Omicron [34]. Presence of arginine at position 452 results in enhanced infectivity and fusogenic potential [35]. Presence of Leu-452 results in the formation of a hydrophobic patch on the surface of spike RBD due to hydrophobic intramolecular interactions with residues Phe-490 and Leu-492. This patch is non-existent when Leu-452 is replaced by an arginine which affects ACE2 binding resulting in the decreased fusogenic potential of Omicron as compared to the Delta variant [32, 36]. Effects of the RBD mutations on ACE2 binding categorized based on their role are shown in Table 4.

Table 4 Effect of RBD mutations on ACE2 binding

Summary

The rapid outbreak of COVID-19 in Wuhan, China, in December 2019 stunned the world in early 2020. In early March 2020, the situation was designated a pandemic by the World Health Organization (WHO) [37]. Scientists continue to work actively to find new information about the virus that could help combat not only SARS-CoV-2 and its variants, but also future pandemics of this type. Although considerable progress has been made in developing therapeutics against SARS-CoV-2, there are still several obstacles to overcome. One of these problems is that the molecular basis of the interactions of SARS-CoV-2 with the host needs to be further explored. Thus, although a considerable number of host proteins have been reported to interact with SARS-CoV-2 proteins, knowledge of the complex structures of SARS-CoV-2 viral proteins with host proteins is still insufficient [38]. The (S)-glycoprotein of SARS-CoV-2 is the major determinant of host tropism and susceptibility and the main target of antibody responses and the enhanced binding of RBD to the human receptor is one of the critical factors for the transmissibility of the virus. As a result, the appearance of adaptive mutations in the spike protein can have a significant impact on host tropism and viral transmission [39]. Apart from the details of the residues in terms of binding mode, information on how their interaction energies change upon mutation and how these interaction differences affect the fusogenic potential of the virus is critical for the development of future therapeutics.