Introduction

The SARS-CoV-2 is a single, positive-strand RNA virus that causes severe respiratory syndrome in humans [1]. The Coronavirus disease 2019 (COVID-19) has emerged as a severe pandemic, claiming more than 0.8 million lives worldwide between December 2019 and August 2020 [2, 3]. Compared to SARS-CoV, SARS-CoV-2 human-to-human infection is more readily transmitted and spread to almost all continents leading to the WHO’s declaration of a Public Health Emergency of International Concern (PHEIC) on January 30, 2020 [3,4,5]. Generally, coronaviruses can cause respiratory, gastrointestinal, and central nervous system diseases in humans and animals, threatening humans life and causing economic loss [6, 7]. These viruses also have the capacity to adapt to a new environment through mutations and are programmed to modify host tropism; thus, the threats are constant and long-term [6, 8, 9].

Including SARS-CoV-2, other coronaviruses cross the species barrier into humans, which lead to outbreaks of severe and fatal respiratory diseases. The SARS-CoV was first identified in bats, and spread to other animals in different geographic regions. The SARS-CoV outbreak was first emerged in humans in 2003, through transmissions from animals in open-air markets in China [10, 11]. Thereafter, a higher number of genetically related viruses were also identified in Chinese Horseshoes bats (Rhinolophus sinicus) [11,12,13]. Coronaviruses belong to the family Coronaviridae and are divided into alpha (α-CoV), beta (β-CoV), gamma (γ-CoV), and delta (δ-CoV) coronaviruses. The alpha and betacoronaviruses can infect mammals, and the viruses found in humans are genetically similar to β-CoV genus. The β-CoVs are further divided into different lineages (A, B, C, and D lineages): SARS-CoV and SARS-CoV-2 are grouped in lineage B, which has approximately 200 published virus sequences, whereas MERS-CoV belongs to lineage C, which has ~ 500 viral sequences [11].

The HCoV-229E and HCoV-NL63 belong to the Alphacoronavirus family, whereas HCoV-OC43, HCoV-HKU1 and SARS-CoV are Betacoronaviruses [14,15,16]. The phylogenetic analysis shows that SARS-CoV-2 protein is firmly rooted in the β-genus lineage of bat coronaviruses [14]. The whole genome of SARS-CoV-2 shares 80% identity with that of SARS-CoV and is 96% identical to the bat coronavirus BatCoV-RaTG13 [17]. The spike protein sequence similarity between SARS-CoV-2 and SARS-CoV is around 76–78%. The RBD alone shares a similarity of 73–76%, and RBM shares 50–53%. In contrast, the human MERS-CoV is related to Tylonycteris bat coronavirus HKU4, shares less sequence similarity (~ 54%) and recognizes DPP4 as their receptor. The sequence similarity between SARS-CoV-2 and SARS-CoV spike proteins explains the possibility of binding to the same receptor angiotensin converting enzyme 2 (ACE2) in the host cell [14].

Coronavirus is one of the largest genomes among all RNA viruses ranging from 27 to 32 kb. Receptor-mediated endocytosis is the main process of virus entry to the host cells. SARS-CoV-2 uses ACE2, a cell-surface receptor that is present in the kidney, blood vessels, heart, and importantly, in the lung AT2 alveolar respiratory tract epithelial cells for viral infection [18]. The spike protein, which is responsible for the viral entry, has N-terminal and C-terminal domains, and two major subunits S1 and S2 are present in almost all coronaviruses [6]. One of these S1 or S2 subunits binds with the host receptors and acts as a receptor-binding domain (RBD) (Fig. 1a).

Fig. 1
figure 1

a Domain arrangement of SARS-CoV-2 spike protein. SS signal sequence, NTD N-terminal transactivation domain, RBD receptor binding domain, SD subdomain, FP fusion peptides, HR1 heptad repeat 1, HR2 heptad repeat 2, CD connector domain, S1/S2 and S2′ protease cleavage sites, TM transmembrane domain, CT cytoplasmic tail. b Host cell entry and replication of SARS-CoV-2. SARS-CoV-2 infection starts with the binding of spike protein with ACE2 receptor and the invasion process is triggered by host cell proteases (furin, trypsin, TMPRSS2 and cathepsin). SARS-CoV-2 releases RNA into the host cell, and the RNA is translated into viral replicase polyproteins pp1a and pp1ab, and subsequently cleaves into NSPs. The full-length negative strand RNA copies of the viral genome are produced by the enzyme replicase using the full-length positive-strand RNA genome as a template. During transcription, RNA polymerase produces a series of subgenomic mRNAs and translates into viral proteins [S (Spike), E (Envelope), N (Nucleocapsid), and M (Membrane)]. The viral proteins and the genome RNA are assembled into virions in Golgi and ER (endoplasmic reticulum), which are budding into ERGIC (ER–Golgi intermediate compartment) and released out of the cell via vesicles

Next-generation sequencing technology (NGS) revolutionized the biological sciences, including virus discovery. The NGS made it easy to recognize thousands of novel virus sequences from wild animal and human populations around the world. Despite these vast coronavirus sequences are published, very little work has been performed for further studies. Further, the lack of tools to test these novel viruses and their ability to infect humans hindered the efforts to predict the subsequent zoonotic viral outbreaks [11]. Understanding the virology of coronaviruses and the methods to control their spread is currently a necessary task to maintain the global health and economic stability.

In this review, we discuss the history of coronaviruses in both humans and animals, their transmissions, mechanism of host cell entry and the structural studies, explaining active and inactive receptor binding of spike protein and the key residues playing an important role in the receptor binding. The drug repurposing and the therapeutic targets for SARS-CoV-2 are also discussed.

History known so far

Until the first identification of human coronaviruses 229E and OC43, in the late 1960s, coronavirus infections were witnessed as harmless for humans [14, 15]. The outbreak of SARS-CoV in southern China in the winter of 2002, took a fatality rate of 10% of the infected patients [18,19,20]. The virus had been rapidly spreading throughout the world, especially in Asia, and controlled after July 2003 [21]. Viral analysis of the outbreak of SARS showed that bats are natural reservoirs for SARS-CoVs, and civet cats and raccoon dogs are the intermediate hosts. In the year 2012, a novel highly pathogenic Middle East Respiratory Syndrome coronavirus (MERS-CoV) was identified in humans, demonstrating that the coronaviruses are transmitted from animals to humans at any time and with unexpected consequences for the public health [22]. MERS-CoV, the slow-spreading virus, has affected ~ 1700 people with a fatality rate of ~ 36% [6, 22]. The animal sources of SARS-CoV-2 infections are bats and SARS-CoV-2 can be transmitted to cats, pangolins, and dogs [23].

Other than humans, the coronaviruses have a big impact on the animal kingdom. Animal coronaviruses can cause severe threat to their hosts; since 1984, an unrecognized infection massively spread among the swine population in Europe, and later it was identified as porcine epidemic diarrhea coronavirus (PEDV), which is derived from the porcine enteric coronavirus TGEV [24]. In 2013, the same PEDV in less than a year had caused a 100% fatality rate in piglets and rubbing out more than 10% of America’s pig population [5, 25, 26]

Therefore, the total number of human coronaviruses identified has been increasing throughout the years. HCoV-229E and HCoV-OC43 are the first two discovered human coronaviruses in the 1960s, HCoV-NL63 and HCoV-HKU1 human coronaviruses identified after the SARS-CoV outbreak in 2003. The sixth coronavirus MERS-CoV was identified in Saudi Arabia in 2012 [22]. However, the story continues with the new identification of SARS-CoV-2 in December 2019 at the seafood wholesale market in Wuhan, China. SARS-CoV-2 is the seventh member of the family of coronaviruses that infects humans and it is different from both MERS-CoV and SARS-CoV.

Although some infections caused by human coronaviruses are mild and associated with common colds, certain animal and human coronaviruses can make a severe impact on the human population. Especially in young children, elderly people, and immune-deficient patients, the infections can be lethal [9, 27,28,29]. Therefore, it is important to understand the mechanism of invading viruses to theirs hosts, transmission and prevention of these processes.

Transmission and epidemiology

The respiratory droplets are the main routes of transmissions; SARS-CoV-2 can be transmitted to a healthy person if he happens to have contact with the infected person or any of his belongings, including clothes, doorknobs, etc. Studies have been reported that aerosol transmission (Airborne transmission) is also possible for SARS-CoV-2, but there is no clear study on neonatal infections (mother to child) [30,31,32,33,34,35,36]. However, the transmission can be avoided by keeping a distance of 2 m between two people, wearing masks while going out, and the isolation of infected people.

During the initial phase of the COVID-19 outbreak, a dataset was obtained from 1099 patients with laboratory-confirmed COVID-19 from 552 hospitals in 30 provinces of China on January 29, 2020. Only 2% of the patients had a history of contact with animals; more than three quarters have either visited the Wuhan city or are residents. Hence, the outbreak patterns or the source of infection could not be predicted from their study. The incubation period of the SARS-CoV-2 was from 1 to 12 days; however, the median incubation period was 4 days [32]. The most common symptoms are fever (43% on admission, and 88.7% during hospitalization), cough (67.8%), diarrhea (3.8%), and fatigue [32, 37]. The SARS-CoV-2 was detected in saliva, blood, sputum, and urine before the development of viral pneumonia, and some patients do not develop pneumonia at all. Asymptomatic persons are potential sources of SARS-CoV-2 infection, which control the transmission dynamics of the current outbreak [32, 38].

The SARS-CoV-2 first identified in Wuhan, China has spread all over the globe. As of August 20, 2020, more than 22 million confirmed infection cases and 0.8 million deaths had been reported across the world, including almost all the countries. The rate of infection or the average number of people getting infected by an individual (R0) was 2.75 in the case of SARS pandemic in 2003. The R0 value of Ebola 2014 was in the range of 1.51–2.53, and H1N1 influenza 2009 was from 1.46 to 1.48, and for MERS, it was around 1. The SARS-CoV-2 R0 value was estimated to be in the range of 1.5–3.5. The comparison of R0 values of various coronaviruses shows that the difference is minimal. However, the difficulties arising for SARS-CoV-2 infection are due to the following: (1) basic properties of the viral infection and the infection periods are uncertain, (2) most of the infected individuals do not show symptoms, but are capable of spreading the infection, (3) changing susceptibility of the population in affecting the spread of infection remains unanswered. In addition, there are no control measures for this spread [39].

Spike protein structure and mechanism of SARS-CoV-2 entry

Coronaviruses consist of four structural proteins: the nucleocapsid protein (N) forms the helical capsid to accommodate its genome. The whole structure is further surrounded by a lipid envelope, which is made of S (spike), E (envelope) and M (membrane) proteins. The membrane and the envelope proteins are needed for the virus assembly and the spike protein is for virus entry and host cell recognition [6]. The spike protein forms large protrusions (Peplomers) on the virus surface (looks like the virus has crowns), and hence it is named as “CORONA” (corona is a Latin word which means crown). It comprises three segments: (1) large ectodomain, (2) transmembrane domain and (3) intracellular tail. The receptor-binding subunits S1 and S2 are placed in the ectodomain region. During the infection, the S1 binds with the host receptor, and S2 fuses the host and viral membranes, thereby releasing the viral genome into the cell (Fig. 1). The spike protein is a clove-shaped trimer with three S1 heads and a trimeric S2 stalk [6].

During viral infection, spike protein (~ 1300 amino acid residues) is cleaved by host proteases into receptor binding subunit S1 and membrane fusion subunit S2. During cell entry, the S1 subunit binds directly to the sugar receptors [40] and ACE2 of the host cell surface, and the S2 subunit undergoes conformational changes and obtains post-fusion state [41]. During this state, the three pairs of heptad repeat region HR-N and HR-C in trimeric S2 form a six-helix bundle structure [42]. The buried hydrophobic fusion peptides become exposed and insert into the target host membrane. These fusion peptides and the transmembrane anchors are positioned at the end of a six-helix bundle structure, bringing the viral and host membranes to fuse [42, 43]. During this process, a large amount of energy is released, which accelerates the membrane fusion forward. Along with this, receptor binding and low pH can also trigger this membrane fusion [6].

Since the spike protein has a good binding affinity for sugar receptors of human cells, it uses them as a mechanism of cell entry [6]. Notably, the SARS-CoV-2 has a higher affinity to human ACE2 than the SARS-CoV virus strain. The ectodomain of the SARS-CoV-2 spike protein binds to the peptidase domain (PD) of ACE2 with a Kd (equilibrium dissociation constant) of ~ 15 nM [4]. Spike protein priming is done by transmembrane protease serine 2 (TMPRSS2), which is also essential for the entry of SARS-CoV-2 [44]. Generally, SARS-CoV enters host cells through endocytosis, where its spike protein is processed by cathepsin L and cathepsin B lysosomal proteases. The extracellular proteases, including elastase in the respiratory tract and TMPRSS2 on the surface of flung cells are also known to activate spike membrane fusion [6].

The viral entry to the host cells can be via: (1) the endocytic pathway and (2) non-endosomal pathway [45] (Fig. 1b). The endocytic pathway, especially clathrin-dependent endocytosis is extensively studied for SARS-CoV and MERS-CoV viral entry. Since the SARS-CoV-2 also uses the same receptor as SARS-CoV, it is reported that SARS-CoV-2 also uses the same viral entry mechanism. Wang et al. [46] reported clathrin- and caveolae-independent endocytic pathways for SARS-CoV entry. Despite the common use of endocytic pathway as a viral entry mechanism, the discrepancies about the same are unavoidable. Thus, the exact nature of the viral entry is context-dependent, including the type of the virus and the host cells [47].

In addition, the macrophages can also act as a viral reservoir and support minimally for the SARS-CoV-2 entry and its replication. Although the dendritic cells and other immune cells are not infected by SARS-CoV-2, they may serve as a transporter for the viruses, which is also responsible for the pathogenesis [48, 49].

Active and inactive state of spike protein

The S1 subunit of SARS-CoV spike glycoprotein is predominantly composed of β-strand structures, composed of an N-terminal domain (NTD, residues 14–294) and three C-terminal domains (CTD1, residues 320–516; CTD2, residues 517–578 and CTD3, residues 579–663). The NTD is linked with CTD1 through the linker residues 295–319, where CTD1 acts as a potential RBD for SARS-CoV and binds explicitly with the ACE2 receptor.

Gui et al. [43] reported that the SARS-CoV spike glycoprotein trimer could obtain different conformations, which are necessary for effective binding with ACE2. All these conformations are termed based on the position of CTD1 of the spike glycoprotein (Fig. 2). The three spike glycoprotein monomers intertwine with each other and form densely packed homotrimer. The head portion of this trimer is taken place by NTDs and CTD1s of S1 subunits, where the CTD1s are located at the center and the NTDs are located outside of this triangular head. The S2 subunits represented as a stem for this trimer, which is further surrounded by CTD2s and CTD3s of the S1 subunits (Fig. 2). In the inactive state (Fig. 2a, b), S2 subunits (stem portion) are completely covered by the “down” position of CTD1s (head portion), which causes steric clashes for the binding between spike protein and ACE2. In the active state (Fig. 2c), two CTD1s adapt the “down” conformation and one CTD1 rotates outward and obtains “up” conformation, which does not cover the S2 subunit and allows the interactions between spike protein and ACE2. The “up” position also paves the way for the S2 subunit to expose and insert its fusion peptides into the host cell membrane [43] (Fig. 2).

Fig. 2
figure 2

Inactive and active state conformations of spike protein with its receptor binding. a, b Represent the inactive state conformations in which S2 subunits (stem portion) are completely covered by the “down” position of CTD1s (head portion). It causes steric clashes and inhibits the binding between spike protein and ACE2. c Represents the active state conformation, in which one of the CTD1s is in open state (shown in red stars) and facilitates the binding between spike protein and ACE2. TM and CT stands for transmembrane domain and cytoplasmic tail, respectively

Key residues of SARS-CoV-2 and SARS-CoV

The sequence similarity between SARS-CoV-2 and SARS-CoV spike protein explains the possibility of having the same receptor ACE2 in the host cell [14]. The sequence and the structural studies revealed the key residues, which are involved in spike–ACE2 interactions. The key residue at position 493 in RBD of SARS-CoV-2 is Gln, wherein SARS-CoV (479 is the corresponding residue in SAR-CoV) of civets and humans, it is Lys and Asn, respectively. Since the residue 479 of RBD is near to the virus-binding hotspot residue Lys31 of human ACE2, the Lys residue present in civet causes steric clashes and not favoring human ACE2 receptor binding. However, the Lys479Asn mutation revealed that the Asn present in 479 of human SARs-CoV enhances the viral binding with human ACE2 receptors. The Gln493 in SARS-CoV-2 RBD is compatible with the human ACE2 receptor hotspot residue Lys31, which explains its target cell identification [14].

Similarly, the residue 501 in SARS-CoV-2 RBD is Asn, wherein civet RBD, it is Ser (487 is the corresponding residue in SARS-CoV), and for humans, it is Thr. This residue also plays an important role in making interaction with the hot spot residue Lys353 of receptor ACE2. Ser487Thr mutation analysis shows encouraging results upon human ACE2 receptor binding and plays a role in human-to-human transmission. Thus, the interactions with Lys353 will be favorable for threonine (human SARS-CoV) and Asn (SARS-CoV-2) than serine (civet) [14].

The residues Lue455, Phe486, and Ser494 in SARS-CoV-2 RBD (Tyr442, Leu472, and Asp480 in human and civet SARS-CoV) are considered as important for the human ACE2 receptor binding. Tyr442 of human and civet SARS-CoV RBDs shows unfavorable interactions with hotspot residue Lys31 of human ACE2; however, Lue455 of SARS-CoV-2 provides favorable interactions. Compared with Leu472 of human civet SARS-CoV RBD, Phe486 of SARS-CoV-2 provides better interaction with hotspot residue Lys31 of human ACE2. Although Ser494 provides positive support for the human ACE2 receptor hotspot residue Lys353, the SARS-CoV Asp480 also makes favorable interaction with hotspot residue Lys353. Throughout this viral entry process, the Lys31 and Lys 353 of human ACE2 receptors are termed as “hot spot” residues, which consist of a salt bridge buried in a hydrophobic environment and contribute critically for the virus and host cell receptor binding. Thus, this key residue comparison of SARS-CoV-2 with the civet and human SARs-CoV explains how actively SARS-CoV-2 is choosing and binding with the human ACE2 receptors, which is likely to cause the human-to-human transmission [14]. The heterogeneity of amino acids in ACE2 receptors is also responsible for the wavering binding affinities between the host and the virus, which is associated with the viral transmission. However, the variants in the host cell receptors (ACE2) can confer resistance against the invading pathogens. Hussain et al. reported that the mutations S19P and E329G in ACE2 disrupt the intermolecular interactions and have low binding affinity with viral spike protein. In addition to the variations in the viral spike protein, ACE2 allelic variants can also drive the potential resistance against SARS-CoV-2 infection [50].

Intrinsic disorder propensities in coronaviruses

Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) also play major roles in a number of biological functions, including DNA/RNA binding, protein binding, and facilitating access to the binding sites between the binding partners [51, 52]. The RNA–protein recognition often needs conformational changes in both RNA and protein, which is facilitated by the structural flexibility of disordered residues [51]. Also, the functional importance of intrinsically disordered regions in proteins includes transcription, translation, post-translational modifications, and cell signaling [51, 53]. The categorization of coronaviruses, based on the intrinsic disorder propensities, can represent useful identification for the viral life cycle and its pathogenicity.

The IDRs in SARS-CoV nucleocapsid protein comprise three segments, such as 1–44, 182–247 and 366–422 [54, 55]. The highly flexible intrinsic disordered linker region, which connects the NTD and CTD is rich in serine and arginine residues. An intrinsically disordered domain that flanks the CTD (C-terminal tail peptide) plays a significant role in dimer–dimer association in human coronavirus 229E (hCoV-229E). Likewise, the coronavirus HKU1, has a partially disordered conserved linker loop (amino acids 428–587) structure [56]. HKU1 S2 subunit also shows the presence of disordered residues at its protease cleavage site [57].

In SARS-CoV-2, the attachment of spike protein with the host cell is activated by the host cell enzymes trypsin, cathepsin L, furin and TMPRSS2 (Fig. 1b). The sequence comparison of SARS-CoV-2 against other lineage B betacoronaviruses shows that the unique amino acid pattern “RRAR” is present at the S1/S2 junction of the spike protein, which is cleaved by the furin enzyme. However, the structure reported for SARS-CoV-2 spike protein (PDB code: 6VSB), shows that S1/S2 junction is in a disordered, solvent-exposed loop [4]. Hence, it has been hypothesized that the unique amino acid sequence “RRAR” present in SARS-CoV-2 is responsible for their effective transmission [58,59,60].

The binding with ACE2 is governed by the intrinsically flexible receptor binding motif, and the binding interfaces along with the key residues are reported in the literature [61, 62]. In addition, based on our analysis (Fig. 3), the monomeric structure of the spike protein in the free form (PDB ID: 6VSB; green color) [4] shows a number of missing residues in the structure. These missing residues of the spike protein attain a stable conformation upon binding to the ACE2 receptor and hence, they are termed as disorder-to-order transition (DOT) residues (PDB ID: 6LZG; light golden color). Specifically, the regions, Leu455 to Pro491 and Asn501 to Val503 show disorder-to-order transitions [61]. These disordered-to-ordered residues are facilitating a better shape complementary and affinity between ACE2 and spike protein. Interestingly, the key residues responsible for transmission, and interaction with ACE2 receptors (already discussed in section “Key residues of SARS-CoV-2 and SARS-CoV”) are overlapping with these mentioned disordered-to-ordered residues. Based on these observations, the disorder-to-order conformational change is necessary to facilitate the spike protein binding with its receptor. Thus, an in-depth analysis of these disordered residues will shed additional insights on the viral recognition and transmission mechanism.

Fig. 3
figure 3

Structure of the monomeric spike protein (green)—ACE2 receptor (blue) complex. The interface residues are shown in light golden. The disordered-to-ordered transition residues (Leu455 to Pro491 and Asn501 to Val503) have been marked in the figure

The studies by Goh et al. [63] on 1918 H1N1 and H5N1 explain that it is very likely that disordered regions are important for host specificity and recognition, e.g., across species of birds. They also explained the changes created by disorder in crucial regions, increasing the virulence of both the H5N1 and the 1918 H1N1 viruses. Therefore, the increasing reports for the intrinsically disordered regions in coronaviruses need to be pointed out. The importance and functional role of intrinsically disordered regions need to be thoroughly studied. This could identify additional targets for drugs to combat coronavirus through the disruption of their packing and assembly process.

Therapeutic targets

The receptor binding, along with its membrane fusion, is the initial and important step in the coronavirus infection and serves as primary targets for inhibiting the viral entry.

The genome sequencing of SARS-CoV-2 and the comparison of the genomes of related virus proteins suggested the anti-HIV lopinavir plus ritonavir combination can be likely effective [15, 64, 65]. SARS-CoV-2 uses the ACE2 receptor of AT2 cells in the lung as its primary targets. Since the viral entry is governed by receptor-mediated endocytosis, AP2-associated protein kinase 1 (AAK1), a known regulator of endocytosis, can be a good target to interrupt the virus entry. The Janus kinase inhibitor baricitinib, and its binding with the cyclin G-associated kinase (endocytosis regulator) is sufficient to inhibit AAK1 [66, 67]. Similarly, sunitinib and erlotinib, the oncology drugs, have been shown to inhibit viral infection of cells through the inhibition of AAK1 [68]. However, these compounds bring serious side effects and cannot be considered for a safe therapy.

Based on the pathogenicity studies of SARs-CoV and MERS-CoV, an increased amount of inflammatory cytokines in serum is associated with the inflammation and extensive lung damage [69]. Similarly, SARS-CoV-2 also has a high amount of IL1B, IFNγ, IP10, and MCP1/CCL2, which lead to T-helper-1 (Th1) activation. However, SARS-CoV-2 is shown to increase the T-helper-2 (Th2) cytokines (IL4 and IL10) that are known to suppress inflammation, which differs from SARs and MERS-CoV infection. In view of cytokines induced by SARS-CoV-2 and other SARS-CoV, MERS-CoV, corticosteroids were frequently used to reduce the inflammation-induced lung injury [2].

Host-and-viral-targeted drugs

Arabi et al. [70] examined the combination of interferon beta-1b, lopinavir, and ritonavir combination for the MERS infection in Saudi Arabia. The antiviral nucleotide prodrug remdesivir showed a potent efficacy on MERS-CoV and SARS-CoV infections. Since SARS-CoV-2 is an emerging virus, no effective treatment has been developed so far, yet the already available combination of lopinavir and ritonavir is being used [2].

Since the SARS-CoV enters the cell through endocytosis and the lysosomal protease is priming the spike protein, targeting/inhibiting the endosomal acidification or lysosomal cysteine protease can block the SARS-CoV entry [6]. The view of the activation of SARS-CoV and SARS-CoV-2 by TMPRSS2 might also have therapeutic suggestions. A protease inhibitor, camostat mesylate, which is studied as a TMPRSS2 inhibitor and has been used to treat humans, is available and could be employed as a defense against the respiratory viruses [71, 72].

Autophagy has been implicated in viral replications, which is also responsible for the formation of double-membrane vesicles (DMV) in the host cells [73]. Since the autophagosomes are degraded by lysosomes, inhibitors like lysosomotropic agents have been proposed for SARS-CoV-2 [74]. The antimalarial drugs hydroxychloroquine (HCQ) and chloroquine (CQ) have been demonstrated for SARS-CoV-2 antiviral activity. However, data to support the use of HCQ and CQ for COVID-19 are incomplete [75]. These lysosomotropic agents are helpful for neutralizing the endosome-lysosomal acidic pH, thereby blocking protease activity and subsequently blocking the viral entry. However, how autophagy is implicated in the infection of CoVs is still under debate [47, 74, 76]. Table 1 describes the different inhibitors used for blocking the viral entry. The most studied and the common pathway proposed for all coronaviruses is the endocytic pathway, so blocking that pathway is a big hallmark for treating the disease.

Table 1 Inhibitors for SARS-CoV-2 viral entry

Identification of potential drugs for SARS-CoV-2 is a necessary task, so drug repurposing can also work for this scenario. The small molecules, which are already in clinical trials or used for some other diseases and the molecules from the expert’s opinion are also listed in Table 2. The table is categorized into three groups: (1) compounds involving drug repurposing, (2) compounds in clinical and pre-clinical study, (3) compounds targeting the mechanism pathway.

Table 2 Literature searched, and expert opinion derived small molecules, which modulate SARS-CoV-2 [131]

Researchers in government and private sectors are making huge efforts to develop effective vaccines for SARS-CoV-2. The vaccine development approaches are mainly based on inactivated and attenuated viral protein particles, viral vectors and viral DNA/RNAs. A novel RNA-based vaccine uses a part of the genetic code of spike protein mRNA-1273 (ClinicalTrials.gov: NCT04283461) [139]. Several other mRNA-based vaccines by CureVac (Tübingen, Germany), BNT162 by BioNTech (Mainz, Germany), Pfizer (New York, NY, USA) and BioNTech mRNA vaccine (Mainz, Germany) are in different stages of development [139, 140]. CanSino Biologics (Tianjin, China), the company that developed the vaccine for Ebola is also developing a vaccine named Ad5-nCoV for SARS-CoV-2. It is a spike protein-based vaccine that is undergoing phase I clinical trials in healthy individuals in Wuhan china (ClinicalTrials.gov: NCT04313127) [141, 142]. The current status of the vaccines under development is available at the Milken Institute Treatment and Vaccine Tracker:

https://milkeninstitute.org/sites/default/files/2020-03/Covid19%20Tracker%20032020v3-posting.pdf [141].

Summary and perspectives

SARS-CoV-2 is a continuously growing life-threatening disease. This coronavirus has rapidly evolved and spread all over the world, having a mortality of more than 0.8 million lives so far. The exact origin and mechanism of attack and spatial distribution are still not completely explored.

The viral and the host cell proteins assisting in invasion process can be a target for treating the infections. The available crystal structures and the binding mechanism proposed by other coronaviruses can help to study the SARS-CoV-2. The critical structural studies of the viral particles are also helpful to identify the drug targets. The positional changes (up and down conformation) of spike protein trimer determine the active to inactive state. Thus, targeting these conformations and developing small molecules or peptides can also stop the viral entry. In addition, phylogenetic analysis and structural studies revealed that the hot spot residues, the role of Gln493 in human ACE2 receptor binding and the critical residue Asn501 of RBD explain the human-to-human transmission. The intrinsic disorder region and a precise furin-like cleavage site can be responsible for viral cycle and pathogenicity. However, in-depth studies are needed to address this issue, which may represent a potential antiviral strategy.

The small molecules, which are in clinical trials, the drug compounds, which are currently in use for treating different diseases can also help to identify (screen) potential drug candidates. Development of suitable mice models to understand this novel virus infection and comparison with the other coronavirus will speed up the drug discovery. Drug testing techniques are also necessary to be accelerated. Recognizing the risks and commercial benefits, researchers are developing effective vaccines for SARS-CoV-2 infection.