Dear Editor,

Coronaviruses (CoVs) are responsible for three worldwide outbreaks in the last twenty years: Severe Acute Respiratory Syndrome (SARS-CoV) in 2002/2003, Middle-East Respiratory Syndrome (MERS- CoV) in 2012/2013, and the current 2019/2020 Severe Acute Respiratory Syndrome-2 (SARS-CoV-2). Concerning their structural and molecular characteristics, CoVs are members of Nidovirales order which also include Roniviridae and Arteriviridae groups. CoVs family comprises four major sub-classifications: alpha-, beta-, gamma-, and delta-CoVs. The SARS-CoV-2 is responsible for the Coronavirus Disease-19 (COVID-19) pandemic –initially reported in Wuhan, Hubei, Cn- and belongs to lineage b of beta-CoVs demonstrating a strong phylogenetic similarity with BatCoVRaTG13 type. Concerning its genomic structure, a large non-segmented, positive-sense RNA molecule of approximately 30 kb has been detected and analyzed in conjunction with the corresponding RNA-dependent RNA-polymerase (Rd-Rp) that is essential for its replication in the cytoplasm of the target epithelial cells. Analyzing SARS-CoV-2 spherical virion’s structure (diam ~ 100 nm), research groups have confirmed that there are four main proteins including the spike surface glycoprotein (S), the main or matrix protein (M), the envelope protein (E), and finally the nucleocapsid protein (NC), whereas a variety of non-structural proteins have been also identified. S glycoprotein projections -consisting of two subunits S1/S2- provide a unique crown-like formation (corona) on virion’s surface. Concerning their functional role, S1 represents the main receptor-binding domain (RBD), whereas S2 is involved in the virus-cell membrane fusion mechanism interacting with proteases, such as furin, thrypsin, cathepsin or serino-protease TMPRSS2 [1, 2]. Novel molecular and structural/crystallographic analyses have focused on a specific cell membrane receptor - the human angiotensin-converting enzyme 2 (hACE2) - which is the main target-functional receptor for SARS-CoV-2 cell attack, attachment and entry that leads to S1 and S2 subunits activation. Interestingly, the furin protease provides cleavage of S glycoprotein to S1/S2 subunits in lung infected epithelia, dramatically in type II pneumonocytes [3]. Mechanisms of furin interaction with other critical molecules including hypoxia factors seem to be also significant regarding its expression and activity in SARS-CoV-2 cell entry [4].

Rapidly developed bilateral pneumonia is the most severe complication affecting critically the lower respiratory tract anatomy leading also to increased mortality rates, especially in males, older, immunocompromised patients and others with a history of metabolic (diabetes mellitus, obesity) or other chronic diseases (malignancies, cardio-vascular and pulmonary failure). A recently published study - based on autopsy procedures - revealed serious specific pathological findings representative of a distinctive acute interstitial pneumonia in COVID-19 molecularly positive victims. Concerning lung epithelia, infected tissues are characterized by diffuse alveolar damage combined with microvascular lesions and also with intra/extra-vascular fibrin deposition. Intravascular trapping of neutrophils combined with microthrombi in arterioles are also observed, whereas gross pulmonary thromboemboli with pulmonary infarcts and diffuse hemorrhage are also included in the pathological landscape [5]. Lymphocytic pneumonitis, pericarditis and myocarditis were also detected in the corresponding examined tissues of dead patients.

According to the latest published epidemiologic data, SARS-CoV-2-mediated COVID-19 pandemic demonstrates aggressive clinic-pathological profiles in significant subsets of the infected patients –especially in males- and for this reason the role of chromosome X that hosts the hACE2 gene (band Xp22.2) seem to be critical. Genomic analyses in humans have shown that this chromosome contains approximately 800 genes that encode for protein products. It is also well established that males and females retain one of their mother’s X chromosomes, whereas females retain another X chromosome from their father. Based on this inheritance pattern, mutations, deletions, amplifications or single nucleotide polymorphisms (SNPs) in X linked genes - such as hACE2 - affect directly their expression profiles in males leading to altered proteins, since they have only one X chromosome hosting one copy of each gene. In contrast, female carriers of abnormal genes demonstrate a normal phenotype due to the second normal and functional gene copy. A prospective cohort study analyzed a dataset including information for incidence rates, infectivity potential and phenotype characteristics in COVID-19 patients in U.S.A. hospitals. They reported that males demonstrated more frequently a progressively aggressive clinical image than females, needing admission to an intensive care unit and oxygen supply for a long time [6]. Furthermore, they reported significantly higher mortality rates among male infected patients compared to females, although the age-dependent mortality risk was equal for both genders.

Another molecular study based on genome-wide linkage scan analysis, explored the role of SNPs in a specific clinic-pathological entity -familial intracranial aneurysms- in Japanese families. Among the analyzed genes, hACE2 demonstrated specific SNPs in the majority of the examined cases, whereas none of them was strongly involved in the development and progression of the aneurysms [7]. Similarly, another A/G SNP based analysis in Chinese people showed increased level of G allele frequencies –especially in male patients with metabolic syndrome - correlated to hypertension [8]. Concerning SARS-CoV-2 cell membrane binding, the impact of these SPNs in potential structural modifications and expression patterns of the hACE2 molecule remains unknown. Interestingly, differences in response to the current viral infection regarding males and females, could be partially explained by the activation of immune regulatory X-linked genes, such as the Toll –like Receptors 7 and 8 (TLR7/8), which lay equally to hACE2 gene (band Xp22.2) and also the Forkhead Box P3 (FOXP3) gene located on Xp11.23 [9]. Additionally, hACE2 is found to be significantly overexpressed in cigarette smokers especially in men which represent the major percentage worldwide [10]. Besides SNP modifications in the current chromosome X region, epigenetic changes including promoter hypomethylation leads to aberrant hACE2 over expression affecting negatively the response to lung severe infection in patients with immune diseases, such as systemic lupus erythematous [11]. Chromosome X hosts another critical gene, the RGN (band Xp11.3), which encodes for regucalcin, an important calcium (Ca2+) -binding protein involved in intracellular Ca2+ homeostasis principally expressed in hepatocytes. Interestingly, regucalcin is expressed also in lung epithelia, and especially in lung adenocarcinoma higher levels of the molecule were associated with better prognosis, acting as suppressor gene [12]. Because furin protein is a calcium-dependent serine endoprotease, intracellular interactions with regucalcin remain unexplored in SARS-CoV-2 –mediated COVID-19. In contrast, regucalcin is involved in other viral infections -mainly in hepatitis B virus (HBV) - and its expression is correlated to the severity of liver lesions [13].

Based on these critical clinic-pathological and genetic data, we observe that specific chromosome X p arm rearrangements or SNPs that affect predominantly hACE2 - but also TLR7/8, FOXP33, and RGN- genes could potentially be useful molecular biomarkers for explaining differences in infected male and female patients regarding the SARS-CoV-2 - related lung pathology lesions (severe acute interstitial pneumonia), response to specific anti-viral treatment, and also levels of infectivity (transmission dynamics)/mortality rates in two sexes (Fig. 1).

Fig. 1
figure 1

Schematic presentation of X-linked hACE2 and other critical genes in COVID-19. Females host two alleles, whereas males only one. SNPs, rearrangements, epigenetic changes (red spot: abnormal allele, green spot: normal) on Xp22.2 and other Xp bands affect directly normal structure, expression and function of the corresponding proteins in males. (hACE2: human Angiotensin-Converting Enzyme 2, TLR7/8: Toll –like Receptors 7 and 8, FOXP3: Forkhead Box P3, RGN: ReGucalciN; SNP: single nucleotide polymorphisms; Ca2+: calcium)