Introduction

Coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) poses a huge threat to global public health. Early diagnosis is essentially important for the disease control and clinical treatment. However, the current nucleic acid testing for SARS-CoV-2 carries a negligible false-negative risk [1, 2]. Serological tests are considered to be a powerful complement to the nucleic acid tests, especially for COVID-19 patients with undetectable viral RNA [3]. Among the serological tests, colloidal gold immunochromatography assay (GICA) is a simple and rapid test method, suitable for timely diagnosis and large-scale screening of COVID-19 patients. Most GICA tests are developed based on indirect or capture immunoassay, by using SARS-CoV-2-related recombinant antigen and secondary antibodies to detect IgM/IgG antibodies against SARS-CoV-2 in patient sera. However, these GICA approaches usually have no ideal detection sensitivity; the pooled sensitivity for SARS-CoV-2 was 66.0% (49.3% to 79.3%) according to previous reports [4,5,6]. A recent study showed that the sensitivity of the total antibody (IgM, IgG, IgA, etc.) test is higher than that of the single IgM or IgG test, which was developed based on the enzyme-linked immunosorbent assay (ELISA) in double-antigen sandwich format [7]. This format can also be applied into GICA and is expected to improve the detection sensitivity.

To develop a reliable double-antigen sandwich GICA, it is important to obtain suitable SARS-CoV-2 antigens or related recombinant proteins. SARS-CoV-2 is a β family coronavirus, including the spike (S) protein, envelope (E) protein, membrane (M) protein, and nucleocapsid (N) protein. Among them, the N protein is the most abundant, relatively conservative protein in coronaviruses; thus, it is often used as a diagnostic antigen [8]. Our previous study showed that the antigenicity of the COOH terminus is higher than that of the NH2 terminus in the N protein of SARS-CoV; and the former N protein fragment may have the same antigenicity with the full-length N protein [9]. The S protein is the common target when designing vaccines based on neutralizing antibodies. It contains a receptor binding domain (S-RBD) in the S1 subunit, which mediates receptor binding and membrane fusion [10, 11]. According to report, the S-RBD of SARS-CoV-2 has higher affinity for angiotensin-converting enzyme 2 (ACE2) than SARS-CoV and lacks cross-reactivity with monoclonal antibodies against S-RBD of SARS-CoV [12], indicating the potential diagnostic value of S protein.

In the present study, we prepared six recombinant proteins based on the reported SARS-CoV-2 sequence (GenBank accession MN908947) [13]. Three recombinant N proteins were obtained by prokaryotic expression, including the full-length recombinant N protein (rN), NH2 terminal (rN1) protein fragments, and COOH terminal (rN2) fragments from this protein. Three recombinant S proteins were obtained by eukaryotic expression, including an S1 domain (rS1) fragment, the receptor binding domain (rS-RBD), and S-RBD ligated to the Fc fragment from mouse (rS-RBD-mFc). The recombinant proteins were evaluated using indirect ELISA. Two recombinant proteins with the highest ELISA titers (rS1and rS-RBD-mFc) were chosen to develop a double-antigen sandwich GICA to detect total antibodies against SARS-CoV-2. Performance of the GICA method was evaluated with 1165 clinical samples, which showed a high detection sensitivity (92.09%) and good specificity (99.44%). Moreover, 65.6% (21/32) of COVID-19 patients with undetectable viral RNA were correctly diagnosed by the GICA method.

The results indicate that the proposed GICA test can serve as a reliable complement to current RNA tests for COVID-19.

Material and methods

Materials

DNA polymerase (2× Pfu MasterMix) was purchased from Beijing TransGen Biotech (Beijing, China). T4 DNA ligase and Gibson Assembly kit were from New England BioLabs Inc., (Ipswich, England). Eukaryotic vectors H293 and H293-Fc, which were used for transient expression, were obtained from the Laboratory of Protein Project, Beijing Institute of Biotechnology, China. FreeStyle™ 293 expression medium, Opti-MEM® I (1×), a reduced serum medium, and 293 fectin™ reagents were purchased from Invitrogen Inc. (CA, Carlsbad, USA). The Unique CDSystem for protein purification was from Suzhou Inscinstech Co., Ltd. (Suzhou, China). The DNA extraction/purification kit was from Beijing TransGen Biotech. Primer construction and sequencing work were conducted by Beijing Tianyi Huiyuan Biotech Ltd. and Beijing Biomed Biotech Ltd., respectively. GICA nitrocellulose (NC) membranes were obtained from Millipore Sigma (Saint Louis, MO, USA); glass fiber and absorbent pads were from Shanghai Kinbio Tech. Co., Ltd. (Shanghai, China).

The gene sequences of the recombinant proteins were all derived from the RNA sequence of SARS-CoV-2 strain from Wuhan (GenBank accession MN908947). The nucleotide positions of the gene sequences are rS1 (21602–23584), rS-RBD (22514–23311), rN (28274–29530), rN1 (28274–28900), and rN2 (28901–29530). The N, N1, and N2 gene cloning vectors were constructed by General Biosystems Co., Ltd. (Anhui, China). The S1 DNA sequence was optimized and synthesized by GenScript Co., Ltd. (Nanjing, China). The vector and bacterial strains for prokaryotic expression were pET28a, E. coli Rosetta, or BL21 (DE3).

Preparation of the recombinant S proteins

The full coding region, which was obtained by overlapping extension PCR using primers containing restriction enzyme recognition sites (Supplementary Table 1), was ligated to HEK293 vectors after digestion. The recombinant plasmids were transformed into E. coli DH5α, and bacterial colonies were selected on Luria-Bertani (LB) agar containing ampicillin. Positive colonies were cultured in 500 μL LB liquid medium for 2–4 h, and the resultant plasmids were extracted, PCR-verified, and sequenced. Positive recombinant plasmids (210 μg each) and liposomes (280 μL) were diluted in 7 mL of opti-MEM medium for 5 min, respectively, and mixed for 30 min, and H293 cells (1.2 × 106 /mL) were added. After culturing (120 rpm, 37 °C with 5% CO2, 3–4 days), the cell supernatants collected by centrifugation were purified with a 0.44-μm filter and the Unique CDSystem chromatography workstation. After column protein A balancing (10 column volumes) with phosphate-buffered saline (PBS), the cell supernatants were placed under a flow rate of 2.0 mL/min and then washed and eluted with PBS (five column volumes) and citric acid buffer (pH 3.0) to collect the purified protein. For desalination, a 1/3 sample volume was applied to a G25 column pre-balanced with PBS (5 column volumes), and each protein was collected after column washes with PBS. Their purities were confirmed by sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS-PAGE).

Preparation of the recombinant N proteins

The N protein’s full coding region was PCR amplified using primers containing restriction enzyme recognition sites (Supplementary Table 1). The restricted amplicons were ligated into pET28a or pET32a using T4 DNA ligase. The recombinant plasmids were transformed into E. coli Rosetta or BL21 (DE3), and the bacterial colonies were selected on LB agar containing kanamycin. Recombinant plasmids in the bacterial colonies were extracted and confirmed as authentic by DNA sequencing. Protein expression was induced for 5 h in positive bacterial colonies with 0.5 mM isopropyl-D-thiogalactopyranoside (IPTG) at a starting optical density (OD) of 0.6. After centrifugation, the supernatants and precipitation products from the lysed bacteria were analyzed by SDS-PAGE. The rN, rN1, and rN2 recombinant proteins were produced by the bacteria harboring the recombinant plasmids via IPTG induction in 2 L of LB medium (20 °C, 200×g, 10 h), and the bacteria were harvested and lysed by ultrasonication (300 W, 30 min). The supernatants were clarified by centrifugation (10,000×g, 20 min), filtrated (0.4-μm filter membrane) and run through a 3 mL Ni Sepharose column with 30 mL lysis buffer, and then washed extensively with PBS containing imidazole (20–80 mM gradient) to remove non-specific proteins. The target proteins were eluted with PBS containing 250 mM imidazole.

Preliminary evaluation of the recombinant proteins by ELISA

The recombinant proteins were initially evaluated by indirect ELISA. After coating the wells with the prepared recombinant proteins, 50 μL of serially diluted human samples was added to the wells and mixed with 50 μL of horseradish peroxidase (HRP)-labeled goat anti-human IgM or IgG, followed by incubation (37 °C, 60 min). After the solution was removed and the wells had been washed with PBS buffer containing Tween-20 five times, 50 μL of tetramethylbenzidine substrate was added to the wells in the plate for 15 min. OD values were measured on a microplate reader (450/630 nm), and recombinant proteins with high OD values for COVID-19 patient sera and low OD reads for the negative control sera were used for all further experiments.

ELISA plates coated with the six recombinant proteins were first tested with seven negative sera from healthy people. Their average values plus twofold standard deviations (mean + 2SD) were used as the cutoffs. However, they were not completely equivalent for each protein and some were set at 0.2 when the calculated values were below 0.2. Sera from two patients with COVID-19 were diluted at ratios of 1:800, 1:400, and 1:100 for IgM detection and ratios of 1:80, 1:40, and 1:20 for IgG detection. The average value (mean) from the negative serum samples was set as the background value, and the lowest antibody titer with a value higher than the cutoff was set as the sensitivity level.

Preparation of the GICA strip with selected recombinant proteins

Colloidal gold suspension was prepared by reducing gold chloride with citrate. The colloidal gold was conjugated to recombinant protein rS-RBD. Briefly, 1 mg of rS-RBD was added to 100 mL of the colloidal gold suspension. After a 30-min reaction, conjugation was blocked using 10 mL of 10% bovine serum albumin (BSA) for 15 min. The colloidal gold conjugate collected (centrifugation at 12,000 rpm, 30 min, and 4 °C) was resuspended in PBS containing 0.1% BSA and 0.1% Tween-20.

The colloidal gold and rS-RBD (0.5 mg/mL) conjugate was applied to a conjugate pad (glass fiber) (30 μL/cm, dried at 37 °C, 3 h). Using a dispenser (XYZ3000; BioDot, Irvine, CA), rS1 and the secondary polyclonal antibody (2 mg/mL) were coated onto the nitrocellulose membrane as the test and control lines, respectively, at a dispensing rate of 1.0 μL/cm. The membrane was then dried at 37 °C for 1 h. Finally, the nitrocellulose membrane, conjugate pad, sample pad, and the absorbent pads were assembled and cut into 4-mm strips.

Performance evaluation of the GICA test

Positive sera from a COVID-19 patient in the early phase (< 7 days after symptom onset) was collected and diluted with running buffer at a ratio of 1:10–1:320. The limit of detection of the GICA strip was determined using 100 μL of the sample, and 0.9% NaCl was used as the blank control. The results could be obtained by naked eyes after 5–10 min. Each test was repeated three times. Additionally, 41 samples collected from healthy men were tested to preliminarily evaluate the false-positive rate of the GICA method.

Clinical evaluation of the GICA test

A multi-center prospective clinical evaluation was conducted to evaluate performances of the GICA test. The GICA test was evaluated with 1165 serum samples, of which 455 samples were from confirmed COVID-19 patients according to The Diagnosis and Treatment Protocol for COVID-19 (7th edition) in China and 710 samples from healthy people. All the samples were collected from Huoshenshan Hospital, General Hospital of Central Theater Command of the PLA, the Sixth People’s Hospital of Shenyang, Peking Union Medical College Hospital, and Shijiazhuang Fifth Hospital in China. In addition, 32 serum samples from suspected cases with negative RNA test results were collected from the above-mentioned hospitals while the clinical symptom and computed tomography (CT) image changes of typical viral pneumonia were observed for these patients. All samples were tested with the GICA strips, and nucleic acid detection by officially approved RNA test kits was used as control. The Ct cutoff values for positivity of RNA tests were determined according to the kits and real-time PCR machines used in each hospital. In addition, double-antigen sandwich–based ELISA kits (Hotgen Biotech, Beijing, China) for the detection of total antibodies against SARS-CoV-2 were also used as a comparison.

Results

Acquisition of the six recombinant proteins

Through gene subcloning by overlapping extension PCR and ligation to the HEK 293 vector, the S1, S-RBD-mFc (containing the mouse Fc fragment), and S-RBD recombinant plasmids were constructed and verified by PCR and sequencing (Supplementary Fig. 1). Following lipofection, transient expression in eukaryotic HEK293 cells, and protein purification, highly pure rS1, rS-RBD-mFc, and rS-RBD were obtained.

Through gene subcloning and ligation to the pET vector, recombinant N, N1, and N2 plasmids were constructed and verified by PCR and sequencing (Supplementary Fig. 1). The rN, rN1, and rN2 expression products from IPTG induction at 37 °C in E. coli were identified in the culture supernatants and in the precipitates, so the induction temperature was lowered to 20 °C, which increased the protein in the supernatants significantly. The supernatant proteins were purified and used for subsequent studies. That the SARS-CoV-2 N proteins were present in the precipitates implies that their expression in E. coli may influence their accurate folding or conformation, because the viral N protein may be conformationally modified after transcription in human cells. Overall, six high-yield recombinant proteins from SARS-CoV-2 with high purities were obtained, as summarized in Table 1.

Table 1 Information of the prepared recombinant proteins

Preliminary ELISA evaluation of the recombinant proteins

Indirect ELISAs were used to preliminarily evaluate the six recombinant proteins. Using serum samples from seven healthy people as the negative controls, serial samples from two patients in the early phase (< 7 days after symptom onset) of COVID-19 were detected. For the S proteins, the coefficient of variation (CV) for the seven negative samples was 20–30%. For the N proteins, the CV was 74–92%, resulting in a high cutoff value.

For IgM detection, all the S proteins had lower background values and higher OD values than those of the N proteins, as well as higher sensitivities (1:800) (Fig. 1). For IgG detection, rS-RBD-mFc had the highest sensitivity (1:80), while the second highest sensitivity (1:40) was attained by rS1, rS-RBD, and rN (Fig. 1). Although with the same sensitivity and similar background, rS1 produced a higher OD value than rS-RBD. The better performance of rS-RBD-mFc when compared with rS-RBD may be related to the protein Fc fragments, which can increase the half-life and stability for the RBD protein fragment from the S protein. Overall, rS-RBD-mFc and rS1 were best suited for IgM and IgG detection.

Fig. 1
figure 1

Evaluation of the recombinant S and N proteins by indirect IgM-ELISA (a, c) and IgG-ELISA (b, d), respectively, with serum samples from seven healthy people and two COVID-19 patients (P1 and P2). Dashed lines refer to the cutoff values

The IgM-specific sensitivities for ELISA (from 1:100 to 1:800) are all higher than those for IgG (1:20–1:80), mostly because the positive serum samples came from patients during the early phase of COVID-19 infection. Additionally, the antigenicity of full-length rN was higher than that of fragments rN1 and rN2, which is consistent with the findings from our previous study on SARS-CoV [9]. However, the higher sensitivity of rN1 over rN2 suggests that the antigenicity of the protein fragment at the NH2 terminus of the N protein was higher than that at the COOH terminus, which is the opposite result of our previous study on SARS-CoV [9].

Evaluation of the recombinant proteins by GICA tests

A double-antigen sandwich GICA test was developed using the recombinant rS-RBD-mFc and rS1 proteins, which can be used to detect total antibodies against SARS-CoV-2 in serum samples (Fig. 2). Serial dilutions of a serum sample from a COVID-19 patient in the early phase were tested by the GICA test. It showed that a positive result was observed at a 1:160 dilution (Fig. 2), indicating the GICA method was able to detect low-titer antibodies in serum. Moreover, several samples from patients infected with influenza A, influenza B, respiratory syncytial virus, Mycoplasma pneumoniae, and Chlamydia pneumoniae were all tested by the GICA test (Fig. 2). No cross-reaction was observed with these samples. In addition, there was no false-positive result when testing 41 serum samples from healthy people.

Fig. 2
figure 2

Schematic illustration (a), lower detection limit (b), and selectivity (c) of the sandwich-format GICA strip for total antibody detection against SARS-CoV-2

Clinical evaluation of the GICA test

Altogether, 1165 serum samples from healthy people and confirmed cases of COVID-19 were tested by the GICA method. As shown in Table 2, the detection sensitivity and specificity of GICA were 92.09% (419/455) and 99.44% (706/710), respectively. The overall accuracy is 96.57% (1125/1165), demonstrating reliability of the GICA test. In addition, 32 serum samples from suspected cases were collected and tested by the GICA test. All these samples were from patients that have clinical symptoms and CT changes of typical viral pneumonia, but the RNA test results are all negative. Among these samples, 65.63% (21/32) were detectable by the GICA method. Detailed clinical information of the 32 cases is provided in Supplementary Table 2. This result indicates that the GICA test for detection of total antibodies against SARS-CoV-2 could be used as a powerful complement to the nucleic acid tests, especially for COVID-19 patients with undetectable viral RNA.

Table 2 Clinical evaluation performances of the GICA method

The 455 confirmed cases out of the 1165 cases were divided into three groups according to the days of symptom onset (Table 3). For the group of patients in the early phase (≤ 7 days), the GICA test presented a lower detection rate (46.8%, 22/47) than that of the RNA test (85.1%, 40/47). However, the detection rate of the GICA test increased to 83.6% (51/61) and up to 99.7% (346/347) for patients in the middle and late phases (> 7 days), which is comparable to that of the RNA test. The result was supposed to be reasonable, because specific antibodies were commonly produced in humans after about 7–10 days of infection and could maintain for several months. Therefore, the GICA test is more suitable for COVID-19 patients in the middle and late phases.

Table 3 Comparisons of the detection rates of GICA test and RNA test for COVID-19 patients in different phases (days after symptom onset)

In addition, a total of 251 cases were tested by both the GICA method and a double-antigen sandwich ELISA method during the clinical evaluation. The ELISA kit was developed based on the same recombinant spike proteins that are used in the GICA test. The results showed that the two methods displayed a high consistency (kappa = 0.919, p < 0.001) (Supplementary Table 3). And they have comparable detection sensitivity and specificity (Table 4). Both the sensitivities are 88.9% (64/72); the specificity of the GICA test and ELISA is 97.8% (175/179) and 98.9% (177/179), respectively.

Table 4 Clinical performances of the GICA and ELISA methods

Discussion

Rapid and accurate detection methods are urgently required for the diagnosis of patients infected with SARS-CoV-2 [14, 15]. However, the current RNA tests depend on whether there is a sufficient viral load in the patient’s upper respiratory tract and reasonable sample quality, leading to a significant number of false-negative results. Consequently, serological tests, along with the epidemic history, clinical symptoms, and image features of chest CT, should be taken into consideration for the diagnosis of COVID-19 [16, 17]. In this study, the GICA test showed a high sensitivity (92.09%, 419/455) and specificity (99.44%, 706/710), and a significant number (65.63%, 21/32) of COVID-19 patients with undetectable viral RNA were correctly detected by the GICA method, indicating the diagnostic value of serological testing for COVID-19.

Selecting suitable SARS-CoV-2-related recombinant proteins is essential for developing a reliable serological test. In the study, we prepared six recombinant proteins, including three S proteins (rS1, rS-RBD, rS-RBD-mFc) and three N proteins (rN, rN1, rN2). Preliminary evaluation by indirect ELISA revealed that the three S proteins were more suitable for developing serological tests than the three N proteins. However, the antigenicity of S proteins and that of the N proteins could not be confirmed in this study, because the two recombinant proteins are prepared using different protein expression systems. The N protein with high content and strong immunogenicity is a main structural protein of SARS-CoV-2. It can be efficiently expressed through a prokaryotic expression system and has good immunoreactivity. However, as a transmembrane protein with hydrophobic regions, the S protein is better prepared using a eukaryotic expression system, which can express proteins with complex structure and high biological activity. Although the supernatant of the prokaryotic expression recombinant N proteins was used in this study, insoluble precipitates were found during preparation, especially rN2, making it likely that some of their conformations were slightly different from the natural forms, or modifications for some sites were missing. Thus, the antigenicity of N protein should be studied after expression in a eukaryotic system in the future.

In conclusion, six recombinant proteins of SARS-CoV-2 were prepared and evaluated. It showed that the eukaryotic expression spike proteins are more suitable than the prokaryotic expression nucleocapsid proteins for the serological diagnosis of SARS-CoV-2 infections. A double-antigen sandwich GICA test was developed using the selected recombinant proteins (rS1 and rS-RBD-mFc), which can detect total antibodies against SARS-CoV-2 in human serum. Clinical evaluations of the GICA tests showed a high sensitivity and specificity comparable to those of the RNA tests. Given its simplicity, reliability, and cost-effectiveness, the GICA method could be used as a powerful complement to current COVID-19 diagnostic tools.