Introduction

In the context of SARS-CoV-2, we have witnessed rapid development and implementation of various therapeutics and diagnostics to combat the pandemic. However, these diagnostics are often centered around European and North American testing markets, ignoring the different biologic variables at place in varying countries across the world. A number of considerations in material development must be evaluated to have success in all continents, which will be crucial for accurate understanding and control of SARS-CoV-2, along with other emergent infectious diseases (Figure 1).

Figure 1
figure 1

Variables in assay design for global infectious diseases. Multiple considerations must be made when developing an assay or translating an assay to another geographic location. (a) Endemic viruses and antibody responses: can create variability in background reactivity for serology. (b) Cold chain: can cause limitations on the types of assays used and sample stability. (c) Universal Standards: common control samples such as patient serum are not viable to be used in a high number of laboratories and synthetic technical standards (recombinant antibodies) should be considered. (d) Urban versus rural population centers: Density of populations can change transmission dynamics and testing availability. (e) Physical and financial requirements: Assays that require physical instrumentation beyond standard laboratory setup or those that incur large upcharges in reagent cost can be prohibitive. (f) Reagent quality and validation: Precisely controlled manufacturing and distribution of optimized reagents minimize potential exacerbation of other variables in assay development. Figure created in Biorender.

Cold chain considerations for sample acquisition

One limit of biomedical research and therapeutics is the cold chain. The need for an uninterrupted temperature-controlled collection, processing, and transport pathway to ensure sample or therapeutic stability can be highly restraining when it comes to long distances or resource-scarce settings.

This translates to sample collection and analysis for diagnostic and infection tracking data. Viral swabs and sampling often require cold storage prior to detection of viral genomic RNA; although the use of RNA preservatives added at time of sample collection can remove required cold chain without significant RNA degradation but eliminates the opportunity to culture the virus (i.e., Zymo Research, QIAGEN). The sample must then transit to a laboratory capable of viral RNA extraction and polymerase chain reaction (PCR), which may be several hours to days’ drive away in resource-scarce settings. In the case of evaluating for the presence of viral variants, there may be only one national laboratory in a country equipped with sequencing capabilities.1 However, simplification of viral detection through multiple available rapid tests, including loop-mediated isothermal amplification- and probe-based RNA detection and lateral-flow antigen (protein) detection kits, has expanded the ability to diagnose SARS-CoV-2 in constrained health care systems at low costs. Point-of-care detection kits with shelf stable components overcome a multitude of obstacles and could serve as a referral to more sensitive laboratory-based tests, or as a standalone in locations where access to laboratory tests is limited.

With respect to antibody detection assays that look for the presence of anti-SARS-CoV-2 antibodies in a patient’s blood, sample collection method can greatly alter sample stability. Venipuncture blood draws that result in the collection of serum or plasma are not shelf stable and are subject to contamination and bacterial growth if not stored in air-tight sterile storage containers. Serum and plasma samples are also more prone to acting as a reservoir of infectious agents such as HIV and hepatitis. In order to circumvent the need for venipuncture, volumetric serum sampling through subcutaneous and capillary-based blood draw have been developed to allow for home sampling.2 While these sampling devices allow for avoidance of venipuncture blood draw that requires a trained phlebotomist, there is still concern regarding sample stability and cost. An alternative to liquid blood samples is dried blood collection.3,4 These methods include dried blood spots and engineered solutions that collect precise blood volumes. In the most simplistic form, similar to that most often used in resource-scarce settings for malarial diagnostics, dried blood can be collected on filter paper that is then dried out, and stable at room temperature for weeks, and years when frozen. In this case, the only supplies required for blood sampling are a lancet or needle, an alcohol swab or similar material for fingertip skin sterilization, and the blood sampling device such as filter paper. However, the possible drawback of this approach used widely in resource-constrained settings for tropical diseases is a high degree of variability in sensitivity.3 Newer technologies are proving to be more reliable and offer a future for mail-in and room temperature sample collection.

Antigen validation, verification, and evaluation

In the context of serologic tests, there are multiple approaches to analytics, mainly the rapid lateral-flow based assays, and the standard laboratory analysis, enzyme-linked immunosorbent assays (ELISAs), to detect antibodies generated by the patient against specific proteins of the virus.5,6,7,8,9 Multiple ELISAs have been developed for detection of antibodies against SARS-CoV-2, including those that utilize the nucleocapsid protein (N), the spike protein (S), and more specifically the receptor binding domain of the spike protein (RBD).4 Each of these antigens can be engineered to provide an optimal signal to noise ratio on serologic assays, as well as optimal production for materials synthesis.

A variety of factors can contribute to antigen performance, including production hosts, purification processes, and storage/handling considerations. S and RBD proteins are heavily glycosylated and have complex folding, making them amenable only to eukaryotic production for the most part. Glycan patterns can vary from host to host, making it vital that proteins are compared consistently from batch to batch and investigator to investigator to obtain reliable results. In the case of S protein, the natural trimeric structure is required for high-sensitivity serologic tests—to avoid the complexity of the membrane-spanning region usually responsible for this trimerization, investigators have artificially introduced trimers using a foldon domain from bacteriophage T4.10 This produces functional trimers of S, but their stability is likely lower than the natural membrane-bound form, requiring careful analytics of proteins to ensure a consistently high level of functional trimer. For RBD, it has been shown that alterations in purification tags can lead to a significant alteration in ELISA sensitivity, again requiring careful comparison of results between experiments using different antigen design.11 Finally, careful attention to overall protein quality is essential for any proteins used as serology assay antigens. In many cases, lack of quality or batch-to-batch reproducibility of proteins can lead to failed assays. All proteins generated need to be carefully quality controlled with regard to such factors as aggregation, monodispersity, and purity. To control for batch-to-batch variability, processes such as differential scanning fluorimetry (“thermal shift assays”) can be used to ensure that past and present assays are not impacted by differences in the proteins themselves. These techniques can also be used to verify that proteins can be frozen and thawed for use in assays, or if they need to be single-use aliquots to ensure consistent quality.

Additionally, certain antigens can be produced in non-mammalian culture systems with high yield and high purity, making them attractive for laboratories with lower access to funding that makes the more complex antigen production pipelines impractical.12

Specific to the SARS-CoV-2 virus and the human immune response to infection, N antibodies appear rapidly in the infection timeline, but fade within six months of disease onset, making this antigen subject to false negatives.13,14,15,16 RBD antibodies can take time to develop in comparison to antibodies against the full spike protein, making RBD-only assays also subject to missing positives. The full spike protein can have higher background readings from non-specific binding but can be optimized to minimize false positives. Furthermore, regions that may be subject to variant mutations (e.g., RBD in B.1.351 and P1 variants of SARS-CoV-2) can inhibit antibody binding to RBD, and as such are more subject to mutation-induced signal loss, though at the time of writing this manuscript current variants do not alter serologic assay performance.17 Ultimately, a combination of multiple antigens is required for detection of seropositivity. Utilization of antigens that are not present in a vaccine formulation but present in viral infection (e.g., N protein for S-based vaccines) can be used to differentiate between vaccine-induced seropositivity and infection-induced seropositivity.

Endemic virus repertoire and its effect on test specificity

Any reference to the sensitivity and specificity of an assay must be completed within the population in which the assay is implemented. Most commonly, assay development occurs in areas with large funding resources for research and development and access to investors to further develop those technologies, such as North America and Europe; however, the populations present in these geographic areas are not subject to exposure to diseases that are more common in other climates.

For example, mainland China and Southeast Asia have the highest abundance and diversity of bats, which are known to carry zoonotic diseases including coronaviruses. Horseshoe bats (Rhinolophus spp.) are considered the reservoir host for SARS-CoV-2, SARS-CoV (responsible for the 2003 outbreak), and other related bat viruses such as RaTG13 that belong to the Sarbecovirus subgenus of betacoronaviruses.18,19,20 Recent reports of sarbecoviruses, a subgenus of Coronoviridae, highly homologous to SARS-CoV-2 in horseshoe bats that are found in Japan (96% homologous and sequence identity of 81.47%), Thailand (96% sequence identity), and Cambodia (92.6% sequence identity) in 2013 and 2010, respectively.18,21,22 This is of clinical importance in serological testing because in Southeast Asia, as well as more rural provinces in Yunnan Province in southern China, the agricultural practice of bat guano farming is popular and exposes these populations at a much higher frequency to bats than elsewhere. One serosurvey performed after the 2003 SARS-CoV-1 outbreak showed 2.7% seropositivity to SARS-CoV-1 in Yunnan farmers without history of travel.23 Given that guano farming is done via the construction of artificial roostings near personal domiciles or at cave entries, it is plausible to consider increased coronavirus exposure in these populations as they typically do not wear personal protective gear to collect these nutrient-rich fecal droppings that they then sell to farmers.24,25 Contemporary surveys of bats in mainland Asia and Southeast Asia reveal high levels of homology between SARS-CoV-2 and circulating virus populations in bats.22,26 While published serosurveys are lacking in Southeast Asia, we would anticipate higher than expected seroreactivity in these populations compared to their North American or European counterparts. Without proper region-specific optimization of assays in serosurveys, public health decision-making may be misinformed on the burden of SARS-CoV-2 disease in-country.

Furthermore, in other parts of the globe different coronaviruses are prevalent and endemic. Northern Africa and the Middle-East harbor reservoirs of Middle East Respiratory Syndrome Coronavirus (MERS-CoV) in camels that are indigenous to these climates. When SARS-CoV-2 ELISA assays optimized in the United States were tested against pre-pandemic blood samples from West Africa (Mali), a large false positive rate was detected, variant upon the antigen type with N resulting in higher false positive rates than S and RBD antigens.27 Reactivity to S antigen of other coronaviruses did not correlate well with SARS-CoV-2 reactivity, so other factors may contribute in Mali. For example, false positivity on commercial SARS-CoV-2 serological assays was related to elevated anti-Plasmodium IgG levels in Nigeria,28 and a recent study suggests acute malaria infection may generate cross-reactive antibodies to the terminal sialic acids on S antigen.29 When adjusting for the specific population, these assays could be utilized in large-scale seroprevalence studies to accurately measure disease spread in a population.

Samples collected from the United States showed an increased signal intensity in ELISA for seasonal coronaviruses HCoV-OC43 and HCoV-HKU1 when the individual was SARS-CoV-2 seropositive, suggesting cross-reactivity of anti-S SARS-CoV-2 antibodies with OC43 and HKU1 spike proteins. This could potentially confound future serologic studies of these season or “common cold” coronaviruses due to cross-reactivity.30,31

Standardization of controls

Beyond the assays themselves, equally as important are the controls used to quantifiably standardize results. Historically, this has been done through the imprecise evaluation of sensitivity and specificity based off convalescent clinical control samples and seronegative control samples derived from blood donors. While critical for interpretation of the assay in different populations, in order to fully standardize assays and allow for cross-comparisons, there is a need to adopt a long-lasting and widely available standard set of controls. Initial assays of antibody prevalence were developed in the 1940s in the context of hematology (Coombs tests),32 almost 40 years before the development of the first recombinant monoclonal antibodies in the 1980s.33 Monoclonal antibodies can be present in specific known concentrations that allow for generation of a sigmoidal regression through a dilution curve that will produce the upper- and lower limits of detection based on a standardized reagent. Adoption of these engineered constant reagents will allow for direct comparison of various assays without the need for human samples which are limited in their distribution potential and of unknown absolute concentration.

Conclusions

There are multiple steps of materials engineering that need to be considered when developing serologic assays for global infectious diseases. Main considerations include cold chain limitations, ease of sample acquisition, and assay controls and validation specific to the regional population’s exposure to coronaviruses. There are facets of each stage that can be approached with engineering solutions, from development of more stable reagents through chemical and protein engineering, to low-cost systems for scalable technologies. Some technologies already exist in the laboratory and merely need to be adopted in a widespread fashion. All aspects of sample acquisition to analysis will need to be considered for prevention of far-reaching infectious diseases, and support nations that bear the brunt of endemic viruses due to indigenous zoonotic vector species.