Background

It is estimated that 71 million people are infected with hepatitis C (HCV) (defined as those with viraemic infection) and 257 million with hepatitis B (HBV) (defined as HBsAg positive) worldwide [1,2,3,4]. Both HBV and HCV infection predominantly affect persons in low and middle income countries [2, 5]. Late complications of HBV and HCV infection are cirrhosis and hepatocellular carcinoma. Overall, there were 1.34 million deaths attributable to these complications in 2015 with a trend towards increasing deaths since 1990 [2]. Most patients are not aware of their infection until they have advanced complications of the disease [6] due to lack of access to and affordability of testing. Just as an increase in access to treatment is important [7, 8] early identification is paramount and also cost-effective [9] to avoid disease complications.

HBsAg and HCV-Ab screening is traditionally done by serology using either laboratory based enzyme-immuno-assay (EIA) or a point-of-care rapid diagnostic test (RDT). While several rapid lateral flow tests exist for HBsAg and HCV-Ab detection, their quality is variable or unknown and in particular there is a paucity of good quality HBsAg RDTs available on the market [10, 11]. Four HCV-Ab RDTs are WHO prequalified as of August 2017 but none of the HBsAg RDTs have met the requirements for WHO prequalification [12]. The choice of which format of serological assays to use in a programmatic setting will depend on a variety of factors; most importantly, ease of use and the characteristics of the testing site, such as storage facilities, infrastructure, level of staff skills and cost. Confirmation of active viraemic infection is done using nucleic acid testing (NAT) for HCV RNA and HBV DNA or using serological assays for detection of HCV core antigen.

In order to facilitate more widespread uptake of testing for HBV and HCV, there needs to be greater access to diagnostic assays. The use of dried blood spots (DBS) for transportation using fingerstick (in adults and older children), or heel pricking (in neonates and infants) sampling of capillary blood and subsequent analysis with automated high-throughput laboratory-based EIA represents another affordable alternative to testing using RDTs, particularly in settings with limited infrastructure. Another advantage of DBS use in low resource settings [13] is that capillary blood collection does not require a trained health worker to perform venipuncture and that DBS utilizing capillary blood needs less volume than venepuncture. Furthermore, no centrifuge or even basic laboratory facilities with electrical power are needed to prepare plasma [14]; and since transport and handling do not require high skill or a cold chain, the risk of contamination is reduced [15]. The main disadvantage of DBS is that the existing commercial assays have not been validated or received regulatory approval with this method of sample collection and transport.

Despite this limitation, DBS have been increasingly used in recent years to screen for a number of viral diseases, including HIV and viral hepatitis [13, 16,17,18,19]. Some studies have suggested that the use of DBS may increase uptake of hepatitis testing among certain vulnerable risk groups [20,21,22]. While there have been several systematic reviews on diagnostic performance of RDTs for HBsAg and HCV-Ab [10, 11, 23] and on the use of POC tests in viral hepatitis testing [10, 24, 25], and various validation studies of diagnostic accuracy studies aiming to validate DBS have been performed [26, 27] including a systematic review of HCV RNA detection with DBS [28], to our knowledge there has been no summary of evidence on diagnostic accuracy for HBsAg and HCV-Ab testing using DBS. We have conducted two systematic reviews and meta-analyses: one on the diagnostic accuracy of HCV-Ab and the other on the diagnostic accuracy of HBsAg from DBS samples compared to venous samples in persons identified for hepatitis testing. This review informed the WHO guidelines on testing for chronic HBC and HCV in low and middle income countries [29] and was conducted in conjunction with systematic reviews on the diagnostic accuracy of the detection of HBV DNA and HCV RNA from DBS [30].

Methods

Search strategy and selection criteria

PRISMA reporting guidelines were followed [31] and the QUADAS-2 tool was used to estimate quality of studies as a risk of bias tool [32]. We conducted two systematic reviews and meta-analyses on the diagnostic accuracy of HBsAg and HCV antibody detection from DBS compared to venous blood. We searched English language studies using five databases (MEDLINE, Web of Science, EMBASE, Global Health and LILACS) with the following search terms: DBS, Dried blood spot, Dry blood spot, filter paper, Guthrie paper, hepatitis, hepatitis B, hepatitis C, HBsAg, HBV, HCV, HCV RNA, HBV DNA (adapted to databases) on 1.9.2015 and updated it on 22.8.2017.

Abstracts were included for fulltext review if inclusion criteria were fulfilled (namely DBS samples and plasma or serum samples used for detection of HCV-Ab and/or HBsAg) or if exclusion of the abstract could not be performed solely on the basis of the information of the abstract. Eligible studies included comparisons of an index test HCV-Ab and HBsAg using DBS with a reference test HCV-Ab and HBsAg using serum or plasma and following outcomes were analysed: correlation, regression coefficient, specificity, sensitivity and positive/negative predictive values. We included studies regardless of whether the assay used was commercially available or used an in-house technique, and testing on DBS and plasma/serum did not have to use the same assay. There were no date, geographic or population demographic restrictions, and individuals of all age groups were included.

Screening and data extraction

Two independent reviewers performed title, abstract and full-text review to identify eligible studies. Disagreements were resolved by consent of the reviewers. The references of articles selected for inclusion were also reviewed for additional articles to review. The same data extraction procedure was performed in duplicate for each study and included the following variables: author, publication and study dates, country, percentage of children and adults, age range, gender distribution, type of specimen used for DBS, specimen used as gold standard (plasma or serum), serological assay used, storage conditions and effect of storage conditions. Additional data and clarifications were sought by contacting study authors where necessary.

Risk of bias and quality assessment

The QUADAS-2 tool categories of study design, index and reference test conduct and reporting of patient flow were adapted for use to assess the risk of bias in included studies. In particular, studies where there was consecutive sampling of patients were rated as being at low risk of bias, and a case control design as at high risk of bias. Studies reporting use of a consistent protocol for index and reference testing for each sample and description of patient flow were rated at low risk of bias) while lack of reporting or inconsistent use of a protocol were considered at high risk of bias.

Statistical data analysis

Summary estimates of sensitivity and specificity were generated with a bivariate random effects meta-analysis using maximum likelihood estimate and 95% confidence intervals. We calculated positive (Sensitivity/(1-Specificity) and negative (1-Sensitivity/Specificity) likelihood ratios directly from the pooled sensitivity and specificity. Several studies did not have sufficient quantitative data to contribute to both sensitivity and specificity - for example no samples with a negative reference test. In such cases, we performed a univariate random effects meta-analysis of the sensitivity and/or specificity estimates separately to incorporate studies that did not report estimates for both. We then compared univariate analyses with the result of the bivariate analysis, in order to make complete use of all the available data.

Heterogeneity was assessed visually from forest plots and by reporting an estimate of τ2 corresponding to the variance of the logit-transformed specificity and sensitivity, which can be interpreted as a measure of between-study variability [33]. Stratified analyses were performed by type of assay used for the index test and by storage conditions. Some studies exposed individual samples or subsets of samples to varying storage conditions or used them to define limits of detection. Since these subsets were not necessarily part of the diagnostic accuracy evaluation we included them in the narrative analysis of the impact of storage conditions on the validity of results. Statistical analysis of the data was performed using STATA 14 (StataCorp. 2015. Stata Statistical Software: Release 14. College Station, TX: StataCorp LP).

Results

Included studies

For HBsAg detection, our search identified 521 abstracts, of which 65 full text articles were assessed for eligibility, and 23 [34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56] met criteria for inclusion in the review (Fig. 1 & Table 1). Six studies came from Africa [34, 36, 39, 41, 47, 51], nine from Europe [37, 38, 42,43,44, 50, 53, 55, 56], four from the Americas [35, 45, 46, 54], three from Asia [40, 48, 49]and one from Australia [52]. Four studies did not have sufficient data for sensitivity and specificity analysis [40, 43, 44, 53]. Most studies provided limited information on the characteristics of participants. One study only included pregnant women [35], one only children [51] and one only HIV-infected persons [39]. All studies used the same commercial assays for their reference and their index test, respectively. Five studies from the 1980s used Ausria II (Abbott), whereas newer studies used diverse commercial assays, including Enzygnost (Behring) and ARCHITECT (Abbott) (see also Table 1).

Fig. 1
figure 1

PRISMA flow chart of studies included in the systematic review of detection of hepatitis B Surface antigen from DBS samples compared to venous blood sampling (plasma/serum)

Table 1 Characteristics of studies included in the systematic review on HBV surface antigen detection from DBS compared to venous blood sampling

For HCV-Ab detection, our search identified 521 abstracts, of which 101 full text articles were assessed for eligibility, 23 studies met the criteria for inclusion in the review [26, 27, 37, 39, 40, 43, 55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71]. Nineteen had sufficient data for inclusion in the quantitative meta-analysis (Fig. 2 & Table 2). Four studies did not have sufficient data for sensitivity and specificity calculations [60, 61, 67] or described a Receiver Operating Characteristic (ROC) curve only [40], and contacting the authors did not yield further information. Two of the 19 studies provided sufficient data to calculate sensitivity but did not have any negative samples to assess specificity [37, 56] (Fig. 2 & Table 2). Of the 23 studies included in the review, most originated from Europe, North America and Australia (16 in total). Four studies were from South America [26, 59, 69, 70] and three from South-East and Central Asia (India [62], Mongolia [60] and Malaysia [40]). All included studies had been published between 1997 to 2017. Most studies used 50 μl to 100 μl of whole blood on filter paper to test for HCV-Ab. Six studies reported using capillary blood samples [26, 55, 57, 58, 63, 66], while others either did not report on this or used venous blood. One study included children [68], however, age ranges or gender for patients were not always reported. Various assays for detection of HCV-Ab in serum and DBS were used (see also Table 2).

Fig. 2
figure 2

PRISMA flow chart of studies included in the systematic review on detection of hepatitis C antibody detection from DBS samples compared to venous blood sampling (plasma/serum)

Table 2 Characteristics of studies included in the systematic review on HCV antibody detection from DBS compared to venous blood sampling

Assessment of study quality and risk of bias (Tables 3 & 4)

Ten studies investigating HBsAg did not use a random or consecutive sampling method [41,42,43,44,45,46,47,48, 50, 55, 56]. Only one study reported on blinding of laboratory personnel to results of the reference test while performing diagnostic tests [35]. Overall, the study quality was rated as moderate. For HCV-Ab detection, eight of the included studies used case-control designs [27, 43, 64, 66,67,68,69, 71] and only two reported consecutive sampling [39, 40]. Only three studies reported blinding of laboratory personnel to results of the reference test [27, 57, 66]. However, most of the other studies (as with HBsAg detection) used and reported a clear and consistent protocol for both reference and index test, so this was not judged as a major cause of bias.

Table 3 Risk of bias in studies included in the systematic review on detection of HB surface antigen
Table 4 Risk of bias table for HCV antibody

Diagnostic performance

In general, based on the 19 studies evaluated, testing for HBsAg using DBS maintained good accuracy compared to testing using plasma or serum from venipuncture. Reported sensitivity estimates ranged from 79%–100% with a pooled estimate from a meta-analysis of 98% (95%CI 95%–99%), and specificity ranged from 89%–100%, with a pooled estimate of 100% (95%CI 99–100%) (Fig. 3). The positive likelihood ratio was 703 (95%CI 107–4615) and the negative likelihood ratio was 0.02 (95% CI 0.008–0.04). Bivariate and univariate estimates of pooled sensitivity and specificity were similar.

Fig. 3
figure 3

Forest plot of sensitivity and specificity of hepatitis B Surface antigen serological diagnosis in DBS compared to venous blood samples

Based on the 19 studies included in the quantitative meta-analysis for HCV antibody, the reported sensitivity for HCV-Ab using DBS ranged from 70% to 100% and specificity ranged from 95 to 100%. The pooled bivariate estimate of sensitivity and specificity was 98% (95%CI 95–99) and 99% (95%CI 98–100), respectively (Fig. 4). The positive likelihood ratio was 361 (95%CI 61–2163) and the negative likelihood ratio 0.02 (95%CI 0.01–0.05). Four included studies reported also measures of agreement with kappa values ranging from 0.87–1 between DBS and venous blood samples [26, 39, 59, 60]. Bivariate and univariate estimates of pooled sensitivity and specificity were similar.

Fig. 4
figure 4

Forest plot of sensitivity and specificity of hepatitis C antibody detection in DBS samples compared to venous blood samples

Visual assessment as well τ2 and its p-value showed moderate heterogeneity in the bivariate analysis of studies. To further assess the heterogeneity, we evaluated different parameters with potential to affect accuracy, including assay type and cut-off used, and storage conditions.

Effect of test and cut-off used

There are no standardised cut-offs for HBsAg detection using DBS. Eight of the studies reported a cut-off based on ROC analysis from the same set of samples as the validation set. Several of the included studies noted that the ideal cut-off (as suggested by ROC curves) for determining test positivity should be higher for DBS samples than for plasma or serum samples. Other studies have indicated that this was due to the small sample volume used in DBS (commonly 50 μl). Indeed the one study with low sensitivity (79%) and specificity (89%) [36] only used 25 μl of blood on filter paper (37).

Different assays were used in HCV-Ab detection. Cut-offs varied widely and as no standardized cut-offs exist, investigators for many studies determined their own cut-off via ROC curves using the same set of samples. Nine of the included studies reported on cut-offs used for DBS [26, 27, 40, 59, 60, 64,65,66]. Stratification by type of test or cut-off used was not possible as the number of strata would have been large and the results difficult to interpret. Stratification in a pooled analysis based on amount of blood (50 μl versus >50 μl) or sampling method (venous blood vs capillary blood) showed similar sensitivity and specificity estimates (data not shown).

Effect of storage conditions

For HBsAg detection, the effect of a range of storage conditions was evaluated in six studies, including storage temperature ranging from −20 to 33 °C and storage duration ranging from overnight to 180 days. In general, storage at room temperature or higher (30–33 °C) did not clearly affect accuracy of testing and no decrease in sensitivity was found with prolonged storage at ambient temperatures [34, 44,45,46, 48, 51, 54].

Four studies of diagnostic accuracy for HCV-Ab using DBS samples evaluated different storage conditions in a subset of samples that did not contribute to the diagnostic accuracy evaluations. In one study, three out of three previously negative samples exceeded cut-off values (i.e. would have been interpreted as positive) after storage for 3 days at room temperature [66]. Similarly, one of the included studies showed that after 6 days of storage at room temperature, the cut-off values were exceeded and previously negative samples became positive [63]. Two studies showed relative stability at room temperature for up to 60 days, but found that storage at −20 °C was associated with less variation in quantitative values [26, 59].

One of the 19 studies contributing to the diagnostic accuracy calculations kept study samples at room temperature for more than 24 h [55]. A pooled analysis stratifying studies according to whether samples had been left at room temperature for longer than 4 h or not did not find any difference in performance and so did not explain the heterogeneity observed in the meta-analysis (data not shown).

Sensitivity analysis

In a sensitivity analysis, we found no difference in estimates of diagnostic performance between those studies that reported consecutive sampling and those with case-control study design (data not shown).

Discussion

To our knowledge, this is the first comprehensive and systematic review to summarize the utility of DBS for HCV-Ab and HBsAg testing. Overall, we found very good diagnostic accuracy and precision for detection of HCV-Ab and HBsAg using DBS samples. These findings were based on a larger number of studies compared to the companion systematic reviews undertaken for HBV DNA and HCV RNA using DBS [30] and were rated as moderate quality. This provides support for the use of DBS from capillary blood for diagnostic testing of HBV or HCV, especially in community settings and hard to reach populations where there is limited access to venipuncture or inadequate laboratory infrastructure to prepare or transport plasma samples or limited access to rapid diagnostic tests.

The WHO 2017 testing guidelines recommended the use of a single quality-assured serological assay (i.e either a laboratory-based EIA or RDT to detect HBsAg and anti-HCV that meet minimum performance standards, and where possible delivered at the point of care to improve access and linkage to care and treatment. The guidelines also made a conditional recommendation to consider the option of DBS specimens for HBsAg and HCV-Ab serology testings in settings where there are no facilities or expertise to take venous whole blood specimens; or RDTs are not available or their use is not feasible, or there are persons with poor venous access (e.g. drug treatment programmes, prisons) [29]. This recommendation was rated as conditional mainly because of the slightly lower accuracy using DBS compared to venous blood sampling and uncertainty regarding the generalizability of studies included in this review (in particular regarding impact of different storage and transport conditions).

There were several limitations to this review. First, we did not include studies in languages other than English and no unpublished data from laboratories were included. Second, while only a few studies were rated as having a low risk of bias, overall the quality of evidence from studies was rated as moderate. Third, most studies did not use fresh samples and no uniform protocol was applied. Fourth, there is also a risk of overestimation of pooled sensitivity and specificity by including studies that applied cut-off levels derived from the same study population. No stratified analysis was possible for the type of test within this review. Therefore, we are unable to recommend the use of certain commercial tests over others using DBS testing of HCV-Ab or HBsAg or to suggest a cut-off specific to a test that should be used for DBS samples.

Several studies found that when assessing stability in separate sample sets samples became false-positive after longer exposure at ambient temperatures for both HCV-Ab [27, 66] and HBsAg [44, 45]. Further data is needed to understand the stability of DBS with different environmental conditions (i.e. temperature and humidity). The review highlights the need for standardized validation of specific tests with DBS.

While some studies published detailed protocols on how to collect and analyze DBS [37], no manufacturers to date have provided instructions on how to use their assays with DBS (including processing methods and possibly different cut-offs). There is a need for manufacturers to validate their assays and provide instructions for the use of DBS even if no claim for regulatory approval is made.

Implementation of DBS and future work:

Consideration of the use DBS sampling for HBV and C serological or nucleic acid testing or both will depend on the health-care setting and infrastructure, and epidemiological context. If good-quality RDTs are available that can be performed using capillary blood then the focus may be more on prioritizing DBS for NAT testing of HBV DNA and HCV RNA. However, if RDTs are not available and there are no facilities or expertise to take venous blood samples, then DBS testing may be equally important to increase access to serological testing as well as NAT. Conveniently, both could be performed from the same specimen. A further situation where DBS may be applicable is where large numbers of individuals are being tested simultaneously. DBS may also be useful where polyvalent screening for multiple diseases is done, but where multiplex RDTs for this purpose are not available or are more costly. The adoption of DBS sampling in a hepatitis testing programme requires the availability of a centralized laboratory competent at handling and processing this sample type. The current lack of validation of assays on this sample type for HBsAg or HCV-Ab and manufacturer’s guidance on the optimal pre-analytical treatment of specimens makes quality control challenging.

Further data is also needed on demonstrating the stability of DBS with individual tests under different field transport and storage conditions likely to be encountered in low resource settings. as well as on the type of test used and how long DBS samples can be left at different temperature or humidity levels. Since DBS requires only a small sample of blood to maintain sensitivity, a higher analytical cut-off may be required to maintain sensitivity and overcome variability at the lower end of the dynamic range of the test as compared to higher volume plasma samples – even if in our limited stratified analysis on this we did not find an effect of blood volume on diagnostic accuracy of HBV or HCV. Further insight can also be gained from individual patient analyses of the existing data, which was beyond the scope of this review.

Conclusion

While diagnostic accuracy of DBS for HCV-Ab and HBsAg testing is adequate in studies included in this review, lack of standardization of testing protocols and uncertainty about their use in field conditions and the appropriate assay cut-offs limits the wider application of DBS.