Accuracy of clinical pallor in the diagnosis of anaemia in children: a meta-analysis

Chalco, Juan P; Huicho, Luis; Alamo, Carlos; Carreazo, Nilton Y; Bada, Carlos A

doi:10.1186/1471-2431-5-46

Accuracy of clinical pallor in the diagnosis of anaemia in children: a meta-analysis

Research article
Open access
Published: 08 December 2005

Volume 5, article number 46, (2005)
Cite this article

Download PDF

You have full access to this open access article

BMC Pediatrics Aims and scope Submit manuscript

Accuracy of clinical pallor in the diagnosis of anaemia in children: a meta-analysis

Download PDF

Juan P Chalco^1,2,
Luis Huicho^1,2,3,
Carlos Alamo^1,4,
Nilton Y Carreazo^3,5 &
…
Carlos A Bada^3,5

16k Accesses
52 Citations
Explore all metrics

Abstract

Background

Anaemia is highly prevalent in children of developing countries. It is associated with impaired physical growth and mental development. Palmar pallor is recommended at primary level for diagnosing it, on the basis of few studies. The objective of the study was to systematically assess the accuracy of clinical signs in the diagnosis of anaemia in children.

Methods

A systematic review on the accuracy of clinical signs of anaemia in children. We performed an Internet search in various databases and an additional reference tracking. Studies had to be on performance of clinical signs in the diagnosis of anaemia, using haemoglobin as the gold standard. We calculated pooled diagnostic likelihood ratios (LR's) and odds ratios (DOR's) for each clinical sign at different haemoglobin thresholds.

Results

Eleven articles met the inclusion criteria. Most studies were performed in Africa, in children underfive. Chi-square test for proportions and Cochran Q for DOR's and for LR's showed heterogeneity. Type of observer and haemoglobin technique influenced the results. Pooling was done using the random effects model. Pooled DOR at haemoglobin <11 g/dL was 4.3 (95% CI 2.6–7.2) for palmar pallor, 3.7 (2.3–5.9) for conjunctival pallor, and 3.4 (1.8–6.3) for nailbed pallor. DOR's and LR's were slightly better for nailbed pallor at all other haemoglobin thresholds. The accuracy did not vary substantially after excluding outliers.

Conclusion

This meta-analysis did not document a highly accurate clinical sign of anaemia. In view of poor performance of clinical signs, universal iron supplementation may be an adequate control strategy in high prevalence areas. Further well-designed studies are needed in settings other than Africa. They should assess inter-observer variation, performance of combined clinical signs, phenotypic differences, and different degrees of anaemia.

View this article's peer review reports

The Accuracy of Physical Examination to Diagnose Anemia Among Patients Five Years or Older: A Systematic Review

Article 31 May 2022

Preventing overuse of laboratory diagnostics: a case study into diagnosing anaemia in Dutch general practice

Article Open access 31 July 2020

The effect of anaemia and abnormalities of erythrocyte indices on HbA_1c analysis: a systematic review

Article 21 May 2015

Background

The global prevalence of anaemia is estimated in 2 billion people, that is, in about 30% of the worldwide population[1]. An even larger number of people present iron deficiency [1]. Every 9 of 10 persons affected of anaemia live in developing countries [2]. Anaemia prevalence in Latin America is 46% in children [3], with differences within countries. In Peru and Chile it is 50% and 8%, respectively [4, 5].

Anaemia is related to impaired physical growth and mental development [6]. It is also associated to a higher risk of infant and child mortality, particularly when it co-exists with malnutrition and other risk factors [7].

It is therefore important to make a timely and accurate diagnosis and initiate an early intervention to reduce the negative impact of anaemia. The laboratory diagnosis of anaemia through any of several techniques is not widely available and its cost is often unaffordable in poor areas of the world. This stimulated several studies to assess the accuracy of clinical signs for screening of anaemia.

The Integrated Management of Childhood Illness (IMCI) strategy developed by the World Health Organization recommends the use of palmar pallor as the initial screening tool [8]. This recommendation is based mainly on the interpretation of results of studies performed in the Gambia [9], Kenya [10], and Malawi [11]. None of these studies showed in fact a clear superiority of palmar pallor. Only the Kenya study showed that palmar pallor performed better than conjunctival pallor when used by health workers but not by study physicians [10]. One of them used packed red cells volume as the gold standard [9]. Packed red cells volume is a controversial gold standard for anaemia, as it varies with different physiologic and pathologic conditions such as hydration status, and its correlation with haemoglobin is not optimal [12].

Thus we were prompted to perform a systematic review to assess the accuracy of clinical pallor in the diagnosis of anaemia. The specific objective of the study was to answer the question of whether there is a clinical sign that best predicts the presence or absence of anaemia in children. The signs most frequently assessed in primary studies are conjunctival, palmar and nailbed pallor. The review did not include respiratory and cardiovascular signs as they are unspecific for anaemia and are furthermore related to severe anaemia with haemodynamic repercussion.

Methods

The review was aimed to include all studies performed in children aged 0 through 18 years old fulfilling pre-established inclusion criteria.

Inclusion criteria

1. Studies on individual or combined accuracy of conjunctival, palmar or conjunctival pallor in the clinical diagnosis of anaemia.

2. Studies performed in children 0 through 18 years old.

3. Original articles. Review articles and letters to editors were not considered, except when they had enough information to assess the diagnostic performance of clinical signs of anaemia.

4. Prospective or retrospective studies performed in outpatient or inpatient children.

5. Articles with enough information to assess the diagnostic performance of clinical signs of anaemia, namely sensitivity, specificity, likelihood ratios and predictive values.

6. Studies in which haemoglobin was used as the gold standard.

Exclusion criteria

1. Studies not related to assessment of clinical signs in the diagnosis of anaemia.

2. Studies with insufficient information for deriving the diagnostic performance of clinical signs.

3. Studies in which it was not used a gold standard or those in which haemoglobin was not the gold standard

Search strategies

Two independent reviewers (JPC, CA) made an Internet search of the literature. The databases searched were the National Library of Medicine database from 1966 through January, 2002 and EMBASE from 1986 through January, 2002. In addition we searched the American and Caribbean Health Sciences Literature (Literatura Americana y del Caribe en Ciencias de la Salud, LILACS) database from 1986 through February, 2002 and the African Health Anthology database from 1924 through July, 2002. This search was combined with a manual tracking of articles deemed relevant and found in the references section of primary and qualitative review articles. Details of the key words used are presented as an appendix [See Additional File 1].

The abstracts of the primarily identified articles were read by the same two independent reviewers to assess whether they were related to the clinical diagnosis of anaemia. Those deemed to be relevant were then retrieved and read as full papers. Any discrepancy between the reviewers was solved by consensus.

Methodological quality of primary studies

We assessed the methodological quality of primary studies according to modified published recommendations [13]. The quality score was derived by ascribing 2 points for each of the major criteria related to systematic and blind application of clinical signs and gold standard to all patients, and 1 point for each of the remaining criteria. The maximum possible score was 16 and the minimum was 0. The final validity rating was reached by consensus. The quality criteria details are presented as an appendix [see Additional File 2].

Methods for calculating the diagnostic performance of index tests

Table 2 × 2 were reconstructed from the original data. Sensitivity, specificity, predictive values, and likelihood ratios with their corresponding 95% CIs were calculated for each primary study. Calculations were performed separately for each clinical sign and by different haemoglobin thresholds used in the primary studies. Whenever the 2 × 2 tables contained a 0 cell, 0.5 was added to all cells to avoid undefined results.

The diagnostic odds ratio (DOR) of each primary study was calculated according to the following formula [14]:

DOR = [Sensitivity/(1-sensitivity]/[(1-specificty)/specificity]

The DOR represents the ratio of the odds of a positive test result in subjects with the disease to the odds of a positive test result in subjects without the disease. A DOR of 1 means that the test has no discriminative power. When the DOR is more than one, the odds of a positive test result are higher in the diseased group.

Methods of homogeneity assessment

Studies were analyzed separately for homogeneity of results by clinical sign and by haemoglobin threshold through chi-square test for proportions (sensitivity and specificity), through Cochran Q for LR's and DOR's [15] and through DOR graphic plotting of individual studies, along with their 95% CI graphs [16].

Mathematical pooling

Pooled proportions (sensitivity and specificity) were calculated through the weighted averages taking into account the sample size of each study. Likewise, DOR's and LR's were pooled. The Mantel-Haenszel fixed effects model was planned to use if the studies were homogeneous for the diagnostic performance indexes and the DerSimonian Laird random effect model if they showed heterogeneity [17]. The 95% CIs were also calculated for all the pooled diagnostic indexes [16]. LR's and DOR's were recalculated after outlier's exclusion.

Diagnostic performance and 95% CIs of individual studies, homogeneity assessment, mathematical pooling and weighing were performed through the use of Metadisc software version Beta 1.1.1 [18].

Exploration of heterogeneity

Potential sources of heterogeneity on diagnostic performance were assessed through Metaregression [18]. Pre-specified potential influential covariates included clinical setting (outpatients or inpatients), continent of study (Africa, Asia, Latin America), age group (children up to 5 years old, children older than 5 years old), technique of haemoglobin measurement (Hemocue^®, spectrophotometry, Coulter^®), whether or not the study setting was endemic for malaria or for intestinal worms, type of observer (physician, nurse, technician, parents), and methodological quality score (continuous variable). For each haemoglobin threshold category and for each test, multivariate metaregression was run including the above signaled covariates to assess whether any of them showed a significant influence on lnDOR. The metaregression was weighted by study size and the threshold effect was not considered, as there were not additional cutoff points within each pre-specified haemoglobin threshold.

Post-test probabilities of anaemia

To graphically illustrate the relative usefulness of each particular clinical sign of anaemia at each haemoglobin threshold, different pre-test probability values were plotted against post-test probability values for both positive (LR+) and negative (LR-) results, before and after outlier's exclusion. The post-test probability for a disorder is another way to assess the value of a diagnostic test. It represents the chances that your patient has a disease. It incorporates information about the disease prevalence, the patient pool, and specific patient risk factors (pre-test probabilities) and information about the diagnostic test itself (the LR). The LR is used to assess how good a diagnostic test is and to help in selecting an appropriate diagnostic test(s) or sequence of tests. The LRs have advantages over sensitivity and specificity because they are less likely to change with the prevalence of the disorder, they can be calculated for several levels of the symptom/sign or test, they can be used to combine the results of multiple diagnostic test and they can be used to calculate the post-test probability for a target disorder. Post-test probabilities can be calculated for different clinical scenarios or settings with various possible pre-test probabilities (disease prevalence), using positive (LR+) and negative (LR-) results for the interest tests.

Results

Adapted QUORUM statement checklist and flow diagram of the study are included as an appendix [see Additional File 3].

Literature search

The number of primarily found articles was 225. Two hundred and two papers were excluded after abstract reading because they were nor relevant to the study objective. Twelve studies were excluded after reading them as full papers, because they were not performed in children (8 studies) [19–26], did not use haemoglobin as reference test (1) [27], did not assess individual signs of anaemia (1) [28], did not present separately results for children (1) [29], or did not perform clinical assessment of pallor (1) [30]. Finally, eleven articles were included in the meta-analysis [10, 11, 31–39].

All the studies we found had been performed in developing countries, mostly in children underfive. Eight were performed in Africa[10, 11, 31, 35–39], one in Pakistan[32], one in Bangladesh and Uganda [34] and one in Brazil [33]. The Uganda component of one study was excluded as it used packed red cells volume as gold standard [34].

Most studies reported their results using pre-specified thresholds. All used one or more of the following haemoglobin categories: <11 g/dL, <8 g/dL, 7 g/dL, and < 5 g/dL. Only one study reported the results for 7 thresholds [36]. In this case, we re-constructed the results in the above noted 4 categories to allow the comparison of results with the other primary studies.

Table 1 summarizes main characteristics of primary studies, including the scores of methodological quality. There were studies that evaluated more than one sub-group of subjects and such results are shown separately.

Table 1 Summary of primary studies characteristics

Full size table

Homogeneity assessment

Chi-square test for proportions and Cochran Q for LR's and DOR's showed heterogeneity for results of primary studies within each threshold. For graphical display of the heterogeneity, 95% CIs for DOR's of individual studies are shown in Figure 1, 2, 3, 4.

Outliers at each haemoglobin category were identified through the DOR's graphs. Point estimates with confidence limits were plotted for each individual study. Those studies or results whose DOR's graphs were outside the 95% bounds of the pooled DOR were considered outliers. At haemoglobin <11 g/dL there was one outlier for palmar pallor [38]. At haemoglobin <8 g/dL there were 2 outliers for conjunctival pallor [10, 37], one for nailbed [37], and one for palmar pallor [37]. At haemoglobin <5 g/dL there was 1 outlier for conjunctival pallor [39], 2 for palmar pallor [10, 39], and one for nailbed pallor [39].