Introduction

Reliable and timely health information is an essential foundation of public health action and health systems strengthening, both nationally and internationally [1, 3]. Being able to plan for and address the health needs requires information about the health status of a community. This kind of information is mostly unavailable in developing countries, partly due to lack of vital registration. Information that is available is derived from hospitals, yet most illness and death occur outside the health system [8, 11].

The need for sound information is especially urgent in the case of emergent diseases and other acute health threats, where rapid awareness, investigation and response can save lives and prevent broader national outbreaks and even global pandemics [1].

Measurement of population health, its causes and distribution is fundamental to the development of evidence for health policies and for the evaluation and planning of health systems and intervention programs [12]. Valid, reliable and comparable measures of health states of individuals are critical components of the evidence base for health policy [12].

The government of Kenya, through the ministry of public health and sanitation has rolled out the community health strategy as a way of improving health care at the household level. This strategy involves community health workers collecting health status data at the household level, which is then used for dialogue at all the levels to encourage improvement in health status through informed decisions and actions by various stakeholders.

Community health workers have been known to collect health related data at the community level. They have been used to collect data for screening for Tuberculosis in many countries, achieving high rates of case detection in Bangladesh [5]. CHWs can prepare and interpret malaria Rapid Diagnostic Tests correctly and safely when supported by clear instructions and appropriate training. Findings from other studies show that a well-designed job aid and brief training can ensure high CHW performance [9]. This continues to improve in time through continuous practice. Community health workers have also been used to manage neonatal infections in rural India [2].

A lot of health interventions have involved the community health workers in reaching out to the community and successfully implementing these health interventions. Large scale involvement of community health workers in government initiatives and most especially to collect health data for use in the health systems has been minimal due to the assumption that the data may not be accurate, hence not useful by the government.

The Problem Statement

Western Kenya has consistently provided low health and development indicators despite presence of an array of interventions initiated by the NGOs and the government of Kenya [10]. These low performing indicators beg for multiple interventions and concerted efforts on all sides to ensure that a reversal is achieved. These interventions require frequent, timely, valid and accurate information on the health status of the targeted populations for them to be effective. This information is for the most part unavailable.

Population-based sample surveys, Demographic and Health Surveys (DHS), and sentinel surveillance methods, are commonly used as substitutes for more widespread, consistent health and demographic monitoring and intervention studies [7]. Nevertheless, sentinel surveillance methods, has been criticised for being expensive and time consuming, and can be applicable only in isolated populations. It is therefore necessary to determine whether other methods can be used to attain similar results, yet not as expensive in time and financial terms. With the rolling out of the community health strategy, a lot of information that can be used to determine the health status of a community can be available. Additionally, the information can be available to the local stakeholders which is not the case with sentinel surveillance data.

CHWs and other lay community workers have been known to provide health care services such as home visits, health education, referral and management of birth related problems at the household level and collect a wide range of health information [2, 6]. However, little is known as to whether this information can be relied on to measure population health, its causes and distribution. Moving from Demographic Surveillance Systems and Demographic Health Survey data to Community Based data collected by Community Health Workers to estimate burden of disease has not been looked at widely.

It is therefore necessary that the validity and reliability of the data collected by community health workers be determined, whether this kind of data can be used for planning and policy formulation for the communities from which it is collected and processed. This would go a long way to settle speculation on whether the data collected by these workers is of acceptable quality for use in determining the health status, its causes and leading to decisions on actions for its improvement.

General Objective

To investigate the reliability of Community Based Information.

Research Question

What is the reliability of data collected at the Community level by Community health workers?

Methodological Approach

Community health workers register and update household information on identified indicators twice a year. Ten percent of this data was re-collected, 1 week later, using the same tool by a team of well trained research assistants to provide the gold standard for the data collection, in order to validate the data. The team also conducted verbal autopsies in all households experiencing mortality to determine possible causes of death.

This study used the Test–Retest/stability Reliability which compares results from an initial test with repeated measures later on, the assumption being that if the instrument is reliable there will be close agreement over repeated tests if the variables being measured remain unchanged. The Kappa score, sensitivity, specificity and positive predictive values were also used to measure reliability and were arrived at as indicated on Table 1.

Table 1 The framework for calculating the predictive value and reliability of CHW collected data

Findings

Child Health

The comparison between Community health workers (CHWs) data and Research Assistants (RA) collected data on measles immunization coverage shows disparities between the two sets of data. A difference of approximately 10% was evident when the 2 sets were compared as shown in Fig. 1. Sensitivity/specificity analysis shows 49 and 51.4% respectively with a predictive value of 87.5. Pearson’s Chi significance of 0.032 and a Kappa score of −0.097.

Fig. 1
figure 1

Comparison of data collected on measles

A difference of approximately 3% as illustrated in Fig. 2 was noted in the exclusive breastfeeding variable showing little difference in the two sets of data. A significance of 0.000 for the Pearson chi square test was found. Sensitivity/Specificity analysis shows 77.1 and 52.6% respectively with a predictive value of 4.8.

Fig. 2
figure 2

Comparison of data collected on exclusive breast feeding

Maternal Health

Very little difference, less than 1%, in the two sets of data as illustrated in Fig. 3 was noted for the ANC variable. Sensitivity/specificity values noted were 48.7 and 51% respectively, with a predictive value of 65.5. Kappa score of −0.005.

Fig. 3
figure 3

Comparison of data collected on ANC 4+ times

Differences in the two sets of data for the skilled attendant variable was a little over 8% as in Fig. 4 indicating significant disparities between the sets. Sensitivity/specificity values noted were 51.8 and 57.1% with a predictive value of 68.75. Kappa score of 0.075.

Fig. 4
figure 4

Comparison of data collected on skilled attendant

Environmental Health

The latrine availability variable had good correspondence, which was less than 1%, indicating 99% agreement of the two data sets (See Fig. 5). Sensitivity/Specificity values were 48.9 and 52.9% respectively with a Predictive value of 90.5. Chi- square significance of 0.000 and Kappa score of 0.099.

Fig. 5
figure 5

Comparison of data collected on latrine availability

A difference of approximately 10% was noted for the ITN use variable suggesting significant disparities between the two sets of data as shown in Fig. 6. A significance of 0.000 was noted for the Person chi square test, while sensitivity/specificity values of 51.9 and 63.0% were noted respectively. Predictive value of 83.4 was found for the variable. Kappa score of 0.006.

Fig. 6
figure 6

Comparison of data collected on ITN use last night

More than 10% difference was noted for the staple food variable as indicated in Fig. 7. Pearson chi square test revealed a significance of 0.036. A kappa score of −.107 was found which had a significance of 0.036. Sensitivity/specificity analysis showed 44.4 and 43.6% respectively with a predictive value of 60.2.

Fig. 7
figure 7

Comparison of data collected on staple food availability

Discussion

Under child health measles showed less than 10% discrepancy between RA and CHW collected data compared to exclusive breast feeding. The association between the two sets of observation was also stronger in exclusive breast feeding (sig = 0.00, for Pearson chi- square test) than in measles (sig = 0.032, for Pearson chi- square test). There was also little agreement between the two sets of observation as indicated by the kappa score k = 0.097. This concurs with Bang et al. in [2] who indicated that CHWs have been involved in management of child illness. In Kenya, CHWs have also often been used in measles campaigns which increase their awareness of this indicator.

The little difference in ANC did not hold for agreement giving a score of −0.005 indicating that the two observations showed less agreement than would be expected just by chance. However, it is still important to note that this variable had very good concurrence between both sets of data. This may be because the CHWs have been involved in health education of pregnant women during the ANC sessions at health facilities. While for the skilled attendant variable, inasmuch as indicating disparities at a glance, also showed less agreement (k = 0.075). It should also be noted that the discrepancy was of less than 10%.

The latrine availability variable had good correspondence, which was less than 1%. Sensitivity/Specificity values were 48.9 and 52.9 respectively with a Predictive value of 90.5. Chi- square significance of 0.000 and Kappa score of 0.099. Latrine availability is a straightforward and direct to collect data on, and it’s no surprise that it had good correspondences. This is an issue that sheds light that there are variables that we can entirely depend on non professional to collect and still get valid results but those health variables that are a bit complicated makes it very difficult for one to rely on as valid and reliable information that can be used for planning and policy formulation purposes.

A difference of approximately 10% was noted comparing data collected by CHWs and Research Assistants in the use of ITN. Using the health variable it clearly shows that there is a significant disparity between the two sets of data of 10%. With a significance of 0.000 using the Person chi square test, while sensitivity/specificity values of 51.95 and 63.0% were noted respectively. Predictive value of 83.4 was found for the variable. Kappa score of 0.006. This poses a challenge on the reliability of data collected by CHWs for planning and policy formulation purposes given that health issues are critical and sensitive, hence requires great accuracy and precision measures. Variables like ITN use may be difficult to determine too since the only way to ensure accuracy of this kind of information is through observation. Since the variable was looking for ITN use last night, the discrepancy might have come about since the nights of reference were different.

More than 10% difference was noted for the staple food variable. Fisher’s exact test showed a significance of 0.023 while Pearson chi square test revealed a significance of 0.036. A kappa score of −.107 was found which had a significance of 0.036. Sensitivity/specificity analysis showed 44.4 and 43.6% respectively with a predictive value of 60.2.

A difference of 10% noted in the comparison of two sets of data raises the question how valid is the data collected by CHWs who are non professionals. A Reliable and timely health information is an essential foundation of public health action and health systems strengthening, both nationally and internationally [1, 13]. CHWs are key in the health sector and they need to be engaged at all times even in decision making, from the data analysis and comparison there is still hope that they can be trained and it has to be ongoing until the desired results are achieved.

Conclusion and Recommendations

Community based health information collected by CHWs can be used to guide planning and action at local level given that there is 90% concurrence between the RA and CHW collected data in all the variables. However, it would be ill advised to use it to guide policy as since higher levels of reliability would be required to make such decisions. To be considered also is that accurate measures would be expected on the third or subsequent rounds of data collection as hypothesised by [4, 9].

There is need to investigate other factors that may have an influence in the accuracy of the data collected by CHWs such as education level and the number of training sessions the CHWs have undergone. This may shed light on the disparities found on some of the variables.