Comparison of analysis methods to classify cholera hotspots in Ethiopia from 2015 to 2021

Demlie, Yeshambel Worku; Moore, Sandra; Dunoyer, Jessica; Muluneh, Dereje; Hussen, Mukemil; Wossen, Mesfin; Edosa, Moti; Sudre, Bertrand

doi:10.1038/s41598-024-56299-5

Comparison of analysis methods to classify cholera hotspots in Ethiopia from 2015 to 2021

Article
Open access
Published: 03 April 2024

Volume 14, article number 7377, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Comparison of analysis methods to classify cholera hotspots in Ethiopia from 2015 to 2021

Download PDF

Yeshambel Worku Demlie¹^na1,
Sandra Moore²^na1,
Jessica Dunoyer²,
Dereje Muluneh³,
Mukemil Hussen¹,
Mesfin Wossen¹,
Moti Edosa¹^na2 &
…
Bertrand Sudre²^na2

698 Accesses
1 Citation
7 Altmetric
1 Mention
Explore all metrics

Abstract

Cholera continues to represent a major public health concern in Ethiopia. The country has developed a Multi-sectoral National Cholera Elimination Plan in 2022, which targets prevention and control interventions in cholera hotspots. Multiple methods to classify cholera hotspots have been used in several countries. Since 2014, a classification method developed by United Nations Children's Fund has been applied to guide water, sanitation and hygiene interventions throughout Sub-Saharan Africa based on three outbreak parameters: frequency, duration and standardized attack rate. In 2019, the Global Task Force on Cholera Control (GTFCC) proposed a method based on two parameters: average annual cholera incidence and persistence. In 2023, an updated GTFCC method for multisectoral interventions considers three epidemiological indicators (cumulative incidence, cumulative mortality and persistence,) and a cholera-case confirmation indicator. The current study aimed to classify cholera hotspots in Ethiopia at the woreda level (equivalent to district level) applying the three methods and comparing the results to optimize the hotspot targeting strategy. From 2015 to 2021, cholera hotspots were located along major routes between Addis Ababa and woredas adjacent to the Kenya and Somalia borders, throughout Tigray Region, around Lake Tana, and in Afar Region. The multi-method comparison enables decision makers to prioritize interventions according to a sub-classification of the highest-priority areas.

Spatiotemporal dynamics of cholera in the Democratic Republic of the Congo before and during the implementation of the Multisectoral Cholera Elimination Plan: a cross-sectional study from 2000 to 2021

Article Open access 22 August 2023

Spatiotemporal dynamics of cholera hotspots in the Democratic Republic of the Congo from 1973 to 2022

Article Open access 28 March 2024

Descriptive characterization of the 2010 cholera outbreak in Nigeria

Article Open access 16 November 2014

Introduction

Cholera is caused by toxigenic forms of the bacterium Vibrio cholerae O1 and O139, which is primarily contracted by ingesting contaminated water or food¹. Cholera outbreak dynamics are influenced by various risk factors such as limited access to basic water supply, sanitation infrastructure, hygiene facilities (WASH), which renders the population vulnerable to disease transmission². Secondary factors such as severe climate events (e.g. droughts and flooding), conflict and population displacement can exacerbate outbreaks, impede access to healthcare facilities, and hinder response efforts^3,4,5. As a result, cholera outbreak frequency, duration and incidence may vary depending on the location, context, and the population affected.

Cholera continues to represent a major public health concern in Ethiopia. Between 2019 and 2021, Ethiopia reported a total of 15,515 suspected cholera cases⁶. In response to this persistent public health threat, the Ethiopian Public Health Institute (EPHI) has recently developed an evidence-based Multi-sectoral National Cholera Elimination Plan⁷. The strategy aims to interrupt cholera transmission in the country by identifying cholera hotspots at the woreda level (equivalent to district level) and improving access to WASH services in high-risk kebeles within these hotspots⁷.

A cholera hotspot is a “geographically limited area where environmental, cultural and/or socioeconomic conditions facilitate the transmission of the disease, and where cholera persists or reappears regularly”⁸. As these areas play a central role in the spread of cholera outbreaks, multisectoral interventions should target these areas to prevent and control cholera outbreaks, prioritizing the most at-risk hotspots to most efficiently use limited resources. Several hotspot classification methods have been implemented in Africa over the past decade. The first method was developed by the United Nations Children's Fund (UNICEF) West and Central Africa Regional Office (WCARO) in 2014 based on the analysis of outbreak frequency, outbreak duration and standardized outbreak attack rate per cholera surveillance unit (CSU) (ideally at the district level)⁹. Based on these three parameters, CSUs are then classified into four priority categories for targeted interventions. This approach was first applied using long time-series in 12 countries in West Africa in 2014, which was then updated and applied on 14 countries in 2018⁹ and eight countries in East and Southern Africa in 2017–2018¹⁰. A second method was proposed by the Global Task Force on Cholera Control (GTFCC) in 2019, which uses two yearly surveillance parameters: mean annual incidence of suspected cholera cases and cholera persistence (the number of weeks per year with at least one suspected case reported). With the GTFCC 2019 method, CSUs are classified into three priority categories (high-, medium- and low-priority)⁸. The core principle of this method has been used in various countries; in some cases, additional parameters have been included, such as WASH indicators in Kenya or case fatality ratio in Ethiopia^7,11,12. In 2023, an updated method was proposed by the GTFCC to identify priority areas for multisectoral interventions (PAMIs) in countries with moderate to high cholera transmission. With this method, priority areas are identified based on three epidemiological parameters (cumulative incidence, cumulative mortality and cholera persistence, as defined above) and a cholera test positivity indicator¹³. In addition to these three main methods, other country-specific ad-hoc classification methods have been implemented using a combination of different epidemiological indicators and contextual cholera risk factors^14,15,16. To date, no study has compared the three main cholera hotspot classification methods to adapt targeting strategies.

The current study aimed to identify and classify cholera hotspots in Ethiopia at the woreda level applying the three main classification methods. The results of the three methods were compared, highlighting the pros and cons of each approach.

Results

From week 37 2015 to week 52 2021, cholera hotspots in Ethiopia, at the woreda level, were classified using three analysis methods (Fig. 1).

Using Method A, 90 CSUs were classified as cholera hotspots (Types 1–4) over the course of the study period. A total of 54 CSUs were classified as Type 1 or Type 2 hotspots (Table 1). In terms of outbreak duration, Type 1 and 2 hotspots had a median outbreak duration ≥ 10.5 weeks, with maximum outbreak durations of 19 weeks and 27.5 weeks, respectively. In terms of outbreak frequency, 19 CSUs notified ≥ three outbreaks, of which 14 were classified as Type 1 and five were classified as Type 3 (Table 1; Fig. 1, Method A). Type 1 to 4 hotspots were located in Tigray Region (notably around Mekelle), along the borders with Sudan and Eritrea, in northern Amhara Region (around Lake Tana), in Afar Region along the border with Djibouti, in Addis Ababa (the capital city), along main roads toward the east (connecting Addis Ababa with Dire Dawa and Jigjiga), and along roads to Kenya (Shashemene, Hawassa and Arba Minch) and Somalia. The majority of Type 1 hotspots were located in Tigray Region, along routes connecting Addis Ababa with Kenya and Somalia, Bale Zone in Oromia Region, and along the Kenya and Somalia borders (Moyale and Dolo Odo, respectively) (Fig. 2).

Table 1 Epidemiological characteristics of the cholera hotspots identified using methods A, B and C.

Full size table

Using Method B, 86 CSUs were classified as high priority hotspots, 164 CSUs were classified as medium priority hotspots, and the remaining 783 CSUs were classified as low priority hotspots. For the high-priority hotspots, the mean incidence broadly ranged from 1.02 to 105, with an average of approximately 10 cases per 10,000 population and an average cholera persistence of 26 weeks. The medium-priority CSUs can be split in two sub-groups: (1) 11 hotspots with low incidence and high persistence and (2) 153 hotspots with high incidence and low persistence (Table 1; Fig. 1). Method B hotspots were widespread throughout the country. Hotspots with high incidence were located in northern Afar Region, Somali Region, Southern Nations, Nationalities, and Peoples' (SNNP) Region and South West Ethiopia Peoples' (SWEP) Region. Hotspots with high persistence were located in Somali Region, Dire Dawa Region, Harari Region, Oromia Region (including Bale Zone), northern Afar and Tigray Region. This method classified areas affected by a single epidemic as medium-priority cholera hotspots (in Somali Region and SNNP in 2017 as well as SWEP Region in 2021 and 2021) (Fig. 2).

For Method C, the median priority index of six was defined as threshold to define priority areas. A total of 133 hotspots had a priority index ≥ six, which corresponds to an estimated population of 7.8 million (approx. 15% of the total population), 72.2% of all cases and 87.7% of deaths (Table 1; Fig. 1). Method C hotspots were distributed in a similar pattern to that of Method B, although fewer CSUs were identified as hotspots using Method C. Several high-priority hotspots (priority index ≥ 8) were located in Tigray Region, SNNP Region, SWEP Region, along the eastward road from Addis Ababa, and Somali Region (Fig. 2). High-priority hotspots in Somali Region included urban areas with high cumulative incidence and very high persistence due to the epidemic in 2017. Hotspots with very high morality were essentially located along the border with South Sudan (SWEP and Gambela Regions). Maps of the parameters used in each method are available in the Supplementary Materials 1, 2 and 3.

The relationship between the results of the three methods are illustrated in Fig. 3, and the epidemiological features for each comparison are provided in Table 2. When comparing all hotspots identified by the three methods, all hotspots identified by Method A were also identified by Method B (Fig. 3, Panel 1). Likewise, each hotspot identified by Method C was also identified by Method B (Fig. 3, Panel 1). Most of the hotspots identified only by Method B were located in remote areas (especially in Somali Region), whereas hotspots identified by all three methods were located along major roads, around major waterbodies or along international borders. The majority of hotspots in Somali Region were not identified by Method A as many of these CSUs were only affected by a single outbreak (Fig. 4, Panel 1).

Table 2 Epidemiological characteristics for each component of the Venn diagram (Panels 1 and 2).

Full size table

When comparing only high-priority hotspots identified by each method (a total of 112 hotspots), 11.4% (13 hotspots) were identified by all three methods, 28.1% (32 hotspots) were only identified by Methods A and B, 10.5% (12 hotspots) were only identified by Methods B and C, and 2% were only identified by Methods A and C. However, 48.2% of these high-priority hotspots were identified only by a single method, notably using Method B (19 hotspots) and Method C (29 hotspots) (Fig. 3, Panel 2). All seven high-priority hotspots uniquely identified with Method A were affected by multiple outbreaks, while 16 high-priority hotspots identified uniquely with Method B were only affected by a single outbreak. Likewise, high-priority hotspots solely identified using Method C experienced either a single outbreak (13 CSUs) or two outbreaks (six CSUs) over the course of the entire study period (Table 2). High-priority hotspots identified by all three methods represent 9.6% of all cases and 14.1% of all deaths; these hotspots were all affected by multiple outbreaks. These hotspots were located in border areas (e.g. Moyale, etc.), along major roads, near large waterbodies, and in urban areas (Fig. 4, Panel 1). Many high-priority hotspots only identified by Method C were located in the southwest, (SNNP and SWEP Regions) (Fig. 4, Panel 1), where a cholera epidemic spread in previously unaffected remote areas causing significant mortality (Supplementary Material 3).

Discussion

In this comprehensive study, three analysis methods were applied to identify and classify cholera hotspots in Ethiopia, at the woreda level, from September 2015 to December 2021. Overall, high-priority cholera hotspots were mainly located along major routes between Addis Ababa and the Kenya and Somalia borders, throughout Tigray Region, around Lake Tana (in Amhara Region), and in Afar Region along the Ethiopia-Djibouti road. The results of the classification methods were then compared to identify the best approach for Ethiopia to implement targeted strategies to achieve the objective of cholera elimination.

Classification methods A, B and C identified a total of 90, 250 and 133 cholera hotspots, respectively. A total 71 CSUs were identified as hotspots by all three methods. Assessing only the high-priority hotspots (Types 1–2, Method A; high-priority, Method B; and priority index ≥ 8, Method C), a total of 54, 86 and 46 hotspots were identified by each method, respectively, among which only 13 hotspots were identified by all three methods.

Over the course of the study period, multiple regions across Ethiopia were vulnerable to cholera outbreaks due to a variety of factors including poor access to water and sanitation¹⁷, severe weather (e.g. drought and flooding)^18,19 and cross-border transmission with neighboring Somalia and Kenya²⁰. However, due to constrained resources, cholera elimination strategies must prioritize prevention efforts targeting a restricted sub-set of cholera hotspots. A cholera hotspot is defined an area “where cholera persists or reappears regularly” and thus plays a critical role in outbreak diffusion to unaffected areas.

In line with the cholera hotspot definition, Method A classifies hotspots based on both outbreak duration and outbreak frequency. As a result, woredas affected by a single outbreak over the course of the entire study period were not identified as hotspots by Method A, although many of these woredas were classified by Methods B and C. Method A also resulted in a more restricted list of overall hotspots, with four priority sub-groups according to the distinct outbreak dynamics, thus enabling limited resources to be targeted and prioritized according to cholera transmission dynamics. Method A was initially developed to target areas with frequent outbreaks for long-term WASH interventions.

Classification Method B was based solely on cholera persistence and incidence. Due to the widespread nature of cholera epidemics in Ethiopia over the past six years, this approach identified nearly 25% of all woredas in the country as cholera hotspots (250 hotspots). However, targeting such a large number of hotspots would likely challenge the implementation of long-term water and sanitation infrastructure investments, especially with competing public health priorities. To improve this method, the GTFCC updated the classification approach in 2023 (Method C).

Compared with Method B, Method C identified a more restricted set of hotspots affected by recurrent outbreaks and/or outbreaks of long duration with substantial transmission intensity. This method also includes a mortality indicator to account for the objectives of the Global Roadmap to 2030 to reduce cholera deaths by 90%^13,21. However, this indicator is primally based on surveillance data of deaths recorded in healthcare facilities and would overlook community deaths in areas with limited healthcare facility coverage. For this study, the data provided for Somali Region in 2017 did not include cholera-related deaths. In this remote and rural area with a high proportion of pastoralist populations, the community case and death numbers are also likely underestimated²². Due to these surveillance limitations and missing data for Somali Region, Method C alone is likely not currently adapted to the Ethiopia context.

Regardless of the classification method(s) applied, the hotspot classification should be interpreted, adjusted and validated by WASH specialists and public health experts that understand the cholera risk factors in the country¹³. Key areas that play an important role in cholera dynamics may not be prioritized by a given method, which may be due missing surveillance data, etc. A validation workshop provides the opportunity to agree on the final hotspot list based on the analysis and manually adjust the classification according to the specific context as needed. This process is also critical to ensure ownership by the public health authorities and other actors involved in cholera control. Furthermore, the most appropriate method or a multi-method approach should be selected and adapted depending on the parameters included in the analysis, data available for the analysis, and the country-specific conditions.

Some study limitations should be noted. The three methods were applied based on the recommended thresholds; however, these thresholds should ideally be set by a panel of experts during a validation workshop. Suspected cholera cases have not been reported from Tigray Region since 2019. Given the insecurity context in the region in 2020 and 2021, the cholera burden may be underestimated due to challenges in healthcare access and disease surveillance limitations. Nevertheless, it is unlikely that a major cholera outbreak in Tigray Region would spread undetected. Regarding the epidemic in Somali Region in 2017, the aggregated databases provided for the analysis did not include cholera-related deaths; as a result, the deaths in this region were significantly underestimated. The total annual cholera case data by region from the WHO Ethiopian country office used to perform the gap analysis were only available for the period 2015–2018; nevertheless, the gap analysis for the years 2019 and 2021 was conducted using the annual cholera case numbers from the WHO Weekly epidemiological record data. As WASH indicator data at the woreda level was unavailable, we were unable to assess the WASH profile of each hotspot type.

These results highlight several actions to further strengthen cholera elimination efforts in Ethiopia. As cholera hotspot patterns can be dynamic due to various factors such as population movement, socio-economic variables and climate factors, the hotspot analysis should be regularly updated. Indeed, a parallel study conducted by Moore et al.²³ provides a detailed description of the dynamic spatiotemporal characteristics of cholera epidemics in Ethiopia during the same time period. The classification exercise should also be conducted ad hoc if the cholera context in the country evolves or major events occur that may drive cholera transmission (e.g. extreme weather events, conflicts). Additional classification exercises could also test the predictive power of each method applied and monitor cholera elimination progress. To establish a comprehensive understanding of the disease dynamics in each woreda, additional data on underlying factors that contribute to disease transmission should be included in the hotspot classification analysis. Incorporating WASH data into this analytical framework can help to identify high-risk areas where inadequate water and sanitation infrastructures and hygiene practices contribute to cholera transmission, while vaccination data can help to prioritize areas where targeted vaccination is required to maintain herd immunity. Finally, supplementary studies should be performed, especially in high-priority hotspots, to better identify populations regularly affected by cholera and contextual factors driving cholera transmission. In urban areas, this level of analysis is instrumental to target preparedness and prevention interventions to the most relevant populations.

This study sheds light on three complementary methods to classify cholera hotspots, a pivotal step in developing a National Cholera Plan. Comparing the results of each method by analyzing cholera data from Ethiopia for the 2015–2021 period, we have gained a comprehensive understanding of the strengths and limitations of the distinct approaches. These results underscore the importance of a multifaceted approach to cholera hotspot classification. The type of method applied should be context-specific, taking into consideration factors such as data availability, data analysis resources and capacity, and the distinct epidemiological landscape. To inform more effective strategies to identify and classify cholera hotspots going forward, additional efforts should aim to identify country-specific factors that influence cholera dynamics in major hotspot areas and adapt the analysis method accordingly. Regardless of the method(s) applied, it is important to allow for subsequent manual adjustment of the final hotspot ranking during a validation workshop with country stakeholders, thereby enabling the flexibility of a tailored strategy that harnesses the strengths of the method(s) to ultimately enhance cholera elimination efforts. Furthermore, it is critical to identify and detail key interventions per pillar and per hotspot type. Overall, these results provide valuable insights for public health policymakers to prepare for and prevent further outbreaks in a targeted manner, ultimately saving lives in vulnerable communities across Ethiopia and beyond.

Methods

Study design and site

In this retrospective cross-sectional study, we used cholera data from Ethiopia from week 37 2015 to week 52 2021 to apply three different methods to classify cholera hotspots.

Ethiopia is located in the Horn of Africa. According to the 2021 administrative divisions, Ethiopia comprises 13 regional states, 92 zones and 1040 woredas (equivalent to districts). The woredas are further divided into kebeles. The estimated 2021 population of Ethiopia is 103,610,998 inhabitants²⁴. The most populated city and national capital is Addis Ababa, which hosts an estimated 3,780,000 people (approximately 4% of the country’s population)²⁴. Ethiopia is a landlocked country with a vast highland complex of mountains and plateaus divided by the Great Rift Valley, which runs southwest to northeast and is surrounded by lowlands, steppes or deserts²⁵.

Cholera case definition

A suspected cholera case is defined as an individual with one of the two conditions²⁶:

A patient aged 5 years or more who develops severe dehydration or dies from AWD, in an area where the disease is not known to be present.
Any patient who develops AWD, with or without vomiting, in an area where there is a cholera epidemic.

Furthermore, in the health post and community levels, a suspected cholera case (often referred to as the community case definition) can be defined as follows: any person five years of age or more with profuse AWD and/or vomiting²⁶.

A confirmed cholera case refers to a suspected case in which Vibrio cholerae O1 or O139 has been isolated from stool via culture.

Cholera data sources

Four sources of cholera data were available for the epidemiological analysis: regional line lists, regional aggregated databases (daily), and data templates.

WHO databases (total cases per woreda) and WHO Weekly epidemiological record data were used to identify data gaps in the EPHI databases initially provided. Any missing data was subsequently requested from the EPHI.

Line lists

Line lists of suspected cholera cases and deaths for the period week 37 2015 to week 52 2021 were provided by the Disease and Health Event Surveillance and Response Department at the EPHI.

Aggregated databases

Aggregated databases were used to supplement data gaps in the line list data. The daily aggregated data (per woreda) was converted into a weekly database for the analysis. The aggregated databases (region and year) were as follows: Somali 2017 (weeks 1–37), Dire Dawa (2017), SNNP (Southern Nations, Nationalities, and Peoples' Region) (2017), SNNP (2020), Benishangul Gumz (2017) and Sidama (2020).

Data templates

Two types of data templates were completed by the regions to supplement remaining data gaps: (1) aggregated total cases and deaths per woreda and (2) outbreak start and end date, total cases and deaths per woreda.

Annual totals for gap analysis

For the period 2015–2018, the total annual cholera cases by region were obtained from the WHO Ethiopian country office. For the years 2019 and 2021, the annual cholera case numbers were obtained from the WHO Weekly epidemiological record^27,28,29. The total annual cholera cases by region for the period 2019–2021 were unavailable.

Population data

The Ethiopian population data projections per woreda for the year 2021 were obtained from the Humanitarian Data Exchange open data platform (United Nations Office for the Coordination of Humanitarian Affairs, OCHA)²⁴. The populations for the previous years (2015–2020) for each regional state were calculated using decreasing population growth rates provided by the EPHI as follows: Addis Ababa, 2.1%; Afar, 2.2%; Amhara, 1.7%; Benishangul Gumz, 3%; Dire Dawa, 2.5%; Gambela, 4.1%; Harari, 2.6%; Oromia, 2.9%; SNNP and Sidama 2.9%; Somali, 2.6%; and Tigray, 2.5%.

Geographic information system (GIS) data

The original GIS file layerscorrespond to the following administrative units: regional states (13 regional states including Addis Ababa), zones (92 zones) and woredas (1040 woredas). Additionally, public domain vector map data (1:10 m scale) was retrieved from Natural Earth open-source repository and clipped to the Ethiopia national boundary (lakes, rivers, major cities and road networks)³⁰.

Epidemiological analysis

Cholera case-based and aggregated data in Microsoft Excel format were cleaned as described below and assembled after data quality verification into a single database of weekly case and death numbers per woreda using RStudio 2023.03.1³¹ with R-4.3.0 version³² for downstream epidemiological analyses. GIS files were managed using QGIS V3.28 Firenze³³ and R-4.3.0³².

To verify the spatial data, the case locations (region, zone and woreda) were systematically verified (e.g. consistent spelling) according to the corresponding location in the GIS file attribute table. During the study period, two new regions (Sidama Region in 2020 and SWEP Region in 2021) were created within SNNP Region (GIS files, December 2021 version³⁴). Cases were assigned to the new regions according to the reporting kebele localization.

In Tigray Region, to represent the most recent administrative organization, the correct woreda for each case (n = 5945) was identified based on the kebele information by overlaying the kebele-level shapefile.

Furthermore, the data for a few woredas in Amhara Region and Somali Region were merged either because they were already aggregated in the databases or because the initial localization description was ambiguous (e.g. for East Dembia and West Dembiya, many cases were listed simply as “Dembia” or “Dembiya”). For Amhara Region, (1) East Dembia and West Dembiya Woredas were merged into “Dembia (W-E)”; (2) Aykel town, Chilga 1 and Chilga 2 Woredas were merged into “Chilga (T-1–2)”; and (3) East Esite, West Esite and Mekan Eyesuse Woredas were merged into “Esite (W-E)”. For Somali Region, Degahabur Town and Degehabur were merged into “Degahabur (T-Z)” and Kebridehar Town and Kebridehar were merged into “Kebridehar (T-Z)”. In this study, the total number of health surveillance units (woreda level) is 1033.

To verify the dates of onset and admission at the health facility recorded in the line lists, the original Ethiopian dates (Ge’ez calendar) and the derived Gregorian dates were systematically verified. All records and available dates were verified (date of onset, date of admission, date of discharge and date of sampling, if any). The epi-week of onset for each case was then calculated according to the Gregorian calendar dates using the ISO week date system. If the onset date was unavailable, the date seen at the health facility was applied. The case and death observations (in the line lists and aggregated data) were aggregated by week for downstream analysis.

Duplicate case data were removed prior to analysis by identifying multiple identical entries based on the combination of the following case-based information: sex, age, patient identifier, woreda, date of onset, date seen at health facility, date of admission and status. For observations lacking the patient identifier information, duplicate lines were identified based on the following case-based information: sex, age, woreda, date of onset, date seen at health facility, date of admission and status. Observations with similar combinations of case-based information were removed.

All line lists and aggregated databases were then consolidated into a single database for further analysis. We then performed a gap analysis for the period 2016–2018 in which the total case numbers per region were verified using the regional total numbers provided by the WHO. For any data gaps identified, we requested the missing line list data. For the years 2019–2021, the total case numbers nationwide were verified using the annual totals available in the WHO Weekly Epidemiological Records. A region-level gap analysis thus could not be performed for the years 2019–2021. Epidemic curves per zone and per woreda were generated using R-4.1.1 and all times-series per woredas were verified to assess outbreak evolution over time. Any outliers and unusual backlogs were assessed with surveillance experts and corrections were applied accordingly.

Identification and classification of cholera hotspots

Three methods were applied and compared to classify cholera hotspots in Ethiopia at the woreda level.

Method A

This classification method involves the analysis of three epidemiological parameters: (1) outbreak frequency, (2) outbreak duration (median, in weeks) and (3) median standardized outbreak attack rate (in 10,000 person-weeks).

To define an outbreak event in each CSU, the weekly time series were processed as follows. Sporadic cases were removed (i.e. one to two cases without reported cases during the week before and after) to mitigate potential notification biases affecting outbreak duration. The weekly number of cases were interpolated using a local polynomial fit (Package ‘interp’, function locpoly, bandwith = 0.5). The interpolation parameters were optimized to fit the outbreak period and minimum cut-off threshold defined on smoothed values to automatically extract for start and end week of each outbreak event. Observed and smoothed time-series and extracted values were manually verified for all CSUs to assess the start and end weeks of each outbreak. A minimum of ten cases was required for a transmission event to be considered an outbreak. Furthermore, two successive outbreak events separated by an inter-epidemic period ≥ six weeks were considered as two separate outbreaks.

The following epidemiological indicators were extracted for each outbreak in each CSU: the number of reported cases and deaths, outbreak start and end week, outbreak duration (in weeks), and standardized outbreak attack rate. Based on the percentile range of the three outbreak parameters (i.e. frequency, duration and standardized outbreak attack rate), each CSU was classified into four hotspot types: Type 1: area with cholera outbreaks of high frequency and extended duration, Type 2: area with cholera outbreaks of moderate frequency and extended duration, Type 3: area with cholera outbreaks of high frequency and short duration, and Type 4: area with cholera outbreaks of moderate frequency and short duration (Table 3).

Table 3 Summary of cholera hotspot classification methods.

Full size table

Method B

This method was used to classify CSUs based on two epidemiological indicators: (1) mean of the yearly annual incidence (per 10,000 pop.) and (2) total number of weeks with at least one reported cholera case divided by the total number of weeks in the study period (expressed as percentage)⁷. Both indicators are dichotomized into two categories (low and high); however, as no specific cut-off is proposed, the cut-off should be determined by the country authorities. For this analysis, the thresholds applied were defined in the Ethiopian National Cholera Plan as follows: “high incidence” corresponds to values ≥ 10 cases per 100,000 population and “high persistence” corresponds to values ≥ 5% (Table 3). The CSUs were then classified into three priority levels: (1) high (areas with high incidence and high persistence), (2) medium (areas with low incidence and high persistence, or with high incidence and low persistence) and (3) low (areas with low incidence and low persistence).

Method C

This method was developed to rank priority areas for cholera prevention and control interventions in countries with high to moderate cholera transmission based on retrospective data collected over the recent five to 15 years¹³. Indicators used to calculate the priority index were derived from the weekly number of cases for each CSU over the course of the study period as follows: (1) cumulative incidence (cumulative number of cholera cases reported per 10,000 person-years), (2) cumulative mortality (cumulative number of cholera-related deaths reported per 10,000 person-years), (3) persistence (percentage of weeks with at least one reported suspected cholera case over the total number of weeks of the study period). A fourth indicator, cholera test positivity, can be considered according to the representativeness of cholera testing among suspected cases, which is determined using the weekly testing coverage (percentage of weeks with at least one suspected case tested for cholera (regardless of the testing method) among weeks with at least one suspected case reported). If the level of representativeness is acceptable, the cholera test positivity indicator selected is the overall positivity rate (percentage). If the level of representativeness is considered suboptimal, the number of years with at least one case tested positive for cholera (regardless of the testing method) is instead included as a test indicator. If the level representativeness is considered insufficient, the cholera test indicator is not included. To apply Method C, we assessed whether the cholera test coverage indicator could be included in the analysis. Over the course of the study period, 221 CSUs (42% of the total) performed cholera testing of one or more suspected case(s) in at least one week, which indicates that the cholera test representativeness is insufficient. As a result, the priority index for this database was based solely on the three epidemiological indicators (incidence, mortality, and persistence).

The values for incidence, mortality and persistence were then converted into separate scores according to a four-point scale based on the 50th and 80th percentiles of distribution (Table 3). The final priority index was calculated taking the sum of the scores for each indicator. The initial list of priority areas was defined using the median value of the priority score.

Cartography

All maps were generated using the GIS files described above and the software QGIS V3.28 Firenze³³ and R-4.3.0³² (with ggmap package).

Ethical considerations

The research protocol was submitted for ethics approval by the EPHI Institutional Review Board in March 2021. The research protocol was approved by the EPHI Institutional Review Board on June 14, 2021. The EPHI Institutional Review Board waived the informed consent requirement. All experiments were performed in accordance with the relevant guidelines and regulations.

Data availability

Data are available from the authors upon reasonable request and with the permission from Yeshambel Worku (workuyeshambel19@gmail.com) and Moti Edosa (motiedosa7@gmail.com).

Abbreviations

AWD:: Acute watery diarrhea
CSU:: Cholera surveillance unit
EPHI:: Ethiopian Public Health Institute
GTFCC:: Global task force on cholera control
SNNP:: Southern nations, nationalities, and peoples
SWEP:: South West Ethiopia Peoples
UNICEF:: United Nations Children’s Fund
WASH:: Water, sanitation and hygiene
WCARO:: West and Central Africa Regional Office
WHO:: World Health Organization

References

Kaper, J. B., Morris, J. G. & Levine, M. M. Cholera. Clin. Microbiol. Rev. 8(1), 48–86 (1995).
Article CAS PubMed PubMed Central Google Scholar
Lippi, D., Gotuzzo, E. & Caini, S. Cholera. Microbiol. Spectr. https://doi.org/10.1128/microbiolspec.PoH-0012-2015 (2016).
Article PubMed Google Scholar
Charnley, G. E. C. et al. Exploring relationships between drought and epidemic cholera in Africa using generalised linear models. BMC Infect. Dis. 21(1), 1177 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rieckmann, A., Tamason, C. C., Gurley, E. S., Rod, N. H. & Jensen, P. K. M. Exploring droughts and floods and their association with cholera outbreaks in Sub-Saharan Africa: A register-based ecological study from 1990 to 2010. Am. J. Trop. Med. Hyg. 98(5), 1269–1274 (2018).
Article PubMed PubMed Central Google Scholar
Charnley, G. E. C., Jean, K., Kelman, I., Gaythorpe, K. A. M. & Murray, K. A. Association between conflict and cholera in Nigeria and the democratic Republic of the Congo. Emerg. Infect. Dis. 28(12), 2472–2481 (2022).
Article PubMed PubMed Central Google Scholar
World Health Organization. Cholera case and death numbers by country [Internet]. The Weekly Epidemiological Record. https://www.who.int/wer/en/ (2022). Accessed 7 August 2023.
EPHI. Ethiopia - Multi-Sectorial Cholera Elimination Plan 2022–2028 [Internet]. https://reliefweb.int/report/ethiopia/ethiopia-multi-sectorial-cholera-elimination-plan-2022-2028 (2023). Accessed 7 August 2023.
GTFCC. Guidance and tool for countries to identify priority areas for intervention [Internet]. https://www.gtfcc.org/wp-content/uploads/2019/11/guidance-and-tool-for-countries-to-identify-priority-areas-for-intervention.pdf (2019). Accessed 7 August 2023.
Cholera platform. Cholera Fact Sheets by Country [Internet]. https://plateformecholera.info/index.php/unicef-cholera-factsheet (2018). Accessed 7 August 2023.
UNICEF WCARO. ESARO Cholera Factsheets [Internet]. https://www.plateformecholera.info/cholera-fact-sheets (2018). Accessed 7 August 2023.
GTFCC. Each country designs its own national plan in the global fight against cholera [Internet]. https://www.gtfcc.org/country-progress/ (2022). Accessed 7 August 2023.
Kenya Ministry of Health, Kenya Ministry of Water, Sanitation and Irrigation. National Multi-Sectoral Cholera Elimination Plan (NMCEP) (2022–2030) [Internet]. https://www.gtfcc.org/wp-content/uploads/2022/09/national-cholera-plan-kenya.pdf (2022). Accessed 7 August 2023.
GTFCC. Identification of Priority Areas for Multisectoral Interventions (PAMIs) for cholera control [Internet]. https://www.gtfcc.org/wp-content/uploads/2023/04/2023-gtfcc-pami-cholera-control-guidance.pdf (2023). Accessed 7 August 2023.
The Government of the Republic Of Zambia, Zambia National Public Health Institute. Zambia Multisectoral Cholera Elimination Plan 2019–2025 [Internet]. https://www.gtfcc.org/wp-content/uploads/2019/05/national-cholera-plan-zambia.pdf (2019). Accessed 7 August 2023.
Ministere De La Sante Publique. Plan Stratégique Multisectoriel d’Elimination du Choléra en République Démocratique du Congo 2013–2017 [Internet]. https://www.choleraalliance.org/sites/g/files/dvc3616/files/document/2017/07/PLAN_ELIMINATION_CHOLERA_2013_2017_FAcv28fev_10032015.pdf (2013). Accessed 7 August 2023.
Kiama, C. et al. Mapping of cholera hotspots in Kenya using epidemiologic and water, sanitation, and hygiene (WASH) indicators as part of Kenya’s new 2022–2030 cholera elimination plan. PLoS Negl. Trop. Dis. 17(3), e0011166 (2023).
Article PubMed PubMed Central Google Scholar
WHO, UNICEF. WHO/UNICEF Joint monitoring programme for water supply, sanitation and hygiene (JMP). https://washdata.org/data/household# (2020). Accessed 7 August 2023.
World Health Organization. Outbreak news. Acute watery diarrhoeal syndrome, Ethiopia. Wkly. Epidemiol. Rec. 81(41), 385 (2006).
Google Scholar
VOA. Ethiopia declares another diarrhea outbreak (2017).
Green, A. Cholera outbreak in the horn of Africa. Lancet 389, 2179–2179 (2017).
Article PubMed Google Scholar
Global Task Force on Cholera Control. Ending cholera. A global roadmap to 2030. (World Health Organization (WHO), 2017).
Challa, J. M. et al. Inadequate hand washing, lack of clean drinking water and latrines as major determinants of cholera outbreak in Somali region, Ethiopia in 2019. Front. Public Health 6(10), 845057 (2022).
Article Google Scholar
Moore, S. et al. Spatiotemporal dynamics of cholera epidemics in Ethiopia: 2015–2021. Sci. Rep. 14(1), 7170 (2024).
Article CAS PubMed PubMed Central Google Scholar
OCHA. The Humanitarian Data Exchange: Datasets. 2021. Ethiopia—Subnational population statistics. https://data.humdata.org/dataset/ethiopia-population-data-_-admin-level-0-3 (2022). Accessed 7 August 2023.
Gebre-Medhin, M. & Vahlquist, B. O. Famine in Ethiopia-a brief review. Am. J. Clin. Nutr. 29(9), 1016–1020 (1976).
Article CAS PubMed Google Scholar
Ethiopian Health Nutrition Research Institutes. Guideline on cholera outbreak management in Ethiopia [Internet]. http://repository.iifphc.org/bitstream/handle/123456789/373/national-cholera-guideline.pdf?sequence=1&isAllowed=y (2011). Accessed 7 August 2023.
World Health Organization. Weekly epidemiological record annual report: Cholera 2019. https://apps.who.int/iris/bitstream/handle/10665/334241/WER9537-eng-fre.pdf?ua=1 (2020). Accessed 7 August 2023.
World Health Organization. Weekly epidemiological record annual report: Cholera 2020. https://reliefweb.int/sites/reliefweb.int/files/resources/WER9637-eng-fre_0.pdf (2021). Accessed 17 November 2021.
World Health Organization. Weekly epidemiological record annual report: Cholera 2021. https://apps.who.int/iris/handle/10665/362858 (2022). Accessed 7 August 2023.
Natural Earth - Free vector and raster map data at 1:10m, 1:50m, and 1:110m scales. https://www.naturalearthdata.com/. Accessed 24 April 2022.
RStudio team. RStudio: Integrated development for R. PBC. http://www.rstudio.com/ (2023). Accessed 7 August 2023.
R Core Team. R: A language and environment for statistical computing. R Foundation for statistical computing. https://www.R-project.org/ (2023). Accessed 7 August 2023.
QGIS Development Team. QGIS Geographic Information System. Open Source Geospatial Foundation Project. http://qgis.osgeo.org (2023). Accessed 7 August 2023.
OCHA. Ethiopia—Subnational administrative boundaries—Humanitarian data exchange. https://data.humdata.org/dataset/cod-ab-eth (2021). Accessed 4 August 2023.

Download references

Acknowledgements

The authors thank Muhammad Ishaq Khan for assisting with project logistics and data verification. The authors also extend their gratitude to Laure Anquez for insight concerning WASH conditions in Ethiopia. The authors also thank Rajeev Gera, Ngozi Kennedy from the UNICEF Ethiopia Country office as well as Ida Marie Ameda and Mohamed Diaaeldin Omer from the UNICEF Eastern and Southern Regional Office for their guidance and continuous support.

Funding

This study was financed by UNICEF Ethiopia (contract #43298049). The funders had no role in the design of the study, the collection and analysis of the data, the decision to publish or the preparation of the manuscript.

Author information

These authors contributed equally: Yeshambel Worku Demlie and Sandra Moore.
These authors jointly supervised this work: Moti Edosa and Bertrand Sudre.

Authors and Affiliations

Public Health Emergency Management, Ethiopian Public Health Institute (EPHI), Addis Ababa, Ethiopia
Yeshambel Worku Demlie, Mukemil Hussen, Mesfin Wossen & Moti Edosa
Prospective and Cooperation, 1 place Gabriel Péri, Vieux port, 13001, Marseille, France
Sandra Moore, Jessica Dunoyer & Bertrand Sudre
Health Section, UNICEF Ethiopia, UNECA Compound, Zambezi Building, Box 1169, Addis Ababa, Ethiopia
Dereje Muluneh

Authors

Yeshambel Worku Demlie
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Moore
View author publications
You can also search for this author in PubMed Google Scholar
Jessica Dunoyer
View author publications
You can also search for this author in PubMed Google Scholar
Dereje Muluneh
View author publications
You can also search for this author in PubMed Google Scholar
Mukemil Hussen
View author publications
You can also search for this author in PubMed Google Scholar
Mesfin Wossen
View author publications
You can also search for this author in PubMed Google Scholar
Moti Edosa
View author publications
You can also search for this author in PubMed Google Scholar
Bertrand Sudre
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.S., S.M., M.E., J.D., M.W. and D.M. designed the study. B.S., M.E. and Y.W.D. curated the surveillance data. S.M. and B.S. cleaned the data. B.S. conducted the analysis. B.S. produced the figures and tables. B.S., S.M., Y.W.D., M.E., M.H., J.D. and D.M. interpreted the results. S.M. and B.S. wrote the manuscript. All authors reviewed and edited the manuscript. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Moti Edosa.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Demlie, Y.W., Moore, S., Dunoyer, J. et al. Comparison of analysis methods to classify cholera hotspots in Ethiopia from 2015 to 2021. Sci Rep 14, 7377 (2024). https://doi.org/10.1038/s41598-024-56299-5

Download citation

Received: 04 October 2023
Accepted: 05 March 2024
Published: 03 April 2024
DOI: https://doi.org/10.1038/s41598-024-56299-5
Springer Nature Limited

Keywords

This article is cited by

Spatiotemporal dynamics of cholera epidemics in Ethiopia: 2015–2021
- Sandra Moore
- Yeshambel Worku Demlie
- Bertrand Sudre
Scientific Reports (2024)

Comparison of analysis methods to classify cholera hotspots in Ethiopia from 2015 to 2021

Abstract

Similar content being viewed by others

Spatiotemporal dynamics of cholera in the Democratic Republic of the Congo before and during the implementation of the Multisectoral Cholera Elimination Plan: a cross-sectional study from 2000 to 2021

Spatiotemporal dynamics of cholera hotspots in the Democratic Republic of the Congo from 1973 to 2022

Descriptive characterization of the 2010 cholera outbreak in Nigeria

Introduction

Results

Discussion

Methods

Study design and site

Cholera case definition

Cholera data sources

Line lists

Aggregated databases

Data templates

Annual totals for gap analysis

Population data

Geographic information system (GIS) data

Epidemiological analysis

Identification and classification of cholera hotspots

Method A

Method B

Method C

Cartography

Ethical considerations

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Spatiotemporal dynamics of cholera epidemics in Ethiopia: 2015–2021

Search

Navigation