Introduction

Dengue infection has been considered a significant global health problem in many tropical and subtropical countries, with 3.97 billion people worldwide are at risk [1, 2]. Indonesia is among many Southeast Asian countries that are considered hyperendemic [3], with an estimation of 600,000 cases of dengue annually, of which approximately 180,000 lead to hospitalization [4]. One of the regions with the highest incidence of dengue cases per year is Bandung City where the total number has steadily increased from 4000 in 2008 to 6000 in 2013 [5, 6].

However, these incidence figures may actually be an underestimate due to dependence on passive surveillance systems. Passive surveillance systems depend largely on a country’s capacity or resources (e.g., clinical skill and laboratory resources) to identify all cases of a disease. When a country does not possess the required capacity or resources, passive surveillance is less likely to detect all cases, which could lead to periodic outbreaks [7,8,9]. Considering the lack of a reliable dengue surveillance system in Indonesia [10], we propose the use of spatial pattern analysis of a possible outbreak of dengue cases, such as through the Geographic Information System (GIS), based on confirmed dengue cases reported by medical institutions. This approach could help communities and policymakers by providing strategic information without the requirement for extensive capacity or resources.

In recent years, GIS has been utilized to assess, identify, and visualize the potential risk factors involved in disease transmission [9, 11,12,13,14]. Employed by many agencies of public health epidemiology, this technology is considered to be a powerful tool in addressing epidemiological problems and providing spatial models for improving the effectiveness of interventions for various infectious diseases, such as black fever, diarrhea, typhoid, malaria, and dengue [15,16,17,18]. In these studies, spatial analysis and statistics such as spatial autocorrelation have been used to develop models of spatial distribution and patterns of disease that demonstrate the spatial clusters of the disease incidence (”hotspots”) [13]. Hotspots can be identified as a spatial cluster of high value, in this case, an area with a record of high dengue incidences that can be assumed to be where dengue outbreaks are most likely to occur [19,20,21,22]. The application of cluster analysis has provided a better understanding for some cases of dengue, such as (1) the dispersion of the first epidemic of dengue virus serotype in Vitoria, Brazil, as cluster analysis showed the speed of dispersion was highest in the hotspots [23]; (2) the dynamics of dengue cases of Andhra Pradesh, India, as the cluster analysis classified 23 districts into 3 major clusters based on the risk of dengue incidences and 4 districts as the major hotspots [20]; and (3) the pattern of dengue exposure based on the types of the dengue virus in Singapore [24].

Taking into account the lack of reliable surveillance networks in a developing country, spatial epidemiological investigations can be approached by mapping the relative risk, a process which essentially compares the probability of disease in the exposed group to the probability in the unexposed group [25], using the patient data of reported positive cases. A previous study employing this approach in Bucaramanga, a smaller Indonesian city with a low population density, was reasonably successful in estimating and mapping the relative risk of dengue incidence [26]. However, our study might produce different dynamics, as Bandung has a much bigger area (Bandung covers an area of more than 167.29 km2 while Bucaramanga covers only 27 km2). This larger space may lead to broader movement of disease carriers among areas in Bandung.

Furthermore, dengue incidences have a temporal signature, as outbreaks at a particular space-time location can surge and resurface in a cyclic pattern. It is therefore critical to analyze the data through space-time clustering—as the disease tends to exhibit in a non-random pattern—and geosurveillance—as the disease tends to occur in similar regions which could translate into geographical position [27, 28]. However, there are two important concerns. First, uncertainty exists regarding geographic context [29], caused by spatial uncertainty of the areas which, in the case of geospatial error, is influenced by the people being studied. Case reports are based mostly on residential or work address, both of which have a dynamic nature. Second, a possibility of conclusion biases [28] is linked to the temporal dimension, as variability may occur in the time between the occurrence of early symptoms and the report of a positive dengue case) [30]. Resulting conclusion biases may lead to errors in the estimation of the risk of dengue incidences in the form of either underestimation or overestimation of the local risk of the diseases [31,32,33], with the consequences of misevaluation of the spatial data and policymakers’ receiving biased evidence.

The high uncertainty which leads to conclusion biases can be overcome by applying higher and optimal kernel bandwidth [34] which is estimated by kernel density estimation [30]. Kernel density estimation is a non-parametric way to estimate the probability density function of a random variable based on a finite data sample. A curve of distribution is created by weighing the distance of all points in each specific location along with the distribution. As part of GIS analysis, kernel density estimation generates a summary of the spatial distribution (based on the point event when the disease occurred or is reported) in which each grid cell in the geographical positional map reflects the intensity of the corresponding process (e.g., disease incidence, medical cases, accident) at that location [35,36,37]. This method can identify the change of localized risk which is usually blurred in the aggregated data of a regional count [38, 39]. Further, it delimits the spatial extent of disease occurrences, allowing less computational time and thus reducing the time required to develop a dengue prevention strategy [30, 40].

Through this study, we aimed to develop spatial-temporal patterns of dengue case incidence by the application of the kernel density estimation to assess the spatial distribution of relative dengue risk in Bandung over a specific range of years. The study also enabled us to produce a map visualizing the relative risk of dengue in Bandung by indicating the hotspots of dengue transmission. This research improves on previous studies about the pattern of dengue spread which simply showed the total dengue incidence for a specific time without comparison to the number of cases in other areas, or without showing spatial and temporal fluctuations [6]. Furthermore, information on the location of hotspots may help policymakers to direct vector eradication activities as well as to plan more suitable strategies for the prevention of outbreaks and early warning information, including for communities to encourage individuals in reducing the risk of dengue cases in their region.

Methodology

Study area

Bandung is the largest city on the southern island of Java, located at 107o36′ east longitude and 6o55′ south latitude. It has a total area of 16,729.65 hectares and lies at 791 m above sea level (asl). Its highest point is in the northern area at 1050 m asl, while the lowest point is located in the southern area at 675 m asl. The southern part of the city consists mostly of sloping contours, while the northern part is far more mountainous. The city comprises 30 sub-districts with 151 villages and can be divided as north, south, east, and west boundaries [41].

Dengue epidemiological data

Monthly reports of diagnosed dengue cases from 16 hospitals in Bandung were collected from the Dengue Disease Surveillance database of the city health office. These reports were obtained for the period January 2014 to December 2016. Dengue cases were distinguished into five different categories based on symptoms and laboratory results (number of hemoglobin, leukocyte, thrombocyte, hematocrit, and NS1), namely undifferentiated fever, dengue fever, dengue hemorrhagic fever, dengue shock syndrome, and expanded dengue syndrome [42]. Undifferentiated fever is a simple fever similar to other viral infections that may be observed among patients who have been infected with dengue before, in particular for the first time (i.e., primary infection). Expanded dengue syndrome is a category of unusual manifestations with the involvement of dengue-associated symptoms in several organs such as the liver, kidneys, brain, or heart. In this study, we used confirmed dengue hemorrhagic fever cases only.

The demographic profile of each reported case included the patient’s age, gender, residential address, date of confirmed diagnosis, and reporting hospital. The data were analyzed anonymously and it was assumed that each infection occurred at the residential address of the patients. Population size data of 151 villages of Bandung for 2014–2016 were also obtained from the Bandung Central Bureau of Statistics for the year 2017. Data were further categorized by age group (Table 1). All of these data were then statistically analyzed using ANOVA and t-tests. All statistical testing was carried out by R-Studio.

Table 1 Age groups of dengue patients

Kernel density estimation for relative risk

Using the dengue epidemiological data from 2014 to 2016, we employed the kernel smoothing method [43, 44] to estimate a continuous spatial distribution of the relative risk of dengue incidences which we defined as the probability of the cases occurring during the period of dengue outbreak.

The probability was estimated from the spatial distributions of dengue cases (the residence address of the patients with confirmed dengue) and the temporal distribution of dengue cases at each area within Bandung (the time when the positive cases were reported). By defining the spatial distribution of dengue cases (f) as the spatial density function, and distribution of infected population at specific regions (g) as the temporal density function, then the dengue relative risk at spatial coordinate \( \overrightarrow{z}=\left(\begin{array}{c}x\\ {}y\end{array}\right)\epsilon W \) (W is Bandung) can be defined as

$$ r\left(\overrightarrow{z}\right)=\frac{f\left(\overrightarrow{z}\right)}{g\left(\overrightarrow{z}\right)} $$

Suppose \( \overrightarrow{d_i} \) and \( \overrightarrow{p_j} \) is a spatial coordinate for the i-th dengue case and a spatial coordinate for the j-th individual. Suppose m and n i are the number of dengue cases and the number of the total population in a year, respectively. Therefore, density function f and g can be defined as

$$ f\left(\overrightarrow{z}\right)\approx \hat{f}\left(\overrightarrow{z}\right)=\frac{1}{m}\sum \limits_{i=1}^m{K}_h\left(\overrightarrow{z}-{\overrightarrow{d}}_i\right) $$
$$ g\left(\overrightarrow{z}\right)\approx \hat{g}\left(\overrightarrow{z}\right)=\frac{1}{n}\sum \limits_{j=1}^n{K}_h\left(\overrightarrow{z}-{\overrightarrow{p}}_j\right) $$

A variable bandwidth selector was used in the multivariate kernel density estimation [45], which is represented in the following equation:

$$ {K}_h\left(\overrightarrow{w}\right)=\frac{1}{h_x{h}_y}K\left(\frac{\overrightarrow{w}}{h}\right)=\frac{1}{h_x{h}_y}\frac{1}{2\pi}\mathit{\exp}\left(\frac{-1}{2}{\overrightarrow{w}}^T{\left({\left(\begin{array}{cc}{h}_x& 0\\ {}0& {h}_y\end{array}\right)}^{-1}\right)}^2\overrightarrow{w}\right) $$

where \( \left(\begin{array}{cc}{h}_x& 0\\ {}0& {h}_y\end{array}\right) \) is the scaled bandwidth.

The optimum bandwidth used in this study was normal smoothing, which is the minimum asymptotic mean integrated squared error [37], calculated using the normal scale (NS) function in the sparr package of Rstudio. We employed normal smoothing to provide asymptotically optimal fixed bandwidths for spatial or spatiotemporal normal densities that showed for dengue incidences for each observation year.

Results

Hospitalized dengue cases

A total of 10,573 hospitalized cases were registered from January 2014 to December 2016, with an annual increase rate of 11.83% (Fig. 1). In 2014, the peak of dengue incidence happened in January and March, while for 2015 and 2016 the peak happened during the dry season (Fig. 2).

Fig. 1
figure 1

Number of dengue cases reported from the 16 local hospitals in Bandung from 2014 to 2016. Hotspot analysis helps in identifying clustered data points on a map. Therefore, using this method permits the averaging of the results, based on cumulative monthly averaging or seasonal averaging

Fig. 2
figure 2

Monthly dengue cases were reported from the local hospital in Bandung from 2014 to 2016

In general, most patients (on average 42.15 ± 1.4% for 3 years data) with reported dengue cases were children under 10 years old for all observation years, with the total number of patients consistently declining with age (Fig. 3). The pattern was relatively constant among the three observation years (ANOVA, p = 0.283). Also, more male (n = 5533) than female (n = 5040) dengue patients were reported during the observation period (Fig. 4), although the difference was not statistically significant (t test, p = 0.31). Annually, there were increasing trends of positive cases for young patients (Fig. 5) alongside a decreasing pattern of cases for an older, including much older, population (15 to 65 years old).

Fig. 3
figure 3

Age distribution of patients with dengue incidence reported by Bandung hospitals 2014–2016

Fig. 4
figure 4

Gender breakdown of age distribution of patients with dengue incidence reported Bandung hospitals 2014–2016

Fig. 5
figure 5

Distribution of cases among age groups reported by Bandung hospitals 2014–2016

Dengue risk estimation and spatial representation

Following normal smoothing calculation, optimal bandwidth for each observational year was estimated as follows (Table 2):

Table 2 Bandwidth value under normal smoothing

Further analysis employed the bandwidth calculation to estimate the relative risk and subsequently to map the risk into a spatial representation. The pattern of the relative risk of dengue in Bandung did not change significantly between 2014 and 2016. In general, villages with high relative dengue risk were more concentrated in the southern and southeastern regions of the city, while villages with low relative dengue risk were mostly found in the western and northeastern regions (Fig. 6). However, it is observable that a new aggregation of the disease incidence (hotspots) for dengue developed in the northern area of Bandung in 2015.

Fig. 6
figure 6

Spatial analysis of the relative risk of dengue in Bandung at the village level in a 2014, b 2015, and c 2016

Despite the change in 2015, the hotspots were relatively constant over time, with the majority found in southeastern areas over time. To estimate the relative risks over space and time we take into account the population at risk. The basic formulation of the relative risk is defined as:

$$ {\boldsymbol{\theta}}_{\boldsymbol{it}}=\frac{{\boldsymbol{y}}_{\boldsymbol{it}}}{{\boldsymbol{E}}_{\boldsymbol{it}}};\boldsymbol{i}=\mathbf{1},\dots, \mathbf{30}\ \mathbf{and}\ \boldsymbol{t}=\mathbf{2014},\mathbf{2015},\mathbf{2016} $$

where θit, yit and Eit denote the relative risk, number of incidences, and expected rate at area i and time t, respectively. The expected rate Eit is defined as [46]

$$ {\boldsymbol{E}}_{\boldsymbol{i}\boldsymbol{t}}={\boldsymbol{N}}_{\boldsymbol{i}\boldsymbol{t}}\frac{\sum_{\boldsymbol{i}=\mathbf{1}}^{\boldsymbol{n}}{\sum}_{\boldsymbol{t}=\mathbf{1}}^{\boldsymbol{T}}{\boldsymbol{y}}_{\boldsymbol{i}\boldsymbol{t}}/\boldsymbol{nT}}{\sum_{\boldsymbol{i}=\mathbf{1}}^{\boldsymbol{n}}{\sum}_{\boldsymbol{t}=\mathbf{1}}^{\boldsymbol{T}}{\boldsymbol{N}}_{\boldsymbol{i}\boldsymbol{t}}/\boldsymbol{nT}} $$

with Nit as the population at risk at area i and time t.

According to this formulation, the relative risks were estimated by taking into account the population at risk.

In this paper, we used the normal kernel density approach. “The optimum bandwidth used in this study was normal smoothing, which is the minimum asymptotic mean integrated squared error [37], calculated using the normal scale (NS) function in the sparr package of Rstudio to provide the asymptotically optimal fixed bandwidths for spatial or spatiotemporal normal densities that were shown for dengue incidences for each observation year.”

The local Moran’s I statistics is commonly used to identify hotspots. It is able to estimate density distribution of events at the local level, and identify statistically significant hot spots in a dataset. However, very often areas identified as hotspots from LISA results are not grouped into a high-density threshold on a kernel density estimation map. When analyzing and mapping spatial patterns (e.g., dengue incidence), it is important to ensure that the identification of hotspots is as accurate and effective as possible [47].

Underreported data in disease study is difficult to avoid. However, by assuming the underreporting occurs in all areas and times randomly, we believe that the underestimation problem is not an issue due to random error assumption.

Discussion

Dengue cases and seasonal changes

Dengue incidences in Bandung showed an increasing number from 2014 to 2016. In general, the pattern of hospitalized cases was remained relatively stable despite the change in hotspot areas. The pattern showed large numbers of cases early in the year followed by a peak period about May to June. The pattern is strongly related to meteorological factors such as rainfall and humidity which can been observed at different scope levels: village-wide (Faridah, unpublished data), region-wide (West Java province and its nearby areas) [48], and nationwide (other regions in Indonesia) [49,50,51] and consistent with reports from other regions of the world even though an effect of temperature is relatively insignificant in our study area as in other tropical countries [52,53,54].

Demographic analysis of hospitalized dengue cases in Bandung, West Java

This study also found that most of the patients were male, similar to previous studies from Cambodia, Malaysia, Sri Lanka, Singapore, the Philippines, and India [55,56,57]. One contribution to this finding might be the higher number of male workers in government offices and private enterprises in Bandung [58]. Thus, there could be a higher likelihood for males to be exposed to dengue-carrying mosquitoes during working hours (mostly daytime) at the workplace or while traveling to and from their workplace [57]. Another contributing factor may be a gender difference in susceptibility to certain serotypes, For example, Chang et al. (2018) [59] reported that males are more likely to be infected with DENV-1, and that dengue cases in females are mostly caused by DENV-3. In Bandung, DENV-1 is more commonly found than DENV-3 [60, 61]. Although to our knowledge, our study is the first to include information on the gender pattern of dengue cases, and our data did not include information on patient serotype. Thus, further studies are required to examine the higher incidence rate in males and particularly the relationship to serotype

The study also highlighted a strong demographic pattern of cases in children, with a significant increase in 2016. This result is similar to other data from a different city in West Java [48] which suggests that the pattern might occur across West Java. Furthermore, children under 10 years old were at the highest risk of dengue infection, which supports findings in the Philippines and Brazil that suggested a possible relation between age and an increase in immunity [62].

Spatial analysis of hospitalized dengue cases in Bandung, West Java

Our results confirmed that dengue cases in Bandung have a clustered distribution pattern among the local villages. High-risk areas of dengue incidences were mostly located in Bandung’s downtown, in the intersection of the area between Cibeunying, Karees, Tegalega, and Bojanegara. This area is one of the most economically productive in Bandung, where most of the best public services (education, government, health), industries, commercial enterprises, and recreation are concentrated and the rate of population growth is considered higher than in other regions due to a higher birth rate and urbanization [63]. This result supports previous studies that showed dengue incidence correlated with urban growth and urbanization [64,65,66,67,68,69].

The kernel density estimation allowed the generated map to show the development of new hotspots in the northern part of the city since 2015. This was possible because the estimation process adds the pattern of point event changes in an equation and compares it to other point events. Thus, although the number of cases may be much lower than in other regions, a positive change will be registered as a positive result of forecasting—in this case, showing an increase in the risk of dengue incidences. A possible explanation of the phenomenon may be related to the rapid development of the northern part of the city during the last 10 years. The development activities created a region that provides better education, job opportunities, recreation, and standard of living that have encouraged temporary rural-urban immigration from established hot spots [70]. This result is supported by similar studies in Thailand [71] and Brazil [72] that showed a significant impact of human movement on the transmission of dengue in a local area. While this study did not calculate the exact distance of each clustered dengue risk coordinate, the clusters shown in Fig. 6 did not span widely beyond the epicenters of the risk. This suggests that dengue transmission dynamics are limited to a small scale. Guzetta et al. (2018) [73] found that a large proportion of cases were transmitted via short-distance human movement (< 1 km) with a limited contribution of long-distance commuting within the city. Moreover, research conducted in Venezuela showed dengue infection often clustered in a smaller range which was within and around households and blocks in radius 20–110 m [14]. Dengue virus transmission is spatially heterogeneous because the dynamics are determined by a complex interplay between environmental and climatic factors, abundance and competence of vector species, human density and behavior, ecological interactions among viral strains, and profiles of immunity in the population [74, 75].

Another notable point in the generated spatial map (Fig. 6) is the annual changes of cluster size, even though the precise fluctuations of cluster size were not measured in this study. Further study is needed to confirm and analysis this phenomenon as it might also reflect inconsistency in dengue programs or underreporting of dengue cases. Previous studies found inherent inadequacy and inaccuracy in Bandung hospitals’ systems for reporting to the Bandung City Health Office. These studies suggested that the reporting system for dengue in Bandung should be evaluated and strengthened to minimize the number of unreported cases [6, 76]. Inaccuracy might also be one reason why no clusters of dengue risk were detected in the western part of Bandung.

The main limitation of this study is a broader inability to assess the underreporting of hospitalized patients’ data. The data we gathered from Bandung City Health Office covered only 60% of the hospitals that were registered in Bandung. The extent to which such partial data might have influenced the analysis result is unknown. However, the omissions are considered minor and the findings still applicable, as we gathered most of the data from type B hospitals, which generally act as the primary reference for health services in their respective areas.

A study by Tapia-Coyer is relevant to our findings. It showed that community empowerment is a key aspect of disease-eradicating strategy, as it allows a local population to drive the eradication of diseases in its environment. Hence, a sustainable process is necessary to encourage individuals to maintain efforts in keeping their environment free of dengue [74]. Knowledge of an area’s dengue risk may further enhance community members’ efforts to prevent dengue preventing contact with the vector and ultimately eradicating it.

Conclusion

This study demonstrates a strong pattern of reported cases related to specific demographic groups (males and children). Furthermore, spatial analysis using GIS also visualized the dynamic development of aggregation of the disease incidence (hotspots) for dengue cases in Bandung. In the future, observation of clustered hotspots in Bandung could provide strategic information for the planning and design of a dengue control program and may encourage communities to initiate their own dengue prevention programs. Existing surveillance methods should be evaluated and modified as needed based on the spatial analysis presented in this report so that local resources can be optimally utilized in a specific area. Moreover, the clustering distance information, which is around 900 m in radius, and demographic data can be used for detailed technical planning in dengue case countermeasures.