Background

Tuberculosis (TB) is a disease caused by Mycobacterium TB and transmitted through the respiratory tract. According to statistics, the number of TB infections is statistically high, with approximately a quarter of the global population infected with latent TB in 2014 [1]. Despite being preventable and curable, TB has emerged as the second leading cause of death globally in 2022 [2]. According to the latest report of the World Health Organization, 10.6 million people worldwide were suffering from TB in 2022, with 7.5 million new diagnoses of TB and about 1.3 million deaths [2], which showed that TB remains an important public health problem that jeopardizes public health worldwide [2, 3].TB poses a significant threat to public health in China, as of 2022, China reported 710,000 TB cases, making it the country with the third-highest TB burden globally [2]. Statistics from 2022 indicate that TB-related deaths in China rank second only to those caused by acquired immune deficiency syndrome (AIDS) among notifiable infectious diseases [4]. According to Lin et al. [5], systematic predictions using their model suggest that China may struggle to meet the global goal of ending TB by 2035 with its current prevention and control measures. The TB prevention and control situation in China is quite severe, necessitating further exploration of new strategies and methods. In recent years, spatiotemporal analysis has seen widespread application in clinical hotspot surveillance of infectious diseases. For example, using spatiotemporal analysis for epidemiological monitoring of COVID-19 cases [6], sociodemographic information on zoonotic cutaneous leishmaniasis was clustered based on spatiotemporal analysis [7]. This approach facilitates early detection and active case identification in high-risk areas, guiding the development of relevant prevention and control measures. Moran’s spatial autocorrelation analysis and spatial scanning statistics have emerged as widely employed methods to analyze the epidemiological characteristics of TB [8]. A systematic review of the spatial clustering of TB in the global general population revealed that the high-risk attributes of TB correlate with the spatial distribution of hotspots [9]. Identifying the geographical distribution of disease hotspots is helpful for recognizing high-incidence areas and formulating prevention and control measures [10].

Studies conducted in China have demonstrated that the incidence of infectious diseases across different administrative units (provinces, autonomous areas, and municipalities directly under the central government) exhibits temporal and spatial clustering as well as seasonality [11]. The spread and prevalence of TB vary across different cities in China [3]. According to Wei X et al. [12], high-risk areas for TB transmission are primarily located in western and south-central China. However, it is noteworthy that several provinces in eastern China, including Shanghai, are showing a potential upward trend in TB cases. This confirms that the epidemic trend, seasonal patterns, and spatial distribution of TB in different provincial geographical units deserve attention [13]. In addition to spatiotemporal analysis at the provincial and municipal levels [3], the study by Lai P et al. [14]. highlighted the importance of analyzing small-scale TB clusters at the street level. This approach is valuable for determining the prevalence patterns of TB in high-density urban areas. At present, TB prevention and control work in Shanghai is progressing steadily, but special attention should be paid to the prevention and control work in suburban areas [15]. Pudong New Area, the largest district in Shanghai with a permanent population of 5.78 million, warrants a thorough assessment of TB incidence. Understanding the TB situation in Pudong New Area will be essential for effective public health planning and intervention. This study primarily employed spatiotemporal analysis techniques to identify TB clustering areas, investigate the geographical distribution of TB incidence, and uncover gaps in TB prevention and control. The findings will provide a basis for precise TB prevention and control strategies.

Methods

Study area

In this study, the data of Pudong New Area of Shanghai (121°27’l8"E ~ 121°48’43” E, 30°08’20” N ~ 31°23’22” N) were analyzed. It is located in the east of Shanghai, at the mouth of the Yangtze River, and it is the area with the largest population in Shanghai. According to the latest administrative division of Pudong New Area, this study incorporated a total of 36 areas, including 12 streets and 24 towns, as shown in Fig. 1.

Fig. 1
figure 1

Geographical location of 36 towns/streets in Pudong New Area of Shanghai

Data sources

The TB data for the study was sourced from the Chinese Center for Disease Control and Prevention (CISDCP). This system, developed by the Chinese Center for Disease Control and Prevention, serves medical and health users nationwide. It is the world’s largest internet-based disease reporting system, established by the Chinese government after the SARS epidemic in 2003, focusing on the reporting of infectious diseases and public health emergencies [16]. The system utilizes a multi-layer management mechanism to ensure timely and effective data input into CISDCP [17]. After diagnosing cases, hospitals and clinics can directly report infectious diseases via the Internet [16]. Centers for Disease Control can obtain disease information in real time, conduct quality control, and upload the verified data to CISDCP. The number of TB cases was obtained from CISDCP in this study. The demographic data is sourced from the Shanghai Pudong New Area Statistical Yearbook. Considering that the regional division of Pudong New Area in Shanghai has been adjusted many times, to ensure the integrity and availability of demographic data, we extracted all TB case data and demographic data registered and reported in Pudong New Area in Shanghai from January 1, 2014, to December 31, 2023.

Establishment of a spatial information database

At the time of data abstraction, we used Excel tables to organize and construct a database of TB case information, annual total population, and seasonal changes from 2014 to 2023.

Map Digital GIS construction

Geographic Information Systems (GIS) [18] was used to recognize the spatial and temporal patterns of diseases. Taking the administrative planning map of Pudong New Area as the background, the vector maps of 36 towns/streets were established using ArcGIS 10.2 software. Through the coordinate system, GIS was created to implement the visualization of regional TB incidence results. It can be useful in providing readers with explicit and valuable information to comprehend and construe data distributions straightforwardly.

Spatial autocorrelation analysis

Global spatial autocorrelation: This method takes into account all spatial unit data in the entire study, and measures and evaluates the degree of spatial correlation and difference between them. Moran’s I is a spatial autocorrelation metric proposed by Moran [19]. Its calculation formula:

$$I = {{n\sum\nolimits_{i = 1}^n {\sum\nolimits_{j = 1}^n {{w_{ij}}\left( {{x_i} - \bar x} \right)\left( {{x_j} - \bar x} \right)} } } \over {\sum\nolimits_{i = 1}^n {\sum\nolimits_{j = 1}^n {{w_{ij}}{{\left( {{x_i} - x} \right)}^2}} } }}$$
(1)

where: n is the number of spatial units indexed by i and j; x is the variable of interest; \(\bar x\) is the mean of x; \({w_{ij}}\)are the elements of a matrix of spatial weights with zeroes on the diagonal(i.e., w = 0).

Moran’s I measures all the metrics that recognize the similarity of observed values of adjacent spatial unit attributes and determines the clustering pattern presented by measuring geographic elements at the global level. The value of I ranges from − 1 ≤ I ≤ 1. I > 0 indicates a positive spatial correlation. The closer the value is to 1, the higher the degree of spatial clustering. I < 0 indicates a negative spatial correlation. I = 0 indicates no correlation. We used Z-scores or P-values to recognize whether the indicators were significant. When Moran’s I > 0 and Z- score ≥ 1.96, P < 0.05, with statistical significance, the disease shows spatial clustering and statistical clustering.

Local spatial autocorrelation: Global spatial autocorrelation is used to reflect whether the attributes specified in the whole research scope are correlated, but it cannot be pointed out exactly where they are clustered, while local spatial autocorrelation can make up for the above shortcomings. Anselin [20] proposed in 1995 that Local Moran’s I index should be used to measure Local correlation. The formula for calculating Local Moran’s I is as follows:

$${I_i} = {{\left( {{x_i} - \bar x} \right)} \over {\sum\nolimits_{i = 1}^n {{{\left( {{x_i} - \bar x} \right)}^2}} }}\sum\limits_{j = 1}^n {{w_{ij}}\left( {{x_i} - \bar x} \right)}$$
(2)

Where, \({I_{i}}\) represents the local Moran’s I index of the region, and its relationship with Global Moran’s I is

$$I = \sum\limits_{i = 1}^n {{{{I_i}} \over n}}$$

Using local spatial autocorrelation analysis to identify the correlation between local areas and the distribution of diseases [21, 22]. In the Local indicator of spatial autocorrelation (LISA) plot, using local Moran’s I statistics to determine statistical significance, we identified four spatial relational clusters for the analyzed variables, for example, high-high-value clustering areas, low-low-value clustering area, low-high-value clustering areas and high-low-value clustering areas [22]. A positive value I denote that the element has an attribute value that contains the same high or low adjacency factor, which becomes part of the cluster, whereas a negative value I contain the adjacency factor with different values, which is an outlier and not part of the cluster.

Space-time scan analysis

The spatial autocorrelation analysis mentioned above primarily focuses on spatial clustering without incorporating the time factor, thus hindering the determination of temporal trends in geospatial clustering areas. To address this limitation, SaTScan software developed by Kulldorff [23] offers an effective solution. The statistical method of space-time scan analysis considers both space and time, and the statistical method of spatiotemporal scanning analysis considers both space and time simultaneously. Space-time scan analysis is a clustering method based on a dynamic cylindrical window that has a circular geographic graphic basis, the window moves in space and space, and the height corresponds to time. The incidence of TB is assumed to follow the Poisson distribution, and the Poisson probability model is used to construct the log-likelihood ratio (LLR) under this assumption. Statistical tests are then performed using the Monte Carlo simulation method and p-values are calculated. Large LLR values and statistically significant differences indicate that the region contained under this dynamic window has a higher probability of aggregation [24, 25].

Statistical analysis

We utilized GIS 10.2 to illustrate the spatial pattern of annual TB incidence across 36 towns/streets in Pudong New Area from 2014 to 2023. To determine the presence of spatial autocorrelation at the regional level, we conducted an analysis using Geoda 1.22.0.4 software. Additionally, we performed space-time scan analysis with SatScan 10.1.3 software to depict the spatiotemporal distribution of TB incidence.

Results

Epidemiological characteristics

A total of 14,104 cases of TB were reported in the Pudong New Area of Shanghai from January 2014 to December 2023. The characteristics of 14,104 TB cases are shown in Table 1. The male-to-female ratio was 1.99:1, with 9394 males and 4710 females. The age group of > 60 accounted for the largest proportion (35.05%), and the main affected group was retirees (26.71%).

Table 1 Demographic characteristics of patients with TB in Pudong New Area, Shanghai, 2014–2023

From 2014 to 2022, with an average annual incidence of 25.14 cases per 100,000 persons, the incidence of TB showed a downward trend, with the most significant decline in 2022. The annual incidence of TB in 2023 has increased slightly, but it was still lower than the incidence of TB in 2021 and before. As shown in Fig. 2.

Fig. 2
figure 2

Overall TB prevalence in Pudong New Area, Shanghai, 2014-202. Distribution of TB in 36 towns/streets

In the study area comprising 36 towns/streets, the reported annual incidence of TB has exhibited a decreasing trend from 2014 to 2023 across various towns/streets, as illustrated in Fig. 3.

Fig. 3
figure 3

Overall distribution of TB in 36 towns/streets, 2014–2023

Utilizing the annual incidence of the district as the baseline (25.14 cases per 100,000), a comparison was made between the light blue and light red dividing lines, as depicted in Fig. 4. The impact of the novel coronavirus epidemic in 2022 resulted in reported TB incidences in each area falling below the baseline. Furthermore, over time, there has been a decline in the number of towns/streets where TB incidences surpass the baseline, evident by the gradual transition from the red area to the blue area in Fig. 4. In the longitudinal comparison across different years within the same street, it is observed that the reported incidence of TB in Laogang Town has consistently remained lower than the baseline over the past decade. Conversely, the reported TB incidences in Hudongxincun Street and Tangqiao Street have been higher than the baseline for nine consecutive years.

Fig. 4
figure 4

Comparison of TB baseline in 36 towns/streets, 2014–2023

Spatial autocorrelation analysis

Global spatial autocorrelation analysis of the annual incidence of TB in the study in the Pudong New Area from 2014 to 2023 showed that Moran’s I was significant (Table 2). Among them, the incidence of TB from 2014, 2016 to 2018, and 2022 showed a positive spatial clustering correlation (Moran’s I value > 0, and z value > 1.96, and P value < 0.05), of which 2014 (Moran’s I value = 0.457) and 2022 (Moran’s I value = 0.384) had the highest positive spatial correlation, as shown in Table 2.

Table 2 Results of Moran’s I analysis of the annual incidence cases of TB, 2014–2023

From 2014 to 2023, local spatial autocorrelation analysis of the annual incidence of TB in the Pudong New Area revealed the presence of high-high-value clustering areas (hot spots) and low-low-value clustering areas (cold spots). We conducted a summary and analysis of these “hot spot” and “cold spot” clustering areas at a test level of 0.05, identifying the highest statistically significant clustering areas in 2022 (a total of 12 clustering areas), followed by 2014 (11 clustering areas), 2018 (11 clustering areas), and 2019 (11 clustering areas). According to the survey, there have been slight changes in the TB hotspots in the area each year. From 2014 to 2023, TB hotspots were consistently identified in Caolu Town, Gaoqiao Town, Gaohang Town, Gaodong Town, Jinqiao Town, and Hudong Xincun Street in the northeast of the district, with Caolu Town emerging as a hotspot for five years, as shown in Fig. 5.

Fig. 5
figure 5

Local spatial autocorrelation analysis of TB in 36 towns/streets, 2014–2023

Space-time scan analysis

From 2014 to 2023, there exists one most likely cluster spatiotemporal clustering area and one secondary spatiotemporal clustering area. The first-level spatiotemporal clustering area was located in 12 towns/streets, including Cao Lu, Gaodong, Gaohang, Hudong, etc., with a period from 2014 to 2018. As the most likely cluster, the first-level spatiotemporal clustering area contained 3,125 cases, with an expected number of 2,457 cases, and a radiation radius of 15.74 km (log-likelihood ratio LLR = 102.78, relative risk RR = 1.35), with a difference of P < 0.001, which was statistically significant, as shown in Fig. 6.

Fig. 6
figure 6

Spatiotemporal clustering analysis of TB incidence, 2014–2023

Heat map of seasonal distribution of TB incidence

This area exhibits a seasonal correlation. Except for the epidemic years (2020–2022), late spring (March) marks the high period of TB incidence, while February sees lower incidences. The seasonal peak period varies from year to year, with summer (April-June) and autumn (July-September) being the clustered peak periods for TB incidence. This seasonal pattern is most prominent from 2014 to 2021, as illustrated in Fig. 7.

Fig. 7
figure 7

Heat maps of seasonal distribution of TB in 36 towns/streets

Discussion

To the best of our knowledge, this study represented the first attempt to elucidate the trends in TB incidence and its spatial patterns in the Pudong New Area. There were 14,104 TB patients in the region in the past decade, the ratio of male to female patients was 1.99, slightly higher than in another study in China (1.83) [26]. In addition, the number of elderly TB patients > 60 years old in this area is the largest, which is consistent with the study of Li SJ, et al. [27]. It is worth noting that the proportion of retired TB patients in this area is the highest, which indicates that we need to strengthen the prevention and control of TB in the elderly.

Over the past decade, the incidence of TB in this area has exhibited a decreasing trend, consistent with national and some provincial studies [12, 25, 28]. This can be largely attributed to the effective guidance of TB prevention and control management in China, coupled with the high attention paid to TB prevention and control by municipal governments and health administration departments [29]. In recent years, relevant municipal departments in Shanghai have diligently executed the objectives outlined in the “13th Five-Year Plan for TB Prevention and Control in Shanghai.” These efforts include enhancing the capacity and awareness of TB prevention and control, increasing investment in research funds, and effectively managing the diagnosis, treatment, and care of TB patients [30].

It was pertinent to highlight the observed differences in TB incidence among the towns/streets. Throughout the study period, there has been a downward trend in the number of towns/streets with reported TB incidence exceeding the baseline. Upon longitudinal comparison across years within each area, it was found that only Laogang Town consistently exhibited low TB incidence over ten years. Conversely, Hudong Xincun Street and Tangqiao Street reported higher than baseline TB incidence for nine consecutive years, potentially influenced by local population density. It is worth noting that the population density of the latter two areas is three times that of the former. Elevated population density tends to facilitate germ transmission. Moreover, internal migration mobility [31] could emerge as a significant factor influencing TB incidence, given the observed correlation between migration and TB incidence.

Spatial autocorrelation analysis has revealed that the spatial distribution of TB incidence was not random. The high clustering observed in 2014 has not found similar results in relevant studies. We think it may be that the TB prevention and control measures were not perfect in the early years compared with the present, and 2014 is the first year of our research, so we will observe that the clustering is higher than that in other years. Conversely, the heightened concentration in 2022 can be explained by the severe impact of the novel coronavirus epidemic in Shanghai during that year, resulting in delays in diagnosis and treatment for potential TB cases. Moreover, localized transmission of the novel coronavirus, alongside the invasion of TB, heightened the susceptibility of the population to TB infection, thereby increasing the risk of TB clustering. However, it’s noteworthy that low staff turnover during this period likely mitigated the spread of TB on a larger scale. This phenomenon serves as a compelling confirmation that 2022 represented a high-concentration area of TB, despite the overall low annual incidence.

Further local spatial analysis revealed a growing trend in the number of clusters of TB incidence as time progressed, this finding aligned with the results of Guo C [32] in their study on TB in mainland China. Our study highlighted that high TB incidence was primarily concentrated in the northeast of the district, especially, Caolu Town emerged as a hot spot for five years. These trends may be attributed to local population dynamics, economic development, and healthcare infrastructure. The study underscored the importance of strengthening TB-related health education, conducting regular job training, and enhancing the management system for TB. Additionally, variations in TB incidence among different towns/streets and over time suggest that TB prevention and control efforts have not yet reached a stable level. Therefore, each area must consider past trends in the TB epidemic when formulating prevention and control strategies for the upcoming years, ensuring strict adherence to overall prevention and control measures.

In our study, one primary cluster and one secondary cluster indicate divergent time trends in TB prevalence across different towns/streets. The primary spatiotemporal clustering area was observed during 2014–2018, suggesting that the burden of TB incidence decreased after 2018. This decline could be attributed to heightened attention from national and local authorities towards TB prevention and control measures [30]. Notably, the issuance of the “National Tuberculosis Prevention and Control Plan (2011–2015)” by the General Office of the State Council in November 2011 marked a pivotal step, outlining a series of strategies for TB prevention and control [33]. Subsequently, during the implementation of the “Thirteenth Five-Year Plan” (2016–2020), Shanghai continued to enhance its TB prevention and control service system in alignment with these guidelines, further bolstering the effective management of TB [30]. The results of space-time analysis revealed that TB clustering primarily occurred in the northeast of the district. Factors influencing TB clustering areas include population mobility, lifestyle choices, environmental factors, and socioeconomic disparities. In high-concentration areas, the health system, led by the CDC and in collaboration with community health service centers, should carry out targeted health interventions based on community diagnosis results. The northeastern part of the region, characterized by a mix of urban and rural areas and a large population flow, requires focused TB health education and active screening of key and high-risk groups. These efforts aim to maximize the early detection of patients and latent infections.

Further exploration was conducted into the seasonal hot spots of TB distribution. Regarding the reporting of seasonal TB incidence, excluding the three-year epidemic period (2020–2022), the end of spring (March) emerged as the peak clustering period for TB reports. This finding aligns with national TB surveillance results [34]. The high incidence of TB reported in March can be attributed to delays in consultation and diagnosis of TB patients [35]. This delay likely results in a backlog of cases being reported in March. The formation of peaks in March may be linked to the window period of increased susceptibility to TB, which typically commences in winter. Patients infected with TB may exhibit symptoms and seek medical treatment within 2–3 months. Notably, there is a correlation between vitamin D deficiency and the development of TB [36]. Vitamin D deficiency is more common in TB patients than in the normal population [37]. Normal vitamin D metabolism activates macrophages, effectively inhibiting the intracellular production of TB. Low synthesis of vitamin D in winter poses a risk of latent TB infection for susceptible individuals [38].

Our study also found that the peak stage of the disease was different each year, and the specific reasons remain to be explored, but these may be mixed with non-seasonal factors because Wang H, et al. [39] found that there was a correlation between the seasonal appearance of environmental air pollutants and TB. The high-incidence seasons reported in our study are summer (April-June) and autumn (July-September). Our findings coincide with the seasonality of TB reporting observed in various areas, including Hubei (late spring/early summer) [28], Guangxi (spring and summer) [40], Germany (late spring/late summer) [41], Japan (summer-fall) [42], Pakistan (April-June) [43], and New York (March-May) [44]. These variations in reporting periods may be attributed to differences in geographical areas among the subjects included in various studies, as well as the influence of local climate and environmental factors on TB spread [45]. Furthermore, the increased latent disease activity during winter and early spring may contribute to a surge in patient visits and subsequent seasonal reporting in the following months. In summary, the observed seasonal peak in reported TB incidence underscores the temporal concentration of cases. Therefore, intensifying efforts in TB prevention and control during winter and spring, and enhancing awareness of protective measures, will aid in mitigating the issue of seasonal TB incidence.

Our study has several limitations that should be acknowledged. Firstly, the analysis conducted focused solely on spatiotemporal patterns of TB incidence. Regional-scale effects may introduce errors in the results, potentially masking correlations with factors such as regional economy, population education level, and healthcare infrastructure. Secondly, while our study identified seasonal patterns in TB incidence, we did not obtain relevant data to explore the influence of local temperature and environmental factors on seasonal TB incidence. This represents a gap in our analysis and warrants further investigation in future research. Lastly, the data for our study were obtained from the self-monitoring system, and there may have been missing information on TB patients, leading to potential data bias in our analysis. Acknowledging these limitations is crucial for interpreting the findings of our study accurately and for guiding future research efforts to address these gaps.

Conclusions

In our study, spatiotemporal analysis has demonstrated its reliability in epidemiological research. Over the period from 2014 to 2023, the reported incidence of TB in the Pudong New Area of Shanghai exhibited a declining trend. Our study has identified spatiotemporal clustering as well as seasonal correlations in TB incidence. High clustering areas were primarily concentrated in the northeastern part of the district, with March identified as the period of peak TB incidence. Relevant departments should make effective use of information from high-risk areas and selectively expand the scope of health prevention and control, especially in areas with high incidences of TB, which will be the next key work in the Pudong New Area.