Abstract
Background
Geographical variations in mood and psychotic disorders have been found in upper-income countries. We looked for geographic variation in these disorders in Colombia, a middle-income country. We analyzed electronic health records from the Clínica San Juan de Dios Manizales (CSJDM), which provides comprehensive mental healthcare for the one million inhabitants of Caldas.
Methods
We constructed a friction surface map of Caldas and used it to calculate the travel-time to the CSJDM for 16,295 patients who had received an initial diagnosis of mood or psychotic disorder. Using a zero-inflated negative binomial regression model, we determined the relationship between travel-time and incidence, stratified by disease severity. We employed spatial scan statistics to look for patient clusters.
Results
We show that travel-times (for driving) to the CSJDM are less than 1 h for ~50% of the population and more than 4 h for ~10%. We find a distance-decay relationship for outpatients, but not for inpatients: for every hour increase in travel-time, the number of expected outpatient cases decreases by 20% (RR = 0.80, 95% confidence interval [0.71, 0.89], p = 5.67E-05). We find nine clusters/hotspots of inpatients.
Conclusions
Our results reveal inequities in access to healthcare: many individuals requiring only outpatient treatment may live too far from the CSJDM to access healthcare. Targeting of resources to comprehensively identify severely ill individuals living in the observed hotspots could further address treatment inequities and enable investigations to determine factors generating these hotspots.
Plain language summary
The frequencies of mental disorders vary by geographic region. Investigating such variations may lead to more equitable access to mental healthcare and to scientific discoveries that reveal specific localized factors that contribute to the causes of mental illness. This study examined the frequency of three disorders with a major impact on public health – schizophrenia, bipolar disorder, and major depressive disorder – by analyzing electronic health records from a hospital providing comprehensive mental health care for a large region in Colombia. We show that individuals receiving outpatient care mainly live relatively near the facility. Those receiving inpatient care live throughout the region, but cluster in a few scattered locations. Future research could lead to strategies for more equitable provision of mental healthcare in Colombia and identify environmental or genetic factors that affect the likelihood that someone will develop one of these disorders.
Similar content being viewed by others
Introduction
Geographical variation in the distribution of mood and psychotic disorders has been observed, and extensively studied, in upper-income countries (UICs)1,2,3,4,5. These observed patterns may reflect inequities in the geographic accessibility of mental healthcare. Such inequities were first observed, and quantified, in a study published in 18521. This, and subsequent studies3,5,6,7 used statistical models to identify a relationship between mental healthcare utilization and distance. Specifically, these studies identified the phenomenon termed distance-decay, whereby the utilization of a healthcare facility declines in direct proportion to the distance (or travel-time8) to the facility1,2. Studies in UICs have also indicated that geographical variations in the observed frequency of particular mental health disorders and outcomes may reflect localized social, environmental, and genetic factors associated with treatment utilization or disease risk9,10,11,12,13,14,15.
The longstanding availability in UICs of population registries and other public databases has provided crucial resources that have enabled geospatial investigation of treatment for mental health disorders, as exemplified by the several studies referenced above. A lack of comparable data repositories has traditionally limited the opportunities for conducting such research in low- and middle-income countries (LMICs). Recently, however, electronic health record (EHR) databases have become available in some LMICs and enable population-level investigations of individuals who have sought treatment for these disorders. As we describe here for a middle-income country, Colombia, analyses of such databases may provide insights that could not be obtained using previously available forms of data.
The main sources of population-level data on mental health disorders in Colombia have been four National Mental Health Surveys, conducted, most recently, in 201516,17. The 2015 National Survey is an observational cross-sectional household-survey conducted in a nationally representative sample of 15,351 individuals aged seven and above, which obtained interview and self-report data. It has provided information on a wide range of topics (e.g., the prevalence of mental health and substance use disorders and on perceptions of access to care) that are important for our understanding of mental health disorders in Colombia and for efforts to improve the provision of mental healthcare.
Several critical types of information, however, are not available from the National Survey, but may be obtained by analyzing EHR databases; these include the distribution of the most severe and acute presentations of mental health disorders (the National Survey obtained little data on psychotic disorders), and the actual utilization of mental healthcare services. Additionally, as the National Survey sampled sparsely in any given location, its data cannot be used for geospatial investigation of mental health disorders. In contrast, as we demonstrate here, analyses utilizing the EHR database of the Clínica San Juan de Dios Manizales (CSJDM), provide a detailed picture of geospatial variation in treated mental illness across the entire department (state) of Caldas.
The CSJDM provides comprehensive mental healthcare for the one million inhabitants of Caldas. Located in the metropolitan municipality of Manizales, it provides mental health services to all individuals seeking care, regardless of health insurance status or other socioeconomic factors. We have identified and characterized geographic variation in the number of individuals who have sought treatment for the mood and psychotic disorders that most commonly result in psychiatric hospitalizations (bipolar disorder [BPD], schizophrenia [SCZ], and major depressive disorder [MDD]) and quantified differences, over the department, in geographic accessibility to mental healthcare, considering both individuals whose illness required inpatient care as well as those who were treated only as outpatients. We did so by combining diagnostic code data from the CSJDM EHR for the years 2005–2018, with publicly available geospatial data.
We show that travel-times (for driving) to the CSJDM are less than 1 h for ~50% of the population and more than 4 h for ~10%. We find a distance-decay relationship for outpatients, but not for inpatients: for every hour increase in travel-time, the number of expected outpatient cases decreases by 20%, an effect that is primarily driven by outpatients with MDD. Further, we identify nine hotspots where inpatient cases cluster. Notably, one hotspot comprises a nearly six-fold overrepresentation of inpatient BPD cases and crosses the administrative boundaries of two municipalities.
The results of the geospatial analyses that we performed can inform the design of programs aimed at reducing inequities in access to mental healthcare for mood and psychotic disorders. Additionally, they suggest the need for future studies aimed at identifying mechanisms for the geographical variations in disease incidence that we observed across the department of Caldas.
Methods
Inclusion criteria
We employed a retrospective cross-sectional design. Since 2005 the CSJDM has maintained EHRs on all outpatient visits and inpatient admissions. We utilized three forms of structured EHR data covering 2005–2018: demographic information, residential address, and diagnostic codes using the International Statistical Classification of Diseases and Related Health Problems: 10th Revision (ICD-10)18. The Institutional Review Boards (IRBs) at CSJDM and UCLA provided approval for the study, which involved uploading a version of the EHR database with patient names, dates (of birth and healthcare service), and with ID numbers removed, to a HIPAA-compliant Amazon Web Services server. Individual informed consent was not required for this study since only anonymized data were used. Gender was obtained through self-report and used as recorded in the EHR.
Using the EHRs for all adult patients 18–90 years, residing anywhere in Caldas, we focused on the 16,376 individuals who were assigned a diagnosis, at their first visit of BPD (diagnostic code: F31), SCZ (diagnostic code: F20), or MDD (diagnostic codes: F32 or F33). We used the initial diagnoses to eliminate possible bias introduced by right censoring of the diagnostic data, and did not use the longitudinal, repeated measures aspect of the EHR data in these analyses. In prior work, we have shown that diagnostic codes recorded in this EHR are highly accurate compared to diagnoses obtained by manual chart review by a psychiatrist19. We stratified analyses based on hospitalization status.
Geocoding workflow
We identified the subset of diagnosed individuals with available residential addresses (N = 16,308). We assigned longitude and latitude coordinates to each patient’s address using the Spanish language setting of OpenCage’s geocoder through the R package opencage20. As input, OpenCage takes an address as free text. Prior to geocoding, we formatted addresses according to the following procedures. Most addresses (N = 11,745) fit within a grid pattern that is typical throughout Colombia, comprised of parallel streets called “calles” and perpendicular streets called “carreras”. We formatted addresses of this type according to the pattern “Calle [X], Carrera [Y], [Municipality], Caldas”, where X is the street number for the “calle”, and Y is the street number for the “carrera”. The remaining addresses (N = 4563) were situated in small rural areas called “veredas” which have few roads. For these addresses, we took the name of the vereda or other address entry and appended the name of the municipality and department.
OpenCage provides a list of potential longitude and latitude coordinates with corresponding accuracy confidence scores. If multiple potential matches were returned, the most relevant was selected, based on the OpenStreetMap data21 relevance criteria for Colombian addresses. We excluded addresses with OpenCage confidence scores lower than 6. Using this threshold for confidence scores ensured that all addresses included in the study were within a 7.5 km bounding box measured diagonally and yielded geographic coordinates for 16,295 of the 16,308 patient addresses; all analyses reported here focus on these 16,295 patients.
Table 1 provides a summary of the dataset stratified by diagnosis (BPD, SCZ, or MDD), gender, and hospitalization status. All individuals had the location of their residence georeferenced to a 7.5 × 7.5 km bounding box; for 71% of them we could define the location of their residence to a 0.5 × 0.5 km bounding box.
Calculating the incidence rate of mood and psychotic disorders
We first calculated the annual incidence per 100,000 individuals of newly diagnosed patients (both in- and outpatients) for each of the 27 municipalities within the department. For each year between 2005–2018, we divided the number of newly diagnosed patients in that specific year in each municipality by the estimated number of residents of that municipality in that year. The number of residents in a municipality was estimated using the WordPop database22, which includes gridded spatial demographic estimates. For most municipalities, the annual incidence of newly diagnosed cases was very low; we therefore conducted all our analyses based on the 14-year aggregate of the annual incidence. We subsequently referred to this aggregated value as the incidence rate and report this rate per 100,000 individuals.
We also calculated the incidence rate for each diagnosis, separately for in- and outpatients, at a more fine-grained geographic level unconstrained by municipality boundaries. To make these calculations, we imposed a grid (with a cell size of 5 km by 5 km) on the map of Caldas and counted the number of residents in each cell. Following the definition of the incidence rate at the level of the municipality, the incidence rate at the cellular level was defined as the number of newly diagnosed cases in the cell (between 2005 and 2018), divided by the total number of residents in that cell in the year 2018, and report this rate per 100,000. To visualize incidence rates, we generated maps using the R package ggplot2 v3.1.1 and shapefiles from the National Statistics Department of Colombia (DANE)23.
Calculating a friction-surface, and a geographic accessibility, map
To quantify the geographic accessibility of mental healthcare for the population of Caldas, and to identify a relationship between the distribution of cases and the travel-time to the CSJDM, we constructed a friction surface map and subsequently a geographic accessibility map. We first plotted maps of Caldas using shapefiles from the National Statistics Department of Colombia (DANE)23. We then used the software package AccessMod5 v5.6.024 to construct the friction surface map. This map quantifies the minimum amount of time that it takes to traverse one square kilometer, assuming driving time using a motorized vehicle on roads and walking time if no roads exist, while accounting for elevation changes and ground cover. To construct the map we combined geospatial data on topography25, land cover26, water bodies27 and road networks23 in the following stages: (1) We aggregated the topographic data to match the resolution (100 meters by 100 meters) of the WorldPop data. (2) We merged the land cover data, with data on roads and inland (permanent) water bodies, and with the aggregated topographic data. (3) We calculated travel-times using AccessMod, basing the estimated travel-time for each pixel on the quickest mode of transportation; Supplementary Table 1 lists the average speeds used to compute travel time, stratified by mode of transportation (see also Supplementary Material Section C in ref. 28). We then generated a geographic accessibility map by plotting the geographic location of the CSJDM on the friction surface map and computing the travel-time to the CSJDM from the centroid of each one square kilometer grid cell. We employed the option for anisotropic analysis, which uses the elevation data to modify travel speeds for walking.
The incidence rate of mood and psychotic disorder in relation to the travel-time to the CSJDM
To make these calculations, we modeled the expected number of cases by travel-time in hours, accounting for the number of individuals in each grid cell of the geographic accessibility map. We used a zero-inflated negative binomial regression from the R package pscl v1.5.5, a procedure that accounts for both the large number of grid units with zero cases and over-dispersion of cases. Normal 95% confidence intervals and two-sided p-values were calculated for each relative risk (RR) based on the distribution generated from 1200 bootstraps using the R package boot v1.3-20. We performed this computation for inpatients and outpatients separately, considering: (i) all three diagnoses together (i.e., two comparisons), and (ii) the three diagnoses separately (i.e., six comparisons). We applied Bonferroni thresholds to correct for eight independent tests, with a significance threshold of 0.00625 (0.05/8). To evaluate possible heterogeneity in the observed relative risk by gender, we conducted sensitivity analyses, including in the model gender, and a gender by travel-time interaction.
Identification of hotspots for residence of inpatients
To identify possible hotspots of residences of inpatients, we used the gridded map of Caldas showing the incidence rate of cases (for BPD, SCZ, and MDD combined, and separately). We then applied Kulldorff’s Spatial Scan Statistic29, implemented in the software package SaTScan v9.630. SaTScan uses a circular window to scan a gridded surface. The window varies in size from one that encompasses a single grid cell to one that contains a predefined maximum population size. Use of a varying window size enables identification of hotspots unconstrained by municipality administrative boundaries. We used a cell size of 5 km by 5 km and set the maximum window size to encompass 25% of Caldas’ population. SaTScan identifies hotspots by maximizing a likelihood ratio function, and assesses their statistical significance using a one-sided Monte Carlo hypothesis procedure29. To account for multiple testing for four different tests, we applied a Bonferroni correction with a significance threshold of 0.0125 (0.05/4). As a sensitivity analysis, we also performed separate hotspot analyses based on gender. We selected 25% as the maximum hotspot size as it is frequently considered the largest window with potential future utility for policymaking31. To evaluate the effect of this threshold, we conducted a sensitivity analysis exploring, for each diagnosis, how the identified hotspots changed in size/location when varying the maximum allowable size of the reported cluster32.
We also employed SaTScan to quantify and visualize the uncertainty in defining the borders of hotspots, using Oliveira’s F function33. This function assigns an intensity value, ranging from zero to one, to each grid cell surrounding a hotspot, reflecting the probability that the grid cell belongs to the hotspot. To visualize hotspots and the uncertainty in their borders, we used QGIS v3.1034. The Oliveira’s F values were colored in categories split by Jenk’s natural breaks35,36, a setting in QGIS that enables better visualization of potential borders.
Results
Inequities in the geographic accessibility of mental healthcare in Caldas
The Central Cordillera of the Andes Mountains divides Caldas into Eastern and Western sections, with the Western side containing the capital city, Manizales, and most towns (Fig. 1a, b). This division is evident in the geographic accessibility map (Fig. 1c), which shows the locations for which the CSJDM is the most inaccessible (travel-time (driving) of greater than 5 h, orange-red colored areas); virtually all these sites lie east of the Central Cordillera. While these relatively inaccessible locations, some of which are more than 10 h in travel-time from CSJDM, are geographically sizable, they include a relatively small proportion of the department’s population. Most of the population can access the hospital with less than 1 h of travel, while 90% can access it within approximately 4 h (Fig. 1d, see also Supplementary Table 2).
Geographic variation in incidence rates
The pattern of geographic variation in incidence rates, differs substantially between inpatients and outpatients (Figs. 2 and 3). For outpatients, by far the highest incidence rate is in the municipality (Manizales) which includes the CSJDM (Fig. 3). Notably, we found that the observed pattern for outpatients reflects strong distance-decay (as measured here by travel-times8); for every 1-h increase in travel-time to the CSJDM, the number of expected outpatients decreases by 20% (RR = 0.80, 95% confidence interval (0.71, 0.89), p = 5.67E-05, zero-inflated negative binomial model, Supplementary Table 3).
For inpatients, by contrast, we found no significant distance-decay, either for the incidence rate overall (i.e., for BPD, SCZ, and MDD combined) or for that of any of the three diagnoses considered separately (Supplementary Table 3. The overall incidence rate (Fig. 2a) is highest in a municipality, Aranzazu, located ~55 km from the CSJDM. This pattern reflects the high concentration of inpatient mood disorders (BPD and MDD) cases from this municipality compared to the department, overall (Fig. 3a): the incidence rate of inpatients with BPD is nearly fourfold higher (862/100,000 vs. 219/100,000), and the incidence rate of inpatients with MDD is more than 1.5 times higher (364/100,000 vs. 235/100,000). For MDD this incidence rate is nearly as high as that in Manizales (387/100,000). For SCZ (Fig. 3a) the incidence rate is highest in three neighboring municipalities located ~111 km from the hospital, where it is about twofold higher than in the department, overall (93/100,000 vs. 45/100,000). We found no significant interactions between travel-time and gender.
Identification of hotspots for inpatients with mood and psychotic disorders
We identified nine distinct hotspots: eight hotspots were identified for the broad category of all inpatients with either BPD, SCZ, or MDD (Clusters 1–8, Table 2, and Figs. 2b, 3b), while one hotspot was unique to inpatients with BPD (Cluster 9). Seven of the nine hotspots were also found in our diagnosis-specific analyses (Clusters 1–6, 9, Table 2, and Figs 2b, 3b); i.e., they were hotspots for inpatients with either BPD, SCZ, or MDD.
In our diagnosis-specific analyses, we identified seven hotspots in total (Clusters 1–6 and Cluster 9, Table 2, Fig. 3b), all of which show a significant overrepresentation of inpatient BPD cases, and three of which (Clusters 1, 2, and 5) cross municipality boundaries. Cluster 2 is also a hotspot for MDD inpatients, while Clusters 1, 3, and 4 are hotspots for inpatients across all three diagnoses. The hotspot that includes the grid cells around the hospital (Cluster 1) displays the highest likelihood ratio for each of the three diagnoses, as well as for overall cases of MDD inpatients. This observation reflects the fact that many more patients reside in this metropolitan area than in any other region of the department; in particular, this hotspot is home to 1476 inpatients with MDD (60% of MDD inpatients in the department) and shows the greatest overrepresentation of this diagnosis (RR = 5.47, p = 1.00E-17). Other hotspots, however, show patterns of overrepresentation that appear to differentiate them by diagnosis. Cluster 9 (Fig. 3b) is specific to BPD and is not identified in the cross-diagnostic analyses.
Cluster 2 denotes the hotspot comprising the most extreme overrepresentation of inpatient BPD cases (RR = 5.83, p = 1.00E-17, Table 2 and Fig. 3b). It is the largest of the BPD hotspots in terms of geographical extent (covering four grid units: ~20 square kilometers) and crosses the administrative boundaries of two municipalities (Aranzazu, the municipality with the highest incidence rate of BPD [Fig. 3a/b] and Filadelfia); it is located about 1.5 h driving time from the hospital. This hotspot displays an extremely high average annual incidence of BPD hospitalizations (862/100,000) and also shows an overrepresentation of MDD but not of SCZ hospitalizations; it may therefore constitute a hotspot specific for inpatients with mood disorder diagnoses. Cluster 3, by contrast, appears to be a hotspot for inpatients overall, as evidenced by the overrepresentation of all three diagnoses, despite being located 5 h from the hospital. While the total number of cases of SCZ from this location is relatively small (N = 85 inpatients, N=13 with SCZ), the nearly nine-fold overrepresentation of SCZ cases is particularly striking (RR = 8.98, p = 2.11E-05, Table 2 and Fig. 3b).
Finally, we conducted a sensitivity analysis to explore the impact on the identified hotspots of varying the maximum allowable size of the reported cluster. The hotspots for MDD and SCZ are insensitive to the maximum scanning window size parameter as we observed nearly identical hotspots, irrespective of the parameters used (Supplementary Fig. 1). For BPD, when we increased the maximum reported cluster size above 40% of the population, Cluster 2 disappears. This is a consequence of the process by which SaTScan pre-computes the most likely hotspot centered at each grid, and then iteratively selects non-overlapping hotspots with the highest likelihood ratio; at settings above 40% the hotspot centered at Cluster 2 includes the area of Cluster 1. Since Cluster 1 has a higher likelihood ratio, Cluster 2 cannot be selected. The overall consistency of the hotspot patterns in these sensitivity analyses indicates the robustness of our findings. Our findings are also consistent when analyzed separately by gender; no new hotspots emerged, and we identify the same clusters described in Table 2.
Discussion
Increasing the equitable availability and utilization of mental healthcare services is a major international development objective. Reducing inequities in the geographic accessibility of such services is critical to achieving this objective37,38,39,40,41,42 and, in turn, depends on detailed knowledge regarding variations in the geospatial distribution of mental health disorders and elucidation of the factors responsible for such variations. Within Latin America, geospatial methodology has been used to quantify geographic accessibility to different forms of healthcare39,43, but not to mental healthcare.
Our results show the inequitable geographic accessibility of specialty mental healthcare services in the department of Caldas, Colombia; while most of its population can access services at the CSJDM with a travel time of <1 h drive, a substantial proportion resides at >4 h drive distant from the facility. Our geospatial analyses of CSJDM EHR data reveal an important impact of this inequity, as utilization of outpatient care for mood and psychotic disorders demonstrates significant distance-decay in relation to the CSJDM. This observation aligns with findings from studies in UICs1,2,3,4,5, which have shown that most people will travel only short distances to obtain such care. The Government of Colombia recently enacted a strategy of increasing the geographical accessibility of mental healthcare by embedding it within primary care clinics distributed throughout the country44,45. Future geospatial investigations, including construction of geographic accessibility maps for primary care facilities and analyses of mental health treatment data from the EHRs of these facilities, could play a valuable role in evaluating the impact of this strategy.
Factors other than geographic accessibility, however, appear to be the main determinants of inpatient treatment for mood and psychotic disorders, which, as has also been found in UICs46, does not demonstrate significant distance-decay. Instead, utilization of such intensive treatment clusters significantly around nine hotspots, widely dispersed across the department of Caldas. This finding indicates that locally important sociodemographic, environmental, or genetic factors influence geospatial variation in risk, treatment-seeking behavior, or treatment accessibility for clinically severe presentations of these disorders.
Elucidating the above factors is a priority for future research aimed at reducing inequities in mental healthcare in this region, and an important early step will be to explain the approximately five-fold overrepresentation of BPD relative to SCZ in the CSJDM EHR database. Such an extreme excess of BPD among individuals hospitalized for mental disorders, has not, to our knowledge, been observed in studies of inpatients in other countries, where BPD and SCZ usually occur at roughly equal frequencies47,48,49. This excess is particularly striking in the observed BPD hotspots; for example, the hotspot at Cluster 2 displays an incidence for this disorder which may be the highest yet reported worldwide48,49,50. Systematic investigation of each of the hotspots, incorporating several additional types of data, may identify specific factors, contributing to these patterns.
Levels of socioeconomic inequality in Latin America, overall, are among the highest worldwide51, and have been associated with extreme disparities in utilization of mental health services52. Recent studies in Brazil and Peru have used geospatial analyses to document the relationship between inequalities in factors such as education levels, income, employment, and migration and geographic variation in the frequency of mental illness parameters (suicide and depressive symptomatology, respectively)53,54. By incorporating information on sociodemographic factors that is available at a detailed level throughout Colombia (e.g., each residence has an assigned socioeconomic status indicator, from 1 [lowest] to 6 [highest], which determines utility rates39), future studies, using sources such as Colombia’s National Department of Statistics census microdata23, could provide a more comprehensive equity perspective by integrating disaggregated georeferenced population sociodemographic open data.
Among the environmental factors that may contribute to geographic variation in risks of mental disorders, exposure to violent conflict, and forced displacement associated with such exposure are particularly relevant in Colombia; its decades long history of internal armed conflict led to the forced displacement of nearly 6 million people between the 1960s and the 2000s, one of the highest rates of any country worldwide55,56. Previous studies in Colombia have linked exposure to violent conflict to poor health outcomes generally56, and increased rates of mental health disorders, specifically55,57. Georeferenced resources, such as those aggregated by the government58, can be used to test this hypothesis.
An additional possibility, which can be investigated, is that genetic variants have contributed to the generation of hotspots for mood and psychotic disorders and to the overall patterns of geographic variation in their frequencies that we observe in Caldas. The demographic history of its population suggests this possibility. The predominant population of Caldas (“The Paisa”) is an example of a genetic isolate that has expanded rapidly subsequent to a series of migration-related bottlenecks59,60, from population founders of indigenous, African, and European ancestry61. Such rapid expansion from a bottleneck creates the opportunity for even deleterious variants to attain sizable frequencies – and thereby exert a substantial effect on disease risk – in a highly localized manner62. Because Caldas is part of a larger region in which the Paisa predominate (“The Paisa region”), extension of geospatial investigations to the EHR databases of healthcare facilities in neighboring departments provides an opportunity to determine the potential impact of such variants through well-powered genetic studies.
Finally, it is important to consider the possible impact of factors that may influence treatment-seeking behavior for mood and psychotic disorders but for which relevant data are currently unavailable and may be difficult to obtain on a sufficiently large scale to permit their use in detailed geospatial analyses. For example, several studies have suggested that stigma and the fear of discrimination may be important barriers to utilization of mental healthcare, in Colombia, but have focused on outpatient treatment63, excluded individuals living with psychotic disorders17 or were based on qualitative analyses of very small samples64.
We discuss here several limitations of this study. First, for simplicity sake our geospatial analyses have focused only on patients’ clinical presentations and diagnoses at their first visit, while a major strength of the EHR database is that it enables longitudinal investigations of variations in patient trajectories; many patients experience changes in the severity of their illness (e.g., their initial visit is as an outpatient, but they subsequently have at least one inpatient admission) or in diagnosis (e.g., they initially are diagnosed with MDD but subsequently experience manic or hypomanic episodes which leads to a change in diagnosis to BPD). Future analyses that incorporate information on such changes may reveal geographic variations that differ from those reported here. Second, as the size of the inpatient population at CSJDM is smaller than that of the outpatient population, it is possible that the lack of significant distance-decay in the former group reflects a lack of power to detect modest effects; future studies that include additional catchment areas may find some degree of distance-decay for these patients. Third, our estimates of geographic accessibility of treatment are based on average travel speeds using different surfaces, and do not account for potential seasonal or daytime variations. As more granular information becomes available (e.g., through dynamic road use databases), it may be possible to incorporate such variations as has been accomplished in Colombia for geospatial investigations of tertiary emergency healthcare39. Other directions for future work include the development of more sophisticated geostatistical models that could have policy relevance. For example, SaTScan maximizes the likelihood ratio and selects the most likely hotspot sequentially rather than jointly, which may not always lead to the selection of the most precise global pattern of hospots.
It is increasingly recognized that population-level investigations in low- and middle-income countries must be a priority for mental health research – both to reduce health inequities and to increase opportunities for transformative scientific discoveries65. The work reported here extends geospatial investigation of mood and psychotic disorders to a middle-income country that has recently experienced extensive and traumatic social disruptions. Despite such disruptions, Colombia has established infrastructure, such as the CSJDM EHR database, that constitute extraordinary resources for population mental health studies. Future studies can make further use of these resources to contribute to our understanding of sociodemographic, environmental and genetic risk factors of mental illness.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Due to privacy protection concerns the local IRB committee prohibits the authors from making the dataset publicly available or available to other researchers. Source data are made available as Supplementary Data 1.
References
Jarvis, E. On the supposed increase of insanity. Am. J. Psychiat. 8, 333–364 (1852).
Faris, R. E. L., Dunham, H. W. Mental disorders in urban areas: an ecological study of schizophrenia and other psychoses (University of Chicago Press, 1939).
Joseph, A. E. & Boeckh, J. L. Locational variation in mental health care utilization dependent upon diagnosis: a Canadian example. Soc. Sci. Med. D Med. Geogr. 15, 395–404 (1981).
Packness, A. et al. Impact of socioeconomic position and distance on mental health care utilization: a nationwide Danish follow-up study. Soc. Psychiatry Psychiatr. Epidemiol. 52, 1405–1413 (2017).
Stulz, N., Pichler, E.-M., Kawohl, W. & Hepp, U. The gravitational force of mental health services: distance decay effects in a rural Swiss service area. BMC Health Serv. Res. 18, 1–13 (2018).
Selmer, H. Statistiske Meddelelser og Undersøgelser fra Sindssygeanstalten ved Aarhus i dens første 25 Aar (1852–77). (Statistical reports and investigations from the mental hospital in Aarhus during its first 25 years). Reitzel, C. A., (Copenhagen, 1879).
Hunter, J. M., Shannon, G. W. & Sambrook, S. L. Rings of madness: service areas of 19th century asylums in North America. Soc. Sci. Med. 23, 1033–1050 (1986).
Pun-Cheng, L. S. C. Distance Decay. International Encyclopedia of Geography. (John Wiley & Sons, Hoboken, 2016).
Brown, S. C. et al. Health disparities in the relationship of neighborhood greenness to mental health outcomes in 249,405 U.S. medicare beneficiaries. Int. J. Environ. Res. Public Health 15, 430 (2018).
Chen, T. K. et al. Higher depression risks in medium- than in high-density urban form across Denmark. Sci. Adv. 9, eadf3760 (2023).
Jing, F. et al. Association between immigrant concentration and mental health service utilization in the United States over time: a geospatial big data analysis. Health Place 83, 103055 (2023).
Moore, H. E. et al. Characterizing unusual spatial clusters of male mental health emergencies occurring during the First National COVID-19 “Lockdown” in the East Midlands Region, UK: a geospatial analysis of ambulance 999 data. Am. J. Mens Health 16, 15579883221097539 (2022).
Rivera, K. M. & Mollalo, A. Spatial analysis and modelling of depression relative to social vulnerability index across the United States. Geospat. Health 17, 1132–1143 (2022).
Rzhetsky, A. et al. Environmental and state-level regulatory factors affect the incidence of autism and intellectual disability. PLoS Comput. Biol. 10, e1003518 (2014).
Fan, C. C. et al. Spatial fine-mapping for gene-by-environment effects identifies risk hot spots for schizophrenia. Nat. Commun. 9, 5296 (2018).
Ministerio de Salud y Procteccion Social. Encuesta Nacional de Salud Mental 2015; available at https://www.minjusticia.gov.co/programas-co/ODC/Publicaciones/Publicaciones/CO031102015-salud_mental_tomoI.pdf (2015).
Gonzalez, L. M. et al. Determining factors in the access to mental health services by the adult colombian population. Rev. Colomb. Psiquiatr. 45, 89–95 (2016).
World Health Organization (WHO). The ICD-10 Classification of Mental and Behavioural Disorders: Diagnostic Criteria for Research. (World Health Organization, 1993).
De la Hoz, J. F. et al. Analysis of diagnosis instability in electronic health records reveals diverse disease trajectories of severe mental illness. medRxiv 2022: 2022.08.20.22279007.
OpenCage. opencage: Interface to the OpenCage API. (2018).
OpenStreetMap contributors. Planet dump https://planet.osm.org. Retrieved from https://planet.openstreetmap.org (2015).
Sorichetta, A. et al. High-resolution gridded population datasets for Latin America and the Caribbean in 2010, 2015, and 2020. Sci. Data 2, 150045 (2015).
Departamento Administrativo Nacional de Estadística (DANE). Marco Geoestadístico Nacional Versión 2012. Departamento de Caldas. https://geoportal.dane.gov.co/servicios/descarga-y-metadatos/descarga-mgn-marco-geoestadistico-nacional (2018).
Ray, N. & Ebener, S. AccessMod 3.0: computing geographic coverage and accessibility to health care services using anisotropic movement of patients. Int. J. Health Geogr. 7, 1–17 (2008).
NASA/METI/AIST/Japan Spacesystems and U.S./Japan ASTER Science Team (2019). ASTER Global Digital Elevation Model V003 [Data set]. NASA EOSDIS Land Processes DAAC. Accessed 2021-06-03 from https://doi.org/10.5067/ASTER/ASTGTM.003.
European Commission, Joint Research Centre. Global Land Cover 2000 database. https://forobs.jrc.ec.europa.eu/glc2000/products.php (2003).
Defense Mapping Agency. Digital Chart of the World. Defense Mapping Agency, Fairfax, Virginia. Four CD-ROMS available at: https://www.ngdc.noaa.gov/mgg/topo/report/s5/s5Avii.html (1992).
Palk, L., Okano, J. T., Dullie, L. & Blower, S. Travel time to health-care facilities, mode of transportation, and HIV elimination in Malawi: a geospatial modelling analysis. Lancet Glob. Health 8, e1555–e1564 (2020).
Kulldorff, M. A spatial scan statistic. Commun. Stat. Theory Methods 26, 1481–1496 (1997).
Kulldorff, M. and Information Management Services, Inc. SaTScanTM v8.0: Software for the spatial and space-time scan statistics. http://www.satscan.org/ (2009).
Han, J. et al. Using Gini coefficient to determining optimal cluster reporting sizes for spatial scan statistics. Int. J. Health Geogr. 15, 27 (2016).
Chen, J., Roth, R. E., Naito, A. T., Lengerich, E. J. & MacEachren, A. M. Geovisual analytics to enhance spatial scan statistic interpretation: an analysis of U.S. cervical cancer mortality. Int. J. Health Geogr. 7, 57 (2008).
Oliveira, F. L. P., Cançado, A. L. F., de Souza, G., Moreira, G. J. P. & Kulldorff, M. Border analysis for spatial clusters. Int. J. Health Geogr. 17, 5 (2018).
QGIS.org. QGIS Geographic Information System. QGIS Association. http://www.qgis.org (2021).
QGIS.org. QGIS 3.16. Geographic Information System User Guide. QGIS Association. Electronic document: https://docs.qgis.org/3.16/en/docs/user_manual/index.html (2021).
Jenks, G. F. Optimal Data Classification For Choropleth Maps: University of Kansas, Department of Geography Occasional Paper No. 2, Lawrence, Kansas (1977).
57th Directing Council of the Pan American Health Organization & 71st Session of the Regional Committee of WHO for the Americas. Strategy and Plan of Action to Improve Quality of Care in Health Service Delivery 2020–2025/Estrategia y plan de acción para mejorar la calidad de la atención en la prestación de servicios de salud 2020-2025 (Report no. CD57/12). available at: https://www.paho.org/en/governing-bodies/directing-council/57th-directing-council-71st-session-regional-committee-who (2019).
Cuervo, L. G., Martinez-Herrera, E., Cuervo, D. & Jaramillo, C. Improving equity using dynamic geographic accessibility data for urban health services planning. Gac. Sanit. 36, 497–499 (2022).
Cuervo, L. G. et al. Dynamic accessibility by car to tertiary care emergency services in Cali, Colombia, in 2020: cross-sectional equity analyses using travel time big data from a Google API. BMJ Open 12, e062178 (2022).
Dewidar, O. et al. Reporting of health equity considerations in equity-relevant observational studies: Protocol for a systematic assessment. F1000Res 11, 615 (2022).
Frenk, J. & White, K. L. The concept and measurement of accessibility. PAHO Scientific Publication: Pan American Health Organization 534, 842–855 (1992).
53rd Directing Council & 66th Session of the Regional Committee of WHO for the Americas. Strategy for Universal Access to Health and Universal Health Coverage (Report No. CD53/5), Rev. 21. Pan American Health Organization/World Health Organization. available at: https://www3.paho.org/hq/dmdocuments/2014/CD53-5-e.pdf (2014).
Carrasco-Escobar, G., Manrique, E., Tello-Lizarraga, K. & Miranda, J. J. Travel time to health facilities as a marker of geographical accessibility across heterogeneous land coverage in Peru. Front. Public Health 8, 498 (2020).
Ley 1616. Ley 1616 de 2013. available at: https://www.minsaludgovco/sites/rid/Lists/BibliotecaDigital/RIDE/DE/DIJ/ley-1616-del-21-de-enero-2013.pdf (2013).
Ministerio de Salud y Procteccion Social. Politica Nacional Salub Mental. available at : https://www.minsalud.gov.co/sites/rid/Lists/BibliotecaDigital/RIDE/VS/PP/politica-nacional-salud-mental.pdf (2018).
Nascimento, M. et al. No man is an island: spatial clustering and access to primary care as possible targets for the development of new community mental health approaches. BMC Health Serv. Res. 20, 344 (2020).
Service, S. K. et al. Distinct and shared contributions of diagnosis and symptom domains to cognitive performance in severe mental illness in the Paisa population: a case-control study. Lancet Psychiatry 7, 411–419 (2020).
Blader, J. C. & Carlson, G. A. Increased rates of bipolar disorder diagnoses among U.S. child, adolescent, and adult inpatients, 1996-2004. Biol. Psychiatry 62, 107–114 (2007).
Rasanen, P., Tiihonen, J. & Hakko, H. The incidence and onset-age of hospitalized bipolar affective disorder in Finland. J. Affect Disord. 48, 63–68 (1998).
Carlborg, A., Ferntoft, L., Thuresson, M. & Bodegard, J. Population study of disease burden, management, and treatment of bipolar disorder in Sweden: a retrospective observational registry study. Bipolar. Disord. 17, 76–85 (2015).
De Ferranti. D. M. Inequality in Latin America: breaking with history? (World bank publications, 2004).
Kohn, R. et al. Mental health in the Americas: an overview of the treatment gap. Rev. Panam. Salud. Publica. 42, e165 (2018).
Alarcao, A. C. et al. Suicide mortality among youth in southern Brazil: a spatiotemporal evaluation of socioeconomic vulnerability. Braz. J. Psychiatry 42, 46–53 (2020).
Ruiz-Grosso, P. et al. Spatial distribution of individuals with symptoms of depression in a periurban area in Lima: an example from Peru. Ann. Epidemiol. 26, 93–9.e2 (2016).
Tamayo-Agudelo, W. & Bell, V. Armed conflict and mental health in Colombia. BJPsych. Int. 16, 40–42 (2019).
McEniry, M., Samper-Ternent, R. & Cano-Gutierrez, C. Displacement due to armed conflict and violence in childhood and adulthood and its effects on older adult health: The case of the middle-income country of Colombia. SSM Popul. Health 7, 100369 (2019).
Bell, V., Méndez, F., Martínez, C., Palma, P. P. & Bosch, M. Characteristics of the Colombian armed conflict and the mental health of civilians living in active conflict zones. Confl. Health 6, 10 (2012).
Centro de Memoria Historica DoP, Gobierno de Colombia, Bases de datos - ¡Basta ya! Colombia: Memorias de guerra y dignidad (2012).
Bedoya, G. et al. Admixture dynamics in Hispanics: a shift in the nuclear genetic ancestry of a South American population isolate. Proc. Natl Acad. Sci. USA 103, 7234–7239 (2006).
Mooney, J. A. et al. Understanding the hidden complexity of Latin American population isolates. Am. J. Hum. Genet. 103, 707–726 (2018).
Ossa, H. et al. Outlining the ancestry landscape of Colombian admixed populations. PLoS One 11, e0164414 (2016).
Stoll, G. et al. Deletion of TOP3β, a component of FMRP-containing mRNPs, contributes to neurodevelopmental disorders. Nat. Neurosci. 16, 1228–1237 (2013).
Campo-Arias, A., Ceballos-Ospino, G. A. & Herazo, E. Barriers to access to mental health services among Colombia outpatients. Int. J. Soc. Psychiatry 66, 600–606 (2020).
Jassir Acosta, M. P. et al. Characterizing the perceived stigma towards mental health in the early implementation of an integrated services model in primary care in Colombia. A qualitative analysis. Rev. Colomb. Psiquiatr. (Engl. Ed) 50, 91–101 (2021).
Cirillo, A. et al. Population neuroscience: challenges and opportunities for psychiatric research in low- and middle-income countries. Braz. J. Psychiatry 42, 442–448 (2020).
Acknowledgements
This work was supported by R01MH113078 (C.E.B., C.L.J. and N.B.F.), R01MH123157 (L.M.O.L., C.L.J. and N.B.F.), R00MH116115 (L.M.O.L.) and R56AI152759 (S.B. and J.T.O.). We thank Anjene Musick for valuable feedback on the manuscript.
Author information
Authors and Affiliations
Contributions
Conceptualization: C.E.B., N.B.F., L.M.O.L., and S.B.; Methodology: J.S., J.T.O., S.K.S., C.E.B., N.F., L.M.O.L., and S.B.; Software: J.S., J.d.l.H., C.G., and A.A.; Formal analysis: J.S. and S.K.S.; Investigation: M.C.R., J.T.O., A.M.D.Z., C.V.U., A.A., and T.T.; Data curation: J.S., J.d.l.H., C.G., and A.A.; Writing Original Draft: J.S., N.B.F., L.M.O.L., and S.B.; Writing – review & editing: J.S., M.C.R., J.T.O., S.K.S., J.d.l.H., A.M.D.Z., C.V.U., C.G., A.A., A.V.S., T.T., C.S., R.C.G., C.E.B., J.I.E., V.I.R., C.L.J., N.B.F., L.M.O.L., and S.B.; Visualization: J.S., J.T.O., and S.B.; Supervision: C.L.J., N.B.F., L.M.O.L., and S.B.; Project Administration: C.E.B., C.L.J., N.B.F., L.M.O.L., and S.B.; Funding acquisition: C.E.B., C.L.J., L.M.O.L., and S.B.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Medicine thanks Luis Gabriel Cuervo, Winfred Dotse-Gborgbortsi, Angela Suárez and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Song, J., Ramírez, M.C., Okano, J.T. et al. Geospatial investigations in Colombia reveal variations in the distribution of mood and psychotic disorders. Commun Med 4, 26 (2024). https://doi.org/10.1038/s43856-024-00441-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s43856-024-00441-x
- Springer Nature Limited