Registered causes in vital statistics classified as garbage codes (GC) are considered indicators of quality of cause-of-death data. Our aim was to describe temporal changes in this quality in Brazil, and the leading GCs according to levels assembled for the Global Burden of Disease (GBD) study. We also assessed socioeconomic differences in the burden of different levels of GCs at a regional level.
We extracted data from the Brazilian Mortality Information System from 1996 to 2016. All three- and four-digit ICD-10 codes considered GC were selected and classified into four categories, according to the GBD study proposal. GC levels 1 and 2 are the most damaging unusable codes, or major GCs. Proportionate distribution of deaths by GC levels according selected variables were performed. Age-standardized mortality rates after correction of underreporting of deaths were calculated to investigate temporal relationships as was the linear association adjusted for completeness between GC rates in states and the Sociodemographic Index (SDI) from the GBD study, for 1996–2005 and 2006–2016. We classified Brazilian states into three classes of development by applying tertiles cutoffs in the SDI state-level estimates.
Age-standardized mortality rates due to GCs in Brazil decreased from 1996 to 2016, particularly level 1 GCs. The most important GC groups were ill-defined causes (level 1) in 1996, and pneumonia unspecified (level 4) in 2016. At state level, there was a significant inverse association between SDI and the rate of level 1–2 GCs in 1996–2005, but both SDI and completeness had a non-expected significant direct association with levels 3–4. In 2006–2016, states with higher SDIs tended to have lower rates of all types of GCs. Mortality rates due to major GCs decreased in all three SDI classes in 1996–2016, but GC levels 3–4 decreased only in the high SDI category. States classified in the low or medium SDI groups were responsible for the most important decline of major GCs.
Occurrence of major GCs are associated with socioeconomic determinants over time in Brazil. Their reduction with decreasing disparity in rates between socioeconomic groups indicates progress in reducing inequalities and strengthening cause-of-death statistics in the country.
Analyses of causes of death (COD) are fundamental for monitoring the health situation of populations in order to better inform policy decisions about priority health interventions [1, 2]. Brazil has more than 1 million deaths registered with listed causes each year in the Mortality Information System (SIM in Portuguese). All causes related to the death should be declared in the standard death certificate (DC) used. The registration of a well-defined underlying cause that triggered the chain of events that led to the death is fundamental for public health interventions, as mortality statistics are produced with a single cause for each death which allows interventions that target this basic cause. Inadequate or insufficiently specified COD statements can compromise the reliability of these statistics and their proper use by policy-makers to plan suitable interventions .
The traditional approach to assess the accuracy of mortality information is to measure completeness (percent of all deaths registered), and the proportion of total deaths coded to R codes from the ICD 10-chapter 18 of general signs and symptoms, denominated as ill-defined causes [1, 3]. Death registration completeness in Brazil is estimated to be high in recent years, more than 95% in the period 2000–2010, although with large regional differences. On the other hand, quality of cause-of-death data on registered deaths remains a problem, as the country presented high levels of ill-defined causes during the last decade . A national official intervention by the Ministry of Health (MoH) was started in 2005 to investigate ill-defined causes of death in health facilities and at home in the whole country, particularly in the poorest North and Northeast regions . Reported results indicate that this investigation has been effective .
In addition to R codes from the ICD 10-chapter 18, other ill-defined or unspecified causes that do not represent useful underlying causes from a policy perspective were analyzed in the 1990 Global Burden of Disease (GBD) Study by Murray and Lopez (1996) , who introduced the term garbage codes (GC) for those causes. In 2010, an innovative approach was proposed by Naghavi and colleagues which greatly expanded the number of codes to be considered garbage, including several ill-defined and incomplete codes for cardiovascular diseases, neoplasms, injuries, and in other ICD chapters. These causes were classified according to four groups in a proposed typology. To enhance comparability of COD data, they selected most probable target codes and proposed algorithms for GC redistribution to the more appropriate target codes . This approach was updated in the 2010, 2013, and 2015 posterior publications of Global Burden of Disease (GBD) studies.
The GBD 2016 study presented a new classification of GCs in four categories, according to the levels of the GBD cause list across which they can be redistributed . This classification was also used in the subsequent GBD 2017 study. It classifies codes that the correction of which causes the new code to fall into a different 1st level GBD cause group as level 1. Other GCs whose correction has impact at subsequent levels of cause are categorized, on the basis of the GBD cause level impacted by the change, as levels 2, 3, and 4. Level 1 and 2 GCs are considered major GCs and their fractions used in a composite metric to rate the quality of mortality data available for each country . As this new proposed classification is based on the extent of redistribution of GCs across the overall COD distribution, it actually provides an important improvement in the analysis of quality of mortality data.
In this study, we describe temporal changes in the quality of vital statistics in Brazil, and the leading GCs according to levels assembled for the Global Burden of Disease (GBD) study. Some selected variables are also reported due to their relevance for health policy-makers to facilitate adequate interventions. We also assess socioeconomic differences in the burden of different levels of GCs at a regional level.
A descriptive data analysis of GCs was conducted based on all deaths registered in the SIM from 1996 to 2016 for both sexes and all ages. There are mandatory laws for the state to register deaths at civil registry offices, and this information is compiled by the Institute of Geography and Statistics (IBGE). Since 1975, the MoH has created the SIM to complement the work of IBGE in vital statistics, with responsibility for collecting and processing mortality data across the whole country, in particular cause of death data. Municipal teams are responsible for the active recording of deaths in hospitals and collection of home deaths in civil registry offices. Standard forms of DCs constitute the source of information of this system and should be filled in by physicians in accordance with the WHO guidelines. The International Classification of Diseases (ICD) coding rules are used to select the underlying COD through an automated coding system named SCB (Seletor de Causa Básica-Underlying COD Selector) .
We selected all three- and four-digit ICD-10 codes considered GC according to the GBD 2017 study, defined as a cause that cannot be considered an underlying cause of death or is not well specified. These codes were classified into four levels , according the redistribution they generated with respect to the proposed redistribution into the GBD hierarchical cause list. Thus, level 1 GC such as the majority of ill-defined causes from chapter 18-ICD10 or septicemia could have as underlying cause an injury, a non-communicable disease, or an infectious disease, and could potentially be redistributed to any level 1 group within GBD cause hierarchy. Level 2 GCs include all codes that are redistributed to within one of these three broad groups but to different level 2 GBD cause. Examples of these GCs are injury unspecified, essential hypertension, and pulmonary edema. Level 3 and level 4 GC codes comprise those that can potentially be redistributed to different level 3 and level 4 causes of the GBD cause hierarchy, respectively. An example of a level 3 GC is cancer unspecified, which contains sufficient detail to be redistributed across all types of cancer from the level 3 GBD cause list, but not outside that group. Unspecified stroke or unspecified road injuries are examples of GC level 4. It should be highlighted that R codes, which were traditionally named ill-defined causes of death, have been broken down into level 1 and level 2 GCs in our analysis, herein named R-codes level 1 and R-codes level 2, and a few R codes were not classified as GCs (R50.2, R78.0–R78.5. R95). The complete list of ICD codes classified as GCs in the GBD 2017 study according the four levels  is presented in Supplementary file 1.
We identified the number of deaths in the SIM that have been assigned to each level per year, and the leading GCs by sex in the first and last year of the period analyzed, 1996 and 2016. Proportionate distribution of deaths by GC levels and selected variables (age, place of death, and category of certifying physician) were also calculated in the most recent year to address the effect that ill-defined codes could have on more current health policy priorities.
Division of Brazil into states and regions
Brazil has an estimated total population in 2016 of about 209 million divided into 26 states and a Federal District. Those geographies, hereafter referred to as states, are grouped into five regions:
North region: Composed of seven states (Acre, Amapá, Amazonas, Pará, Rondônia, Roraima, and Tocantins)—17.7 million inhabitants (8.6% of total population); responsible for almost half of the Brazilian territorial extension; and has the lowest population density of the country with living standards below the national average
Northeast: Composed of nine states (Alagoas, Bahia, Ceará, Maranhão, Paraíba, Pernambuco, Piauí, Rio Grande do Norte, and Sergipe)—56.9 million inhabitants (28% of the total population) with the lowest levels in almost all of the social indicators
Southeast: Composed of four states (Espírito Santo, Minas Gerais, Rio de Janeiro, and São Paulo)—86.3 million inhabitants (42% of the total population); responsible for almost 50% of the Brazilian GDP
South: Composed of three states (Paraná, Rio Grande do Sul and Santa Catarina)—29.4 million inhabitants, and higher living standards
Center West: Composed of the Federal District and three states (Goiás, Mato Grosso and Mato Grosso do Sul)—15.6 million inhabitants (6% of the total population) with intermediate living standards
Analysis of temporal changes of garbage codes in states and by level of development based on the SDI
Mortality rates of GCs were calculated after correction of underreporting of deaths in the SIM. Estimated GC numbers by level were calculated by applying the proportions of GCs in each level from the SIM database to all-cause mortality estimated by the GBD 2017 study in each state. Age-standardized mortality rates, per 100,000, were calculated by the direct method using the GBD world’s population as reference . Population data for age, sex, and states were obtained from the IHME . For visualizing the geographical distribution of GC rates between 1996 and 2016 at the state level, we aggregated GC levels 1 and 2, considered major GCs [9, 10]. Similarly, GC levels 3 and 4 were summed up.
We investigate the temporal relationship between level 1–2 and 3–4 GC rates in states by the Sociodemographic Index (SDI), a composite measure proposed in the GBD study based on the average total fertility under the age of 25 years, educational attainment in those aged 15 years or older, and lag distributed income per capita. Each indicator was scaled to a value from zero to one, and the composite SDI index is the mean of the 3 rescaled components. This index varies between zero and one, with zero representing the lowest per capita income, lowest level of education and highest total fertility rate under the age of 25 years, i.e., the location with a theoretical minimum level of development relevant to health. An SDI of 1 represents the highest per capita income, the highest level of education and lowest total fertility under the age of 25 years, this location having a theoretical maximum level of development .
The intervention of the MoH implemented in Brazilian states in 2005—the investigation of GC level 1 R-codes, responsible for a higher proportion of GCs in the country (14.3% out of 946,960 deaths recorded at SIM in 2000), impacted on reducing those codes and improved the quality of cause of death data . To control for the effect of this confounding factor in time series analysis of socioeconomic differences, we considered two separate periods in this analysis, 1996–2005 and 2006–2016. Linear regression coefficients were separately modeled in these two periods to examine the linear association between GC mortality rates of each type and SDI and how this association changed after the intervention. Although we controlled for the influence of completeness using mortality rates adjusted for undercount of deaths, we also examined whether this variable was associated with the occurrence of GC levels 1–2 and 3–4 in Brazilian states for each period analyzed.
To better visualize the effect of differences developmental on level 1–2 and 3–4 GC rates at a regional level during the whole analyzed period, we classified states into tertiles of SDI state-level estimates for the first year of analysis, 1996, as follows: (1) low SDI, states with SDI values lower than 0.460—Acre, Alagoas, Bahia, Ceará, Maranhão, Paraiba, Pará, Piauí, and Tocantins; (2) medium SDI, states with SDI values varying from 0.460 to 0.540—Amapá, Amazonas, Goiás, Mato Grosso do Sul, Pernambuco, Rio Grande do Norte, Rondônia, Roraima, and Sergipe; and (3) high SDI: states with SDI values higher than 0.540—Distrito Federal, Espírito Santo, Mato Grosso, Minas Gerais, Paraná, Rio Grande do Sul, Rio de Janeiro, Santa Catarina, and São Paulo.
States in the North and Northeast regions were classified in either the low or middle SDI groups, while all states pertaining to the South or Southeast regions were in the high SDI category.
Age-standardized mortality rates due to deaths coded to GCs registered as COD in Brazil decreased from 417.3 per 100,000 inhabitants in 1996 to 261.3 in 2016. This decrease occurred particularly in the GC level 1 group, which has been the most important group of GCs from 1996 to 2005, but was overtaken by GC level 4 in the latter years (Fig. 1).
Table 1 presents the proportional mortality distribution by sex according to levels and the leading GC groups in the country, in 1996 and 2016. In 1996, 45.3% and 52.5% of registered deaths were classified as GCs in males and females, respectively. These proportions decreased to 36.0% and 42.2% of all deaths in 2016, and the proportions of deaths coded to level 1 GCs dropped in those years from 20.7 to 12.5% in males, and from 25.1 to 14.0% in females. Although GCs were still coded for approximately 507,000 deaths (both sexes) in 2016, levels 3 and 4 were more frequent in that year.
The top five ICD-10 grouping codes of COD classified as GCs in males and females in 2016 were (a) pneumonia unspecified (level 4 classification in the GBD study), (b) stroke not specified as hemorrhage or infarction (level 4), (c) ill-defined causes (level 1), (d) diabetes unspecified (level 4), and (e) heart failure (level 1). Proportions of the majority of leading GC groups showed in Table 1 decreased or maintained relatively stable in 2016 compared to 1996. R-codes level 1 decreased considerably over the same period in males and females. On the other hand, proportions of the level 4 GCs acute lower respiratory infections and unspecified diabetes and the level 2 GC essential hypertension all increased 1 percentage point or more in 2016.
Table 2 shows the percentage of GC and non-GC coded deaths in both sexes combined by age, place of death, and certifying physician in 2016. The age groups with the highest proportion of GCs were the elderly—over 60 years of age, and especially over 80 years (50%), followed by children aged 1–9 years (greater than 34%). While among those under 50 years old GC level 1 predominate, among the elderly (over 60 years) GC level 1 is exceeded by GC level 4. Concerning the place of death, more than 70% of total deaths in the country in 2016 occurred in hospitals. Garbage codes were present as the underlying cause of death in 46% of home deaths and in 38% of hospital deaths. Higher proportions of major GC levels 1–2 were present in deaths occurring at home. On the other hand, level 4 GCs were more frequent for in-hospital deaths. As expected, DCs signed by coroners from forensic institutes and physicians from the autopsy service for natural causes (herein named SVO physicians), had lower proportions of GCs, although more than 35% of these deaths were declared as being due to GCs. GC fractions from the attending and substitute physicians were almost 40%, with slight differences between them.
Supplementary file 2 presents the previous variables in males and in females. A higher percentage of GC levels 1 and 4 in women than in men for deaths occurring on streets and certified by coroners is probably due to the observed higher mortality due to injuries in men and better certification by forensic institute physicians. Deaths in women occurring on streets or certified by forensic institutes have higher probability of so-called natural causes and were certified as with garbage codes. The higher proportion of GCs in the age group of 15 to 29 years in women might be related to this hypothesis.
Figure 2 shows age-standardized mortality rates for deaths coded to garbage codes in Brazilian states, in 1996 and 2016. All Brazilian states had lower rates of GC levels 1–2 in 2016 compared to 1996. Interestingly, for GC levels 3–4, several states from the North and Northeast regions presented higher rates in 2016. On the other hand, all states from the South and Southeast presented lower GC levels 3–4 rates in 2016.
Figures 3 and 4 show the relationship between levels of SDI and rates of GCs coded into two groups, levels 1–2 and 3–4, for all states by year in two separate periods: 1996–2005 and 2006–2016. In the first period, there was an inverse relationship between rates of GC levels 1–2 and SDI, i.e., age-standardized rates of GC levels 1–2 decreased as the SDI increased for states. On the other hand, for GC levels 3–4, we found lower rates in less developed states. In 2006–2016, in contrast, the relationship between GCs and SDIs was inverse for both types of GCs.
We also evaluated the relationship between SDI and rate of deaths coded to GCs with linear regression models presented in Table 3 controlling for completeness. In 1996–2005, there was a significant inverse association between rates of GC levels 1–2 and SDI, and a significant direct association for levels 3–4, indicating that less developed states had higher rates of GC levels 1–2 and lower rates of GC levels 3–4. In 2006–2016, the coefficients were negative for the two GC groups, indicating that states with higher SDIs tend to have lower rates of all types of GCs. It is interesting to note here that although we had adjusted the GC rates for completeness, there was a significant direct association between completeness and rates of GC levels 3–4 in 1996–2005, i.e. states with lower completeness had lower rates of GC levels 3–4.
Figure 5 presents temporal trends of rates of GCs in states grouped according the SDI in the initial analyzed year of 1996 into high (first tertile), medium (second tertile), and low (third tertile) level of development. Mortality rates due to GC levels 1–2 decreased over time in all SDI tertiles (Fig. 5—top panel), with the lowest SDI group having a sharp decline after 2004. On the other hand, for GC levels 3–4, this tertile had increased rates from 1996 to 2005, and decreasing rates occurred only in the higher developed states. All rates have been similar across tertiles in more recent years (Fig. 5—bottom panel). States in the highest tertile of SDI have had high completeness since 1996. On the other hand, states in the lowest tertile of SDI had completeness of death counts in the SIM of about 65% in 1996 (Supplementary file 3).
Estimated probabilities of having a GC as the cause of death decreased 40.4% from 1996 to 2016 in Brazil and the decrease was particularly important for the level 1 codes (58.4% over the period). The sharp decline presented by this group may be related to the investigation in hospitals and at home of an important part of GCs from this group, the R-codes, which has been performed by the Ministry of Health (MoH) since 2005 [5, 14]. States classified in low or medium SDI tertiles were responsible for the most important decline. Greater improvement in Brazil´s less developed states might be related to investments in the public health care system . Although the relationship between SDI and rates was maintained in 2006–2016, rates of level 1–2 GCs in the three different state groups converged over time, indicating reduced inequalities in data quality due to major GCs. This reduction may have been related to the intervention of the MoH focused on poorer areas, and therefore leading to a reduction in those codes .
The decrease of major GCs occurred in almost all Brazilian states, and this encouraging result also indicates progress in strengthening cause-of-death statistics in the country over the past 20 years. Our findings are supported by the results of the classification system developed for the GBD2016 study—the GBD data quality rating. The calculation of this rating is based on the following variables: completeness of death registration, proportions of deaths not coded to GC levels 1 or 2, and fraction of deaths assigned to detailed GBD causes. The score for Brazil improved from 68.9 in 1995–1999 to 82.5 in 2010–2017, although it remained as a 4-star country in this 1 to 5 star classification. In the latter period, some high-income countries like Germany (score = 84.0), the Netherlands (83.3), and Japan (81.3) were also classified with 4 stars. On the other hand, in South America, Chile and Colombia were classified with 5 stars, but Uruguay, Paraguay, and Argentina were also classified with 4 stars .
Although with a decreasing trend, GCs still represent an important percentage of total deaths in the country, influencing the quality of mortality information. Despite the improvement in cause of death quality, R-codes and other ill-defined causes such as heart failure or septicemia, which contain no information of the underlying cause, remain frequent causes. High proportions of deaths due to these causes indicate not only worse quality of cause of death statistics, but also lack of medical care, especially among the economically disadvantaged population . Interesting, proportions of pneumonia unspecified increased in 2016 for men and women, probably related to the observed increase over time in age-specific mortality rates from specific infections of the lower respiratory tract in the population aged 70 years or over .
The highest proportion of GCs occurred among elderly people, especially those over 80 years (49.6%) and were due mainly to level 4 GCs. Higher proportions of ill-defined causes among the elderly are probably due to the higher number of comorbidities, such as neoplasms, hypertension, diabetes, and other cardiovascular diseases, making it difficult to provide information on the underlying cause of death during the completion of the DC [18,19,20], even after the use of verbal autopsies . On the other hand, children aged 1–14 years also had a high fraction of major GCs, more than 20%. High proportions of GCs in this age group, although in accordance with the results of Naghavi et al.  in other settings, are of concern as it is very important for prevention to correctly identify the leading causes of premature deaths.
Regarding the place of death, GCs were registered as the underlying cause of death in almost 50% of home deaths and in more than 38% of those which occurred in a hospital. Although previous studies in some Brazilian cities showed that ill-defined R codes were negatively associated with hospital deaths [22, 23], the percentage found for in-hospital deaths in this study can be considered relatively high. In hospital settings diagnostic procedures are more available, and we expected that better defined CODs would be declared much more frequently.
The highest proportion of GCs in home deaths may be due to difficulties of diagnosing or a lack of knowledge of the diagnosis by the physician. This can be explained by the fact that the certification of a home death is more difficult than deaths occurring inside hospitals due to the difficulty of defining a diagnosis for reasons already mentioned. The majority of home deaths in Brazil had medical assistance before death . This finding reinforces the importance of continuing the investigation of household deaths until a culture valorizing the importance of the correct DC information is establishing among physicians.
Proportions of deaths coded with GCs were similar among categories of certifying physicians, although we had originally expected a smaller proportion to be present when attested by the attending physician, and by those from SVO and forensic institutes. But for major GCs, the proportion of GC level 2 in deaths certified by forensic institute physicians was higher than that found for other physician categories. These findings probably result from deaths from injuries whose intent remained undetermined despite being certified by a coroner.
In order to investigate the impact of socioeconomic conditions on rates of different types of GCs over time, we have presented three sets of calculations at the state level. First, we presented age-standardized rates by SDI in two periods, 1996–2005 and 2006–2016. These results indicate that while there has been a decline of GC levels 1–2 in both periods, mortality rates coded to GC levels 3–4 increased as SDI increased across states and time during 1996–2005 but decreased as SDI increased during 2006–2016. Second, we presented linear regression models in two separate periods for analyzing coefficients of SDIs and completeness related to GCs. The inclusion of completeness was intended to address whether this variable impacted on GC occurrence, as there is evidence of an association of improvements in completeness with declining percentages of deaths being reported as ill-defined . Finally, extended temporal analysis of states classified in tertiles of development according to state of residence ranked by estimated SDIs in 1996 indicated substantial differences in performance between low and high SDI tertile states in 1996–2005, with the gap closing substantially in the 2006–2016 period.
Our results of an inverse association between levels of SDI and rates of levels 1–2 GCs in both periods are consistent with recent publications focused on ill-defined causes of death. Higher proportions of R-codes from chapter 18-ICD10 are more frequent among people who had a lower educational level or less access to health services [25,26,27]. On the other hand, the unexpected significant direct relationship between SDI and mortality rates due to GC levels 3–4 in 1996–2005 constitutes the most striking finding from our study. To our knowledge, this finding was not reported before in the literature. We also observed a significant direct relationship between completeness and proportions of GC levels 3–4 in the first period, i.e., these GCs were higher when completeness increased.
The GBD 2017 study aggregates locations into 5 SDI groupings generated by applying quintile cutoffs from the distribution of national-level SDI for countries with populations greater than 1 million in 2017 to estimates of SDI for all GBD locations. Brazilian states were classified into 3 of those groupings: low-middle SDI, middle SDI and high-middle SDI . We adapted this procedure by applying tertiles cutoffs to estimates of all state-level SDI in Brazil in 1996 to better contextualize the Brazilian reality. When analyzing temporal changes of GC rates across those tertiles of SDI, we observed that the increase of rates of GC levels 3–4 were concentrated in the low SDI tertile states in 1996–2005, and this slight increase occurred simultaneously with the increase in completeness in this group of states, and greater decrease of levels 1–2. It should be due to expanded primary care, with better knowledge of patient problems prior to the admission that led to death .
We suppose that states with better completeness and socioeconomic standards have more hospitals, and physicians were able to fill out the DC with GC levels 3–4 such as pneumonia, diabetes, or stroke unspecified, instead of an R code from level 1. The growing prevalence in poorer areas of some of these GCs such as diabetes of unspecified type, with higher burden in the North and Northeast regions , could also have had contributed. Another hypothesis is related to the correction of undercount of deaths in this study, as this adjustment was based on the assumption of a similar COD distribution in deaths recorded and not recorded in SIM, which is probably not correct for some causes. Different distribution of causes among registered and missing deaths was observed in Minas Gerais State, Brazil, using the verbal autopsy technique . Further detailed investigation of this aspect can help us to identify specific issues of this subject.
Overall, the predominance of level 4 GCs in the country in recent years is encouraging, as these causes have less impact in public health. According to Anaconda, a tool for checking the quality of mortality data, GC levels 1–3 are classified as higher severity unusable causes, given their higher risk of misguiding public policies to avoid premature deaths. Level 4 codes are not included in the broad category of unusable causes by this classification. Furthermore, level 4 GCs include codes that may require access to greater diagnostic sophistication and considerable equipment to accurately determine the COD .
This study has two major limitations. First, GC mortality is corrected for under-registration using the estimates of all-cause mortality from the GBD study, which can be imprecise . Furthermore, as was previously stated, the correction for cause-specific rates was based on the assumption of a similar COD distribution in deaths recorded and not recorded in the SIM, which may not be the case . Secondly, to analyze socioeconomic differences at the regional level, we used the sociodemographic index (SDI), a composite indicator proposed in more recent GBD studies, as a measure of socioeconomic deprivation. We chose the SDI over other deprivation measures as it is frequently used in various studies as an indicator of the socioeconomic level of countries and subnational divisions and uses similar component indicators for countries and states , which facilitates comparisons in the global context. Nevertheless, there are no set criteria for judging the quality of estimates of measures of deprivation, and hence it is difficult to assess whether the SDI is the best indicator. However, it summarizes three important dimensions of human development and is calculated in years from 1990 to 2017, according to methodological adjustments made for the GBD 2017 study .
The analysis plan developed here does not contemplate the potentially important socioeconomic differences by sex in completeness and trends concerning the quality of cause-of-death registry in Brazil, which therefore may also be considered a limitation in this study. Our objective, however, was only to visualize the socioeconomic differences without control for confounding factors such as gender and access to health services. Additionally, if we were to stratify the analysis by sex, we would end up producing a considerably more detailed analysis, longer and more complex, and thus turning more difficult the task of understanding the principal objective, the description of the burden of GCs in Brazil.
In summary, this study provides a more accurate understanding of trends and magnitude of mortality coded to garbage codes according to levels and the ranking of specific groups. This information is of importance for health policy in Brazil as the proportions found in 2016 were still large enough to impact regional inequalities in well-defined causes of death without a redistribution of GCs. Although the GBD study redistributes those GCs among specific causes, this redistribution is based on statistical methods and has some limitations [10, 31]. As better correction is provided by empirical data , this study might also contribute to the recent effort made by the Ministry of Health who in 2017 implemented important interventions in 60 municipalities throughout the country, as part of the Bloomberg Data for Health Initiative, in which hospital deaths with garbage codes were investigated extracting information from hospital records using standardized questionnaires, which then were reviewed by physicians in order to identify the underlying or specific causes. This initiative, reinforced by adequate physician training on the filling in of the DC, has significantly improved data from the SIM . So, it is imperative to have a comprehensive knowledge of GC distribution in the country, therefore permitting further advances in this intervention and the subsequent empirical corrections.
There has been a decreasing burden of deaths coded to GCs in Brazil. Analysis differentiated by types of GCs and level of development shows different temporal trends and substantial inequalities within the Brazilian population in 1996–2005. Decreasing inequalities in mortality coded to GCs over the period 2006–2016 and the predominance of less important GC level 4 in the country in 2016 indicate progress in strengthening cause-of-death statistics in Brazil over the past 20 years.
Availability of data and materials
The dataset used and analyzed in the study is available from the Brazilian Ministry of Health (www.datasus.gov.br).
Cause of death
Global Burden of Disease
International Classification of Disease and Related Health Problems
Institute for Health Metrics and Evaluation
Mortality Information System [Sistema de Informações sobre Mortalidade]
Death Investigation Service [Serviço de Verificação de Óbitos]
World Health Organization
Mathers CD, Fat DM, Inoue M, Rao C, Lopez AD. Counting the dead and what they died from: an assessment of the global status of cause of death data. Bull World Health Organ. 2005;83:171–7.
Phillips DE, Lozano R, Naghavi M, Atkinson C, Gonzalez-Medina D, Mikkelsen L, Murray CJ, Lopez AD. A composite metric for assessing data on mortality and causes of death: the vital statistics performance index. Popul Health Metr. 2014;12:14. https://doi.org/10.1186/1478-7954-12-14.
Laurenti R, Mello Jorge MHP, Gotlieb SL. The accuracy of the official mortality and morbidity statistics related to chronic non-communicable diseases. Ciência & Saúde Coletiva. 2004;9(4):909–21.
Lima EEC, Queiroz BL. Evolution of the deaths registry system in Brazil: associations with changes in the mortality profile, under-registration of death counts, and ill-defined causes of death. Cad Saude Publica. 2014;30(8):1721–30. https://doi.org/10.1590/0102-311X00131113.
Souza MFM, Barea V, Williams D. Improving the mortality information in poor areas: the Brazilian experience. In: Annual Meeting of the WHO-FIC Network, Oct 28- Nov 3; Stazione Marittima,Trieste, Italy. Geneve: WHO; 2007. https://www.who.int/classifications/network/meetings/en/.
Teixeira R, Naghavi M, Guimarães MDC, Ishitani LH, França E. Quality of cause-of-death data in Brazil: garbage codes among registered deaths in 2000 and 2015. Revista Brasileira de Epidemiologia. 2019;22(Suppl. 3):e19002. https://doi.org/10.1590/1980-549720190002.supl.3.
Murray CJL, Lopez AD. Estimating causes of death: new methods and global and regional applications for 1990. In: CJL M, Lopez AD, editors. The global burden of disease. Boston: Harvard School of Public Health; 1996. p. 118–200.
Naghavi M, Makela S, Foreman K, O'Brien J, Pourmalek F, Lozano R. Algorithms for enhancing public health utility of national causes-of-death data. Popul Health Metrics. 2010;8:9. https://doi.org/10.1186/1478-7954-8-9.
GBD 2016 Causes of Death Collaborators. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. 2017;390:1151–210. https://doi.org/10.1016/S0140-6736(17)32152-9.
GBD 2017 Causes of Death Collaborators. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392:1736–88. https://doi.org/10.1016/S0140-6736(18)32203-7.
França E, Abreu DMX, Rao C, Lopez AD. Evaluation of cause-of-death statistics for Brazil, 2002-2004. Int J Epidemiol. 2008;37(4):891–901. Disponível em. https://doi.org/10.1093/ije/dyn121.
GBD 2017 Causes of Death Collaborators. Supplementary appendix 1. Supplement to: Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392:1736–88.
Global Burden of Disease Collaborative Network. Global Burden of Disease Study 2017 (GBD 2017) Population estimates 1950-2017. Seattle, United States: Institute for Health Metrics and Evaluation (IHME); 2018. http://ghdx.healthdata.org/record/ihme-data/gbd-2017-population-estimates-1950-2017 . Accessed Jan 2019.
Brasil. Ministério da Saúde, Secretaria de Vigilância em Saúde, Departamento de Análise de Situação em Saúde. Manual para investigação do óbito com causa mal definida [Investigation handbook for ill-defined death causes]. Brasília: Ministério da Saúde; 2009. p. 48. il. – (Série A. Normas e Manuais Técnicos).
Albuquerque MV, Viana ALA, Lima LD, Ferreira MP, Fusaro ER, Iozzi FL. Regional health inequalities: changes observed in Brazil from 2000-2016. Ciência & Saúde Coletiva. 2017;22(4):1055–64. https://doi.org/10.1590/1413-81232017224.26862016.
Martins Junior DF, Costa TM, Lordelo MS, FelzeMburg RDM. Tendência dos óbitos por causas mal definidas na região Nordeste do Brasil, 1979-2009. Rev Assoc Med Bras. 2011;57(3):338–46. https://doi.org/10.1590/S0104-42302011000300019.
Corrêa RA, São José BP, Malta DC, Passos VMA, França EF, Teixeira RA, Camargos PAM. Burden of disease by lower respiratory tract infections in Brazil: estimates of the Global Burden of Disease 2015 study. Rev Bras Epidemiol. 2017;20(Suppl 1):171–81.
Mello Jorge MHP, Laurenti R, Lima-Costa MF, Gotlieb SLD, Chiavegatto Filho ADP. A mortalidade de idosos no Brasil: a questão das causas mal definidas. Epidemiol. Serv. Saúde. 2008;17(4):271–81. https://doi.org/10.5123/S1679-49742008000400004.
Kanso S, Romero DE, Leite IC, Moraes EN. Diferenciais geográficos, socioeconômicos e demográficos da qualidade da informação da causa básica de morte dos idosos no Brasil. Cad. Saúde Pública. 2011;27(7):1323–39. https://doi.org/10.1590/S0102-311X2011000700008.
Martins Junior DF, Felzemburg RDM, Dias AB, Costa TM, Santos PNP. Trends in mortality from ill-defined causes among the elderly in Brazil, 1979-2013: ecological study. Sao Paulo Med J. 2016;134(5):437–45. https://doi.org/10.1590/1516-3180.2016.0070010616.
Ngo AD, Rao C, Hoa NP, Adair T, Chuc NTK. Mortality patterns in Vietnam, 2006: Findings from a national verbal autopsy survey. BMC Research Notes. 2010;3:78. https://doi.org/10.1186/1756-0500-3-78.
Lima-Costa MF, Matos DL, Laurenti R, Mello-Jorge MHP, Cesar CC. Time trends and predictors of mortality from ill-defined causes in old age: 9 year follow-up of the Bambuí cohort study (Brazil). Cad. Saúde Pública. 2010;26(3):514–22. https://doi.org/10.1590/S0102-311X2010000300009.
Abreu DMX, Sakurai E, Campos LN. A evolução da mortalidade por causas mal definidas na população idosa em quatro capitais brasileiras, 1996-2007. R. bras. Est. Pop. 2010;27(1):75–88. https://doi.org/10.1590/S0102-30982010000100006.
França E, Cunha CC, Vasconcelos AMN, Escalante JC, Abreu DMX, Lima RB, Morais Neto OL. Investigation of ill-defined causes of death: assessment of a program’s performance in a State from the Northeastern region of Brazil. Rev Bras Epidemiol. 2014;17(1):119–34. https://doi.org/10.1590/1415-790X201400010010.
Kulhánová I, Menvielle G, Bopp M, Borrell C, Deboosere P, Eikemo TA, Hoffmann R, Leinsalu M, Martikainen P, Regidor E, Rodríguez-Sanz M, Rychtaříková J, Wojtyniak B, Mackenbach JP. Socioeconomic differences in the use of ill-defined causes of death in 16 European countries. BMC Public Health. 2014;14:1295. https://doi.org/10.1186/1471-2458-14-1295.
Rodrigues NCP, Daumas RP, Almeida AS, O'Dwyer G, Andrade MKN, Flynn MB, Lino VTS. Risk factors for the ill-defined causes of death in the Brazilian states: a multilevel analysis. Cien Saude Colet. 2018;23(11):3979–88. https://doi.org/10.1590/1413-812320182311.27182016.
Rasella D, Aquino R, Barreto ML. Impact of the family health program on the quality of vital information and reduction of child unattended deaths in Brazil: an ecological longitudinal study. BMC Public Health. 2010;10:380. https://doi.org/10.1186/1471-2458-10-380.
Duncan BB, França EF, Passos VMA, Cousin E, Ishitani LH, Malta DC, Naghavi M, Mooney M, Schmidt MI. The burden of diabetes and hyperglycemia in Brazil and its states: findings from the Global Burden of Disease Study 2015. Rev. bras. epidemiol. 2017;20(supl.1):90–101. https://doi.org/10.1590/1980-5497201700050008.
Campos D, França E, Loshi RH, Souza MFM. Uso da autópsia verbal na investigação de óbitos com causa mal definida em Minas Gerais. Brasil. Cadernos de Saúde Pública. 2010;26(6):1221–33.
Mikkelsen L, Richards N, Lopez AD. Redefining ‘garbage codes’ for public health policy: Report on the expert group meeting, 27-28 February 2017. CRVS best practice and advocacy. Melbourne, Australia: Bloomberg Philanthropies Data for Health Initiative, Civil Registration and Vital Statistics Improvement, University of Melbourne; 2019. Available in: https://crvsgateway.info/file/14581/276.
França EB, Passos VMA, Malta DC, Duncan BB, Ribeiro ALP, Guimarães MDC, et al. Cause-specific mortality for 249 causes in Brazil and states during 1990-2015: a systematic analysis for the global burden of disease study 2015. Popul Health Metr. 2017;15(1):39. https://doi.org/10.1186/s12963-017-0156-y.
Omar A, Ganapathy SS, Anuar MFM, Khoo YY, Jeevananthan C, Awaluddin SM, Yn JLM, Rao C. Cause-specific mortality estimates for Malaysia in 2013: results from a national sample verification study using medical record review and verbal autopsy. BMC Public Health. 2019;19:110.
Souza MFM, França EB, Teixeira RA, Ishitani LH, Cunha CC, Santos MR, Frederes A, Cortez-Escalante JJ, Abreu DMX. Iniciativa Dados para a Saúde: impacto na melhoria da qualidade da informação sobre causas de óbito no Brasil. Revista Brasileira de Epidemiologia. 2019;22(Suppl. 3):e19005. https://doi.org/10.1590/1980-549720190005.supl.3.
About this supplement
This article has been published as part of Population Health Metrics, Volume 18 Supplement 1 2020: The GBD Brazil Network. The full contents of the supplement are available at https://pophealthmetrics.biomedcentral.com/articles/supplements/volume-18-supplement-1.
This work was supported by the National Health Fund/Brazilian Ministry of Health (TED 125/2017). Publication costs were also funded by the National Health Fund/Brazilian Ministry of Health. BBD was supported in part by CNPq (Bolsa de produtividade em pesquisa, 304467/2015-4) and IATS (465518/2014-1).
Ethics approval and consent to participate
The study GBD Brazil was approved by the Research Ethics Committee of the Universidade Federal de Minas Gerais, Project CAAE—62803316.7.0000.5149.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary file 1:
Title: List of garbage codes mapped to the Global Burden of Disease 2017 cause list. Description: The source is: GBD 2017 Causes of Death Collaborators. Supplementary appendix 1. Supplement to: Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018; 392:1736–88.
Supplementary file 2:
Title: Deaths classified as garbage codes according to selected variables in males and females. Brazil, 2016. Description: It shows the percentage of GC and non-GC coded deaths in males and females by age, place of death and certifying physician in 2016.
Supplementary file 3:
Title: Completeness according to SDI tertiles of Brazilian states from 1996 to 2016.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
França, E., Ishitani, L.H., Teixeira, R. et al. Changes in the quality of cause-of-death statistics in Brazil: garbage codes among registered deaths in 1996–2016. Popul Health Metrics 18 (Suppl 1), 20 (2020). https://doi.org/10.1186/s12963-020-00221-4
- Cause of death
- Data quality
- Vital statistics