Introduction

The quality of statistics on causes of death (CoD) does not present consolidated indicators in papers nor in official health or statistical publications [1,2,3,4,5]. The quality of CoD statistics is assessed with several approaches and limited continuity. Only the great group of symptoms, signs, and ill-defined conditions of the International Classification of Diseases has achieved a generalised use [6,7,8,9]. This great quality group was often scored in wide ranges across nations when publicised, to suggest to take action [10]. The quality of statistics literature has also considered the impossible CoD, incompatible CoD to sex and age, and a variety of unspecific CoD; all referred as garbage medical causes [11,12,13,14,15,16,17,18].

Some collaborations have improved the reliability of ill-defined and garbage CoD by applying informatics algorithms to minimize the garbage codes in different ways: a) replacing the underlying CoD (which is the cause used in mortality statistics) with other causes filled out on the same death certificate; b) making expert data redistributions, or applying regression statistic models; or c) matching cases from health administrative databases for CoD extraction [11, 13, 15, 19,20,21,22].

Quality of death statistics have varied among countries for centuries. In the mid-eighteenth century, a little later than in North and Western European countries, Spain started to publish official vital statistics [23]. Recently, with the restoration of the Spanish democracy in 1978 and the decentralisation in the regions, a renewed boost was given to the improvement of civil registration and vital statistics to Western European standards [24]. Although the mortality rate decrease had begun time before, the amount of ill-defined causes of death has gradually decreasing throughout the new democratic time in Spain, to achieve, nowadays, an intermediate international position [10, 12].

The poor quality of CoD statistics involved a double miscertification issue concerning validity and reliability. Several validity studies showed a lack of uniformity across great and leading CoD, in Western context [25,26,27,28,29].

Our objective was to assess the national and regional quality of causes of death by analysing the reliability of the official mortality statistics over the years 1980–2019 across Spain.

Methods

We implemented an observational descriptive epidemiological design over annual mortality cases by the administrative territorial division of Spain in 18 regions (named Autonomous Cities and Communities) (Supplementary Figure 1 Map). Every region supports the National Institute of Statistics (INE acronym in Spanish) by coding the CoD, except the cities Ceuta and Melilla where coding is carried out by the INE itself. We assessed reliability by 3 indicators. We grouped the underlying causes of death (CoD) into unspecific and ill-defined quality categories from the 9th and 10th revisions of the International Classification of Diseases. This assessment was performed by experts (three coding nurses, and epidemiologist and forensic physicians) and by taking into account the literature (Supplementary Annex A) [6, 12, 15]. Territorial case counts and populations of INE data are publicly available by an informed request (https://www.ine.es/infoine/?L=1). Time was tabulated in 4 decades (1980-, 1990-, 2000-, and 2010–2019). We estimated age-adjusted rates of quality categories for 100,000 inhabitants by using the direct method and the European Standard Population (WHO, 1976), as one indicator, and the weights {wa: a = 1,2,…,19} appear as Supplementary Table 1. For the second indicator, we presented the age-adjusted rates ratios (proportions, in percentage) of the quality category divided by its region all death causes. Both indicators, by decade and sex.

Each regional mortality rate was contrasted with the Spanish median of each decade by means of the Comparative Mortality Ratio (CMR) in a Bayesian framework, as the third indicator. The estimation was implemented as the number of deaths by region (\({\text{r}}=1, 2, \dots , 18\)) and age group (\({\text{a}}=1 , 2, \dots , 19\)), and it was modelled as a Poisson random variable \({{\text{d}}}_{{\text{ra}}} \sim \mathrm{ Poisson}({\upmu }_{{\text{ra}}})\), with the Jeffreys’s prior distribution [30\(\uppi ({\upmu }_{{\text{ra}}})\propto {{\upmu }_{{\text{ra}}}}^{-1/2}\). Again, we have used the European Standard Population for the age-adjusted rates. The mortality rate by region and age group is \({\uplambda }_{{\text{ra}}}={\upmu }_{{\text{ra}}}/{{\text{P}}}_{{\text{ra}}}\), where \({{\text{P}}}_{{\text{ra}}}\) is the population in the region \({\text{r}}\) and age group \({\text{a}}\). The mortality rate in Spain by age group is \({\uplambda }_{{\text{a}}}={\upmu }_{{\text{a}}}/{{\text{P}}}_{{\text{a}}}\), where \({\upmu }_{{\text{a}}}={\sum }_{{\text{r}}}{\upmu }_{{\text{ra}}}\) and \({{\text{P}}}_{{\text{a}}}={\sum }_{{\text{r}}}{{\text{P}}}_{{\text{ra}}}\). For each region, the adjusted rate is given by \(\sum_{{\text{a}}}{{\text{w}}}_{{\text{a}}}{\uplambda }_{{\text{ra}}}\), and the comparative mortality rates ratio is the parameter:

$${{\text{CMR}}}_{{\text{r}}}=\frac{\sum_{{\text{a}}}{{\text{w}}}_{{\text{a}}}{\uplambda }_{{\text{ra}}}}{\sum_{{\text{a}}}{{\text{w}}}_{{\text{a}}}{\uplambda }_{{\text{a}}}}$$

The posterior distribution of each \({\upmu }_{{\text{ra}}}\) is a gamma distribution. The posterior distribution of the adjusted rate and of \({{\text{CMR}}}_{{\text{r}}}\) were obtained using the Monte Carlo method, generating \({\upmu }_{{\text{ra}}}^{{\text{i}}} \sim \mathrm{ Gamma}({{\text{d}}}_{{\text{ra}}}+1/2, 1)\) and computing \({\uplambda }_{{\text{ra}}}^{{\text{i}}}={\upmu }_{{\text{ra}}}^{{\text{i}}}/{{\text{P}}}_{{\text{ra}}}\), \({\uplambda }_{{\text{a}}}^{{\text{i}}}={\upmu }_{{\text{a}}}^{{\text{i}}}/{{\text{P}}}_{{\text{a}}}\), and \({{\text{CMR}}}_{{\text{r}}}^{{\text{i}}}=\sum_{{\text{a}}}{{\text{w}}}_{{\text{a}}}{\uplambda }_{{\text{ra}}}^{{\text{i}}}/\sum_{{\text{a}}}{{\text{w}}}_{{\text{a}}}{\uplambda }_{{\text{a}}}^{{\text{i}}}\), \({\text{i}}=1, \dots , 10 000\). The posterior estimation was performed using the median as the point estimator, and the 95% credible intervals (95% CrI) were the intervals from 0.025th to 0.975th quantile of the posterior distribution. The \({{\text{CMR}}}_{{\text{r}}}\) was considered statistically significant if the 95% CrI did not contain the value 1, to assess excess or defect on mortality. The analysis was performed using R version 3.6.1. The CMR function and the database are available on request.

Results

Spain by decade, sex, and quality category in rates indicator

All-cause deaths have decreased in in both sexes Spain as a whole over the years 1980 to 2019 (age-adjusted rates of 1 543 per 100,000 inhabitants in the first decade 1980–1989 to 867 per 100,000 inhabitants in the last decade of 2010–2019). The ill-defined CoD have declined in both sexes (from rates of 181, 101, 75, to 52 in the four decades studied) (Table and Fig. 1). The unspecific CoD have declined in both sexes, as well (from rates of 49, 39, 33, to 25 over decades) (Table 1, and cases distribution are available at Supplementary Table 2).

Fig. 1
figure 1

Regional comparative mortality ratios and 95% credible intervals of ill-defined and unspecific causes of death to Spain by decades. Both sexes, 1980–2019

Table 1 Quality categories of causes of death ratesa by region, sex and decade. Spain, 1980–2019

Women and men rates have decreased over the decades in all quality categories. Men have showed higher rates than women have in all quality categories in Spain (Tables 2 and 3, and cases distribution are available at Supplementary Tables 3 and 4).

Table 2 Proportions* of ill-defined causes of death by region, sex, and decade. Spain, 1980–2019
Table 3 Proportions* of unspecific causes of death by region, sex, and decade. Spain, 1980–2019

In the whole period of 1980–2019, 8 of the 18 regions (that was, Andalusia, Asturias, Balearic and Canary Islands, Ceuta & Melilla, Extremadura, Murcia, and Valencia; with rate range from 1,147 to 1,291) have had higher all-cause age-adjusted rates than Spain (rate of 1,098) (Table 1).

Spain has achieved a 7.6% of ill-defined CoD in the whole period and both sexes. Ill-defined CoD were higher in women than men were (9.0 versus 6.3%). The proportion of Ill-defined CoD by sex has decreased over the decades, but is still prominent in women in all four decades (13.4, 9.8, 8.5, and 6.9%) (Table 2). In unspecific CoD, Spain has registered a 3.1% in the whole period and sex. This proportion has remained steady at rounded 3% over decades by sex (Table 3).

Whole period, region and sex with rates indicator

In the whole period and in both sexes, seven of 18 regions (Ceuta & Melilla’s rate of 131, Andalusia 106, Madrid 103, Extremadura 101, Galicia 92, Castile & Lion 90, and Balearic Islands 85) have presented higher regional age-adjusted rates than Spain for ill-defined CoD. The same regions, excluding Balearic Islands, have displayed higher rates in women. Ill-defined rates were higher in men than women were in the whole period, with Murcia exception (rates of 71 versus 78, respectively). Men have showed higher rates in the same regions than in both sexes, plus Cantabria (Table 1).

All regions rates have decreased in all quality categories over time and sex. Five of the regions (Ceuta & Melilla rate of 107, Andalusia 81, Madrid 68, Extremadura 59, and Valencia 53) have exceed the Spanish rate in the last decade (2000–2019), for ill-defined CoD in both sexes. Women and men have showed the same pattern in the last decade but excluding Valencia in men (Table 1).

On unspecific CoD rates, 7 of 18 regions have exceed the Spanish rate in the last decade in both sexes (Extremadura 34, Andalusia 31, Murcia 30, Ceuta & Melilla 28, and Valencia 25). Women have pointed the same regions than have exceed in both sexes, while men have added one different more (Madrid), but excluding another (Valencia) (Table 1).

Quality categories with proportions indicator

The regional ill-defined CoD have lowered in proportions through the most recent decades and sex, but three regions (Ceuta & Melilla, Andalusia, and Madrid) have maintained or increased proportions by sex, except in women in one region (Madrid), although it was higher than Spain (9 versus 7%) (Table 2). Regional unspecific CoD in both sexes have maintained the proportions over decades, meanwhile 6 of 18 regions have registered the lowest proportions in the last decade compared to Spain (Asturias, Balearic Islands, Cantabria, Castile & Lion, Navarre, and Rioja), and previous regional decades. Women have included 4 of 18 regions, while men have sex-specified this both sexes pattern, including one region (Aragon) (Table 3).

Regional quality versus Spain by decade and sex with CMR indicator

The CMR of ill-defined CoD in both sexes have statistically exceeded Spain in 3 of the 18 regions (Andalusia, Extremadura, and Madrid) in all decades. The same excess has occurred in one other region (Ceuta & Melilla) in the last three decades. Another (Valencia) has registered excess mortality in the last decade (CMR = 1.02, 95% CrI 1.01 to 1.03). On the contrary, two regions (Castile & Lion and Galicia) that exceeded in the first three decades have decreased in the last (0.86, 0.85 to 0.88; and 0.95, 0.93 to 0.96, respectively) (Fig. 1 and Supplementary Table 5).

Women have showed an ill-defined mortality excess over Spain in 2 regions (Andalusia and Extremadura) in all decades; in one region (Ceuta & Melilla) in the last three decades; and in two different regions (Madrid and Valencia) in the last two decades (Fig. 2 and Supplementary Table 6). While, men have showed the same both sexes regional ill-defined pattern excess over all decades in the same 3 regions (Andalusia, Extremadura, and Madrid), and the same one in the last three decades (Ceuta & Melilla) (Fig. 3 and Supplementary Table 7).

Fig. 2
figure 2

Regional comparative mortality ratios and 95% credible intervals of ill-defined and unspecific causes of death to Spain by decades. Women, 1980–2019

Fig. 3
figure 3

Regional comparative mortality ratios and 95% credible intervals of ill-defined and unspecific causes of death to Spain by decades. Men, 1980–2019

The CMR of unspecific CoD in both sexes has statistically exceeded Spain in one region (Ceuta & Melilla) in all decades adding 5 more regions to the last decade (Andalusia, Canary Islands, Extremadura, Galicia, and Murcia) (Fig. 1 and Supplementary Table 8). Women and men have exceeded in 3 regions (Andalusia, Canary Islands, and Murcia) in in the last two decades; but also, Murcia and Extremadura in the last three decades, in women (Figs. 2 and 3, and Supplementary Tables 9 and 10, respectively).

Discussion

The quality of mortality statistics of causes of death has increased throughout the 40 years studied in Spain in women and men. However, quality gaps still remain in specific regions. Meanwhile, the best regional quality results have showed that there is scope for targeted upgrade.

In our experience, two major components comprise reliable quality of CoD statistics: First component involves medical certification (professionalism, health record access, and healthcare administration type) and the second involves post-certification and related to mortality registers [31, 32], coding skills [33], and the capacity for documental health information recovery [12].

Our purpose was the internal comparison of the regions with Spain over a long period. The age-standardised rates to the European Standard Population fulfilled our purpose of national and international comparison. However, given the general rates decrease in the two quality of mortality categories, we chose to describe its proportion composition with respect to all-cause of deaths between year periods, thus giving a better description.

This study has some limitations. The CoD selection and quality grouping may lack of comparability. However, our consensus on ICD10 code selection was based on the ICD10 instructions manual [34] and literature revision [6, 12, 35]. The two major proposals of quality assessment of CoD come from the Centers for Disease Control [6] and Anaconda software® [35], but these showed some qualitative differences. The CDC paper established 3 subtypes of CoD (unknown & ill-defined, immediate & intermediate, and nonspecific). The unknown and ill-defined causes included fewer codes than the ICD10 (18th chapter and annex 7.3). These immediate or intermediate CoD could also be reassigned to a general unspecific group, as well as to the ill-defined group by WHO criteria (ICD10 code I50 for heart failure). The annual national summary of 2.2% for unknown and ill-defined causes versus a 32.5% for the other unsuitable CoD, seems a broad gap to take action (Supplementary Annex B) [6, 12]. The Anaconda software encompasses 3 axes: the 1st axe, five qualify for uninformative subtypes (1- symptoms, sign, and ill-defined conditions; 2- impossible as CoD; 3- intermediate CoD; 4- immediate CoD; and 5- insufficiently specified CoD extracted from Global Burden Disease (GBD) [36]; the 2nd axe, four levels of health impact policies of 800 codes (Supplementary Annex B); and the 3rd axe, a vital performance index (of completeness, and garbage and impossible codes by age and sex). Although, GBD is dynamically updated [37], this praiseworthy effort, also expresses complex assessment outcomes, to take action further than lack of completeness [27, 38] and high numbers for ill-defined CoD, especially in low-income countries [10, 39]. In our case, for example, we considered dementia, ictus, pneumonia, or accidental poisoning by narcotics, reliable as primary health care diagnostics, however a detailed hospital-like testing technology may improve their accuracy.

The health and judicial administration framework matters in medical certification. Spain is supported by a Welfare State with National Health and Social Systems (public funding, universal access, a majority of centres of governmental propriety, and regional competences in health and social care budget and management), as well as a judicial system with forensic pathology and laboratory facilities at every regional centre. The Western European context of public funding and universal health care provision (private versus governmental) could be associated with completeness and validity of causes of death [40]. Likewise, the majority of diseases can be diagnosed through anamnesis and conventional physical examination and complementary tests at the Primary Health Care subsystem.

The process of completing and accurately coding a death certificate according to the ICD is challenging for all countries. Not all of them have achieved a good-quality threshold on mortality data. The WHO included in the medium-quality category several high-income Western European countries (such as Austria, Belgium, Denmark, France and Germany, regulated by universal Health Insurance systems) [14, 41]. Furthermore, in the present decade six high-income countries worldwide achieved a 9—31%, adding ill-defined (ICD10, 18th chapter) to impossible CoD [15]. Similarly, it would be a specificity error to classify 67.3% of the vital registration deaths as least-specific codes, without any further geographical or social context reference [21]. Providing the magnitude of poor quality death certification, health authorities did not seem to play a role in the probable random underestimation of the great and leading CoD [7, 13, 26,27,28, 31]. Currently, the COVID-19 pandemic has probably worsened death certification [42]. Statistics and health authorities may consider implementing the framework conditions to avoid miscertification. In addition, the WHO may include ill-defined conditions in the same ICD chapter in future revisions. Meanwhile, some national CoD registries have achieved top quality [32], and could be a standard target to replicate.

Some papers have emphasised the weakness of imputations made by case identification algorithms based on available health and population record information. The imputations from multiple search assignments of unsuitable CoD were proportionally predicted [11, 13, 15, 19,20,21, 43, 44] without a representative sample of validation [7, 13, 27, 28, 31].

Related to sex differences, we have assessed lower quality death certification in men than women at any territory and decade. Results, that are aligned to multi-country study that stated a no clear bias against women in death registration [45]. The three indicators applied adjusted for age groups. Sex differential in longer life might partially contributed to major medical unspecificity by polipathology bias. The general (not much) higher proportion quality in women than men on bad mortality quality, should consider that the magnitude of the population involved is described by the rates, which were higher in men than in women.

The poor statistical death quality showed in regions is much coincident to regional distribution of the Gross Domestic Product per Capita by regions of Spain (Supplementary Figure 2 Map). Moreover, poor death certification may be linked to individual characteristics (such as medical professionalism, the social stratum of the deceased, etc.) [46, 47]. This misclassification would imply regional and individual differential errors. The long-time evidence of our results is suggestive of a new organisational model with a multilevel health experts support to the National Institute of Statistics for a better regional and national standards upgraded [33].

Some studies have associated the deficiency of medical specialist education on death certification with miscertification in mortality statistics [48]. Courses for the improvement of CoD notification have oftentimes been imparted with diverse approaches and to different alumni, such as medical students or physicians in their specialisation [43, 49, 50]. Nowadays, this training is available through new communication technologies such as mobile phones [44], websites and e-learning platforms [43, 51]. Additionally, the WHO may introduce a certification of “medical competence on certification of causes of death” to foster the quality of mortality statistics worldwide.

As stated before, there is a general need of representative national validity studies of causes of death to address properly the post-certification informatics reassignment in CoD.

Conclusion

The reliability of the CoD has been improving over the last 40 years in Spain in both sexes. Regional gaps have persisted along those years and even in the last decade. Regional gaps mostly focused in Southern regions. Authorities involved might consider to take action and upgrade bad quality, and to develop a systematic medical post-grade training on death certification to improve regional differences and the quality of death statistics of Spain.