1 Introduction

In this paper we present a new composite index for measuring the vulnerability of countries and regions to climate change, natural disasters and other kinds of shocks, which is called the Global Data Lab (GDL) Vulnerability Index or GVI. The GVI focuses on the human aspects of vulnerability without taking physical aspects (like buildings, roads, landscape, soils, vegetation, etc.) or human-environment interactions (like agriculture and nature management) into account.

In recent decades awareness has grown that the impacts of climate change cannot be addressed by focusing on climate-related hazards alone, but that also resilience has to be strengthened by reducing vulnerabilities (Birkman et al., 2013; Parsons et al., 2016). Vulnerability is increasingly seen as part of a broader climate change and disaster risk reduction framework that distinguishes between (a) exposure-related factors, (b) the sensitivity or susceptibility to harm of societies, regions and ecosystems, and (c) their capacity to cope with changes and disasters and adapt to them (IPCC, 2022; Birkman et al., 2022). In this perspective, the occurrence of changes and events related to climate change, together with the degree to which a region is exposed to their effects and the vulnerability of the region, jointly determine the impact that is experienced (Füssel, 2006; UNDRR, 2019). Vulnerability hereby refers to people’s and societies’ sensitivity to harm as well as their capacity to cope and adjust and is defined straightforwardly as “[t]he propensity or predisposition to be adversely affected” (IPCC, 2014, 2022).

In the literature, vulnerability of groups and societies is supposed to encompass several human-related dimensions, including the economy, education, health, the position of women, governance, demographic factors and infrastructure (Hallegatte et al., 2018, 2020; IPCC, 2012, 2022; Muttarak & Lutz, 2014; Cutter, Boruff & Lynn, 2003; Lavell et al., 2012; Ebi et al., 2021). To assess the performance of societies on these dimensions, a broad set of indicators has been developed with which the position of a country or region on a specific dimension can be compared to the position of other countries and regions on that dimension (UNDP, 2024; Birkman et al., 2022; Miola, 2015; Huisman & Smits, 2009; Atwii et al., 2022).

Besides these dimension indicators, also composite indices have been developed that provide an overall picture of the capacity of a society to face adverse consequences of climate change (Garschagen et al., 2021; Birkmann et al., 2022). These composite indices have the advantage that they summarize different aspects of vulnerability into a single number. This makes them very useful for comparing countries, analysing trends, studying relationships between vulnerability impact, resilience and recovery, and helping steer finances and policies aimed at risk reduction and adaptation towards the most vulnerable areas (Garschagen et al., 2021; Feldmeyer et al., 2020; Becker et al., 2017; Füssel, 2006). The GVI, the vulnerability index presented here, is such a composite index. It shows the variation in vulnerability with regard to seven socioeconomic dimensions that are measured with eleven indicators.

The GVI is simpler than earlier developed vulnerability indices like the ND-Gain Vulnerability Index of Notre Dame University (ND-GAIN; Chen, 2015), the INFORM Index of the Joint Research Centre of the European Commission (INFORM; Marin-Ferrer et al., 2017), or the World Risk Index (WRI; Welle & Birkman, 2015). These country-level indices are constructed on a yearly basis by teams of experts and combine large numbers of underlying indicators (ND-GAIN: 45; INFORM: 54; WRI: 27). They are ingenious and encompassing, but also rather complex and difficult to understand for non-specialists.

With the GVI we aim to present an understandable and easy to use alternative. The GVI focuses on the core dimensions of socioeconomic vulnerability and uses a strictly data driven approach that is not dependent on expert judgement. The resulting index is universal and designed for use by experts as well as non-experts in the climate field. Any area of the world can be ranked on the GVI scale by filling in the underlying indicator values in a simple formula. The same formula can be used to compute the vulnerability of a country, grouping of countries (e.g. East Africa, South Asia, the Arabic world) or subnational region (province, district) at any point in time, as long as the required indicators are available. For the situation that one or two indicators are missing, reduced GVI formulas are available that rank the region as well as possible on the GVI scale given the information that is available.

The GVI scores are very similar to those of the existing vulnerability indices mentioned above. Our validity analyses reveal high correlations with the coping capacity, adaptive capacity and susceptibility subcomponents of ND-GAIN, INFORM and WRI. These correlations are in the same order as those between these subcomponents of the three indices themselves. This might be surprising at first glance, given that the GVI is based on far fewer indicators. However, recent evidence indicates that the human aspects of development are highly correlated and that development is in fact a low dimensional phenomenon (Yang & Xian, 2018; Kraemer et al., 2020; Ghislandi et al., 2019). Our finding in the Validation section that GVI scores are hardly influenced by removing one of the indicators also points in this direction.

The fact that GVI resembles these established indices so well offers great opportunities for the use of GVI in situations where no other vulnerability indices are available. In this paper, we demonstrate this possibility by computing GVI values from indicators at the level of 20 global regions. Given that currently more socioeconomic indicators are becoming available for subnational regions, like provinces or districts within countries (e.g. Smits & Permanyer, 2019; Mahecha et al., 2019; Crombach & Smits, 2024), also new possibilities arise for measuring climate change vulnerability at the small-area level. With its simple and flexible formula-based approach, the GVI might play a pivotal role in extending vulnerability measurement to these levels and in this way add more spatial detail to the global vulnerability analyses of the IPCC and other major players in the climate field.

In the next section, we discuss the major dimensions of vulnerability that are combined in the GVI, together with the indicators that are used to measure them. Thereafter the data and methods used in this paper are presented. The main GVI formula and its reduced versions are constructed by applying Principal Component Analysis on a dataset for 162 countries in the period 2015–2020. The index is subsequently validated by comparing it with the above-mentioned existing vulnerability indices, by testing its dependency on specific indices and by studying its relationships with the underlying indicators at different levels of economic development. In the result section, GVI values are presented for 189 countries and 20 global regions in the period 2000–2020. The paper is concluded with a conclusion section in which the outcomes are summarized and discussed.

2 Human Dimensions of Vulnerability

The GVI focuses on the human and societal aspects of vulnerability of communities, countries or other geographic areas of interest. It combines seven major dimensions, which – together with their indicators – are presented in Table 1. Physical aspects (landscape, soils, natural vegetation etc.) as well as exposure-related aspects are not included. Neither are human-environment interactions like agriculture and nature management. While the use of vulnerability indices like the GVI is not restricted to climate related changes and events and may also encompass other natural and human-related disasters and shocks, the focus of the current study and hence of the examples that will be used is on the climate field.

The economic dimension refers to the ability of economic actors, such as households, companies or states, to cope with climate change and related events, as well as the damage and economic loss caused by such events (Birkman, 2013). Both economic development at the level of regions and poverty at the level of households affect the risk of suffering from negative effects of climate change (Lavell et al., 2012; Cutter, Boruff & Lynn, 2003). Less developed countries tend to be more vulnerable to the impacts of natural hazards (Hallegatte et al., 2018, 2020), as they have larger vulnerable populations with less possibilities to adapt to the consequences of climate change, or to prepare for or recover from climate-related impacts (Lavell et al., 2012; Thomas et al., 2019; Andrijevic et al., 2021).

Table 1 Dimensions and indicators on which the GVI is baseda

Education affects the risk of suffering negative consequences of climate change since a well-informed population will be more aware of the possible risks of and the best ways to respond to climate change events (IPCC, 2012). People and societies with a higher level of education are better prepared for disasters and respond better to them and consequently tend to suffer less negative impact and recover faster (Muttarak & Lutz, 2014). Lack of knowledge and lower levels of education increase social vulnerability and reduce recovery capacity and the ability to receive and understand warnings (Cutter, Boruff & Lynn, 2003).

Gender has been found to be an important factor determining the degree of vulnerability to climate change (Eastin, 2018; Cutter, Boruff & Lynn, 2003). Women and girls often are at greater risk of dying in disasters, they are less included in decision making and may even be acted against in recovery and reconstruction projects (Houghton, 2009; Sultana, 2021). Gender does not make people vulnerable through biology, but as a consequence of societal structures and roles (Lavell et al., 2012; Birkmann et al., 2013), for example by discouraging women from participating in survival training.

A weak health status of the population and inadequate health systems, including poor hospital and laboratory infrastructures, may increase vulnerability to climate change events considerably (Ebi et al., 2021). Specific groups, such as the (very) young and old, and people with underlying health conditions might be more vulnerable to disasters and in the aftermath of such events. People with chronical diseases, such as diabetes and obesity, and the elderly are at greater risk during heat waves (Watts et al., 2021). Children are particularly susceptible to dehydration and infectious diseases (Hellden et al., 2021).

Inferior infrastructure may act as a driver of vulnerability to climate change. People without access to clean drinking water and sewage systems are more susceptible to disease in the aftermath of a hazard (Miola, 2015). Electricity is essential for preparing and acting in times of emergencies. Communication means such as mobile phones and internet may help reduce the impact of climate change. People with limited access to information are more vulnerable since they might not be aware of the full scale of the imminent hazard, unaware of how to respond, or might not be alerted about it in the first place (Hansson et al., 2020). Mobile phones may help spread early warning signals in areas prone to disaster and support the organization of post-event responses (Dujardin et al., 2020).

Governance is essential for reducing climate change vulnerability and improving coping capacity of countries and regions (Andrijevic et al., 2020). Good quality governance makes it easier to develop strategies and implement policies to deal with the impacts of climate change and to act in times of crisis.

Demographic characteristics like age structure and population density are important factors related to climate change vulnerability (Lavell et al., 2012; Son, Coco Liu & Bell, 2019). Certain population groups, such as the very young and old are more vulnerable than others due to ‘biological sensitivity’ (Thomas et al., 2019; Cutter, Boruff & Lynn, 2003; Miola, 2015). Another relevant demographic factor is urbanization. Rapid and unplanned urbanization may increase vulnerability, particularly in low and middle income countries (LMICs) where it associated with slums and informal settlements, often located on peripheral lands and areas more at risk of climate-related events (Lavell et al., 2012; Son, Coco Liu & Bell, 2019). Rural communities are potentially more vulnerable because rural areas often have lower priority for governments (Lavell et al., 2012). The demographic dimension is measured by the percentage of the population living in urban areas and the Dependency Ratio (Crombach & Smits, 2021), which is computed by dividing the size of the dependent population (aged 0–15 and 65+) by the size of the working age population (aged 15–65).

3 Methods

3.1 Data

To construct the GVI, a database for the period 2000–2020 was built with indicators for 189 countries derived from three reputable international sources. From the World Development Indicators of the World Bank (WDI, 2024) we derived GDP per capita and the poverty headcount ratio at US$ 3.65 a day (both in 2017 international dollars PPP), the percentages of people using safely managed drinking water services, with access to electricity and living in urban areas, the number of mobile cellular subscriptions per 100 people, and the Dependency Ratio.

From the Human Development Index Database of the Human Development Report Office of the UNDP (UNDP, 2024), we derived mean years of schooling of the adult (25+) population, life expectancy at birth and the Gender Development Index (GDI), which indicates the level of human development of women relative to men (UNDP, 2024).

To indicate governance, we use the Worldwide Governance Indicators (WGI) from the WGI Database of the World Bank (WGI, 2022). This database includes six governance indicators -- voice and accountability, political stability, government effectiveness, regulatory quality, rule of law and control of corruption – measured with standardized variables. To obtain a single figure indicator, the mean of these six indicators was taken. As the WGI values were lacking for 2001, for that year the WGI values were interpolated between the years 2000 and 2002.

The mobile cellular subscription indicator was adjusted, because in many countries people on average have more than one subscription, up to over two per person. After a certain number, more subscriptions do not mean more communication possibilities. We consequently choose to maximize this indicator at 125 subscriptions per 100 people, which is in the order of the numbers found in EU countries and among the values with the highest explained variation in the Principal Component Analysis (PCA).

The dataset constructed on the basis of World Bank and UNDP data still contained some missing values, in particular for LMICs. This can be problematic, as for obtaining comparability over time, the same indicators should be used for each year. We therefore imputed the missing values in the GVI Start Dataset with indicators derived or estimated from databases of the Global Data Lab (GDL; www.globaldatalab.org). This was done in the following ways. Mean years of schooling for Somalia (2006, 2011) was derived from the GDL Area Database; the Gender Development Index (GDI) for Fiji (2015), Myanmar (2015) and Papua New Guinea (2015), was derived from the GDL Human Development Database (version 5.0): GDP per capita for Cuba (2019), Eritrea (2002), Somalia (2006, 2019), South Sudan (2008, 2010), Venezuela (2001, 2013), Yemen (2006, 2013) was estimated based on the GDL International Wealth Index (IWI); the poverty headcount ratio at US$ 3.65 a day for Afghanistan (2010, 2015), Barbados (2012), Belize (2006, 2011), Cambodia (2000, 2005, 2010, 2014), Eritrea (2002), Guyana (2006, 2009, 2014, 2019), Kosovo (2014, 2020), Kuwait (2014), Libya (2014), Somalia (2006, 2019), Suriname (2006, 2010, 2018), Trinidad and Tobago (2006, 2011), Turkmenistan (2006, 2015, 2019), Tuvalu (2019) and Turks and Caicos Islands (2019) was estimated on the basis of the GDL Poverty50 measure; the GDI for Djibouti (2006), Guinea Bissau (2006, 2014, 2019), and Turkmenistan (2015, 2019) was estimated on the basis of mean age difference between spouses, mean age at marriage, IWI and the Human Development Index, all derived from the GDL website.

Because not all missing data could be imputed with GDL indicators, some additional measures had to be taken to complete the database. First, if values were missing for in-between years, the missing values were imputed by linear interpolation between the nearest lower and higher years for which information was available. Second, if this was not possible because no values for an earlier and a later year were available, we allowed the use of neighbouring values for up to four years. For the poverty headcount ratio this period was extended to ten years, because this indicator had substantially more missing values than the others. For a few countries we allowed neighbouring values for more years to be able to include them in the analysis. For poverty this was the case for Algeria (+1), Barbados (+3), Myanmar (+6), Papua New Guinea (+1), Serbia (+2), Seychelles (+3), United Arab Emirates (+3) and Zimbabwe (+1); for GDI for Bosnia and Herzegovina (+1), Guinea Bissau (+2), Palestine (+1), Turkmenistan (+1) and Vanuatu (+1); for water for USA (+1), for Poland (+1) and for electricity for Liberia (+3). If this did not yet solve the issue for a country, the data for the problematic years were removed and the GVI had to be computed over less years. With these steps, we were able to obtain indicator values for 189 countries in our database.

For the construction of the GVI formula in the PCA analysis, data from our database for the period 2015–2020 was used. Given the universal nature this formula should have, the requirements for country selection were even stricter than for the complete 2000–2020 database. All included countries were required to have values for all indicators in all years of the 2015–2020 period, so that each country could have the same weight in the analysis. Missing values within this period were imputed by linear interpolation for in-between years and with the values of neighbouring years for end years. For indicators with missing values for all years in the 2015–2020 period, values were taken from the most recent earlier year after 2010 for which valid information was available. If no valid information was available after 2010, the country was removed from the analysis. After making these selections, 162 countries from all regions of the globe remained.

To validate the GVI, its values at country level are compared with those of (the subindices of) three established climate vulnerability indices: ND-GAIN, INFORM and World Risk Index. For this purpose, data with the values of those (sub) indices for the year 2020 were downloaded from the following websites: ND-GAIN: https://gain.nd.edu/our-work/country-index/download-data/ (downloaded on 6-2-2023); INFORM: https://drmkc.jrc.ec.europa.eu/inform-index/INFORM-Risk/Results-and-data/moduleId/1782/id/453/controller/Admin/action/Results (downloaded on 6-2-2023); World Risk Index: https://weltrisikobericht.de/weltrisikobericht-2022-e/ and https://weltrisikobericht.de/download/2977/ (downloaded on 10-2-2023). The validation dataset contained GVI values for 189 countries, ND-GAIN values for, depending on the (sub)index, between 176 and 192 countries, INFORM values for 191 countries and WRI values for 193 countries. Because of missing values, the validation analyses were performed on between 175 and 188 countries.

3.2 Index Construction

The simplest way to construct a composite index is to give the same weight to all the underlying indicators (McKenzie, 2005; OECD, 2008; Howe et al., 2012). However, this would imply that all indicators have the same effect on the final index, which in most cases is not very realistic. It therefore is recommendable to use a more advanced method to determine the relative weights of the included indicators (Filmer & Pritchett, 2001; Sahn & Stifel, 2003; OECD, 2008). Here we use an empirically based method and apply principal component analysis (PCA) for computing the weights. This has the advantage that the weights are not affected by subjective judgements of experts and are completely determined by the data.

PCA is a multivariate statistical technique that reduces the number of variables in a dataset by converting them into a smaller number of components; each component being a linear weighted combination of the initial variables (Bartholomew et al., 2002; Nardo et al., 2005; Vyas & Kumaranayake, 2006). The first component, which explains the largest part of the variation, can then be chosen as the central index (compare Filmer & Protchett, 2001; Smits & Steendijk, 2015; Crombach & Smits, 2024) for similar approaches in the field of wealth measurement). For GVI the first component explains 66.4% of the variation in the underlying indicators. Compared to the construction of wealth indices, where the first factor generally explains 25–35% (Smits & Steendijk, 2015; Howe et al., 2012), this is a very satisfactory explanatory power.

Table 2 presents an overview of the mean and standard deviation of the indicators included in the construction of the GVI (columns 1 and 2), plus the ‘raw’ indicator weights produced by PCA analysis on this database (column 3). These PCA weights reflect the possibility that a country that has a high (low) value on one of the indicators also has a high (low) value on other indicators (Vyas & Kumaranayake, 2006; OECD, 2008). The weights show that all indicators contribute substantially to the index. The highest weights are observed for poverty, life expectancy and access to clear water. The lowest weights for the Gender Development Index, phone subscriptions and urbanization.

Table 2 Mean and standard deviations of the indicators, raw indicator weights, and coefficients of the GVI formula

On the basis of the raw indicator weights we computed a raw vulnerability score by multiplying the indicator variables with their indicator weights and summing them up. This procedure is shown in the following equation, where \(\:{y}_{r}\) is the raw vulnerability score, \(\:{\beta\:}_{n}\) the estimated raw indicator weight of the \(\:n\)Th indicator and \(\:{x}_{n}\) the (standardized) indicator value of the \(\:n\)Th indicator.

$$\:{y}_{r}=\sum\:{\beta\:}_{n}\cdot\:{x}_{n}$$

When applying this formula to our data, we obtained a country vulnerability distribution with a minimum score of -2.58671677 and a maximum score of 1.76018853. To get a scale with a more intuitively appealing range, we transformed the distribution to values in the 0-100 range. This was done in such a way that some room was left for lower and higher values in the data for earlier and later years than the period for which the index was constructed (2015–2020). To construct this scale, we used the following formula:

$$\:GVI^{\prime\:}=L-\frac{\left(\sum\:{\beta\:}_{n}\cdot\:{x}_{n}-\text{a}\text{b}\text{s}\left(\text{M}\text{i}\text{n}\right)\right)}{\left(\text{a}\text{b}\text{s}\left(\text{M}\text{i}\text{n}\right)+\:Max+\text{H}\right)/100}=-22.63157686+\sum\:{\beta\:}_{n}^{{\prime\:}}\cdot\:{x}_{n}\:\:$$

Where Min is the lowest possible value of the distribution (-2.58671677), Max the highest possible value of the distribution (1.76018853), L the lowest value we would like to give the scale, and H a constant chosen strategically to obtain a preferred highest value of the scale. On the GVI’ scale constructed in this way higher values refer to countries that are less vulnerable, which is not very intuitive. We therefore have reversed the scale to create the final GVI scale in the following way:

$$\:GVI=100-GVI^{\prime\:}\:$$

The values of L and H were chosen in such a way that the final GVI scale runs from 10 to 95 in the period 2015–2020. In this way, countries at the lower (least vulnerable) end of the scale keep ample space for future improvements, while at the upper (most vulnerable) end some room remained for more extreme vulnerability levels in past data. Future or past values below zero or above 100 have to be recoded to zero or 100 to keep the scale in the zero-100 range. The final GVI scores are rounded to one decimal place. The database with the GVI and its underlying indicators for the period 2000-2020 is available in the Supplementary Information and at the website of the Global Data Lab of Radboud University (http://www.globaldatalab.org), where also connections with other indicators can be made and future updates will be presented.

4 Validation

4.1 Correlations with Existing Indices

To validate the GVI and position it in the climate vulnerability field, we compare its performance with the three widely used vulnerability indices mentioned in the introduction: ND-GAIN, INFORM and WRI. All three indices include a number of subindices that aim to measure different aspects of vulnerability. ND-GAIN assumes two broad dimensions, called vulnerability and readiness, whereby vulnerability is defined as the propensity of societies to be negatively impacted by climate hazards and readiness as the degree of safety and efficiency of the business environment in making efficient use of adaptive actions (Chen et al., 2015). The vulnerability dimension is most relevant for our work. It consists of three subcomponents, (1) exposure, which captures the external physical aspects that contribute to vulnerability, (2) sensitivity, which indicates to what extent a society is affected by climate-related hazards, and (3) adaptive capacity, which refers to the possibilities to respond and adapt to the negative consequences of climate change.

The INFORM index is presented as a tool for understanding the risk of humanitarian crises and disasters (Marin-Ferrer et al., 2017). It covers three dimensions of risk: (1) hazards and exposure, referring to averse events that can occur, (2) vulnerability, signalling the susceptibility of societies to those averse events, and (3) coping capacity, indicating the availability of resources that may reduce the impact of those events.

Risk is also a central element in the WRI. Within this framework, risk is defined as the interaction of exposure and vulnerability (Atwii et al., 2022). Exposure relates to the (natural) circumstances in which populations live and the degree to which these circumstances subject them to natural hazards and other negative consequences of climate change. Vulnerability refers to the predisposition of those populations to be affected by those hazards and their negative consequences. Vulnerability in the WRI context is considered to include three dimensions: (1) Susceptibility, which relates to characteristics of a society that affect the probability of suffering damage from a disaster and being able to overcome its negative consequences, (2) coping capacity, which refers to the possibilities and resources available to reduce these negative impacts by direct actions, and (3) adaptive capacity, which specifies the availability of long-term strategies and activities that may help counteract negative consequences in future (Atwii et al., 2022).

Hence all three indices include an external dimension -- referring to the presence and intensity of climate-related influences and shocks – and an internal dimension – referring to characteristics of societies that may or may not help them to resist and overcome the consequences of those influences and shocks. The GVI as a socioeconomic composite index is aimed at measuring the second – internal – dimension. This dimension can be further divided into two subdimensions. On the one hand there is a sensitivity or susceptibility aspect that refers to the strength with which the external factors are felt by the affected society. On the other hand a coping or adaptive capacity aspect that reflects the resources available in a society that help reduce negative consequences and build resistance against future shocks.

To validate the GVI, we have computed Pearson correlations between the national GVI values in the year 2020 and the national values of (the subindices of) ND-GAIN, INFORM and WRI for that year. These correlations are shown in Table 3. We see that GVI is very highly (-0.89) correlated with ND-Gain, its vulnerability subdimension (0.89) and the adaptive capacity component thereof (0.89). It is also highly correlated with the INFORM index (0.84), its vulnerability subdimension (0.82) and particularly strong with its coping capacity component (0.93). Correlations between GVI and the exposure component of ND-GAIN and INFORM are with values around 0.50 clearly lower, but not neglectable. GVI is uncorrelated with the WRI (0.09), but this is mainly due to its lack of association with the WRI exposure component (-0.11). In fact, it is strongly correlated with the vulnerability subdimension (0.77) and particularly its susceptibility (0.78) and adaptive capacity (0.81) components.

Table 3 Pearson correlations between national GVI values in the year 2020 and the national values of (the subindices of) ND-GAIN (G), INFORM (I) and WRI (W) for that year, including two-tailed significance levels and number of cases on which the correlations are based
Table 3 continued

These correlations are in the same order as those between these subcomponents of the three indices themselves. The correlation of adaptive capacity of ND-GAIN with coping capacity of INFORM is 0.91 and with susceptibility and adaptive capacity of WRI 0.70 and 0.77 respectively. The correlations of coping capacity of INFORM with susceptibility and adaptive capacity of WRI are 0.71 and 0.80 respectively. Susceptibility and adaptive capacity of WRI are themselves correlated 0.72.

The high correlations of GVI with the vulnerability dimensions and in particular the coping capacity, adaptive capacity and susceptibility components of the three established indices – also in comparison with the correlations between the subcomponents of these indices themselves – show that despite its simplicity the data-driven GVI represents the human or societal components of climate vulnerability very well.

We therefore can conclude that with the simple and flexible approach used for the construction of GVI, a picture of the global distribution of climate change vulnerability is obtained that is very similar to the outcomes of these more sophisticated indices. This offers great prospects for the application of the GVI to other regional subdivisions and aggregates for which no other vulnerability index is available. In particular the possibility to apply the index to subnational areas like provinces and districts within countries might be very helpful for adding more spatial detail to the global vulnerability analyses of IPCC and other important players in the climate field.

4.2 Reduced GVI Versions

While the PCA analysis made clear that all indicators contributed substantially to the index, it remains to be seen to what extend the GVI depends on the inclusion of specific indicators. This is an important question, not only for validation reasons, but also because it regularly happens – in particular for LMICs – that not all indicators are available for a specific region. If the dependency on specific indicators is not very strong, reduced version of GVI might be constructed that might place a region with missing data as well as possible on the GVI scale given the available information.

To address this issue, we use an approach developed by (Smits & Steendijk, 2015), who performed additional PCA analyses for creating reduced versions of their wealth index. Following them, we conducted a number of additional PCA analyses, each time with one or two of the indicators removed. On the basis of the outcomes of these analyses, new GVI formulas were developed for creating reduced vulnerability indices that were scaled in a similar way as the main GVI. The reduced formulas are available at the website of the Global Data Lab (https://www.globaldatalab.org).

To test the quality of the reduced indices, Pearson correlations between these indices and the original GVI were computed. All correlations turned out to be higher than 0.99, which is so high that hardly any difference between the original and reduced versions of the GVI seems to exist. We therefore are led to the conclusion that the GVI is barely influenced by the removal of specific indicators and that none of the underlying indicators seems essential for assessing the vulnerability of a country.

This finding, together with the relatively high explained variation in the PCA analysis and the fact that all indicators contributed more or less similar to the PCA model, suggests that socioeconomic vulnerability is a low-dimensional phenomenon. This may also imply that sub-versions of the GVI based on restricted information might still provide a reasonable picture of climate change vulnerability in situations with weak data infrastructures, like in resource-poor regions of LMICs.

4.3 Importance of Indicators at Different Development Levels

The finding that all dimension indicators contribute to the composite index does not mean that their importance is everywhere the same. It is possible that at different levels of development the index is dominated by different subdimensions (Permanyer & Smits, 2020). To test for this possibility, we have for the year 2020 computed Pearson correlations between the GVI and all underlying indicators for the four global income regions defined by the World Bank (2024). Because three of our subdimensions – income, education and life expectancy -- are the main subdimensions of the Human Development Index (HDI) of the UNDP (2024), we also include the HDI in this analysis. This gives us the opportunity to increase our understanding of the effect of combining dimensions on index stability.

The results are presented in Table 4. We see that the large majority of correlations is substantial and significant at P < 0.05 or higher level. The figures reveal that the contribution to GVI of the subindices may differ between countries in different income regions. GDP per capita contributes most in the lower middle and high income countries, poverty and education in the lower middle income countries, the GDI in the low income countries, life expectancy at all income levels, water and electricity in low and lower middle income countries, mobile phone subscriptions in low income countries, governance in high income countries, the dependency ratio in low and middle income countries and urbanization in high income countries. The HDI – as a composite of three of the subdimensions -- shows a substantial correlation at all levels of development. This indicates that composite indices created by combining several subindices are not only more encompassing and concise than their subindices, but also more stable across different levels of economic development.

Table 4 Pearson correlations between GVI values in the year 2020 and the values of its subindices for the four global income regions of the World Bank, including two-tailed significance levels and number of cases on which the correlations are based

5 Results

5.1 National GVI Values

In the Supplementary Information, yearly GVI values for the 2000–2020 period are presented for the 189 countries for which the GVI is available, together with the underlying indicators on which the GVI values are based. In Fig. 1, GVI values for 2000 and 2020 are displayed on world maps. As could be expected, socioeconomic vulnerability is low in the most developed countries and high in the poorest countries. Particularly high GVI values, indicating very high levels of vulnerability, are found in sub-Saharan Africa and South and South East Asia, whereas low values can be found in West European countries – predominantly North West Europe – North America and Australia/New Zealand.

Fig. 1
figure 1

World maps with the distribution of the GVI at country level in 2000 and 2020, plus the change in vulnerability between 2000 and 2020

Between 2000 and 2020 the level of socioeconomic vulnerability improved in all countries for which we have data for both years. Whereas in 2000 50 countries had a GVI-value above 75, indicating very high levels of vulnerability, in 2020 this had dropped to 17 countries.

The map at the bottom of Fig. 1 shows the changes in GVI values that countries experienced over the 2000–2020 period. Countries which already had a low level of vulnerability in 2000, i.e. most Western countries, experienced the lowest levels of change. However, while absolute levels of improvement were relatively minor in these countries, relative levels of improvement could still be large. Luxembourg for instance almost halved its GVI-value from 19.4 in 2000 to 10.5 in 2020.

The largest levels of reduction in socioeconomic vulnerability were experienced by some of the countries with the highest levels in 2000, i.e. countries in South and South East Asia, and the South East and North West of Africa. However, other countries with high initial levels of vulnerability, in particular some landlocked African countries, experienced little improvement. Niger, Chad and the Central African Republic for instance were among the countries with the highest levels of vulnerability both in 2000 and 2020.

5.2 GVI Values for Global Regions

The formula approach underlying the GVI makes it possible to compute GVI values for any regional (sub)division for which the required indicators are available. To demonstrate this possibility, we have computed GVI values at the level of 20 UN Geoscheme Regions (the M49 standard; UN, 2023), whereby Micronesia, Melanesia and Polynesia where combined into one region. To create the GVI values, we first have computed values for the eleven underlying indicators at the level of these regions. This was done by taking the population weighted mean of the country values. After that, the GVI values were computed by entering the regional indicators in the GVI formula.

Not surprisingly, the conclusion on the basis of Fig. 2 is rather similar to that of Fig. 1. Sub-Saharan Africa, with the exception of the Southern part, is clearly the most vulnerable region of our globe, followed by the island states in the Pacific Ocean, South Asia, North Africa and the Caribbean. Northern America, Western Europe, Northern Europe and Australia/New Zealand are again from a socioeconomic perspective the least vulnerable regions. The GVI values and underlying indicators for the UN regions in the period 2000–2020 are available in the Supplementary Information.

Fig. 2
figure 2

World maps with the distribution of the GVI at the level of 20 UN Geoscheme regions (M49 standard) in 2000 and 2020. Regions distinguished: Northern Africa, Western Africa, Middle Africa, Eastern Africa, Southern Africa, Northern America, Central America, South America, Western Asia, Central Asia, Eastern Asia, Southern Asia, South-eastern Asia, Australia and New Zealand, Caribbean, Northern Europe, Western Europe, Eastern Europe, Southern Europe, and Pacific (with the regions Micronesia, Melanesia and Polynesia combined)

6 Conclusions

Composite vulnerability indices are widely used instruments for measuring the sensitivity, coping capacity and adaptive capacity of regions regarding the impact of climate change, natural disasters and other kinds of shocks. Established indices like the ND-GAIN, INFORM and WRI are created by teams of experts and released on a yearly basis for most countries of the world. They are ingenious and encompassing – combining large numbers of underlying variables – but at the same time rather complex and not computable by non-specialists. Given their institutional/expert based origin, their scores have to be taken at face value and are only available for the countries and regions for which they are produced.

With the GVI, we aim to present a simple and flexible alternative. GVI uses a strictly data driven approach that is not dependent on expert judgement. It focuses on seven core dimensions of vulnerability -- the economy, education, gender, health, infrastructure, governance and demography – which are measured with eleven indicators. The resulting index is universal, encompassing and easy to use. Any person who knows the values of the underlying indicators for a specific area or community can determine the area’s or community’s GVI score, by entering the indicator values in a simple additive formula.

The GVI summarizes the human components of vulnerability in a number between 0 and 100, with higher values indicating higher levels of vulnerability.

Correlations between GVI scores and subcomponents of ND-GAIN, INFORM and WRI reveal that GVI is an effective indicator of the subcomponents coping capacity, adaptive capacity and sensitivity. The mutual correlations between the coping capacity, adaptive capacity and sensitivity subcomponents of the three indices themselves are in the same order as the correlations of the GVI with these subcomponents. This suggests that that GVI may come close to the greatest common denominator of these components and thus to a large extent represents what these indicators aim to measure.

The finding that a restricted index based on far fewer indicators than these established indices has similar performance in vulnerability measurement might seem surprising at first glance. However, using more indicators is not necessarily better. The selection, normalization and weighting of a larger number of indicators involves more arbitrary choices than of a smaller number of indicators and hence may reduce instead of increase the quality of the resulting index (OECD, 2008). This is particularly the case with the human aspects of development, which are known to be highly correlated (Yang & Xian, 2018; Kraemer et al., 2020; Ghislandi et al., 2019). The PCA approach we use for the GVI may also be better fitted for constructing an overall index than the min-max approach used for the other indices, as PCA analysis as technique is specifically designed to extract the common denominator of sets of correlated indicators (OECD, 2008; Nardo et al., 2005; Vyas & Kumaranayake, 2006).

The GVI scores at country and regional level reveal high levels of vulnerability in the poorest countries, particularly in sub-Saharan Africa, the Pacific island states, and South and South East Asia. In contrast, low GVI scores, indicating low levels of vulnerability, are found in developed countries, particular in North West Europe, North America and Australia. Between 2000 and 2020 vulnerability improved in all countries for which we have data. As might be expected, countries and regions which had already low levels of vulnerability in 2000 showed the smallest reductions, while some countries and regions with the highest levels of initial vulnerability showed the largest reductions. However, the landlocked African countries improved disappointingly little and remain among the most vulnerable countries of our world.