Keywords

The Paris Climate Agreement (UNFCCC, 2015) ‘notes that … emission reduction efforts will be required … to hold the increase in the global average temperature to below 2 °C above pre-industrial levels…’. The Intergovernmental Panel on Climate Change (IPCC) further quantified the carbon budget to achieve this target in its Sixth Assessment Report of the Working Group (IPCC, 2021). According to the IPCC, a global carbon budget of 400 GtCO2 is required to limit the temperature rise to 1.5 °C, with 67% likelihood, by 2050.

To implement these targets, energy and climate mitigation pathways are required. Numerous computer models for the analysis and development of energy and emission pathways have been developed over the last few decades. Many different calculation methods have been established, which mainly differ in the principal task of the model and the level of detail in the GHG emissions and/or energy systems calculated. The various methods of climate-economy modelling use different ways to describe the economy- and climate-relevant parameters as parts of a highly interconnected process (Nikas et al., 2019). In this context, the economy includes all aspects of the energy system and the policy framework, whereas the climate module reflects various GHG emissions from energy-related and non-energy-related processes, such as land use.

A comprehensive review of energy models, focusing on the usability of those models for decision-making, found ‘that a better understanding of user needs and closer co-operation between modellers and users is imperative to truly improve models and unlock their full potential to support the transition towards climate neutrality …’ (Süsser et al., 2022).

1 The OneEarth Climate Model

The UN-convened Net-Zero Asset Owner Alliance (NZAOA) is an international group of institutional investors committed to transitioning their investment portfolios to net-zero emissions by 2050 (NZAOA, 2021). Detailed industry sector-based energy scenarios are required to implement those net-zero commitments. On the basis of the OneEarth Climate Model (OECM; Teske et al. 2019a, b), the Institute for Sustainable Futures, University of Technology Sydney (UTS/ISF), in close co-operation with institutional investors, has developed an integrated energy assessment model for industry-specific 1.5 °C pathways, with high technical resolution, for the finance sector. In this article, we describe the detailed methodology and the architecture of the energy model in the 2021 edition of the advanced OneEarth Climate Model (OECM 2.0).

1.1 The OneEarth Climate Model Architecture

The OneEarth Climate Model has been developed on the basis of established computer models. The energy system analysis tool consisted of three independent modules:

  1. 1.

    Energy system model (EM): a mathematical accounting system for the energy sector (Simon et al., 2018)

  2. 2.

    Transport scenario model TRAEM (transport energy model) with high technical resolution (Pagenkopf et al., 2019)

  3. 3.

    Power system analysis model [R]E 24/7, which simulates the electricity system on an hourly basis and at geographic resolution to assess the requirements for infrastructure, such as grid connections between different regions and electricity storage types, depending on the demand profile and power generation characteristics of the system (Teske, 2015)

The advanced OneEarth Climate Model, OECM 2.0, merges the energy system model (EM), the transport energy model (TRAEM), and the power system model [R]E 24/7 into one MATLAB-based energy system module. The Global Industry Classification Standard (GICS) was used to define sub-areas of the economy. The global finance industry must increasingly undertake mandatory climate change stress tests for GICS-classified industry sectors in order to develop energy and emission benchmarks to implement the Paris climate protection agreement. This requires very high technical resolution for the calculation and projection of future energy demands and the supply of electricity, (process) heat, and fuels that are necessary for the steel and chemical industries. An energy model with high technical resolution must be able to calculate the energy demand based on either projections of the sector-specific gross domestic product (GDP) or market forecasts of material flows, such as the demand for steel, aluminium, or cement in tonnes per year.

To decarbonize the energy supply, fossil fuels must be phased out and replaced by a renewable energy supply. However, the supply of high-temperature process heat for various production processes cannot yet be fully electrical, and a simple fuel switch from oil, gas, or coal to biomass is also impossible, given the limited availability of sustainable bioenergy (Seidenberger et al., 2010; Farjana et al., 2018). To develop a detailed sector-specific solution, the temperature level required must be considered when developing an energy scenario. An energy model with such high technical resolution can provide detailed results for various industry sectors but requires a highly complex and data-intensive model architecture. Separate modules for the calculation of different sectors of the energy system are not practicable for such high technical resolution because high electrification rates lead to increased sector coupling, and the interactions between sectors cannot be captured if the energy model uses separate modules.

Furthermore, the geographic distribution of the energy demand and supply must be accommodated to calculate the import and export of energy, especially for energy-intensive industries. Finally, the simulation of 100% renewable energy systems requires high time resolution to accommodate the high proportions of variable solar and wind energy.

The MATLAB model has an object-oriented structure and two modules—to calculate demand and supply—that can be operated independently of each other. Therefore, an energy demand analysis independent of the specific supply options or the development of a supply concept based on demand from an external source is possible.

1.2 The OECM Demand Module

The demand module uses a bottom-up approach to calculate the energy demand for a process (e.g. steel production) or a consumer (e.g. a household) in a region (e.g. a city, island, or country) over a period of time. One of the most important elements in this approach is the strict separation of the original need (e.g. to get from home to work), how this need can be satisfied (e.g. with a tram), and the kind of energy required to provide this service (in this case, electricity). This basic logic is the foundation for the energy demand calculations across all sectors: buildings, transport, services, and industry. Furthermore, the energy services required are defined: electricity, heat (broken down into four heat levels: <100 °C, 100–500 °C, 500–1000 °C, and > 1000 °C), and fuels for processes that cannot (yet) be electrified. Synthetic fuels, such as hydrogen, are part of both the demand module, because electricity is required to produce it, and the supply module.

The energy requirements are assigned to specific locations. This modular structure allows regions to be defined and, if necessary, the supply from other areas to be calculated.

Demand and generation modules are independent and can be used individually or sequentially. Energy demands can be calculated either as synthetic load profiles, which are then summed to annual energy demands, or as annual consumption only, without hourly resolution. Whether or not hourly resolution is selected depends to a large extent on the availability of data. Load profiles, such as those for the chemical industry, are difficult to obtain and are sometimes even confidential.

1.2.1 Input Parameters

As in basic energy models, the main drivers of the energy demand are the development of the population and of economic activity, measured in GDP. Figure 3.1 shows the basic methodology of the OECM demand module. Tier 1 inputs are population and GDP by region and sector. ‘Population’ defines the number of individual energy services, which determines the energy required per capita, and ‘economic activity’ (in GDP) defines the number of services and/or products manufactured and sold. Tier 1 demand parameters are determined by the effect that a specific service requires. For population, the demand parameters are defined by the need for food, shelter (buildings), and mobility and—depending on the economic situation and/or lifestyle of the population—the demand for goods and services.

Fig. 3.1
Two charts with 2 demand tiers. Demand tier 1 has cause, leading to effect, that leads to demand change impact factors. Examples of population and economy are given below. Tier 2 has energy intensity, leading to quantity, pointing to energy.

Tier 1 and tier 2 input parameters for the assessment of energy demand

Economic activity (measured in GDP) is a secondary input and is directly and indirectly dependent upon the size of the population. However, a large population does not automatically lead to high economic activity. Both population and projected GDP are inputs from external sources, such as the United Nations or the World Bank. Tier 1 input parameters themselves are strictly non-technical. The need to produce food can be satisfied without electricity or (fossil) fuels, just as a service can be provided with physical strength.

Tier 2 demand parameters are energy-relevant factors and describe technical applications, their energy intensities, and the extent to which the application is used. For example, if lighting is required, the technical application ‘light bulb’ is chosen to satisfy the demand.

In this example, the energy intensity is the capacity of a light bulb, e.g. 100 W. The use of the application (e.g. for 5 h per day) defines the daily demand (5 h × 100 W = 500 Wh per day). The quantity of consumption per year is 365 days at 500 Wh per day = 1825 Wh or 182.5 kWh per year. This very basic and simple principle is used for every application in each of the main sectors: residential + buildings, industry, and transport. These sectors are broken down into multiple sub-sectors, such as aviation, navigation, rail, and road for transport, and further into applications, such as vehicle types. The modular programming allows the addition of as many sub-sectors and applications as required.

1.2.2 Structure of the Demand Module

Each of the three sectors, residential and buildings, industry, and transport, has standardized sub-structures and applications. The residential sector R (first layer) has a list of household types (second layer), and each household type has a standard set of services (third layer), such as ‘lighting’, ‘cooling’, and ‘entertainment’. Finally, the applications for each of the services are defined (fourth layer), such as refrigerator or freezer for ‘cooling’. The energy intensity of each application can be altered to reflect the status quo in a certain region and/or to reflect improvements in energy efficiency. An illustrative example of the layers of the residential sector is shown in Fig. 3.2.

Fig. 3.2
A chart of residential sector with 4 layers. Layer 1. Residential. Layer 2. Residential leads to household type 1, 2, 3, and 3 dots. Layer 3. Household type 1 leads to lighting, cooling, entertainment, 3 dots, and cooking. Layer 4. L E D, fridge, freezer, T V, 3 dots, and stove. Each household type leads to respective components.

Residential sector sub-structures

Figure 3.3 shows an example of the model structure of the industry sector. In the second layer are different industries—the OECM uses the GICS classification system for industry sub-sectors. The quantity of energy for each of the sub-sectors is driven by either GDP or the projected quantity of product, such as the tonnes of steel produced per year. The market shares of specific manufacturing processes are defined, and each process has a specific energy intensity for electricity, (process) heat, and/or fuels.

Fig. 3.3
A flow diagram with parts. Industry. Steel, chemical, cement, X plus 1 industries. Quantity in dollars or tons per year. Market shares of primary and secondary steel. Energy intensity calculated by electric process, and process heat at 4 temperature levels. Energy demand measured in T W h per year or P J per year.

Calculation of the industry energy demand

Figure 3.4 shows the structure for the transport sector. Again, the demand is driven by ‘non-energy’ factors, such as passenger-kilometres and freight-kilometres, and energy-related factors, such as the transport mode and the energy intensity of the different vehicle options.

Fig. 3.4
Two charts with transport demand and transport service. Transport demand has cause, leading to effect, that leads to demand change impact factors. Examples of passenger and freight are given below. Transport service has factors for service choice, leading to transport mode, pointing to energy and emissions.

Calculation of transport energy demand

1.2.3 Demand Module Architecture in MATLAB

The demand module is implemented in MATLAB, a widely used programming language for mathematics and science computing. MATLAB allows the integration of a range of tools and databases and has the flexibility to add and develop new functions. Specifically, the model has been developed using an object-oriented programming approach, allowing extensibility and modularity.

Figure 3.5 shows the demand module developed in MATLAB. The demand module encompasses eight classes: (1) demand class, (2) household class, (3) household application class, (4) sub-sector, (5) industry class, (6) industry application class, (7) transport modes, and (8) vehicles class (Fig. 3.6).

  • Demand class: This is the main class, which describes the residential, industry, and transport sectors, which are defined by household type, sub-sector, and transport mode classes, respectively. The attributes that define this class include a range of years, energy consumption forms, energy levels, list of sectors, household types, sub-sectors, applications, appliances, and vehicles. The demand class also has two main types of methods: (i) calculation demand methods and (ii) printing results methods. The calculation methods use equations and algorithms to calculate and find the demand. For example, the ‘Find Demand’ method can be used to find a wide range of calculations and outputs, e.g. the electricity demand of a group of households for a specified year. The calculation method can calculate the demand for single or aggregated sectors, sub-sectors, or applications, for a single year or a range of years, unique or multiple forms of energy consumption, and single or various types of vehicle categories. The printing result methods can be used to export the results into an Excel spreadsheet or to plot the results using the MATLAB interface. Therefore, the outputs of the demand module can be either predefined graphs, tables, or data for a standardized report. See Table 3.1 for a brief description of each method in this class.

  • Household and appliance classes: These classes are used to define the residential sector. The appliance objects are embedded within the household-type objects. Attributes include names, sectors, and regions, which are defined as string inputs (i.e. text or character inputs) or numerical inputs, which are defined as int (i.e. integers) or double (i.e. numeric variables holding numbers with decimal points). Attributes can also include arrays of strings or double values. Array variables are helpful in input time series data, such as load profiles. Because households and appliances have their own classes, this architecture is flexible and allows the addition of households with different attributes and different types of appliances.

  • Sub-sector and industry application classes: These classes are used to define the industry sector. The industry application objects are embedded within the sub-sector objects. As shown in Fig. 3.7, these classes have their own lists of attributes. Therefore, the module developed can accommodate different types of sub-sectors (e.g. steel, cement, etc.) and incorporate various types of applications under each sub-sector.

  • Transport modes and vehicle classes: These classes are used to define the transport sector. The vehicle objects are embedded within the transport mode objects. Therefore, multiple types of transport modes can be defined, such as aviation and navigation, as well as various types of vehicles, such as planes and cruise ships.

Fig. 3.5
A diagram of the demand module in MATLAB, with its classes, attributes, methods, and associations. Household type and appliance, subsectors and industry application, vehicle and transport mode, all connect under demand.

A unified modelling language (UML) diagram of the demand module in MATLAB, showing its classes, attributes, methods, and associations

Fig. 3.6
A class titled Household underscore type. It contains, name, sector, year, region, appliances, share, people h type, load profile weekdays, and load profile weekends.

An example of a household type object, showing the assigned attributes

Table 3.1 Methods within the demand class
Fig. 3.7
A class titled Industry underscore application. It contains, name, sector, subsector, region, year, energy consumption form, energy level, material based production, primary energy intensity, share factors, conversion factors, efficiency, load profiles.

An example of an industry application object, showing the assigned attributes

Figures 3.6 and 3.7 show the high-level class definitions for residential and industry sub-sector objects, respectively. The blue-marked text indicates the defined value for each attribute. For example, one household object with five residents is defined by the name ‘Rural–Phase 1’ and has a list of 35 appliance objects, defined with a string array. It is assigned a share factor for 2020 of 0.2, which means that 20% of the households in that specific region and year are defined by this type of household and its attributes. Furthermore, 24 h load profiles are defined for each application for every day, with numerical arrays. For example, weekend load profiles have a size of 35 rows and 48 columns, representing 35 applications and 24 time slots for each weekend day.

The object-oriented architecture allows all these input attributes to be updated or modified easily. These attributes can also be read from a predefined Excel spreadsheet. This facilitates a data input process that follows the array structure, such as the load profile.

Figure 3.7 shows an example of an industrial application object that belongs to the sub-sector iron and steel. In this case, the energy consumption form is defined as electricity and heat, which means that it considers the electrical and heat demand. The ‘share factors’ represent the portions of the demand assigned to electricity and heat. The energy-level array also allows the predefined network to which the application is connected to be defined, as well as the temperature levels. In this particular case, the demand is defined based on the total annual primary energy intensity and the material-based production, which are 1184 GJ/tonnes and 1178 Mt, respectively, for the specified region and year. The input and output units must be predefined when the MATLAB modules are initialized. Other attributes that can be assigned are conversion factors, such as from primary energy to the final energy via an efficiency factor.

Additional attributes and methods can be defined for each class if required and the data are available. Therefore, the demand module class can be extended by defining new classes, attributes, and methods.

1.3 The OECM Supply Module

The supply module consists of three main elements: supply technologies, storage technologies, and the infrastructure for the power supply (capacities of power lines). For the generation of electricity and heat, the programme considers all the technologies of the energy market, from both renewable and non-renewable sources. In addition to the generation of pure electricity and heat, the entire range of combined heat and power systems is included.

Storage technologies include batteries and the use of hydrogen from electrolysers. The calculation of heat storage is possible, but has not yet been used in the OECM scenarios.

A dispatch strategy is defined for electricity and heat generation that reflects market and policy factors. Whether electricity from photovoltaics and onshore and offshore wind turbines have priority dispatch ahead of fossil-fuel power plants and how storage systems are used can be determined. Each technology has a specific conversion efficiency.

Heat generation technologies are also defined by the temperature levels they can provide. For example, residential solar collectors can only supply low-temperature heat and will therefore not be considered for high-temperature process heat (Table 3.2).

Table 3.2 Example of generation and storage technologies

The regional energy demand—as defined in the previous section—can be met by neighbouring regions, with importation from or, in the case of oversupply, exportation to them. The extent to which electricity can be imported or exported from one region to another is defined by the capacity of regional interconnections, which represent the available power line capacities.

1.3.1 The OECM Dispatch Module

The methodology of the dispatch module of the MATLAB-based OECM is based on the previous version of the model (Teske et al. 2019a). The key inputs are related to the supply technologies, storage types, dispatch strategy, and the interconnections among regions for possible power exchange (Table 3.3). Different supply technologies can be selected, each with its technical characteristics, including its efficiency, available installed capacity, fuel type, and regional meteorological data (solar radiation or wind speed). Meteorological data define the capacity factors of solar and wind energy generators as their levels of availability at 1-h resolution for an entire year (Table 3.4).

Table 3.3 Input parameters for the dispatch model
Table 3.4 Output parameters for the dispatch model

The supply technologies can be either dispatchable (e.g. gas power plants) or non-dispatchable (e.g. solar photovoltaic without storage). The model allows the order in which the supply technologies and storage functions are utilized to be adjusted to satisfy the demand. However, storage and interconnections cannot be selected as the first elements of supply (Table 3.5).

Table 3.5 Technology groups for the selection of dispatch order

Tables 3.6, 3.7, and 3.8 provide an overview of the possible supply technologies and examples of different dispatch scenarios. Although concentrated solar power (CSP) plants with storage are dispatchable to some extent—depending on the storage size and the available solar radiation—they are part of the renewable variable group in the MATLAB model. Although the model allows the dispatch order to be changed, the 100% renewable energy analysis always follows the same dispatch logic. The model identifies excess renewable production, which is defined as any potential wind and solar photovoltaic generation greater than the actual hourly demand in MW during a specific hour. To avoid curtailment, the surplus renewable electricity must be stored with some form of electric storage technology or exported to a different cluster or region. Within the model, the excess renewable production accumulates through the dispatch order. If storage is present, it will charge the storage within the limits of the input capacity. If no storage is present, this potential excess renewable production is reported as ‘potential curtailment’ (pre-storage) (Table 3.9).

Table 3.6 Technology options—variable renewable energy
Table 3.7 Technology options—dispatch generation
Table 3.8 Technology options—storage technologies
Table 3.9 Dispatch module—inputs, intermediate outputs, and outputs

Limitations: It is important to note that calculating the possible interconnection capacities for transmission grids between subregions does not replace technical grid simulations. Grid services, such as the inductive power supply, frequency control, and stability, should be analysed, although this is beyond the scope of the OECM analysis. The results of [R]E 24/7 provide a first rough estimate of whether increased use of storage or increased interconnection capacities or a mix of both will reduce systems costs.

1.3.2 Regional Interconnections

Interconnection capacities are set as a function of the total generation capacity within a cluster. Interconnections between defined regions are the only ones considered, and all intra-regional interconnections or line constraints are excluded. Therefore, a region is considered a ‘copper plate’—and a transmission system where electricity can flow unconstrained from any generation site to any demand site is found in most energy modelling tools (Avrin, 2016). This simplification is required to achieve a short calculation time while maintaining high technical and time resolution. The algorithm devised for the function of the interconnectors is based on the following information for each region:

  • Unmet load in the region

  • Excess generation in other regions

  • Interconnection capacity between the undersupplied region and each of the other regions

  • Priority of the closest region(s) in exporting power to the undersupplied region

The excess generation capacity and unmet load are calculated by running the model without the interconnections to determine the excess or shortfall in generation when the load within the region is met. These excesses and shortfalls are calculated at the point in the dispatch cascade at which the interconnectors provide or consume power, for example, after the variable renewables and dispatchable generators and before the storage technologies.

The interconnection capacity between regions is defined based on a percentage of the maximum regional load. The capacity is defined in a matrix, both to and from each region to every other region. A priority order for each region to every other region is given based on proximity, so that if a region has an unmet load, it will be served sequentially with the excess generation of loads in other regions in their defined order of proximity.

For every hour and every region in each cluster (a cluster is a group of regions), the possible interconnections required for the importation or exportation of energy to balance the load are calculated. Each region is considered in turn, and the algorithm attempts to meet the unmet load with excess generation by other regions, keeping track of the residual excess loads and the interconnector capacities. Each region’s internal load is met first, before its generation resources are considered for other interconnected regions.

For regions sending generation capacity to other regions, the interconnector element behaves as an increase in load, whereas for regions accepting power from neighbouring regions, the interconnector element behaves as an additional generator, from the model’s perspective.

Once the total inflow and outflow of the interconnectors are calculated, the hourly values for the total supply in each region are updated, together with any residual deficit in supply or any curtailed (= forced to shut down) electricity generator that does not have priority dispatch.

Similar to the supply technologies, different storage technologies (electrical, thermal, or hydrogen) can be defined and selected, together with their technical characteristics, such as their round-trip efficiency, new or installed capacities in each year of the modelled period, lifetime, maximum depth of discharge, maximum energy out in a time step, and costs. When the total energy delivered by the supply technologies in a region does not meet the demand, energy is discharged from storage (if the storage technology has energy available), following the constraints of the storage operation (maximum energy out per time step, maximum depth of discharge, maximum depth of charge, state of charge) and the order of operation for the defined storage technologies. In the case of a demand deficit after storage, electricity from other regions will be imported. When there is surplus energy generation, the surplus will charge any storage appliances (if available), also according to the same constraints of energy storage operation and sequential order.

1.3.3 Supply Module Architecture in MATLAB

Analogous to the demand module, inputs can be made directly into the supply module via MATLAB or a standardized Excel sheet. The supply module in MATLAB is also based on an object-oriented structure, in which classes and the objects belonging to those classes are built based on attributes and methods.

Figure 3.8 shows the UML class diagram for the supply module developed in MATLAB. Specifically, the supply module has three main classes:

  1. 1.

    Supply class: This is the main class and it is built on the supply and storage technology objects. Attributes that describe the supply class include years, region, energy supply form, fuel, and generation and storage technologies. The supply class has two main types of methods: (i) calculation supply methods and (ii) printing result methods. The calculation methods implement equations and algorithms to calculate the dispatch and fuel consumption. Table 3.10 presents a brief description of each method.

  2. 2.

    Supply technology class: This class is used to define supply technologies. Attributes include name, type, efficiency, year, region, and energy supply form and are defined as text inputs. Additional attributes are defined as numerical inputs, such as lifetime, cost, and capacity factors. The structure adopted allows the addition of new attributes if required. This class has methods that are used by the main supply class to calculate the primary fuel, emissions, or installed capacity of a specific technology.

  3. 3.

    Storage technology class: This class is used to define storage technologies. The attributes include name, type, efficiency, year, region, and energy storage form and are defined as text inputs. Other numerical attributes include charging and discharging rates, capacity, cost factors, and state of charge.

Fig. 3.8
A diagram of the supply module in MATLAB supply technology and storage technology with its classes, attributes, methods, and associations.

A UML diagram of the supply module in MATLAB, showing its classes, attributes, methods, and associations

Table 3.10 Methods within the supply class

Figures 3.9 and 3.10 show the high-level class definitions for supply technologies and storage objects, respectively. The text in blue indicates the defined value for each attribute. For example, the supply technology object in Fig. 3.9 has the name ‘coal power plant’, its input energy is defined as hard coal, and the object is associated with the electricity energy form. The attributes in Fig. 3.9 consider the year 2020 and a global scenario. For example, the existing capacity is defined as 989.5 GW and the decommissioned capacity is 23 GW. The lifetime of this object is 35 years.

Fig. 3.9
A supply module architecture in MATLAB with the assigned attributes. It is titled supply underscore technology and has 2 columns of attributes.

An example of a supply technology object, showing the assigned attributes

Fig. 3.10
A storage module architecture in MATLAB with the assigned attributes. It is titled storage underscore technology and has 2 columns of attributes.

An example of a storage technology object, showing the assigned attributes

An example of a storage object is shown in Fig. 3.10. The attributes of this object include text inputs, such as its name ‘battery lithium’ and its type ‘electrical’. This object has numerical attributes such as the efficiency (equal to 0.95 for this object) and the charging and discharging rates (fixed at 5 kW). Note that the units for each attribute are defined when the module is initialized in MATLAB.

The supply module architecture developed is flexible to accommodate different types of supply and storage technologies. Additional attributes or methods can be easily added to the model.

1.4 Databases and Model Calibration

The OECM model uses several databases for energy statistics, energy intensities, technology market shares, and other market or socio-economic parameters. The calculation of the energy balance for the base year is based on the International Energy Agency (IEA) Advanced World Energy Balances (IEA, 2020, 2021).

The energy statistics for a calculated country and/or region are uploaded via an interface module. The data for each year from 2005 onwards until the last year for which data are available are used to calibrate the model. This process is based on the energy system model (EM), developed by the German Aerospace Center DLR, and is implemented in the energy simulation platform Mesap/PlaNet (Schlenzig, 1999; Seven2one, 2012). The market shares are calculated based on the IEA statistics and a technical database for energy intensities for various appliances and applications across all sectors. These data are input and the calibration processes performed with a standardized Excel tool. The calibration method is briefly outlined below using the transport sector.

To calibrate the model, the transport demand of the past decade is recalculated on the basis of the available energy statistics. The IEA’s Advanced World Energy Balances provides the total final energy demand by transport mode—aviation, navigation, rail, or road—by country, by region, or globally. However, it provides no further specification of the energy use within each of the transport modes. Therefore, further division into passenger and freight transport is calculated using percentage shares. These proportions are determined with a literature search, together with the average energy intensity for each of the transport modes for passenger and freight vehicles.

The annual transport demand in passenger-kilometres per year (pkm/year) or tonne-kilometres per year (tkm/year) is calculated as the annual energy demand divided by the average energy intensity by mode. These results are then compared with the Organisation for Economic Co-operation and Development (OECD) transport statistics, which provide both parameters——pkm/year and tkm/year. Calibrating the model on the basis of historical data ensures that the basis of the scenario projections for the coming years and decades is correctly mapped and ensures that the changes are calculated most realistically (Tables 3.11 and 3.12).

Table 3.11 Calibration for calculating the transport demand
Table 3.12 Projection of transport demand based on the changing demand in kilometres

For the forward projection of the transport demand, the calculation method is reversed: the transport demand for each transport mode is calculated on the basis of the annual change, as a percentage. The calculated total annual pkm and tkm are the inputs for the energy demand calculation.

This methodology for calibration and projection is used across all sectors.

The developed MATLAB tool can access online data and databases through available application programming interfaces (APIs). For example, the API for the World Bank Indicators provides access to nearly 16,000 time series indicators, including population estimates and projections (World Bank, 2021). Likewise, the OECD provides access to datasets through an API. This allows a developer to easily call the API and access data using the code lines in MATLAB.

1.5 Sectors and Sub-sectors

The OECM was designed to calculate energy pathways for geographic regions, as documented in Chap. 2. The OECM was developed further to meet the requirements of the financial industry and to design energy and emission pathways for clearly defined industry sectors (sectorial pathways). The finance industry uses different classification systems to describe sub-areas of certain branches of industry. The most important system is the Global Industry Classification Standard (GICS; MSCI, 2020). However, the GICS sub-sectors do not match the IEA statistical breakdown of the energy demands of certain industries. Table 3.13 shows examples of the finance sector calculated with the OECM model, the GICS codes, and the statistical information used. Although the OECM model allows all the GICS code sub-sectors to be calculated, the availability of statistics is the factor limiting the resolution of the sectorial pathways. For example, the statistical data for the textile and leather industry are stored in the IEA database, but the database does not separate the two industries further.

Table 3.13 Examples of industry sub-sectors based on the Global Industry Classification Standard (GICS)

1.6 Cost Calculation

The costs linked to the energy supply in each year of the modelled period include the investment costs related to ‘new capacities’ for technologies and storage (including replacement or decommissioning, based on the assumed technical lifetime = vintaging), operation and maintenance (O&M) costs as a percentage of the total installed capacities, and fuel costs. Other inputs for each technology and storage type include the capital cost per unit ($/kW), O&M costs as a percentage of the capital cost, and unit fuel cost ($/GJ).

Therefore, for each technology or storage type:

  • It is assumed that the change in ‘installed capacity’ between each of the years modelled is linear and a linear interpolation between these is considered.

  • The ‘installed capacities’ and ‘new capacities’ are interrelated (one depends upon the other) in each of the modelled and interpolated years, based on the cumulative capacities in the calculated year and the assumed technology lifetime.

  • The capital costs per unit and the fuel costs in each of the modelled years are also interpolated linearly between the modelled years. Therefore, if a scenario is calculated in 5-year steps, e.g. the development from 2025 and 2030, the years 2026 to 2029 are calculated as a linear interpolation.

  • Replacement capacities, if required, are also included in each year as part of the investment costs.

  • The O&M costs in each of the interpolated years are calculated based on the interpolated installed capacities and the annual O&M input costs (as a percentage of the capital cost).

  • Annual fuel costs for non-renewable technologies are calculated based on their output energy (running time) and interpolated fuel costs.

  • The resulting ‘specific costs’ ($/kWh) are also calculated from the interpolated energy supplied in each year.

The total specific costs ($/kWh) of a scenario, as practically distributed over the interpolated years, allow the incurred costs for a scenario to be determined. Limitations: The economic model does not consider the change in the value of money over time. Each year of the modelled period is regarded as if it were the present year, with the multiple costs incurred. Future additions to the model could include the net present costs and the contemporary value of money.

1.7 OECM 2.0 Output and Area of Use

The added value of OECM 2.0 is its high resolution of the sector-specific parameters for both demand and supply, which are required as key performance indicators (KPIs) by the finance industry. Table 3.14 provides an overview of the main parameters and the areas of their use, with a focus on the needs of institutional investors.

Table 3.14 Energy-related key performance indicators (KPIs) for net-zero target setting, calculated with OECM 2.0

Commodities and GDP are the main drivers of the energy demand for industries. The projection of, for example, the global steel demand in tonnes per year over the next decades is discussed with the industry and/or client. The OECM 2.0 can calculate either a single specific sector only or a whole set of sectors. For the development of global scenarios, various industry projections are combined to estimate both the total energy supply required and the potential energy-related emissions. Therefore, a global carbon budget can be broken down into carbon budgets of specific industries.

Energy intensities are both input data for the base year and a KPI for future projections. The effect of a targeted reduction in the energy intensity in a given year and the resulting energy demand and carbon emissions can be calculated, for example, for the steel industry.

All sector demands are supplied by the same energy supply structure in terms of electricity, process heat (for each level), and total final energy. Finally, specific emissions, such as CO2 per tonne of steel or per cubic metre of wastewater treatment, are calculated and can be used to set industry targets.

All input and output OECM data are available as MATLAB-based tables or graphs or as standard Excel-based reports.

1.8 Further Research Demand

Industry-specific energy intensities and energy demands are not available for a variety of industries. In particular, the energy intensities for sub-sectors of the chemical industry are either totally unavailable or confidential. A database of energy intensities is required to develop more detailed scenarios. Although energy intensities can be estimated based on the available data, the input parameters are usually derived from various sources, which may not follow the same methodology. Energy intensities based on GDP, for example, are calculated with either nominal GDP, real GDP, or purchasing power parity GDP. Furthermore, energy intensities can be provided as final energy or primary energy. In some cases, this information is not available at all. A database of industry-specific energy demands and energy intensities, with a consistent methodology, is required to improve the accuracy of calculations in future research.

To capture the complexity of regional and global building demand projection, both in terms of data availability and high technical resolution, the high-efficiency building (HEB) model was used to develop four bottom-up demand scenarios. The HEB was developed by the Central European University (CEU) of Budapest under the scientific leadership of Prof. Dr. Diana Uerge-Vorsatz. The following section documents the methodology of the HEB based on the paper by Chatterjee, S.; Kiss, B.; and Ürge-Vorsatz, D. (2021). The results are documented in Sects. 7.1 and 7.2.

2 The High-Efficiency Building (HEB) Model

Modelling the energy demand for buildings is a complex task because the building sector-related energy demand depends on several factors, such as spatial resolution, temporal resolution, building physics, and the different technologies of building construction (Prieto et al., 2019; Chatterjee & Ürge-Vorstaz, 2020). The majority of demand models do not incorporate these factors and therefore provide insights into the future energy demand scenarios of the building sector that can be far from realistic (Prieto et al., 2019; Chatterjee & Ürge-Vorstaz, 2020). Therefore, in this study, we use the HEB model to understand the future energy demand potentials for building in key regions across the globe.

The HEB model was originally developed in 2012 to calculate the energy demand and CO2 emissions of the residential and tertiary building sectors until 2050 under three different scenarios (Ürge-Vorsatz & Tirado Herrero, 2012). Since then, the model has been developed and updated several times. With the latest update, the model calculates the energy demand under four scenarios until 2060 based on the most recent data for macroeconomic indicators and technological development. This model is novel in its methodology compared with earlier global energy analyses and reflects an emerging paradigm—the performance-oriented approach to the energy analysis of buildings. Unlike component-oriented methods, a systemic perspective is taken: the performance of whole systems (e.g. whole buildings) is studied, and these performance values are used as the input in the scenarios. The model calculates the overall energy performance levels of buildings, regardless of the measures applied to achieve them. It also captures the diversity of solutions required in each region by including region-specific assumptions about advanced and suboptimal technology mixes. The elaborated model uses a bottom-up approach, because it includes rather detailed technological information for one sector of the economy. However, it also exploits certain macroeconomic (GDP) and socio-demographic data (population, urbanization rate, floor area per capita, etc.). The key output of the HEB model is floor area projections for different types of residential and tertiary buildings in different regions and their member states, the total energy consumption of residential and tertiary buildings, the energy consumption for heating and cooling, the energy consumption for hot water energy, the total CO2 emissions, the CO2 emissions for heating and cooling, and the CO2 emissions for hot water energy.

2.1 The High-Efficiency Building Model Methodology

The HEB model conducts a scenario analysis for the entire building sector, in which the building sector is distinguished by location (rural, urban, and slum), building type (single-family, multifamily, commercial, and public buildings, with subcategories), and building vintage (existing, new, advanced new, retrofitted, and advanced retrofitted). This detailed classification of buildings is undertaken for 11 regions (Ürge-Vorsatz & Tirado Herrero, 2012), extended with country-specific results for the EU-27 countries, China, India, and the USA. Furthermore, within each region, different climate zones are considered to capture the differences in building energy uses and renewable energy generation caused by variations in climate. The climate zones are calculated based on four key climatic factors—heating degree days (HDD), cooling degree days (CDD), relative humidity (RH) in the warmest month, and average temperature in the warmest month (T). These parameters are processed using the GIS5 tool—spatial analysis—and performed with the ArcGIS software. The detailed classification categories are summarized in Table 3.2.

The purpose of the detailed classification of building categories and scenario assessments is to explore the consequences of certain policy directions or decisions that inform policy-making (Table 3.15).

Table 3.15 Building classification scheme of HEB

The key input data used in the HEB are region-specific forecasts of GDP, population, rate of urbanization, and the proportion of the population living in urban slums. The time resolution of the model is yearly, so that socio-economic input data can be easily obtained from various credible sources, such as the databases of the World Bank, United Nations Development Programme (UNDP), EUROSTAT, and the OECD. Besides these socio-economic parameters, many others are included, and in the case of data absence, assumptions are made in the HEB model to calculate the final energy demand. Figure 3.11 shows the main workflow of the HEB model.

Fig. 3.11
The main workflow of the H E B. Input data and parameters lead to yearly dynamics of retrofit and new floor area, through some intermediary step. With data and parameters, results and calculations for the floor area, energy consumption and emission and obtained.

The main workflow of the HEB. Input data and parameters can be modified by the user (green). Main outputs are the floor areas of different building vintage types and the energy consumption and CO2 emissions of the stock (blue)

The HEB model includes several calculation steps, from considering the input data to obtaining the final output. Each of these calculation steps is discussed in the sections below.

2.2 Disaggregation

In the first step of the calculation, after all the socio-economic input data are obtained, the input is disaggregated into the detailed building classification scheme, and the total floor area required to satisfy the year-specific population and GDP needs (the year is denoted with Y in subscript) is determined. The core concept for calculating the floor area differs for residential and commercial buildings:

  • For residential buildings, the total occupied floor area correlates with the population, and thus, population forecasts are used to determine the floor areas of buildings in each region.

  • For commercial and public buildings, the floor area correlates more strongly with GDP, so GDP forecasts are used as a proxy to determine the total floor space areas of commercial and public buildings.

The region-specific population data—as the input for the calculation—is further disaggregated into urban and rural populations based on the urbanization rate and into the different climate zones based on GIS data:

$$ {P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}={P}_{\mathrm{r},\mathrm{Y}}\times {U}_{\mathrm{r},\mathrm{Y}}\times {Sc}_{\mathrm{r},\mathrm{c}}\kern1em \mathrm{if}\ \mathrm{u}=\mathrm{urban} $$
(3.1)

and

$$ {P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}={P}_{\mathrm{r},\mathrm{Y}}\times \left(1-{U}_{\mathrm{r},\mathrm{Y}}\right)\times {Sc}_{\mathrm{r},\mathrm{c}}\kern1em \mathrm{if}\ \mathrm{r}=\mathrm{rural} $$
(3.2)

where

  • Pr, c, u, Y [capita] is the total urban/rural population of region r and climate zone c in year Y

  • Pr, Y [capita] is the total population of region r in year Y

  • Ur, Y [−] is the urbanization rate of region r in year Y

  • Scr, c [%] is the share of the population within region r living in climate zone c

The urban population is then further disaggregated into the population living in slums (in regions where a significant number of people do not have access to standard living conditions) and the population living in conventional residential buildings. The latter group is split into the populations living in single-family and multifamily houses based on region-specific fixed values:

$$ {P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{Y}}={P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}\times {Ss}_{\mathrm{r},\mathrm{Y}}\kern1em \mathrm{where}\ \mathrm{u}=\mathrm{urban}\ \mathrm{and}\ \mathrm{b}=\mathrm{slum} $$
(3.3)

and

$$ {P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{Y}}={P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}\times \left(1-{Ss}_{\mathrm{r},\mathrm{Y}}\right)\kern1em \mathrm{where}\ \mathrm{u}=\mathrm{urban}\ \mathrm{and}\ \mathrm{b}=\mathrm{residential} $$
(3.4)

then

$$ {P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}={P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{Y}}\times {Ssf}_{\mathrm{r}}\kern1em \mathrm{where}\ \mathrm{u}=\mathrm{urban},\mathrm{b}=\mathrm{residential}\ \mathrm{and}\ \mathrm{t}= SF $$
(3.5)

and

$$ {P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}={P}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}\times \left(1-{Ssf}_{\mathrm{r}}\right)\kern1em \mathrm{where}\ \mathrm{u}=\mathrm{urban},\mathrm{b}=\mathrm{residential}\ \mathrm{and}\ \mathrm{t}= MF $$
(3.6)

where

  • Pr, c, u, b, Y [capita] is the total urban/rural population of region r, climate zone c, and building category b in year Y.

  • Pr, c, u, Y [capita] is the total urban/rural population of region r and climate zone c in year Y.

  • Pr, c, u, b, t, Y [capita] is the total urban/rural population of region r, climate zone c, building category b, and building type t in year Y.

  • Ssr, Y [%] is the share of the urban population living in slums in region r and year Y.

  • Ssfr [%] is the share of the urban population living in single-family houses in region r.

The population living in rural areas is assumed to live in single-family houses.

The disaggregation of GDP follows the same pattern, except that the share of GDP that can be associated with rural commercial or public buildings is fixed within the modelling period:

$$ {GDP}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}={GDP}_{\mathrm{r},\mathrm{Y}}\times \left(1-{U}_{\mathrm{r},\mathrm{Y}}\right)\times {Sc}_{\mathrm{r},\mathrm{c}}\kern1em \mathrm{if}\ \mathrm{u}=\mathrm{urban} $$
(3.7)

and

$$ {GDP}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}={GDP}_{\mathrm{r},\mathrm{Y}}\times {U}_{\mathrm{r},\mathrm{Y}}\times {Sc}_{\mathrm{r},\mathrm{c}}\kern1em \mathrm{if}\ \mathrm{u}=\mathrm{rural} $$
(3.8)

where

  • GDPr, c, u, Y [USD] is the total GDP that can be associated with urban/rural commercial or public buildings in region r and climate zone c in year Y.

  • GDPr, Y [USD] is the total GDP of region r in year Y.

  • Ur, Y [−] is the urbanization rate of region r in year Y.

  • Scr, c [%] is the share of climate zone c within region r.

The share of different commercial building types is also determined with fixed ratios based on data from the literature:

$$ {GDP}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{t},\mathrm{Y}}={GDP}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{Y}}\times {Scp}_{\mathrm{t}} $$
(3.9)

where

  • GDPr, c, u, t, Y [USD] is the total GDP that can be associated with urban/rural commercial or public buildings of type t in region r and climate zone c in year Y.

  • GDPr, c, u, Y [USD] is the total GDP that can be associated with urban/rural commercial or public buildings in region r and climate zone c in year Y.

  • Scpt [%] is the share of commercial and public buildings of type t in the commercial and public building stock.

2.3 Determining the Total Floor Area

Different equations are used for the calculation of the floor area of residential buildings and non-residential buildings. The floor area of residential buildings can be calculated with the following equation, using specific floor area values (the floor area that is occupied by one person):

$$ {TFA}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}={\mathrm{P}}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}\times {\mathrm{SFAc}}_{\mathrm{r},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}\kern1em \mathrm{where}\ \mathrm{b}=\mathrm{residential}/\mathrm{slum} $$
(3.10)

where

  • TFAr, c, u, b, t, Y [m2] is the total urban/rural floor area of building category b and building type t in region r and climate zone c in year Y.

  • Pr, c, u, b, t, Y [capita] is the total urban/rural population of region r, climate zone c, building category b, and building type t in year Y.

  • SFAcr, u, b, t, Y [m2/capita] is the specific floor area of building category b and building type t in region r in year Y.

Similarly, the floor area of commercial and public buildings is calculated using specific floor area values (the floor area that is required to produce one unit of GDP):

$$ {TFA}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}={GDP}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{t},\mathrm{Y}}\times {\mathrm{SFAg}}_{\mathrm{r},\mathrm{b},\mathrm{Y}}\kern1em \mathrm{if}\ \mathrm{b}=\mathrm{C}\&\mathrm{P} $$
(3.11)

where

  • TFAr, c, u, b, t, Y [m2] is the total urban/rural floor area of commercial or public buildings of building type t in region r and climate zone c in year Y.

  • GDPr, c, u, t, Y [USD] is the total GDP that can be associated with urban/rural commercial or public buildings of type t in region r and climate zone c in year Y.

  • SFAgr, b, Y [m2/USD] is the specific floor area of commercial or public buildings in region r in year Y.

Specific floor area values are determined from statistical data for each region. To take socio-economic development into account, the floor area per capita and the floor area per GDP are modelled as values that change yearly, reaching the average for OECD countries by the end of the modelling period in developing regions.

2.4 Yearly Dynamics of Floor Area Changes

The yearly dynamics of this floor area model transition the existing building stock into the future state determined by the scenarios. This includes the retrofitting or demolition of existing buildings, as well as the introduction of new buildings to the stock. In some cases, the floor area is left abandoned, which might result from a reduction in the population (e.g. in developed regions) or an increased rate of urbanization due to which buildings located in rural areas are abandoned after a certain time. It is important to capture this phenomenon, because abandoned buildings do not contribute to energy consumption or the emissions of the building stock. This yearly dynamic of the vintage types of buildings is presented in Fig. 3.12.

Fig. 3.12
A diagram of the yearly floor area dynamics in the H E B mode. It has existing floor area on the left and required floor area on the right with details and statuses marked in between, with connections.

Yearly floor area dynamics in the HEB model

The demolished floor area is calculated with region-specific demolition rates. After the demolished floor area is subtracted from the existing total, the remaining existing floor area is classified into different building vintages. Similarly, the retrofitted floor area is calculated by applying the yearly changing region-specific retrofitting rate to the total existing building stock. The retrofitted floor area is further classified into two types: advanced retrofitted floor area and normal retrofitted floor area. For each of the regions, the shares of retrofitted and advanced retrofitted floor area differ, and the shares of advance retrofitted, advance new, and retrofitted floor areas also vary under different scenarios. The floor area from new constructions is classified into two building vintages: new and advanced new. Like the retrofitted floor area, the share of advanced new floor area also varies under different scenarios.

2.5 Calculating the Energy Consumption of Buildings

The energy consumption for heating and cooling depends on the floor area. Therefore, in the HEB model, energy consumption is calculated after the year-specific floor area is calculated. The key input required to calculate the energy consumption for heating and cooling is the average consumption data for heating and cooling, which are usually obtained from data reported in the literature, for each of the regions, climate zones, and building types, because different building vintages have different consumption requirements. Therefore, different vintage types are modelled by assuming different energy intensities (denoted with subscript v). The values also depend on the scenario (denoted with subscript s). Energy intensity is multiplied by the corresponding floor area to determine the energy consumption for heating and cooling the stock:

$$ {HCE}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y},\mathrm{v},\mathrm{s}}={TFA}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y}}\bullet {\mathrm{EUhc}}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{v},\mathrm{s}} $$
(3.12)

where

  • HCEr, c, u, b, t, Y, v, s [kWh/year] is the total energy demand for heating and cooling of buildings with vintage type v in scenario s, building type t in region r, and climate zone c in year Y.

  • TFAr, c, u, b, t, Y [m2] is the total urban/rural floor area of building category b and building type t in region r and climate zone c in year Y.

  • EIhcr, c, u, b, t, v, s [kWh/m2/year] is the heating and cooling energy intensity of buildings of vintage type v in scenario s and building type t in region r and climate zone c.

After the detailed energy consumption is calculated, the data can be summed to arrive at the region-specific, yearly aggregated results for a given scenario:

$$ \mathrm{Total}\ {\mathrm{Energy}}_{\mathrm{r},\mathrm{Y},\mathrm{s}}=\sum \limits_{\mathrm{c}}\sum \limits_{\mathrm{u}}\sum \limits_{\mathrm{b}}\sum \limits_{\mathrm{t}}\sum \limits_{\mathrm{v}}\mathrm{Total}\ {\mathrm{Energy}}_{\mathrm{r},\mathrm{c},\mathrm{u},\mathrm{b},\mathrm{t},\mathrm{Y},\mathrm{v},\mathrm{s}} $$
(3.13)

2.6 Implementation

The most recent version of the HEB model was developed in the Python programming language, using the PyData ecosystem to handle large datasets. This ecosystem ensures quite large flexibility among the modelling parameters, and the diversity of input data and its granularity can be properly handled. This model is not an open-access model, but the Central European University has received funding from the European Union’s Horizon 2020 research and innovation programme (under grant agreement no. 837089) in the SentinelFootnote 1 project, to develop HEB further. In this project, the HEB model will be made an open-source model that users can use without cost.