Background

Electricity is the pivot in upholding highly technologically advanced industrialisation in every economy [1,2,3]. Almost every activity done in this modern era hinges on electricity. The demand and usage of electric energy increase globally as the years past [4]; however, the process of generating, transmitting, and distributing electrical energy remains complicated and costly. Hence, effective grid management is an essential role in reducing the cost of energy production and increased in generating the capacity to meet the growing demand in electric energy [5].

Accordingly, effective grid management involves proper load demand planning, adequate maintenance schedule for generating, transmission and distribution lines, and efficient load distribution through the supply lines. Therefore, an accurate load forecasting will go a long way to maximise the efficiency of the planning process in the power generation industries [5, 6]. As a means to improve the accuracy of Electrical Energy Demand (EED) forecasting, several computational and statistical techniques have been applied to enhance forecast models [7].

EED forecasting techniques can be clustered into three (3), namely correlation, extrapolation, and a combination of both. The Extrapolation techniques (Trend analysis) involve fitting trend curves to primary historical data of electrical energy demand in a way to mirror the growth trend itself [7, 8]. Here, the future value of electricity demand is obtained from estimating the trend curve function at the preferred future point. Despite its simplicity, its results are very realistic in some instances [8].

On the other hand, correlation techniques (End-use and Economic models) involve relating the system load to several economic and demographic factors [7, 8]. Thus, the techniques ensure that the analysts capture the association existing between load increase patterns and other measurable factors. However, the disadvantage lies in the forecasting of economic and demographic factors, which is more complicated than the load forecast itself [7, 8]. Usually, economic and demographic factors such as population, building permits, heating, employment, ventilation, air conditioning system information, weather data, building structure, and business are used in correlation techniques [7,8,9]. Nevertheless, some researchers group EED forecasting models into two, viz. data-driven (artificial intelligence) methods (same as the extrapolation techniques) and engineering methods (same as correlation the techniques) [9]. All the same, no single method is accepted scientifically superior in all situations.

Also, proper planning and useful applications of electric load forecasting require particular “forecasting intervals,” also referred to as “lead time”. Based on the lead time, load forecasting can be grouped into four (4), namely: very short-term load forecasting (VSTLF), short-term load forecasting (STLF), medium-term load forecasting (MTLF) and long-term load forecasting (LTLF) [6, 7, 10]. The VSTLF is applicable in real-time control, and its predicting period is within minutes to 1 h ahead. The STLF is for making forecasting within 1 h to 7 days or month ahead [11]. It is usually used for the day-to-day operations of the utility industry, such as scheduling the generation and transmission of electric energy. The MTLF is used for forecasting of fuel purchase, maintenance, utility assessments. Its forecasting period ranges from 1 week to 1 year. While the LTLF is for making forecasting beyond a year to 20 years ahead, it is suitable for forecasting the construction of new generations, strategic planning, and changes in the electric energy supply and delivery system [10].

Notwithstanding the above-mentioned techniques and approaches available, EED forecasting is seen to be complicated and cannot easily be solved with simple mathematical formulas [2]. Also, Hong and Fan [12] pointed out that electric load forecasting has been a primary problem for the electric power industries, since the inception of the electric power. Regardless of the difficulty in electric load forecasting, the optimal and proficient economic set-up of electric power systems has continually occupied a vital position in the electric power industries [13]. This exercise permits the utility industries to examine the dynamic growth in load demand patterns to facilitate continuity planning for a better and accurate power system expansion. Consequently, inaccurate prediction leads to power shortage, which can lead to “dumsor” and unneeded development in the power system leading to unwanted expenditure [7, 14]. Besides, a robust EED forecasting is essential in developing countries having a low rate of electrification to facilitate a way for supporting the active development of the power systems [15].

Based on the sensitive nature of electricity demand forecasting in the power industries, there is a need for researchers and professionals to identify the challenges and opportunities in this area. Besides, as argued by Moher et al. [16], systematic reviews are the established reference for generating evidence in any research field for further studies. Our partial search of literature resulted in the following [10, 12, 17,18,19,20,21] papers that focused on comprehensive systematic review concerning the methods, models, and several methodologies used in electric load forecasting. Hammad et al. [10] compared forty-five (45) academic papers on electric load forecasting based on inputs, outputs, time frame, the scale of the project, and value. They revealed that despite the simplicity of regression models, they are mostly useful for long-term load forecasting compared with AI-based models such as ANN, Fuzzy logic, and SVM, which are appropriate for short-term forecasting.

Similarly, Hong and Fan [12] carried out a tutorial review of probabilistic EED forecasting. The paper focused on EED forecasting methodologies, special techniques, common misunderstandings and evaluation methods. Wang et al. [19] presented a comprehensive review of factors that affects EED forecasting, such as forecast model, evaluation metric, and input parameters. The paper reported that the commonly used evaluation metrics were the mean absolute error, MAPE, and RMSE. Likewise, Kuster et al. [22] presented a systematic review of 113 studies in electricity forecasting. The paper examined the timeframe, inputs, outputs, data sample size, scale, error type as criteria for comparing models aimed at identifying which model best suited for a case or scenario.

Also, Zhou et al. [17] presented a review of electric load classification in the smart-grid environment. The paper focused on the commonly used clustering techniques and well-known evaluation methods for EED forecasting. Another study in [21] presented a review of short-term EED forecasting based on artificial intelligence. Mele [20] presented an overview of the primary machine learning techniques used for furcating short-term EED. Gonzalez-Briones et al. [18] examined the critical machine learning models for EED forecasting using a 1-year dataset of a shoe store. Panda et al. [23] presented a comprehensive review of LTLF studies examining the various techniques and approaches adopted in LTLF.

The above-discussed works of literature show that two studies [20, 21] address a comprehensive review on STLF, [23] addresses forecasting models based on LTLF. The study in [24] was entirely dedicated to STLF. Only a fraction (10%) of above systematic review studies included STLF, MTLF and LTLF papers in their review; however, as argued in [10], the lead time (forecasting interval) is a factor that positively influences the performance of a chosen model for EED forecasting studies. Again, a high percentage of these studies [10, 12, 17,18,19,20,21,22, 24] concentrated on the methods (models), input parameter, and timeframe. Nevertheless, Wang et al. [19] revealed that the primary factors that influence EED forecasting models are property (characteristic) parameters of the building and weather parameters include. Besides, these parameters are territorial dependant and cultural bond. Thus, the weather pattern is not the same world-wide neither do we use the same building architecture and materials globally.

Notwithstanding, a higher percentage of previous systematic review studies overlooked the origin of studies and dataset of EED forecasting paper. Also, only a few studies [12, 17, 19] that examined the evaluation metrics used in EED forecasting models. However, as pointed out in [17], there is no single validity index that can correctly deal with any dataset and offer better performance always.

Despite all these review studies [10, 12, 17,18,19,20,21,22, 24] on electricity load forecasting, a comprehensive systematic review of electricity load forecasting that takes into account all possible factors, such as the forecasting load (commercial, residential and combined), the forecast model (conventional and AI), model evaluation metrics and forecasting type (STLF, MTLF, and LTLF) that influences EED forecast models is still an open gate for research. Hence, to fill in the gap, this study presents an extensive systematic review of state-of-the-art literature based on electrical energy demand forecasting. The current review is classified according to the forecasting load (commercial, residential, and combined), the forecast model (conventional, AI and hybrids), model evaluation metrics, and forecasting type (STLF, MTLF, and LTLF). The Preferred-Reporting Items for Systematic-Review and Meta-Analysis (PRISMA) flow diagram was adopted for this study based on its ability to advance the value and quality of the systematic review as compared with other guidelines [16, 25, 26]. The current study contributes to knowledge as follows:

  1. 1.

    A comprehensive and detailed assessment of previous state-of-the-art studies on electricity demand forecasting; based on used methods, timeframe, the train and test split of data, error, and accuracy metrics applied to forecast.

  2. 2.

    We present a concise summary of the useful characteristics of compared techniques in electric load forecasting.

  3. 3.

    We identified the challenges and opportunities for further studies in electric load forecasting.

The remaining sections of the current paper are structured as follows. “Methodology” section presents the methods and materials used in the current study. “Data collection” section presents the results and a detailed discussion of the outcomes, and “Study framework” section presents the summary of findings and direction for future studies.

Methodology

The current study presents a systematic review of pertinent literature on electrical energy forecasting.

Data collection

A total of eighty-one (81) state-of-the-art research works published in journals, conferences, and magazines, and student’s thesis relevant to the scope of the current study were downloaded from the internet, thus using keywords and terms which included AI, Electricity Prediction (EP), Energy Forecasting (EF), Machine Learning (ML), and combination of AI and EP, AI and EF, ML and EP, ML and EF. Each downloaded literature was then carefully studied and categorised into the two methods of electrical load forecasting data-driven (artificial intelligence) methods and engineering methods.

Study framework

According to Moher et al. [16], the quality of every systematic review is based on building protocol, which outlines the justification, hypothesis, and planned methods of the investigation. However, only a few systematic review study reports of their framework. A detailed, well-described structure for systematic reviews facilitates the understanding and evaluation of the methods adopted. Hence, the PRISMA model [26] was adopted in this study (Fig. 1). As shown in Fig. 1, the PRISMA presents the flow of information from one stage to another in a systematic review of the literature and gives the total number of the research identified, excluded, and included and the reasons for inclusion and exclusions.

Fig. 1
figure 1

Source: Moher et al. [26]

The adopted PRISMA flow diagram.

The PRISMA flow diagram involved five (5) phases, as shown in Fig. 1. Phase 1 consists of outlining the review scope, developing questions, and inclusion or exclusion. Phase 2 searches the literature with keywords to identify potential studies. Phase 3 includes determining the addition of a paper by screening its abstracts if it meets inclusion criteria. While phase 4 includes characterisation of paper for mapping by keywords. This review aimed to document an overview of research in the field of electric load forecasting to make way for future studies. As a result, a fifth (5) step offers an in-depth quantitative synthesis (meta-analysis) of studies included in the review.

Our search of literature retrieved one hundred and one (109) papers from online journals and eleven (11) from under sources, making one hundred and twelve (120) records in all (see Fig. 1). Of the 120 records, 21 were duplicates, hence, removed leaving ninety-one (99) record shortlisted for the screening stage. At the screening stage, fifteen (15) records were removed; thus, studies that were not related to electrical energy, and those that the primary publication language was not in English. Leaving eighty-four (84) records, of the 84 records, we further remove five (5) more records that were published before the year 2010, two (2) record omitted due to overlapping, this reduced eligible papers for analysis to seventy-seven (77). The 77 papers were used for the qualitative analysis. Ten (10) records that presented a review of electric load forecasting were also removed, and the remaining (67) were used for the quantitative analysis, as shown in Fig. 1.

Results and discussion

Electrical energy consumption can be classified as residential (domestic), commercial (non-residential), or industrial. Residential or domestic refers to the home or a dwelling where people globally live from day-to-day. At the same time, commercial consumers are the business and industries that require massive supply than residential users for their businesses [27]. Selected literature was on electric load forecasting was classified into two main categories AI methods and engineering methods. However, each category was further grouped into residential and commercial or combined (residential and commercial), and the outcomes are presented. A total of seventy-seven (77) papers were eligible for the qualitative analysis, while sixty-seven (67) were included in the quantitative analysis, as already discussed above in this study.

AI methods used in electrical energy demand forecasting

This section presents the studies that were based on AI techniques.

Combined (commercial and residential)

Most works on EED forecasting sought to forecast the total load (residential and commercial) demand on the supplying authority. This section presents the selected studies that fell in this category of electric load forecasting.

Hybrid models

In a way to harness the strength in different machine learning techniques, some researchers sought to hybrid two or more ML techniques to improve the forecasting accuracy of their models.

A short-term (next-day) EED forecasting model based on the historical meteorological parameter to forecast the future load on the Greek Electric Network Grid using Support Vector Machine (SVM), ensemble XGBoost, Random Forest (RF), k-Nearest Neighbours (KNN), Neural Networks (NN) and Decision Trees (DT) was proposed by [28]. The mean absolute percentage error (MAPE) was used as a performance metric for comparison among selected models by the author. The study achieved a reduction in prediction error or + 4.74% compared with the Operator of the Electricity Market in Greece predictions. In other studies [29], long-term (10 years) EED forecasting using NN and Autoregressive Integrated Moving Average (ARIMA) was proposed to forecast the EED of Kuwait. Weather temperature and humidity, average salary, gross domestic, oil price, population, residence, passengers, currency earning rate, and economic factors like (total import and export in USD) were used as independent variables. The study concluded that NN outperformed ARIMA and weather parameters were found to be more significant than average salary, gross domestic and oil price.

A Particle Swarm Optimisation (PSO) and Differential Evolution (DE) for forecasting the Andhra Pradesh Grid using weather parameters were presented [6]. The reported concluded that better prediction accuracy was achieved with PSO and DE than the conventional time series forecast model. In another study, a Curve Fitting Algorithm (CFA) was proposed for forecasting electricity power demand for an hour/day/week/month [30]. Their study shows that future electricity demand can be effectively forecasted based on past demand. In a process to increase forecasting accuracy, historical electricity data combined with Twitter data was used as an input variable to hybrid ANN and SVM forecast model to forecast the electricity consumption in Dutch [14]. The authors compared the performance of ANN to SVM and concluded that the ANN outperformed the SVM. On the other side, the SVM performance in accuracy increases in long-term forecasting. The authors again admit that inclusion of weather data as input could not increase model performance. Similarly, in Ghana, an attempt to predict the 30-day ahead EED demand of the Bono region using a hybrid ML (MLP, SVM, and DT) based on historical weather and electricity demand was made [7]. The authors achieved 95% prediction accuracy; however, it affirms that the inclusion of household lifestyle as an input variable will improve prediction accuracy.

The hybrid of ML (SVM and RF) and time-series models Generalized Linear Model (GLM) and ARIMA forecast model was proposed for predict electricity consumption in South Africa based on the historical electricity price, load demand, and weather parameter [31]. The outcome of the study showed that the ARIMA-GLM combination performs better for long-term forecasting. Similarly, a combination of quantum search with SVM (quantum computing and the chaotic mechanics) for forecasting yearly EED in Taiwan was presented [32]. The empirical analysis revealed that the proposed model exhibits considerably enhanced forecasting performance than other SVM-based forecasting models. A hybrid of mode decomposition (EMD), PSO, and SVM model was present for forecasting short-term EED demand of the Australian electricity market [33]. Sulandari et al. [34] proposed a hybrid of ANN and Fuzzy algorithm and a recurrent formula (LRF) to predict electricity demand in Indonesian. The study results showed that the hybrid model performed well with low values of RMSE. Likewise, in [35], a hybrid of clustering technique (K-means) and ARIMA forecasting model was presented to forecast university buildings electricity demand. Paper revealed that the hybrid model outperformed the ARIMA model alone as a forecast model.

ANN models

An artificial neural network (ANN)-based forecast model for short-term forecasting of Chhattisgarh State electricity demand was proposed [36], using historical weather data as input variables. The results conclude that ANN can efficiently forecast electricity demand. Likewise, an ANN model was applied to forecast the short-term electricity demand of the Iraqi National Grid [37]. The authors achieved high accuracy and a reasonable error margin. In a similar study, a DT algorithm forecast model was proposed for forecasting future EED on the Yola/Jimeta power transmission company using weather parameters. Again, a short to medium term EED forecasting using deep machine learning (ML) algorithm long short-term memory (LSTM)-based neural network enhanced with genetic algorithm (GA) for feature selection was proposed [9], to forecast France metropolitan’s electricity consumption. The mean absolute error (MAE) and root-mean-square error (RMSE) were used as the performance metrics, and the weather parameter was used in the independent variable. Their results affirm that weather parament is very useful in forecasting future electricity demand.

An enhanced Convolutional Neural Network (CNN) and enhanced SVM-based forecast model was presented for forecasting electricity price and load forecasting using [38]. Despite the enhancement made by authors, they recommended additional enhancement of classifiers to improve prediction. In a similar study, an ANN model to forecast consumer demand in North Cyprus was proposed [39]. Their results affirm the ability for ANN effective automatic modelling of electricity; however, the study concluded this could be achieved when the training and testing datasets are meaningful. A forecast model using a deep belief network using historical EED of Macedonian (2008–2014) was proposed to forecast a short-term (1 day) EED. The outcome of the forecast shows a reduction in MAPE by 8.6% by the proposed model compared with traditional techniques [2]. In Young et al. [40], an ANN with Bayesian regularisation algorithm-based model for short-term load forecasting of commercial building electricity usage was carried out. An ANN and hybrid methods to forecast electricity consumption of Turkey were proposed Aydogdu et al. [41]. The proposed model gave an average absolute prediction error of 2.25%.

Khwaja et al. [11] presented an ensemble ANN predictive model to enhance short-term electricity load forecasting. Different from existing studies, the authors combined both bagging and boosting techniques to train bagged-boosted ANNs. The study results showed that the proposed ensemble technique offered a reduction of both variance and bias compared to a bagged ANN, single ANN, and boosted ANN. Also, Ahmad et al. [42] combined Extreme Learning Machine and an enhanced Support Vector Machine to forecast short-term electricity demand. The study outcome showed that the proposed hybrid model outperformed other state-of-the-art predictive models in terms of performance and accuracy. Atef and Eltawil [43] proposed a deep-stacked LSTM forecasting model to forecast electricity demand. The paper reported that bidirectional (Bi-LSTM) networks outperformed unidirectional (Uni-LSTM) in terms of forecasting accuracy. Also, a generalised regression Neural Network (GRNN) predictive model was proposed in [44] to predict short-term electricity demand. The study results showed that a GRNN of 30 neurons offered better prediction accuracy in terms of MAPE and MAE than a GRNN of 10 neurons.

Fuzzy logic models

A Fuzzy Logic (FL)-based forecasting model for the next-day electricity demand in Albania was presented [45]. The time, the historical and forecasting value of the temperature and the previous day load (L) served as the independent variables for the forecast of the next-day consumption. The outcome of the study yielded accurate forecasting by the FL model. Motepe et al. [46] proposed an adaptive neuro-fuzzy inference system (ANFIS) model for forecasting South African electricity demand. The author concluded that adding temperature as an input parameter to the proposed model did not enhance forecast accuracy, as typically expected.

SVM models

A new economy (stock indices) reflecting the STLF model for electricity demand forecasting was proposed [47]. The authors attempted to forecast the future demand for electricity based on the Taiwan Stock Exchange Capitalization Weighted Stock Index (TAIEX) data. The study revealed a significant relationship between stock market function and energy demand, which was helpful to financial analysis wanting to do reverse engineering.

Residential load forecasting

A survey in 2019 shows that EED forecasting, especially short-term load forecasting for individual (residential) electricity customers, plays an increasingly essential role in the future grid planning and operation Kong et al. [51]. Similarly, the outcome of Leahy and Lyons [48] affirms that water heating styled used by a household is even more essential than the number of electrical appliances when explaining domestic electrical energy usage. Furthermore, a study in Portugal shows that residential electricity consumption since 1990 has been increasing more rapidly than the Gross Domestic Product (GDP) per capita [49]. Therefore, studies that focused on forecasting residential EED is of necessity. However, the results revealed that eight (8) out of the sixty-seven (67) representing (11.94%) reviewed works focused on residential electricity forecasting, and a section of these studies is presented.

An hourly prediction of residential energy consumption using RF and SVM was proposed by Hedén [50], using 187 households in Austin. Error metrics are mean bias (MB), coefficient of variance (CV), and MAPE. The study achieved better forecasting accuracy compared with traditional time series models. Again, an LTSM for short-term residential load forecasting based on residential meter reading [51] and resident behaviour learning [52] was studied. The outcomes of these works showed the effectiveness of the LTSM in electricity demand forecasting.

In the same way, linear regression, RF, and SVM predictive model (data-driven) for estimation of city-scale energy use in buildings were proposed by Kontokosta and Tull [53]. The outcome of the study revealed that adequate electricity consumption in a building could be predicted using actual data from a moderately small subset of buildings. Likewise, unsupervised ML algorithms such as Self-Organising Maps (SOM), k-means, and k-medoid were used to cluster residential electricity based on their trend of electricity use within the day [54]. The study found that households and how they use electricity in the home can be categorised based on specific customer characteristics. Mcloughlin et al. [55] examined residential electricity consumption patterns in Irish based on occupant socio-economic and dwelling variables. The variable examined includes the number of bedrooms, dwelling type, age of household head, social class, household composition, and water heating.

The study outcome showed a positive association between maximum demand periods and several household appliances, especially dishwashers, electric cookers tumble dryers, with electric cook topping. The study further found that the time of use of electrical appliances was dependent on occupant characteristics, and younger occupants of a household tend to use more electricity than the older. Alike, an ARIMA, NN, and exponential smoothening forecast model were proposed for forecasting household electricity demand [56]. The authors concluded that forecasting accuracy varies considerably depending on the choice of forecasting techniques/tactic and configuration/selection of input parameters. Again, a neural network model for predicting residential building energy consumption was proposed by Biswas et al. [57]. The outcome of the study showed that models based on OWO-Newton algorithms and Levenberg–Marquardt outperformed previous literature.

Engineering and traditional electrical energy demand forecasting

This section presents the electrical energy demand forecasting study based on the traditional time-series and engineering models.

A predictive model for the prediction of medium-term (1-year) electricity consumption of general households based on the lifestyle of the household using Lasso and Group Lasso was proposed [58]. Their results showed that household lifestyle such as family composition, age, and house-type is good predictors of electricity consumption in a home. Similarly, in Nigeria, an attempt was made to estimate the electricity demand of residential users to support energy transitions using the engineering approach, such as calculating the total power consumed in a household based on the power rating of appliance and their duration of use [59]. The study concludes that the proposed system can serve policymaking in Nigeria to improve the financial sustainability of the energy supply structure. An extended Autoregressive Distributed Lag (ARDL) model was proposed to estimate residential electricity consumption per capita demand function, which depends on the GDP per capita in Algeria [60]. The study concluded that promoting financial growth among citizens of Algeria would reduce electricity consumption, since wealthier people (higher income earners) mighty use of more efficient appliances.

In Bogomolov et al. [61], authors used general public dynamics derived data from cellular network and energy consumption dataset to predict the next-day energy demand. The study could serve a model to enhance the energy meter to promote energy conservation. Likewise, the historical data of on–off times of residential appliances were used to predict the next-day electricity demand using Bayesian inference [62]. The study concluded that historical electricity consumption data only is not adequate for a decent eminence hourly forecast.

Also, an ARIMA and Holt-Winter model was proposed for forecasting the national electricity consumption of Pakistan from 1980 to 2011 [3]. The study revealed that the demand in household energy consumption would higher as compare with all other sectors. Jain et al. [63] proposed an ARIMA forecaster for forecasting electricity consumption. The proposed model achieved a MAPE of 6.63%. The authors concluded that the ARIMA model has the potential of computing in EED forecasting with other forecast techniques. Integration of three (3) forecasting model, long-range energy alternative planning (LEAP), ARIMA, and Holt-Winter, was proposed for forecasting long-term energy demand in Pakistan [64]. The study would be valid for energy supplies for accurate estimation of users’ demand in the future. The combination of Extreme Learning Machine and Multiple Regression for forecasting China’s electricity demand was proposed [65]. A quantile regression (QR) model for long-term electricity demand forecasting in South Africa within 2012 and 2015 was presented by Mokilane et al. [66]. The model was helpful to power distribution industries in the country.

An attempt was made to forecast the electricity consumption of Ghana by 2030 using ARIMA-based model. The study outcome projected that Ghana’s electricity consumption would grow from 8.5210 billion kWh in 2012 to 9.5597 billion kWh in 2030 [67]. However, in 2017, a report by the energy commission revealed that electricity consumption was 14,247 GWh [68]. A generalised additive model was adopted for forecasting medium-term electric energy demand in a South African power supply system [69]. The outcome of the study revealed a useful application of the proposed model in the power generation and distribution industries in the country. An investigation between the association of causal nexus and (environmental pollution, energy use, GDP per capita, and urbanisation) in an attempt to forecast Nigeria’s energy use by 2030 was carried out using the ARIMA and ETS models [70]. The study outcome showed better forecasting accuracy by both models, and a high rise in energy demand was observed.

Multiple regression analysis approaches for forecasting the yearly electricity demand in commercial sectors and electricity access rates in rural and urban households in some selected West African countries, which included Ghana, were carried out by Adeoye and Spataru [71]. The study showed that there is a very high variation in hourly electricity demand in the dry seasons. Their results affirm Nti et al. [7] report that the demand pattern of electricity in Ghana is highly dependent on the month of the year. In a way, one can say there is a partial agreement in these two studies. A time-series regression model for forecasting South African’s peak load demand was presented [72]. Experimental fallouts indicated that when the temperature is included as an input parameter, improvement in accuracy by the forecast model was realised. ARIMA-based predictive model for predicting both sectoral and total electrical energy consumption of Turkey for the next 15 years was proposed [73]. The study points out that the demand for electrical energy in agriculture sectors, transport, public service, residential, and commercial will keep increasing.

Similarly, partially linear additive quantile regression models for forecasting short-term electricity demand during the peak-demand periods (i.e., from 6:00 to 8:00 pm) were carried out in South African [74]. The authors found out that the use of the proposed system in power utility industries for the planning, scheduling, and dispatching of electricity activities will result in a minimal cost principally during the peak-period hours. Caro et al. [75] predicted the Spanish electricity demand using the ARIMA model. The study achieved an improvement in the short-term predictions of electricity demand with less computational time. A short-term electricity load forecasting model based on dynamic mode decomposition was proposed in [76], the proposed model showed better stability and accuracy compared with other predictive models.

Quantitative analysis of findings

The descriptive statistics of the study outcome is presented under this section with tables and charts.

Algorithms used for forecast models

As part of the aim of the review, we sought to find out the most used algorithms in electricity forecasting models. The most top nine (9) used algorithms found in the sixty-seven (67) article are presented (Fig. 2); this includes only algorithms that were used in more than a single paper. The study outcome revealed that 90% out of the top nine algorithms were AI-based, with ANN representing 28% of AI models used in electricity forecasting. Besides, the ANN models were primarily used for STLF where electrical energy consumption patterns are more intricate than LTLF. The traditional AFRIMA recorded 17.5% due to its efficiency in LTLF, where load fluctuations and periodicity are less critical.

Fig. 2
figure 2

Top nine (9) most used algorithms for electricity forecasting

Additionally, a high percentage of regression models is used for LTLF prediction. The study outcome shows how AI is applied in various sectors of the economy to improve efficiency and profitability. Also, we observed that the SVM, PSO and Fuzzy are gaining more popularity in recent studies, a sign of increasing attention of researchers on these algorithms for EED forecasting.

Study origin

Table 1 shows the studies and their origin (countries). The origin of surveyed studies was examined, in order to find the linkage between the power crisis in the continent and studies on electrical energy demand forecasting. Concerning geographical coverage, it was found that a high number of studies (31.34%) were carried out in Europe, 17.91% in Africa and 19.40% in Asia. Making Africa the third-highest, however, interestingly, most of the studies in Africa were carried in South Africa (five representing 41.67%), three (3) representing 25% in Ghana, with the rest from Nigeria and other African countries. The energy crisis that hit Europe in 2008, reported by [77], can be attributed to the numerous studies in electricity forecast, as shown in Table 1. It was further observed that 2 out of 3 studies in Ghana were based on national or regional electricity demand forecasting (one long-term, one short-term (1 month)). While the third study aimed at identifying the relationship between electricity demand and economic growth. Thus, it suggests that the limited number of studies in electricity demand forecasting by both academicians and professionals in this field might have partially contributed to the power shortages facing the nation, which in 2015 resulted in “Domsur”.

Table 1 Studies and their origin (for N = 67 papers)

Used evaluation metrics

The performance of every forecasting model is examined based on the difference in error between the actual value \(\left( y \right)\) and the predicted value (\(\hat{y}\)). Several of these metrics were identified in the literature. However, we examined the most used in electrical energy demand forecasting to enable new beginners in the field of electricity demand forecasting to have a firm grip on which to apply in their study. Figure 3 shows the top four (4) most used error metrics in two or more studies. The results revealed that root-mean-square error RMSE (38%) was the most used error metric among EED forecasters, followed by mean absolute percentage error MAPE (35%). Due to the effectiveness in measure predictive model performance and their usefulness for short-term prediction, it was also observed that the MAPE is a standard primary metric as it is easy to both calculate and understand. The results affirm the findings in [12, 19, 22] that MAE, MAPE, and RMSE are the most commonly used evaluation metrics in EED forecasting model.

Fig. 3
figure 3

Most used error metrics in electricity load forecasting

Forecast type

Based on short-term load forecasting (STLF), medium-term load forecasting (MTLF), and long-term load forecasting (LTLF), it was observed that 80%, 15%, and 5% of the electrical energy demand forecasting were STLF, MTLF, and LTLF, respectively. The massive number of studies is based on STLF as compared with LTLF and MTLF call for further studies into the challenges associated with LTLF and MTLF electricity load forecasting. Again, 80% STLF forecasting affirms the 38% use of RMSE error metric, since it is for STLF forecasting. The results affirm the report in [22] that 43.6% of electricity forecasting are short-term prediction.

Model input parameters

The efficiency of every predictive model is believed to be partially dependent on the independent (input) variable [7]. At this level, the focus was to examine the different input variables used for electricity load forecasting. The type of independent variables (input features) used by electricity load forecasters was also examined. The current study observed that sixty (60) out of the sixty-seven (67) papers made known the input parameter to the proposed model. Table 2 presents the type of input variables and the percentage of studies that utilised it. It was observed that a high percentage (50%) of electricity demand forecasting was based on weather parameters; next was the historical electricity consumption pattern. The outcome exposed that little attention is given to household lifestyle in electricity demand forecasting. However, Nishida et al. [58] argue that residential (domestic) energy consumption differs depending on the lifestyle of the family. Family lifestyle, according to [56], cannot be undermined in electrical load forecasting. According to these studies, these factors include the life stage family composition, house type, age, home appliances possessed and their usage, family income, cultural background, social life, and lifestyle habits, which include how long to stay at home and how to spend holidays. The observations open the opportunity for further studies on the association between EED and household lifestyle.

Table 2 Parameter used in electricity load forecasting

Conclusions

The current study sought to reviewed state-of-the-art literature on electricity load forecasting to identify the challenges and opportunities for future studies. The outcome of the study revealed that electricity load forecasting is seen to be complicated for both engineers and academician and is still an ongoing area of research. The key findings are summarised as follows.

  1. 1.

    Several studies (90%) have applied AI in electrical energy demand forecasting as compared with traditional engineering and statistical method (10%) to address energy prediction problems; however, there are not enough studies benchmarking the performance of these methods.

  2. 2.

    There are few studies on EED forecasting in Africa countries (12 out of 67). Though the continent has progressive achievement in the creation of Regional Power Pools (PPP) over the last two decades, the continent still suffers from a lousy power network in most of its countries, leaving millions of people in Africa without electricity.

  3. 3.

    Temperature and rainfall as an input parameter to the EED forecasting model are seen to have a divergent view. At the same time, some sections of research recorded an improvement in accuracy and reported no improvement in accuracy when introduced and input. However, the current study attributes this to the difference in automorphic temperature globally and the different economic status among countries. An additional investigation will bring more clarity to the literature.

  4. 4.

    This study revealed that EED forecasting in the residential sector had seen little attention. On the other hand, Guo et al. [78] argue that the basic unit of electricity consumption is home.

  5. 5.

    It was observed that there had been a global increase in residential electricity demand, this according to the report in [49] can be attributed to the growing rate of buying electrical equipment and appliances of low quality due to higher living standards. However, a further probe into Soares et al. [49] assertion will bring clarity to literature because of the discrepancy in opinions in literature.

  6. 6.

    Lastly, the study revealed that there is a limited number of studies on load forecasting studies in Ghana. We, therefore, recommend rigorous researchers in this field in the country to enhance the economic growth of the country.

Our future study will focus on identifying the relationship between household lifestyle factors and electricity consumption in Ghana and predict load consumption based on identified factors since it is an area that has seen little or no attention in Ghana.