Abstract
We present a household and enterprise energy survey dataset collected within the framework of the PeopleSuN project in Nigeria in 2021. Across three Nigerian geopolitical zones, a total of 3,599 households and 1,122 small and medium-sized enterprises were surveyed. The sample is designed to be representative of rural and peri-urban grid-electrified regions of each zone. Our surveys collect data on demographic and socioeconomic characteristics, energy access and supply quality, electrical appliance ownership and usage time, cooking solutions, energy related capabilities, and supply preferences. We encourage academic use of the data presented and suggest three avenues of further research: (1) modelling appliance ownership likelihoods, electricity consumption levels and energy service needs in un-electrified regions; (2) identifying supply-side and demand-side solutions to address high usage of diesel generators; (3) exploring broader issues of multi-dimensional energy access, access to decent living standards and climate vulnerability.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Background & Summary
Rural and peri-urban populations in Nigeria continue to suffer unreliable and expensive energy supply. According to the World Bank, the electricity access rate in Nigeria stood at 55.4% in 2020 with a big gap between urban and rural areas (83.9% vs. 24.6%)1. At the same time, nearly 30 million Nigerian households depend on wood as a source of cooking fuel, the collection of which is time consuming and mainly done by women2. Where there is supply, it is typically unreliable and frequently interrupted by blackouts. The Nigeria Enterprise Survey from the World Bank showed that 27% of Nigerian firms identified reliability of electricity supply as the main obstacle to their business3. On average, 32.8 power outages were reported to occur in a typical month leading to an estimated 11% loss in sales value3. The average grid-connected household receives just 6.6 hours of supply on a typical day, linked to a per capita consumption of just 144kWh per year4. In comparison, the annual per capita consumption in Ghana and South Africa is respectively 351 kWh and 4,198 kWh. Plagued by issues of supply quality, many Nigerians have resorted to self-generation using petrol and diesel generators, spending approximately 1.56 trillion Naira (3.76 billion USD, using an average exchange rate in 2021) per year on fuel5.
While global efforts are accelerating under the banner of achieving Sustainable Development Goal 7 (SDG7) by 2030, progress in Nigeria remains hindered by limited data availability, among other barriers. Data describing the energy access deficit in Nigeria exists (see Table 1), however, there is limited disaggregate information describing the supply quality in the existing network and the unmet demand in ‘un-electrified’ regions. In this data descriptor, we present primary survey data collected to fill this and other gaps through the ‘People Power: Optimizing off-grid electricity supply systems in Nigeria’ project (PeopleSuN)’6. PeopleSuN is funded by the German Federal Ministry of Education and Research (BMBF) within the funding initiative ‘Client II - International Partnerships for Sustainable Innovations. Data collection followed extensive stakeholder discussions in Nigeria under the PeopleSuN project to define the data gap and the necessary survey and sampling strategy to address this. The questionnaires used draw from specific modules within established surveys capturing energy-related data, most directly from the Multi-Tier Framework for Measuring Energy Access surveys7. The surveys provide data on household and enterprise characteristics, energy supply and consumption. They also capture preferences, trust in institutions and several gender-disaggregated variables. Our sample is representative of grid-electrified rural and peri-urban regions across three geopolitical zones with large energy access deficits. This data provides important insight into actual energy supply quality and use among ‘electrified’ communities and can be used to improve models of energy demand in similar but currently ‘un-electrified’ communities. The final sample includes 3,599 households and 1,122 small and medium sized enterprises from 225 enumeration areas across the three geopolitical zones.
Methods
Figure 1 provides an overview of the sample design and survey modules used. Complete codebooks describing the questionnaires are available here: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GTNEJD6. We now describe the data collection methods in detail.
Sample frame
Three geopolitical zones were selected for sampling: North West, North Central, and South South. These zones were purposefully chosen based on lower rates of urbanization (excluding South West and South East) and safety and associated logistical concerns (excluding North East). The sample frame was then developed using remotely sensed datasets due to the lack of up-to-date administrative data. Four remotely sensed spatial datasets were used, as shown in Table 2. The focus of this work was on understanding energy supply quality and use among grid-electrified households living in rural and peri-urban regions (outside urban centers). Urbanization was defined using the ‘Urban Rural Catchment Area’ dataset (URCA), which stratified the population by travel time to the nearest agglomeration. Further detail on the URCA sub-categories and their definition can be found in8. For the purposes of this work, the URCA categories were aggregated into an urban core and varying degrees of rurality, as shown in Table 3. Figure 2 describes the distribution of households across these categories and separated into those assumed to be electrified as per the ELEC dataset. All Towns (>0.02 M and <0.05 M population) and regions that were within or further than 1 hour travel time from a city boundary were defined as outside the urban core and relevant to this study. Estimated population sizes and electrification rates within each aggregate spatial category across all six geopolitical zones are shown in Fig. 2, providing further intuition regarding the selection of North Central, North West, and South South as zones of study.
Sampling
In total 3,600 households were intended to be surveyed, split evenly into the three geopolitical zones. Households were sampled from 225 enumeration areas (EAs) consisting of 16 households each. In addition, 5 firms were to be surveyed in each enumeration area, giving a total sample of 1,125 firms. Within each zone, half of the states (four in total per state) were selected using systematic probability proportionate to size sampling (PPS) in order to reduce survey costs. The count of households at the state level was used as the weighting variable in the PPS sampling. This approximates a representative zonal sample when not able to visit all states within each zone. The state-level pre-sampling may introduce bias with respect to the rurality of sparsely populated states not included in our sample. To address this the distribution of households with respect to the URCA PeopleSuN categories across all states in each zone is used to adjust subsequent sampling within selected states. Table 4 describes the approximate proportion of grid-connected households in each zone and category. This is done including all states, not just those sampled. The proportions therefore represent the share of people living in each category that should be replicated within the restricted sample of states. This proportional stratification results in a sample size within each zone and category that reflects, as far as possible, a representative sample of grid-electrified rural and peri-urban households across the zone.
The final enumeration areas within each zone are selected using PPS sampling once again, weighted by the total number of households in each cell to achieve the sample size by zone as defined in the Desired EAs column. A total sample of 225 enumeration areas is equivalent to a zonal sample of 75 enumeration areas, distributed as per the proportional stratification across each of the three zones. Within each zone, this results in a sample that is representative of grid-electrified households outside the urban core while reducing the cost of survey implementation. Analysis can therefore be conducted at the zonal level as is, and re-weighted to arrive at a representative ‘national’ sample across all three contiguous zones if desired. The necessary weights for aggregate analyses across all three contiguous zones are provided with the dataset.
Implementation
Survey implementation was conducted by eHealth Africa (eHA), who also secured state-government permission to conduct surveys in each state. The survey was conducted using the Kobo Collect Computer-Assisted Personal Interviews (CAPI) technology. All EAs within security compromised Local Government Areas (LGAs) were identified and shared with the research team for replacement. From the total of 255 EAs originally planned (reflecting a desired sample of 225 EAs and 30 buffer EAs), 247 EAs remained during implementation. From the 21st to the 23rd June 2021, a training of trainers was conducted by the research team for eHA’s key staff and field supervisors who were scheduled to work on the project. Following this, the translation of survey instruments into local languages (Hausa and Pidgin) was finalized on the 12th of July. This was followed by field testing which was conducted on the 13th and 14th of July 2021, and a field testing report was submitted on 16th July 2021. Step-down training of enumerators was simultaneously conducted between 25th - 29th July 2021 in three locations (Kano, Abuja, and Akwa Ibom) to accommodate the enumerators by zone and reduce travel costs. Kano hosted the North West states’ enumerators while Abuja and Akwa-Ibom hosted North Central and South-South enumerators respectively with the report submitted on the 30th of July 2021. Pre- and post-training assessments were conducted and enumerators that met the evaluation benchmarks were finally selected. A total of 60 enumerators and 12 state supervisors were engaged in the quantitative survey activity.
Data collection commenced on August 7th. A team comprising two enumerators (mostly paired male and female) worked to cover 16 households and 5 enterprises in each EA. Community leaders and local authorities were consulted before enumerators’ visits and commencement of activity at the LGA and community levels. The team faced challenges in gathering survey data due to security, communal clashes and inaccessibility in certain EAs, requiring replacement with a list of buffer enumerations areas within each zone. As a result, there is some small level of bias with respect to the zones given the mismatch in URCA group proportions, which is shown in Table 4. This remains small overall and the mechanism underlying this is transparently communicated here. Furthermore, one enumeration area in North Central, and six in North West were not grid-connected (despite being identified as electrified within the AtlasAI data), but were using electricity in other forms such as diesel generators or solar devices. One EA (AK12) contains a sample of 15, rather than 16 households which was not able to be completed due to banditry on the final day of enumeration, giving us a final sample of 3,599 households. Figure 3 depicts the full sample of enumeration areas with household counts shown as labels. Points have been jittered up to 10 km.
Data Records
All data is publicly available here https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GTNEJD6.
Technical Validation
Prior to data collection, a thorough preparatory program was implemented, starting with a training of trainers session conducted by the research team for eHA’s key staff and field supervisors from June 21st to 23rd, 2021. The sessions aimed to establish a strong foundation in survey methodology and the specific techniques required for this project. Following the training of trainers, field testing was conducted on July 13th and 14th to identify potential issues or inconsistencies in the survey tool or the Hausa and Pidgin translations. A field testing report was submitted on July 16th, providing valuable insights to improve the survey instruments and enumerator guidelines. Subsequently, step-down training of enumerators was carried out between July 25th and 29th, 2021, in three locations (Kano, Abuja, and Akwa Ibom) to accommodate enumerators by zone and reduce travel costs. The training focused on data collection techniques, handling of survey instruments, and ethics in data collection. Pre- and post-training assessments were conducted, and enumerators meeting the evaluation benchmarks were finally selected. In total, 60 enumerators and 12 state supervisors were engaged for the quantitative survey activity.
Data collection finally commenced on August 7th, 2021, with teams of two enumerators covering 16 households and five enterprises in each enumeration area (EA). Community leaders and local authorities were consulted before the enumerators’ visits and commencement of activity at the LGA and community levels. During data collection, standard outlier and consistency checks were performed daily across the dataset. This process involved running outlier detection algorithms and cross-validating the collected data for accuracy and completeness. Inconsistencies or outliers identified were either approved, rejected, or flagged for clarification. This real-time validation process helped maintain the quality of the data collected and minimized the likelihood of errors. Security challenges, communal clashes, and inaccessibility in certain EAs necessitated the use of buffer enumeration areas, resulting in a small level of bias with respect to the zones. This bias, however, remains minimal, and the underlying mechanism is transparently communicated.
Following data collection, a three-week data cleaning process was initiated, which involved extensive exchange with the enumerators and supervisors. This process allowed for the identification and rectification of any remaining inconsistencies, errors, or omissions in the dataset. Enumerators were consulted for clarification and verification of flagged data points, ensuring that the final dataset was both accurate and reliable, and any issues transparently reported. For example, during the cleaning process, it was found that one EA in North Central and six in North West were not grid-connected but used other forms of electricity like diesel generators or solar devices. Additionally, one EA (AK12) contained a sample of 15 instead of 16 households due to banditry on the final day of enumeration, resulting in a final sample of 3,599 households.
Usage Notes
We present here a novel survey dataset describing demographic and socioeconomic characteristics, electricity access and supply quality, electrical appliance ownership and usage time, cooking solutions, capabilities, and preferences across households and enterprises living in grid-electrified communities outside the urban core in Nigeria. The dataset is provided as a set of .csv and .xlsx files. Table 5 describes each file.
This dataset can inform decision makers, project developers and researchers about the current energy supply situation and model future energy use towards securing a decent living standard for all in rural and peri-urban areas of Nigeria. Our preliminary assessment points to three key areas of further research using this data. First, we propose modelling appliance ownership likelihoods, electricity consumption levels and energy service needs in un-electrified regions. Surveyed use of appliances among grid-connected communities can serve as a proxy to estimate future electricity demand for similar households which are currently not grid connected. Second, we propose identifying supply-side and demand-side solutions to address high usage of diesel generators. The collected data provides evidence of grid supply reliability and backup prevalence solutions among rural and peri-urban grid-electrified communities, as well as energy service needs. Thirdly, we propose exploring broader issues of multi-dimensional energy access, access to decent living standards and climate vulnerability. The data captures objective appliance ownership and use as well as subjective satisfaction with access to basic energy services such as keeping oneself at a comfortable temperature. Similarly, with regard to productive uses of electricity, the data describes appliance and equipment ownership and use, as well as subjective satisfaction with energy supply affordability, reliability and adequacy.
A common theme across these three research areas is the importance of increasing the central grid supply quality and in providing cost-competitive decentralized alternatives to improve living conditions and business perspectives in Nigeria. Although our data descriptor focusses on energy access, we understand that achieving SDG7 - providing sustainable energy for all - is not the silver bullet for improving living conditions and business perspectives on its own. A myriad of accompanying factors such as access to other basic services including education, sanitation and health, access to capital and broader issues of gender equality must be addressed. Nevertheless, SDG7 is a key enabling achievement as many other SDGs depend upon sustainable energy access. It is clear that achieving improving energy access in Nigeria is both about new connections and about increasing the supply quality in weak grid areas. The existing grid network needs to be strengthened and decentralised renewables will play a role in both peri-urban and rural regions. Furthermore, in order to satisfy energy service needs in the context of a changing global climate, demand-side policies supporting household acquisition of appliances necessary for a decent living standard must be explored. A cross-cutting analysis combining expertise from institutions responsible for centralised network planning and those responsible for decentralised electrification efforts would be ideal to strategically plan these efforts. We hope that the data we present can be useful in providing evidence for such an analysis.
Code availability
As we are presenting collected survey data, no custom code was used or is necessary to generate or work with this data. Complete codebooks describing the questionnaires are available here: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/GTNEJD6.
Change history
04 December 2023
A Correction to this paper has been published: https://doi.org/10.1038/s41597-023-02771-2
References
World Bank. World development indicators 2021: https://data.worldbank.org/country/nigeria
WHO. World health statistics 2022: monitoring health for the sdgs, sustainable development goals. Tech. Rep. (2022).
Nigeria enterprise survey. Tech. Rep. NGA_2014_ES_v01_M. The World Bank (2014).
Living standard measurement survey/general household survey. Tech. Rep. Nigerian National Bereau of Statistics and World Bank (2019).
Ugwoke, B., Gershon, O., Becchio, C., Corgnati, S. & Leone, P. A review of nigerian energy access studies: The story told so far. Renewable and Sustainable Energy Reviews 120, 109646 (2020).
Pelz, S. et al. RLI-PeopleSuN Energy Audits (RLI-PSN): Household and enterprise energy supply, use and preference data from rural Nigeria. Harvard Dataverse https://doi.org/10.7910/DVN/GTNEJD (2022).
Bhatia, M. & Angelou, N. Beyond Connections: Energy Access Redefined (2015).
Cattaneo, A., Nelson, A. & McMenomy, T. Global mapping of urban–rural catchment areas reveals unequal access to services. Proceedings of the National Academy of Sciences 118, e2011990118 (2021).
Bondarenko, M. & WorldPop. Bottom-up gridded population estimates for nigeria, version 1.2 (2020).
NPC/Nigeria and ICF. Nigeria demographic and health survey 2018. Tech. Rep. National Population Commission. Abuja, Nigeria, and Rockville, Maryland, USA (2019).
Acknowledgements
This research work is part of the three-year research and development project “People Power: Optimizing off-grid electricity supply systems in Nigeria” (PeopleSuN). PeopleSuN is a project funded by the German Federal Ministry of Education and Research (BMBF) within the funding initiative “Client II - International Partnerships for Sustainable Innovations” (FKZ 03SF0606A). We thank eHealth Africa for their impeccable work in collecting the data and their cooperation. We would also like to thank Tomiwa Aileru, Clara Bruegge, Oriyomi Amusa and Ulli Lich who supported us in the preparation of the survey instruments and processing of the data.
Author information
Authors and Affiliations
Contributions
S.P., C.N. and P.B. conducted stakeholder discussions to define the data gap (for details, see https://www.peoplesun.org/). S.P. developed the sampling and data collection methods. eHealthAfrica implemented the data collection. All authors contributed in drafting the accompanying text.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pelz, S., Chinichian, N., Neyrand, C. et al. Electricity supply quality and use among rural and peri-urban households and small firms in Nigeria. Sci Data 10, 273 (2023). https://doi.org/10.1038/s41597-023-02185-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02185-0
- Springer Nature Limited
This article is cited by
-
Analysis of solar energy potentials of five selected south-east cities in nigeria using deep learning algorithms
Sustainable Energy Research (2024)
-
Assessment of health consequences of fossil fuel energy use in Osun state, Nigeria: burden and implications
Discover Energy (2024)