A training manual for event history data management using Health and Demographic Surveillance System data

Bocquier, Philippe; Ginsburg, Carren; Herbst, Kobus; Sankoh, Osman; Collinson, Mark A.

doi:10.1186/s13104-017-2541-9

A training manual for event history data management using Health and Demographic Surveillance System data

Research note
Open access
Published: 26 June 2017

Volume 10, article number 224, (2017)
Cite this article

Download PDF

You have full access to this open access article

BMC Research Notes Aims and scope Submit manuscript

A training manual for event history data management using Health and Demographic Surveillance System data

Download PDF

Philippe Bocquier^1,2,
Carren Ginsburg^2,4,
Kobus Herbst^3,4,
Osman Sankoh^4,5,6 &
…
Mark A. Collinson^2,4,7

2670 Accesses
10 Citations
11 Altmetric
Explore all metrics

Abstract

Objective

The objective of this research note is to introduce a training manual for event history data management. The manual provides a first comprehensive guide to longitudinal Health and Demographic Surveillance System (HDSS) data management that allows for a step-by-step description of the process of structuring and preparing a dataset for the calculation of demographic rates and event history analysis. The research note provides some background information on the INDEPTH Network, and the iShare data repository and describes the need for a manual to guide users as to how to correctly handle HDSS datasets.

Results

The approach outlined in the manual is flexible and can be applied to other longitudinal data sources. It facilitates the development of standardised longitudinal data management and harmonization of datasets to produce a comparative set of results.

Introduction

The International Network for the Demographic Evaluation of Populations and their Health (INDEPTH) was founded in 1998 and represents a group of currently 47 Health and Demographic Surveillance System (HDSS) sites located in 18 low- and middle-income countries in Africa, Asia and the Pacific. Following its establishment, the Network has endeavoured to build a standardised set of data management protocols pertaining to HDSS data [1]. One of the key challenges of HDSS data management relates to the effective and efficient means of storing and maintaining longitudinal data on health, socio-economic and demographic dynamics that are prospectively updated within a geographically-defined population. These longitudinal datasets capture the dynamic sets of events and episodes pertaining to every individual and household under surveillance (including migration into and out of the demarcated HDSS area). They require sound database structures and protocols for data management and storage. The HDSS platforms form the backbone for high-quality data analysis for scientific enquiry and embedding research projects.

One of the research priorities of the INDEPTH Network is to facilitate comparative demographic analyses across HDSSs. The Multi-centre Analysis of the Dynamics of Internal Migration And Health (MADIMAH) is an INDEPTH project that commenced in 2011 with the aim of producing a set of comparative analyses across HDSSs on questions concerning migration and health [2]. The first phase of MADIMAH involved a study of migration, urbanisation and human capital using datasets from eight HDSSs in Burkina Faso, Kenya, South Africa and Mozambique [3]. Thereafter, a multi-centre study examining the migration effect on mortality across nine sub-Saharan African HDSSs was conducted [4]. These studies illustrate the use of standardised longitudinal data management and harmonization of HDSS data across multiple centres. They further illustrate the scientific potential that may be realised by following a uniform analytical framework to produce a comparative set of results across different locations. The objective of this research note is to introduce a training manual for event history data management which proposes a set of versatile procedures aimed at producing standard data structures for use in longitudinal event history analysis (EHA).

Main text

Data

The INDEPTH iSHARE2 data repository^{Footnote 1} went online in 2014 and provides a unique resource of high-quality, fully documented HDSS longitudinal datasets available for download to a wide range of users, including HDSS-linked scientists and analysts, researchers and students [5]. The repository, which is growing over time, holds amongst others, core micro datasets describing the key demographic events of more than 25 HDSS populations and unique data on cause specific mortality [5]. Recently, the first of a series of multi-centre core micro datasets attached to the MADIMAH project has been released and is structured to examine determinants of in- and out-migration, particularly the education status of the migrant [6].

Methods

The efficient use of these micro datasets requires that users are able to handle HDSS data structures (such as the residency episode files) and understand the range of core events that alter residency status in the HDSS, especially, in- and out-migration, births and deaths. These data structures and properties form the necessary foundation for the statistical analyses of population dynamics. In order to address these requirements, the MADIMAH group developed a manual based on the group’s experiences of conducting comparative analyses across multiple HDSS sites, and of training HDSS data scientists and analysts in these methods. The intention was to provide data managers and analysts who manage raw questionnaire data with a step-by-step description of the process of structuring and preparing a dataset for the calculation of demographic rates and EHA. The approach was to create a common language and set of codes that can create synergies and enable communication across larger communities of data managers and analysts working with longitudinal research designs.

Results: training manual

The training manual is available on-line as Additional file 1 to this note. It provides a general introduction to event history data management. The manual leads the user through all the procedures necessary to format and analyse longitudinal data. It demonstrates how to create a core residency file suitable for EHA and how to check for inconsistencies in the data. The approach is flexible and covers the calculation of basic demographic rates, as well as more complex determinants analysis through the addition of individual and household attributes. The manual illustrates how to enrich the database with new events with precise or imputed dates of occurrence. Finally, the manual explains how to create duration events of several types. The methods outlined in the manual are implemented in detailed coding using Stata software. All sections start with an example of an output file, followed by a check-list and conclude with further examples or programmes needed to solve specific technical issues. Longer, more detailed Stata programmes are available in Additional file 1: Appendix.

This manual is the first comprehensive guide to HDSS longitudinal data management and has become a standard for INDEPTH member HDSS Centres. It can be implemented on longitudinal data from other sources, including register-based, retrospective, or cohort data. It forms the first part of a two-part series. The second manual will guide analysts through the computation of demographic rates and the analysis of determinants and outcomes of demographic processes, using the longitudinal dimension in the data.

Limitations

The procedures outlined in the training manual are most comprehensively applied to HDSS data because these data are inclusive of all entry and exit events in a geographically defined population. In other data sources some entry or exit events might not be relevant, e.g. in-migration for cohort data, or death for retrospective data. Nonetheless, the procedures described in this manual will remain valid and only minor changes to the programming codes will be necessary to apply these methods to such study designs.

Notes

http://www.indepth-ishare.org/index.php/home.

Abbreviations

INDEPTH:: International Network for the Demographic Evaluation of Populations and their Health
HDSS:: Health and Demographic Surveillance System
EHA:: event history analysis
MADIMAH:: Multi-centre Analysis of the Dynamics of Internal Migration And Health

References

Sankoh O, Byass P. The INDEPTH Network: filling vital gaps in global epidemiology. Int J Epidemiol. 2012;41(3):579–88. doi:10.1093/ije/dys081.
Article PubMed PubMed Central Google Scholar
Gerritsen A, Bocquier P, White MJ, Mbacke C, Alam N, Béguy D, Odhiambo F, Sacoor C, Phuc HD, Punpuing S, Collinson MA. Health and demographic surveillance systems: contributing to an understanding of the dynamics in migration and health. Global Health Act. 2013;6:21496. doi:10.3402/gha.v6i0.21496.
Article Google Scholar
Ginsburg C, Bocquier P, Béguy D, Afolabi S, Augusto O, Derra K, Odhiambo F, Otiende M, Soura A, Zabre P, White MJ, Collinson MA. Human capital on the move: education as a determinant of internal migration in selected INDEPTH surveillance populations in Africa. Demogr Res. 2016;4(30):845–84. doi:10.4054/DemRes.2016.34.30.
Article Google Scholar
Ginsburg C, Bocquier P, Béguy D, Afolabi S, Augusto O, Derra K, Herbst K, Lankoande B, Odhiambo F, Otiende M, Soura A, Wamukoya M, Zabre P, White MJ, Collinson MA. Healthy or unhealthy migrants? Identifying internal migration effects on mortality in Africa using Health and Demographic Surveillance Systems. Soc Sci Med. 2016;164:59–73.
Article PubMed Google Scholar
Herbst K, Juvekar S, Bhattacharjee T, Bangha M, Patharia N, Tei T, Gilbert B, Sankoh O. The INDEPTH Data Repository: an international resource for longitudinal population and health data from Health and Demographic Surveillance Systems. J Empir Res Hum Res Ethics. 2015;10(3):324–33. doi:10.1177/1556264615594600.
Article PubMed PubMed Central Google Scholar
Multi-centre analysis of the dynamics of internal migration and human capital in selected INDEPTH centres in Sub-Saharan Africa-release 2016. Provided by the INDEPTH Network Data Repository. 2016. http://www.indepth-network.org. doi:10.7796/INDEPTH.GH004.MIG.2014.v1.

Download references

Authors’ contributions

PB conceptualised the methods and processes outlined in the manual and was a major contributor in writing the manual. CG reviewed and checked the methods and processes outlined in the manual and edited and contributed to the writing of the manual and research note. KH contributed to the conceptualisation of methods outlined in the training manual and contributed to the research note. OS provided INDEPTH team leadership and contributed to the research note. MC contributed to the conceptualisation of methods outlined in the manual, provided MADIMAH team leadership, and contributed to the manuscript. All authors read and approved the final manuscript.

Acknowledgements

We acknowledge institutional support from the School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, South Africa; Centre de Recherche en Démographie et Sociétés, Université Catholique de Louvain, Louvain-la-Neuve, Belgium; and the African Population and Health Research Centre, Nairobi, Kenya, as critical bases for the MADIMAH project leadership.

Competing interests

The authors declare that they have no competing interests.

Funding

The INDEPTH Multi-centre Analysis of the Dynamics of Internal Migration And Health (MADIMAH) project has received funds from the Swedish International Development Agency (Sida: 2012-000379) The research has received a joint financial support from the National Research Foundation, South Africa, and the Wallonia‐Brussels Federation of Belgium (Grant No: 95284). We gratefully acknowledge the South African Medical Research Council (SAMRC) for funding Carren Ginsburg’s Career Development Award.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Centre de Recherche En Démographie, Université Catholique de Louvain, Louvain‐la‐Neuve, Belgium
Philippe Bocquier
Medical Research Council/Wits Rural Public Health and Health Transitions Research Unit (Agincourt), School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, 27 St Andrews Road, Parktown, Johannesburg, 2193, South Africa
Philippe Bocquier, Carren Ginsburg & Mark A. Collinson
Africa Health Research Institute, University of KwaZulu-Natal, Durban, South Africa
Kobus Herbst
INDEPTH Network, Accra, Ghana
Carren Ginsburg, Kobus Herbst, Osman Sankoh & Mark A. Collinson
School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
Osman Sankoh
Department of Mathematics and Statistics, Njala University, Njala, Sierra Leone
Osman Sankoh
Umeå Centre for Global Health Research, Umeå University, Umeå, Sweden
Mark A. Collinson

Authors

Philippe Bocquier
View author publications
You can also search for this author in PubMed Google Scholar
Carren Ginsburg
View author publications
You can also search for this author in PubMed Google Scholar
Kobus Herbst
View author publications
You can also search for this author in PubMed Google Scholar
Osman Sankoh
View author publications
You can also search for this author in PubMed Google Scholar
Mark A. Collinson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carren Ginsburg.

Additional file

Additional file 1. Manual of event history data management using HDSS data.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Bocquier, P., Ginsburg, C., Herbst, K. et al. A training manual for event history data management using Health and Demographic Surveillance System data. BMC Res Notes 10, 224 (2017). https://doi.org/10.1186/s13104-017-2541-9

Download citation

Received: 07 September 2016
Accepted: 17 June 2017
Published: 26 June 2017
DOI: https://doi.org/10.1186/s13104-017-2541-9

A training manual for event history data management using Health and Demographic Surveillance System data

Abstract

Objective

Results

Introduction

Main text

Data

Methods

Results: training manual

Limitations

Notes

Abbreviations

References

Authors’ contributions

Acknowledgements

Competing interests

Funding

Publisher’s Note

Author information

Authors and Affiliations

Corresponding author

Additional file

Additional file 1. Manual of event history data management using HDSS data.

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A training manual for event history data management using Health and Demographic Surveillance System data

Abstract

Objective

Results

Introduction

Main text

Data

Methods

Results: training manual

Limitations

Notes

Abbreviations

References

Authors’ contributions

Acknowledgements

Competing interests

Funding

Publisher’s Note

Author information

Authors and Affiliations

Corresponding author

Additional file

Additional file 1. Manual of event history data management using HDSS data.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation