Skip to main content

The Danish Demographic Database—Principles and Methods for Cleaning and Standardisation of Data

  • Chapter
  • First Online:
Population Reconstruction

Abstract

Since 2001 seven Danish censuses dating from 1787 till 1880 have been completely transcribed by volunteers. Due to this effort the research community now has access to a large number of demographic data. The census data were digitised according to the principle of literal data transcription in order to leave all interpretations to the users. The disadvantage of this solution is that it induces problems when creating aggregated statistics as the spelling of, e.g. position in household and occupations was not standardised which leads to great variation in the description of the same entities. In order to overcome this obstacle the data were cleaned and standardised. Standardisation consists of adding numeric codes for the gender, civil status and position in household. For occupations, HISCO has been applied to secure that the data can be used in comparative research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://ddd.dda.dk

  2. 2.

    H.J. Marker, senior researcher at DDA 1984–2009. Presently director of Swedish National Data Service.

  3. 3.

    Danish Data Archive became a member of the Danish National Archives in 1993.

  4. 4.

    http://ddd.dda.dk/Vejledning%20i%20kildeindtastning.htm

  5. 5.

    A census must be at least 75 years old before every one can get access to it.

  6. 6.

    History of Work Information system http://historyofwork.iisg.nl/index.php. Codes for historical occupations based on ISCO. Used for comparative research.

  7. 7.

    www.nappdata.org

  8. 8.

    https://www.nappdata.org/napp-action/variables/OCCHISCO#codes_section

  9. 9.

    https://www.ipums.org/

  10. 10.

    http://www.rhd.uit.no/

  11. 11.

    http://www.prdh.umontreal.ca/IMAG/

  12. 12.

    https://www.ipums.org/

References

  • Hall, P. K., McCaa, R., & Thorvaldsen, G. (Eds.). (2000). Handbook of international historical microdata for population research: A project of IMAG. Minneapolis: University of Minnesota, Minnesota Population Center.

    Google Scholar 

  • Johansen, H. C. (2004). Early Danish census taking. History of the Family, 9, 23–31.

    Article  Google Scholar 

  • Kirby, K., Carson, J., Dunlop, F., Dearle, A., Dibben, C., Williamson, L., et al. (2015). Automatic methods for coding historical occupation descriptions to standard classifications (This book, Chap. 3).

    Google Scholar 

  • Mandemakers, K., & Dillon, L. (2004). Best practices with large databases on historical populations. Historical Methods, 37(1), 34–38.

    Google Scholar 

  • Marker, H. J. (2001). Folketællingen 1801 – på vej mod en forskningsressource. Metode and Data, 84.

    Google Scholar 

  • Marker, H. J. (2006). Klargøringen af folketællingen 1801. Metode and Data, 92.

    Google Scholar 

  • Roberts, E. (2006). Reflections on coding 90 million historical occupations. In Paper at 31st Social Science History Conference, Minneapolis.

    Google Scholar 

  • Ruggles, S. (2005). The Minnesota population center data integration projects: Challenges of harmonizing census microdata across time and place. In Proceedings of the American Statistical Association, Government Statistics Section (pp. 1405–1415). Alexandria, VA: American Statistical Association.

    Google Scholar 

  • Ruggles, S. (2012). The future of historical family demography. Annual Review of. Sociology, 38, 18.1–18.19.

    Google Scholar 

  • Solli, A. (2003). Livsløb - familie - samfunn. Endring av familiestrukturar i Norge på 1800-tallet. Tillegg A,B,C.

    Google Scholar 

  • Thorvaldsen, G., & Erikstad, M. (2006). Statistikk basert på individdata fra folketellingene – nye muligheter. Heimen, 43, 41–53.

    Google Scholar 

  • Thorvaldsen, G., & Solli, A. (2012). Norway: From colonial to computerized censuses. Revista de Demografía Histórica, XXX(1), 107–136.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nanna Floor Clausen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Clausen, N.F. (2015). The Danish Demographic Database—Principles and Methods for Cleaning and Standardisation of Data. In: Bloothooft, G., Christen, P., Mandemakers, K., Schraagen, M. (eds) Population Reconstruction. Springer, Cham. https://doi.org/10.1007/978-3-319-19884-2_1

Download citation

Publish with us

Policies and ethics