Abstract
Since 2001 seven Danish censuses dating from 1787 till 1880 have been completely transcribed by volunteers. Due to this effort the research community now has access to a large number of demographic data. The census data were digitised according to the principle of literal data transcription in order to leave all interpretations to the users. The disadvantage of this solution is that it induces problems when creating aggregated statistics as the spelling of, e.g. position in household and occupations was not standardised which leads to great variation in the description of the same entities. In order to overcome this obstacle the data were cleaned and standardised. Standardisation consists of adding numeric codes for the gender, civil status and position in household. For occupations, HISCO has been applied to secure that the data can be used in comparative research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
H.J. Marker, senior researcher at DDA 1984–2009. Presently director of Swedish National Data Service.
- 3.
Danish Data Archive became a member of the Danish National Archives in 1993.
- 4.
- 5.
A census must be at least 75 years old before every one can get access to it.
- 6.
History of Work Information system http://historyofwork.iisg.nl/index.php. Codes for historical occupations based on ISCO. Used for comparative research.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
References
Hall, P. K., McCaa, R., & Thorvaldsen, G. (Eds.). (2000). Handbook of international historical microdata for population research: A project of IMAG. Minneapolis: University of Minnesota, Minnesota Population Center.
Johansen, H. C. (2004). Early Danish census taking. History of the Family, 9, 23–31.
Kirby, K., Carson, J., Dunlop, F., Dearle, A., Dibben, C., Williamson, L., et al. (2015). Automatic methods for coding historical occupation descriptions to standard classifications (This book, Chap. 3).
Mandemakers, K., & Dillon, L. (2004). Best practices with large databases on historical populations. Historical Methods, 37(1), 34–38.
Marker, H. J. (2001). Folketællingen 1801 – på vej mod en forskningsressource. Metode and Data, 84.
Marker, H. J. (2006). Klargøringen af folketællingen 1801. Metode and Data, 92.
Roberts, E. (2006). Reflections on coding 90 million historical occupations. In Paper at 31st Social Science History Conference, Minneapolis.
Ruggles, S. (2005). The Minnesota population center data integration projects: Challenges of harmonizing census microdata across time and place. In Proceedings of the American Statistical Association, Government Statistics Section (pp. 1405–1415). Alexandria, VA: American Statistical Association.
Ruggles, S. (2012). The future of historical family demography. Annual Review of. Sociology, 38, 18.1–18.19.
Solli, A. (2003). Livsløb - familie - samfunn. Endring av familiestrukturar i Norge på 1800-tallet. Tillegg A,B,C.
Thorvaldsen, G., & Erikstad, M. (2006). Statistikk basert på individdata fra folketellingene – nye muligheter. Heimen, 43, 41–53.
Thorvaldsen, G., & Solli, A. (2012). Norway: From colonial to computerized censuses. Revista de Demografía Histórica, XXX(1), 107–136.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Clausen, N.F. (2015). The Danish Demographic Database—Principles and Methods for Cleaning and Standardisation of Data. In: Bloothooft, G., Christen, P., Mandemakers, K., Schraagen, M. (eds) Population Reconstruction. Springer, Cham. https://doi.org/10.1007/978-3-319-19884-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-19884-2_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19883-5
Online ISBN: 978-3-319-19884-2
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)