The evolution of web archiving

Article

DOI: 10.1007/s00799-016-0171-9

Cite this article as:
Costa, M., Gomes, D. & Silva, M.J. Int J Digit Libr (2016). doi:10.1007/s00799-016-0171-9

Abstract

Web archives preserve information published on the web or digitized from printed publications. Much of this information is unique and historically valuable. However, the lack of knowledge about the global status of web archiving initiatives hamper their improvement and collaboration. To overcome this problem, we conducted two surveys, in 2010 and 2014, which provide a comprehensive characterization on web archiving initiatives and their evolution. We identified several patterns and trends that highlight challenges and opportunities. We discuss these patterns and trends that enable to define strategies, estimate resources and provide guidelines for research and development of better technology. Our results show that during the last years there was a significant growth in initiatives and countries hosting these initiatives, volume of data and number of contents preserved. While this indicates that the web archiving community is dedicating a growing effort on preserving digital information, other results presented throughout the paper raise concerns such as the small amount of archived data in comparison with the amount of data that is being published online.

Keywords

Web archiving Digital preservation Survey 

Copyright information

© Springer-Verlag Berlin Heidelberg 2016

Authors and Affiliations

  1. 1.Departamento de Informática, Faculdade de CiênciasUniversidade de LisboaLisbonPortugal
  2. 2.Foundation for National Scientific ComputingLisbonPortugal
  3. 3.INESC-ID, Instituto Superior TécnicoUniversidade de LisboaLisbonPortugal