Reference Work Entry

Encyclopedia of Social Network Analysis and Mining

pp 2337-2343

Date:

Web Archives

  • Klaus BerberichAffiliated withMax Planck Institute for Informatics

Synonyms

Web archiving; Web preservation

Glossary

Crawler

A software that harvests content from the World Wide Web

URL

Uniform Resource Locator

Scope

The resources that a web archive seeks to preserve

Website

A collection of related URLs

Definition

Web archives are repositories of web contents collected in the past. They act against the ephemeral nature of the World Wide Web, where new contents are constantly added while others are removed and thus lost forever. Web archives counter this loss by preserving web contents as part of the cultural heritage for future generations. To this end, web archives select resources (e.g., specific websites) worth preserving, repeatedly acquire snapshots of these resources, store them together with metadata (e.g., a time stamp or keywords), and provide access to the archived web contents (e.g., via keyword search). Institutions operating web archives include nonprofit organizations, universities, national libraries, and for-profit companies. Users of web archives ...

This is an excerpt from the content