A First Experience in Archiving the French Web

  • S. Abiteboul
  • G. Cobéna
  • J. Masanes
  • G. Sedrati
Conference paper

DOI: 10.1007/3-540-45747-X_1

Part of the Lecture Notes in Computer Science book series (LNCS, volume 2458)
Cite this paper as:
Abiteboul S., Cobéna G., Masanes J., Sedrati G. (2002) A First Experience in Archiving the French Web. In: Agosti M., Thanos C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2002. Lecture Notes in Computer Science, vol 2458. Springer, Berlin, Heidelberg

Abstract

The web is a more and more valuable source of information and organizations are involved in archiving (portions of) it for various purposes, e.g., the Internet Archive www.archive.org. A new mission of the French National Library (BnF) is the “dépôt légal” (legal deposit) of the French web. We describe here some preliminary work on the topic conducted by BnF and INRIA. In particular, we consider the acquisition of the web archive. Issues are the definition of the perimeter of the French web and the choice of pages to read once or more times (to take changes into account). When several copies of the same page are kept, this leads to versioning issues that we briefly consider. Finally, we mention some first experiments.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • S. Abiteboul
    • 1
    • 3
  • G. Cobéna
    • 1
  • J. Masanes
    • 2
  • G. Sedrati
    • 3
  1. 1.INRIAFrance
  2. 2.BnFFrance
  3. 3.XylemeFrance

Personalised recommendations