International Journal on Digital Libraries

, Volume 19, Issue 1, pp 39–55 | Cite as

ArchiveWeb: collaboratively extending and exploring web archive collections—How would you like to work with your collections?

  • Zeon Trevor Fernando
  • Ivana Marenzi
  • Wolfgang Nejdl


Curated web archive collections contain focused digital content which is collected by archiving organizations, groups, and individuals to provide a representative sample covering specific topics and events to preserve them for future exploration and analysis. In this paper, we discuss how to best support collaborative construction and exploration of these collections through the ArchiveWeb system. ArchiveWeb has been developed using an iterative evaluation-driven design-based research approach, with considerable user feedback at all stages. The first part of this paper describes the important insights we gained from our initial requirements engineering phase during the first year of the project and the main functionalities of the current ArchiveWeb system for searching, constructing, exploring, and discussing web archive collections. The second part summarizes the feedback we received on this version from archiving organizations and libraries, as well as our corresponding plans for improving and extending the system for the next release.


Working with web archives Collaborative search and exploration Web archive requirements and evaluation 



We especially thank Jefferson Bailey from the Internet Archive who provided us with the contacts to his colleagues at university libraries and archiving institutions, and for his helpful comments during the requirements and evaluation phase. We are also grateful to all experts, who participated with enthusiasm in our evaluation, providing valuable feedback and useful suggestions to improve the ArchiveWeb system. This work was partially funded by the European Commission in the context of the Alexandria project (ERC advanced Grant No. 339233).


  1. 1.
    Alonso, O., Strötgen, J., Baeza-Yates, R., Gertz, M.: Temporal information retrieval: challenges and opportunities. In: Proceedings of the 1st international temporal web analytics workshop (TWAW 2011) associated to WWW’11, pp. 1–8 (2011)Google Scholar
  2. 2.
    Bragg, M., Hanna, K., Donovan, L., Hukill, G., Peterson, A.: The web archiving life cycle model. White Paper. (2013)
  3. 3.
    Cutrell, E., Robbins, D., Dumais, S., Sarin, R.: Fast, flexible filtering with phlat. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI ’06), pp. 261–270 (2006)Google Scholar
  4. 4.
    Dougherty, M., van den Heuvel, C.: Historical infrastructures for web archiving: annotation of ephemeral collections for researchers and cultural heritage institutions. (2009)
  5. 5.
    Dumais, S., Cutrell, E., Cadiz, J., Jancke, G., Sarin, R., Robbins, D.C.: Stuff I’ve seen: a system for personal information retrieval and re-use. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval (SIGIR ’03), pp. 72–79 (2003)Google Scholar
  6. 6.
    Fernando, Z.T., Marenzi, I., Nejdl, W., Kalyani, R.: Archiveweb: collaboratively extending and exploring web archive collections. In: Proceedings of 20th International Conference on Theory and Practice of Digital Libraries: Research and Advanced Technology for Digital Libraries (TPDL ’16), pp. 107–118 (2016)Google Scholar
  7. 7.
    Gomes, D., Miranda, J., Costa, M.: A survey on web archiving initiatives. In: Proceedings of the 15th International Conference on Theory and Practice of Digital Libraries: Research and Advanced Technology for Digital Libraries (TPDL ’11), pp. 1045–1050 (2011)Google Scholar
  8. 8.
    Jackson, A., Lin, J., Milligan, I., Ruest, N.: Desiderata for exploratory search interfaces to web archives in support of scholarly activities. In: Proceedings of the 16th Joint Conference on Digital Libraries, JCDL ’16, pp. 103–106 (2016)Google Scholar
  9. 9.
    Lieser, W.: Digital Art (Art Pocket). H.F.Ullmann Publishing GmbH, Berlin (2009)Google Scholar
  10. 10.
    Lin, J., Gholami, M., Rao, J.: Infrastructure for supporting exploration and discovery in web archives. In: Proceedings of the 23rd International Conference on World Wide Web (WWW ’14), pp. 851–856 (2014)Google Scholar
  11. 11.
    Marenzi, I.: Multiliteracies and e-learning2.0. In: Blell, G., Kupetz, R. (eds.) Foreign Language Pedagogy, Content and Learner Oriented, vol. 28. Peter Lang, Frankfurt am Main (2014)Google Scholar
  12. 12.
    Marenzi, I., Nejdl, W.: I search therefore I learn—active and collaborative learning in language teaching: two case studies. In: Okada, A., Connolly, T., Scott, P. (eds.) Collaborative Learning 2.0: Open Educational Resources, pp. 103–125. IGI Global, Hershei, PA (2012)Google Scholar
  13. 13.
    Marenzi, I., Zerr, S.: Multiliteracies and active learning in CLIL—the development of LearnWeb2.0. In: IEEE Trans. Learn. Technol. (TLT) 5, 336–348 (2012)Google Scholar
  14. 14.
    Odijk, D., Gârbacea, C., Schoegje, T., Hollink, L., de Boer, V., Ribbens, K., van Ossenbruggen, J.: Supporting Exploration of Historical Perspectives across Collections. In: Proceedings of 19th International Conference on Theory and Practice of Digital Libraries (TPDL ’15), pp. 238–251 (2015)Google Scholar
  15. 15.
    Padia, K., AlNoamany, Y., Weigle, M.C.: Visualizing digital collections at Archive-It. In: Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL ’12), pp. 15–18 (2012)Google Scholar
  16. 16.
    Ras, M., van Bussel, S.: Web archiving user survey. Technical report, National Library of the Netherlands (Koninklijke Bibliotheek) (2007)Google Scholar
  17. 17.
    Stalker, P.J.: Gaming in Art: A Case Study of Two Examples of the Artistic Appropriation of Computer Games and the Mapping of Historical Trajectories Of ’Art Games’ Versus Mainstream Computer Games. University of Witwatersrand, South Africa (2005)Google Scholar
  18. 18.
    Weikum, G., Ntarmos, N., Spaniol, M., Triantafillou, P., Benczúr, A., Kirkpatrick, S., Rigaux, P., Williamson, M.: Longitudinal analytics on web archive data: it’s about time! In: Proceedings of the \(5^{th}\) biennial Conference on Innovative Data Systems Research (CIDR), Asilomar, CA, USA, January 9-12, pp. 199–202 (2011)Google Scholar
  19. 19.
    Winters, J.: Tackling complexity in humanities big data: From parliamentary proceedings to the archived web. In Hiltunen, T., McVeigh, J., Säily, T. (eds.), Big and Rich Data in English Corpus Linguistics: Methods and Explorations. Studies in Variation, Contacts and Change in English. Helsinki: VARIENG (Forthcoming 2017)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2017

Authors and Affiliations

  • Zeon Trevor Fernando
    • 1
  • Ivana Marenzi
    • 1
  • Wolfgang Nejdl
    • 1
  1. 1.L3S Research CenterHannoverGermany

Personalised recommendations