Skip to main content

ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections

  • Conference paper
  • First Online:
Research and Advanced Technology for Digital Libraries (TPDL 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9819))

Included in the following conference series:

Abstract

Curated web archive collections contain focused digital contents which are collected by archiving organizations to provide a representative sample covering specific topics and events to preserve them for future exploration and analysis. In this paper, we discuss how to best support collaborative construction and exploration of these collections through the ArchiveWeb system. ArchiveWeb has been developed using an iterative evaluation-driven design-based research approach, with considerable user feedback at all stages. This paper describes the functionalities of our current prototype for searching, constructing, exploring and discussing web archive collections, as well as feedback on this prototype from seven archiving organizations, and our plans for improving the next release of the system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://alexandria-project.eu/.

  2. 2.

    http://archive-it.org/.

  3. 3.

    http://archiveweb.l3s.uni-hannover.de/aw/index.jsf.

  4. 4.

    http://archive.org/web/.

  5. 5.

    http://archivethe.net/.

  6. 6.

    https://tools.ietf.org/html/rfc7089.

  7. 7.

    http://www.tibetinfonet.net/.

  8. 8.

    https://archive-it.org/blog/post/only-41-of-occupy-movement-urls-accessible-on-liv e-web/.

  9. 9.

    http://datamarket.azure.com/dataset/bing/search.

  10. 10.

    https://github.com/internetarchive/wayback/tree/master/wayback-cdx-server.

References

  1. Abel, F., Herder, E., Marenzi, I., Nejdl, W., Zerr, S.: Evaluating the benefits of social annotation for collaborative search. In: 2nd Annual Workshop on Search in Social Media (SSM 2009) (2009)

    Google Scholar 

  2. Amershi, S., Morris, M.R.: Cosearch: a system for co-located collaborative web search. In: Proceedings of CHI 2008, pp. 1647–1656 (2008)

    Google Scholar 

  3. Cutrell, E., Robbins, D., Dumais, S., Sarin, R.: Fast, flexible filtering with phlat. In: Proceedings of CHI 2006, pp. 261–270 (2006)

    Google Scholar 

  4. Dougherty, M., van den Heuvel, C.: Historical infrastructures for web archiving: annotation of ephemeral collections for researchers and cultural heritage institutions. In: MIT6 Conference, Boston, MA (2009)

    Google Scholar 

  5. Dumais, S., Cutrell, E., Cadiz, J., Jancke, G., Sarin, R., Robbins, D.C.: Stuff I’ve seen: a system for personal information retrieval and re-use. In: Proceedings of SIGIR 2003, pp. 72–79 (2003)

    Google Scholar 

  6. Evans, B.M., Chi, E.H.: Towards a model of understanding social search. In: Proceedings of CSWC 2008, pp. 485–494 (2008)

    Google Scholar 

  7. Gomes, D., Miranda, J., Costa, M.: A survey on web archiving initiatives. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (eds.) TPDL 2011. LNCS, vol. 6966, pp. 408–420. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Held, C., Cress, U.: Learning by foraging: the impact of social tags on knowledge acquisition. In: Cress, U., Dimitrova, V., Specht, M. (eds.) EC-TEL 2009. LNCS, vol. 5794, pp. 254–266. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  9. Lin, J., Gholami, M., Rao, J.: Infrastructure for supporting exploration and discovery in web archives. In: Proceedings of WWW 2014, pp. 851–856 (2014)

    Google Scholar 

  10. Marenzi, I., Kupetz, R., Nejdl, W., Zerr, S.: Supporting active learning in CLIL through collaborative search. In: Luo, X., Spaniol, M., Wang, L., Li, Q., Nejdl, W., Zhang, W. (eds.) ICWL 2010. LNCS, vol. 6483, pp. 200–209. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  11. Marenzi, I., Nejdl, W.: I search therefore I learn - active and collaborative learning in language teaching: two case studies, pp. 103–125. Collaborative Learning 2.0: Open Educational Resources (2012)

    Google Scholar 

  12. Marenzi, I., Zerr, S.: Multiliteracies and active learning in CLIL - the development of LearnWeb2.0. IEEE Trans. Learn. Technol. (TLT) 5(4), 336–348 (2012)

    Article  Google Scholar 

  13. Morris, M.R.: A survey of collaborative web search practices. In: Proceedings of CHI 2008, pp. 1657–1660 (2008)

    Google Scholar 

  14. Morris, M.R., Horvitz, E.: Searchtogether: an interface for collaborative web search. In: Proceedings of UIST 2007, pp. 3–12 (2007)

    Google Scholar 

  15. Odijk, D., Gârbacea, C., Schoegje, T., Hollink, L., de Boer, V., Ribbens, K., van Ossenbruggen, J.: Supporting exploration of historical perspectives across collections. In: Kapidakis, S., et al. (eds.) TPDL 2015. LNCS, vol. 9316, pp. 238–251. Springer, Heidelberg (2015). doi:10.1007/978-3-319-24592-8_18

    Chapter  Google Scholar 

  16. Padia, K., AlNoamany, Y., Weigle, M.C.: Visualizing digital collections at archive-it. In: Proceedings of JCDL 2012, pp. 15–18 (2012)

    Google Scholar 

  17. Ras, M., van Bussel, S.: Web archiving user survey. Technical report, National Library of the Netherlands (Koninklijke Bibliotheek) (2007)

    Google Scholar 

  18. Russell, D.M., Stefik, M.J., Pirolli, P., Card, S.K.: The cost structure of sensemaking. In: Proceedings of the INTERACT 1993 and CHI 1993, pp. 269–276 (1993)

    Google Scholar 

  19. Zerr, S., d’Aquin, M., Marenzi, I., Taibi, D., Adamou, A., Dietze, S.: Towards analytics and collaborative exploration of social and linked media for technology-enchanced learning scenarios. In: Proceedings of the 1st International Workshop on Dataset PROFIling & fEderated Search for Linked Data (2014)

    Google Scholar 

Download references

Acknowledgments

We thank Jefferson Bailey from the Internet Archive who provided us with the contacts to his colleagues at university libraries and archiving institutions. We are also grateful to all experts, who participated with enthusiasm in our evaluation, providing valuable feedback and useful suggestions to improve the system. This work was partially funded by the European commission in the context of the ALEXANDRIA project (ERC advanced grant no 339233).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zeon Trevor Fernando .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Fernando, Z.T., Marenzi, I., Nejdl, W., Kalyani, R. (2016). ArchiveWeb: Collaboratively Extending and Exploring Web Archive Collections. In: Fuhr, N., Kovács, L., Risse, T., Nejdl, W. (eds) Research and Advanced Technology for Digital Libraries. TPDL 2016. Lecture Notes in Computer Science(), vol 9819. Springer, Cham. https://doi.org/10.1007/978-3-319-43997-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-43997-6_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-43996-9

  • Online ISBN: 978-3-319-43997-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics