World Wide Web

, Volume 17, Issue 4, pp 799–825 | Cite as

Towards building a blog preservation platform

  • Nikos KasioumisEmail author
  • Vangelis Banos
  • Hendrik Kalb


Social media content and user participation has increased dramatically since the advent of Web 2.0. Blogs have become relevant to every aspect of business and personal life. Nevertheless, we do not have the right tools to aggregate and preserve blog content correctly, as well as to manage blog archives effectively. Given the rising importance of blogs, it is crucial to build systems to facilitate blog preservation, safeguarding an essential part of our heritage that will prove valuable for current and future generations. In this paper, we present our work in progress towards building a novel blog preservation platform featuring robust digital preservation, management and dissemination facilities for blogs. This work is part of the BlogForever project which is aiming to make an impact to the theory and practice of blog preservation by creating guidelines and software that any individual or organization could use to preserve their blogs.


Blog preservation Web archiving 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Ainsworth, S.G., Alsum, A., SalahEldeen, H., Weigle, M.C., Nelson, M.L.: How much of the web is archived? In: Proceeding of the 11th Annual International ACM/IEEE Joint Conference, p. 133. ACM Press, New York, NY (2011)Google Scholar
  2. 2.
    Arango, S., Pinsent, E., Sleeman, P., Gkotsis, G., Stepanyan, K., Rynning, M., Kopidaki, S.: Blogforever: D5.2 implementation of case studies. Tech. rep. (2013)Google Scholar
  3. 3.
    Ashley, K., Davis, R., Guy, M., Kelly, B., Pinsent, E., Farrell, S.: A guide to web preservation. Tech. rep., Joint Information Systems Committee (JISC) (2010)Google Scholar
  4. 4.
    Banos, V., Arango-Docio, S., Pinsent, E., Sleeman, P.: Blogforever: D5.5 case studies comparative analysis and conclusions. Tech. rep. (2013)Google Scholar
  5. 5.
    Bondi, A.B.: Characteristics of scalability and their impact on performance. In: Proceedings of the 2nd International Workshop on Software and Performance, pp. 195–203. ACM (2000)Google Scholar
  6. 6.
    Chittenden, T.: Digital dressing up: modelling female teen identity in the discursive spaces of the fashion blogosphere. J. Youth Stud. 13(4), 505–520 (2010)CrossRefGoogle Scholar
  7. 7.
    Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing, pp. 143–154. ACM (2010)Google Scholar
  8. 8.
    Elsas, J.L., Arguello, J., Callan, J., Carbonell, J.G.: Retrieval and feedback models for blog feed search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’08, pp. 347–354. ACM, New York, NY (2008)CrossRefGoogle Scholar
  9. 9.
    Fitzner, D.: Requirements specification of the TELEIOS user community. Tech. rep., Fraunhofer IGD (2010)Google Scholar
  10. 10.
    Gomes, D., Miranda, J., Costa, M.: A survey on web archiving initiatives. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (Eds.) Research and Advanced Technology for Digital Libraries. Lecture Notes in Computer Science, vol. 6966, pp. 408–420. Springer, Berlin/Heidelberg (2011)CrossRefGoogle Scholar
  11. 11.
    Group, T.O.: Interoperability requirements (2009). Accessed 17 Oct 2012
  12. 12.
    Hetzel, W.C.: The Complete Guide to Software Testing. QED Information Sciences, Wellesley, MA (1988)Google Scholar
  13. 13.
    Hockx-Yu, H.: The past issue of the web. In: Proceedings of the ACM WebSci’11. Koblenz, Germany (2011)Google Scholar
  14. 14.
    IEEE recommended practice for software requirements specifications. IEEE Std 1–40 (1998). doi: 10.1109/IEEESTD.1998.88286
  15. 15.
    Johnson, R.B., Onwuegbuzie, A.J.: Mixed methods research: a research paradigm whose time has come. Educ. Res. 33(7), 14–26 (2004)CrossRefGoogle Scholar
  16. 16.
    Kalb, H., Kasioumis, N., Llopis, J.G., Postaci, S., Arango-Docio, S.: Blogforever: D4.1 user requirements and platform specifications. Tech. rep. (2011)Google Scholar
  17. 17.
    Kalb, H., Trier, M.: The blogosphere as œuvre: individual and collective influences on bloggers. In: ECIS 2012 Proceedings (2012).
  18. 18.
    Kaplan, A.M., Haenlein, M.: The early bird catches the news: nine things you should know about micro-blogging. Bus. Horizons 54(2), 105–113 (2011)CrossRefGoogle Scholar
  19. 19.
    Kim, Y., Ross, S., Stepanyan, K., Pinsent, E., Sleeman, P., Arango-Docio, S., Banos, V., Trochidis, I., Llopis, J.G., Kalb, H.: Blogforever: D3.1 preservation strategy report. Tech. rep. (2012)Google Scholar
  20. 20.
    Lavoie, B.: The open archival information system reference model: introductory guide. Microform & Digitization Review 33(2), 68–81 (2004)Google Scholar
  21. 21.
    Lindahl, C., Blount, E.: Weblogs: simplifying web publishing. Computer 36(11), 114–116 (2003)CrossRefGoogle Scholar
  22. 22.
    Masanès, J.: Web Archiving. Springer-Verlag, Berlin, Heidelberg (2006)CrossRefGoogle Scholar
  23. 23.
    McEwen, S.: Requirements: An introduction (2004). Accessed 17 Oct 2012
  24. 24.
  25. 25.
    Pennock, M., Davis, R.M.: Archivepress: a really simple solution to archiving blog content. In: 6th International Conference on Preservation of Digital Objects (iPRES 2009), pp. 148–154. California Digital Library, San Francisco (2009)Google Scholar
  26. 26.
    Rusbridge, C.: Preservation for scholarly blogs (2009). Accessed 17 Oct 2012
  27. 27.
    Rynning, M., Banos, V., Stepanyan, K., Joy, M., Gulliksen, M.: Blogforever: D2.4 weblog spider prototype and associated methodology. Tech. rep. (2011)Google Scholar
  28. 28.
    Sanderson, R., Shankar, H., Ainsworth, S., McCown, F., Adams, S.: Implementing time travel for the web. The Code4Lib Journal (13) (2011).
  29. 29.
    Stepanyan, K., Gkotsis, G., Kalb, H., Kim, Y., Cristea, A., Joy, M., Trier, M., Ross, S.: Blogs as objects of preservation: advancing the discussion on significant properties. In: 9th International Conference on Preservation of Digital Objects (iPRES 2012). Toronto, Canada (2012)Google Scholar
  30. 30.
    Stepanyan, K., Gkotsis, G., Pincent, E., Banos V. abd Davis, R.: Blogforever: D2.6 data extraction methodology. Tech. rep. (2012)Google Scholar
  31. 31.
    Stepanyan, K., Joy, M., Cristea, A., Kim, Y., Pinsent, E., Kopidaki, S.: Blogforever: D2.3 weblog data model. Tech. rep. (2011)Google Scholar
  32. 32.
    Vangelis Banos, N.B., Manolopoulos, Y.: Trends in blog preservation. In: ICEIS Conference Proceedings (2012)Google Scholar
  33. 33.
    Weltevrede, E., Helmond, A.: Where do bloggers blog? Platform transitions within the historical dutch blogosphere. First Monday 17(2–6) (2012).
  34. 34.
    Zúñiga, V.T.: Blogs as an effective tool to teach and popularize physics: a case study. Latin-American Journal of Physics Education 3(2), 4 (2009)Google Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  1. 1.CERNGeneva 23Switzerland
  2. 2.Department of InformaticsAristotle University of ThessalonikiThesssalonikiGreece
  3. 3.Technische Universität BerlinBerlinGermany

Personalised recommendations