Abstract
An emerging trend in social media is for users to create and publish “stories”, or curated lists of Web resources, with the purpose of creating a particular narrative of interest to the user. While some stories on the Web are automatically generated, such as Facebook’s “Year in Review”, one of the most popular storytelling services is “Storify”, which provides users with curation tools to select, arrange, and annotate stories with content from social media and the Web at large. We would like to use tools, such as Storify, to present (semi-)automatically created summaries of archival collections. To support automatic story creation, we need to better understand as a baseline the structural characteristics of popular (i.e., receiving the most views) human-generated stories. We investigated 14,568 stories from Storify, comprising 1,251,160 individual resources, and found that popular stories (i.e., top 25 % of views normalized by time available on the Web) have the following characteristics: 2/28/1950 elements (min/median/max), a median of 12 multimedia resources (e.g., images, video), 38 % receive continuing edits, and 11 % of their elements are missing from the live Web. We also checked the population of Archive-It collections (3109 collections comprising 305,522 seed URIs) for better understanding the characteristics of the collections that we intend to summarize. We found that the resources in human-generated stories are different from the resources in Archive-It collections. In summarizing a collection, we can only choose from what is archived (e.g., twitter.com is popular in Storify, but rare in Archive-It). However, some other characteristics of human-generated stories will be applicable, such as the number of resources.
Notes
http://timetravel.mementoweb.org/guide/api/, which provided results from 12 different public web archives.
References
Ainsworth, S.G., AlSum, A., SalahEldeen, H., Weigle, M.C., Nelson, M.L.: How much of the web is archived? In: Proceedings of the 11th ACM/IEEE-CS joint conference on digital libraries, JCDL ’11, pp. 133–136. ACM Press, New York (2011). doi:10.1145/1998076.1998100
AlNoamany, Y.: Using web archives to enrich the live web experience through storytelling. Ph.D. thesis, Old Dominion University (2016)
AlNoamany, Y., AlSum, A., Weigle, M.C., Nelson, M.L.: Who and what links to the internet archive. Int. J. Digit. Libr. 14(3–4), 101–115 (2014). doi:10.1007/s00799-014-0111-5
AlNoamany, Y., Weigle, M.C., Nelson, M.L.: Characteristics of social media stories. In: Proceedings of the 19th International conference on theory and practice of digital libraries, TPDL ’15, pp. 267–279. Springer International Publishing, Cham (2015). doi:10.1007/978-3-319-24592-8_20
AlNoamany, Y., Weigle, M.C., Nelson, M.L.: Detecting off-topic pages in web archives. In: Proceedings of the 19th international conference on theory and practice of digital libraries,TPDL’15, vol. 9316, pp. 225–237. Springer International Publishing (2015). doi:10.1007/978-3-319-24592-8_17
Bar-Yossef, Z., Broder, A.Z., Kumar, R., Tomkins, A.: Sic transit gloria telae: towards an understanding of the web’s decay. In: Proceedings of the 13th international conference on World Wide Web, WWW ’04, pp. 328–337 (2004). doi:10.1145/988672.988716
Brewington, B., Cybenko, G.: Keeping up with the changing web. Computer 33(5), 52–58 (2000). doi:10.1109/2.841784
Cohen, J., Mihailidis, P.: Storify and news curation: teaching and learning about digital storytelling. In: Second annual social media technology conference & workshop, vol. 1, pp. 27–31 (2012)
Duh, K., Hirao, T., Kimura, A., Ishiguro, K., Iwata, T., Yeung, C.M.A.: Creating stories: social curation of twitter messages. In: Proceedings of the 6th International AAAI Conference on Weblogs and Social Media, ICWSM’ 12 (2012)
Hall, C., Zarro, M.: Social curation on the website pinterest.com. Am. Soc. Inf. Sci. Technol. 49(1), 1–9 (2012)
Han, J., Choi, D., Choi, A.Y., Choi, J., Chung, T., Kwon, T.T., Rha, J.Y., Chuah, C.N.: Sharing topics in pinterest: understanding content creation and diffusion behaviors. In: Proceedings of the 2015 ACM on conference on online social networks, COSN ’15, pp. 245–255. ACM, New York (2015). doi:10.1145/2817946.2817961
Kieu, B.T., Ichise, R., Pham, S.B.: Predicting the popularity of social curation. In: Knowledge and systems engineering, pp. 413–424. Springer, Cham (2015)
Klein, M., Nelson, M.L.: Find, new, copy, web, page—tagging for the (re-)discovery of web pages. In: Proceedings of the 15th international conference on theory and practice of digital libraries, TPDL’11, pp. 27–39. Springer, Berlin, Heidelberg (2011). doi:10.1007/978-3-642-24469-8_5
Klein, M., Van de Sompel, H., Sanderson, R., Shankar, H., Balakireva, L., Zhou, K., Tobin, R.: Scholarly context not found: one in five articles suffers from reference rot. PloS One 9(12), e115,253 (2014). doi:10.1371/journal.pone.0115253
Koehler, W.: Web page change and persistence-a four-year longitudinal study. J. Am. Soc. Inf. Sci. Technol. 53(2), 162–171 (2002)
Kruskal, W.H., Wallis, W.A.: Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47(260), 583–621 (1952). doi:10.1080/01621459.1952.10483441
Laire, D., Casteleyn, J., Mottart, A.: Social media’s learning outcomes within writing instruction in the EFL classroom: exploring, implementing and analyzing storify. Proc. Soc. Behav. Sci. 69, 442–448 (2012)
Lawrence, S., Pennock, D.M., Flake, G.W., Krovetz, R., Coetzee, F.M., Glover, E., Nielsen, F.A., Kruger, A., Giles, C.L.: Persistence of web references in scientific research. Computer 34(2), 26–31 (2001). doi:10.1109/2.901164
Lyman, P.: Archiving the world wide web. building a national strategy for digital preservation: issues in digital media archiving, pp. 38–51 (2002)
Marshall, C., McCown, F., Nelson, M.: Evaluating personal archiving strategies for internet-based information. Proc. Archiv. 2007(1), 151–156 (2007)
Mihailidis, P., Cohen, J.N.: Exploring curation as a core competency in digital and media literacy education. J. Interact. Media Educ. 2013, 1–19 (2013). doi:10.5334/2013-02
Mohr, G., Stack, M., Ranitovic, I., Avery, D., Kimpton, M.: An introduction to heritrix an open source archival quality web crawler. In: Proceedings of the 4th international web archiving workshop, IWAW ’04, pp. 43–49 (2004)
Negulescu, K.C.: Web archiving @ the Internet Archive. Presentation at the 2010 Digital Preservation Partners Meeting, http://www.digitalpreservation.gov/meetings/documents/ndiipp10/NDIIPP072110FinalIA.ppt (2010)
Nelson, M.L.: A plan for curating “Obsolete Data or Resources”. Tech. Rep. arXiv:1209.2664 (2012)
Ottoni, R., Las Casas, D., Pesce, J.P., Meira Jr, W., Wilson, C., Mislove, A., Almeida, V.: Of pins and tweets: investigating how users behave across image-and text-based social networks. In: Proceedings of the 8th international AAAI conference on weblogs and social media, ICWSM’ 14, pp. 386–395 (2014)
Padia, K., AlNoamany, Y., Weigle, M.C.: Visualizing digital collections at archive-It. In: Proceedings of the 12th annual international ACM/IEEE joint conference on digital libraries, JCDL ’12, pp. 15–18 (2012). doi:10.1145/2232817.2232821
Palomo, B., Palomo, B.: New information narratives: the case of storify. Hipertext.net 12 (2014). doi:10.2436/20.8050.01.6
SalahEldeen, H.M., Nelson, M.L.: Losing my revolution: How many resources shared on social media have been lost? In: Proceedings of the 16th international conference on theory and practice of digital libraries, TPDL’12, pp. 125–137. Springer-Verlag, Cham (2012). doi:10.1007/978-3-642-33290-6_14
SalahEldeen, H.M., Nelson, M.L.: Carbon dating the web: estimating the age of web resources. In: Proceedings of 3rd temporal web analytics workshop, TempWeb ’13, pp. 1075–1082 (2013)
Sastry, N.: Predicting pinterest: organising the world’s images with human-machine collaboration. In: Proceedings of the 24th international conference on world wide web, WWW ’15 Companion, pp. 1065–1065. International World Wide Web Conferences Steering Committee (2015). doi:10.1145/2740908.2744719
Seitzinger, J.: Curate me! exploring online identity through social curation in networked learning. In: Proceedings of the 9th international conference on networked learning, pp. 7–9 (2014)
Stanoevska-Slabeva, K., Sacco, V., Giardina, M.: Content curation: a new form of gatewatching for social media? In: Proceedings of the 12th international symposium on online journalism (2012). http://online.journalism.utexas.edu/2012/papers/Katarina.pdf
Taylor, M.: Introduction to javascript object notation: a to-the-point guide to JSON. CreateSpace Independent Publishing Platform, USA (2014)
Tofel, B.: Wayback for accessing web archives. In: Proceedings of international web archiving workshop. IWAW (2007). http://iwaw.europarchive.org/07/IWAW2007_tofel.pdf
Van de Sompel, H., Nelson, M.L., Sanderson, R.: RFC 7089—HTTP framework for time-based access to resource states–Memento (2013). http://tools.ietf.org/html/rfc7089
Zhong, C., Salehi, M., Shah, S., Cobzarenco, M., Sastry, N., Cha, M.: Social bootstrapping: how pinterest and last.fm social communities benefit by borrowing links from facebook. In: Proceedings of the 23rd international conference on World Wide Web, WWW ’14, pp. 305–314. ACM, New York (2014). doi:10.1145/2566486.2568031
Zhong, C., Shah, S., Sundaravadivelan, K., Sastry, N.: Sharing the loves: understanding the how and why of online content curation. In: Proceedings of the 7th international AAAI conference on weblogs and social media, ICWSM’ 13, pp. 659–667 (2013)
Acknowledgments
This work was supported in part by IMLS LG-71-15-0077-15. We thank Kristine Hanna and Jefferson Bailey of the Internet Archive for the Archive-It data and baseline story summaries.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
AlNoamany, Y., Weigle, M.C. & Nelson, M.L. Characteristics of social media stories. Int J Digit Libr 17, 239–256 (2016). https://doi.org/10.1007/s00799-016-0185-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00799-016-0185-3