Abstract
The authors began collecting personal informative lifelogging data in 2011. The data collected in this article is a discontinuous data set, each of which includes one or more images, a GPS message, a description of a location, and a description of the text, we call this informative lifelogging. However, for the purposes automatically building a story from a huge collection of unstructured egocentric data presents major challenges. This paper first introduces the structure and characteristics of the collected data and uses the DB-scan algorithm to classify the data. Then a model for generating a story is proposed, and a model of story generation based on a story template is proposed in the model. The author implemented a complete software system through code, described a story generation model, and gave the key algorithm to generate stories. Through this system, 418 stories were generated automatically, of which 62% of the stories were particularly accurate. The experimental results verify that it is feasible to automatically generate stories based on personal Informative lifelogging data.
Similar content being viewed by others
References
Andrew A, Eustice K, Hickl AJ. Using location Lifelogs to make meaning of food and physical activity behaviors. Presented at PCTHPIC, (2013).
Asnaoui KE, Hamid A, Brahim A, Mohammed O (2017) A survey of activity recognition in egocentric lifelogging datasets. International Conference on Wireless Technologies, IEEE
Belhadi A, Djenouri Y, Lin CWJ, Cano A (2020) Trajectory outlier detection: algorithms, taxonomies, evaluation and open challenges. ACM Transactions on Management Information Systems.
Bolaos M, Dimiccoli M, Radeva P (2017) Toward storytelling from visual lifelogging: an overview. IEEE Transactions on Human-Machine Systems 47(1):77–90
Chen DL, Dolan WB. “Collecting highly parallel data for paraphrase evaluation,” . in Proc 49th Annu Meeting Assoc Comput Linguistics, 2011, pp. 190–200.
Gao L, Li X, Song J (2019) Hierarchical LSTMs with adaptive attention for visual captioning[J]. IEEE Trans Pattern Anal Mach Intell:1–1
Gurrin C, Smeaton AF, Doherty AR (2014) Sources of lifelog data, in lifelogging: personal bigdata.1st ed.,vol. 8, no. 1. Dublin, Ireland: DCU, pp. 35–47.
Jo H, Ryu J-H. Placegram: A diagrammatic map for personal geotagged data browsing. IEEE Educational Activities Department, (2010).
Kim PH (2011) Web-based research collaboration service: crowd lifelog research case study. International Conference on Next Generation Web Services Practices, IEEE
Kybartas B, Bidarra R (2017) A survey on story generation techniques for authoring computational narratives. IEEE Transactions on Computational Intelligence & Ai in Games
Li Z, Ding B, Kays R, Nye P (2010) Mining periodic behaviors for moving objects. In Proceedings of the16th ACM SIGKDD International Conference onKnowledge Discovery and Data Mining. ACM, 1099–1108.
Liu X, Huet B (2016) Event-based cross media question answering. Multimedia Tools & Applications 75(3):1495–1508
Mandler JM, Johnson NS (1977) Remembrance of things parsed: story structure and recall. Cogn Psychol 9(1):111–151
Mann S (1997) Wearable computing: a first step toward personal imaging. Computer 30(2):25–32
Martin WM, Heylighen A, Cavallin H (2005) The right story at the right time. AI & Soc 19(1):34–47
Mazimpaka JD (2016) Trajectory data mining: a review of methods and applications. Journal of Spatial Information Science 13(13)
Rohrbach A, Torabi A, Rohrbach M, Tandon N, Pal C, Larochelle H, Courville A, Schiele B (2017) Movie description. Int J Comput Vis 123(1):94–120
Song J, Gao L, Nie F, Shen HT, Yan Y, Sebe N (2016) Optimized graph learning using partial tags and multiple features for image and video annotation. IEEE Trans Image Process 25(11):4999–5011
Venugopalan S, Xu H, Donahue J, Rohrbach M, Mooney RJ, Saenko K (2015) Translating videos to natural language using deep recurrent neural networks. NAACL HLT:1494–1504
Venugopalan S, Rohrbach M, Donahue J, Mooney R, Darrell T, Saenko K (2015) Sequence to sequence-video to text. in Proc IEEE Int Conf Comput Vis:4534–4542
Vinyals O, Toshev A, Bengio S, Erhan D (2015) Show and tell: a neural image caption generator. Proc. IEEE Conf. Comput. Vis. Pattern Recognit, In, pp 3156–3164
Wang Y, Zheng Y, Xue Y. (2014) Travel time estimation of a path using sparse trajectories. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 25–34.
Wang X, Gao L, Wang P, Sun X, Liu X (2018) Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length. IEEE Transactions on Multimedia 20(3):634–644
Wang J, Yuan Y, Ni T, Ma Y, Shen W (2020) Anomalous trajectory detection and classification based on difference and intersection set distance. IEEE Trans Veh Technol 69(99):2487–2500
Xiao S, Li T, Wang J (2019) Optimization methods of video images processing for mobile object recognition[J]. Multimed Tools Appl
Xu K, Ba J, Kiros R, Cho K, Courville AC, Salakhutdinov R, Zemel RS, Bengio Y (2015) Show, attend and tell: Neural image caption generation with visual attention. in Proc 32nd Int Conf Mach Learn:2048–2057
Xu J, Mei T, Yao T, Rui Y (2016) MSR-VTT: A large video description dataset for bridging video and language. in Proc IEEE Conf Comput Vis Pattern Recognit:5288–5296
Yuan J, Zheng Y, Xie X, Sun G (2013) T-Drive: enhancing driving directions with taxi drivers' intelligence. IEEE Transaction on Knowledge and Data Engineering 25(1):220–232
Yuan NJ, Zheng Y, Zhang L, Xie X (2013) T-finder: a recommender system for finding passengers and vacant taxis. IEEE Transaction on Knowledge and Data Engineering 25(10):2390–2403
Zhou Y, Lau BPL, Koh Z, Yuen C, Ng BKK (2020) Understanding crowd behaviors in a social event by passive wifi sensing and data mining. IEEE Internet of Things Journal
Acknowledgments
The authors would like to thank the anonymous reviewers of this paper for their insightful comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Liu, G., Rehman, M.U. & Wu, Y. Toward storytelling from personal informative lifelogging. Multimed Tools Appl 80, 19649–19673 (2021). https://doi.org/10.1007/s11042-020-10453-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10453-z