Analysis of Online User Behaviour for Art and Culture Events

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10410)


Nowadays people share everything on online social networks, from daily life stories to the latest local and global news and events. Many researchers have exploited this as a source for understanding the user behaviour and profile in various settings. In this paper, we address the specific problem of user behavioural profiling in the context of cultural and artistic events. We propose a specific analysis pipeline that aims at examining the profile of online users, based on the textual content they published online. The pipeline covers the following aspects: data extraction and enrichment, topic modeling, user clustering, and prediction of interest. We show our approach at work for the monitoring of participation to a large-scale artistic installation that collected more than 1.5 million visitors in just two weeks (namely The Floating Piers, by Christo and Jeanne-Claude). We report our findings and discuss the pros and cons of the work.


Social media Big data Behaviour analysis Data mining 


  1. 1.
    Arabghalizi, T., Rahdari, B.: Event-based user profiling in social media using data mining approaches. Master’s thesis, Politecnico di Milano, April 2017Google Scholar
  2. 2.
    Becker, H., Naaman, M., Gravano, L.: Learning similarity metrics for event identification in social media. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010, pp. 291–300. ACM, New York (2010).
  3. 3.
    Bozzon, A., Brambilla, M., Ceri, S., Silvestri, M., Vesci, G.: Choosing the right crowd: expert finding in social networks. In: Joint 2013 EDBT/ICDT Conferences, EDBT 2013 Proceedings, Genoa, Italy, 18–22 March 2013, pp. 637–648 (2013).
  4. 4.
  5. 5.
    Dai, Q.Y., Zhang, C.P., Wu, H.: Research of decision tree classification algorithm in data mining. Int. J. Database Theory Appl. 9, 1–8 (2016)CrossRefGoogle Scholar
  6. 6.
    Diao, Q.: Event identification and analysis on Twitter. Ph.D. thesis, Singapore Management University (2015)Google Scholar
  7. 7.
    Dziopa, T.: Clustering validity indices evaluation with regard to semantic homogeneity. In: Position Papers of the 2016 Federated Conference on Computer Science and Information Systems, FedCSIS 2016, Gdańsk, Poland, 11–14 September 2016, pp. 3–9 (2016).
  8. 8.
    Farzindar, A., Wael, K.: A survey of techniques for event detection in twitter. Comput. Intell. 31(1), 132–164 (2015).
  9. 9.
    Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery: an overview. In: Advances in Knowledge Discovery and Data Mining, pp. 1–34. American Association for Artificial Intelligence, Menlo Park, CA, USA (1996).
  10. 10.
    Friedemann, V.: Clustering a Customer Base Using Twitter Data. Technical report, stanford university department of computer science, October 2015Google Scholar
  11. 11.
    Ghosh, S., Sharma, N., Benevenuto, F., Ganguly, N., Gummadi, K.: Cognos: crowdsourcing search for topic experts in microblogs. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012, pp. 575–590. ACM, New York (2012).
  12. 12.
    Griffiths, T.: Gibbs sampling in the generative model of latent dirichlet allocation. Technical report (2002)Google Scholar
  13. 13.
    Grira, N., Crucianu, M., Boujemaa, N.: Unsupervised and semi-supervised clustering: a brief survey. In: ‘A Review of Machine Learning Techniques for Processing Multimedia Content’, Report of the MUSCLE European Network of Excellence (FP6) (2004)Google Scholar
  14. 14.
    Grun, B., Hornik, K.: Topicmodels: an r package for fitting topic models. J. Stat. Softw. 40(13), 1–30 (2011)CrossRefGoogle Scholar
  15. 15.
    Hu, Y.: Event Analytics on Social Media: Challenges and Solutions. Ph.D. thesis, Arizona State University (2014)Google Scholar
  16. 16.
    Kanoje, S., Girase, S., Mukhopadhyay, D.: User profiling trends, techniques and applications. Int. J. Adv. Found. Res. Comput. (IJAFRC) 1, 119–125 (2014)Google Scholar
  17. 17.
    Kelley, K., Lai, K., Wu, P.J.: Using r for data analysis: a best practice for research. In: Osborne, J. (ed.) Best Practices in Quantitative Methods. SAGE publishing (2008)Google Scholar
  18. 18.
    Kisilevich, S., Krstajic, M., Keim, D., Andrienko, N., Andrienko, G.: Event-based analysis of people’s activities and behavior using flickr and panoramio geotagged photo collections. In: 2010 14th International Conference Information Visualisation, pp. 289–296, July 2010Google Scholar
  19. 19.
    NamSorSAS: Namsor api.
  20. 20.
    Patil, M., Kankal, S.: Topic digging over asynchronous text sequences. Int. J. Eng. Comput. Sci. 5, 19548–19551 (2016)Google Scholar
  21. 21.
    Prangnawarat, N., Hulpus, I., Hayes, C.: Event analysis in social media using clustering of heterogeneous information networks. In: The Twenty-Eighth International Flairs Conference (2015)Google Scholar
  22. 22.
    Singh, K., Shakya, H.K., Biswas, B.: Clustering of people in social network based on textual similarity. Perspect. Sci. 8, 570–573 (2016)., recent Trends in Engineering and Material Sciences
  23. 23.
    Tan, P.N., Steinbach, M., Kumar, V.: Cluster analysis: basic concepts and algorithms. In: Introduction to Data Mining, 1st edn. Addison-Wesley Longman Publishing Co., Inc. (2005)Google Scholar
  24. 24.
    Twitter: Api overview (2017).
  25. 25.
    Van Craenendonck, T., Blockeel, H.: Using internal validity measures to compare clustering algorithms. In: Benelearn 2015 Poster presentations (online), pp. 1–8 (2015)Google Scholar
  26. 26.
    Xiao, J., Zhang, Y., Jia, X., Li, T.: Measuring similarity of interests for clustering web-users. In: Proceedings of the 12th Australasian Database Conference, ADC 2001, pp. 107–114. IEEE Computer Society, Washington, DC (2001).
  27. 27.
    Yandex: Translate api (2014–2017).
  28. 28.
    Zhou, G., Ding, H., Zhou, G., Zhang, W.: A user clustering algorithm considering user’s interest-offset. In: International Conference on Cyberspace Technology (CCT 2013), pp. 62–67, November 2013Google Scholar

Copyright information

© IFIP International Federation for Information Processing 2017

Authors and Affiliations

  1. 1.Politecnico di MilanoMilanoItaly

Personalised recommendations