Towards the Paradigm of Information Warehousing: Application to Twitter

  • Hadjer Moulai
  • Habiba Drias
Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 50)


Over the last decade, social media have dominated our lives. The exploding number of data produced by these platforms triggered a wave of research works that mainly focus on the storage and analysis of this data. In this paper, we propose an original information warehouse architecture for the storage and analysis of social media information. A multidimensional model is defined and the information is extracted, transformed and loaded in the warehouse using ETL (Extract, Transform, Load). The described framework is implemented for Twitter and a data mining analysis is performed on the collected tweets using a clustering algorithm to uncover most discussed topics. The preliminary results are satisfactory and the proposed paradigm can be applied for various information sources such as newspapers and scientific papers.


Information warehouse Social media Multidimensional model Data mining Twitter 


  1. 1.
    Khan, J.: Universal information warehouse system and method, 11 May 2004Google Scholar
  2. 2.
    Holten, R.: Framework and method for information warehouse development processes, pp. 135–163. Physica-Verlag HD, Heidelberg (2000)Google Scholar
  3. 3.
    Post, A.R., Kurc, T., Cholleti, S., Gao, J., Lin, X., Bornstein, W., Cantrell, D., Levine, D., Hohmann, S., Saltz, J.H.: The analytic information warehouse (AIW): a platform for analytics using electronic health record data. J. Biomed. Inform. 46(3), 410–424 (2013)CrossRefGoogle Scholar
  4. 4.
    Kamal, J., Pasuparthi, K., Rogers, P., Buskirk, J., Mekhjian, H.: Using an information warehouse for clinical trials: a prototype. AMIA Ann. Symp. Proc. 2005, 1004–1004 (2005)Google Scholar
  5. 5.
    Choo, C.W.: The Knowing Organization: How Organisations Use Information to Construct Meaning, Create Knowledge, and Make Decisions. Oxford University Press, New York (2006)Google Scholar
  6. 6.
    Rehman, N.U., Mansmann, S., Weiler, A., Scholl, M.H.: Building a data warehouse for twitter stream exploration. In: 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pp. 1341–1348 (2012)Google Scholar
  7. 7.
    Aboubi, Y., Drias, H., Kamel, N.: BSO-CLARA: bees swarm optimization for clustering large applications. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds.) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science, vol. 9468. Springer, Cham (2015)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.LRIAUSTHBAlgiersAlgeria

Personalised recommendations