Abstract
Social media have changed the world in the last decade. In fact, the rapid and vast adoption of these platforms had an impact on human behavior, in particular, and society in general. All these channels of media are indispensable for our daily life because they enable us to easily communicate, interact and share a variety of content, with one another. In addition, companies have been interested in this phenomenon to discuss and share products with their customers, improving their production from reviews posted on these platforms. However, analyzing and understanding the volume of generated data from social media requires the use of data warehouse methods and techniques to discover new knowledge to assist companies in the decision-making process. In this research paper, we propose an approach for data warehouse construction for opinion analysis, which can help decision-makers to make decisions by analyzing the users' opinions from the data warehouse. In this paper, we are interested more specifically, in the designing of a data mart from social media accounts, such as Facebook, Twitter, and YouTube. Second, we define a set of semantic relations between the multidimensional concepts of data marts to generate the global schema of the data warehouse by integrating three data mart schemes. Then, we proposed a method of opinion analysis from social media based on machine learning techniques to identify the polarity of the users comments posted on social media, and finally, we are focusing on the implementation of our data warehouse on the NoSQL database to make the analysis task easier for the decision-maker.
Similar content being viewed by others
References
Arfaoui, N, and Akaichi, J (2015) Automating schema integration technique case study: generating data warehouse schema from data mart schemas. In: International conference: beyond databases, architectures and structures (pp. 200–209). Springer, Cham https://doi.org/10.1007/978-3-319-18422-7_18
Bringay, S., Bechet, N., Bouillot, F., Poncelet, P., Roche, M., Teisseire, M. (2011). Towards an On-Line Analysis of Tweets Processing. In: International conference on database and expert systems applications (pp. 154-161). Springer, Heidelberg. (DEXA) https://doi.org/10.1007/978-3-642-23091-2_15
Chevalier M, El Malki M, Kopliku A, Teste O, Tournier R (2015). Implementing Multidimensional Data Warehouses into NoSQL. https://doi.org/10.5220/0005379801720183
Cuzzocrea, A, De Maio, C, Fenza, G, Loia, V, and Parente, M (2016) OLAP analysis of multidimensional tweet streams for supporting advanced analytics. In: Proceedings of the 31st annual ACM symposium on applied computing (pp. 992–999). ACM https://doi.org/10.1145/2851613.2851662
Gallinucci E, Golfarelli M, Rizzi S (2013). Meta-stars: multidimensional modeling for social business intelligence. In: Proceedings of the sixteenth international workshop on data warehousing and OLAP. ACM, pp 11–18 https://doi.org/10.1145/2513190.2513195
Girsang AS, Isa SM, Natasya N, Ginzel ME (2020) Implementation of a journalist business intelligence in social media monitoring. System. https://doi.org/10.25046/aj0506182
Gutiérrez-Batista K, Vila MA, Martin-Bautista MJ (2021) Building a fuzzy sentiment dimension for multidimensional analysis in social networks. Appl Soft Comput 108:107390. https://doi.org/10.1016/j.asoc.2021.107390
He W, Zha S, Li L (2013) Social media competitive analysis and text mining: a case study in the pizza industry. Int J Inf Manag 33(3):464–472. https://doi.org/10.1016/j.ijinfomgt.2013.01.001
Inmon W, H (1996) Building the data warehouse, 2nd edn. Wiley, New York
Jenhani F, Gouider MS (2022) Large-scale system for social media data warehousing: the case of twitter-related drug abuse events integration. Int J Data Warehousing Min (IJDWM) 18(1):1–18. https://doi.org/10.4018/IJDWM.290890
Jenhani F, Gouider MS, Said LB (2019) Streaming social media data analysis for events extraction and warehousing using hadoop and storm: drug abuse case study. Proced Comp Sci 159:1459–1467. https://doi.org/10.1016/j.procs.2019.09.316
Jianmin W, Wenbin Z, Tongrang F, Shilong Y, Hongwei L (2020) An improved join-free snowflake schema for ETL and OLAP of data warehouse. Practice and Experience, Concurrency and Computation. https://doi.org/10.1002/cpe.5519
Kimball R (1996) The Data Warehouse Toolkit Practical Techniques for Building Dimensional Data Warehouses. Wiley, New York
Kurnia PF (2018) Business intelligence model to analyze social media information. Proced Comp Sci 135:5–14. https://doi.org/10.1016/j.procs.2018.08.144
Leclercq, É, and Savonnet, M (2018) A tensor based data model for polystore: an application to social networks data. In: Proceedings of the 22nd international database engineering and applications symposium https://doi.org/10.1016/j.procs.2018.08.144
Moalla, I, and Nabli, A (2014) Towards data mart building from social network for opinion analysis. In: International conference on intelligent data engineering and automated learning (pp. 295–302). Springer, Cham https://doi.org/10.1007/978-3-319-10840-7_36
Moalla, I, Nabli, A, and Hammami, M (2017) Integration of a multidimensional schema from different social media to analyze customers' opinions. In: 11th International conference on research challenges in information science (RCIS) (pp. 391–400). IEEE, DOI: https://doi.org/10.1109/RCIS.2017.7956564
Moalla, I, Nabli, A, and Hammami, M (2018) Towards Opinions analysis method from social media for multidimensional analysis. In: Proceedings of the 16th International conference on advances in mobile computing and multimedia (pp. 8–14) https://doi.org/10.1145/3282353.3282367
Rahm E, Bernstein PA (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350. https://doi.org/10.1007/s007780100057
Rehman, N.U, Mansmann, S, Weiler, A, Scholl, M.H (2012) Building a data warehouse for twitter stream exploration. In: proceedings of the 2012 IEEE/ACM International conference on advances in social networks analysis and mining, 1341–1348 https://doi.org/10.1109/ASONAM.2012.230
Valêncio CR, Silva LMM, Tenório W, Zafalon GFD, Colombini AC, Fortes MZ (2020) Data warehouse design to support social media analysis in a big data environment. J Comput Sci. https://doi.org/10.3844/jcssp.2020.126.136
Yangui R, Nabli A, Gargouri F (2016) Automatic transformation of data warehouse schema to NoSQL data base: comparative study. Proced Comp Sci 96:255–264. https://doi.org/10.1016/j.procs.2016.08.138
Author information
Authors and Affiliations
Contributions
All the authors have contributed equally to this work
Corresponding author
Ethics declarations
Conflict of interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Moalla, I., Nabli, A. & Hammami, M. Data warehouse building to support opinion analysis in social media. Soc. Netw. Anal. Min. 12, 123 (2022). https://doi.org/10.1007/s13278-022-00960-2
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-022-00960-2