Abstract
Business intelligence and data analytics projects often involve low-level, ad hoc data wrangling and programming, which increases development effort and reduces usability of the resulting analytics solutions. Conceptual modeling allows to move data analytics onto a higher level of abstraction, facilitating the implementation and use of analytics solutions. In this chapter, we provide an overview of the data analytics landscape and explain, along the (big) data analysis pipeline, how conceptual modeling methods may benefit the development and use of data analytics solutions. We review existing literature and illustrate common issues as well as solutions using examples from cooperative research projects in the domains of precision dairy farming and air traffic management. We target practitioners involved in the planning and implementation of business intelligence and analytics projects as well as researchers interested in the application of conceptual modeling to business intelligence and analytics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Agrawal, D., Bernstein, P., Bertino, E., Davidson, S., Dayal, U., Franklin, M., others: Challenges and opportunities with big data – a community white paper developed by leading researchers across the United States. Tech. rep., Computing Community Consortium (2012), https://cra.org/ccc/resources/ccc-led-whitepapers/, accessed: 23 June 2020
Anderlik, S., Neumayr, B., Schrefl, M.: Using domain ontologies as semantic dimensions in data warehouses. In: Atzeni, P., Cheung, D.W., Ram, S. (eds.) ER 2012. LNCS, vol. 7532, pp. 88–101. Springer (2012). https://doi.org/10.1007/978-3-642-34002-4_7
Bala, M., Boussaid, O., Alimazighi, Z.: A fine-grained distribution approach for ETL processes in big data environments. Data & Knowledge Engineering 111, 114–136 (2017)
Baldacci, L., Golfarelli, M., Graziani, S., Rizzi, S.: QETL: An approach to on-demand ETL from non-owned data sources. Data & Knowledge Engineering 112, 17–37 (2017). https://doi.org/10.1016/j.datak.2017.09.002
Becker, J., Delfmann, P., Knackstedt, R.: Adaptive reference modeling: Integrating configurative and generic adaptation techniques for information models. In: Reference modeling, pp. 27–58. Springer (2007)
Berthold, M.R., Cebron, N., Dill, F., Gabriel, T.R., Kötter, T., Meinl, T., Ohl, P., Thiel, K., Wiswedel, B.: KNIME - the konstanz information miner: version 2.0 and beyond. SIGKDD Explorations 11(1), 26–31 (2009). https://doi.org/10.1145/1656274.1656280
Brambilla, M., Fraternali, P.: Interaction flow modeling language: Model-driven UI engineering of web and mobile apps with IFML. Morgan Kaufmann (2014)
Caracciolo, C., Stellato, A., Morshed, A., Johannsen, G., Rajbhandari, S., Jaques, Y., Keizer, J.: The AGROVOC linked dataset. Semantic Web 4(3), 341–348 (2013)
Ceravolo, P., Azzini, A., Angelini, M., Catarci, T., Cudré-Mauroux, P., Damiani, E., Mazak, A., van Keulen, M., Jarrar, M., Santucci, G., Sattler, K., Scannapieco, M., Wimmer, M., Wrembel, R., Zaraket, F.A.: Big data semantics. Journal on Data Semantics 7(2), 65–85 (2018). https://doi.org/10.1007/s13740-018-0086-2
Delen, D., Ram, S.: Research challenges and opportunities in business analytics. Journal of Business Analytics 1(1), 2–12 (2018). https://doi.org/10.1080/2573234X.2018.1507324
Dobson, S., Golfarelli, M., Graziani, S., Rizzi, S.: A reference architecture and model for sensor data warehousing. IEEE Sensors Journal 18(18), 7659–7670 (2018). https://doi.org/10.1109/JSEN.2018.2861327
Donnelly, K.: SNOMED-CT: The advanced terminology and coding system for eHealth. Studies in Health Technology and Informatics 121, 279 (2006)
El Akkaoui, Z., Mazón, J., Vaisman, A.A., Zimányi, E.: BPMN-based conceptual modeling of ETL processes. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 1–14. Springer (2012)
El Akkaoui, Z., Zimányi, E.: Defining ETL worfklows using BPMN and BPEL. In: Proceedings of the ACM 12th International Workshop on Data Warehousing and OLAP. pp. 41–48 (2009)
Fleckenstein, M., Fellows, L.: Data Analytics, pp. 133–142. Springer (2018). https://doi.org/10.1007/978-3-319-68993-7_13
Francia, M., Gallinucci, E., Golfarelli, M.: Social BI to understand the debate on vaccines on the web and social media: unraveling the anti-, free, and pro-vax communities in italy. Social Network Analysis and Mining 9(1), 46:1–46:16 (2019). https://doi.org/10.1007/s13278-019-0590-x
Francia, M., Gallinucci, E., Golfarelli, M., Rizzi, S.: Social business intelligence in action. In: Nurcan, S., Soffer, P., Bajec, M., Eder, J. (eds.) CAiSE 2016. LNCS, vol. 9694, pp. 33–48. Springer (2016). https://doi.org/10.1007/978-3-319-39696-5_3
Golfarelli, M.: Design issues in social business intelligence projects. In: Zimányi, E., Abelló, A. (eds.) eBISS 2015. LNBIP, vol. 253, pp. 62–86. Springer (2016)
Golfarelli, M., Maio, D., Rizzi, S.: The dimensional fact model: a conceptual model for data warehouses. International Journal of Cooperative Information Systems 7(2-3), 215–247 (1998)
Golfarelli, M., Rizzi, S.: A model-driven approach to automate data visualization in big data analytics. Information Visualization 19(1) (2020). https://doi.org/10.1177/1473871619858933
Gorelik, A.: The enterprise big data lake: Delivering the promise of big data and data science. O’Reilly (2019)
Hilal, M., Schuetz, C.G., Schrefl, M.: Using superimposed multidimensional schemas and OLAP patterns for RDF data analysis. Open Computer Science 8(1), 18–37 (2018). https://doi.org/10.1515/comp-2018-0003
Hitzler, P., Krötzsch, M., Rudolph, S.: Foundations of Semantic Web Technologies. Chapman and Hall/CRC Press (2010), http://www.semantic-web-book.org/
Hitzler, P., Krötzsch, M., Rudolph, S., Sure, Y.: Semantic Web: Grundlagen. Springer (2007)
Inmon, W.H.: Building the data warehouse. Wiley, fourth edn. (2005)
Keim, D.A., Andrienko, G.L., Fekete, J., Görg, C., Kohlhammer, J., Melançon, G.: Visual analytics: Definition, process, and challenges. In: Kerren, A., Stasko, J.T., Fekete, J., North, C. (eds.) Information Visualization - Human-Centered Issues and Perspectives, LNCS, vol. 4950, pp. 154–175. Springer (2008). https://doi.org/10.1007/978-3-540-70956-5_7
Kotu, V., Deshpande, B.: Data Science. Morgan Kaufmann, 2nd edn. (2019). https://doi.org/10.1016/B978-0-12-814761-0.00007-1
Kovacic, I., Schuetz, C.G., Schausberger, S., Sumereder, R., Schrefl, M.: Guided query composition with semantic OLAP patterns. In: Augsten, N. (ed.) Proceedings of the Workshops of the EDBT/ICDT 2018 Joint Conference. CEUR Workshop Proceedings, vol. 2083, pp. 67–74. CEUR-WS.org (2018), http://ceur-ws.org/Vol-2083/paper-11.pdf
Krötzsch, M., Weikum, G.: Editorial for special section on knowledge graphs. Journal of Web Semantics 37-38, 53–54 (2016). https://doi.org/10.1016/j.websem.2016.04.002
Linstedt, D., Olschimke, M.: Building a scalable data warehouse with Data Vault 2.0. Morgan Kaufmann (2015)
Marcel, P.: OLAP query personalisation and recommendation: An introduction. In: Aufaure, M., Zimányi, E. (eds.) 2011. LNBIP, vol. 96, pp. 63–83. Springer (2011). https://doi.org/10.1007/978-3-642-27358-2_3
Marz, N., Warren, J.: Big Data: Principles and best practices of scalable real-time data systems. Manning Publications (2015)
Morgan, R., Grossmann, G., Schrefl, M., Stumptner, M., Payne, T.: VizDSL: A visual DSL for interactive information visualization. In: Krogstie, J., Reijers, H.A. (eds.) CAiSE 2018. LNCS, vol. 10816, pp. 440–455. Springer (2018). https://doi.org/10.1007/978-3-319-91563-0_27
Muñoz, L., Mazón, J., Pardillo, J., Trujillo, J.: Modelling ETL processes of data warehouses with UML activity diagrams. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008 Workshops. LNCS, vol. 5333, pp. 44–53. Springer (2008). https://doi.org/10.1007/978-3-540-88875-8_21
Nalchigar, S., Yu, E.: Business-driven data analytics: A conceptual modeling framework. Data & Knowledge Engineering 117, 359–372 (2018). https://doi.org/10.1016/j.datak.2018.04.006
Nalchigar, S., Yu, E.: Designing business analytics solutions. Business & Information Systems Engineering 62(1), 61–75 (2020). https://doi.org/10.1007/s12599-018-0555-z
Nalchigar, S., Yu, E.S.K., Obeidi, Y., Carbajales, S., Green, J., Chan, A.: Solution patterns for machine learning. In: Giorgini, P., Weber, B. (eds.) CAiSE 2019. LNCS, vol. 11483, pp. 627–642. Springer (2019). https://doi.org/10.1007/978-3-030-21290-2_39
Neuböck, T., Schrefl, M.: Modelling knowledge about data analysis processes in manufacturing. IFAC-PapersOnLine 48(3), 277–282 (2015). https://doi.org/10.1016/j.ifacol.2015.06.094, 15th IFAC Symposium onInformation Control Problems inManufacturing
Oliveira, B., Belo, O.: BPMN patterns for ETL conceptual modelling and validation. In: ISMIS 2012, LNCS, vol. 7661, pp. 445–454. Springer (2012)
Oliveira, B., Santos, V., Belo, O.: Pattern-based ETL conceptual modelling. In: MEDI 2013, LNCS, vol. 8216, pp. 237–248. Springer (2013)
Ordonez, C., Maabout, S., Matusevich, D.S., Cabrera, W.: Extending er models to capture database transformations to build data sets for data mining. Data & Knowledge Engineering 89, 38–54 (2014)
Peiritsch, A.R.: Starbucks’ racial-bias crisis: Toward a rhetoric of renewal. Journal of Media Ethics 34(4), 215–227 (2019). https://doi.org/10.1080/23736992.2019.1673757
Pozzi, F.A., Fersini, E., Messina, E., Liu, B. (eds.): Sentiment analysis in social networks. Morgan Kaufmann (2017). https://doi.org/10.1016/C2015-0-01864-0
Romero, O., Marcel, P., Abelló, A., Peralta, V., Bellatreche, L.: Describing analytical sessions using a multidimensional algebra. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2011. LNCS, vol. 6862, pp. 224–239. Springer (2011). https://doi.org/10.1007/978-3-642-23544-3_17
Russell, N., Van Der Aalst, W.M.P., Ter Hofstede, A.H.M.: Workflow patterns: the definitive guide. MIT Press (2016)
Russom, P.: Data lakes: Purposes, practices, patterns, and platforms (2017), https://tdwi.org/research/2017/03/best-practices-report-data-lakes, accessed: 05 August 2019
Saltz, J.S., Grady, N.W.: The ambiguity of data science team roles and the need for a data science workforce framework. In: Nie, J., Obradovic, Z., Suzumura, T., Ghosh, R., Nambiar, R., Wang, C., Zang, H., Baeza-Yates, R., Hu, X., Kepner, J., Cuzzocrea, A., Tang, J., Toyoda, M. (eds.) 2017 IEEE International Conference on Big Data. pp. 2355–2361 (2017). https://doi.org/10.1109/BigData.2017.8258190
Satyanarayanan, M.: The emergence of edge computing. Computer 50(1), 30–39 (2017). https://doi.org/10.1109/MC.2017.9
Schäffer, M.: Modeling genome data processing pipelines. In: Plattner, H., Schapranow, M.P. (eds.) High-Performance In-Memory Genome Data Analysis: How In-Memory Database Technology Accelerates Personalized Medicine, pp. 31–53. Springer (2014). https://doi.org/10.1007/978-3-319-03035-7_2
Schuetz, C.G., Neumayr, B., Schrefl, M., Gringinger, E., Wilson, S.: Semantics-based summarisation of atm information: Managing information overload in pilot briefings using semantic data containers. The Aeronautical Journal (2019). https://doi.org/10.1017/aer.2019.74
Schuetz, C.G., Neumayr, B., Schrefl, M., Neuböck, T.: Reference modeling for data analysis: The BIRD approach. International Journal of Cooperative Information Systems 25(2), 1–46 (2016). https://doi.org/10.1142/S0218843016500064
Schuetz, C.G., Schausberger, S., Kovacic, I., Schrefl, M.: Semantic OLAP patterns: Elements of reusable business analytics. In: Panetto, H., Debruyne, C., Gaaloul, W., Papazoglou, M.P., Paschke, A., Ardagna, C.A., Meersman, R. (eds.) OTM 2017. LNCS, vol. 10574, pp. 318–336. Springer (2017). https://doi.org/10.1007/978-3-319-69459-7_22
Schuetz, C.G., Schausberger, S., Schrefl, M.: Building an active semantic data warehouse for precision dairy farming. Journal of Organizational Computing and Electronic Commerce 28(2), 122–141 (2018). https://doi.org/10.1080/10919392.2018.1444344
Seiter, M.: Business Analytics. Vahlen, 2nd edn. (2019)
Sharda, R., Delen, D., Turban, E.: Business intelligence, analytics, and data science: a managerial perspective. Pearson, 4th global edn. (2018)
Sherman, R.: Business Intelligence Guidebook. Morgan Kaufmann (2015). https://doi.org/10.1016/C2012-0-06937-2
Shi, W., Dustdar, S.: The promise of edge computing. Computer 49(5), 78–81 (2016). https://doi.org/10.1109/MC.2016.145
Thalhammer, T., Schrefl, M., Mohania, M.K.: Active data warehouses: complementing OLAP with analysis rules. Data & Knowledge Engineering 39(3), 241–269 (2001). https://doi.org/10.1016/S0169-023X(01)00042-8
Theodoroua, V., Abelló, A., Thieleb, M., Lehner, W.: Frequent patterns in ETL workflows: An empirical approach. Data & Knowledge Engineering 112, 1–16 (2017)
Vaisman, A., Zimányi, E.: Data Warehouse Systems – Design and Implementation. Springer (2014)
Vrandečić, D., Krötzsch, M.: Wikidata: A free collaborative knowledgebase. Communications of the ACM 57(10), 78–85 (2014). https://doi.org/10.1145/2629489
Wang, L., Schuetz, C.G., Cai, D.: Choosing response strategies in social media crisis communication: an evolutionary game theory perspective. Information & Management (2020). https://doi.org/10.1016/j.im.2020.103371, in press
Weiler, A., Grossniklaus, M., Scholl, M.H.: An evaluation of the run-time and task-based performance of event detection techniques for twitter. Information Systems 62, 207–219 (2016). https://doi.org/10.1016/j.is.2016.01.003
Williams, S.: Business Intelligence Strategy and Big Data Analytics. Morgan Kaufmann (2016). https://doi.org/10.1016/C2015-0-01169-8
Zeng, D., Chen, H., Lusch, R., Li, S.: Social media analytics and intelligence. IEEE Intelligent Systems 25(6), 13–16 (2010)
Acknowledgements
We thank Ilko Kovacic for permission to adapt his figures on enriched multidimensional models and OLAP patterns. We thank Median Hilal for feedback on the graphical presentation of analysis graphs.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer-Verlag GmbH, DE, part of Springer Nature
About this chapter
Cite this chapter
Schuetz, C.G., Schrefl, M. (2023). Conceptualizing Analytics: An Overview of Business Intelligence and Analytics from a Conceptual-Modeling Perspective. In: Vogel-Heuser, B., Wimmer, M. (eds) Digital Transformation. Springer Vieweg, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-65004-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-662-65004-2_13
Published:
Publisher Name: Springer Vieweg, Berlin, Heidelberg
Print ISBN: 978-3-662-65003-5
Online ISBN: 978-3-662-65004-2
eBook Packages: Computer ScienceComputer Science (R0)