Skip to main content

Conceptualizing Analytics: An Overview of Business Intelligence and Analytics from a Conceptual-Modeling Perspective

  • Chapter
  • First Online:
Digital Transformation

Abstract

Business intelligence and data analytics projects often involve low-level, ad hoc data wrangling and programming, which increases development effort and reduces usability of the resulting analytics solutions. Conceptual modeling allows to move data analytics onto a higher level of abstraction, facilitating the implementation and use of analytics solutions. In this chapter, we provide an overview of the data analytics landscape and explain, along the (big) data analysis pipeline, how conceptual modeling methods may benefit the development and use of data analytics solutions. We review existing literature and illustrate common issues as well as solutions using examples from cooperative research projects in the domains of precision dairy farming and air traffic management. We target practitioners involved in the planning and implementation of business intelligence and analytics projects as well as researchers interested in the application of conceptual modeling to business intelligence and analytics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.indyco.com/.

References

  1. Agrawal, D., Bernstein, P., Bertino, E., Davidson, S., Dayal, U., Franklin, M., others: Challenges and opportunities with big data – a community white paper developed by leading researchers across the United States. Tech. rep., Computing Community Consortium (2012), https://cra.org/ccc/resources/ccc-led-whitepapers/, accessed: 23 June 2020

  2. Anderlik, S., Neumayr, B., Schrefl, M.: Using domain ontologies as semantic dimensions in data warehouses. In: Atzeni, P., Cheung, D.W., Ram, S. (eds.) ER 2012. LNCS, vol. 7532, pp. 88–101. Springer (2012). https://doi.org/10.1007/978-3-642-34002-4_7

  3. Bala, M., Boussaid, O., Alimazighi, Z.: A fine-grained distribution approach for ETL processes in big data environments. Data & Knowledge Engineering 111, 114–136 (2017)

    Article  Google Scholar 

  4. Baldacci, L., Golfarelli, M., Graziani, S., Rizzi, S.: QETL: An approach to on-demand ETL from non-owned data sources. Data & Knowledge Engineering 112, 17–37 (2017). https://doi.org/10.1016/j.datak.2017.09.002

    Article  Google Scholar 

  5. Becker, J., Delfmann, P., Knackstedt, R.: Adaptive reference modeling: Integrating configurative and generic adaptation techniques for information models. In: Reference modeling, pp. 27–58. Springer (2007)

    Google Scholar 

  6. Berthold, M.R., Cebron, N., Dill, F., Gabriel, T.R., Kötter, T., Meinl, T., Ohl, P., Thiel, K., Wiswedel, B.: KNIME - the konstanz information miner: version 2.0 and beyond. SIGKDD Explorations 11(1), 26–31 (2009). https://doi.org/10.1145/1656274.1656280

  7. Brambilla, M., Fraternali, P.: Interaction flow modeling language: Model-driven UI engineering of web and mobile apps with IFML. Morgan Kaufmann (2014)

    Google Scholar 

  8. Caracciolo, C., Stellato, A., Morshed, A., Johannsen, G., Rajbhandari, S., Jaques, Y., Keizer, J.: The AGROVOC linked dataset. Semantic Web 4(3), 341–348 (2013)

    Article  Google Scholar 

  9. Ceravolo, P., Azzini, A., Angelini, M., Catarci, T., Cudré-Mauroux, P., Damiani, E., Mazak, A., van Keulen, M., Jarrar, M., Santucci, G., Sattler, K., Scannapieco, M., Wimmer, M., Wrembel, R., Zaraket, F.A.: Big data semantics. Journal on Data Semantics 7(2), 65–85 (2018). https://doi.org/10.1007/s13740-018-0086-2

    Article  Google Scholar 

  10. Delen, D., Ram, S.: Research challenges and opportunities in business analytics. Journal of Business Analytics 1(1), 2–12 (2018). https://doi.org/10.1080/2573234X.2018.1507324

    Article  Google Scholar 

  11. Dobson, S., Golfarelli, M., Graziani, S., Rizzi, S.: A reference architecture and model for sensor data warehousing. IEEE Sensors Journal 18(18), 7659–7670 (2018). https://doi.org/10.1109/JSEN.2018.2861327

    Article  Google Scholar 

  12. Donnelly, K.: SNOMED-CT: The advanced terminology and coding system for eHealth. Studies in Health Technology and Informatics 121,  279 (2006)

    Google Scholar 

  13. El Akkaoui, Z., Mazón, J., Vaisman, A.A., Zimányi, E.: BPMN-based conceptual modeling of ETL processes. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 1–14. Springer (2012)

    Google Scholar 

  14. El Akkaoui, Z., Zimányi, E.: Defining ETL worfklows using BPMN and BPEL. In: Proceedings of the ACM 12th International Workshop on Data Warehousing and OLAP. pp. 41–48 (2009)

    Google Scholar 

  15. Fleckenstein, M., Fellows, L.: Data Analytics, pp. 133–142. Springer (2018). https://doi.org/10.1007/978-3-319-68993-7_13

  16. Francia, M., Gallinucci, E., Golfarelli, M.: Social BI to understand the debate on vaccines on the web and social media: unraveling the anti-, free, and pro-vax communities in italy. Social Network Analysis and Mining 9(1), 46:1–46:16 (2019). https://doi.org/10.1007/s13278-019-0590-x

  17. Francia, M., Gallinucci, E., Golfarelli, M., Rizzi, S.: Social business intelligence in action. In: Nurcan, S., Soffer, P., Bajec, M., Eder, J. (eds.) CAiSE 2016. LNCS, vol. 9694, pp. 33–48. Springer (2016). https://doi.org/10.1007/978-3-319-39696-5_3

  18. Golfarelli, M.: Design issues in social business intelligence projects. In: Zimányi, E., Abelló, A. (eds.) eBISS 2015. LNBIP, vol. 253, pp. 62–86. Springer (2016)

    Google Scholar 

  19. Golfarelli, M., Maio, D., Rizzi, S.: The dimensional fact model: a conceptual model for data warehouses. International Journal of Cooperative Information Systems 7(2-3), 215–247 (1998)

    Article  Google Scholar 

  20. Golfarelli, M., Rizzi, S.: A model-driven approach to automate data visualization in big data analytics. Information Visualization 19(1) (2020). https://doi.org/10.1177/1473871619858933

  21. Gorelik, A.: The enterprise big data lake: Delivering the promise of big data and data science. O’Reilly (2019)

    Google Scholar 

  22. Hilal, M., Schuetz, C.G., Schrefl, M.: Using superimposed multidimensional schemas and OLAP patterns for RDF data analysis. Open Computer Science 8(1), 18–37 (2018). https://doi.org/10.1515/comp-2018-0003

    Article  Google Scholar 

  23. Hitzler, P., Krötzsch, M., Rudolph, S.: Foundations of Semantic Web Technologies. Chapman and Hall/CRC Press (2010), http://www.semantic-web-book.org/

  24. Hitzler, P., Krötzsch, M., Rudolph, S., Sure, Y.: Semantic Web: Grundlagen. Springer (2007)

    Google Scholar 

  25. Inmon, W.H.: Building the data warehouse. Wiley, fourth edn. (2005)

    Google Scholar 

  26. Keim, D.A., Andrienko, G.L., Fekete, J., Görg, C., Kohlhammer, J., Melançon, G.: Visual analytics: Definition, process, and challenges. In: Kerren, A., Stasko, J.T., Fekete, J., North, C. (eds.) Information Visualization - Human-Centered Issues and Perspectives, LNCS, vol. 4950, pp. 154–175. Springer (2008). https://doi.org/10.1007/978-3-540-70956-5_7

  27. Kotu, V., Deshpande, B.: Data Science. Morgan Kaufmann, 2nd edn. (2019). https://doi.org/10.1016/B978-0-12-814761-0.00007-1

  28. Kovacic, I., Schuetz, C.G., Schausberger, S., Sumereder, R., Schrefl, M.: Guided query composition with semantic OLAP patterns. In: Augsten, N. (ed.) Proceedings of the Workshops of the EDBT/ICDT 2018 Joint Conference. CEUR Workshop Proceedings, vol. 2083, pp. 67–74. CEUR-WS.org (2018), http://ceur-ws.org/Vol-2083/paper-11.pdf

  29. Krötzsch, M., Weikum, G.: Editorial for special section on knowledge graphs. Journal of Web Semantics 37-38, 53–54 (2016). https://doi.org/10.1016/j.websem.2016.04.002

    Article  Google Scholar 

  30. Linstedt, D., Olschimke, M.: Building a scalable data warehouse with Data Vault 2.0. Morgan Kaufmann (2015)

    Google Scholar 

  31. Marcel, P.: OLAP query personalisation and recommendation: An introduction. In: Aufaure, M., Zimányi, E. (eds.) 2011. LNBIP, vol. 96, pp. 63–83. Springer (2011). https://doi.org/10.1007/978-3-642-27358-2_3

  32. Marz, N., Warren, J.: Big Data: Principles and best practices of scalable real-time data systems. Manning Publications (2015)

    Google Scholar 

  33. Morgan, R., Grossmann, G., Schrefl, M., Stumptner, M., Payne, T.: VizDSL: A visual DSL for interactive information visualization. In: Krogstie, J., Reijers, H.A. (eds.) CAiSE 2018. LNCS, vol. 10816, pp. 440–455. Springer (2018). https://doi.org/10.1007/978-3-319-91563-0_27

  34. Muñoz, L., Mazón, J., Pardillo, J., Trujillo, J.: Modelling ETL processes of data warehouses with UML activity diagrams. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008 Workshops. LNCS, vol. 5333, pp. 44–53. Springer (2008). https://doi.org/10.1007/978-3-540-88875-8_21

  35. Nalchigar, S., Yu, E.: Business-driven data analytics: A conceptual modeling framework. Data & Knowledge Engineering 117, 359–372 (2018). https://doi.org/10.1016/j.datak.2018.04.006

    Article  Google Scholar 

  36. Nalchigar, S., Yu, E.: Designing business analytics solutions. Business & Information Systems Engineering 62(1), 61–75 (2020). https://doi.org/10.1007/s12599-018-0555-z

    Article  Google Scholar 

  37. Nalchigar, S., Yu, E.S.K., Obeidi, Y., Carbajales, S., Green, J., Chan, A.: Solution patterns for machine learning. In: Giorgini, P., Weber, B. (eds.) CAiSE 2019. LNCS, vol. 11483, pp. 627–642. Springer (2019). https://doi.org/10.1007/978-3-030-21290-2_39

  38. Neuböck, T., Schrefl, M.: Modelling knowledge about data analysis processes in manufacturing. IFAC-PapersOnLine 48(3), 277–282 (2015). https://doi.org/10.1016/j.ifacol.2015.06.094, 15th IFAC Symposium onInformation Control Problems inManufacturing

  39. Oliveira, B., Belo, O.: BPMN patterns for ETL conceptual modelling and validation. In: ISMIS 2012, LNCS, vol. 7661, pp. 445–454. Springer (2012)

    Google Scholar 

  40. Oliveira, B., Santos, V., Belo, O.: Pattern-based ETL conceptual modelling. In: MEDI 2013, LNCS, vol. 8216, pp. 237–248. Springer (2013)

    Google Scholar 

  41. Ordonez, C., Maabout, S., Matusevich, D.S., Cabrera, W.: Extending er models to capture database transformations to build data sets for data mining. Data & Knowledge Engineering 89, 38–54 (2014)

    Article  Google Scholar 

  42. Peiritsch, A.R.: Starbucks’ racial-bias crisis: Toward a rhetoric of renewal. Journal of Media Ethics 34(4), 215–227 (2019). https://doi.org/10.1080/23736992.2019.1673757

    Article  Google Scholar 

  43. Pozzi, F.A., Fersini, E., Messina, E., Liu, B. (eds.): Sentiment analysis in social networks. Morgan Kaufmann (2017). https://doi.org/10.1016/C2015-0-01864-0

  44. Romero, O., Marcel, P., Abelló, A., Peralta, V., Bellatreche, L.: Describing analytical sessions using a multidimensional algebra. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2011. LNCS, vol. 6862, pp. 224–239. Springer (2011). https://doi.org/10.1007/978-3-642-23544-3_17

  45. Russell, N., Van Der Aalst, W.M.P., Ter Hofstede, A.H.M.: Workflow patterns: the definitive guide. MIT Press (2016)

    Google Scholar 

  46. Russom, P.: Data lakes: Purposes, practices, patterns, and platforms (2017), https://tdwi.org/research/2017/03/best-practices-report-data-lakes, accessed: 05 August 2019

  47. Saltz, J.S., Grady, N.W.: The ambiguity of data science team roles and the need for a data science workforce framework. In: Nie, J., Obradovic, Z., Suzumura, T., Ghosh, R., Nambiar, R., Wang, C., Zang, H., Baeza-Yates, R., Hu, X., Kepner, J., Cuzzocrea, A., Tang, J., Toyoda, M. (eds.) 2017 IEEE International Conference on Big Data. pp. 2355–2361 (2017). https://doi.org/10.1109/BigData.2017.8258190

  48. Satyanarayanan, M.: The emergence of edge computing. Computer 50(1), 30–39 (2017). https://doi.org/10.1109/MC.2017.9

    Article  Google Scholar 

  49. Schäffer, M.: Modeling genome data processing pipelines. In: Plattner, H., Schapranow, M.P. (eds.) High-Performance In-Memory Genome Data Analysis: How In-Memory Database Technology Accelerates Personalized Medicine, pp. 31–53. Springer (2014). https://doi.org/10.1007/978-3-319-03035-7_2

  50. Schuetz, C.G., Neumayr, B., Schrefl, M., Gringinger, E., Wilson, S.: Semantics-based summarisation of atm information: Managing information overload in pilot briefings using semantic data containers. The Aeronautical Journal (2019). https://doi.org/10.1017/aer.2019.74

    Article  Google Scholar 

  51. Schuetz, C.G., Neumayr, B., Schrefl, M., Neuböck, T.: Reference modeling for data analysis: The BIRD approach. International Journal of Cooperative Information Systems 25(2), 1–46 (2016). https://doi.org/10.1142/S0218843016500064

    Article  Google Scholar 

  52. Schuetz, C.G., Schausberger, S., Kovacic, I., Schrefl, M.: Semantic OLAP patterns: Elements of reusable business analytics. In: Panetto, H., Debruyne, C., Gaaloul, W., Papazoglou, M.P., Paschke, A., Ardagna, C.A., Meersman, R. (eds.) OTM 2017. LNCS, vol. 10574, pp. 318–336. Springer (2017). https://doi.org/10.1007/978-3-319-69459-7_22

  53. Schuetz, C.G., Schausberger, S., Schrefl, M.: Building an active semantic data warehouse for precision dairy farming. Journal of Organizational Computing and Electronic Commerce 28(2), 122–141 (2018). https://doi.org/10.1080/10919392.2018.1444344

    Article  Google Scholar 

  54. Seiter, M.: Business Analytics. Vahlen, 2nd edn. (2019)

    Google Scholar 

  55. Sharda, R., Delen, D., Turban, E.: Business intelligence, analytics, and data science: a managerial perspective. Pearson, 4th global edn. (2018)

    Google Scholar 

  56. Sherman, R.: Business Intelligence Guidebook. Morgan Kaufmann (2015). https://doi.org/10.1016/C2012-0-06937-2

    Article  Google Scholar 

  57. Shi, W., Dustdar, S.: The promise of edge computing. Computer 49(5), 78–81 (2016). https://doi.org/10.1109/MC.2016.145

    Article  Google Scholar 

  58. Thalhammer, T., Schrefl, M., Mohania, M.K.: Active data warehouses: complementing OLAP with analysis rules. Data & Knowledge Engineering 39(3), 241–269 (2001). https://doi.org/10.1016/S0169-023X(01)00042-8

    Article  MATH  Google Scholar 

  59. Theodoroua, V., Abelló, A., Thieleb, M., Lehner, W.: Frequent patterns in ETL workflows: An empirical approach. Data & Knowledge Engineering 112, 1–16 (2017)

    Article  Google Scholar 

  60. Vaisman, A., Zimányi, E.: Data Warehouse Systems – Design and Implementation. Springer (2014)

    Google Scholar 

  61. Vrandečić, D., Krötzsch, M.: Wikidata: A free collaborative knowledgebase. Communications of the ACM 57(10), 78–85 (2014). https://doi.org/10.1145/2629489

    Article  Google Scholar 

  62. Wang, L., Schuetz, C.G., Cai, D.: Choosing response strategies in social media crisis communication: an evolutionary game theory perspective. Information & Management (2020). https://doi.org/10.1016/j.im.2020.103371, in press

  63. Weiler, A., Grossniklaus, M., Scholl, M.H.: An evaluation of the run-time and task-based performance of event detection techniques for twitter. Information Systems 62, 207–219 (2016). https://doi.org/10.1016/j.is.2016.01.003

    Article  Google Scholar 

  64. Williams, S.: Business Intelligence Strategy and Big Data Analytics. Morgan Kaufmann (2016). https://doi.org/10.1016/C2015-0-01169-8

    Article  Google Scholar 

  65. Zeng, D., Chen, H., Lusch, R., Li, S.: Social media analytics and intelligence. IEEE Intelligent Systems 25(6), 13–16 (2010)

    Article  Google Scholar 

Download references

Acknowledgements

We thank Ilko Kovacic for permission to adapt his figures on enriched multidimensional models and OLAP patterns. We thank Median Hilal for feedback on the graphical presentation of analysis graphs.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christoph G. Schuetz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer-Verlag GmbH, DE, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Schuetz, C.G., Schrefl, M. (2023). Conceptualizing Analytics: An Overview of Business Intelligence and Analytics from a Conceptual-Modeling Perspective. In: Vogel-Heuser, B., Wimmer, M. (eds) Digital Transformation. Springer Vieweg, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-65004-2_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-65004-2_13

  • Published:

  • Publisher Name: Springer Vieweg, Berlin, Heidelberg

  • Print ISBN: 978-3-662-65003-5

  • Online ISBN: 978-3-662-65004-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics