A Step Foreword Historical Data Governance in Information Systems

  • José Pedro Simão
  • Orlando BeloEmail author
Conference paper
Part of the Lecture Notes in Business Information Processing book series (LNBIP, volume 341)


From major companies and organizations to smaller ones around the world, databases are now one of the leading technologies for supporting most of organizational information assets. Their evolution allows us to store almost anything often without determining if it is in fact relevant to be saved or not. Hence, it is predictable that most information systems sooner or later will face some data management problems and consequently the performance problems that are unavoidably linked to. In this paper we tackle the data management problem with a proposal for a solution using machine-learning techniques, trying to understand in an intelligent manner the data in a database, according to its relevance for their users. Thus, identifying what is really important to who uses the system and being able to distinguish it from the rest of the data is a great way for creating new and efficient measures for managing data in an information system.


Information systems management Databases systems Data governance Data quality Machine learning 



This work has been supported by COMPETE: POCI-01-0145-FEDER-007043 and FCT – Fundação para a Ciência e Tecnologia within the Project Scope: UID/CEC/00319/2013.


  1. 1.
    George Marakas, G., O’Brien, J.: Management Information Systems. McGraw-Hill Education, New York City (2010)Google Scholar
  2. 2.
    Marr, B.: Big data overload: why most companies can’t deal with the data explosion. Forbes (2016). Accessed 25 May 2018
  3. 3.
    Marr, B.: Big data: 20 mind-boggling facts everyone must read. Forbes (2015). Accessed 25 May 25 2018
  4. 4.
    Russom, P.: Data governance strategies. Bus. Intell. J. 13(2), 13–15 (2008)Google Scholar
  5. 5.
    Newman, D., Logan, D.: Governance is an essential building block for enterprise information management. Gartner Research, pp. 1–9, May 2006Google Scholar
  6. 6.
    Angeletou, S., Rowe, M., Alani, H.: Modelling and analysis of user behaviour in online communities. In: Aroyo, L., et al. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 35–50. Springer, Heidelberg (2011). Scholar
  7. 7.
    Grolinger, K., Higashino, W., Tiwari, A., Capretz, M.: Data management in cloud environments: NoSQL and NewSQL data stores. J. Cloud Comput. 2(1), 49:1–49:24 (2013)CrossRefGoogle Scholar
  8. 8.
    Sakr, S., Liu, A., Batista, D., Alomari, M.: Survey of large scale data management approaches in cloud environments. IEEE Commun. Surv. Tutorials 13(3), 311–336 (2011)CrossRefGoogle Scholar
  9. 9.
    LaBrie, R., Ye, L.: A paradigm shift in database optimization: from indices to aggregates, p. 5 (2002)Google Scholar
  10. 10.
    Jarke, M., Koch, J.: Query optimization in database systems. ACM Comput. Surv. 16(2), 111–152 (1984)MathSciNetCrossRefGoogle Scholar
  11. 11.
    Ioannidis, Y.: Query optimization. ACM Comput. Surv. 28(1), 121–123 (1996)CrossRefGoogle Scholar
  12. 12.
    Rocha, D., Belo, O.: Integrating usage analysis on cube view selection - an alternative method. Int. J. Decis. Support Syst. 1(2), 228 (2015)CrossRefGoogle Scholar
  13. 13.
    Najafabadi, M.M., Villanustre, F., Khoshgoftaar, T.M., Seliya, N., Wald, R., Muharemagic, E.: Deep learning applications and challenges in big data analytics. J. Big Data 2(1), 1–21 (2015)CrossRefGoogle Scholar
  14. 14.
    Qiu, J., Wu, Q., Ding, G., Xu, Y., Feng, S.: A survey of machine learning for big data processing. EURASIP J. Adv. Signal Process. (2016)Google Scholar
  15. 15.
    Al-Jarrah, O.Y., Yoo, P.D., Muhaidat, S., Karagiannidis, G.K., Taha, K.: Efficient machine learning for big data: a review. Big Data Res. 2, 87–93 (2015)CrossRefGoogle Scholar
  16. 16.
    Arnold, K., Gosling, J., Holmes, D.: The Java Programming Language, 4th edn. Addison - Wesley, Upper Saddle River (2006)zbMATHGoogle Scholar
  17. 17.
    Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann Series in Data Management Systems, 2nd edn. Morgan Kaufman, Amsterdam, Boston (2005)zbMATHGoogle Scholar
  18. 18.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10 (2009)CrossRefGoogle Scholar
  19. 19.
    Candel, A., LeDell, E., Parmar, V., Arora, A.: Deep Learning with H2O - Booklet, 5th edn., Inc., Mountain View (2017)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.ALGORITMI R&D Centre, Department of Informatics, School of EngineeringUniversity of MinhoBragaPortugal

Personalised recommendations