Quality Assessment of Data Using Statistical and Machine Learning Methods

Conference paper
Part of the Smart Innovation, Systems and Technologies book series (SIST, volume 32)


Data warehouses are used in organization for efficiently managing the information. The data from various heterogeneous data sources are integrated in data warehouse in order to do analysis and make decision. Data warehouse quality is very important as it is the main tool for strategic decision. Data warehouse quality is influenced by Data model quality which is further influenced by conceptual data model. In this paper, we first summarize the set of metrics for measuring the understand ability of conceptual data model for data warehouses. The statistical and machine learning methods are used to predict effect of structural metrics, on understand ability, efficiency and effectiveness of Data warehouse Multidimensional (MD) conceptual model.


Conceptual model Data warehouse quality Multidimensional data model Statistical Understand ability 


  1. 1.
    Serrano, M., Trujillo, J., Calerro, C., Piattini, M.: Metrics for data warehouse conceptual model understandability. Inf. Softw. Technol. 851–890 (2007)Google Scholar
  2. 2.
    Kimball, R.: The Data Warehouse Toolkit. Wiley, New York (2011)Google Scholar
  3. 3.
    Kesh, S.: Evaluating the quality of entity relationship models. Inf. Softw. Technol. 37, 681–689 (1995)CrossRefGoogle Scholar
  4. 4.
    Serrano, M., Calero, C., Trujello, J.: Sergio Lujan-Mora and Mario Riattini. Empirical Validation of Metrics for Conceptual Models of Data Warehouses. In: Pearson, A., Stirna, J. (eds.) CAiSE, LNCS, vol. 3084, pp. 506–520 (2004)Google Scholar
  5. 5.
    Batini, C., Ceri S., Navathe S.: Conceptual database design: an entity relationship approach. Benjamin/CummingsGoogle Scholar
  6. 6.
    Jeusfeld, M., Quix, C., Jarke, M.: Design and analysis of quality information for data warehouses. In: 17th International Conference on Conceptual Modeling (ER‟98), Singapore (1998)Google Scholar
  7. 7.
    Golfarelli, M., Maio, D., Rizzi S.: The dimensional fact model—a conceptual for data warehouses. Int. J. Coop. Inf. Syst. (IJCIS) 7, 215–247 (1998)Google Scholar
  8. 8.
    Basili, V., Romach.: The tame project towards improvement oriented software environments. IEEE Trans. Soft Eng. 14(6) 728–738 (1988)Google Scholar
  9. 9.
    Golfarelli, M., Rizzi, S.: A methodological framework for data warehouse design. In: 1st International Workshop on Data Warehousing and OLAP (Dolap 98) Maryland (USA) (1998)Google Scholar
  10. 10.
    Sapia, C.: On Modeling and Predicting Query Behavior in OLAP Systems. In: International Workshop on Design and Management of Data warehouses (DMDW ‘99), pp. 1–10, Heidelberg (Germany) (1999)Google Scholar
  11. 11.
    Sapia, C., Blaschka, M., Holfing, G., Dinter, B.: Extending use the E/R model for multidimensional paradigm. In: 1st International Workshop on Data Warehouse and Data mining (DWDM ’98), pp. 105–116. Springer Singapore (1998)Google Scholar
  12. 12.
    Husemann, B., Lechtenborger, J., Vossen, G.: Conceptual data warehouse design. In: 2nd International Workshop on Design and Management of Data Warehouses (DMDW 2000), pp. 3–9, Stockholm (Sweden) (2000)Google Scholar
  13. 13.
    Abello, A., Samos, J., Saltor, F.: YAM2 (Yet Another Multi Dimensional Model) An Extension of UML. In: International Database Engineering and Application Symposium (IDEAS 2002), pp. 172–181. IEEE Computer Society Edmonton (Canada) (2002)Google Scholar
  14. 14.
    Caldiera, V.R.B.G., Dieter Rombach, H.: The goal question metric approach. In: Encyclopedia of Software Engineering. Wiley, New York (1994)Google Scholar
  15. 15.
    Moody, D.: Metrics for evaluating the quality of entity relationship models. In: 17th International Conference on Conceptual Modelling, pp. 213–225 (ER‟98) Singapore (1998)Google Scholar

Copyright information

© Springer India 2015

Authors and Affiliations

  1. 1.Jagan Institute of Management StudiesNew DelhiIndia
  2. 2.USICTDwarka, New DelhiIndia

Personalised recommendations