Quality in Data Warehousing

  • Mokrane Bouzeghoub
  • Zoubida Kedad
Part of the Advances in Database Systems book series (ADBS, volume 25)

Abstract

Data warehousing is a new technology which provides a software infrastructure for decision support systems and OLAP applications. Data warehouse systems collect data from heterogeneous and distributed sources, transform and reconcile this data in order to aggregate it and customize it with respect to business and organizational criteria required by decision makers. High level aggregated data is organized by subjects and stored as a multidimensional structure into a data mart. Data quality is very important in database applications in general and very crucial in data warehousing in particular. Indeed, data warehouse systems provide aggregated data to decision makers whose actions and decisions should be very strategic to the enterprise. Providing dirty data, imprecise data or non coherent data may lead to the rejection of the decision support system or may result into non productive decisions. This chapter provides a general framework for data warehouse design based on quality.

Keywords

Quality Factor Data Warehouse Enterprise Model Quality Goal User View 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baralis E., Paraboschi S., Teniente E. “Materialized View Selection in a Multidimensional Environment”, Proceedings of the VLDB’97, Athens, GR, 1997.Google Scholar
  2. Basili, V. R., Caldiera, G., Rombach. H. D. The Goal Question Metric Approach.Encyclopedia of Software Engineering - 2 Volume Setpp 528–532, John Wiley & Sons, Inc. (1994). Available athttp://www.cs.umd.edu/users/basili/papers.html Google Scholar
  3. Bouzeghoub M., Fabret F., Matulovic M., “Modeling the Data Warehouse Refreshment Process as a Workflow Application ”, Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW’99), Heidelberg, Germany, June 1999.Google Scholar
  4. Bouzeghoub, M., Kedad, Z. “A Quality-Based Framework for Data Warehouse Design ”, Proceedings of the International Workshop on Design and Management of Data Warehouses(DMDW’2000), Stockholm, June 2000.Google Scholar
  5. Bouzeghoub, M., Lenzerini, M. (guest eds) “Data extraction, cleaning and reconciliation, a special issue of Information Systems Journal, Elsevier Sce, to be published end 2001Google Scholar
  6. Calvanese, D., De Giacomo, G., Lenzerini, M., Nardi, D., Rosati, R., “ Source Integration integration: conceptual modeling and reasoning support ”, Proceedings of the 6thInternat. Conf. On Cooperative Information Systems, (CoopIS’98), 1998.Google Scholar
  7. Galhardas, H., Florescu, D., Shasha, D., Simon, E., Saita, C.A. “Declarative data cleaning: language, model and algorithms, proceed. Of the 27`h VLDB Conf., Rome, 2001.Google Scholar
  8. Gupta A., Mumick I.S., “Maintenance of materialized views: Problems, Techniques, and applications.” IEEE Data Engineering Bulletin, Special Issue on Materialised Views and Data Warehousing, 18(2), june 1995Google Scholar
  9. Hull R., Zhou G. “A Framework for Supporting Data Integration Using the Materialized and Virtual Approches”Procedding of theSIGMOD 96. 1996aGoogle Scholar
  10. Hull, R., Zhou, G., Toward the study of performance trade-off between materialized and virtual integrated views, Proeed. Of the Workshop on Materialized Views, I.S. Mumick & A. Gupta Edits, Montreal, June 1996b.Google Scholar
  11. Jarke M, Jeusfeld M.A., Quix C., Vassiliadis P., “Architecture and Quality in Data Warehouses”, Proceeding of the 10th International Conference on Advanced Information Systems Engineering (CAiSE’98), Pisa, Italy, June 1998.Google Scholar
  12. Jarke, M., Lenzerini, M., Vassiliou, Y., Vassiliadis, P., (edits), “ Fundamentals of Data Warehouses ”, Springer, 1999Google Scholar
  13. Jeusfeld, M. A., Quix C., Jarke M., “Design and Analysis of Quality Information for Data Warehouses”, Proceedings of the 17th Internat. Conf. on Conceptual Modeling (ER’98), Singapore, november 1998.Google Scholar
  14. Kedad Z. “Techniques d’int¨¦gration dans les syst¨¨mes d’information multi-source”, PhD Thesis, Laboratoire PriSM, University of Versailles, January 1999.Google Scholar
  15. Kedad Z., Bouzeghoub M. “Conception de syst¨¨mes d’information multi-source”, to appear in the Proceedings of INFORSID’99, Toulon, France, June 1999.Google Scholar
  16. Levy A.Y., Rajaraman A., Ordille J.J. “Querying Heterogeneous Information Sources Using Source Description”, Proceedings of the VLDB’96.Google Scholar
  17. Ligoudistianos S., Sellis T., Theodoratos D., Vassiliou Y. “Heuristic Algorithms For Designing The Data Warehouse with SPJ Views”, Proceedings of the DAWAK’99.Google Scholar
  18. D. Theodoratos, M. Bouzeghoub. Data Currency Quality Factors in Data Warehouse Design. InProc. of the International Workshop on Design and Management of Data Warehouses (DMDW’99)Heidelberg, Germany (1999). Available at http://www.dbnet.ece.ntua.gr/--dwq/Google Scholar
  19. Theodoratos D., Sellis T. “The Data Warehouse Configuration”, Proceedings of the VLDB’97, Athens, GR, 1997.Google Scholar
  20. Theodoratos D., Sellis T. “Data Warehouse Schema and Instance Design”, Proceedings of ER’98, syngapour, Nov. 1998.Google Scholar
  21. Theodoratos D., Ligoudistianos S., Sellis T. “Designing the Global Data Warehouse with SPJ Views”, Proceedings of CAISE’99.Google Scholar
  22. Vassiliadis, P., Bouzeghoub, M., Quix, C., “Towards quality-oriented data warehouse usage and evolution”, Proceedings of the 11th Conference on Advanced Information Systems Engineering (CAiSE’99), Heidelberg, Germany, June 1999, also published in Information Systems Journal, vol. 25, No. 2, 2000.Google Scholar
  23. R.Y. Wang, V.C. Storey, C.P. Firth. A Framework for Analysis of Data Quality Research.IEEE Transactions on Knowledge and Data Engineering7(4): 623–640 (1995).CrossRefGoogle Scholar
  24. Wang, R. Y., A product perspective on total data quality management, Com. Of the ACM,Vo141, N2, February 1998.Google Scholar
  25. Yang J., Karlapalem K., Li S. “Algorithms for materialized view design in data warehousing Environment”, Proceedings of VLDB 97, Athens, GR, 1997.Google Scholar

Copyright information

© Springer Science+Business Media New York 2002

Authors and Affiliations

  • Mokrane Bouzeghoub
    • 1
  • Zoubida Kedad
    • 1
  1. 1.Laboratoire PRiSMUniversity of VersaillesFrance

Personalised recommendations