Managing Data Quality in Cooperative Information Systems

  • Massimo Mecella
  • Monica Scannapieco
  • Antonino Virgillito
  • Roberto Baldoni
  • Tiziana Catarci
  • Carlo Batini
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2519)

Abstract

Current approaches to the development of cooperative information systems are based on services to be offered by cooperating organizations, and on the opportunity of building coordinators and brokers on top of such services. The quality of data exchanged and provided by different services hampers such approaches, as data of low quality can spread all over the cooperative system. At the same time, improvement can be based on comparing data, correcting them and disseminating high quality data. In this paper, a service-based framework for managing data quality in cooperative information systems is presented. An XML-based model for data and quality data is proposed, and the design of a broker, which selects the best available data from different services, is presented. Sucha broker also supports the improvement of data based on feedbacks to source services.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    C. Batini and M. Mecella, “Enabling Italian e-Government Through a Cooperative Architecture,” IEEE Computer, vol. 34, no. 2, 2001.Google Scholar
  2. 2.
    U. Dayal, M. Hsu, and R. Ladin, “Business Process Coordination: State of the Art, Trends and Open Issues,” in Proceedings of the 27th Very Large Databases Conference (VLDB 2001), Roma, Italy, 2001.Google Scholar
  3. 3.
    F. Casati, D. Georgakopoulos, and M.C. Shan, Eds., Proceedings of the 2nd VLDB International Workshop on Technologies for e-Services (VLDB-TES 2001), Rome, Italy, 2001.Google Scholar
  4. 4.
    P. Bertolazzi and M. Scannapieco, “Introducing Data Quality in a Cooperative Context,” in Proceedings of the 6th International Conference on Information Quality (IQ’01), Boston, MA, USA, 2001.Google Scholar
  5. 5.
    H. Galhardas, D. Florescu, D. Shasha, and E. Simon, “An Extensible Framework for Data Cleaning,” in Proceedings of the 16th International Conference on Data Engineering (ICDE 2000), San Diego, CA, USA, 2000.Google Scholar
  6. 6.
    M. Jarke, M. Lenzerini, Y. Vassiliou, and Panos Vassiliadis, Eds., Fundamentals of Data Warehouses, Springer Verlag, 1999.Google Scholar
  7. 7.
    H.B. Kon, R.Y. Wang and S.E. Madnick, “Data Quality Requirements: Analysis and Modeling,” in Proceedings of the 9th International Conference on Data Engineering (ICDE’ 93), Vienna, Austria, 1993.Google Scholar
  8. 8.
    G. Mihaila, L. Raschid, and M. Vidal, “Querying Quality of Data Metadata,” in Proceedings of the 6th International Conference on Extending Database Technology (EDBT’98), Valencia, Spain, 1998.Google Scholar
  9. 9.
    M. Mecella, M. Scannapieco, A. Virgillito, R. Baldoni, T. Catarci, and C. Batini, “Architectural Support for Data Quality in Cooperative Information Systems,” Technical report of the DaQuinCIS project, Dipartimento di Informatica e Sistemistica, Università di Roma “La Sapienza”, Roma, Italy, 2002.Google Scholar
  10. 10.
    T. Catarci and M. Lenzerini, “Representing and Using Interschema Knowledge in Cooperative Information Systems,” Journal of Intelligent and Cooperative Information Systems, vol. 2, no. 4, 1993.Google Scholar
  11. 11.
    M. Scannapieco, “Data Quality in Cooperative Information Systems,” Doctoral Poster at the 28th Very Large Databases Conference (VLDB 2002), Hong Kong, 2002.Google Scholar
  12. 12.
    T.C. Redman, Data Quality for the Information Age, ArtechHouse, 1996.Google Scholar
  13. 13.
    A. Deutsch, M. Fernandez, D. Florescu, A. Levy, and D. Suciu, “XML-QL: A Query Language for XML,” in Proceedings of the 8th International World Wide Web Conference (WWW8), Toronto, Canada, 1999.Google Scholar
  14. 14.
    R.G.G. Cattell and D.K. Barry, Eds., The Object Database Standard: ODMG 2.0, Morgan Kaufmann Publishers, 1997.Google Scholar
  15. 15.
    M. Scannapieco, V. Mirabella, M. Mecella, and C. Batini, “Data Quality in e-Business,” in Proceedings of the Workshop on Web Services, e-Business, and the Semantic Web: Foundations, Models, Architecture, Engineering and Applications, in conjunction with CAiSE 2002, Toronto, Ontario, Canada, 2002.Google Scholar
  16. 16.
    F.B. Schneider, “What Goods are Models and What Models are Good?,” in Distributed Systems, S. Mullender, Ed. ACM Press, 1994.Google Scholar
  17. 17.
    M.J. Fischer, N.A. Lynch, and M.S. Paterson, “Impossibility of Distributed Consensus withOne Faulty Process,” Journal of the ACM, vol. 32, no. 2, 1985.Google Scholar
  18. 18.
    V. Hadzilacos and S. Toueg, “Fault-Tolerant Broadcasts and Related Problems,” in Distributed Systems, S. Mullender, Ed. ACM Press, 1994.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Massimo Mecella
    • 1
  • Monica Scannapieco
    • 1
    • 2
  • Antonino Virgillito
    • 1
  • Roberto Baldoni
    • 1
  • Tiziana Catarci
    • 1
  • Carlo Batini
    • 3
  1. 1.Università di Roma “La Sapienza”DIS
  2. 2.Consiglio Nazionale delle RicercheIASI
  3. 3.Università di Milano “Bicocca”DISCo

Personalised recommendations