Advertisement

Data Quality

Concepts, Methodologies and Techniques

  • Carlo Batini
  • Monica Scannapieca

Part of the Data-Centric Systems and Applications book series (DCSA)

Table of contents

About this book

Introduction

Poor data quality can seriously hinder or damage the efficiency and effectiveness of organizations and businesses. The growing awareness of such repercussions has led to major public initiatives like the "Data Quality Act" in the USA and the "European 2003/98" directive of the European Parliament.

Batini and Scannapieco present a comprehensive and systematic introduction to the wide set of issues related to data quality. They start with a detailed description of different data quality dimensions, like accuracy, completeness, and consistency, and their importance in different types of data, like federated data, web data, or time-dependent data, and in different data categories classified according to frequency of change, like stable, long-term, and frequently changing data. The book's extensive description of techniques and methodologies from core data quality research as well as from related fields like data mining, probability theory, statistical data analysis, and machine learning gives an excellent overview of the current state of the art. The presentation is completed by a short description and critical comparison of tools and practical methodologies, which will help readers to resolve their own quality problems.

This book is an ideal combination of the soundness of theoretical foundations and the applicability of practical approaches. It is ideally suited for everyone – researchers, students, or professionals – interested in a comprehensive overview of data quality issues. In addition, it will serve as the basis for an introductory course or for self-study on this topic.

Keywords

Data Accuracy Data Availability Data Completeness Data Consistency Data Integration Data Quality Distributed Data Management data mining learning organization

Authors and affiliations

  • Carlo Batini
    • 1
  • Monica Scannapieca
    • 2
  1. 1.Dipartimento di Informatica Sistemistica e Comunicazione Piazza dell’Ateneo NuovoUniversità di Milano BicoccaMilanoItaly
  2. 2.Dipartimento di Informatica e Sistemistica “A. Ruberti”Università di Roma “La Sapienza”RomaItaly

Bibliographic information

  • DOI https://doi.org/10.1007/3-540-33173-5
  • Copyright Information Springer-Verlag Berlin Heidelberg 2006
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Computer Science
  • Print ISBN 978-3-540-33172-8
  • Online ISBN 978-3-540-33173-5
  • Buy this book on publisher's site