Skip to main content

Analyzing Data Quality Trade-Offs in Data-Redundant Systems

  • Conference paper
Interdisciplinary Aspects of Information Systems Studies

Abstract

For technical and architectural reasons data in information systems are often redundant in various databases. Data changes are propagated between the various databases through a synchronization mechanism, which ensures a certain degree of consistency. Depending on the time delay of propagating data changes, synchronization is classified in real time synchronization and lazy synchronization in case of respectively high or low synchronization frequency. In practice, lazy synchronization is very commonly applied but, because of the delay in data synchronization, it causes misalignments among data values resulting in a negative impact on data quality. Indeed, the raise of the time interval between two realignments increases the probability that data result incorrect or out-of-date. The paper analyses the correlation between data quality criteria and the synchronization frequency and reveals the presence of trade-offs between different criteria such as availability and timeliness. The results illustrate the problem of balancing various data quality requirements within the design of information systems. The problem is examined in selected types of information systems that are in general characterized by high degree of data redundancy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pacitti, E. and Simon, E. (2000). Update propagation strategies to improve freshness in lazy master replicated databases. VLDB Journal 8 (3-4): 305-318.

    Article  Google Scholar 

  2. Orr, K. (1998). Data quality and systems theory. Communications of the ACM 41 (2): 66-71.

    Article  Google Scholar 

  3. Wand, Y. and Wang, R.Y. (1996). Anchoring data quality dimensions in ontological founda-tions. Communication of the ACM 39 (11): 86-95.

    Article  Google Scholar 

  4. Cappiello, C. , Francalanci, C. , and Pernici, B. (Winter 2003-2004). Time-related factors of data quality in multichannel information systems. Journal of Management Information Systems, 20. (3): 71-91.

    Google Scholar 

  5. Jarke, M., Lenzerini, Vassiliou, Y. , and Vassiliadis, P. (1999). Fundamentals of Data Ware-houses. Springer, Berlin.

    Google Scholar 

  6. Barbara, D. and Garcia-Molina, D. (1981). The cost of data replication. In Proceedings of the Seventh Data Communications Symposium, Mexico, pp. 193-198.

    Google Scholar 

  7. Collins, K. (1999). Data: Evaluating value vs. cost. Tactical Guidelines, TG-08-3321. Gart-ner Group.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Physica-Verlag Heidelberg

About this paper

Cite this paper

Cappiello, C., Helfert, M. (2008). Analyzing Data Quality Trade-Offs in Data-Redundant Systems. In: Interdisciplinary Aspects of Information Systems Studies. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2010-2_25

Download citation

Publish with us

Policies and ethics