Abstract
For technical and architectural reasons data in information systems are often redundant in various databases. Data changes are propagated between the various databases through a synchronization mechanism, which ensures a certain degree of consistency. Depending on the time delay of propagating data changes, synchronization is classified in real time synchronization and lazy synchronization in case of respectively high or low synchronization frequency. In practice, lazy synchronization is very commonly applied but, because of the delay in data synchronization, it causes misalignments among data values resulting in a negative impact on data quality. Indeed, the raise of the time interval between two realignments increases the probability that data result incorrect or out-of-date. The paper analyses the correlation between data quality criteria and the synchronization frequency and reveals the presence of trade-offs between different criteria such as availability and timeliness. The results illustrate the problem of balancing various data quality requirements within the design of information systems. The problem is examined in selected types of information systems that are in general characterized by high degree of data redundancy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pacitti, E. and Simon, E. (2000). Update propagation strategies to improve freshness in lazy master replicated databases. VLDB Journal 8 (3-4): 305-318.
Orr, K. (1998). Data quality and systems theory. Communications of the ACM 41 (2): 66-71.
Wand, Y. and Wang, R.Y. (1996). Anchoring data quality dimensions in ontological founda-tions. Communication of the ACM 39 (11): 86-95.
Cappiello, C. , Francalanci, C. , and Pernici, B. (Winter 2003-2004). Time-related factors of data quality in multichannel information systems. Journal of Management Information Systems, 20. (3): 71-91.
Jarke, M., Lenzerini, Vassiliou, Y. , and Vassiliadis, P. (1999). Fundamentals of Data Ware-houses. Springer, Berlin.
Barbara, D. and Garcia-Molina, D. (1981). The cost of data replication. In Proceedings of the Seventh Data Communications Symposium, Mexico, pp. 193-198.
Collins, K. (1999). Data: Evaluating value vs. cost. Tactical Guidelines, TG-08-3321. Gart-ner Group.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2008 Physica-Verlag Heidelberg
About this paper
Cite this paper
Cappiello, C., Helfert, M. (2008). Analyzing Data Quality Trade-Offs in Data-Redundant Systems. In: Interdisciplinary Aspects of Information Systems Studies. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2010-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-7908-2010-2_25
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2009-6
Online ISBN: 978-3-7908-2010-2
eBook Packages: Business and EconomicsBusiness and Management (R0)