Integration Integrity for Multigranular Data
When data from several source schemata are to be integrated, it is essential that the resulting data in the global schema be consistent. This problem has been studied extensively for the monogranular case, in which all domains are flat. However, data involving spatial and/or temporal attributes are often represented at different levels of granularity in different source schemata. In this work, the beginnings of a framework for addressing data integration in multigranular contexts are developed. The contribution is twofold. First, a model of multigranular attributes which is based upon partial orders which are augmented with partial lattice-like operations is developed. These operations are specifically designed to model the kind of dependencies which occur in multigranular modelling, particularly in the presence of aggregation operations. Second, the notion of a thematic multigranular comparison dependency, generalizing ordinary functional and order dependencies but specifically designed to model the kinds of functional and order dependencies which arise in the multigranular context, is developed.
The work of M. Andrea Rodríguez, as well as a six-week visit of Stephen J. Hegner to Concepción, during which many of the ideas reported here were developed, were partly funded by Fondecyt-Conicyt grant number 1140428. Loreto Bravo was initially a collaborator, but was unable to continue due to other commitments. The authors gratefully acknowledge her contributions and insights during the early phases of this investigation.
- 3.Bertossi, L.E.: Database Repairing and Consistent Query Answering. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, San Rafael (2011)Google Scholar
- 5.Bonham-Carter, G.F.: Geographic Information Systems for Geoscientists: Modelling with GIS. Pergamon, Oxford (1995)Google Scholar
- 11.Egenhofer, M., Clementine, E., Felice, P.D.: Evaluating inconsistency among multiple representations. In: Spatial Data Handling, pp. 901–920 (1995)Google Scholar
- 12.Egenhofer, M., Sharma, J.: Assessing the consistency of complete and incomplete topological information. Geogr. Syst. 1, 47–68 (1993)Google Scholar
- 16.Hegner, S.J.: Distributivity in incompletely specified type hierarchies: theory and computational complexity. In: Dörre, J. (ed.) Computational Aspects of Constraint-Based Linguistic Description II, DYANA, pp. 29–120 (1994)Google Scholar
- 17.Iftikhar, N., Pedersen, T.B.: Using a time granularity table for gradual granular data aggregation. Fundam. Inform. 132(2), 153–176 (2014)Google Scholar
- 18.Kuijpers, B., Paredaens, J., den Bussche, J.V.: On topological elementary equivalence of spatial databases. In: ICDT, pp. 432–446 (1997)Google Scholar
- 19.Lenzerini, M.: Data integration: a theoretical perspective. In: Popa, L., Abiteboul, S., Kolaitis, P.G. (eds.) PODS, pp. 233–246. ACM (2002)Google Scholar
- 23.Rahm, E., Do, H.H.: Data cleaning: problems and current approaches. IEEE Data Eng. Bull. 23(4), 3–13 (2000)Google Scholar