Skip to main content

Data Integration — Problems, Approaches, and Perspectives

  • Chapter
Conceptual Modelling in Information Systems Engineering

Abstract

Data integration is one of the older research fields in the database area and has emerged shortly after database systems were first introduced into the business world. In this paper, we briefly introduce the problem of integration and, based on an architectural perspective, give an overview of approaches to address the integration issue. We discuss the evolution from structural to semantic integration and shortly present our own research in the SIRUP (Semantic Integration Reflecting User-specific semantic Perspectives) approach. Finally, an outlook to challenging areas of future research in the realm of data integration is given.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abiteboul, S., Benjelloun, O., Milo. T.: Web Services and Data Integration. In Third International Conference on Web Information Systems Engineering (WISE 2002), pages 3–7, Singapore, December 12–14, 2002. IEEE Computer Society.

    Google Scholar 

  2. Abiteboul, S., Polyzotis, N.: The Data Ring: Community Content Sharing. In Third Biennial Conference on Innovative Data Systems Research (CIDR 2007), Asilomar, CA, USA, January 7–10, 2007. Online Proceedings.

    Google Scholar 

  3. Arens, Y., Chee, C.H., Hsu, C.-N, Knoblock, C.A.: Retrieving and Integrating Data from Multiple Information Sources. International Journal of Cooperative Information Systems (IJCIS), 2(2):127–158, 1993.

    Article  Google Scholar 

  4. Bayardo, R. J., Bohrer, B., Brice, R. S., Cichocki, A., Fowler, J., Helal, A., Kashyap, V., Ksiezyk, T., Martin, G., Nodine, M.H., Rashid, M., Rusinkiewicz, M., Shea, R., Unnikrishnan, C., Unruh, A., Woelk, D.: InfoSleuth: Agent-Based Semantic Integration of Information in Open and Dynamic Environments. In 1997 ACM SIGMOD International Conference on Management of Data (SIGMOD 1997), pages 195–206, Tucson, Arizona, USA, 1997. ACM.

    Google Scholar 

  5. Bernstein, P., Brodie, M., Ceri, S., DeWitt, D., Franklin, M., Garcia-Molina, H., Gray, J., Held, J., Hellerstein, J., Jagadish, H.V., Lesk, M., Maier, D., Naughton, J., Pirahesh, H., Stonebraker, M., Ullman, J.: The Asilomar Report on Database Research. SIGMOD Record, 27(4):74–80, 1998.

    Article  Google Scholar 

  6. Bernstein, P.A., Halevy, A.Y., Pottinger, R.A.: A Vision for Management of Complex Models. ACM SIGMOD Record, 29(4):55–63, 2000.

    Article  Google Scholar 

  7. Bukhres, O.A., Elmagarmid, A.K. editors. Object-Oriented Multidatabase Systems: A Solution for Advanced Applications. Prentice-Hall, 1996.

    Google Scholar 

  8. Carey, M., Haas, L., Schwarz, P., Arya, M., Cody, W., Fagin, R., Flickner, M., Luniewski, A., Niblack, W., Petkovic, D., Thomas, J., Williams, J., Wimmers, E.: Towards Heterogeneous Multimedia Information Systems: The Garlic Approach. In 5th International Workshop on Research Issues in Data Engineering-Distributed Object Management (RIDE-DOM 1995), pages 124–131, Taipei, Taiwan, March 6–7, 1995.

    Google Scholar 

  9. Chawathe, S., Garcia-Molina, H., Hammer, J., Ireland, K., Papakonstantinou, Y., Ullman, J., Widom, J.: The TSIMMIS Project: Integration of Heterogeneous Information Sources. In 16th Meeting of the Information Processing Society of Japan (IPSJ), pages 7–18, Tokyo, Japan, October, 1994.

    Google Scholar 

  10. Clifton, C., Kantarcioglu, M., Doan, A., Schadow, G., Vaidya, J., Elmagarmid, A.K., Suciu, D.: Privacy-Preserving Data Integration and Sharing. In 9 th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2004), pages 19–26, Paris, France, June 13, 2004. ACM.

    Google Scholar 

  11. Conrad, S., Höding, M., Saake, G., Schmitt, I., Türker, C.: Schema Integration with Integrity Constraints. In 15th British National Conference on Databases (BNCOD 1997), pages 200–214, London, UK, July 7–9, 1997. Springer.

    Google Scholar 

  12. Dittrich, K.R., Jonscher, D.: All Together Now — Towards Integrating the World’s Information Systems. In Advances in Multimedia and Databases for the New Century, pages 109–123, Kyoto, Japan, November 30–December 2, 1999. World Scientific Press.

    Google Scholar 

  13. Franklin, M.J., Halevy, A.J., Maier, D.: From Databases to Dataspaces: A New Abstraction for Information Management. SIGMOD Record, 34(4):27–33, 2005.

    Article  Google Scholar 

  14. Gaines, B.R, Shaw, M.L.G: Comparing the Conceptual Systems of Experts. In 11th International Joint Conference on Artificial Intelligence (IJCAI 1989), pages 633–638, Detroit, Michigan, USA, August, 1989. Morgan Kaufmann.

    Google Scholar 

  15. García-Solaco, M., Saltor, F., Castellanos, M.: Semantic Heterogeneity in Multidatabase Systems. In O. A. Bukhres and A. K. Elmagarmid, editors, Object-Oriented Multidatabase Systems. A Solution for Advanced Applications, pages 129–202. Prentice-Hall, 1996.

    Google Scholar 

  16. Gertz, M., Özsu, M.T., Saake, G., Sattler, K.U.: Report on the Dagstuhl Seminar “Data Quality on the Web”. SIGMOD Record, 33(1):127–132, 2004.

    Article  Google Scholar 

  17. Goh, C.H, Madnick, S.E., Siegel, M.: Context Interchange: Overcoming the Challenges of Large-Scale Interoperable Database Systems in a Dynamic Environment. In Third International Conference on Information and Knowledge Management (CIKM 1994), pages 337–346, Gaithersburg, USA, November 29–December 2, 1994. ACM.

    Google Scholar 

  18. Halevy, A.Y.: Data Integration: A Status Report. In Datenbanksysteme in Business, Technologie und Web (BTW 2003), volume 26, pages 24–29, Leipzig, Germany, February 26–28, 2003. Gesellschaft für Informatik (GI).

    Google Scholar 

  19. Halevy, A.Y., Ives, Z.G, Suciu, D., Tatarinov, I.: Schema Mediation in Peer Data Management Systems. In 19th International Conference on Data Engineering (ICDE 2003), pages 505–518, Bangalore, India, March 5–8, 2003. IEEE Computer Society.

    Google Scholar 

  20. Huhns, M.N., Singh, M.P.: Agents on the Web: Ontologies for Agents. IEEE Internet Computing, 1(6):81–83, 1997.

    Article  Google Scholar 

  21. Hurson, A.R., Bright, M.W.: Multidatabase Systems: An Advanced Concept in Handling Distributed Data. Advances in Computers, 32:149–200, 1991.

    Google Scholar 

  22. Jonscher, D., Dittrich, K. R.: An Approach for Building Secure Database Federations. In 20th International Conference on Very Large Data Bases (VLDB 1994), pages 24–35, Santiago de Chile, Chile, September 12–15, 1994. Morgan Kaufmann.

    Google Scholar 

  23. Kent, W.: Data and Reality. Basic Assumptions in Data Processing Reconsidered. North-Holland, Amsterdam, 1978.

    Google Scholar 

  24. Landers, T., Rosenberg, R.L.: An Overview of MULTIBASE. In Second International Symposium on Distributed Data Bases (DDB 1982), pages 153–184, Berlin, Germany, September 1–3, 1982. North-Holland.

    Google Scholar 

  25. McCann, R., Doan, A., Varadaran, V. Kramnik, A. Zhai, C. Building Data Integration Systems: A Mass Collaboration Approach. In Sixth International Workshop on Web and Databases (WebDB 2003), pages 25–30, San Diego, California, USA, June 12–13, 2003.

    Google Scholar 

  26. Mena, E., Kashyap, V., Sheth, A.P., Illarramendi, A.: OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies. In First IFCIS International Conference on Cooperative Information Systems (CoopIS 1996), pages 14–25, Brussels, Belgium, June 19–21, 1996. IEEE Computer Society.

    Google Scholar 

  27. Ouksel, A.M., Sheth, A.P.: Semantic Interoperability in Global Information Systems: A Brief Introduction to the Research Area and the Special Section. SIGMOD Record, 28(1):5–12, 1999.

    Article  Google Scholar 

  28. Scheuermann, P., Elmagarmid, A.K., Garcia-Molina, H., Manola, F., McLeod, D., Rosenthal, A., Templeton, M.: Report on the Workshop on Heterogenous Database Systems held at Northwestern University, Evanston, Illinois, December 11–13, 1989. SIGMOD Record, 19(4):23–31, 1990.

    Article  Google Scholar 

  29. Sheth, A.P., Gala, S.K., Navathe, S.B.: On Automatic Reasoning for Schema Integration. International Journal of Intelligent and Cooperative Information Systems, 2(1):23–50, 1993.

    Article  Google Scholar 

  30. Sheth, A.P., Larson, J.A.: Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases. ACM Computing Surveys, 22(3):183–236, 1990.

    Article  Google Scholar 

  31. Sølvberg. A.: Data and What They Refer to. In Conceptual Modeling, Current Issues and Future Directions, Selected Papers from the Symposium on Conceptual Modeling, Los Angeles, California, USA, held before ER 1997, pages 211–226. Springer, 1997.

    Google Scholar 

  32. Sølvberg, A.: Conceptual Modeling in a World of Models. In R. Kaschek, editor, Entwicklungsmethoden für Informationssysteme und deren Anwendung, EMISA 1999, pages 63–77, Fischbachau, Germany, 1999. Teubner.

    Google Scholar 

  33. Wache, H., Vögele, T., Visser, U., Stuckenschmidt, H., Schuster, G., Neumann, H., Hübner, S.: Ontology-Based Integration of Information-A Survey of Existing Approaches. In IJCAI-2001 Workshop on Ontologies and Information Sharing, pages 108–117, Seattle, USA, April 4–5, 2001.

    Google Scholar 

  34. Wiederhold, G.: Mediators in the Architecture of Future Information Systems. IEEE Computer, 25(3):38–49, 1992.

    Google Scholar 

  35. Winslett, M.: Databases in Virtual Organizations: A Collective Interview and Call for Researchers. SIGMOD Record, 34(1):86–89, 2005.

    Article  Google Scholar 

  36. Ziegler, P.: User-Specific Semantic Integration of Heterogeneous Data: What Remains to be Done? Technical Report ifi-2004.01, Department of Informatics, University of Zurich. http://www.ifi.unizh.ch/techreports/TR004.html, 2004.

    Google Scholar 

  37. Ziegler, P., Dittrich, K.R.: Three Decades of Data Integration-All Problems Solved? In 18th IFIP World Computer Congress (WCC 2004), Volume 12, Building the Information Society, pages 3–12, Toulouse, France, August 22–27, 2004. Kluwer.

    Google Scholar 

  38. Ziegler, P., Dittrich, K. R.: User-Specific Semantic Integration of Heterogeneous Data: The SIRUP Approach. In First International IFIP Conference on Semantics of a Networked World (ICSNW 2004), pages 44–64, Paris, France, June 17–19, 2004. Springer.

    Google Scholar 

  39. Ziegler, P., Kiefer, C., Sturm, C., Dittrich, K.R., Bernstein, A.: Detecting Similarities in Ontologies with the SOQA-SimPack Toolkit. In 10th International Conference on Extending Database Technology (EDBT 2006), pages 59–76, Munich, Germany, March 26–31, 2006. Springer.

    Google Scholar 

  40. Ziegler, P., Sturm, C., Dittrich, K.R.: Unified Querying of Ontology Languages with the SIRUP Ontology Query API. In Datenbanksysteme in Business, Technologie und Web (BTW 2005), pages 325–344, Karlsruhe, Germany, March 2–4, 2005. Gesellschaft für Informatik (GI).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Ziegler, P., Dittrich, K.R. (2007). Data Integration — Problems, Approaches, and Perspectives. In: Krogstie, J., Opdahl, A.L., Brinkkemper, S. (eds) Conceptual Modelling in Information Systems Engineering. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72677-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-72677-7_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-72676-0

  • Online ISBN: 978-3-540-72677-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics