Advertisement

European Journal of Information Systems

, Volume 26, Issue 1, pp 84–115 | Cite as

Supporting the design of data integration requirements during the development of data warehouses: a communication theory-based approach

  • Christoph Rosenkranz
  • Roland Holten
  • Marc Räkers
  • Wolf Behrmann
Empirical Research

Abstract

Data warehouses (DW) form the backbone of data integration that is necessary for analytical applications, and play important roles in the information technology landscape of many industries. We introduce an approach for addressing the fundamental problem of semantic heterogeneity in the design of data integration requirements during DW development. In contrast to ontology-driven or schema-matching approaches, which propose the automatic resolution of differences ex-post, our approach addresses the core problem of data integration requirements: understanding and resolving different contextual meanings of data fields. We ground the approach firmly in communication theory and build on practices from agile software development. Besides providing relevant insights for the design of data integration requirements, our findings point to communication theory as a sound underlying foundation for a design theory of information systems development.

Keywords

common ground communication theory data integration requirements data warehouse development agile software development information systems development 

Notes

Acknowledgements

The authors would like to thank the editors and the three anonymous reviewers for their constructive feedback and suggestions that helped to considerably improve the paper. The conference audience at ICIS 2010 also contributed with valuable comments to an earlier version. In addition, we would like to thank zeb.rolfes.schierenbeck.associates gmbh, and especially Mr. Sven Krämer, for his help in data access and implementing the artifacts.

References

  1. Abrahamsson P, Salo O, Ronkainen J and Warsta J (2002) Agile software development methods: review and analysis. VTT Technical Report, Oulu, Finland.Google Scholar
  2. Ågerfalk PJ (2004) Investigating actability dimensions: a language/action perspective on criteria for information systems evaluation. Interacting with Computers 16(5), 957–988.CrossRefGoogle Scholar
  3. Ågerfalk PJ (2014) Insufficient theoretical contribution: a conclusive rationale for rejection&quest. European Journal of Information Systems 23(6), 593–599.CrossRefGoogle Scholar
  4. Agrawal R et al (2009) The claremont report on database research. Communications of the ACM 52(6), 56–65.CrossRefGoogle Scholar
  5. Baskerville R, Pries-Heje J and Venable J (2009) Soft design science methodology. In 4th International Conference on Design Science Research in Information Systems and Technology, ACM, Philadelphia, PA.Google Scholar
  6. Beck K et al (2001) The agile manifesto. [WWW document] http://www.agilemanifesto.org/ (accessed 20 September 2015).
  7. Bergman M, Lyytinen K and Mark G (2007) Boundary objects in design: an ecological view of design artifacts. Journal of the Association for Information Systems 8(11), 546–568.Google Scholar
  8. Bernstein PA, Melnik S, Michalis P and Quix C (2004) Industrial-strength schema matching. ACM SIGMOD Record 33(4), 38–43.CrossRefGoogle Scholar
  9. Beynon-Davies P (2010) The enactment of significance: a unified conception of information, systems and technology. European Journal of Information Systems 19(4), 389–408.CrossRefGoogle Scholar
  10. BITS (2005) Bits Operational Risk Management Working Group. Reconciliation of Regulatory Overlap for the Management and Supervision of Operational Risk in U.S. Financial Institutions: Improving Compliance Efficiencies by Minimizing Redundancy, The Financial Services Roundtable BITS, Washington DC.Google Scholar
  11. Black SE, Boca PP, Bowen JP, Gorman J and Hinchey MG (2009) Formal versus agile: survival of the fittest. IEEE Computer 49(9), 39–45.Google Scholar
  12. Bouchard TJ (1976) Field research methods: interviewing, questionnaires, participant observation, systematic observation, unobtrusive measures. In Handbook of Lndustrial and Organizational Psychology (Dunnette MD Ed) Rand McNally, Chicago, IL.Google Scholar
  13. Bromme R and Jucks R (2001) Wissensdivergenz und Kommunikation: Lernen zwischen Experten und Laien im Netz. In Partizipation und interaktion im virtuellen seminar (Hesse FW and Friedrich HF, Eds), pp 81–103, Waxmann, Münster, Germany.Google Scholar
  14. Bromme R, Jucks R and Rambow R (2004) Experten-Laien-Kommunikation im Wissensmanagement. In Der Mensch im Wissensmanagement: Psychologische Konzepte zum besseren Verständnis und Umgang mit Wissen (Reinmann G and Mandl H, Eds), pp 176–188, Hogrefe, Göttingen, Germany.Google Scholar
  15. Bromme R, Jucks R and Runde A (2005) Barriers and biases in computer-mediated expert-layperson-communication. In Barriers and Biases in Computer-Mediated Knowledge Communication (Bromme R, Hesse FW and Spada H, Eds), pp 89–118, Springer, New York.CrossRefGoogle Scholar
  16. Brown-Schmidt S (2012) Beyond common and privileged: gradient representations of common ground in real-time language use. Language and Cognitive Processes 27(1), 62–89.CrossRefGoogle Scholar
  17. Clark HH (1992) Arenas of Language Use. Chicago University Press, Chicago, IL.Google Scholar
  18. Clark HH (1996) Using Language. Cambridge University Press, New York, NY.CrossRefGoogle Scholar
  19. Clark HH (2005) Coordinating with each other in a material world. Discourse Studies 7(4–5), 507–525.CrossRefGoogle Scholar
  20. Clark HH and Brennan SE (1991) Grounding in communication. In Perspectives on Socially Shared Cognition (Resnick LB, Levine JM and Teasley SD, Eds), pp 127–149, APA, Washington DC.CrossRefGoogle Scholar
  21. Clark HH and Krych MA (2004) Speaking while monitoring addressees for understanding. Journal of Memory and Language 50(1), 62–81.CrossRefGoogle Scholar
  22. Clark HH and Marshall CR (1981) Definite reference and mutual knowledge. In Elements of Discourse Understanding (Koshi AK, Webber B and Sag IA, Eds), pp 10–63, Cambridge University Press, Cambridge.Google Scholar
  23. Cockburn A and Highsmith J (2001) Agile software development: the people factor. IEEE Computer 34(11), 131–133.CrossRefGoogle Scholar
  24. Convertino G, Mentis HM, Rosson MB, Carroll JM, Slavkovic A and Ganoe CH (2008) Articulating common ground in cooperative work: content and process. In Proceeding of the Annual SIGCHI Conference: Human Factors in Computing Systems pp 1637–1646, ACM, Florence, Italy.Google Scholar
  25. Convertino G, Mentis HM, Alex YWT, Rosson MB and Carroll JM (2007a) How does common ground increase? In 2007 International ACM Conference on Supporting Group Work, pp 225–228, ACM, Sanibel Island, FL.Google Scholar
  26. Convertino G, Mentis HM, Rosson MB, Slavkovic A and Carroll JM (2009) Supporting content and process common ground in computer-supported teamwork. In 27th International Conference on Human Factors in Computing Systems, pp 2339–2348, ACM, Boston, MA.Google Scholar
  27. Convertino G, Mentis HM, Slavkovic A, Rosson MB and Carroll JM (2011) Supporting common ground and awareness in emergency management planning: a design research project. In ACM Transactions on Computer-Human Interaction 18(4), 1–34.CrossRefGoogle Scholar
  28. Convertino G, Zhang Y and Asti B (2006) Board-based collaboration questionnaire. [WWW document] http://cscl.ist.psu.edu/public/projects/crossculturecollab/quest.html (accessed 1 July 2010).
  29. Convertino G, Zhang Y, Asti B, Rosson MB and Mohammed S (2007b) Board-based collaboration in cross-cultural pairs. In 1st International Conference on Intercultural Collaboration, Springer, Kyoto, Japan.Google Scholar
  30. Corr L and Stagnitto J (2011) Agile Data Warehouse Design: Collaborative Dimensional Modeling, From Whiteboard to Star Schema. DecisionOne Press, Leeds, UK.Google Scholar
  31. Corvera Charaf M, Rosenkranz C and Holten R (2013) The emergence of shared understanding: applying functional pragmatics to study the requirements development process. Information Systems Journal 23(2), 115–135.CrossRefGoogle Scholar
  32. Couture N (2014) Best practices for adopting agile methodologies for data warehouse development. Business Inteligence Journal 18(2), 8–17.Google Scholar
  33. Cramton CD (2001) The mutual knowledge problem and its consequences for dispersed collaboration. Organization Science 12(3), 346–371.CrossRefGoogle Scholar
  34. Davidson EJ (1999) Joint application design (jad) in practice. Journal of Systems and Software 45(3), 215–223.CrossRefGoogle Scholar
  35. Davis FD (1989) Perceived usefulness, perceived ease of use, and user acceptance of information technology. MIS Quarterly 13(3), 319–340.CrossRefGoogle Scholar
  36. Dibbern J, Winkler J and Heinzl A (2008) Explaining variations in client extra costs between software projects offshored to India. MIS Quarterly 32(2), 333–366.Google Scholar
  37. Do H-H and Rahm E (2007) Matching large schemas: approaches and evaluation. Information Systems 32(6), 857–885.CrossRefGoogle Scholar
  38. Dubé L and Paré G (2003) Rigor in information systems positivist case research: current practices, trends, and recommendations. MIS Quarterly 27(4), 597–635.Google Scholar
  39. Dybå T and Dingsøyr T (2009) What do we know about agile software development? IEEE Software 26(5), 6–9.CrossRefGoogle Scholar
  40. Färber F, Cha SK, Primsch J, Bornhövd C, Sigg S and Lehner W (2012) Sap hana database: data management for modern business applications. ACM SIGMOD Record 40(4), 45–51.CrossRefGoogle Scholar
  41. Finkelstein C (1989) An Introduction to Information Engineering. From Strategic Planning to Information Systems. Addison-Wesley Publishing, Reading, MA.Google Scholar
  42. Fitzgerald B, Hartnett G and Conboy K (2006) Customising agile methods to software practices at intel shannon. European Journal of Information Systems 15(2), 200–213.CrossRefGoogle Scholar
  43. Flanagan JC (1954) The critical incident technique. Psychological Bulletin 51(4), 327–358.CrossRefGoogle Scholar
  44. Frank MC and Goodman ND (2012) Predicting pragmatic reasoning in language games. Science 336(6084), 998.CrossRefGoogle Scholar
  45. Garrod S and Anderson A (1987) Saying what you mean in dialogue: a study in conceptual and semantic co-ordination. Cognition 27(2), 181–218.CrossRefGoogle Scholar
  46. Giorgini P, Rizzi S and Garzetti M (2008) Grand: a goal-oriented approach to requirement analysis in data warehouses. Decision Support Systems 45(1), 4–21.CrossRefGoogle Scholar
  47. Goede R and Huisman M (2010) The suitability of agile systems development methodologies for data warehouse development. In International Conference on Information Management Evaluation, pp 99–106, Academic Conferences Limited, UK.Google Scholar
  48. Golfarelli M, Rizzi S, Turricchia E, Cuzzocrea A and Dayal U (2011) Modern Software Engineering Methodologies Meet Data Warehouse Design: 4wd Data Warehousing and Knowledge Discovery, pp 66–79, Springer, Berlin,Heidelberg, Germany.CrossRefGoogle Scholar
  49. Gregor S and Hevner AR (2013) Positioning and presenting design science research for maximum impact. MIS Quarterly 37(2), 337–355.Google Scholar
  50. Gregor S and Jones D (2007) The anatomy of a design theory. Journal of the Association for Information Systems 8(5), 312–335.Google Scholar
  51. Guarino N, Ed (1998) Formal ontology and information systems. In FOIS’98, pp 3–15, IOS Press, Trento, Italy.Google Scholar
  52. Guest G, Bunce A and Johnson L (2006) How many interviews are enough? Field Methods 18(1), 59–82.CrossRefGoogle Scholar
  53. Gupta R, Tranel D and Duff MC (2012) Ventromedial prefrontal cortex damage does not impair the development and use of common ground in social interaction: implications for cognitive theory of mind. Neuropsychologia 50(1), 145–152.CrossRefGoogle Scholar
  54. Halevy A (2005) Why your data won’t mix. Queue 3(8), 50–58.CrossRefGoogle Scholar
  55. Halevy A, Rajaraman A and Ordille J (2006) Data integration: the teenage years. In 32nd International Conference on Very Large Data Bases (VLDB 2006), VLDB Endowment, Seoul, Korea.Google Scholar
  56. Hanna JE, Tanenhaus MK and Trueswell JC (2003) The effects of common ground and perspective on domains of referential interpretation. Journal of Memory and Language 49(1), 43–61.CrossRefGoogle Scholar
  57. Henderson R, Podd J, Smith M and Varela-Alvarez H (1995) An examination of four user-based software evaluation methods. Interacting with Computers 7(4), 412–432.CrossRefGoogle Scholar
  58. Herbsleb D and Mockus A (2003) An empirical study of speed and communication in globally-distributed software development. IEEE Transactions on Software Engineering 29(6), 1–14.CrossRefGoogle Scholar
  59. Hevner A, March S, Park J and Ram S (2004) Design science in information systems research. MIS Quarterly 28(1), 75–105.Google Scholar
  60. Highsmith J and Cockburn A (2001) Agile software development: the business of innovation. IEEE Computer 34(9), 120–122.CrossRefGoogle Scholar
  61. Holten R (2003) Specification of management views in information warehouse projects. Information Systems 28(7), 709–751.CrossRefGoogle Scholar
  62. Holten R and Rosenkranz C (2011) Designing viable social systems: the role of linguistic communication for self-organization. Kybernetes 40(3/4), 559–580.CrossRefGoogle Scholar
  63. Honkela T, Könönen V, Lindh-Knuutila T and Paukkeri M-S (2008) Simulating processes of concept formation and communication. Journal of Economic Methodology 15(3), 245–259.CrossRefGoogle Scholar
  64. Hughes R (2008) Agile Data Warehousing: Delivering World-Class Business Intelligence Systems Using Scrum and xp. IUniverse, Bloomington, IN.Google Scholar
  65. Hummel M, Rosenkranz C and Holten R (2013) The role of communication in agile systems development: an analysis of the state-of-the-art. Business & Information Systems Engineering 5(5), 343–355.CrossRefGoogle Scholar
  66. Hummel M, Rosenkranz C and Holten R (2015) The role of social agile practices for direct and indirect communication in information systems development teams. Communications of the Association for Information Systems 36, 273–300.Google Scholar
  67. Hwang MI and Xu H (2008) The effect of implementation factors on data warehousing success: an exploratory study. Journal of Information, Information Technology, and Organizations 2, 1–14.Google Scholar
  68. Inmon WH (2005) Building the Data Warehouse. Wiley, Indianapolis, IN.Google Scholar
  69. Jucks R, Becker B-M and Bromme R (2008) Lexical entrainment in written discourse: is experts’ word use adapted to the addressee? Discourse Processes 45(6), 497–518.CrossRefGoogle Scholar
  70. Kankanhalli A, Tan BCY and Wei KK (2005) Contributing knowledge to electronic repositories: an empirical investigation. MIS Quarterly 29(1), 113–143.Google Scholar
  71. Karhatsu H, Ikonen M, Kettunen P, Fagerholm F and Abrahamsson P (2010) Building blocks for self-organizing software development teams a framework model and empirical pilot study. In International Conference on Software Technology and Engineering, pp 297–304, IEEE, San Juan, Puerto Rico.Google Scholar
  72. Kimball R and Caserta J (2004) The Data Warehouse etl toolkit, Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data. Wiley, Indianapolis, IN.Google Scholar
  73. Korkala M, Abrahamsson P and Kyllonen P (2006) A case study on the impact of customer communication on defects in agile software development. In AGILE 2006 (Chao J, Cohn M, Maurer F, Sharp H and Shore J, Eds), pp 76–88, IEEE, Minneapolis, MN.CrossRefGoogle Scholar
  74. Lee AS (1989) A scientific methodology for mis case studies. MIS Quarterly 13(1), 32–50.CrossRefGoogle Scholar
  75. Lee AS (1991) Integrating positivist and interpretive approaches to organizational research. Organization Science 2(4), 342–365.CrossRefGoogle Scholar
  76. Lee G and Xia W (2010) Toward agile: an integrated analysis of quantitative and qualitative field data. MIS Quarterly 34(1), 87–114.Google Scholar
  77. Loftus E (1975) Leading questions and the eye witness report. Cognitive Psychology 7(4), 560–572.CrossRefGoogle Scholar
  78. Lyytinen K and Newman M (2008) Explaining information systems change: a punctuated socio-technical change model. European Journal of Information Systems 17(6), 589–613.CrossRefGoogle Scholar
  79. Malinowski E and Zimányi E (2008) Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications. Springer, Berlin, Germany.Google Scholar
  80. March ST and Hevner AR (2007) Integrated decision support systems: a data warehousing perspective. Decision Support Systems 43(3), 1031–1043.CrossRefGoogle Scholar
  81. March TS and Smith G (1995) Design and natural science research on information technology. Decision Support Systems 15(4), 251–266.CrossRefGoogle Scholar
  82. Martin J (1991) Rapid Application Development. Macmillan Publishing Co., Inc., Indianapolis, IN.Google Scholar
  83. Mastrogiacomo S, Missonier S and Bonazzi R (2014) Talk before it’s too late: reconsidering the role of conversation in information systems project management. Journal of Management Information Systems 31(1), 44–78.CrossRefGoogle Scholar
  84. Mazón J-N, Pardillo J and Trujillo J (2007) A model-driven goal-oriented requirement engineering approach for data warehouses. In Advances in Conceptual Modeling – Foundations and Applications, ( Hainaut J-L et al. Eds), pp 255–264, Springer in Berin, Germany.Google Scholar
  85. Mazón J-N and Trujillo J (2008) An MDA approach for the development of data warehouses. Decision Support Systems 45(1), 41–58.CrossRefGoogle Scholar
  86. Miles MB and Huberman AM (1994) Qualitative Data Analysis: A Sourcebook of New Methods. Sage, Beverly Hills, CA.Google Scholar
  87. Mingers J and Willcocks L (2014) An integrative semiotic framework for information systems: the social, personal and material worlds. Information and Organization 24(1), 48–70.CrossRefGoogle Scholar
  88. Mishler EG (1986) Research Interviewing: Context and Narrative. Harvard University Press, Cambridge, MA.Google Scholar
  89. Narayanan LV (2008) Data warehousing and analytics in banking: concepts. In Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications (Wang J, Ed), pp 1825–1839, IGI Global, Hershey, PA.CrossRefGoogle Scholar
  90. Nelson RR (2007) It project management: infamous failures, classic mistakes, and best practices. MIS Quarterly Executive 6(2), 67–78.Google Scholar
  91. Nidumolu S (1995) The effect of coordination and uncertainty on software project performance: residual performance risk as an intervening variable. Information Systems Research 6(3), 191–219.CrossRefGoogle Scholar
  92. Nielsen J and Landauer TK (1993) A mathematical model of the finding of usability problems. In INTERACT ‘93 and Conference on Human Factors in Computing Systems (INTERCHI’93), pp 206–213, Amsterdam, The Netherlands.Google Scholar
  93. Peffers K, Tuunanen T, Rothenberger MA and Chatterjee S (2007) A design science research methodology for information systems research. Journal of Management Information Systems 24(3), 45–77.CrossRefGoogle Scholar
  94. Pickering MJ and Garrod S (2004) Toward a mechanistic psychology of dialogue. Behavioral and Brain Sciences 27(2), 169–226.Google Scholar
  95. Pikkarainen M, Haikara J, Salo O, Abrahamsson P and Still J (2008) The impact of agile practices on communication in software development. Empirical Software Engineering 13(3), 303–337.CrossRefGoogle Scholar
  96. Plattner H and Zeier A (2012) In-Memory Data Management. Springer, Berlin,Heidelberg, Germany.CrossRefGoogle Scholar
  97. Rahm E and Bernstein PA (2001) A survey of approaches to automatic schema matching. The VLDB Journal 10(4), 334–350.CrossRefGoogle Scholar
  98. Ram S and Jinsoo P (2004) Semantic conflict resolution ontology (scrol): an ontology for detecting and resolving data and schema-level semantic conflicts. IEEE Transactions on Knowledge and Data Engineering 16(2), 189–202.CrossRefGoogle Scholar
  99. Rosenkranz C, Corvera Charaf M and Holten R (2013) Language quality in requirements development: tracing communication in the process of information systems development. Journal of Information Technology 28(3), 198–223.CrossRefGoogle Scholar
  100. Rosenkranz C, Vranesic H and Holten R (2014) Boundary interactions and motors of change in requirements elicitation: a dynamic perspective on knowledge sharing. Journal of the Association for Information Systems 15(6), 306–345.Google Scholar
  101. Saleem K, Bellahsene Z and Hunt E (2008) Porsche: performance oriented schema mediation. Information Systems 33(7–8), 637–657.CrossRefGoogle Scholar
  102. Sein MK, Henfridsson O, Purao S, Rossi M and Lindgren R (2011) Action design research. MIS Quarterly 35(1), 37–56.Google Scholar
  103. Sen A and Sinha AP (2005) A comparison of data warehousing methodologies. Communications of the ACM 48(3), 79–84.CrossRefGoogle Scholar
  104. Sherman R (2014) Business Intelligence Guidebook: From Data Integration to Analytics. Morgan Kaufman, Waltham, MA.Google Scholar
  105. Shvaiko P and Euzenat J (2005) A survey of schema-based matching approaches. In Journal on Data Semantics iv (Spaccapietra S, Ed), pp 146–171, , Springer Berlin, Heidelberg.Google Scholar
  106. Simperl EPB and Tempich C (2006) Ontology engineering: a reality check. In On the Move to Meaningful Internet Systems 2006: Coopis, doa, gada, and odbase (Meersman R and Tari Z, Eds), pp 836–854, Springer, Berlin, Germany.CrossRefGoogle Scholar
  107. Tylén K, Weed E, Wallentin M, Roepstorff A and Chris DF (2010) Language as a tool for interacting minds. Mind & Language 25(1), 3–29.CrossRefGoogle Scholar
  108. Vaishnavi VK and Kuechler W (2008) Design Science Research Methods and Patterns. Innovating Information and Communication Technology. Auerbach Publications, Taylor & Francis Group, Boca Raton, FL.Google Scholar
  109. Venkatesh V, Morris MG, Gordon BD and Davis FD (2003) User acceptance of information technology: toward a unified view. MIS Quarterly 27(3), 425–478.Google Scholar
  110. Vidgen R and Wang X (2009) Coevolving systems and the organization of agile software development. Information Systems Research 20(3), 355–376.CrossRefGoogle Scholar
  111. Wand Y and Wang R (1996) Anchoring data quality dimensions in ontological foundations. Communications of the ACM 39(11), 86–95.CrossRefGoogle Scholar
  112. Wang RY and Strong DM (1996) Beyond accuracy: what data quality means to data consumers. Journal of Management Information Systems 12(4), 5–33.CrossRefGoogle Scholar
  113. Watson HJ and Wixom BH (2007) The current state of business intelligence. Computer 40(9), 96–99.CrossRefGoogle Scholar
  114. Wetherbe JC (1991) Executive information requirements: getting it right. MIS Quarterly 15(1), 51–65.CrossRefGoogle Scholar
  115. Wixom BH and Todd PA (2005) A theoretical integration of user satisfaction and technology acceptance. Information Systems Research 16(1), 85–102.CrossRefGoogle Scholar
  116. Wixom BH and Watson HJ (2001) An empirical investigation of the factors affecting data warehousing success. MIS Quarterly 25(1), 17–41.CrossRefGoogle Scholar
  117. Yin RK (2003) Case Study Research: Design and Methods, 3rd edn, SAGE Publications, Thousand Oaks, CA.Google Scholar

Copyright information

© The OR Society 2016

Authors and Affiliations

  • Christoph Rosenkranz
    • 1
  • Roland Holten
    • 2
  • Marc Räkers
    • 3
  • Wolf Behrmann
    • 4
  1. 1.Faculty of Management, Economics and Social Sciences, University of CologneKölnGermany
  2. 2.Department of Economics and Business Administration, Goethe UniversityFrankfurt am MainGermany
  3. 3.zeb MünsterMünsterGermany
  4. 4.zeb FrankfurtFrankfurt am MainGermany

Personalised recommendations