Abstract
Empirical Software Engineering (ESE) roots back to the 1970s and has since then gained growing recognition as the standard approach to scientific inquiry in the context of software engineering. Many different quantitative and qualitative research methods have been described and supplied with guidelines and checklists and several books have been written about good practice in ESE. With the emerging amount of data being produced during software development, a new paradigm of scientific inquiry has gained much attention, i.e., Data Science (DS). The goal of this chapter is to discuss whether DS could replace traditional ESE or, if it does not replace it, how traditional ESE could benefit from adopting DS practices—and vice versa. In this chapter, we first give some general background information about ESE and DS, then we describe in more detail how both paradigms are typically used in the context of software engineering research and what are their respective strengths and weaknesses. Finally, we illustrate with the help of an industry-driven case example how both paradigms, ESE and DS, could benefit from each other if used in combination.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alahyari H, Gorschek T, Svensson RB (2019) An exploratory study of waste in software development organizations using agile or lean approaches: a multiple case study at 14 organizations. Inf Softw Technol 105:78–94
Basili VR (1985) Quantitative evaluation of software methodology. In: Keynote address, proceedings of the first pan pacific computer conference, Melbourne, pp 379–398
Basili VR, Caldiera G, Rombach DH (1994) The goal question metric approach. Wiley, Hoboken, pp 528–532
Basili VR, Caldiera G, Rombach HD (2001) The experience factory. In: Marciniak J (ed) Encyclopedia of software engineering. Wiley, Hoboken
Basili VR, Rombach HD, Schneider K, Kitchenham B, Pfahl D, Selby RW (2007) Empirical software engineering issues. Critical assessment and future directions. In: International workshop, Dagstuhl Castle, June, 2006. Revised Papers. LNCS 4336, Springer, Berlin, pp 26–30
Bird C, Menzies T, Zimmermann T (2015) The art and science of analyzing software data. Elsevier, Amsterdam
Blackstone A (2012) Principles of sociological inquiry–Qualitative and quantitative methods. BC open textbook collection, Open Textbook Library
Boehm B, Rombach HD, Zelkowitz MV (2010) Foundations of empirical software engineering: the legacy of Victor R, 1st edn. Springer; Basili Publishing Company, Berlin
Briand LC, Differding CM, Rombach HD (1996) Practical guidelines for measurement-based process improvement. Softw Process Improv Prac 2:253–280
Clark KB, Wheelwright SC (1993) Managing new product and process development: text and cases. The Free Press, New York
Cohn M (2005) Agile estimating and planning. Pearson Education, London
Fayyad U, Piatetsky-Shapiro G, Smyth P (1996) From data mining to knowledge discovery in databases. AI Mag 17:37–37
Hannay J, Sjøberg D, Dybå TA (2007) Systematic review of theory use in software engineering experiments. IEEE Trans Softw Eng 33:87–107
Hayashi C (1998) What is data science? Fundamental concepts and a heuristic example. In: Hayashi C, Yajima K, Bock H-H, Ohsumi N, Tanaka Y, Baba Y (eds) Data science, classification, and related methods. Studies in classification, data analysis, and knowledge organization. Springer, Berlin, pp 40–51
Hayashi C, Yajima K, Bock H-H, Ohsumi N, Tanaka Y, Baba YDS (eds) (1998) Classification, and related methods, studies in classification, data analysis, and knowledge organization. Springer, Berlin
Kitchenham B, Charters S (2007) Guidelines for performing systematic literature reviews in software engineering. Technical report, EBSE-2007-01, School of Computer Science and Mathematics, Keele University
Kitchenham B, Pickard L, Pfleeger SL (1995) Case studies for method and tool evaluation. IEEE Softw 12(4):52–62
Kitchenham BA, Dybå T, Jorgensen M (2004) Evidence-based software engineering. In: Proceedings of the 26th international conference on software engineering (ICSE ’04), pp 273–281
McCollum JK, Sherman JD (1991) The effects of matrix organization size and number of project assignments on performance. IEEE Trans Eng Manag 38(1):75–78
Menzies T, Kocaguneli E, Turhan B, Minku L, Peters F (2014) Sharing data and models in software engineering. Morgan Kaufmann, Burlington
Menzies T, Williams L, Zimmermann T (2016) Perspectives on data science for software engineering. Morgan Kaufmann, Burlington
Morgan JM, Liker JK (2006) The Toyota product development system: integrating people, process and technology. Productivity Press, New York
Petersen K, Feldt R, Mujtaba S, Mattsson M (2008) Systematic mapping studies in software engineering. In: Visaggio G, Baldassarre MT, Linkman S, Turner M (eds) Proceedings of the 12th international conference on evaluation and assessment in software engineering (EASE’08). BCS Learning & Development Ltd., Swindon, pp 68–77
Poppendieck M, Poppendieck T (2003) Lean software development: an agile toolkit: an agile toolkit. Addison Wesley, Boston
Runeson P, Höst M (2009) Guidelines for conducting and reporting case study research in software engineering. Empir Softw Eng 14(2):2009
Seaman CB (1999) Qualitative methods in empirical studies of software engineering. IEEE Trans Softw Eng 25(4):1999
Shull FJ, Singer J (2007) Sjøberg, i. In: ‘K’, guide to advanced empirical software engineering. Springer, Berlin
Stol K, Fitzgerald BC (2017) Continuous software engineering: a roadmap and agenda. J Syst Softw 123:176–189
Stol K, Fitzgerald B (2018) The ABC of software engineering research. ACM Trans Softw Eng Methodol 27:3
Vasilescu B, Blincoe K, Xuan Q, Casalnuovo C, Damian D, Devanbu P, Filkov V (2016) The sky is not the limit: multitasking across GitHub projects. In: IEEE/ACM 38th international conference on software engineering (ICSE). IEEE, Piscataway, pp 994–1005
Wallace WL (1971) The logic of science in sociology. Aldine, New York
Acknowledgements
This research was partly supported by the institutional research grant IUT20-55 of the Estonian Research Council and the Estonian Centre of Excellence in ICT Research (EXCITE).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Scott, E., Milani, F., Pfahl, D. (2020). Data Science and Empirical Software Engineering. In: Felderer, M., Travassos, G. (eds) Contemporary Empirical Methods in Software Engineering. Springer, Cham. https://doi.org/10.1007/978-3-030-32489-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-32489-6_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32488-9
Online ISBN: 978-3-030-32489-6
eBook Packages: Computer ScienceComputer Science (R0)