Abstract
Numbers are not data and data analysis does not necessarily produce information and knowledge. Statistics, data mining, and artificial intelligence are disciplines focused on extracting knowledge from data. They provide tools for testing hypotheses, predicting new observations, quantifying population effects, and summarizing data efficiently. In these fields, measurable data is used to derive knowledge. However, a clean, exact and complete dataset, which is analyzed professionally, might contain no useful information for the problem under investigation. The term Information Quality (InfoQ) was coined by Ref. [15] as the potential of a dataset to achieve a specific (scientific or practical) goal using a given data analysis method. InfoQ is a function of goal, data, data analysis, and utility. Eight dimensions that relate to these components help assess InfoQ: Data Resolution, Data Structure, Data Integration, Temporal Relevance, Generalizability, Chronology of Data and Goal, Construct Operationalization, and Communication. The eight dimensions can be used for developing streamlined evaluation metrics of InfoQ. We describe two studies where InfoQ was integrated into research methods courses, guiding students in evaluating InfoQ of prospective and retrospective studies. The results and feedback indicate the importance and usefulness of InfoQ and its eight dimensions for evaluating empirical studies.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Angst CM, Agarwal R, Kuruzovich J (2008) Bid or buy? Individual shopping traits as predictors of strategic exit in on-line auctions. Int J Electron Commer 13:59–84
Bapna R, Goes P, Gupta A, Jin Y (2004) User heterogeneity and its impact on electronic auction market design: an empirical exploration. MIS Quarterly, 28(1):21
Bapna R, Jank W, Shmueli G (2008) Price formation and its dynamics in online auctions. Decis Support Syst 44:641–656
Berthold MR, Borgelt C, Hoppner F, Klawonn F (2010) Guide to intelligent data analysis. Springer, London
Borle S, Boatwright P, Kadane JB (2006) The timing of bid placement and extent of multiple bidding: an empirical investigation using eBay online auctions. Stat Sci 21:194–205
Deming WE (1953) On the distinction between enumerative and analytic studies. J Am Stat Assoc 48:244–255
Figini S, Kenett RS, Salini S (2010) Integrating operational and financial risk assessments. Qual Reliab Eng Int 26(8):887–897
Ghani R, Simmons H (2004) Predicting the end-price of online auctions. Pisa, Italy
Giovanni E (2008) Understanding economic statistics. Technical report
Godfrey AB (2008) Eye on data quality. Six Sigma Forum Magazine, pp 5–6
Hand DJ (2008) Statistics: a very short introduction. Oxford University Press, Oxford
Jank W, Shmueli G (2010) Modeling online auctions. Wiley, Hoboken
Katkar R, Reiley DH (2006) Public versus secret reserve prices in eBay auctions: results from a pokémon field experiment. Advances in Econc Analysis and Policy, 6(2), Article 7, 1–23
Kenett RS, Coleman S, Ograjenšek I (2010) On quality research: an application of InfoQ to the Phd research process. In: Proceedings of the European network for business and industrial statistics (ENBIS) 10th annual conference on business and industrial statistics, Antwerp, Belgium, September 2010
Kenett RS, Shmueli G (2013) On information quality. J Roy Stat Soc Ser A, forthcoming
Kenett RS, Thyregod P (2006) Aspects of statistical consulting not taught by academia. Stat Neerl 60:396–412
Lucking-Reiley D, Bryan D, Prasad N, Reeves D (2007) Pennies from eBay: the determinants of price in online auctions. J Ind Econ 55:223–233
Mallows C (1998) The zeroth problem. Am Stat 52:1–9
Patzer GL (2005) Using secondary data in marketing research. Praeger, Westport
Russom P (2011) Big data analytics. Technical report, Q4
Shmueli G (2010) To explain or to predict? Stat Sci 25:289–310
Shmueli G, Koppius OR (2011) Predictive analytics in information systems research. Manag Inf Syst Q 35:553–572
Tukey JW (1977) Exploratory data analysis. Addison Wesley, New York
Acknowledgments
We thank Professors Joel Greenhouse (Carnegie Mellon University), Shirley Coleman (Newcastle University), and Irena Ograjenek (University of Ljubljana) for their support of integrating InfoQ into graduate courses at CMU and University of Ljubljana, and helping assess its impact.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media Dordrecht
About this paper
Cite this paper
Shmueli, G., Kenett, R. (2013). An Information Quality (InfoQ) Framework for Ex-Ante and Ex-Post Evaluation of Empirical Studies. In: Uden, L., Wang, L., Hong, TP., Yang, HC., Ting, IH. (eds) The 3rd International Workshop on Intelligent Data Analysis and Management. Springer Proceedings in Complexity. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-7293-9_1
Download citation
DOI: https://doi.org/10.1007/978-94-007-7293-9_1
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-7292-2
Online ISBN: 978-94-007-7293-9
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)