Abstract
Data Quality assessment is aimed at evaluating the suitability of a dataset for an intended task. The extensive literature on data quality describes the various methodologies for assessing data quality by means of data profiling techniques of the whole datasets. Our investigations are aimed to provide solutions to the need of automatically assessing the level of quality of the records of a dataset, where data profiling tools do not provide an adequate level of information. As most of the times, it is easier to describe when a record has quality enough than calculating a qualitative indicator, we propose a semi-automatically business rule-guided data quality assessment methodology for every record. This involves first listing the business rules that describe the data (data requirements), then those describing how to produce measures (business rules for data quality measurements), and finally, those defining how to assess the level of data quality of a data set (business rules for data quality assessment). The main contribution of this paper is the adoption of the OMG standard DMN (Decision Model and Notation) to support the data quality requirement description and their automatic assessment by using the existing DMN engines.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Batini, C., Cappiello, C., Francalanci, C., Maurino, A.: Methodologies for data quality assessment and improvement. ACM Comput. Surv. (CSUR) 41(3), 16 (2009)
Batini, C., Scannapieco, M.: Data Quality: Concepts, Methodologies and Tea niques. Data-Centric Systems and Applications. Springer, New York (2006). https://doi.org/10.1007/3-540-33173-5
Ceravolo, P., et al.: Big data semantics. J. Data Semant. 7(2), 65–85 (2018)
Dangarska, Z., Figl, K., Mendling, J.: An explorative analysis of the notational characteristics of the decision model and notation (DMN). In: 20th IEEE International Enterprise Distributed Object Computing Workshop, EDOC Workshops 2016, Vienna, Austria, 5–9 September 2016, pp. 1–9 (2016)
Dasseville, I., Janssens, L., Janssens, G., Vanthienen, J., Denecker, M.: Combining DMN and the knowledge base paradigm for flexible decision enactment. In: Supplementary Proceedings of the RuleML 2016 Challenge, 10th International Web Rule Symposium, RuleML 2016, New York, USA, 6–9 July 2016 (2016). http://ceur-ws.org/Vol-1620/paper3.pdf
Figl, K., Mendling, J., Tokdemir, G., Vanthienen, J.: What we know and what we do not know about DMN. Enterp. Model. Inf. Syst. Arch. 13(2), 1–16 (2018)
Hasic, F., Craemer, A.D., Hegge, T., Magala, G., Vanthienen, J.: Measuring the complexity of DMN decision models. In: Business Process Management Workshops - BPM 2018 International Workshops, Sydney, NSW, Australia, 9–14 September 2018, Revised Papers, pp. 514–526 (2018)
Hasić, F., De Smedt, J., Vanthienen, J.: Towards assessing the theoretical complexity of the decision model and notation (DMN). In: CEUR Workshop Proceedings, vol. 1859, pp. 64–71. CEUR Workshop Proceedings (2017). https://lirias.kuleuven.be/1548429?limo=0
Heinrich, B., Klier, M.: Metric-based data quality assessment - developing and evaluating a probability-based currency metric. Decis. Support. Syst. 72, 82–96 (2015)
ISO-25012: Iso/IEC 25012: Software engineering-software product quality requirements and evaluation (square)-data quality model (2008)
Janssens, L., Bazhenova, E., Smedt, J.D., Vanthienen, J., Denecker, M.: Consistent integration of decision (DMN) and process (BPMN) models. In: Proceedings of the CAiSE 2016 Forum, at the 28th International Conference on Advanced Information Systems Engineering (CAiSE 2016), Ljubljana, Slovenia, 13–17 June 2016, pp. 121–128 (2016). http://ceur-ws.org/Vol-1612/paper16.pdf
Lee, S., Ludäscher, B., Glavic, B.: PUG: a framework and practical implementation for why and why-not provenance. VLDB J. 28(1), 47–71 (2019)
Loshin, D.: The Practitioner’s Guide to Data Quality Improvement. Elsevier, Amsterdam (2010)
OMG: Business process model and notation (2017). http://www.omg.org/spec/BPMN/2.0
OMG: Decision Model and Notation (DMN), Version 1.2, January 2019. https://www.omg.org/spec/DMN
Parody, L., Gómez-López, M.T., Bermejo, I., Caballero, I., Gasca, R.M., Piattini, M.: PAIS-DQ: extending process-aware information systems to support data quality in PAIS life-cycle. In: Tenth IEEE International Conference on Research Challenges in Information Science, RCIS 2016, Grenoble, France, 1–3 June 2016, pp. 1–12 (2016)
Pérez-Álvarez, J.M., Gómez-López, M.T., Parody, L., Gasca, R.M.: Process instance query language to include process performance indicators in DMN. In: 20th IEEE International Enterprise Distributed Object Computing Workshop, EDOC Workshops 2016, Vienna, Austria, 5–9 September 2016, pp. 1–8 (2016)
Sebastian-Coleman, L.: Measuring data quality for ongoing improvement: a dataquality assessment framework. Newnes (2012)
Wang, R.Y.: A product perspective on total data quality management. Commun. ACM 41(2), 58–65 (1998)
Wang, R.Y., Reddy, M.P., Kon, H.B.: Toward quality data: an attribute-based approach. Decis. Support. Syst. 13(3–4), 349–372 (1995)
Woodall, P., Oberhofer, M., Borek, A.: A classification of data quality assessment and improvement methods. Int. J. Inf. Qual. 3(4), 298–321 (2014)
Acknowledge
This work has been partially funded by the Ministry of Science and Technology of Spain ECLIPSE (RTI2018-094283-B-C33) and (RTI2018-094283-B-C31) projects, the Junta de Andalucía via the PIRAMIDE and METAMORFOSIS projects, the European Regional Development Fund (ERDF/FEDER), GEMA: Generation and Evaluation of Models for dAta Quality (Ref.: SBPLY/17/180501/000293), and the Cátedra de Telefónica “Inteligencia en la Red” of the Universidad de Sevilla.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Valencia-Parra, Á., Parody, L., Varela-Vaca, Á.J., Caballero, I., Gómez-López, M.T. (2019). DMN for Data Quality Measurement and Assessment. In: Di Francescomarino, C., Dijkman, R., Zdun, U. (eds) Business Process Management Workshops. BPM 2019. Lecture Notes in Business Information Processing, vol 362. Springer, Cham. https://doi.org/10.1007/978-3-030-37453-2_30
Download citation
DOI: https://doi.org/10.1007/978-3-030-37453-2_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37452-5
Online ISBN: 978-3-030-37453-2
eBook Packages: Computer ScienceComputer Science (R0)