Skip to main content

DMN for Data Quality Measurement and Assessment

  • Conference paper
  • First Online:
Business Process Management Workshops (BPM 2019)

Abstract

Data Quality assessment is aimed at evaluating the suitability of a dataset for an intended task. The extensive literature on data quality describes the various methodologies for assessing data quality by means of data profiling techniques of the whole datasets. Our investigations are aimed to provide solutions to the need of automatically assessing the level of quality of the records of a dataset, where data profiling tools do not provide an adequate level of information. As most of the times, it is easier to describe when a record has quality enough than calculating a qualitative indicator, we propose a semi-automatically business rule-guided data quality assessment methodology for every record. This involves first listing the business rules that describe the data (data requirements), then those describing how to produce measures (business rules for data quality measurements), and finally, those defining how to assess the level of data quality of a data set (business rules for data quality assessment). The main contribution of this paper is the adoption of the OMG standard DMN (Decision Model and Notation) to support the data quality requirement description and their automatic assessment by using the existing DMN engines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Batini, C., Cappiello, C., Francalanci, C., Maurino, A.: Methodologies for data quality assessment and improvement. ACM Comput. Surv. (CSUR) 41(3), 16 (2009)

    Article  Google Scholar 

  2. Batini, C., Scannapieco, M.: Data Quality: Concepts, Methodologies and Tea niques. Data-Centric Systems and Applications. Springer, New York (2006). https://doi.org/10.1007/3-540-33173-5

    Book  MATH  Google Scholar 

  3. Ceravolo, P., et al.: Big data semantics. J. Data Semant. 7(2), 65–85 (2018)

    Article  Google Scholar 

  4. Dangarska, Z., Figl, K., Mendling, J.: An explorative analysis of the notational characteristics of the decision model and notation (DMN). In: 20th IEEE International Enterprise Distributed Object Computing Workshop, EDOC Workshops 2016, Vienna, Austria, 5–9 September 2016, pp. 1–9 (2016)

    Google Scholar 

  5. Dasseville, I., Janssens, L., Janssens, G., Vanthienen, J., Denecker, M.: Combining DMN and the knowledge base paradigm for flexible decision enactment. In: Supplementary Proceedings of the RuleML 2016 Challenge, 10th International Web Rule Symposium, RuleML 2016, New York, USA, 6–9 July 2016 (2016). http://ceur-ws.org/Vol-1620/paper3.pdf

  6. Figl, K., Mendling, J., Tokdemir, G., Vanthienen, J.: What we know and what we do not know about DMN. Enterp. Model. Inf. Syst. Arch. 13(2), 1–16 (2018)

    Google Scholar 

  7. Hasic, F., Craemer, A.D., Hegge, T., Magala, G., Vanthienen, J.: Measuring the complexity of DMN decision models. In: Business Process Management Workshops - BPM 2018 International Workshops, Sydney, NSW, Australia, 9–14 September 2018, Revised Papers, pp. 514–526 (2018)

    Google Scholar 

  8. Hasić, F., De Smedt, J., Vanthienen, J.: Towards assessing the theoretical complexity of the decision model and notation (DMN). In: CEUR Workshop Proceedings, vol. 1859, pp. 64–71. CEUR Workshop Proceedings (2017). https://lirias.kuleuven.be/1548429?limo=0

  9. Heinrich, B., Klier, M.: Metric-based data quality assessment - developing and evaluating a probability-based currency metric. Decis. Support. Syst. 72, 82–96 (2015)

    Article  Google Scholar 

  10. ISO-25012: Iso/IEC 25012: Software engineering-software product quality requirements and evaluation (square)-data quality model (2008)

    Google Scholar 

  11. Janssens, L., Bazhenova, E., Smedt, J.D., Vanthienen, J., Denecker, M.: Consistent integration of decision (DMN) and process (BPMN) models. In: Proceedings of the CAiSE 2016 Forum, at the 28th International Conference on Advanced Information Systems Engineering (CAiSE 2016), Ljubljana, Slovenia, 13–17 June 2016, pp. 121–128 (2016). http://ceur-ws.org/Vol-1612/paper16.pdf

  12. Lee, S., Ludäscher, B., Glavic, B.: PUG: a framework and practical implementation for why and why-not provenance. VLDB J. 28(1), 47–71 (2019)

    Article  Google Scholar 

  13. Loshin, D.: The Practitioner’s Guide to Data Quality Improvement. Elsevier, Amsterdam (2010)

    Google Scholar 

  14. OMG: Business process model and notation (2017). http://www.omg.org/spec/BPMN/2.0

  15. OMG: Decision Model and Notation (DMN), Version 1.2, January 2019. https://www.omg.org/spec/DMN

  16. Parody, L., Gómez-López, M.T., Bermejo, I., Caballero, I., Gasca, R.M., Piattini, M.: PAIS-DQ: extending process-aware information systems to support data quality in PAIS life-cycle. In: Tenth IEEE International Conference on Research Challenges in Information Science, RCIS 2016, Grenoble, France, 1–3 June 2016, pp. 1–12 (2016)

    Google Scholar 

  17. Pérez-Álvarez, J.M., Gómez-López, M.T., Parody, L., Gasca, R.M.: Process instance query language to include process performance indicators in DMN. In: 20th IEEE International Enterprise Distributed Object Computing Workshop, EDOC Workshops 2016, Vienna, Austria, 5–9 September 2016, pp. 1–8 (2016)

    Google Scholar 

  18. Sebastian-Coleman, L.: Measuring data quality for ongoing improvement: a dataquality assessment framework. Newnes (2012)

    Google Scholar 

  19. Wang, R.Y.: A product perspective on total data quality management. Commun. ACM 41(2), 58–65 (1998)

    Article  Google Scholar 

  20. Wang, R.Y., Reddy, M.P., Kon, H.B.: Toward quality data: an attribute-based approach. Decis. Support. Syst. 13(3–4), 349–372 (1995)

    Article  Google Scholar 

  21. Woodall, P., Oberhofer, M., Borek, A.: A classification of data quality assessment and improvement methods. Int. J. Inf. Qual. 3(4), 298–321 (2014)

    Google Scholar 

Download references

Acknowledge

This work has been partially funded by the Ministry of Science and Technology of Spain ECLIPSE (RTI2018-094283-B-C33) and (RTI2018-094283-B-C31) projects, the Junta de Andalucía via the PIRAMIDE and METAMORFOSIS projects, the European Regional Development Fund (ERDF/FEDER), GEMA: Generation and Evaluation of Models for dAta Quality (Ref.: SBPLY/17/180501/000293), and the Cátedra de Telefónica “Inteligencia en la Red” of the Universidad de Sevilla.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Álvaro Valencia-Parra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Valencia-Parra, Á., Parody, L., Varela-Vaca, Á.J., Caballero, I., Gómez-López, M.T. (2019). DMN for Data Quality Measurement and Assessment. In: Di Francescomarino, C., Dijkman, R., Zdun, U. (eds) Business Process Management Workshops. BPM 2019. Lecture Notes in Business Information Processing, vol 362. Springer, Cham. https://doi.org/10.1007/978-3-030-37453-2_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-37453-2_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-37452-5

  • Online ISBN: 978-3-030-37453-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics