Abstract
Extracting meaningful information from regulatory documents such as the General Data Protection Regulation (GDPR) is of utmost importance for almost any company. Existing approaches pose strict assumptions on the documents and output models containing inconsistencies or redundancies since relations within and across documents are neglected. To overcome these shortcomings, this work aims at deriving mixed graphs based on paragraph embedding as well as process discovery and combining these graphs using constraint relations such as “redundant” or “conflicting” detected by the ConRelMiner method. The approach is implemented and evaluated based on two real-world use cases: Austria’s energy use cases plus the contained process models as ground truth and the GDPR. Mixed graphs and their combinations constitute the next step towards an end-to-end solution for extracting process models from text, either from scratch or amending existing ones.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Smart metering use-cases für das advanced meter communication system (AMCS), version 1.0. Technical report 1/88, Österreichs Energie (2015)
Van der Aa, H., Carmona Vargas, J., Leopold, H., Mendling, J., Padró, L.: Challenges and opportunities of applying natural language processing in business process management. In: Computational Linguistics, pp. 2791–2801 (2018)
van der Aa, H., Leopold, H., Reijers, H.A.: Comparing textual descriptions to process models-the automatic detection of inconsistencies. Inf. Syst. 64, 447–460 (2017)
van der Aa, H., Leopold, H., Reijers, H.A.: Checking process compliance against natural language specifications using behavioral spaces. Inf. Syst. 78, 83–95 (2018)
Allen, F.E.: Control flow analysis. In: ACM SIGPLAN Notices, vol. 5, pp. 1–19 (1970)
de AR Goncalves, J.C., Santoro, F.M., Baiao, F.A.: Business process mining from group stories. In: International Conference on Computer Supported Cooperative Work in Design, pp. 161–166 (2009)
Bajwa, I.S., Lee, M.G., Bordbar, B.: SBVR business rules generation from natural language specification. In: AAAI Spring Symposium, pp. 2–8 (2011)
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python: Analyzing Text with the Natural Language Toolkit. O’Reilly Media, Inc., Massachusetts (2009)
Deeptimahanti, D.K., Babar, M.A.: An automated tool for generating UML models from natural language requirements. In: Automated Software Engineering, pp. 680–682 (2009)
Dragoni, M., Villata, S., Rizzi, W., Governatori, G.: Combining NLP approaches for rule extraction from legal documents. In: MIning and REasoning with Legal Texts (2016)
Friedrich, F., Mendling, J., Puhlmann, F.: Process model generation from natural language text. In: Advanced Information Systems Engineering, pp. 482–496 (2011)
Ghose, A., Koliadis, G., Chueng, A.: Process discovery from model and text artefacts. In: Services, pp. 167–174 (2007)
Group, I.E.W., et al.: ICH harmonized tripartite guideline, quality risk management q9. In: Technical Requirements for Registration of Pharmaceuticals for Human Use (2005)
Hansen, P., Kuplinsky, J., de Werra, D.: Mixed graph colorings. Math. Methods Oper. Res. 45(1), 145–160 (1997)
Kabicher, S., Rinderle-Ma, S.: Human-centered process engineering based on content analysis and process view aggregation. In: Advanced Information Systems Engineering, pp. 467–481 (2011)
Ly, L.T., Maggi, F.M., Montali, M., Rinderle-Ma, S., van der Aalst, W.M.P.: Compliance monitoring in business processes: functionalities, application, and tool-support. Inf. Syst. 54, 209–234 (2015)
More, P., Phalnikar, R.: Generating UML diagrams from natural language specifications. Appl. Inf. Syst. 1(8), 19–23 (2012)
Ren, P., Chen, Z., Ren, Z., Wei, F., Ma, J., de Rijke, M.: Leveraging contextual sentence relations for extractive summarization using a neural attention model. In: Research and Development in Information Retrieval, pp. 95–104 (2017)
Riefer, M., Ternis, S.F., Thaler, T.: Mining process models from natural language text: a state-of-the-art analysis. Multikonferenz Wirtschaftsinformatik, pp. 9–11 (2016)
Saha, T.K., Joty, S., Hassan, N., Hasan, M.A.: Regularized and retrofitted models for learning sentence representation with context. In: Information and Knowledge Management, pp. 547–556 (2017)
Selway, M., Grossmann, G., Mayer, W., Stumptner, M.: Formalising natural language specifications using a cognitive linguistic/configuration based approach. Inf. Syst. 54, 191–208 (2015)
Sinha, A., Paradkar, A.: Use cases to process specifications in business process modeling notation. In: Web Services, pp. 473–480 (2010)
Wang, H.J., Zhao, J.L., Zhang, L.J.: Policy-driven process mapping (PDPM): discovering process models from business policies. DSS 48(1), 267–281 (2009)
Winter, K., Rinderle-Ma, S.: Detecting constraints and their relations from regulatory documents using NLP techniques. In: On the Move to Meaningful Internet Systems, pp. 261–278 (2018)
Winter, K., Rinderle-Ma, S.: Untangling the GDPR using ConRelMiner. arXiv:1811.03399 (2018)
Winter, K., Rinderle-Ma, S., Grossmann, W., Feinerer, I., Ma, Z.: Characterizing regulatory documents and guidelines based on text mining. In: On the Move to Meaningful Internet Systems, pp. 3–20 (2017)
Acknowledgment
This work has been funded by the Vienna Science and Technology Fund (WWTF) through project ICT15-072.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Winter, K., Rinderle-Ma, S. (2019). Deriving and Combining Mixed Graphs from Regulatory Documents Based on Constraint Relations. In: Giorgini, P., Weber, B. (eds) Advanced Information Systems Engineering. CAiSE 2019. Lecture Notes in Computer Science(), vol 11483. Springer, Cham. https://doi.org/10.1007/978-3-030-21290-2_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-21290-2_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21289-6
Online ISBN: 978-3-030-21290-2
eBook Packages: Computer ScienceComputer Science (R0)