Abstract
In this chapter we present three anaphora resolution approaches for workflow extraction. We introduce a lexical approach and two further approaches based on a set of association rules which are created during a statistical analysis of a corpus of workflows. We implement these approaches in our generic workflow extraction framework. The workflow extraction framework allows to derive a formal representation based on workflows from textual descriptions of instructions, for instance, of aircraft repair procedures from a maintenance manual. The framework applies a pipes-and-filters architecture and uses Natural Language Processing (NLP) tools to perform information extraction steps automatically. We evaluate the anaphora resolution approaches in the cooking domain. Two different evaluation functions are used for the evaluation which compare the extraction result with a golden standard. The syntactic function is strictly limited to syntactical comparison. The semantic evaluation function can use an ontology to infer a semantic distance for the evaluation. The evaluation shows that the most advanced anaphora resolution approach performs best. In addition a comparison of the semantic and syntactic evaluation functions shows that the semantic evaluation function is better suited for the evaluation of the anaphora resolution approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
Collaborative Agile Knowledge Engine.
- 5.
- 6.
- 7.
- 8.
References
Workflow Management Coalition: Workflow management coalition glossary and terminology. http://www.wfmc.org/standars/docs/TC-1011_term_glossary_v3.pdf (1999). Accessed 23 May 2007
Aamodt, A., Plaza, E.: Case-based reasoning: Foundational issues, methodological variations, and system approaches. AI commun. 7(1), 39–59 (1994)
Minor, M., Montani, S., Recio-Garca, J.A.: Process-oriented case-based reasoning. Inf. Syst. 40, 103–105 (2014)
Bergmann, R., Gil, Y.: Similarity assessment and efficient retrieval of semantic workflows. Inf. Syst. 40, 115–127 (2014)
Minor, M., Tartakovski, A., Schmalen, D., Bergmann, R.: Agile workflow technology and case-based change reuse for long-term processes. Int. J. Intell. Inf. Technol. 4(1), 80–98 (2008)
Kendall-Morwick, J., Leake, D.: Facilitating representation and retrieval of structured cases: Principles and toolkit. Inf. Syst. 40, 106–114 (2014)
Minor, M., Bergmann, R., Görg, S.: Case-based adaptation of workflows. Inf. Syst. 40, 142–152 (2014)
Schumacher, P., Minor, M., Walter, K., Bergmann, R.: Extraction of procedural knowledge from the web. In: Proceedings of the Workshop WWW’12, Lyon, France (2012)
Dufour-Lussier, V., Le Ber, F., Lieber, J., Nauer, E.: Automatic case acquisition from texts for process-oriented case-based reasoning. Inf. Syst. 40, 153–167 (2014)
Schumacher, P., Minor, M., Schulte-Zurhausen, E.: Extracting and enriching workflows from text. In: Proceedings of the 2013 IEEE 14th International Conference on Information Reuse and Integration, pp. 285–292 (2013)
Zhu, H.: Software Design Methodology: From Principles to Architectural Styles. Butterworth-Heinemann (2005)
Langer, G.: Textkohärenz und Textspezifität. Europäische Hochschulschriften, vol. 152. Peter Lang, Ireland (1995)
AeroSpace and Defence Industries Association of Europe: ASD simplified technical English. http://www.asd-ste100.org/ (2013). Accessed 19 Sept 2013
Minor, M., Schmalen, D., Bergmann, R.: XML-based representation of agile workflows. In: Bichler, M., Hess, T., Krcmar, H., Lechner, U., Matthes, F., Picot, A., Speitkamp, B., Wolf, P. (eds.) Multikonferenz Wirtschaftsinformatik 2008, pp. 439–440. GITO-Verlag, Berlin (2008)
Riloff, E., Phillips, W.: An introduction to the sundance and autoslog systems. Technical report, Technical report UUCS-04-015, School of Computing, University of Utah, (2004)
Schank, R.: Conceptual dependence: A theory of natural language understanding. Cogn. Psychol. (3)4, 532–631 (1972)
Fillmore, C.: The case for case reopened. Syntax Semant 8(1977), 59–82 (1977)
Tognini-Bonelli, E.: Corpus Linguistics at Work. John Benjamins Publishing, Amsterdam (2001)
Higginbotham, J., Pianesi, F., Varzi, A.C.: Speaking of events. Oxford University Press, New York (2000)
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, pp. 3–14. IEEE Press, New York (1995)
Kowalski, G.: Information system evaluation. In: Information Retrieval Architecture and Algorithms, pp. 253–281. Springer, Berlin (2011)
Euzenat, J.: Semantic precision and recall for ontology alignment evaluation. In: Proceedings of the IJCAI, pp. 348–353 (2007)
Pedersen, T., Pakhomov, S.V., Patwardhan, S., Chute, C.G.: Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inform. 40(3), 288–299 (2007)
Lin, D.: An information-theoretic definition of similarity. ICML 98, 296–304 (1998)
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the IJCAI, vol. 1, pp. 448–453. Morgan Kaufmann, San Francisco (1995)
Sánchez, D., Batet, M., Isern, D.: Ontology-based information content computation. Knowl. Based Syst. 24(2), 297–303 (2011)
Gasperin, C., Briscoe, T.: Statistical anaphora resolution in biomedical texts. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1. COLING ’08, Stroudsburg, USA, Association for Computational Linguistics, pp. 257–264 (2008)
Markert, K., Modjeska, N., Nissim, M.: Using the web for nominal anaphora resolution. In: Proceedings of the EACL Workshop on the Computational Treatment of Anaphora, Budapest, pp. 39–46 (2003)
Gil, Y., Ratnakar, V., Fritz, C.: Tellme: learning procedures from tutorial instruction. In: Proceedings of the 15th International Conference on Intelligent User Interfaces, pp. 227–236 (2011)
Zhang, Z., Webster, P., Uren, V.S., Varga, A., Ciravegna, F.: Automatically extracting procedural knowledge from instructional texts using natural language processing. In: Calzolari, N., Choukri, K., Declerck, T., Dogan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.) LREC. European Language Resources Association (ELRA), Istanbul, pp. 520–527 (2012)
Friedrich, F., Mendling, J., Puhlmann, F.: Process model generation from natural language text. In: Rolland, C. (ed.) Advanced Information Systems Engineering, vol. 6741, pp. 482–496. Springer, Heidelberg (2011)
Acknowledgments
This work was funded by the German Research Foundation, project number BE 1373/3-1.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Schumacher, P., Minor, M., Schulte-Zurhausen, E. (2014). On the Use of Anaphora Resolution for Workflow Extraction. In: Bouabana-Tebibel, T., Rubin, S. (eds) Integration of Reusable Systems. Advances in Intelligent Systems and Computing, vol 263. Springer, Cham. https://doi.org/10.1007/978-3-319-04717-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-04717-1_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04716-4
Online ISBN: 978-3-319-04717-1
eBook Packages: EngineeringEngineering (R0)