On the Use of Anaphora Resolution for Workflow Extraction

Schumacher, Pol; Minor, Mirjam; Schulte-Zurhausen, Erik

doi:10.1007/978-3-319-04717-1_7

Pol Schumacher⁴,
Mirjam Minor⁴ &
Erik Schulte-Zurhausen⁴

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 263))

720 Accesses
2 Citations

Abstract

In this chapter we present three anaphora resolution approaches for workflow extraction. We introduce a lexical approach and two further approaches based on a set of association rules which are created during a statistical analysis of a corpus of workflows. We implement these approaches in our generic workflow extraction framework. The workflow extraction framework allows to derive a formal representation based on workflows from textual descriptions of instructions, for instance, of aircraft repair procedures from a maintenance manual. The framework applies a pipes-and-filters architecture and uses Natural Language Processing (NLP) tools to perform information extraction steps automatically. We evaluate the anaphora resolution approaches in the cooking domain. Two different evaluation functions are used for the evaluation which compare the extraction result with a golden standard. The syntactic function is strictly limited to syntactical comparison. The semantic evaluation function can use an ontology to infer a semantic distance for the evaluation. The evaluation shows that the most advanced anaphora resolution approach performs best. In addition a comparison of the semantic and syntactic evaluation functions shows that the semantic evaluation function is better suited for the evaluation of the anaphora resolution approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.gate.ac.uk
2.
http://www.nlp.stanford.edu/software/
3.
http://www.opennlp.apache.org
4.
Collaborative Agile Knowledge Engine.
5.
www.allrecipes.com
6.
http://www.semantic-measures-library.org
7.
http://www.wikitaaable.loria.fr
8.
www.dbpedia.org

References

Workflow Management Coalition: Workflow management coalition glossary and terminology. http://www.wfmc.org/standars/docs/TC-1011_term_glossary_v3.pdf (1999). Accessed 23 May 2007
Aamodt, A., Plaza, E.: Case-based reasoning: Foundational issues, methodological variations, and system approaches. AI commun. 7(1), 39–59 (1994)
Google Scholar
Minor, M., Montani, S., Recio-Garca, J.A.: Process-oriented case-based reasoning. Inf. Syst. 40, 103–105 (2014)
Google Scholar
Bergmann, R., Gil, Y.: Similarity assessment and efficient retrieval of semantic workflows. Inf. Syst. 40, 115–127 (2014)
Google Scholar
Minor, M., Tartakovski, A., Schmalen, D., Bergmann, R.: Agile workflow technology and case-based change reuse for long-term processes. Int. J. Intell. Inf. Technol. 4(1), 80–98 (2008)
Article Google Scholar
Kendall-Morwick, J., Leake, D.: Facilitating representation and retrieval of structured cases: Principles and toolkit. Inf. Syst. 40, 106–114 (2014)
Google Scholar
Minor, M., Bergmann, R., Görg, S.: Case-based adaptation of workflows. Inf. Syst. 40, 142–152 (2014)
Google Scholar
Schumacher, P., Minor, M., Walter, K., Bergmann, R.: Extraction of procedural knowledge from the web. In: Proceedings of the Workshop WWW’12, Lyon, France (2012)
Google Scholar
Dufour-Lussier, V., Le Ber, F., Lieber, J., Nauer, E.: Automatic case acquisition from texts for process-oriented case-based reasoning. Inf. Syst. 40, 153–167 (2014)
Google Scholar
Schumacher, P., Minor, M., Schulte-Zurhausen, E.: Extracting and enriching workflows from text. In: Proceedings of the 2013 IEEE 14th International Conference on Information Reuse and Integration, pp. 285–292 (2013)
Google Scholar
Zhu, H.: Software Design Methodology: From Principles to Architectural Styles. Butterworth-Heinemann (2005)
Google Scholar
Langer, G.: Textkohärenz und Textspezifität. Europäische Hochschulschriften, vol. 152. Peter Lang, Ireland (1995)
Google Scholar
AeroSpace and Defence Industries Association of Europe: ASD simplified technical English. http://www.asd-ste100.org/ (2013). Accessed 19 Sept 2013
Minor, M., Schmalen, D., Bergmann, R.: XML-based representation of agile workflows. In: Bichler, M., Hess, T., Krcmar, H., Lechner, U., Matthes, F., Picot, A., Speitkamp, B., Wolf, P. (eds.) Multikonferenz Wirtschaftsinformatik 2008, pp. 439–440. GITO-Verlag, Berlin (2008)
Google Scholar
Riloff, E., Phillips, W.: An introduction to the sundance and autoslog systems. Technical report, Technical report UUCS-04-015, School of Computing, University of Utah, (2004)
Google Scholar
Schank, R.: Conceptual dependence: A theory of natural language understanding. Cogn. Psychol. (3)4, 532–631 (1972)
Google Scholar
Fillmore, C.: The case for case reopened. Syntax Semant 8(1977), 59–82 (1977)
Google Scholar
Tognini-Bonelli, E.: Corpus Linguistics at Work. John Benjamins Publishing, Amsterdam (2001)
Google Scholar
Higginbotham, J., Pianesi, F., Varzi, A.C.: Speaking of events. Oxford University Press, New York (2000)
Google Scholar
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, pp. 3–14. IEEE Press, New York (1995)
Google Scholar
Kowalski, G.: Information system evaluation. In: Information Retrieval Architecture and Algorithms, pp. 253–281. Springer, Berlin (2011)
Google Scholar
Euzenat, J.: Semantic precision and recall for ontology alignment evaluation. In: Proceedings of the IJCAI, pp. 348–353 (2007)
Google Scholar
Pedersen, T., Pakhomov, S.V., Patwardhan, S., Chute, C.G.: Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inform. 40(3), 288–299 (2007)
Article Google Scholar
Lin, D.: An information-theoretic definition of similarity. ICML 98, 296–304 (1998)
Google Scholar
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the IJCAI, vol. 1, pp. 448–453. Morgan Kaufmann, San Francisco (1995)
Google Scholar
Sánchez, D., Batet, M., Isern, D.: Ontology-based information content computation. Knowl. Based Syst. 24(2), 297–303 (2011)
Article Google Scholar
Gasperin, C., Briscoe, T.: Statistical anaphora resolution in biomedical texts. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1. COLING ’08, Stroudsburg, USA, Association for Computational Linguistics, pp. 257–264 (2008)
Google Scholar
Markert, K., Modjeska, N., Nissim, M.: Using the web for nominal anaphora resolution. In: Proceedings of the EACL Workshop on the Computational Treatment of Anaphora, Budapest, pp. 39–46 (2003)
Google Scholar
Gil, Y., Ratnakar, V., Fritz, C.: Tellme: learning procedures from tutorial instruction. In: Proceedings of the 15th International Conference on Intelligent User Interfaces, pp. 227–236 (2011)
Google Scholar
Zhang, Z., Webster, P., Uren, V.S., Varga, A., Ciravegna, F.: Automatically extracting procedural knowledge from instructional texts using natural language processing. In: Calzolari, N., Choukri, K., Declerck, T., Dogan, M.U., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S. (eds.) LREC. European Language Resources Association (ELRA), Istanbul, pp. 520–527 (2012)
Google Scholar
Friedrich, F., Mendling, J., Puhlmann, F.: Process model generation from natural language text. In: Rolland, C. (ed.) Advanced Information Systems Engineering, vol. 6741, pp. 482–496. Springer, Heidelberg (2011)
Google Scholar

Download references

Acknowledgments

This work was funded by the German Research Foundation, project number BE 1373/3-1.

Author information

Authors and Affiliations

Goethe Universität Frankfurt - Institut für Informatik - Lehrstuhl für Wirtschaftsinformatik, 60325, Frankfurt am Main, Germany
Pol Schumacher, Mirjam Minor & Erik Schulte-Zurhausen

Authors

Pol Schumacher
View author publications
You can also search for this author in PubMed Google Scholar
Mirjam Minor
View author publications
You can also search for this author in PubMed Google Scholar
Erik Schulte-Zurhausen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pol Schumacher .

Editor information

Editors and Affiliations

Laboratoire de Communication dans les Systèmes Informatiques, Ecole Nationale Supérieure d’Informatique, Algiers, Algeria
Thouraya Bouabana-Tebibel
SPAWAR Systems Center Pacific, San Diego, USA
Stuart H. Rubin

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Schumacher, P., Minor, M., Schulte-Zurhausen, E. (2014). On the Use of Anaphora Resolution for Workflow Extraction. In: Bouabana-Tebibel, T., Rubin, S. (eds) Integration of Reusable Systems. Advances in Intelligent Systems and Computing, vol 263. Springer, Cham. https://doi.org/10.1007/978-3-319-04717-1_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-04717-1_7
Published: 18 February 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04716-4
Online ISBN: 978-3-319-04717-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

On the Use of Anaphora Resolution for Workflow Extraction