Skip to main content

Integrating All Stages of Information Systems Development by Means of Natural Language Processing

  • Conference paper
Requirements Engineering: Foundation for Software Quality (REFSQ 2007)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 4542))

Abstract

In this paper, we present the methodology and architecture of the natural language processing integration into all stages of the information systems development. We show that if the IS textual documentation is preprocessed and integrated into the business knowledge base development then the whole information systems modeling process can be speeded and improved. Self-organizing map received from information systems documentation and the formal concept analysis are suggested to test the IS documentation comprehensibility and reusability. IBM’s Information Framework (IFW) Financial Services Data Model (FSDM) has been used for the present research. By using FSDM we demonstrate that the IS model can be partially recreated from IS textual documents by combining techniques based on self-organizing map and formal concept analysis. Finally the numerical experiment is provided to show that IS documents supplemented with the suggested techniques can be reused in natural language interfaces and save the resources and time needed to develop such interfaces.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Androutsopoulos, I., Ritchie, G.D., Thanisch, P.: Time, Tense and Aspect in Natural Language Database Interfaces. Natural Language Engineering 4, 229–276 (1998)

    Article  Google Scholar 

  2. Burg, J.F.M., Riet, R.P.: Enhancing CASE Environments by Using Linguistics. International Journal of Software Engineering and Knowledge Engineering 8(4), 435–448 (1998)

    Article  Google Scholar 

  3. Cunningham, H.: GATE: a General Architecture for Text Engineering. Computers and the Humanities 36, 223–254 (2002)

    Article  Google Scholar 

  4. Darke, P., Shanks, G.: Understanding Corporate Data Models. Information and Management 35, 19–30 (1999)

    Article  Google Scholar 

  5. Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1999)

    MATH  Google Scholar 

  6. Hertzum, M., Pejtersen, A.M.: The information-seeking practices of engineers: searching for documents as well as for people. Journal of Information Processing and Management 36, 761–778 (2000)

    Article  Google Scholar 

  7. Hofmann, T.: Probabilistic latent semantic indexing. In: Research and Development in Information Retrieval, pp. 50–57 (1999)

    Google Scholar 

  8. Hotho, A., Staab, S., Stumme, G.: Explaining text clustering results using semantic structures. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 22–26. Springer, Heidelberg (2003)

    Google Scholar 

  9. Hung, C., Wermter, S., Smith, P.: Hybrid Neural Document Clustering Using Guided Self-organisation and WordNet. Issue of IEEE Intelligent Systems, pp. 68–77 (2004)

    Google Scholar 

  10. IBM. IBM Banking Data Warehouse General Information Manual. Available from on the IBM corporate site (accessed July 2006), http://www.ibm.com

  11. IBM Voice Toolkit V5.1 for WebSphere Studio. (accessed July 2006) http://www-306.ibm.com/software/

  12. Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: WEBSOM self-organizing maps of document collections. Neurocomputing 21, 101–117 (1998)

    Article  MATH  Google Scholar 

  13. Knublauch, H., Fergerson, R., Noy, N.F.: The Protege-OWL plugin: an open development environment for semantic web applications. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 229–243. Springer, Heidelberg (2004)

    Google Scholar 

  14. Kohonen, T.: Self-Organizing Maps. Springer, Heidelberg (2001)

    MATH  Google Scholar 

  15. Lagus, K., Honkela, T., Kaski, S., Kohonen, T.: WEBSOM for textual datamining. Articial Intelligence Review 13(5/6), 345–364 (1999)

    Article  Google Scholar 

  16. Mich, L., Franch, M., Inverardi, P.N.: Market research on requirements analysis using linguistic tools. Requirements Engineering 9(1), 40–56 (2004)

    Article  Google Scholar 

  17. Miller, G.A.: WordNet: A Dictionary Browser. In: Proc. 1st Int’l Conf. Information in Data, pp. 25–28 (1985)

    Google Scholar 

  18. Object Modeling Group (OMG). Semantics of Business Vocabulary and Rules Specification Drafted Adopted Specfication (March 2, 2006)

    Google Scholar 

  19. Ryan, K.: The role of natural language in requirements engineering. In: Proceedings of IEEE International Symposium on Requirements Engineering, pp. 240–242. IEEE Computer Society Press, Washington, DC (1993)

    Google Scholar 

  20. Rolland, C., Proix, C.: A Natural Language Approach to Requirements Engineering. In: Loucopoulos, P. (ed.) CAiSE 1992. LNCS, vol. 593, pp. 257–277. Springer, Heidelberg (1992)

    Chapter  Google Scholar 

  21. Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, London (1989)

    Google Scholar 

  22. Valtchev, P., Grosser, D., Roume, C., Rouane, H.M.: GALICIA: an open platform for lattices. In: de Moor, A., Ganter, B., (eds.) Using Conceptual Structures: Contributions to 11th Intl. Conference on Conceptual Structures, pp. 241–254 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Pete Sawyer Barbara Paech Patrick Heymans

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Laukaitis, A., Vasilecas, O. (2007). Integrating All Stages of Information Systems Development by Means of Natural Language Processing. In: Sawyer, P., Paech, B., Heymans, P. (eds) Requirements Engineering: Foundation for Software Quality. REFSQ 2007. Lecture Notes in Computer Science, vol 4542. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73031-6_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73031-6_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73030-9

  • Online ISBN: 978-3-540-73031-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics