Abstract
In this paper, we present the methodology and architecture of the natural language processing integration into all stages of the information systems development. We show that if the IS textual documentation is preprocessed and integrated into the business knowledge base development then the whole information systems modeling process can be speeded and improved. Self-organizing map received from information systems documentation and the formal concept analysis are suggested to test the IS documentation comprehensibility and reusability. IBM’s Information Framework (IFW) Financial Services Data Model (FSDM) has been used for the present research. By using FSDM we demonstrate that the IS model can be partially recreated from IS textual documents by combining techniques based on self-organizing map and formal concept analysis. Finally the numerical experiment is provided to show that IS documents supplemented with the suggested techniques can be reused in natural language interfaces and save the resources and time needed to develop such interfaces.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Androutsopoulos, I., Ritchie, G.D., Thanisch, P.: Time, Tense and Aspect in Natural Language Database Interfaces. Natural Language Engineering 4, 229–276 (1998)
Burg, J.F.M., Riet, R.P.: Enhancing CASE Environments by Using Linguistics. International Journal of Software Engineering and Knowledge Engineering 8(4), 435–448 (1998)
Cunningham, H.: GATE: a General Architecture for Text Engineering. Computers and the Humanities 36, 223–254 (2002)
Darke, P., Shanks, G.: Understanding Corporate Data Models. Information and Management 35, 19–30 (1999)
Ganter, B., Wille, R.: Formal Concept Analysis: Mathematical Foundations. Springer, Heidelberg (1999)
Hertzum, M., Pejtersen, A.M.: The information-seeking practices of engineers: searching for documents as well as for people. Journal of Information Processing and Management 36, 761–778 (2000)
Hofmann, T.: Probabilistic latent semantic indexing. In: Research and Development in Information Retrieval, pp. 50–57 (1999)
Hotho, A., Staab, S., Stumme, G.: Explaining text clustering results using semantic structures. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 22–26. Springer, Heidelberg (2003)
Hung, C., Wermter, S., Smith, P.: Hybrid Neural Document Clustering Using Guided Self-organisation and WordNet. Issue of IEEE Intelligent Systems, pp. 68–77 (2004)
IBM. IBM Banking Data Warehouse General Information Manual. Available from on the IBM corporate site (accessed July 2006), http://www.ibm.com
IBM Voice Toolkit V5.1 for WebSphere Studio. (accessed July 2006) http://www-306.ibm.com/software/
Kaski, S., Honkela, T., Lagus, K., Kohonen, T.: WEBSOM self-organizing maps of document collections. Neurocomputing 21, 101–117 (1998)
Knublauch, H., Fergerson, R., Noy, N.F.: The Protege-OWL plugin: an open development environment for semantic web applications. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 229–243. Springer, Heidelberg (2004)
Kohonen, T.: Self-Organizing Maps. Springer, Heidelberg (2001)
Lagus, K., Honkela, T., Kaski, S., Kohonen, T.: WEBSOM for textual datamining. Articial Intelligence Review 13(5/6), 345–364 (1999)
Mich, L., Franch, M., Inverardi, P.N.: Market research on requirements analysis using linguistic tools. Requirements Engineering 9(1), 40–56 (2004)
Miller, G.A.: WordNet: A Dictionary Browser. In: Proc. 1st Int’l Conf. Information in Data, pp. 25–28 (1985)
Object Modeling Group (OMG). Semantics of Business Vocabulary and Rules Specification Drafted Adopted Specfication (March 2, 2006)
Ryan, K.: The role of natural language in requirements engineering. In: Proceedings of IEEE International Symposium on Requirements Engineering, pp. 240–242. IEEE Computer Society Press, Washington, DC (1993)
Rolland, C., Proix, C.: A Natural Language Approach to Requirements Engineering. In: Loucopoulos, P. (ed.) CAiSE 1992. LNCS, vol. 593, pp. 257–277. Springer, Heidelberg (1992)
Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, London (1989)
Valtchev, P., Grosser, D., Roume, C., Rouane, H.M.: GALICIA: an open platform for lattices. In: de Moor, A., Ganter, B., (eds.) Using Conceptual Structures: Contributions to 11th Intl. Conference on Conceptual Structures, pp. 241–254 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Laukaitis, A., Vasilecas, O. (2007). Integrating All Stages of Information Systems Development by Means of Natural Language Processing. In: Sawyer, P., Paech, B., Heymans, P. (eds) Requirements Engineering: Foundation for Software Quality. REFSQ 2007. Lecture Notes in Computer Science, vol 4542. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73031-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-73031-6_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73030-9
Online ISBN: 978-3-540-73031-6
eBook Packages: Computer ScienceComputer Science (R0)