Acquiring Textual Relations Automatically on the Web Using Genetic Programming

  • Agneta Bergström
  • Patricija Jaksetic
  • Peter Nordin
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1802)

Abstract

The flood of electronic information is pouring over us, while the technology maintaining the information and making it available to us has not yet been able to catch up. One of the paradigms within information retrieval focuses on the use of thesauruses to analyze contextual/structural information. We have explored a method that automatically finds textual relations in electronic documents using genetic programming and semantic networks. Such textual relations can be used to extend and update thesauruses as well as semantic networks. The program is written in PROLOG and communicates with software for natural language parsing. The system is also an example of computationally expensive fitness function using a large database. The results from the experiment show feasibility for this type of automatic relation extraction.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Banzhaf, W., Nordin, P., Keller, R.E., Francone, F.D.: Genetic Programming: An Introduction on the Automatic Evolution of Computer Programs and Its Applications. Morgan Kaufmann, Germany (1997)Google Scholar
  2. 2.
    Fellbaum, C.: WordNet - An Electronic Lexical Database. MIT Press, Cambridge (1998)Google Scholar
  3. 3.
    Freitas, A.A.: A genetic programming framework for two data mining tasks: Classification and generalized rule induction. In: Genetic Programming 1997: Proceedings of the Second Annual Conference, Stanford University, CA, USA, pp. 96–101 (1997)Google Scholar
  4. 4.
    Gauch, S., Smith, J.B.: An Expert System for Searching Full Text. Information Processing and Management 25(3) (1989)Google Scholar
  5. 5.
    Gazdar, G., Mellish, C.: Natural Language Processing in PROLOG: An Introduction to Computational Linguistics. Addison-Wesley Publishing Company, Wokingham (1994)Google Scholar
  6. 6.
    Hearst, M.: Automatic Acquistion of Hyponyms from Large Text Corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics, Nantes, France (1992)Google Scholar
  7. 7.
    Nelson, M.R.: We Have the Information You Want, But Getting It Will Cost You: Being Held Hostage by Information Overload. Crossroads 4(1) (Fall 1997)Google Scholar
  8. 8.
    Marcus, R.S.: Computer and Human Understanding in Intelligent Retrieval Assistance. In: Proceedings of the 54th American Society for Information Science meeting, vol. 28 (1991)Google Scholar
  9. 9.
    Masand, B.: Optimizing Confidence of Text classification by Evolution of Symbolic Expressions. In: Advances in Genetic Programming. MIT Press, USA (1994)Google Scholar
  10. 10.
    Salton, G.O., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, New York (1983)Google Scholar
  11. 11.
    Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley Publishing, Reading (1989)Google Scholar
  12. 12.
    Smith, T.C., Witten, I.H.: A genetic algorithm for the induction of natural language grammars. In: Proceeding of IJCAI 1995 Workshop on New Approaches to Learning for Natural Language Processing, Montreal, Canada, pp. 17–24 (1995)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Agneta Bergström
    • 1
  • Patricija Jaksetic
    • 2
  • Peter Nordin
    • 3
  1. 1.Interverbum AB, LocalizationStockholmSweden
  2. 2.PLAY: Applied research on art and technology, Viktoria InstituteGothenburgSweden
  3. 3.Complex SystemsChalmers University of Technology, CTH/GUGothenburgSweden

Personalised recommendations