Information Extraction and Classification from Free Text Using a Neural Approach

  • Ignazio Gallo
  • Elisabetta Binaghi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4756)


Many approaches to Information Extraction (IE) have been proposed in literature capable of finding and extract specific facts in relatively unstructured documents. Their application in a large information space makes data ready for post-processing which is crucial to many context such as Web mining and searching tools. This paper proposes a new IE strategy, based on symbolic and neural techniques, and tests it experimentally within the price comparison service domain. In particular the strategy seeks to locate a set of atomic elements in free text which is preliminarily extracted from web documents and subsequently classify them assigning a class label representing a specific product.


Information Extraction Neural Network Text Classification 


  1. 1.
    Chang, C.H., Hsu, C.N., Lui, S.C.: Automatic information extraction from semi-structured web pages by pattern discovery. Decis. Support Syst. 35(1), 129–147 (2003)CrossRefGoogle Scholar
  2. 2.
    Muslea, I.: Extraction patterns for information extraction tasks: A survey. In: Califf, M.E. (ed.) Papers from the Sixteenth National Conference on Artificial Intelligence (AAAI-99) Workshop on Machine Learning for Information Extraction, Orlando, FL, AAAI Press (1999)Google Scholar
  3. 3.
    Chang, C.H., Kayed, M., Girgis, M.R., Shaalan, K.F.: A survey of web information extraction systems. IEEE Transactions on Knowledge and Data Engineering 18(10), 1411–1428 (2006)CrossRefGoogle Scholar
  4. 4.
    Damerau, F.J.: A technique for computer detection and correction of spelling errors. Communications of the Association for Computing Machinery 7(3), 171–176 (1964)Google Scholar
  5. 5.
    Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation, 318–362 (1986)Google Scholar
  6. 6.
    Jackson, P., Moulinier, I.: Natural Language Processing for Online Applications: Text Retrieval, Extraction, and Categorization. Natural Language Processing, 5. John Benjamins Publishing Co. (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Ignazio Gallo
    • 1
  • Elisabetta Binaghi
    • 1
  1. 1.Department of Computer Science and Communication, Universitá degli Studi dell’Insubria, via Mazzini 5, VareseItaly

Personalised recommendations