Information Extraction and Classification from Free Text Using a Neural Approach
Many approaches to Information Extraction (IE) have been proposed in literature capable of finding and extract specific facts in relatively unstructured documents. Their application in a large information space makes data ready for post-processing which is crucial to many context such as Web mining and searching tools. This paper proposes a new IE strategy, based on symbolic and neural techniques, and tests it experimentally within the price comparison service domain. In particular the strategy seeks to locate a set of atomic elements in free text which is preliminarily extracted from web documents and subsequently classify them assigning a class label representing a specific product.
KeywordsInformation Extraction Neural Network Text Classification
- 2.Muslea, I.: Extraction patterns for information extraction tasks: A survey. In: Califf, M.E. (ed.) Papers from the Sixteenth National Conference on Artificial Intelligence (AAAI-99) Workshop on Machine Learning for Information Extraction, Orlando, FL, AAAI Press (1999)Google Scholar
- 4.Damerau, F.J.: A technique for computer detection and correction of spelling errors. Communications of the Association for Computing Machinery 7(3), 171–176 (1964)Google Scholar
- 5.Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation, 318–362 (1986)Google Scholar
- 6.Jackson, P., Moulinier, I.: Natural Language Processing for Online Applications: Text Retrieval, Extraction, and Categorization. Natural Language Processing, 5. John Benjamins Publishing Co. (2002)Google Scholar