Skip to main content
  • Conference proceedings
  • © 2003

Information Extraction in the Web Era

Natural Language Communication for Knowledge Acquisition and Intelligent Information Agents

Part of the book series: Lecture Notes in Computer Science (LNCS, volume 2700)

Part of the book sub series: Lecture Notes in Artificial Intelligence (LNAI)

Conference series link(s): SCIE: International Summer School on Information Extraction

Conference proceedings info: SCIE 2002.

Buying options

eBook USD 34.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.95
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

This is a preview of subscription content, access via your institution.

Table of contents (7 papers)

  1. Front Matter

  2. Information Extraction in the Web Era

    1. Acquisition of Domain Knowledge

      • Roman Yangarber
      Pages 1-28
    2. Terminology Mining

      • Béatrice Daille
      Pages 29-44
    3. Measuring Term Representativeness

      • Toru Hisamitsu, Jun-ichi Tsujii
      Pages 45-76
    4. Agents Based Ontological Mediation in IE Systems

      • Maria Teresa Pazienza, Michele Vindigni
      Pages 92-128
  3. Back Matter

Other Volumes

  1. Information Extraction in the Web Era

About this book

The number of research topics covered in recent approaches to Information - traction (IE) is continually growing as new facts are being considered. In fact, while the user’s interest in extracting information from texts deals mainly with the success of the entire process of locating, in document collections, facts of interest, the process itself is dependent on several constraints (e.g. the domain, the collection dimension and location, and the document type) and currently it tackles composite scenarios, including free texts, semi- and structured texts such as Web pages, e-mails, etc. The handling of all these factors is tightly related to the continued evolution of the underlying technologies. In the last few years, in real-world applications we have seen the need for scalable, adaptable IE systems (see M.T.Pazienza, “InformationExtraction: Towards Scalable Adaptable Systems”, LNAI 1714) to limit the need for human intervention in the customization process and portability of the IE application to new domains. Scalability and adaptability requirements are still valid impacting features and get more relevance into a Web scenario, where in intelligent information agents are expected to automatically gather information from heterogeneous sources.

Keywords

  • DOM
  • Information Retrieval
  • Web data mining
  • content summarization
  • data analysis
  • data mining
  • digital libraries
  • information extraction
  • knowledge discovery
  • knowledge extraction
  • natural language processing
  • semi-structured data
  • text mining

Editors and Affiliations

  • DISP, University of Tor Vergata, Rome, Italy

    Maria Teresa Pazienza

Bibliographic Information

Buying options

eBook USD 34.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.95
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions