Automatically Mapping and Integrating Multiple Data Entry Forms into a Database

  • Yuan An
  • Ritu Khare
  • Il-Yeol Song
  • Xiaohua Hu
Conference paper

DOI: 10.1007/978-3-642-24606-7_20

Part of the Lecture Notes in Computer Science book series (LNCS, volume 6998)
Cite this paper as:
An Y., Khare R., Song IY., Hu X. (2011) Automatically Mapping and Integrating Multiple Data Entry Forms into a Database. In: Jeusfeld M., Delcambre L., Ling TW. (eds) Conceptual Modeling – ER 2011. ER 2011. Lecture Notes in Computer Science, vol 6998. Springer, Berlin, Heidelberg

Abstract

Forms are a standard way of gathering data into a database. Many applications need to support multiple users with evolving data gathering requirements. It is desirable to automatically link dynamic forms to the back-end database. We have developed the FormMapper system, a fully automatic solution that accepts user-created data entry forms, and maps and integrates them into an existing database in the same domain. The solution comprises of two components: tree extraction and form integration. The tree extraction component leverages a probabilistic process, Hidden Markov Model (HMM), for automatically extracting a semantic tree structure of a form. In the form integration component, we develop a merging procedure that maps and integrates a tree into an existing database and extends the database with desired properties. We conducted experiments evaluating the performance of the system on several large databases designed from a number of complex forms. Our experimental results show that the FormMapper system is promising: It generated databases that are highly similar (87% overlapped) to those generated by the human experts, given the same set of forms.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Yuan An
    • 1
  • Ritu Khare
    • 1
  • Il-Yeol Song
    • 1
  • Xiaohua Hu
    • 1
  1. 1.College of Information Science and TechnologyDrexel UniversityUSA

Personalised recommendations