CASM: Coherent Automated Schema Matcher

  • Rudraneel Chakraborty
  • Faiyaz Ahmed
  • Shazzad Hosain
Conference paper
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 156)

Abstract

Schema matching has been one of the basic tasks in almost every data intensive distributed applications such as enterprize information integration, collaborating web services, web catalogue integration, and schema based point to point database systems and so on. Typical schema matchers perform manually and use a set of matching algorithms with a composition function by using them in an arbitrary manner which results in wasteful computations and needs manual specification for different domains. Recently, there has been some schema matching strategy proposed with partial or full automation. Such a schema matching strategy is OntoMatch. In this paper, we propose an element level automated linguistic based schema matching strategy motivated by the concept of OntoMatch, with more powerful matching algorithms and definite property construction for matcher selection that produces better output. Experimental result is also provided to support the claim of the improvement.

Keywords

Matching Algorithm Schema Match Term Matcher String Edit Distance Matcher Selection 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
  3. 3.
  4. 4.
    Jaro, M.A.: Probabilistic linkage of large public health data files. Stat Med. 14(5-7), 491–498 (1999)CrossRefGoogle Scholar
  5. 5.
    Bhattacharjee, A., Jamil, H.M.: OntoMatch: A monotonically improving schema matching system for autonomous data integration. In: IEEE IRI, Las Vegas, NV (2009)Google Scholar
  6. 6.
    Do, H.H., Rahm, E.: COMA - A system for flexible combination of schema matching approaches. In: VLDB, Hong Kong, China, pp. 610–621 (2002)Google Scholar
  7. 7.
    Hosain, S., Jmail, H.: An algebraic language for semantic data integration on the hidden web. In: Proceedings of the 3rd IEEE ICSC 2009, Berkeley, California, pp. 237–244 (2009)Google Scholar
  8. 8.
    Madhavan, J., Bernstein, P.A., Rahm, E.: Generic schema matching with Cupid. In: VLDB, Italy, pp. 49–58 (2001)Google Scholar
  9. 9.
    Monge, A., Elkan, C.: The field matching problem: Algorithms and applications. In: 2nd International Conference on KDDM, pp. 267–270 (1996)Google Scholar
  10. 10.
    Ristad, E.S., Yianilos, P.N.: Learning string edit distance. IEEE Transactions on Pattern Recognition and Machine Intelligence 20(5), 522–532 (1998)CrossRefGoogle Scholar
  11. 11.
    Winkler, W.E.: The state of record linkage and current research problems. Tech. rep., Statistical Research Division, U.S. Census Bureau (1999)Google Scholar
  12. 12.
    Zobel, J.: Phonetic string matching: Lessons from information retrieval. In: SIGIR 1996, pp. 166–172 (1996)Google Scholar

Copyright information

© Springer-Verlag GmbH Berlin Heidelberg 2013

Authors and Affiliations

  • Rudraneel Chakraborty
    • 1
  • Faiyaz Ahmed
    • 1
  • Shazzad Hosain
    • 1
  1. 1.Department of EECSNorth South UniversityDhakaBangladesh

Personalised recommendations