The Similarity-Aware Relational Intersect Database Operator

  • Wadha J. Al Marri
  • Qutaibah Malluhi
  • Mourad Ouzzani
  • Mingjie Tang
  • Walid G. Aref
Conference paper

DOI: 10.1007/978-3-319-11988-5_15

Part of the Lecture Notes in Computer Science book series (LNCS, volume 8821)
Cite this paper as:
Marri W.J.A., Malluhi Q., Ouzzani M., Tang M., Aref W.G. (2014) The Similarity-Aware Relational Intersect Database Operator. In: Traina A.J.M., Traina C., Cordeiro R.L.F. (eds) Similarity Search and Applications. SISAP 2014. Lecture Notes in Computer Science, vol 8821. Springer, Cham

Abstract

Identifying similarities in large datasets is an essential operation in many applications such as bioinformatics, pattern recognition, and data integration. To make the underlying database system similarity-aware, the core relational operators have to be extended. Several similarity-aware relational operators have been proposed that introduce similarity processing at the database engine level, e.g., similarity joins and similarity group-by. This paper extends the semantics of the set intersection operator to operate over similar values. The paper describes the semantics of the similarity-based set intersection operator, and develops an efficient query processing algorithm for evaluating it. The proposed operator is implemented inside an open-source database system, namely PostgreSQL. Several queries from the TPC-H benchmark are extended to include similarity-based set intersetion predicates. Performance results demonstrate up to three orders of magnitude speedup in performance over equivalent queries that only employ regular operators.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Wadha J. Al Marri
    • 1
  • Qutaibah Malluhi
    • 1
  • Mourad Ouzzani
    • 2
  • Mingjie Tang
    • 3
  • Walid G. Aref
    • 3
  1. 1.Qatar UniversityDohaQatar
  2. 2.Qatar Computing Research InstituteDohaQatar
  3. 3.Purdue UniversityWest LafayetteUSA

Personalised recommendations