Abstract
Recent advances in information and communication technology make huge amount of heterogeneous information available for us. But integration of information semantically and provide machine understandable meaning to information is still a great challenge in current web technology. In overcoming the challenges, ontology matching plays a vital role, which is introduced by semantic web technology. In this paper, we propose a new method of ontology matching using parallelization and distribution technique. To apply parallelism, we develop a partitioning algorithm by using property-by-class and subclass of relationship, which partitions the ontology into smaller clusters. Then the clusters from different ontologies are matched based on terminological and structural similarity with semantic verification. All these tasks of matching are handled in a parallel way and all the tasks are distributed over the computational resources. Thus, we significantly reduce the time complexity and space complexity of large scale matching task. Our proposed method reduces misaligned pairs while increasing correct aligned concepts. Validity of our claims have been substantiated through different experiments on small and large ontologies.
Similar content being viewed by others
References
Algergawy A, Massmann S, Rahm E (2011) A clustering-based approach for large-scale ontology matching. In: Eder J, Bieliková M, Tjoa AM (eds) East European Conference on Advances in Databases and Information Systems, Springer, Vienna, Austria, pp 415–428
Antoniou G, Groth P, van Harmelen F, Hoekstra R (2012) A semantic web primer, 3rd edn. MIT Press, Cambridge
Aumueller D, Do HH, Massmann S, Rahm E (2005) Schema and ontology matching with COMA++. In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data, ACM, pp 906–908
Barney B (2016) Introduction to parallel computing. https://computing.llnl.gov/tutorials/parallel_comp/
Carver RH, Tai KC (2005) Modern multithreading: implementing, testing, and debugging multithreaded Java and C++/Pthreads/Win32 programs. Wiley, New York
Cormen TH, Leiserson CE, Rivest RL, Stein C (2009) Introduction to algorithms, 3rd edn. MIT press, Cambridge
Erdélyi M, Abonyi J (2006) Node similarity-based graph clustering and visualization. In: 7th International symposium of hungarian researchers on computational intelligence
Euzenat J, Shvaiko P (2013) Ontology matching. Springer, Berlin
Grama A, Gupta A, Karypis G, Kumar V (2003) Introduction to parallel computing. Pearson, Upper Saddle River
Gross A, Hartung M, Kirsten T, Rahm E (2010) On matching large life science ontologies in parallel. In: Lambrix P, Kemp GJL (eds) International Conference on Data Integration in the Life Sciences, Springer, Gothenburg, Sweden, pp 35–49
Gustafson JL (1988) Reevaluating amdahl’s law. Communications of the ACM 31(5):532–533
Hu W, Qu Y, Cheng G (2008) Matching large ontologies: a divide-and-conquer approach. Data Knowl Eng 67(1):140–160
Jaccard P (1901) Étude comparative de la distribution florale dans une portion des Alpes et des Jura. Bull Société Vaud Sci Nat 37:547–579 (in French)
Jean-Mary YR, Shironoshita EP, Kabuka MR (2009) Ontology matching with semantic verification. Web Semant Sci Serv Agents World Wide Web 7(3):235–251
Lambrix P, Kaliyaperumal R (2013) A session-based approach for aligning large ontologies. In: Cimiano P, Corcho Ó, Presutti V, Hollink L, Rudolph S (eds) Extended Semantic Web Conference, Springer, Montpellier, France, pp 46–60
Lambrix P, Tan H (2006) Sambo—a system for aligning and merging biomedical ontologies. Web Semant Sci Serv Agents World Wide Web 4(3):196–206
Li J, Tang J, Li Y, Luo Q (2009) Rimom: a dynamic multistrategy ontology alignment framework. IEEE Trans Knowl Data Eng 21(8):1218–1232
Madhavan J, Bernstein PA, Domingos P, Halevy AY (2002) Representing and reasoning about mappings between domain models. In: AAAI-02 proceedings, pp 80–86
Maedche A (2002) Ontology learning for the semantic web. Springer, Berlin
Miller GA (1995) Wordnet: a lexical database for english. Commun ACM 38(11):39–41
Mittra T, Ali MM (2014) Ontology matching by applying parallelization and distribution of matching task within clustering environment. In: IEEE 2014 international conference on electrical and computer engineering (ICECE), pp 445–448
Nagy M, Vargas-Vera M, Motta E (2007) DSSIM: managing uncertainty on the semantic web. In: Proceedings of the 2nd international conference on ontology matching, vol 304, CEUR-WS. org, pp 160–169
Otero-Cerdeira L, Rodríguez-Martínez FJ, Gómez-Rodríguez A (2015) Ontology matching: a literature review. Expert Syst Appl 42(2):949–971
Rahm E (2011) Towards large-scale schema and ontology matching. In: Bellahsene Z, Bonifati A, Rahm E (eds) Schema matching and mapping, Springer, Berlin, Heidelberg, pp 3–27
Seddiqui MH, Aono M (2009) An efficient and scalable algorithm for segmented alignment of ontologies of arbitrary size. Web Semant Sci Serv Agents World Wide Web 7(4):344–356
Shvaiko P, Euzenat J (2013) Ontology matching: state of the art and future challenges. IEEE Trans Knowl Data Eng 25(1):158–176
Solomonik E, Carson E, Knight N, Demmel J (2014) Tradeoffs between synchronization, communication, and work in parallel linear algebra computations. Technical Reports. UCB/EECS-2014-8, EECS Department, University of California, Berkeley. http://www.eecs.berkeley.edu/Pubs/TechRpts/2014/EECS-2014-8.html
Staab S, Studer R (eds) (2004) Handbook on ontologies. International handbooks on information systems. Springer, Berlin
Tenschert A, Assel M, Cheptsov A, Gallizo G (2009) Parallelization and distribution techniques for ontology matching in urban computing environments. In: Proceedings of the fourth international workshop on ontology matching
Valiant LG (1990) A bridging model for parallel computation. Commun ACM 33(8):103–111
Yu L (2011) A developer’s guide to the semantic web. Springer, Berlin
Acknowledgements
This work is a postgraduate research project of Bangladesh University of Engineering and Technology. We thank Wei Hu, Yuzhong Qu and Gong Cheng for providing Russia12 and TourismAB dataset. We avail this opportunity to thank our anonymous reviewers whose comments contributed in improving the quality of the paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mittra, T., Ali, M.M. Parallelized and distributed task based ontology matching in clustering environment with semantic verification. CSIT 5, 265–279 (2017). https://doi.org/10.1007/s40012-017-0165-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40012-017-0165-9