Abstract
Clustering has been widely applied to various domains to explore the useful patterns inside data. Clustering quality can be improved using Knowledge represented by ontology. Nevertheless, most traditional ontology-based clustering algorithms are limited to handle categorical instances. But in real case study, ontology contains both numerical and categorical attributes. In this paper, we propose a new method for clustering knowledge contained in the ontology based on mixed features. The main contribution is the proposition of new similarity measures that combine numerical and nominal variables along different dimensions (instances, attributes, and relation-ships). Three kinds of similarity measures are so defined: instances-based similarity IS, relations-based similarity RS and attributes-based similarity AS. These three measures are then combined into an overall similarity measure. This combined measure is used for clustering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gruber, T.: Ontology. In: Encyclopedia of Database, Berlin, Heidelberg, pp. 1963–1965 (2009), http://tomgruber.org/writing/ontology-definition-2007.htm
Maedche, A., Zacharias, V.: Clustering Ontology-based Metadata in the Semantic Web. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, p. 348. Springer, Heidelberg (2002)
Kim, J.-M., Park, Y.-T.: Study of Ontology-based Incremental Clustering Technique in Semantic Web. Proceedings of the International Conference on Next Generation Web Services Practices (NWeSP 2005). IEEE (2005)
Esposito, F., Fanizzi, N., d’Amato, C.: Conceptual Clustering Applied to Ontologies. In: Raś, Z.W., Tsumoto, S., Zighed, D.A. (eds.) MCD 2007. LNCS (LNAI), vol. 4944, pp. 42–56. Springer, Heidelberg (2008)
Mikroyannidi, E., Iannone, L., Stevens, R., Rector, A.: Inspecting regularities in ontology design using clustering. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 438–453. Springer, Heidelberg (2011)
de Amorim, R.C., Mirkin Minkowski, B.: metric, feature weighting and anomalous cluster initializing in K-Means clustering. Pattern Recognition 45(3), 1061–1075 (2012)
Jagannatha Reddy, M.V., Kavitha, B.: Clustering the Mixed Numerical and Categorical Dataset using Similarity Weight and Filter Method. International Journal of Database Theory and Application 5(1) (March 2012)
Shih, M.-Y., Jheng, J.-W., Lai, L.-F.: A Two-Step Method for Clustering Mixed Categorical and Numeric Data. Tamkang Journal of Science and Engineering 13(1), 11–19 (2010)
He, Z., Xu, X., Deng, S.: Scalable algorithms for clustering mixed type attributes in large datasets. International Journal of Intelligent Systems 20(10), 1077–1089 (2005)
Ralambondrainy, H.: A conceptual version of the k-means algorithm. Pattern Recognition Letters 16(11), 1147–1157 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Yangui, R., Nabli, A., Gargouri, F. (2014). SOIM: Similarity Measures on Ontology Instances Based on Mixed Features. In: Ait Ameur, Y., Bellatreche, L., Papadopoulos, G.A. (eds) Model and Data Engineering. MEDI 2014. Lecture Notes in Computer Science, vol 8748. Springer, Cham. https://doi.org/10.1007/978-3-319-11587-0_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-11587-0_17
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11586-3
Online ISBN: 978-3-319-11587-0
eBook Packages: Computer ScienceComputer Science (R0)