Abstract
As an emerging conceptual and computing paradigm of information processing, granular computing has received much attention recently. Many models and methods of granular computing have been proposed and studied. Among them was the granular computing model using information tables. In this paper, we shall demonstrate the application of this granular computing model for the study of a specific data mining problem - outlier detection. Within the granular computing model using information tables, this paper proposes a novel definition of outliers - GrC (granular computing)-based outliers. An algorithm to find such outliers is also given. And the effectiveness of GrC-based method for outlier detection is demonstrated on three publicly available databases.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Yao, Y.Y., Zhong, N.: Granular computing using information tables. In: Lin, T.Y., Yao, Y.Y., Zadeh, L.A. (eds.) Data Mining, Rough Sets and Granular Computing, pp. 102–124. Physica-Verlag (2002)
Zadeh, L.A.: Fuzzy sets and information granularity. In: Gupta, N., Ragade, R., Yager, R. (eds.) Advances in Fuzzy Set Theory and Applications, pp. 3–18. North- Holland, Amsterdam (1979)
Zadeh, L.A.: Some reflections on soft computing, granular computing and their roles in the conception, design and utilization of information/intelligent systems. Soft Computing 2(1), 23–25 (1998)
Skowron, A., Stepaniuk, J.: Towards discovery of information granules. In: Żytkow, J.M., Rauch, J. (eds.) PKDD 1999. LNCS (LNAI), vol. 1704, pp. 542–547. Springer, Heidelberg (1999)
Skowron, A., Stepaniuk, J.: Information Granules in Distributed Environment. In: Zhong, N., Skowron, A., Ohsuga, S. (eds.) RSFDGrC 1999. LNCS (LNAI), vol. 1711, pp. 357–366. Springer, Heidelberg (1999)
Miao, D.Q., Wang, G.Y., Liu, Q., et al.: Granular Computing Past, Present and Future Prospect. Science Press, Beijing (2007) (in Chinese)
Duan, Q.G., Miao, D.Q., Zhang, H.Y., Zheng, J.: Personalized Web Retrieval based on Rough-Fuzzy Method. Journal of Computational Information Systems 3(3), 1067–1074 (2007)
Duan, Q.G., Miao, D.Q., Wang, R.Z., Chen, M.: An Approach to Web Page Classification based on Granules. In: Proc. of 2007 IEEE/WIC/ACM Int. Conf. on Web Intelligence (WI 2007), Silicon Valley, USA, vol. 2-5, pp. 279–282 (2007)
Miao, D.Q., Chen, M., Wei, Z.H., Duan, Q.G.: A Reasonable Rough Approximation of Clustering Web Users. In: Zhong, N., Liu, J., Yao, Y., Wu, J., Lu, S., Li, K. (eds.) Web Intelligence Meets Brain Informatics. LNCS (LNAI), vol. 4845, pp. 428–442. Springer, Heidelberg (2007)
Yao, Y.Y.: A partition model of granular computing. LNCS Transactions on Rough Sets, vol. 1, pp. 232–253 (2004)
Knorr, E., Ng, R.: Algorithms for Mining Distance-based Outliers in Large Datasets. In: Proc. of the 24th VLDB Conf., New York, pp. 392–403 (1998)
Hawkins, D.: Identifications of Outliers. Chapman and Hall, London (1980)
Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: identifying density-based local outliers. In: Proc. of the 2000 ACM SIGMOD Int. Conf. on Management of Data, Dallas, pp. 93–104 (2000)
Jiang, F., Sui, Y.F., Cao, C.G.: Outlier Detection Using Rough Set Theory. In: Ślęzak, D., Yao, J., Peters, J.F., Ziarko, W., Hu, X. (eds.) RSFDGrC 2005. LNCS (LNAI), vol. 3642, pp. 79–87. Springer, Heidelberg (2005)
Johnson, S.C.: Hierarchical Clustering Schemes. Psychometrika 2, 241–254 (1967)
Nguyen, S.H., Nguyen, H.S.: Some efficient algorithms for rough set methods. In: Proc. of the 6th Int. Conf. on Information Processing and Management of Uncertainty (IPMU 1996), Granada, Spain, pp. 1451–1456 (1996)
He, Z.Y., Deng, S.C., Xu, X.F.: An Optimization Model for Outlier Detection in Categorical Data. In: Int. Conf. on Intelligent Computing (ICIC(1) 2005), Hefei, China, pp. 400–409 (2005)
He, Z.Y., Deng, S.C., Xu, X.F.: Discovering Cluster Based Local Outliers. Pattern Recognition Letters 24(9-10), 1651–1660 (2003)
Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large datasets. In: Proc. of the 2000 ACM SIGMOD Int. Conf. on Management of Data, Dallas, pp. 427–438 (2000)
Harkins, S., He, H.X., Willams, G.J., Baxter, R.A.: Outlier detection using replicator neural networks. In: Proc. of the 4th Int. Conf. on Data Warehousing and Knowledge Discovery, France, pp. 170–180 (2002)
Willams, G.J., Baxter, R.A., He, H.X., Harkins, S., Gu, L.F.: A Comparative Study of RNN for Outlier Detection in Data Mining. In: Proc. of the 2002 IEEE Int. Conf. on Data Mining (ICDM 2002), Japan, pp. 709–712 (2002)
Bay, S.D.: The UCI KDD repository (1999), http://kdd.ics.uci.edu
Aggarwal, C.C., Yu, P.S.: Outlier detection for high dimensional data. In: Proc. of ACM SIGMOD Int. Conf. on Managment of Data, California, pp. 37–46 (2001)
Liu, B., Hsu, W., Ma, Y.: Integrating Classification and Association Rule Mining. In: Proc.of the 4th Int. Conf. on Knowledge Discovery and Data Mining (KDD 1998), New York, pp. 80–86 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Y., Miao, D., Wang, R. (2008). Outlier Detection Based on Granular Computing. In: Chan, CC., Grzymala-Busse, J.W., Ziarko, W.P. (eds) Rough Sets and Current Trends in Computing. RSCTC 2008. Lecture Notes in Computer Science(), vol 5306. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88425-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-88425-5_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88423-1
Online ISBN: 978-3-540-88425-5
eBook Packages: Computer ScienceComputer Science (R0)