A Novel Cluster Based Algorithm for Outlier Detection

Mahajan, Manish; Kumar, Santosh; Pant, Bhasker

doi:10.1007/978-981-13-1513-8_47

Manish Mahajan¹⁷,
Santosh Kumar¹⁷ &
Bhasker Pant¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 810))

1614 Accesses
2 Citations

Abstract

Nowadays an important issue as well as challenge in data mining is obviously is outlier detection. Outlier detection has been used in many areas such as Fraud detection, Intrusion detection, Health care, Fault detection, etc., where detection of outliers is based on the different characteristics of data or datasets. In this current age of ‘Information Technology’, large numbers of processes are obtainable in the domain of data mining to discover the outliers by successfully creating the clusters and after that detecting the outliers from these created clusters. In data mining, cluster methods are highly essential and have been applied from micro- to macro-applications. Basically clusters are a pool of similar data objects put together grounded on the attributes and district features they have. Specifically outlier detection is used to recognize and exclude inconsistency from the available data sets. In the presented work an algorithm has been suggested which is based on clustering approach to the given data sets. The proposed algorithm efficiently detects outliers inside the clusters by using clustering algorithm and weight based approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Cateni, S., Colla, V., Vannucci, M.: Outlier detection methods for industrial applications. Advances in robotics. In: Automation and Control, pp. 274–275 (2008)
Google Scholar
Ahmad, A., Dey, L.: A k-mean clustering algorithm for mixed numeric and categorical data. Data Knowl. Eng. 63, 502–527 (2007)
Article Google Scholar
Hodge, V.J., Austin, J.: A survey of outlier detection methodologies. Artif. Intell. Rev. 22(2), 85–126 (2004)
Article Google Scholar
Fawzy, A., Mokhtar, H.M.O., Hegazy, O.: Outliers detection and classification in wireless sensor networks. Egypt. Inf. J. 14, 157–164 (2013)
Article Google Scholar
Khan, F.: An initial seed selection algorithm for k-means clustering of geo-referenced data to improve replicability of cluster assignments for mapping application. Appl. Soft Comput. 12, 3698–3700 (2012)
Article Google Scholar
Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 1–58 (2009)
Article Google Scholar
Pachgade, S.D., Dhande, S.S.: Outlier detection over data set using cluster-based and distance based approach. Int. J. Adv. Res. Comput. Sci. Soft. Eng. 2(6), 12–16 (2012)
Google Scholar
Zhu, C., Kitagawa, H., Papadimitriou, S., Faloutsos, C.: Outlier detection by example. J. Intell. Inf. Syst. 36, 217–247 (2011)
Article Google Scholar
Shi, Y., Zhang, L.: COID: a cluster–outlier iterative detection approach to multi-dimensional data analysis. Knowl. Inf. Syst. 28, 710–733 (2010)
Google Scholar
Indira Priya, P., Ghosh, D.K.: A survey on different clustering algorithms in data mining techniques. Int. J. Mod. Eng. Res. 3(1), 267–274 (2013)
Google Scholar
Gupta, M., Gao, J., Aggarwal, C.C., Han, J.: Outlier detection for temporal data. In: Proceedings of the 13th SIAM International Conference on Data Mining (SDM) (2013)
Google Scholar
Divya, T., Christopher, T.: A study of clustering based algorithm for outlier detection in data streams. Int. J. Adv. Netw. Appl. (IJANA) (2015). ISSN 0975-0282
Google Scholar
Chugh, N., Chugh, M., Agarwal, A.: Outlier detection in streaming data a research perspective. Int. J. Sci. Eng. Technol. Res. (IJSETR) 4(3) (2015)
Google Scholar
Bhosale, S.V., et al.: Outlier detection in straming data using clustering approached. Int. J. Comput. Sci. Inf. Technol. (IJCSIT) 5(5), 6050–6053 (2014)
Google Scholar
Manoharan, J.J., Hari Ganesh, S.: Improved k-means clustering algorithm using linear data structure list to enhance the efficiency. Int. J. Appl. Eng. Res. 10(20) (2015). ISSN 0973-4562
Google Scholar
Purohit, P.: A new efficient approach towards k-means clustering algorithm. Int. J. Comput. Appl. 65(11) (2013)
Google Scholar
Shunye, W.: An improved k-means clustering algorithm based on dissimilarity. In: 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC) Dec 20–22, 2013, Shenyang, China. IEEE
Google Scholar
Fahim, S.A.M., Torkey, F.A., Ramadan, M.A.: An efficient enhanced k-means clustering algorithm. J. Zhejiang Univ. Sci. A. ISSN 1009-3095, ISSN 1862-1775
Google Scholar
Wang, J., Su, X.: An improved k-means clustering algorithm. IEEE (2011)
Google Scholar
Mahmud, Md.S., Rahman, Md.M., Akhtar, Md.N.: Improvement of k-means clustering algorithm with better initial centroids based on weighted average. In: 2012 7th International Conference on Electrical and Computer Engineering, 20–22 Dec 2012, Dhaka, Bangladesh. IEEE (2012)
Google Scholar
Chauhan, P., Shukla, M.: A review on outlier detection techniques on data stream by using different approaches of KMeans algorithm. In: 2015 International Conference on Advances in Computer Engineering and Applications (ICACEA), IMS Engineering College, Ghaziabad, India. IEEE (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Graphic Era Deemed to be University, Dehradun, India
Manish Mahajan, Santosh Kumar & Bhasker Pant

Authors

Manish Mahajan
View author publications
You can also search for this author in PubMed Google Scholar
Santosh Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Bhasker Pant
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Santosh Kumar .

Editor information

Editors and Affiliations

Department of Electronics and Telecommunication Engineering, Dr. Babasaheb Ambedkar Technological University, Lonere, Raigad, Maharashtra, India
Brijesh Iyer
Department of Electronics and Telecommunication Engineering, Dr. Babasaheb Ambedkar Technological University, Lonere, Raigad, Maharashtra, India
S.L. Nalbalwar
Department of Electronics and Communication Engineering, Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
Nagendra Prasad Pathak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mahajan, M., Kumar, S., Pant, B. (2019). A Novel Cluster Based Algorithm for Outlier Detection. In: Iyer, B., Nalbalwar, S., Pathak, N. (eds) Computing, Communication and Signal Processing . Advances in Intelligent Systems and Computing, vol 810. Springer, Singapore. https://doi.org/10.1007/978-981-13-1513-8_47

Download citation

DOI: https://doi.org/10.1007/978-981-13-1513-8_47
Published: 13 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1512-1
Online ISBN: 978-981-13-1513-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics