Abstract
Outlier detection is a preprocessing method that is useful in detecting unusual data if they are of interest to the users, or reducing irrelevant instances in machine learning if outliers are interpreted as noise. In the past decades, there are many outlier detection algorithms that have been developed and reported in the literature. Many have been implemented as software programs across a spectrum of various applications ranging from identifying special customers in marketing to fraud detection in security context. In chapter, a novel outlier detection method named lightweight analysis is proposed. Instead of using a full dataset to compute outlier values like the traditional outlier detection algorithms, the proposed concept of lightweight analysis works by examining only a certain “number of instances as a reference in order to calculate the outlier indicator, like cumulative analysis or lightweight analysis with sliding window, other than global analysis only.” These three mechanisms are then combined with the existing outlier measurements such as “interquartile, local outlier factor and Mahalanobis distance range.” In this study, the computer simulation experiments show encouraging results that support the fact that good classification accuracy is possible using reduced datasets. The outlier detection results are on par or more accurate than the existing state of the arts by using the “proposed classifier-based outlier detection (COD) method coupled with lightweight analysis.”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Fong, S., Li, T., Han, D., Mohammed, S. (2021). Lightweight Classifier-Based Outlier Detection Algorithms from Multivariate Data Stream. In: Fong, S., Millham, R. (eds) Bio-inspired Algorithms for Data Streaming and Visualization, Big Data Management, and Fog Computing. Springer Tracts in Nature-Inspired Computing. Springer, Singapore. https://doi.org/10.1007/978-981-15-6695-0_6
Download citation
DOI: https://doi.org/10.1007/978-981-15-6695-0_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6694-3
Online ISBN: 978-981-15-6695-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)