Skip to main content

High-Dimensional Data Anomaly Detection Framework Based on Feature Extraction of Elastic Network

  • Conference paper
  • First Online:
Machine Learning and Intelligent Communications (MLICOM 2019)

Abstract

Although appropriate feature extraction can improve the performance of anomaly detection, it is a challenging task due to the complex interaction between features, the mixture of irrelevant features and relevant features, and the unavailability of data tags. When conventional anomaly detection methods deal with the problem of anomaly detection of high dimensional data, the performance of anomaly detection will be degraded due to the existence of irrelevant features. This paper proposed a method of feature extraction and anomaly detection for high dimensional data based on elastic network, which can filter irrelevant features and improve the accuracy and efficiency of anomaly detection. In this paper, an outlier scoring method was used to score the outliers of the original data, and then outliers and the original data were input into the elastic network for sparse regression. After feature extraction of elastic network, those irrelevant features to abnormal data are ignored, thus reducing the dimension of data. Finally, high-dimensional data are detected efficiently according to extracted features. In the experimental stage, we used the high-dimensional anomaly dataset provided by ODDS to detect the performance of the proposed method based on AUC detection accuracy, ROC curve, feature number, convergence speed and other indicators. The results show that the proposed method not only can effectively extract the features related to high-dimensional anomaly data, but also the detection accuracy of outliers has been greatly improved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hawkins, D.M.: Identification of Outliers, 1st edn, pp. 11–12. Springer, Berlin (1980). https://doi.org/10.1007/978-94-015-3994-4

    Book  MATH  Google Scholar 

  2. Li, L., Hao, Z., Peng, H., et al.: Nearest neighbors based density peaks approach to intrusion detection. Chaos Solitons Fractals 110, 33–40 (2018)

    Article  MathSciNet  Google Scholar 

  3. Wang, C., Dong, H.: Credit card fraud forecasting model based on clustering analysis and integrated support vector machine. Cluster Comput. 16, 1–6 (2018)

    Article  Google Scholar 

  4. Gao, X., Fang, Y.: Penalized weighted least squares for outlier detection and robust regression (2016)

    Google Scholar 

  5. Chen, T., Martin, E., Montague, G.: Robust probabilistic PCA with missing data and contribution analysis for outlier detection. Comput. Stat. Data Anal. 53(10), 3706–3716 (2009)

    Article  MathSciNet  Google Scholar 

  6. Aggarwal, C.C.: Linear models for outlier detection. In: Aggarwal, C.C. (ed.) Outlier Analysis, pp. 75–99. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-6396-2_3

    Chapter  MATH  Google Scholar 

  7. Dalatu, P.I., Fitrianto, A., Mustapha, A.: A comparative study of linear and nonlinear regression models for outlier detection (2016)

    MATH  Google Scholar 

  8. Pérez, B., Molina, I., Peña, D.: Outlier detection and robust estimation in linear regression models with fixed group effects. J. Stat. Comput. Simul. 84(12), 2652–2669 (2014)

    Article  MathSciNet  Google Scholar 

  9. Xu, H., Mao, R., Liao, H., et al.: Closest neighbors excluded outlier detection. In: Online Analysis & Computing Science, pp. 105–110. IEEE (2016)

    Google Scholar 

  10. Liu, J., Wang, G.: Outlier detection based on local minima density. In: IEEE Information Technology, Networking, Electronic & Automation Control Conference, pp. 718–723. IEEE (2016)

    Google Scholar 

  11. Vwema, P., Yadava, R.D.S.: Fuzzy c-means clustering based outlier detection for SAW electronic nose. In: Convergence in Technology, pp. 513–519. IEEE (2017)

    Google Scholar 

  12. Bolón-Canedo, V., Sánchez-Maro, N., Alonso-Betanzos, A.: Recent advances and emerging challenges of feature selection in the context of big data. Knowl.-Based Syst. 86, 33–45 (2015)

    Article  Google Scholar 

  13. Du, L., Shen, Y.D.: Unsupervised feature selection with adaptive structure learning (2015)

    Google Scholar 

  14. Yu, K., Wu, X., Ding, W., et al.: Scalable and accurate online feature selection for big data. ACM Trans. Knowl. Discovery Data 11, 16 (2015)

    Google Scholar 

  15. Zhang, C., Wang, G., Zhou, Y., et al.: Feature selection for high dimensional imbalanced class data based on F-measure optimization. In: International Conference on Security. IEEE (2018)

    Google Scholar 

  16. Feng, S., Sen, P.N.: Percolation on elastic networks: new exponent and threshold. Phys. Rev. Lett. 52(3), 216–219 (1984)

    Article  Google Scholar 

  17. Liu, F., Ting, K.M., Zhou, Z.H.: Isolation-based anomaly detection. ACM Trans. Knowl. Discovery Data 6(1), 1556–4681 (2012)

    Google Scholar 

  18. Yang, Z., Zheng, Y., Gao, Y., et al.: Abnormal data detection for an e-business using object-oriented approach. In: Integration and Innovation Orient to E-Society, vol. 1 (2007)

    Google Scholar 

  19. Zimek, A., Gaudet, M., Campello, R.J.G.B., et al.: Subsampling for efficient and effective unsupervised outlier detection ensembles. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2013)

    Google Scholar 

Download references

Acknowledgement

The research was supported by the State Grid Liaoning Electric Power Supply CO., LTD, and we are grateful for the financial support for the “Key Technology and Application Research of the Self-Service Grid Big Data Governance (SGLNXT00YJJS1800110)”.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to KeXin Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shen, Y., Bo, J., Li, K., Chen, S., Qiao, L., Li, J. (2019). High-Dimensional Data Anomaly Detection Framework Based on Feature Extraction of Elastic Network. In: Zhai, X., Chen, B., Zhu, K. (eds) Machine Learning and Intelligent Communications. MLICOM 2019. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 294. Springer, Cham. https://doi.org/10.1007/978-3-030-32388-2_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-32388-2_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-32387-5

  • Online ISBN: 978-3-030-32388-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics