Skip to main content

Autoencoder Neural Network for Detecting Non-human Web Traffic

  • Conference paper
  • First Online:
Artificial Intelligence and Soft Computing (ICAISC 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13589))

Included in the following conference series:

Abstract

In this paper, a neural network model is presented to identify fraudulent visits to a website, which are significantly different from visits of human users. Such unusual visits are most often made by automated software, i.e. bots. Bots are used to perform advertising scams or to do scraping, i.e., automatic scanning of website content frequently not in line with the intentions of website authors. The model proposed in this paper works on data extracted directly from a web browser when a user or a bot visits a website. This data is acquired by way of using JavaScript. When bots appear on the website, collected parameter values are significantly different from the values collected during usual visits made by human website users. However, just knowing what values these parameters have is simply not enough to identify bots as they are being constantly modified and new values that have not yet been accounted for appear. Thus, it is not possible to know all the data generated by bots. Therefore, this paper proposes a neural network with an autoencoder structure that makes it possible to detect deviations in parameter values that depart from the learned data from usual users. This enables detection of anomalies, i.e., data generated by bots. The effectiveness of the presented model is demonstrated on authentic data extracted from several online stores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Recaptcha https://www.google.com/recaptcha/about/

  2. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT press (2016)

    Google Scholar 

  3. Zhou, C., Paffenroth, R.C.: Anomaly detection with robust deep autoencoders. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2017)

    Google Scholar 

  4. Farahnakian, F., Heikkonen, J.: A deep auto-encoder based approach for intrusion detection system. In: 2018 20th International Conference on Advanced Communication Technology (ICACT). IEEE (2018)

    Google Scholar 

  5. Nguyen, Q.P.: GEE: A Gradient-based Explainable Variational Autoencoder for Network Anomaly Detection (2019)

    Google Scholar 

  6. 2019. https://www.emarketer.com/content/digital-ad-fraud-2019

  7. Barker, S.: Future Digital Advertising, Artificial Intelligence & Advertising Fraud 2019–2023, Juniper Research (2019)

    Google Scholar 

  8. Xiong, Y., Zuo, R.: Recognition of geochemical anomalies using a deep autoencoder network. Comput. Geosci. 86, 75–82 (2016)

    Article  Google Scholar 

  9. Gabryel, M., Grzanek, K., Hayashi, Y.: Browser fingerprint coding methods increasing the effectiveness of user identification in the web traffic. J. Artificial Intelligence and Soft Computing Res. 10 (2020)

    Google Scholar 

  10. Gabryel, M., et al.: Decision making support system for managing advertisers by ad fraud detection. J. Artificial Intelligence and Soft Computing Res. 11 (2021)

    Google Scholar 

  11. Kim, T., Park, C.H.: Anomaly pattern detection in streaming data based on the transformation to multiple binary-valued data streams. J. Artificial Intelligence and Soft Computing Res. 12(1), 19–27 (2022)

    Google Scholar 

  12. Brunner, C., Kő, A., Fodor, S.: An autoencoder-enhanced stacking neural network model for increasing the performance of intrusion detection. J. Artificial Intelligence and Soft Computing Res. 12(2), 149–163 (2022)

    Article  Google Scholar 

  13. Bilski, J., Kowalczyk, B., Marjański, A., Gandor, M., Żurada, J.: A novel fast feedforward neural networks training algorithm. J. Artificial Intelligence and Soft Computing Res. 11(4), 287–306 (2021). https://doi.org/10.2478/jaiscr-2021-0017

    Article  Google Scholar 

  14. Bilski, J., Rutkowski, L., Smola̧g, J., Tao, D.: A novel method for speed training acceleration of recurrent neural networks. Information Sciences 553, 266–279 (2021). https://doi.org/10.1016/j.ins.2020.10.025

  15. Grycuk, R., Scherer, R.: Novel fast binary hash for content-based solar image retrieval. In: 2020 International Joint Conference on Neural Networks (IJCNN). IEEE (2020)

    Google Scholar 

  16. Grycuk, R., Scherer, R.: Solar image hashing by intermediate descriptor and autoencoder. In: 2021 International Joint Conference on Neural Networks (IJCNN). IEEE (2021)

    Google Scholar 

Download references

Acknowledgments

The presented results are obtained within the realization of the project “Traffic Watchdog 2.0 – verification and protection system against fraud activities in the on-line marketing (ad frauds) supported by artificial intelligence and virtual finger-print technology” financed by the National Centre for Research and Development; grant number POIR.01.01.01–00-0241/19–01.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marcin Gabryel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gabryel, M., Lada, D., Kocić, M. (2023). Autoencoder Neural Network for Detecting Non-human Web Traffic. In: Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2022. Lecture Notes in Computer Science(), vol 13589. Springer, Cham. https://doi.org/10.1007/978-3-031-23480-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-23480-4_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-23479-8

  • Online ISBN: 978-3-031-23480-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics