Malicious Domain Name Detection Based on Extreme Machine Learning

Shi, Yong; Chen, Gong; Li, Juntao

doi:10.1007/s11063-017-9666-7

Malicious Domain Name Detection Based on Extreme Machine Learning

Published: 03 July 2017

Volume 48, pages 1347–1357, (2018)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

2030 Accesses
61 Citations
9 Altmetric
Explore all metrics

Abstract

Malicious domain detection is one of the most effective approaches applied in detecting Advanced Persistent Threat (APT), the most sophisticated and stealthy threat to modern network. Domain name analysis provides security experts with insights to identify the Command and Control (C&C) communications in APT attacks. In this paper, we propose a machine learning based methodology to detect malware domain names by using Extreme Learning Machine (ELM). ELM is a modern neural network with high accuracy and fast learning speed. We apply ELM to classify domain names based on features extracted from multiple resources. Our experiment reveals the introduced detection method is able to perform high detection rate and accuracy (of more than 95%). The fast learning speed of our ELM based approach is also demonstrated by a comparative experiment. Hence, we believe our method using ELM is both effective and efficient to identify malicious domains and therefore enhance the current detection mechanism of APT attacks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ghafir I, Prenosil V (2014) Advanced persistent threat attack detection: an overview. Int J Adv Comput Netw Secur 4:50–54
Google Scholar
Li M, Huang W, Wang Y, Fan W, Li J (2016) The study of APT attack stage model. In: 2016 IEEE/ACIS 15th international conference on computer and information science (ICIS), pp 1–5
Li F APT attribution and DNS profiling. http://www.blackhat.com/docs/us-14/materials/us-14-Li-APT-Attribution-And-DNS-Profiling-WP.pdf
Soltani S, Seno SAH, Nezhadkamali M, Budiarto R (2014) A survey on real world botnets and detection mechanisms. Int J Inf Netw Secur 3:116–127
Google Scholar
Grill M, Nikolaev I, Valeros V, Rehak M (2015) Detecting DGA malware using NetFlow. In: 2015 IFIP/IEEE international symposium on integrated network management (IM). IEEE, pp 1304–1309
Sato K, Ishibashi K, Toyono T, Miyake N (2012) Extending black domain name list by using co-occurrence relation between DNS queries. IEICE Trans Commun 95:794–802
Article Google Scholar
Zhang S (2014) Detecting malware domains on DNS traffic. Master Thesis, Shanghai Jiaotong University
Shi L, Lin D, Fang CV, Zhai Y (2015) A hybrid learning from multi-behavior for malicious domain detection on enterprise network. In: 2015 IEEE international conference on data mining workshop (ICDMW). pp 987–996
Gao Y, Zhen Y, Li H, Chua TS (2016) Filtering of brand-related microblogs using social-smooth multiview embedding. IEEE Trans Multimed 18:2115–2126
Article Google Scholar
Manadhata PK, Yadav S, Rao P, Horne W (2014) Detecting malicious domains via graph inference. In: European symposium on research in computer security. Springer, pp 1–18
Lee J, Lee H (2014) GMAD: graph-based malware activity detection by DNS traffic analysis. Comput Commun 49:33–47
Article Google Scholar
Chau DH, Nachenberg C, Wilhelm J, Wright A, Faloutsos C (2010) Polonium: Tera-scale graph mining for malware detection. In: Acm sigkdd conference on knowledge discovery and data mining
Gao Y, Zhang H, Zhao X, Yan S (2017) Event classification in microblog via social tracking. ACM Trans Intell Syst Technol 8:1–14
Article Google Scholar
Ding G, Guo Y, Zhou J, Gao Y (2016) Large-scale cross-modality search via collective matrix factorization hashing. IEEE Trans Image Process 25:5427–5440
Article MathSciNet Google Scholar
Mashechkin IV, Petrovskii MI, Tsarev DV (2016) Machine learning methods for analyzing user behavior when accessing text data in information security problems. Mosc Univ Comput Math Cybern 40:179–184
Article MathSciNet Google Scholar
Futai Z, Siyu Z, Weixiong R (2013) Hybrid detection and tracking of fast-flux botnet on domain name system traffic. China Commun 10:81–94
Article Google Scholar
Bilge L, Kirda E, Kruegel C, Balduzzi M (2011) EXPOSURE: finding malicious domains using passive DNS analysis. In: Network and distributed system security symposium
Amini P, Azmi R, Araghizadeh M (2014) Botnet detection using NetFlow and clustering. Adv Comput Sci Int J 3:139–149
Google Scholar
Yu X, Zhang B, Kang L, Chen J (2012) Fast-flux botnet detection based on weighted svm. Inf Technol J 11:1048–1055
Article Google Scholar
Lasota K, Kozakiewicz A (2011) Analysis of the similarities in malicious DNS domain names. In: International conference on secure and trust computing, data management, and application, 1006
Google Scholar
Ma J, Saul LK, Savage S, Voelker GM (2009) Beyond blacklists: learning to detect malicious web sites from suspicious URLs. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1245–1254
Shannon CE (2001) A mathematical theory of communication. ACM SIGMOBILE Mob Comput Commun Rev 5:3–55
Article Google Scholar
Passerini E, Paleari R, Martignoni L, Bruschi D (2008) Fluxor: detecting and monitoring fast-flux service networks. In: International conference on detection of intrusions and malware, and vulnerability assessment. pp 186–206
Brisco T DNS support for load balancing. https://tools.ietf.org/html/rfc1794
ICANN WHOIS: WHOIS Search. https://whois.icann.org/en
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
Article Google Scholar
Huang GB (2015) What are extreme learning machines? Filling the gap between Frank Rosenblatt’s dream and John von Neumann’s puzzle. Cogn Comput 7:263–278
Article Google Scholar
Website Traffic, Statistics and Analytics—Alexa. http://www.alexa.com/siteinfo
Malicious Domain List. https://www.malwaredomainlist.com/
PhishTank—Join the fight against phishing. http://www.alexa.com/

Download references

Author information

Authors and Affiliations

School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, 800, Dongchuan Rd., Minhang District, Shanghai, 200240, People’s Republic of China
Yong Shi, Gong Chen & Juntao Li

Authors

Yong Shi
View author publications
You can also search for this author in PubMed Google Scholar
Gong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Juntao Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gong Chen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shi, Y., Chen, G. & Li, J. Malicious Domain Name Detection Based on Extreme Machine Learning. Neural Process Lett 48, 1347–1357 (2018). https://doi.org/10.1007/s11063-017-9666-7

Download citation

Published: 03 July 2017
Issue Date: December 2018
DOI: https://doi.org/10.1007/s11063-017-9666-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Malicious Domain Name Detection Based on Extreme Machine Learning

Abstract

Access this article

Similar content being viewed by others

A Machine Learning Framework for Studying Domain Generation Algorithm (DGA)-Based Malware

A Hybrid Multiclass Classifier Approach for the Detection of Malicious Domain Names Using RNN Model

Enhanced Domain Generating Algorithm Detection Based on Deep Neural Networks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Malicious Domain Name Detection Based on Extreme Machine Learning

Abstract

Access this article

Similar content being viewed by others

A Machine Learning Framework for Studying Domain Generation Algorithm (DGA)-Based Malware

A Hybrid Multiclass Classifier Approach for the Detection of Malicious Domain Names Using RNN Model

Enhanced Domain Generating Algorithm Detection Based on Deep Neural Networks

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation