Abstract
Android is the most preferred mobile operating system in the world. Applications are available from both official application repositories and other application stores. For these reasons, there has been a remarkable increase in malware for the Android operating system in recent years. In this study, a novel Android malware detection system is proposed by using filter-based feature selection methods. The proposed approach is static Android malware detection based on machine learning. Permissions extracted from application files are used as features in the developed system. Dimension reduction is carried out with eight different feature selection methods to enhance the running time and efficiency of machine learning algorithms. While four of these methods are used in Android malware detection systems, the remaining four methods are adapted from text classification studies. The adapted methods are compared in terms of both extracted features and classification results. When the results are examined, it is shown that the adapted methods improve the efficiency of the classification algorithms and can be used in this field.
Similar content being viewed by others
References
AAPT2 (2020) Android asset packaging tool. https://developer.android.com/studio/command-line/aapt2. Accessed 1 Sept 2020
Abawajy J, Darem A, Alhashmi AA (2021) Feature subset selection for malware detection in smart IOT platforms. Sensors 21(4):1374
Alazab M, Alazab M, Shalaginov A, Mesleh A, Awajan A (2020) Intelligent mobile malware detection using permission requests and API calls. Futur Gener Comput Syst 107:509–521
Ananya A, Aswathy A, Amal T, Swathy P, Vinod P, Mohammad S (2020) Sysdroid: a dynamic ml-based android malware analyzer using system call traces. Clust Comput 23(4):2789–2808
APKPure (2020) APKPure Android application store. http://apkpure.com/. Accessed 1 Sept 2020
Bhattacharya A, Goswami RT (2018) A hybrid community based rough set feature selection technique in android malware detection. Smart trends in systems. Security and sustainability. Springer, Singapore, pp 249–258
Chandrashekar G, Sahin F (2014) A survey on feature selection methods. Comput Elect Eng 40(1):16–28
Coban O, Ozel SA (2019) Adapting text categorization for manifest based android malware detection. Comput Sci 20(3):383–405
Dataset (2020) Official web site of VirusShare. https://virusshare.com/. Accessed 1 Sept 2020
Deepa K, Radhamani G, Vinod P (2015) Investigation of feature selection methods for android malware analysis. Procedia Comput Sci 46:841–848
Deng X, Li Y, Weng J, Zhang J (2019) Feature selection for text classification: a review. Multimedia Tools Appl 78(3):3797–3816
Dharmalingam VP, Palanisamy V (2020) A novel permission ranking system for android malware detection–the permission grader. J Ambient Intell Human Comput 12(5):5071–5081
Faruki P, Bharmal A, Laxmi V, Ganmoor V, Gaur MS, Conti M, Rajarajan M (2014) Android security: a survey of issues, malware penetration, and defenses. IEEE Commun Surv Tutor 17(2):998–1022
Fatima A, Maurya R, Dutta MK, Burget R, Masek J (2019) Android malware detection using genetic algorithm based optimized feature selection and machine learning. In: 2019 42nd International Conference on Telecommunications and Signal Processing (TSP), IEEE, pp 220–223
Feizollah A, Anuar NB, Salleh R, Wahab AWA (2015) A review on feature selection in mobile malware detection. Digit Investig 13:22–37
Forman G (2003) An extensive empirical study of feature selection metrics for text classification. J Mach Learn Res 3(Mar):1289–1305
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18
Jung J, Park J, Sj C, Han S, Park M, Cho HH (2021) Feature engineering and evaluation for android malware detection scheme. J Internet Technol 22(2):423–440
Kaspersky (2020) All about Android app permissions. https://www.kaspersky.com/blog/android-permissions-guide/14014/. Accessed 1 Sept 2020
Kouliaridis V, Potha N, Kambourakis G (2021) Improving android malware detection through dimensionality reduction techniques. In: Machine Learning for Networking: Third International Conference, MLN 2020, Paris, France, November 24–26, 2020, Revised Selected Papers 3, Springer International Publishing, pp 57–72
Kural OE, Şahin DÖ, Akleylek S, Kılıç E (2019) Permission weighting approaches in permission based android malware detection. In: 2019 4th International Conference on Computer Science and Engineering (UBMK), IEEE, pp 134–139
Lan M, Tan CL, Su J, Lu Y (2009) Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans Pattern Anal Mach Intell 31(4):721–735
Liu Z, Wang R, Japkowicz N, Tang D, Zhang W, Zhao J (2021) Research on unsupervised feature learning for android malware detection based on restricted Boltzmann machines. Futur Gener Comput Syst 120:91–108
Morales-Ortega S, Escamilla-Ambrosio PJ, Rodriguez-Mota A, Coronado-De-Alba LD (2016) Native malware detection in smartphones with android OS using static analysis, feature selection and ensemble classifiers. In: 2016 11th International Conference on Malicious and Unwanted Software (MALWARE), IEEE, pp 1–8
Pan Y, Ge X, Fang C, Fan Y (2020) A systematic literature review of android malware detection using static analysis. IEEE Access 8:116363–116379
Peng H, Gates C, Sarma B, Li N, Qi Y, Potharaju R, Nita-Rotaru C, Molloy I (2012) Using probabilistic generative models for ranking risks of android apps. In: Proceedings of the 2012 ACM conference on Computer and communications security, pp 241–252
Peynirci G, Eminağaoğlu M, Karabulut K (2020) Feature selection for malware detection on the android platform based on differences of IDF values. J Comput Sci Technol 35(4):946–962
Şahin DÖ, Kural OE, Akleylek S, Kılıç E (2018) New results on permission based static analysis for android malware. In: 2018 6th International Symposium on Digital Forensic and Security (ISDFS), IEEE, pp 1–4
Şahin DÖ, Kılıç E (2019) Two new feature selection metrics for text classification. Automatika 60(2):162–171
Salah A, Shalabi E, Khedr W (2020) A lightweight android malware classifier using novel feature selection methods. Symmetry 12(5):858
Shabtai A, Kanonov U, Elovici Y, Glezer C, Weiss Y (2012) “andromaly”: A behavioral malware detection framework for android devices. J Intell Inf Syst 38:161–190
Suarez-Tangil G, Tapiador JE, Peris-Lopez P, Blasco J (2014) Dendroid: a text mining approach to analyzing and classifying code structures in android malware families. Expert Syst Appl 41(4):1104–1117
Taşcı Ş, Güngör T (2013) Comparison of text feature selection policies and using an adaptive framework. Expert Syst Appl 40(12):4871–4886
Wang W, Zhao M, Gao Z, Xu G, Xian H, Li Y, Zhang X (2019) Constructing features for detecting android malicious applications: issues, taxonomy and directions. IEEE Access 7:67602–67631
Xu K, Li Y, Deng RH (2016) Iccdetector: Icc-based malware detection on android. IEEE Trans Inf Forensics Secur 11(6):1252–1264
Zhao K, Zhang D, Su X, Li W (2015) Fest: A feature extraction and selection tool for android malware detection. In: 2015 IEEE Symposium on Computers and Communication (ISCC), pp 714–720
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Şahin, D.Ö., Kural, O.E., Akleylek, S. et al. A novel Android malware detection system: adaption of filter-based feature selection methods. J Ambient Intell Human Comput 14, 1243–1257 (2023). https://doi.org/10.1007/s12652-021-03376-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-021-03376-6