Abstract
Improving Smartphone anomaly-based malware detection techniques is widely studied in recent years. Previous studies explore three factors: dataset size, dataset type and normal profile model. These factors improve the performance, but increase computation complexity and the required memory space. In this paper we explore a new factor: the dataset representation. Dataset representation is the format adopted to organize and represent data. To investigate the impact of this factor, we examine four machine learning classifiers with three different dataset representations. Those dataset representations are: successive system calls, bag of system calls and patterns frequency system calls. The used dataset is a collection of system call traces of Smartphone executing Android 2.2. We analyse the performance of each classifier and deduce the influence of dataset representation on accuracy and false positive rates. The results show that the dataset representation has a potential impact on the performance of classifiers with low computational and memory cost.
Chapter PDF
Similar content being viewed by others
References
Amamra, A., Talhi, C., Robert, J.-M.: Performance Evaluation of Multi-pattern Matching Algorithms on Smartphone. In: BWCCA, Vicroria, BC, Canada, pp. 329–334 (2012)
Amamra, A., Talhi, C., Robert, J.-M.: Smartphone Malware Detection: From a Survey Towards Taxonomy. In: Malware, Fajardo, Puerto Rico, USA, pp. 89–96 (2012)
Bose, A., Hu, X., Shin, K.G., Park, T.: Behavioral detection of malware on mobile handsets. In: MobiSys, Breckenridge, CO, USA, pp. 225–238 (2008)
Shabtai, A., Kanonov, U., Elovici, Y., Glezer, C., Weiss, Y.: Andromaly: a behavioral malware detection framework for android devices. Journal of Intelligent Information Systems 38, 161–190 (2012)
Xie, L., Zhang, X.: pBMDS: a behavior-based malware detection system for cellphone devices. In: Third ACM Conference on Wireless Network Security, Hoboken, NJ, USA, pp. 37–48 (2010)
Zhao, M.: RobotDroid: A Lightweight Malware Detection Framework on Smartphones. Journal of Networks 7(4) (2012)
Burguera, I., Zurutuza, U., Nadjm-Tehrani, S.: Crowdroid: Behavior-Based Malware Detection System for Android. In: Workshop on Security and Privacy in Smartphones and Mobile Devices 2011, Chicago, USA (2011)
Amamra, A., Talhi, C., Robert, J.-M., Hamiche, M.: Enhancing Smartphone Malware Detection Performance by Applying Machine Learning Hybrid Classifiers. In: Kim, T.-H., Ramos, C., Kim, H.-K., Kiumi, A., Mohammed, S., Ślęzak, D. (eds.) ASEA/DRBC 2012. CCIS, vol. 340, pp. 131–137. Springer, Heidelberg (2012)
Pathak, A., Hu, Y.C., Zhang, M., Bahl, P., Wang, Y.-M.: Fine-grained power modeling for smartphones using system call tracing. In: EuroSys 2011, Salzburg, Austria, pp. 153–168 (2011)
Buennemeyer, T.K., Nelson, T.M., Clagett, L.M., Dunning, J.P., Marchany, R.C., Tront, J.G.: Mobile Device Profiling and Intrusion Detection using Smart Batteries. In: HICSS 2008, Waikoloa, Hawaii, pp. 1–10 (2008)
Organization, L.K. Linux 2.6 System calls Table (2013), https://www.kernel.org/pub/linux/kernel/v2.6/ (cited 2013)
Waikato, U.o. WEKA, http://www.cs.waikato.ac.nz/ml/weka/ (cited 2012)
Google. Android Market, https://play.google.com/store (cited 2012)
Mobile, B.C. Mobile Malwares (2012), http://contagiominidump.blogspot.ca/
Mukkamala, S., Janoski, G., Sung, A.: Intrusion detection using neural networks and support vector machines. In: International Joint Conference on Neural Networks, New Mexico, USA, pp. 1702–1707 (2002)
Smola, A., Vishwanathan, S.V.N.: Introduction to Machine Learning. Cambridge University Press, United Kingdom (2008)
Ben Amor, N., Benferhat, S., Elouedi, Z.: Naive Bayes vs Decision Trees in Intrusion Detection Systems. In: Symposium on Applied Computing, Nicosia, Cyprus, pp. 420–424 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 IFIP International Federation for Information Processing
About this paper
Cite this paper
Amamra, A., Talhi, C., Robert, JM. (2013). Impact of Dataset Representation on Smartphone Malware Detection Performance. In: Fernández-Gago, C., Martinelli, F., Pearson, S., Agudo, I. (eds) Trust Management VII. IFIPTM 2013. IFIP Advances in Information and Communication Technology, vol 401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38323-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-38323-6_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38322-9
Online ISBN: 978-3-642-38323-6
eBook Packages: Computer ScienceComputer Science (R0)