Filtering for Malice Through the Data Ocean: Large-Scale PHA Install Detection at the Communication Service Provider Level

Chen, Kai; Li, Tongxin; Ma, Bin; Wang, Peng; Wang, XiaoFeng; Zong, Peiyuan

doi:10.1007/978-3-319-66332-6_8

Kai Chen^17,18,
Tongxin Li¹⁹,
Bin Ma^17,18,
Peng Wang²⁰,
XiaoFeng Wang²⁰ &
…
Peiyuan Zong^17,18

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 10453))

Included in the following conference series:

International Symposium on Research in Attacks, Intrusions, and Defenses

2379 Accesses
3 Citations

Abstract

As a key stakeholder in mobile communications, the communication service provider (CSP, including carriers and ISPs) plays a critical role in safeguarding mobile users against potentially-harmful apps (PHA), complementing the security protection at app stores. However a CSP-level scan faces an enormous challenge: hundreds of millions of apps are installed everyday; retaining their download traffic to construct their packages entails a huge burden on the CSP side, forces them to change their infrastructure and can have serious privacy and legal ramifications. To control the cost and avoid trouble, today’s CSPs acquire apps from download URLs for a malware analysis. Even this step is extremely expensive and hard to meet the demand of online protection: for example, a CSP we are working with runs hundreds of machines to check the daily downloads it observes. To rise up to this challenge, we present in this paper an innovative “app baleen” (called Abaleen) framework for an on-line security vetting of an extremely large number of app downloads, through a high-performance, concurrent inspection of app content from the sources of the downloads. At the center of the framework is the idea of retrieving only a small amount of the content from the remote sources to identify suspicious app downloads and warn the end users, hopefully before the installation is complete. Running on 90 million download URLs recorded by our CSP partner, our screening framework achieves an unparalleled performance, with a nearly 85\(\times \) speed-up compared to the existing solution. This level of performance enables an online vetting for PHAs at the CSP scale: among all unique URLs used in our study, more than 95% were processed before the completion of unfettered downloads. With the CSP-level dataset, we revealed not only the surprising pervasiveness of PHAs, but also the real impact of them (over 2 million installs in merely 3 days).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

APPregator: A Large-Scale Platform for Mobile Security Analysis

Large-scale App privacy governance

Article 12 November 2022

Scalable online vetting of Android apps for measuring declared SDK versions and their consistency with API calls

Article 12 January 2021

Notes

1.
Those terms and images were manually inspected to ensure their correctness.
2.
For each node, its child nodes with most children are visited first.

References

Aliyun cloud. https://www.aliyun.com/
At&t managed security services. http://www.corp.att.com/gov/solution/network_services/mss.html
Cloaking. https://en.wikipedia.org/wiki/Cloaking
Data retention directive. https://en.wikipedia.org/wiki/Data_Retention_Directive
Fraudster phishing users with malicious mobile apps. https://info.phishlabs.com/blog/fraudster-phishing-users-with-malicious-mobile-apps
Jieba - chinese text segmentation. https://github.com/fxsjy/jieba
Linking to your products. https://developer.android.com/distribute/tools/promote/linking.html
Mobile/tablet operating system market share. https://www.netmarketshare.com/operating-system-market-share.aspx
Verizon managed security services. http://www.verizonenterprise.com/products/security/monitoring-analytics/managed-security-services.xml
Data retention across the eu (2016). http://fra.europa.eu/en/theme/information-society-privacy-and-data-protection/data-retention
Zip (file format) (2017). https://en.wikipedia.org/wiki/Zip_(file_format)
Abbasi, A., Albrecht, C., Vance, A., Hansen, J.: Metafraud: a meta-learning framework for detecting financial fraud. Mis Q. 36(4), 1293–1327 (2012)
Google Scholar
Arp, D., Spreitzenbarth, M., Hubner, M., Gascon, H., Rieck, K.D.: Effective and explainable detection of android malware in your pocket. In: NDSS (2014)
Google Scholar
Chen, K., Liu, P., Zhang, Y.: Achieving accuracy and scalability simultaneously in detecting application clones on android markets. In: ICSE (2014)
Google Scholar
Chen, K., Wang, P., Lee, Y., Wang, X., Zhang, N., Huang, H., Zou, W., Liu, P.: Finding unknown malice in 10 seconds: mass vetting for new threats at the google-play scale. In: USENIX Security, vol. 15 (2015)
Google Scholar
Chen, K., Wang, X., Chen, Y., Wang, P., Lee, Y., Wang, X., Ma, B., Wang, A., Zhang, Y., Zou, W.: Following devil’s footprints: cross-platform analysis of potentially harmful libraries on android and IOS. In: IEEE Symposium on Security and Privacy (SP), pp. 357–376. IEEE (2016)
Google Scholar
Crussell, J., Gibler, C., Chen, H.: Attack of the clones: detecting cloned applications on android markets. In: Foresti, S., Yung, M., Martinelli, F. (eds.) ESORICS 2012. LNCS, vol. 7459, pp. 37–54. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33167-1_3
Chapter Google Scholar
Felt, A.P., Finifter, M., Chin, E., Hanna, S., Wagner, D.: A survey of mobile malware in the wild. In: Proceedings of the 1st ACM Workshop on Security and Privacy in Smartphones and Mobile Devices, pp. 3–14. ACM (2011)
Google Scholar
Foozy, C.F.M., Ahmad, R., Abdollah, M.F.: Phishing detection taxonomy for mobile device. Int. J. Comput. Sci. 10, 338–344 (2013)
Google Scholar
Google. Google report: Android security 2014 year in review (2014). https://static.googleusercontent.com/media/source.android.com/en/security/reports/Google_Android_Security_2014_Report_Final.pdf
Gu, G., Porras, P.A., Yegneswaran, V., Fong, M.W., Lee, W.: Bothunter: detecting malware infection through ids-driven dialog correlation. In: Security (2007)
Google Scholar
Lever, C., Antonakakis, M., Reaves, B., Traynor, P., Lee, W.: The core of the matter: analyzing malicious traffic in cellular carriers. In: NDSS (2013)
Google Scholar
Monga, V., Evans, B.L.: Perceptual image hashing via feature points: performance evaluation and tradeoffs. IEEE Trans. Image Process. 15, 11 (2006)
Article Google Scholar
Niu, X.-M., Jiao, Y.-H.: An overview of perceptual hashing. Acta Electronica Sinica 36(7), 1405–1411 (2008)
Google Scholar
Rastogi, V., Chen, Y., Enck, W.: Appsplayground: automatic security analysis of smartphone applications. In: CODASPY, pp. 209–220 (2013)
Google Scholar
Ren, C., Chen, K., Liu, P.: Droidmarking: resilient software watermarking for impeding android application repackaging. In: Proceedings of the 29th ACM/IEEE International Conference on Automated Software Engineering, pp. 635–646. ACM (2014)
Google Scholar
RFC. Hypertext transfer protocol - http/1.1 (1999). http://www.ietf.org/rfc/rfc2616.txt
Sun, M., Li, M., Lui, J. Droideagle: seamless detection of visually similar android apps. In: Proceedings of the 8th ACM Conference on Security & Privacy in Wireless and Mobile Networks, p. 9. ACM (2015)
Google Scholar
Yan, L.K., Yin, H.: Droidscope: seamlessly reconstructing the OS and dalvik semantic views for dynamic android malware analysis. In: USENIX Security (2012)
Google Scholar
Zhang, F., Huang, H., Zhu, S., Wu, D., Liu, P.: Viewdroid: towards obfuscation-resilient mobile application repackaging detection. In: WiSec (2014)
Google Scholar
Zhou, W., Zhou, Y., Jiang, X., Ning, P.: Detecting repackaged smartphone applications in third-party android marketplaces. In: CODASPY (2012)
Google Scholar

Download references

Acknowledgements

We thank our shepherd Roberto Perdisci and anonymous reviewers for their valuable comments. We also thank VirusTotal for the help in validating suspicious apps in our study. Kai Chen was supported in part by NSFC U1536106, National Key Research and Development Program of China (Grant No. 2016QY04W0805), Youth Innovation Promotion Association CAS, and strategic priority research program of CAS (XDA06010701). The IU authors are supported in part by NSF CNS-1223477, 1223495, 1527141 and 1618493, and ARO W911NF1610127.

Author information

Authors and Affiliations

SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China
Kai Chen, Bin Ma & Peiyuan Zong
School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China
Kai Chen, Bin Ma & Peiyuan Zong
Peking University, Beijing, China
Tongxin Li
Indiana University, Bloomington, USA
Peng Wang & XiaoFeng Wang

Authors

Kai Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tongxin Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Ma
View author publications
You can also search for this author in PubMed Google Scholar
Peng Wang
View author publications
You can also search for this author in PubMed Google Scholar
XiaoFeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Peiyuan Zong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kai Chen .

Editor information

Editors and Affiliations

Qatar Computing Research Institute, Doha, Qatar
Marc Dacier
University of Illinois at Urbana Champaign, Champaign, Illinois, USA
Michael Bailey
Stony Brook University, Stony Brook, New York, USA
Michalis Polychronakis
Georgia Institute of Technology, Georgia, USA
Manos Antonakakis

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (txt 1 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, K., Li, T., Ma, B., Wang, P., Wang, X., Zong, P. (2017). Filtering for Malice Through the Data Ocean: Large-Scale PHA Install Detection at the Communication Service Provider Level. In: Dacier, M., Bailey, M., Polychronakis, M., Antonakakis, M. (eds) Research in Attacks, Intrusions, and Defenses. RAID 2017. Lecture Notes in Computer Science(), vol 10453. Springer, Cham. https://doi.org/10.1007/978-3-319-66332-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-66332-6_8
Published: 12 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66331-9
Online ISBN: 978-3-319-66332-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Filtering for Malice Through the Data Ocean: Large-Scale PHA Install Detection at the Communication Service Provider Level

Abstract

Access this chapter

Similar content being viewed by others

APPregator: A Large-Scale Platform for Mobile Security Analysis

Large-scale App privacy governance

Scalable online vetting of Android apps for measuring declared SDK versions and their consistency with API calls

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (txt 1 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Filtering for Malice Through the Data Ocean: Large-Scale PHA Install Detection at the Communication Service Provider Level

Abstract

Access this chapter

Similar content being viewed by others

APPregator: A Large-Scale Platform for Mobile Security Analysis

Large-scale App privacy governance

Scalable online vetting of Android apps for measuring declared SDK versions and their consistency with API calls

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (txt 1 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation