High-Integrity MapReduce Computation in Cloud with Speculative Execution

  • Jing Xiao
  • Zhiwei Xiao
Part of the Communications in Computer and Information Science book series (CCIS, volume 164)

Abstract

Cloud computing involves processing a huge amount of data using massively, distributed computing resources. However, the massive and distributed nature of cloud computing also make the integrity of computation upon easily be easily broken either by deliberate attacks or unconscious machine failures. In this paper, we propose to provide high-integrity feature to MapReduce computation using speculative execution. The key idea of our approach is selectively replicating MapReduce tasks on a random computation node, and comparing the hash of the execution results to determine if the integrity of the task is compromised. A preliminary prototype, called Nessaj, has been implemented on Hadoop MapReduce framework. Experimental results show that Nessaj can detect and recover from our randomly injected attacks in high probability. The performance overhead is also moderate.

Keywords

Cloud Platform Statistical Machine Translation Intermediate Data Hadoop Distribute File System MapReduce Framework 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amazon Inc.: Amazon Elastic Compute Cloud (Amazon EC2), http://aws.amazon.com/ec2/ (2008)
  2. 2.
    Common Vulnerabilities and Exposures: Xen guest root can escape to domain 0 through pygrub, http://cve.mitre.org/cgi-bin/cvename.cgi?name=CVE-2007-4993 (2007)
  3. 3.
    Common Vulnerabilities and Exposures: Vulnerability in xenserver could result in privilege escalation and arbitrary code execution, http://support.citrix.com/article/CTX118766.access (2007)
  4. 4.
    Kortchinsky, K.: Hacking 3d (and breaking out of vmware) (2009)Google Scholar
  5. 5.
    Schroeder, B., Pinheiro, E., Weber, W.: DRAM errors in the wild: a large-scale field study. In: Proceedings of the Eleventh International Joint Conference on Measurement and Modeling of Computer Systems, pp. 193–204. ACM, New York (2009)Google Scholar
  6. 6.
    Apache Wiki: Applications and organizations using hadoop, http://wiki.apache.org/Hadoop/PoweredBy
  7. 7.
    Ghemawat, S., Gobioff, H., Leung, S.: The Google file system. ACM SIGOPS Operating Systems Review 37(5), 29–43 (2003)CrossRefGoogle Scholar
  8. 8.
    Rivest, R.: The MD5 message-digest algorithm. Request for Comments (RFC1320). Internet Activities Board, Internet Privacy Task Force (1992)Google Scholar
  9. 9.
    Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Communications of the ACM 51(1), 107–113 (2008)CrossRefGoogle Scholar
  10. 10.
    Chu, C., Kim, S., Lin, Y., Yu, Y., Bradski, G., Ng, A., Olukotun, K.: Map-reduce for machine learning on multicore. In: Advances in Neural Information Processing Systems: Proceedings of the 2006 Conference, vol. 281. MIT Press, Redmond (2007)Google Scholar
  11. 11.
    Dyer, C., Cordova, A., Mont, A., Lin, J.: Fast, easy, and cheap: Construction of statistical machine translation models with MapReduce. In: Proceedings of the Third Workshop on Statistical Machine Translation at ACL, pp. 199–207 (2008)Google Scholar
  12. 12.
    Ekanayake, J., Pallickara, S., Fox, G.: MapReduce for Data Intensive Scientific Analyses. In: IEEE Fourth International Conference on eScience, 2008. eScience 2008, pp. 277–284 (2008)Google Scholar
  13. 13.
    Bialecki, A., Cafarella, M., Cutting, D., O’Malley, O.: Hadoop: a framework for running applications on large clusters built of commodity hardware (2005), http://lucene.apache.org/hadoop
  14. 14.
    Zaharia, M., Konwinski, A., Joseph, A., Berkeley, U., Katz, R., Stoica, I.: Improving MapReduce Performance in Heterogeneous Environments. In: Proc. OSDI (2008)Google Scholar
  15. 15.
    Roy, I., Setty, S., Kilzer, A., Shmatikov, V., Witchel, E.: Airavat: Security and privacy for MapReduce. In: Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation, USENIX Association (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Jing Xiao
    • 1
  • Zhiwei Xiao
    • 2
  1. 1.School of Information SecurityShanghai JiaoTong UniversityChina
  2. 2.Software SchoolFudan UniversityChina

Personalised recommendations