Abstract
As a parallel programming model, Map-Reduce is used for distributed computing of massive data. Map-Reduce model encapsulates the details of parallel implementation, fault-tolerant processing, local computing and load balancing, etc., provides a simple but powerful interface. In case of having no clear idea about distributed and parallel programming, this interface can be utilized to save development time. This paper introduces the method of using Hadoop, the open-source Map-Reduce software platform, to combine PCs to carry out scalable parallel computing. Our experiment using 12 PCs to compute N-body problem based on Map-Reduce model shows that we can get a 9.8x speedup ratio. This work indicates that the Map-Reduce can be applied in scalable parallel computing.
Similar content being viewed by others
References
Wikipedia. SETI@home [EB/OL]. (2011-7-10) [2011-7-15]. http://en.wikipedia.org/wiki/SETI@home.
Wikipedia. Berkeley open infrastructure for network computing [EB/OL]. (2011-6-14) [2011-6-20]. http://en.wikipedia.org/wiki/Berkeley Open Infrastructure for Network Computing.
Boinc’s official website. How BOINC works [EB/OL]. (2011-7-1) [2011-7-15]. http://boinc.berkeley.edu/wiki/How BOINC works.
Wikipedia. MapReduce [EB/OL]. (2011-7-14) [2011-7-15]. http://en.wikipedia.org/wiki/Map-reduce.
Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters [J]. Communications of the ACM, 2008, 51(1): 107–113.
Hadoop wiki. PoweredBy [EB/OL]. (2011-7-10) [2011-7-15]. http://wiki.apache.org/hadoop/PoweredBy.
Blelloch G, Narlikar G. A practical comparison of N-body algorithms [M]// Parallel Algorithms (series in Discrete Mathematics and Theoretical Computer Science), Providence: American Mathematical Society. 1997: 1–16.
Nyland L, Harris M, Prins J. Fast N-body simulation with CUDA [M]// GPU Gems 3. Boston: Addison-Wesley Professional. 2007: 677–696.
Google. Hadoop-eclipse-plugin [EB/OL]. (2011-5-8) [2011-7-15]. http://hadoop-eclipse-plugin.googlecode.com/files/hadoop-0.20.3-dev-eclipse-plugin.jar.
Chen W Y. Programming Map-Reduce (Hadoop) with eclipse [EB/OL]. (2008-5-27) [2011-7-15]. http://www.trac.nchc.org.tw/cloud/export/256/hadoopeclipse.pdf.
White T. Hadoop: The definitive guide [M]. US: O’Reilly Media. 2009: 1–38.
Author information
Authors and Affiliations
Corresponding author
Additional information
Project supported by the Shanghai Leading Academic Discipline Project (Grant No.J50103), the National High-Technology Research and Development Program of China (Grant No.2009AA012201), the Major Technology R&D Program of Shanghai (Grant No.08DZ501600), and the Science and Technology Pillar Project of Jiangxi (Grant No.2010BGB00604)
About this article
Cite this article
Nguyen, Tc., Shen, Wf., Chai, Yh. et al. Research and implementation of scalable parallel computing based on Map-Reduce. J. Shanghai Univ.(Engl. Ed.) 15, 426–429 (2011). https://doi.org/10.1007/s11741-011-0763-3
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11741-011-0763-3