Skip to main content

Parallel K-Means Clustering of Remote Sensing Images Based on MapReduce

  • Conference paper
Book cover Web Information Systems and Mining (WISM 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6318))

Included in the following conference series:

Abstract

The K-Means clustering is a basic method in analyzing RS (remote sensing) images, which generates a direct overview of objects. Usually, such work can be done by some software (e.g. ENVI, ERDAS IMAGINE) in personal computers. However, for PCs, the limitation of hardware resources and the tolerance of time consuming present a bottleneck in processing a large amount of RS images. The techniques of parallel computing and distributed systems are no doubt the suitable choices. Different with traditional ways, in this paper we try to parallel this algorithm on Hadoop, an open source system that implements the MapReduce programming model. The paper firstly describes the color representation of RS images, which means pixels need to be translated into a particular color space CIELAB that is more suitable for distinguishing colors. It also gives an overview of traditional K-Means. Then the programming model MapReduce and a platform Hadoop are briefly introduced. This model requires customized ‘map/reduce’ functions, allowing users to parallel processing in two stages. In addition, the paper detail map and reduce functions by pseudo-codes, and the reports of performance based on the experiments are given. The paper shows that results are acceptable and may also inspire some other approaches of tackling similar problems within the field of remote sensing applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gursoy, A.: Data Decomposition for Parallel K-means Clustering. Parallel Processing and Applied Mathematics 14, 241–248 (2004)

    Article  MATH  Google Scholar 

  2. Li-shun, J., Ding-sheng, L.: Research on K-Means Clustering Parallel Algorithm of Remote Sensing Image. Remote Sensing Information 01, 27–30 (2008)

    Google Scholar 

  3. Matlab R2008a Product Help. Demos/Toolboxes/Image Processing/Image Segmentation/Color-Based Segmentation Using the L*a*b* Color Space

    Google Scholar 

  4. Kartikeyan, B., Sarkar, A., et al.: A segmentation approach to classification of remote sensing imagery. International Journal of Remote Sensing 19, 1695–1709 (1998)

    Article  Google Scholar 

  5. MATLAB Central - File detail - RGB2Lab, http://www.mathworks.com /matlabcentral/fileexchange/24009

  6. Color Inspector 3D - Color Space Conversions, http://www.f4.fhtw-berlin.de/~barthel/ImageJ/ColorInspector/HTMLHelp/farbraumJava.htm#rgb2lab

  7. Zhaoqi, B., Xuegong, Z.: Pattern Recognition, 2nd edn., pp. 235–237. Tsinghua University Press, Beijing (2000)

    Google Scholar 

  8. Dean, J., Ghemawat, S.: Mapreduce: Simplified data processing on large clusters. Communications of the ACM 51, 107–113 (2008)

    Article  Google Scholar 

  9. Hadoop Website, http://hadoop.apache.org/

  10. Yao, K.T., Lucas, R.F., et al.: Data Analysis for Massively Distributed Simulations. Interservice/Industry Training, Simulation, and Education Conference(I/ITSEC) (Got from Google Scholar) (2009)

    Google Scholar 

  11. Zhao, W., Ma, H., et al.: Parallel K-Means Clustering Based on MapReduce. In: CloudCom 2009. LNCS, vol. 5931, pp. 674–679 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lv, Z., Hu, Y., Zhong, H., Wu, J., Li, B., Zhao, H. (2010). Parallel K-Means Clustering of Remote Sensing Images Based on MapReduce. In: Wang, F.L., Gong, Z., Luo, X., Lei, J. (eds) Web Information Systems and Mining. WISM 2010. Lecture Notes in Computer Science, vol 6318. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16515-3_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16515-3_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16514-6

  • Online ISBN: 978-3-642-16515-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics