A Parallel and Convergent Support Vector Machine Based on MapReduce

Ma, Yingying; Wang, Liming; Li, Longpu

doi:10.1007/978-3-319-01766-2_67

Yingying Ma³,
Liming Wang³ &
Longpu Li³

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 277))

943 Accesses
2 Citations

Abstract

In order to improve the performance of the traditional support vector machine (SVM), this chapter proposes one method referred as MR-SVM to parallelize SVM on MapReduce and mitigates the convergence problems brought by data partitioning and distributed computation. By splitting the large dataset and concurrently calculating the support vector set of each chunk across map units, MR-SVM improves the process capability and efficiency. Then the partial support vector sets are combined as the training set of the global training in reduce phase, and the current global optimum solved by reducing operations is fed back to each map units to determine whether MR-SVM should proceed with another pass. This process iterates until MR-SVM converges to the global optimum. In theory, it has been proved that MR-SVM converges to the global optimum within finite iteration size. Experimental results show that MR-SVM can improve the data processing capability and efficiency of the traditional counterpart and guarantee its high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Scalable Least Square Twin Support Vector Machine Learning

Benchmarking Support Vector Machines Implementation Using Multiple Techniques

Non-linear Classification of Massive Datasets with a Parallel Algorithm of Local Support Vector Machines

References

Graf, H. P., Cosatto, E., Bottou, L., Durdanovic, I., & Vapnik, V. (2004). Parallel support vector machines: The Cascade SVM. Advances in Neural Information Processing Systems (NIPS), 17, 521–528.
Google Scholar
Salleh, N. S. M., Suliman, A., & Ahmad, A. R. (2011). Parallel execution of distributed SVM using MPI (CoDLib). In International Conference on Information Technology and Multimedia (ICIM) (pp. 1–4). IEEE.
Google Scholar
Li, Q., Salman, R., Test, E., Strack, R., & Kecman, V. (2013). Parallel multitask cross validation for support vector machine using GPU. Journal of Parallel and Distributed Computing, 73(3), 293–302.
Google Scholar
Caruana, G., Li, M. Z., & Liu, Y. (2012) An ontology enhanced parallel SVM for Scalable spam filter training [EB/OL]. http://dx.doi.org/10.1016/j.neucom.2012.12.001. doi:10.1016/j.neucom.2012.12.001#_parent
Chu, C. T., Kim, S. K., Lin, Y. A., Yu, Y. Y., Bradski, G., Olukotun, K., et al. (2007). Map-reduce for machine learning on multicore. Advances in Neural Information Processing Systems (NIPS), 19, 281–288.
Google Scholar
Vapnik, V. (1995). The nature of statistical learning theory (pp. 131–162). New York, NY: Springer.
Book MATH Google Scholar
Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.
Article Google Scholar
Platt, J. C. (1999). Fast training of support vector machines using sequential minimal optimization (pp. 185–208). Cambridge, MA: MIT Press.
Google Scholar
Wu, C. M., Wang, X. D., Bai, D. Y., & Zhang, H. D. (2009). Fast incremental learning algorithm of SVM on KKT conditions. The Sixth International Conference on Fuzzy Systems and Knowledge Discovery (pp. 551–554). IEEE.
Google Scholar

Download references

Author information

Authors and Affiliations

Information Engineering Institute, Zhengzhou University, Zhengzhou, 450052, China
Yingying Ma, Liming Wang & Longpu Li

Authors

Yingying Ma
View author publications
You can also search for this author in PubMed Google Scholar
Liming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Longpu Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yingying Ma .

Editor information

Editors and Affiliations

University of Texas at Dallas, Richardson, Texas, USA
W. Eric Wong
Chinese Academy of Sciences, Beijing, China, People’s Republic
Tingshao Zhu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ma, Y., Wang, L., Li, L. (2014). A Parallel and Convergent Support Vector Machine Based on MapReduce. In: Wong, W.E., Zhu, T. (eds) Computer Engineering and Networking. Lecture Notes in Electrical Engineering, vol 277. Springer, Cham. https://doi.org/10.1007/978-3-319-01766-2_67

Download citation

DOI: https://doi.org/10.1007/978-3-319-01766-2_67
Published: 05 December 2013
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01765-5
Online ISBN: 978-3-319-01766-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

A Parallel and Convergent Support Vector Machine Based on MapReduce

Abstract

Access this chapter

Similar content being viewed by others

Scalable Least Square Twin Support Vector Machine Learning

Benchmarking Support Vector Machines Implementation Using Multiple Techniques

Non-linear Classification of Massive Datasets with a Parallel Algorithm of Local Support Vector Machines

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Parallel and Convergent Support Vector Machine Based on MapReduce

Abstract

Access this chapter

Similar content being viewed by others

Scalable Least Square Twin Support Vector Machine Learning

Benchmarking Support Vector Machines Implementation Using Multiple Techniques

Non-linear Classification of Massive Datasets with a Parallel Algorithm of Local Support Vector Machines

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation