Abstract
The present paper describes the formal model of data mining algorithms. These models consider each data mining algorithm as a sequence of operations. This allows us to determine ways for parallel execution of data mining algorithms. The software implementation of the formal model is executed on the Java language. A few data mining algorithms were developed on the basis of the suggested formal modal. The algorithm k-means is described in the paper as the example.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amol, G., Prabhanjan, K., Edwin, P., Ramakrishnan, K.: NIMBLE: a toolkit for the implementation of parallel data mining and machine learning algorithms on mapreduce. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’11), pp. 334–342. California (2011)
Andrew, Y.Ng., Bradski, G., Chu, C-T., Olukotun, K., Kim, Sang K., Lin, Y-A., Yu, Y.: Map-reduce for machine learning on multicore. In: Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems, pp. 281–288. Vancouver, Canada (2006)
Common Warehouse Metamodel (CWM) Specification. http://www.omg.org/spec/CWM/1.1/
Java Specification Request 73: Java Data Mining (JDM) - JDM Public review Draft 2003/11/25 : JSR-73 Expert Group
Kholod, I.I.: Unified data mining model. In: XV International Conference on Soft Computing and Measurements SCM‘2012, vol. 1, pp. 237–240. Saint-Petersburg (2012)
Kholod, I.I., Karshiyev, Z.A.: Parallelization of the algorithm Naïve Bayes on the basis of block structure. In: XV International Conference on Soft Computing and Measurements SCM‘2012, vol. 1, pp. 182–185. Saint-Petersburg (2012)
Barsegian, A., Kupriyanov, M., Kholod, I., Thess, M.: Analysis of Data and Processes: From Standard to Realtime Data Mining, p. 300. Re Di Roma-Verlag (2014)
Acknowledgments
The work has been performed in Saint Petersburg Electrotechnical University “LETI” within the scope of the contract Board of Education of Russia and science of the Russian Federation under the contract No 02.G25.31.0058 from 12.02.2013. This paper is also supported by the federal project “Organization of scientific research” of the main part of the state plan of the Board of Education of Russia and project part of the state plan of the Board of Education of Russia (task # 2.136.2014/K).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Kholod, I., Karshiyev, Z., Shorov, A. (2015). The Formal Model of Data Mining Algorithms for Parallelize Algorithms. In: Wiliński, A., Fray, I., Pejaś, J. (eds) Soft Computing in Computer and Information Science. Advances in Intelligent Systems and Computing, vol 342. Springer, Cham. https://doi.org/10.1007/978-3-319-15147-2_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-15147-2_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15146-5
Online ISBN: 978-3-319-15147-2
eBook Packages: EngineeringEngineering (R0)