Abstract
Multi-objective optimization (MOO) is a class of optimization problems where several objective functions must be simultaneously optimized. Traditional search methods are difficult to extend to MOO problems so many of these problems are solved using bio-inspired optimization algorithms. One of the famous optimization algorithms that have been applied to MOO is the non-dominated sorting genetic algorithm II (NSGA-II). NSGA-II algorithm has been successfully used to solve MOO problems owing to its lower computational complexity compared with the other optimization algorithms. In this paper we use NSGA-II to solve a MOO problem of time series data mining. The problem in question is determining the optimal weights of a multi-metric distance that is used to perform several data mining tasks. NSGA-II is particularly appropriate to optimize data mining problems where fitness functions evaluation usually involves intensive computing resources. Whereas several previous papers have proposed different methods to optimize time series data mining problems, this paper is, to our knowledge, the first paper to optimize several time series data mining tasks simultaneously. The experiments we conducted show that the performance of the optimized combination of multi-metric distances we propose in executing time series data mining tasks is superior to that of the distance metrics that constitute the combination when they are applied separately.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Witten, I.H., Frank, E.: Data Mining Practical Machine Learning Tools and Techniques, Second Edition edn. Elsevier, Amsterdam (2009)
Muhammad Fuad, M.M.: Differential evolution versus genetic algorithms: towards symbolic aggregate approximation of non-normalized time series. In: Sixteenth International Database Engineering & Applications Symposium– IDEAS 2012. BytePress/ACM, Prague, Czech Republic, 8–10 August 2012
Muhammad Fuad, M.M.: Using differential evolution to set weights to segments with different information content in the piecewise aggregate approximation. In: 16th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, KES 2012, Frontiers of Artificial Intelligence and Applications (FAIA). IOS Press, San Sebastian, Spain, 10–12 September 2012
Bunke, H., Kraetzl, M.: Classification and detection of abnormal events in time series of graphs. In: Last, M., Kandel, A., Bunke, H. (eds.) Data Mining in Time Series Databases. World Scientific, New Jersey (2003)
Gorunescu, F.: Data Mining: Concepts, Models and Techniques. Blue Publishing House, Cluj-Napoca (2006)
Vlachos, M., Gunopulos, D.: Indexing time-series under conditions of noise. In: Last, M., Kandel, A., Bunke, H. (eds.) Data Mining in Time Series Databases. World Scientific, New Jersey (2003)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2011)
Larose, D.T.: Discovering Knowledge in Data: An Introduction to Data Mining. Wiley, New York (2005)
Kanungo, T., Netanyahu, N.S., Wu, A.Y.: An efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 881–892 (2002)
Muhammad Fuad, M.M.: Differential evolution-based weighted combination of distance metrics for k-means clustering. In: Dediu, A.-H., Lozano, M., Martín-Vide, C. (eds.) TPNC 2014. LNCS, vol. 8890, pp. 193–204. Springer, Heidelberg (2014)
Zezula, P., et al.: Similarity Search - The Metric Space Approach. Springer, New York (2005)
Bustos, B., Skopal, T.: Dynamic Similarity Search in Multi-metric Spaces. Proceedings of the ACM Multimedia, MIR Workshop. ACM Press, New York (2006)
Bustos, B., Keim, D.A., Saupe, D., Schreck, T., Vranic, D.: Automatic selection and combination of descriptors for effective 3D similarity search. In: Proceedings of the IEEE International Workshop on Multimedia Content-based Analysis and Retrieval. IEEE Computer Society (2004)
Affenzeller, M., Winkler, S., Wagner, S., Beham, A.: Genetic Algorithms and Genetic Programming Modern Concepts and Practical Applications. Chapman and Hall/CRC, Boca Raton (2009)
El-Ghazali, T.: Metaheuristics: from Design to Implementation. John Wiley & Sons Inc, Hoboken (2009)
Srinivas, N., Deb, K.: Multi-objective function optimization using non-dominated sorting genetic algorithms. J. Evol. Comput. 2(3), 221–248 (1995)
Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering. Springer-Verlag GmbH, Heidelberg (2011)
Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. In: IEEE Trans Evolutionary Computation (2002)
Ma, Q., Xu, D., Iv, P., Shi, Y.: Application of NSGA-II in parameter optimization of extended state observer. In: Challenges of Power Engineering and Environment (2007)
Keogh, E., Zhu, Q., Hu, B., Hao. Y., Xi, X., Wei, L., Ratanamahatana C.A.: The UCR time series classification/clustering homepage (2011). www.cs.ucr.edu/~eamonn/time_series_data/
Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and mining of time series data: experimental comparison of representations and distance measures. In: Proceedings of the 34th VLDB (2008)
Muhammad Fuad, M.M.: One-step or two-step optimization and the overfitting phenomenon: a case study on time series classification. In: The 6th International Conference on Agents and Artificial Intelligence- ICAART 2014. SCITEPRESS Digital Library, Angers, France, 6–8 March 2014
Muhammad Fuad, M.M.: On the application of bio-inspired optimization algorithms to fuzzy c-means clustering of time series. In: The 4th International Conference on Pattern Recognition Applications and Methods - ICPRAM 2015. SCITEPRESS Digital Library, Lisbon, Portugal, 10–12 January 2015
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Muhammad Fuad, M.M. (2015). Applying Non-dominated Sorting Genetic Algorithm II to Multi-objective Optimization of a Weighted Multi-metric Distance for Performing Data Mining Tasks. In: Mora, A., Squillero, G. (eds) Applications of Evolutionary Computation. EvoApplications 2015. Lecture Notes in Computer Science(), vol 9028. Springer, Cham. https://doi.org/10.1007/978-3-319-16549-3_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-16549-3_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16548-6
Online ISBN: 978-3-319-16549-3
eBook Packages: Computer ScienceComputer Science (R0)