Skip to main content

Applying Non-dominated Sorting Genetic Algorithm II to Multi-objective Optimization of a Weighted Multi-metric Distance for Performing Data Mining Tasks

  • Conference paper
  • First Online:
Applications of Evolutionary Computation (EvoApplications 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9028))

Included in the following conference series:

Abstract

Multi-objective optimization (MOO) is a class of optimization problems where several objective functions must be simultaneously optimized. Traditional search methods are difficult to extend to MOO problems so many of these problems are solved using bio-inspired optimization algorithms. One of the famous optimization algorithms that have been applied to MOO is the non-dominated sorting genetic algorithm II (NSGA-II). NSGA-II algorithm has been successfully used to solve MOO problems owing to its lower computational complexity compared with the other optimization algorithms. In this paper we use NSGA-II to solve a MOO problem of time series data mining. The problem in question is determining the optimal weights of a multi-metric distance that is used to perform several data mining tasks. NSGA-II is particularly appropriate to optimize data mining problems where fitness functions evaluation usually involves intensive computing resources. Whereas several previous papers have proposed different methods to optimize time series data mining problems, this paper is, to our knowledge, the first paper to optimize several time series data mining tasks simultaneously. The experiments we conducted show that the performance of the optimized combination of multi-metric distances we propose in executing time series data mining tasks is superior to that of the distance metrics that constitute the combination when they are applied separately.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Witten, I.H., Frank, E.: Data Mining Practical Machine Learning Tools and Techniques, Second Edition edn. Elsevier, Amsterdam (2009)

    Google Scholar 

  2. Muhammad Fuad, M.M.: Differential evolution versus genetic algorithms: towards symbolic aggregate approximation of non-normalized time series. In: Sixteenth International Database Engineering & Applications Symposium– IDEAS 2012. BytePress/ACM, Prague, Czech Republic, 8–10 August 2012

    Google Scholar 

  3. Muhammad Fuad, M.M.: Using differential evolution to set weights to segments with different information content in the piecewise aggregate approximation. In: 16th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems, KES 2012, Frontiers of Artificial Intelligence and Applications (FAIA). IOS Press, San Sebastian, Spain, 10–12 September 2012

    Google Scholar 

  4. Bunke, H., Kraetzl, M.: Classification and detection of abnormal events in time series of graphs. In: Last, M., Kandel, A., Bunke, H. (eds.) Data Mining in Time Series Databases. World Scientific, New Jersey (2003)

    Google Scholar 

  5. Gorunescu, F.: Data Mining: Concepts, Models and Techniques. Blue Publishing House, Cluj-Napoca (2006)

    Google Scholar 

  6. Vlachos, M., Gunopulos, D.: Indexing time-series under conditions of noise. In: Last, M., Kandel, A., Bunke, H. (eds.) Data Mining in Time Series Databases. World Scientific, New Jersey (2003)

    Google Scholar 

  7. Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 3rd edn. Morgan Kaufmann, Burlington (2011)

    Google Scholar 

  8. Larose, D.T.: Discovering Knowledge in Data: An Introduction to Data Mining. Wiley, New York (2005)

    Google Scholar 

  9. Kanungo, T., Netanyahu, N.S., Wu, A.Y.: An efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 881–892 (2002)

    Article  Google Scholar 

  10. Muhammad Fuad, M.M.: Differential evolution-based weighted combination of distance metrics for k-means clustering. In: Dediu, A.-H., Lozano, M., Martín-Vide, C. (eds.) TPNC 2014. LNCS, vol. 8890, pp. 193–204. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  11. Zezula, P., et al.: Similarity Search - The Metric Space Approach. Springer, New York (2005)

    Google Scholar 

  12. Bustos, B., Skopal, T.: Dynamic Similarity Search in Multi-metric Spaces. Proceedings of the ACM Multimedia, MIR Workshop. ACM Press, New York (2006)

    Google Scholar 

  13. Bustos, B., Keim, D.A., Saupe, D., Schreck, T., Vranic, D.: Automatic selection and combination of descriptors for effective 3D similarity search. In: Proceedings of the IEEE International Workshop on Multimedia Content-based Analysis and Retrieval. IEEE Computer Society (2004)

    Google Scholar 

  14. Affenzeller, M., Winkler, S., Wagner, S., Beham, A.: Genetic Algorithms and Genetic Programming Modern Concepts and Practical Applications. Chapman and Hall/CRC, Boca Raton (2009)

    Book  MATH  Google Scholar 

  15. El-Ghazali, T.: Metaheuristics: from Design to Implementation. John Wiley & Sons Inc, Hoboken (2009)

    Google Scholar 

  16. Srinivas, N., Deb, K.: Multi-objective function optimization using non-dominated sorting genetic algorithms. J. Evol. Comput. 2(3), 221–248 (1995)

    Article  Google Scholar 

  17. Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering. Springer-Verlag GmbH, Heidelberg (2011)

    Book  MATH  Google Scholar 

  18. Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. In: IEEE Trans Evolutionary Computation (2002)

    Google Scholar 

  19. Ma, Q., Xu, D., Iv, P., Shi, Y.: Application of NSGA-II in parameter optimization of extended state observer. In: Challenges of Power Engineering and Environment (2007)

    Google Scholar 

  20. Keogh, E., Zhu, Q., Hu, B., Hao. Y., Xi, X., Wei, L., Ratanamahatana C.A.: The UCR time series classification/clustering homepage (2011). www.cs.ucr.edu/~eamonn/time_series_data/

  21. Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and mining of time series data: experimental comparison of representations and distance measures. In: Proceedings of the 34th VLDB (2008)

    Google Scholar 

  22. Muhammad Fuad, M.M.: One-step or two-step optimization and the overfitting phenomenon: a case study on time series classification. In: The 6th International Conference on Agents and Artificial Intelligence- ICAART 2014. SCITEPRESS Digital Library, Angers, France, 6–8 March 2014

    Google Scholar 

  23. Muhammad Fuad, M.M.: On the application of bio-inspired optimization algorithms to fuzzy c-means clustering of time series. In: The 4th International Conference on Pattern Recognition Applications and Methods - ICPRAM 2015. SCITEPRESS Digital Library, Lisbon, Portugal, 10–12 January 2015

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muhammad Marwan Muhammad Fuad .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Muhammad Fuad, M.M. (2015). Applying Non-dominated Sorting Genetic Algorithm II to Multi-objective Optimization of a Weighted Multi-metric Distance for Performing Data Mining Tasks. In: Mora, A., Squillero, G. (eds) Applications of Evolutionary Computation. EvoApplications 2015. Lecture Notes in Computer Science(), vol 9028. Springer, Cham. https://doi.org/10.1007/978-3-319-16549-3_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16549-3_47

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16548-6

  • Online ISBN: 978-3-319-16549-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics