Abstract
In Machine Learning, feature selection is paramount to achieve accurate and correct results as well as to reduce the computational effort. As a first step, it can be used to identify erroneous features or features introducing spurious correlation. This also applies to machine learning in an engineering environment. Here, data is often given as multivariate time series datasets, which require adapted approaches for machine learning as well as preprocessing steps, like feature selection. Because of the importance of feature selection, special methods for this use case are needed. Therefore, we propose an applicable feature selection method for multivariate time series, based on a differential correlation approach. This method aims to detect erroneous features or those introducing spurious correlation through the assessment of variable relationships across multiple multivariate time series instances. For this, the correlation between the time series features is compared across all examined instances through a differential correlation approach. To show the applicability of the proposed selection method, it is evaluated on artificially generated time series datasets, including erroneous features. Additionally, the method is evaluated on a real multivariate time series dataset in context of an engineering environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bach, J., Langner, J., Otten, S., Holzäpfel, M., Sax, E.: Data-driven development, a complementing approach for automotive systems engineering. In: IEEE International Systems Engineering Symposium (ISSE). Vienna, vol. 2017, pp. 1–6 (2017). https://doi.org/10.1109/SysEng.2017.8088295
LinkedIn Economic Graph Team: LinkedIn Workforce Report United States, August 2018
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: Knowledge discovery and data mining: towards a unifying framework. In: KDD, vol. 96 (1996)
Pistorius, F., Baumann, D., Seidel, L., Sax, E.: Intuitive time-series-analysis-toolbox for inexperienced data scientists. In: Presented at the CSCI 2020: International Conference on Computational Science and Computational Intelligence (CSCI). Presented on 16 December 2020
Alasadi, S.A., Bhaya, W.S.: Review of data preprocessing techniques in data mining. J. Eng. Appl. Sci. 12(16), 4102–4107 (2017)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Jović, A., Brkić, K., Bogunović, N.: A review of feature selection methods with applications. In: 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO). IEEE (2015)
Bolón-Canedo, V., Sánchez-Maroño, N., Alonso-Betanzos, A.: A review of feature selection methods on synthetic data. Knowl. Inf. Syst. 34(3), 483–519 (2013)
Tang, J., Alelyani, S., Liu, H.: Feature selection for classification: A review. Algorithms and applications, Data classification, p. 37 (2014)
Yoon, H., Yang, K., Shahabi, C.: Feature subset selection and feature ranking for multivariate time series. IEEE Trans. Knowl. Data Eng. 17(9), 1186–1198 (2005)
Han, M., Liu, X.: Feature selection techniques with class separability for multivariate time series. Neurocomputing 110, 29–34 (2013)
Crone, S.F., Kourentzes, N.: Feature selection for time series prediction-a combined filter and wrapper approach for neural networks. Neurocomputing 73(10–12), 1923–1936 (2010)
Christ, M., et al.: Time series feature extraction on basis of scalable hypothesis tests (tsfresh-a python package). Neurocomputing 307, 72–77 (2018)
Christ, M., Kempa-Liehr, A.W., Feindt, M.: Distributed and parallel time series feature extraction for industrial big data applications. arXiv preprint arXiv:1610.07717 (2016)
Fang, L., et al.: Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data. Biomed. Signal Process. Control 21, 82–89 (2015)
Freedman, D., Pisani, R., Purves, R.: Statistics (international student edition). 4th edn. WW Norton & Company, New York (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Pistorius, F., Baumann, D., Sax, E. (2022). Differential Correlation Approach for Multivariate Time Series Feature Selection. In: Arai, K. (eds) Proceedings of the Future Technologies Conference (FTC) 2021, Volume 1. FTC 2021. Lecture Notes in Networks and Systems, vol 358. Springer, Cham. https://doi.org/10.1007/978-3-030-89906-6_59
Download citation
DOI: https://doi.org/10.1007/978-3-030-89906-6_59
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89905-9
Online ISBN: 978-3-030-89906-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)