Mutual Information with Parameter Determination Approach for Feature Selection in Multivariate Time Series Prediction

Liu, Tianhong; Wei, Haikun; Zhang, Chi; Zhang, Kanjian

doi:10.1007/978-3-319-44188-7_17

Tianhong Liu¹²,
Haikun Wei¹²,
Chi Zhang¹² &
…
Kanjian Zhang¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 629))

Included in the following conference series:

International Conference on Engineering Applications of Neural Networks

2212 Accesses

Abstract

For modeling of multivariate time series, input variable selection is a key problem. Feature selection is to select a relevant subset to reduce the dimensionality of the problem without significant loss of information. This paper presents the estimation of mutual information and its application in feature selection problem. Mutual information is one of the most common strategies borrowed from information theory for feature selection. However, the calculation of probability density function (PDF) according to the definition of mutual information is difficult, especially for high dimensional variables. A k-nearest neighbor (k-NN) method based estimator is widely used to estimate the mutual information between two variables directly from the data set. Nevertheless, this estimator depends on smoothing parameter. There is no theoretically method to choose the parameter. This paper purposes to solve two problems: one is to employ resampling methods to help the mutual information estimator to improve feature selection and the other is to apply these methods to a wind power prediction problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Keynia, F.: A new feature selection algorithm and composite neural network for electricity price forecasting. Eng. Appl. Artif. Intell. 25(8), 1687–1697 (2012)
Article Google Scholar
Wu, C.L., Chau, K.W.: Prediction of rainfall time series using modular soft computing methods. Eng. Appl. Artif. Intell. 26(3), 997–1007 (2013)
Article Google Scholar
Du Preez, J., Witt, S.F.: Univariate versus multivariate time series forecasting: An application to international tourism demand. Int. J. Forecast. 19(3), 435–451 (2003)
Article Google Scholar
Franois, D., Rossi, F., Wertz, V.: Resampling methods for parameter-free and robust feature selection with mutual information. Neurocomputing 70, 1276–1288 (2007)
Article Google Scholar
Geman, S., Bienenstock, E., Doursat, R.: Neural networks and the bias/variance dilemma. Neual. Comput. 4(1), 1–58 (1992)
Article Google Scholar
Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948)
Article MathSciNet MATH Google Scholar
Frnay, B., Doquire, G., Verleysen, M.: Is mutual information adequate for feature selection in regression? Neural Netw. 48, 1–7 (2013)
Article MATH Google Scholar
Han, M., Ren, W.J., Liu, X.X.: Joint mutual information-based input variable selection for multivariate time series modeling. Eng. Appl. Rtif. Intel. 37, 250–257 (2015)
Article Google Scholar
Lin, Y., Hu, Q., Liu, J., Chen, J., Duan, J.: Multi-label feature selection based on neighborhood mutual information. Appl. Soft. Comput. 38, 244–256 (2016)
Article Google Scholar
Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Trans. Neural Netw. 5, 537–550 (1994)
Article Google Scholar
Van Dijck, G., Van Hulle, M.M.: Speeding up the wrapper feature subset selection in regression by mutual information relevance and redundancy analysis. In: Kollias, S.D., Stafylopatis, A., Duch, W., Oja, E. (eds.) Artificial Neural Networks–ICANN 2006. LNCS, vol. 4131, pp. 31–40. Springer, Heidelberg (2006)
Chapter Google Scholar
Fleuret, F.: Fast binary feature selection with conditional mutual information. J. Mach. Learn. Res. 5, 1531–1555 (2004)
MathSciNet MATH Google Scholar
Rossi, F., Franois, D., Wertz, V., Meurens, M., Verleysen, M.: Fast selection of spectral variables with b-spline compression. Chemometr. Intell. Lab. 86(2), 208–218 (2007)
Article Google Scholar
Kraskov, A., Stgbauer, H., Grassberger, P.: Estimating mutual information. Phys. Rev. E 69, 066–138 (2004)
Article MathSciNet Google Scholar
Verleysen, M., Rossi, F., François, D.: Advances in feature selection with mutual information. In: Biehl, M., Hammer, B., Verleysen, M., Villmann, T. (eds.) Similarity-Based Clustering. LNCS, vol. 5400, pp. 52–69. Springer, Heidelberg (2009)
Chapter Google Scholar
Stogbauer, H., Kraskov, A., Astakhov, S.A., Grassberger, P.: Least dependent component analysis based on mutual information. Phys. Rev. E 70, 066–123 (2004)
Article Google Scholar
Peng, H.C., Long, F.H., Ding, C.: Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Pattern. Anal. Mach. Intell. 27(8), 1226–1238 (2005)
Article Google Scholar

Download references

Acknowledgments

The authors gratefully acknowledge the financial support of this research by the National Natural Science Foundation of China (Grant No. 61374006), the Major Program of National Natural Science Foundation of China (Grant No. 11190015) and the Natural Science Foundation of Jiangsu (Grant No. BK20131300).

Author information

Authors and Affiliations

Key Laboratory of Measurement and Control of CSE, Ministry of Education, School of Automation, Southeast University, Nanjing, 210096, People’s Republic of China
Tianhong Liu, Haikun Wei, Chi Zhang & Kanjian Zhang

Authors

Tianhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Haikun Wei
View author publications
You can also search for this author in PubMed Google Scholar
Chi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Kanjian Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Haikun Wei .

Editor information

Editors and Affiliations

Robert Gordon University, Aberdeen, United Kingdom
Chrisina Jayne
Lab of Forest Informatics (FiLAB), Democritus University of Thrace Lab of Forest Informatics (FiLAB), Orestiada, Greece
Lazaros Iliadis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, T., Wei, H., Zhang, C., Zhang, K. (2016). Mutual Information with Parameter Determination Approach for Feature Selection in Multivariate Time Series Prediction. In: Jayne, C., Iliadis, L. (eds) Engineering Applications of Neural Networks. EANN 2016. Communications in Computer and Information Science, vol 629. Springer, Cham. https://doi.org/10.1007/978-3-319-44188-7_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-44188-7_17
Published: 19 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44187-0
Online ISBN: 978-3-319-44188-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics