Feature Selection Based on Run Covering

Yang, Su; Liang, Jianning; Wang, Yuanyuan; Winstanley, Adam

doi:10.1007/11949534_21

Su Yang¹⁸,
Jianning Liang¹⁸,
Yuanyuan Wang¹⁹ &
…
Adam Winstanley²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4319))

Included in the following conference series:

Pacific-Rim Symposium on Image and Video Technology

1165 Accesses
1 Citations

Abstract

This paper proposes a new feature selection algorithm. First, the data at every attribute are sorted. The continuously distributed data with the same class labels are grouped into runs. The runs whose length is greater than a given threshold are selected as “valid” runs, which enclose the instances separable from the other classes. Second, we count how many runs cover every instance and check how the covering number changes once eliminate a feature. Then, we delete the feature that has the least impact on the covering cases for all instances. We compare our method with ReliefF and a method based on mutual information. Evaluation was performed on 3 image databases. Experimental results show that the proposed method outperformed the other two.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. Journal of Machine Learning Research 3, 1157–1182 (2003)
Article MATH Google Scholar
Liu, H., Yu, L.: Toward integrating feature selection algorithms for classification and clustering. IEEE Trans. Knowledge and Data Engineering 17, 491–502 (2005)
Article Google Scholar
Jain, A., Zongker, D.: Feature selection: Evaluation, application, and small sample performance. IEEE Transactions on Pattern Analysis and Machine Intelligence 19, 153–158 (1997)
Article Google Scholar
Kohavi, R., John, G.H.: Wrappers for feature subset selection. Artificial Intelligence 97, 273–324 (1997)
Article MATH Google Scholar
Robnik-Sikonja, M., Kononenko, I.: Theoretical and empirical analysis of ReliefF and RreliefF. Machine Learning 53, 23–69 (2003)
Article MATH Google Scholar
Kira, K., Rendell, L.: A practical approach to feature selection. In: Proc. Int. Conf. Machine Learning, pp. 249–256 (1992)
Google Scholar
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 1226–1238 (2005)
Article Google Scholar
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proc. Int. Joint Conf. Artificial Intelligence, pp. 1137–1145 (1995)
Google Scholar
Rui, Y., Huang, T.S., Chang, S.: Image retrieval: Current techniques, promising directions and open issues. Visual communication and image representation 10(4), 39–62 (1999)
Article Google Scholar
Ho, T.K., Baird, H.S.: Pattern classification with compact distribution maps. Computer Vision and Image Understanding 70, 101–110 (1998)
Article Google Scholar
Cover, T.M.: The best two independent measurements are not the two best. IEEE Transactions on Systems, Man, and Cybernetics 4, 116–117 (1974)
MATH Google Scholar
Narendra, P.M., Fukunaga, K.: A branch and bound algorithm for feature subset selection. IEEE Transactions on Computers 26, 917–922 (1977)
Article MATH Google Scholar
Somol, P., Pudil, P., Kittler, J.: Fast branch & bound algorithms for optimal feature selection. IEEE Transactions on Pattern Analysis and Machine Intelligence 26, 900–912 (2004)
Article Google Scholar
Kwak, N., Choi, C.H.: Input feature selection by mutual information based on Parzen windows. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(12), 1667–1671 (2002)
Article Google Scholar
Trappenberg, T., Ouyang, J., Back, A.: Input variable selection: Mutual information and linear mixing measures. IEEE Trans. Knowledge and Data Engineering 18, 37–46 (2006)
Article Google Scholar
http://www.ics.uci.edu/~mlearn/MLRepository.html
http://www.cs.waikato.ac.nz/~ml

Download references

Author information

Authors and Affiliations

Shanghai Key Laboratory of Intelligent Information Processing, Dept. of Computer Science and Engineering, Fudan University, Shanghai, 200433, China
Su Yang & Jianning Liang
Dept. of Electronic Engineering, Fudan University, Shanghai, 200433, China
Yuanyuan Wang
National Centre for Geocomputation, Dept. of Computer Science, National University of Ireland, Maynooth, Co. Kildare, Ireland
Adam Winstanley

Authors

Su Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jianning Liang
View author publications
You can also search for this author in PubMed Google Scholar
Yuanyuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Adam Winstanley
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, National Tsing Hua University, HsinChu, Taiwan
Long-Wen Chang
Department of Electrical Engineering, National Chung Cheng University, 621, Chia-Yi, Taiwan, ROC
Wen-Nung Lie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, S., Liang, J., Wang, Y., Winstanley, A. (2006). Feature Selection Based on Run Covering. In: Chang, LW., Lie, WN. (eds) Advances in Image and Video Technology. PSIVT 2006. Lecture Notes in Computer Science, vol 4319. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949534_21

Download citation

DOI: https://doi.org/10.1007/11949534_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68297-4
Online ISBN: 978-3-540-68298-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics