Skip to main content

Information Gain, Correlation and Support Vector Machines

  • Chapter
Feature Extraction

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 207))

Abstract

We report on our approach, CBAmethod3E, which was submitted to the NIPS 2003 Feature Selection Challenge on Dec. 8, 2003. Our approach consists of combining filtering techniques for variable selection, information gain and feature correlation, with Support Vector Machines for induction. We ranked 13th overall and ranked 6th as a group. It is worth pointing out that our feature selection method was very successful in selecting the second smallest set of features among the top-20 submissions, and in identifying almost all probes in the datasets, resulting in the challenge’s best performance on the latter benchmark.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • B.E. Boser, I. Guyon, and V. Vapnik. A training algorithm for optimal margin classifiers. In Fifth Annual Workshop on Computational Learning Theory, pages 144–152. ACM, 1992.

    Google Scholar 

  • P.S. Bradley and O.L. Mangasarian. Feature selection via concave minimization and support vector machines. In Proc15th IntĊonfṀachine Learning, pages 82–90, 1998.

    Google Scholar 

  • C.C. Chang and C.J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.

  • C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20(3):273–297, 1995.

    MATH  Google Scholar 

  • U. Fayyad and K. Irani. Multi-interval discretization of continuous-valued attributes for classification learning. In Proc10th IntĊonfṀachine Learning, pages 194–201, 1993.

    Google Scholar 

  • I. Guyon, J. Weston, S. Barnhill, and V. Vapnik. Gene selection for cancer classification using support vector machines. Machine Learning, 46:389–422, 2002.

    Article  MATH  Google Scholar 

  • G. Karakoulas. Cost-effective classification for credit scoring. In Proc3rd IntĊonfȦI Applications on Wall Street, 1995.

    Google Scholar 

  • T. Mitchell. Machine Learning. McGraw-Hill, New York, 1997.

    MATH  Google Scholar 

  • M. Momma and K.P. Bennett. A pattern search method for model selection of support vector regression. In R. Grossman, J. Han, V. Kumar, H. Mannila, and R. Motwani, editors, Proceedings of the Second SIAM International Conference on Data Mining, pages 261–274. SIAM, 2002.

    Google Scholar 

  • J. Platt. Fast Training of Support Vector Machines using Sequential Minimal Optimization, chapter 12, pages 185–208. MIT Press, 1999.

    Google Scholar 

  • P. Turney. Types of cost in inductive concept learning. In Workshop cost-sensitive learning, Proc. 17th Int. Conf. Machine Learning, pages 15–21, 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Roobaert, D., Karakoulas, G., Chawla, N.V. (2006). Information Gain, Correlation and Support Vector Machines. In: Guyon, I., Nikravesh, M., Gunn, S., Zadeh, L.A. (eds) Feature Extraction. Studies in Fuzziness and Soft Computing, vol 207. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-35488-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-35488-8_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-35487-1

  • Online ISBN: 978-3-540-35488-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics