The Effect of Attribute Scaling on the Performance of Support Vector Machines

Edwards, Catherine; Raskutti, Bhavani

doi:10.1007/978-3-540-30549-1_44

Catherine Edwards²⁰ &
Bhavani Raskutti²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3339))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

2597 Accesses
6 Citations

Abstract

This paper presents some empirical results showing that simple attribute scaling in the data preprocessing stage can improve the performance of linear binary classifiers. In particular, a class specific scaling method that utilises information about the class distribution of the training sample can significantly improve classification accuracy. This form of scaling can boost the performance of a simple centroid classifier to similar levels of accuracy as the more complex, and computationally expensive, support vector machine and regression classifiers. Further, when SVMs are used, scaled data produces better results, for smaller amounts of training data, and with smaller regularisation constant values, than unscaled data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Berry, M.J.A., Linoff, G.: Data Mining Techniques: For Marketing, Sales and Customer Support. Wiley, New York (1997)
Google Scholar
Pyle, D.: Data Preparation for Data Mining. Morgan Kaufmann Publishers, Inc., California (1999)
Google Scholar
Corte, C., Vapnik, V.: Support-vector networks. Machine Learning 20, 273–297 (1995)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and other kernel-based learning methods. Cambridge University Press, Cambridge (2000)
Google Scholar
Vapnik, V.: Statistical learning theory. Wiley, Chichester (1998)
MATH Google Scholar
Kimeldorf, G., Whaba, G.: A correspondence between Bayesian estimation of stochastic processes and smoothing by splines. Ann. Math. Statist. 41, 495–502 (1970)
Article MATH MathSciNet Google Scholar
Schölkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2001)
Google Scholar
Girosi, F., Jones, M., Poggio, T.: Regularization theory and neural networks architectures. Neural Computation 7, 219–269 (1995)
Article Google Scholar
Rocchio, J.J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)
Google Scholar
Kowalczyk, A., Raskutti, B.: Exploring Fringe Settings of SVMs for Classification. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) PKDD 2003. LNCS (LNAI), vol. 2838, pp. 278–290. Springer, Heidelberg (2003)
Chapter Google Scholar
Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30(7), 1145–1159 (1997)
Article Google Scholar
Weiss, G., Provost, F.: The effect of class distribution on classifier learning. Technical report, Rutgers University (2001)
Google Scholar
Centor, R.: Signal detectability: The use of ROC curves and their analysis. Med. Decis. Making 11, 102–106 (1991)
Article Google Scholar
Fawcett, T.: ROC Graphs: Notes and practical considerations for data mining researchers. In: HP Labs Tech Report HPL-2003-4 (2003)
Google Scholar
Bamber, D.: The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J. Math. Psych. 12, 387–415 (1975)
Article MATH MathSciNet Google Scholar
Hand, D.J., Till, R.J.: A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learning 45, 171–186 (2001)
Article MATH Google Scholar
Hsu, C., Chang, C., Lin, C.: A practical guide to support vector classification (2003), http://www.csie.ntu.tw/cjlin/papers/guide/guide.pdf
Sarle, W.: Neural network FAQ (1997), ftp://ftp.sas.com/pub/neural/FAQ2.html

Download references

Author information

Authors and Affiliations

Telstra Research Laboratories, Telstra Corporation, 770 Blackburn Road, Clayton, Victoria, Australia
Catherine Edwards & Bhavani Raskutti

Authors

Catherine Edwards
View author publications
You can also search for this author in PubMed Google Scholar
Bhavani Raskutti
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Information Technology, Monash University, VIC 3800, Australia
Geoffrey I. Webb
Science, Engineering and Technology Portfolio, Royal Melbourne Institute of Technology, VIC 3001, Melbourne, Australia
Xinghuo Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Edwards, C., Raskutti, B. (2004). The Effect of Attribute Scaling on the Performance of Support Vector Machines. In: Webb, G.I., Yu, X. (eds) AI 2004: Advances in Artificial Intelligence. AI 2004. Lecture Notes in Computer Science(), vol 3339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30549-1_44

Download citation

DOI: https://doi.org/10.1007/978-3-540-30549-1_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24059-4
Online ISBN: 978-3-540-30549-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics