Advertisement

A New Binary Classifier: Clustering-Launched Classification

  • Tung-Shou Chen
  • Chih-Chiang Lin
  • Yung-Hsing Chiu
  • Hsin-Lan Lin
  • Rong-Chang Chen
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4114)

Abstract

One of the powerful classifiers is Support Vector Machine (SVM), which has been successfully applied to many fields. Despite its remarkable achievement, SVM is time-consuming in many situations where the data distribution is unknown, causing it to spend much time on selecting a suitable kernel and setting parameters. Previous studies proposed understanding the data distribution before classification would assist the classification. In this paper, we exquisitely combined with clustering and classification to develop a novel classifier, Clustering-Launched Classification (CLC), which only needs one parameter. CLC employs clustering to group data to characterize the features of the data and then adopts the one-against-the-rest and nearest-neighbor to find the support vectors. In our experiments, CLC is compared with two well-known SVM tools: LIBSVM and mySVM. The accuracy of CLC is comparable to LIBSVM and mySVM. Furthermore, CLC is insensitive to parameter, while the SVM is sensitive, showing CLC is easier to use.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Dunham, M.H.: Data Ming: Introductory and Advanced Topics. Prentice-Hall, Englewood Cliffs (2003)Google Scholar
  2. 2.
    Roiger, R.J., Geatz, M.W.: Data Mining: A Tutorial-Based Primer. Addison-Wesley, Reading (2003)Google Scholar
  3. 3.
    Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2000)Google Scholar
  4. 4.
    Liang, X.: Mathematical Analysis of Classifying Convex Clusters Based on Support Functionals. In: Li, X., Wang, S., Dong, Z.Y. (eds.) ADMA 2005. LNCS (LNAI), vol. 3584, pp. 761–768. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  5. 5.
    Chen, T.S., Chen, R.C., Lin, C.C., Tsai, T.H., Li, S.Y., Liang, X.: Classification of Microarray Gene Expression Data Using a New Binary Support Vector System. In: Proceedings of IEEE International Conference on Neural Networks and Brain (ICNN&B), pp. 485–489 (2005)Google Scholar
  6. 6.
    Chen, R.C., Chen, T.S., Lin, C.C.: A New Binary Support Vector Approach for Increasing Detection Rate of Credit Card Fraud. International Journal of Pattern Recognition and Artificial Intelligence 20(2), 227–239 (2006)CrossRefGoogle Scholar
  7. 7.
    Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)zbMATHGoogle Scholar
  8. 8.
    Sun, B.Y., Huang, D.S., Fang, H.T.: Lidar Signal De-noising Using Least Squares Support Vector Machine. IEEE Signal Processing Letter 12, 101–104 (2005)CrossRefGoogle Scholar
  9. 9.
    Zhao, X.M., Huang, D.S., Cheung, Y.M., Wang, H.Q., Huang, X.: A Novel Hybrid GA/SVM System for Protein Sequences Classification. LNCS, pp. 11–16 (2004)Google Scholar
  10. 10.
    Chen, R.C., Chen, J., Chen, T.S., Hsieh, C.H., Chen, T.Y., Wu, K.Y.: Building an Intrusion Detection System Based on Support Vector Machine and Genetic Algorithm. LNCS, pp. 409–414 (2005)Google Scholar
  11. 11.
    Qiao, H., Zhang, S., Zhang, B., Keane, J.: Intelligent Robots and Systems. In: Proceedings of 2004 IEEE/RSJ International Conference (IROS), vol. 2, pp. 2015–2020 (2004)Google Scholar
  12. 12.
    Chen, R.C., Chen, T.S., Chien, Y.E., Yang, Y.R.: Novel Questionnaire-Responded Transaction Approach with SVM for Credit Card Fraud Detection. LNCS, pp. 916–921 (2005)Google Scholar
  13. 13.
    Chen, R.C., Chiu, M.L., Huang, Y.L., Chen, L.T.: Detecting Credit Card Fraud by Using Questionnaire-Responded Transaction Model Based on Support Vector Machines. LNCS, pp. 800–806 (2004)Google Scholar
  14. 14.
    Chen, T.S., Chen, Y.T., Lin, C.C., Chen, R.C.: A Combined K-Means and Hierarchical Clustering Method for Improving the Clustering Efficiency of Microarrary. In: Proceedings of International Symposium on Intelligent Signal Processing and Communications Systems (ISPACS), pp. 405–408 (2005)Google Scholar
  15. 15.
    Chen, T.S., Tu, B.J., Li, S.C.: A Distance-between-clusters Based Gene Selection Technique on Microarray. In: Proceedings of International Conference on Informatics, Cybernetics, and Systems (ICICS), pp. 1532–1537 (2003)Google Scholar
  16. 16.
    Cover, T.M., Hart, P.E.: Nearest Neighbor Pattern Classification. IEEE Transactions on Information Theory, 21–27 (1967)Google Scholar
  17. 17.
    Chang, C.C., Lin, C.J.: LIBSVM: A Library for Support Vector Machines (2001), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
  18. 18.
    Cover, T.M., Hart, P.E.: Nearest Neighbor Pattern Classification. IEEE Transactions on Information Theory, 2711–2719 (1967)Google Scholar
  19. 19.
    Chang, C.C., Lin, C.J.: LIBSVM: A Library for Support Vector Machines 20 (2001),Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
  20. 20.
    Ruping, S.: mySVM-Manual, University of Dortmund. Lehrstuhl Informatik 8, 21 (2000)Google Scholar
  21. 21.
    Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository ofMachine Learning Databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Tung-Shou Chen
    • 1
  • Chih-Chiang Lin
    • 1
  • Yung-Hsing Chiu
    • 1
  • Hsin-Lan Lin
    • 2
  • Rong-Chang Chen
    • 3
  1. 1.Graduate School of Computer Science and Information TechnologyTaiwan
  2. 2.Graduate School of Business AdministrationTaiwan
  3. 3.Department of Logistics Engineering and ManagementNational Taichung Institute of TechnologyTaiwan

Personalised recommendations