Skip to main content

Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification

  • Chapter

Part of the book series: Studies in Computational Intelligence ((SCI,volume 149))

Summary

High-quality classifiers generally require significant amount of labeled data. However, in many real-life applications and domains, labeled positive training data are difficult to obtain, while unlabeled data are largely available. To resolve the problem, many researchers have proposed semi-supervised learning methods that can build good classifiers by using only handful of labeled data. However, the main problem of the previous approaches for time series domains is the difficulty in selecting an optimal stopping criterion. This work therefore proposes a novel stopping criterion for semi-supervised time series classification, together with an integration of Dynamic Time Warping distance measure to improve the data selection during a self training. The experimental results show that this method can build a better classifier that achieves higher classification accuracy than the previous approach. In addition, the extended proposed work is shown to have satisfactory result for multi-cluster and multi-class semi-supervised time series classifier.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bennett, K.P., Demiriz, A.: Semi-Supervised Support Vector Machines. In: Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II (1999)

    Google Scholar 

  2. Blum, A., Lafferty, J.: Learning from Labeled and Unlabeled Data using Graph Mincuts. In: Proceedings of 18th International Conference on Machine Learning (2001)

    Google Scholar 

  3. Blum, A., Mitchell, T.: Combining Labeled and Unlabeled Data with Co-Training. In: Proceedings of 11th Annual Conference on Computational Learning Theory, Madison, Wisconsin, United States (1998)

    Google Scholar 

  4. Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press, Cambridge (2006)

    Google Scholar 

  5. Cohen, I., Cozman, F.G., Sebe, N., Cirelo, M.C., Huang, T.S.: Semi-Supervised Learning of Classifiers: Theory, Algorithms, and Their Application to Human-Computer Interaction. IEEE Transaction on Pattern Analysis and Machine Intelligence (2004)

    Google Scholar 

  6. Keogh, E.: The UCR Time Series Classification/Clustering Homepage (January 2008), http://www.cs.ucr.edu/~eamonn/timeseriesdata/

  7. Li, M., Zhou, Z.H.: SETRED: Self-Training with Editing. In: Proceedings of 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005) (2005)

    Google Scholar 

  8. Nigam, K., Mccallum, A.K., Thrun, S., Mitchell, T.: Machine Learning (2000)

    Google Scholar 

  9. Ratanamahatana, C.A., Keogh, E.: Everything you know about Dynamic Time Warping is wrong. In: Proceedings of 3rd Workshop on Mining Temporal and Sequential Data, In Conjunction with 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2004) (2004)

    Google Scholar 

  10. Ratanamahatana, C.A., Keogh, E.: Making Time-Series Classification More Accurate Using Learned Constraints. In: Proceedings of SIAM International Conference on Data Mining (2004)

    Google Scholar 

  11. Sakoe, H., Chiba, S.: Dynamic Programming Algorithm Optimization for Spoken Word Recognition. Morgan Kaufmann, San Francisco (1990)

    Google Scholar 

  12. Shahshahani, B.M., Landgrebe, D.A.: The Effect of Unlabeled Samples in Reducing the Small Sample Size Problem and Mitigating the Hughes Phenomenon. IEEE Transactions on Geoscience and Remote Sensing (1994)

    Google Scholar 

  13. Wei, L.: Self Training dataset (May 2007), http://www.cs.ucr.edu/~wli/selfTraining/

  14. Wei, L., Keogh, E.: Semi-Supervised Time Series Classification. In: Proceedings 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)

    Google Scholar 

  15. Zhang, R., Alexander, I.R.: A New Data Selection Principle for Semi-Supervised Incremental Learning. In: Proceedings of 18th International Conference on Pattern Recognition (ICPR 2006) (2006)

    Google Scholar 

  16. Zhu, X.: Semi-Supervised Learning Literature Survey. Technical report, no.1530, Computer Sciences, University of Wisconsin-Madison (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Roger Lee

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Ratanamahatana, C.A., Wanichsan, D. (2008). Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification. In: Lee, R. (eds) Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing. Studies in Computational Intelligence, vol 149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70560-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-70560-4_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-70559-8

  • Online ISBN: 978-3-540-70560-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics