Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification

Ratanamahatana, Chotirat Ann; Wanichsan, Dechawut

doi:10.1007/978-3-540-70560-4_1

Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification

Chotirat Ann Ratanamahatana¹ &
Dechawut Wanichsan¹

Chapter

918 Accesses
16 Citations

Part of the book series: Studies in Computational Intelligence ((SCI,volume 149))

Summary

High-quality classifiers generally require significant amount of labeled data. However, in many real-life applications and domains, labeled positive training data are difficult to obtain, while unlabeled data are largely available. To resolve the problem, many researchers have proposed semi-supervised learning methods that can build good classifiers by using only handful of labeled data. However, the main problem of the previous approaches for time series domains is the difficulty in selecting an optimal stopping criterion. This work therefore proposes a novel stopping criterion for semi-supervised time series classification, together with an integration of Dynamic Time Warping distance measure to improve the data selection during a self training. The experimental results show that this method can build a better classifier that achieves higher classification accuracy than the previous approach. In addition, the extended proposed work is shown to have satisfactory result for multi-cluster and multi-class semi-supervised time series classifier.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bennett, K.P., Demiriz, A.: Semi-Supervised Support Vector Machines. In: Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II (1999)
Google Scholar
Blum, A., Lafferty, J.: Learning from Labeled and Unlabeled Data using Graph Mincuts. In: Proceedings of 18th International Conference on Machine Learning (2001)
Google Scholar
Blum, A., Mitchell, T.: Combining Labeled and Unlabeled Data with Co-Training. In: Proceedings of 11th Annual Conference on Computational Learning Theory, Madison, Wisconsin, United States (1998)
Google Scholar
Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press, Cambridge (2006)
Google Scholar
Cohen, I., Cozman, F.G., Sebe, N., Cirelo, M.C., Huang, T.S.: Semi-Supervised Learning of Classifiers: Theory, Algorithms, and Their Application to Human-Computer Interaction. IEEE Transaction on Pattern Analysis and Machine Intelligence (2004)
Google Scholar
Keogh, E.: The UCR Time Series Classification/Clustering Homepage (January 2008), http://www.cs.ucr.edu/~eamonn/timeseriesdata/
Li, M., Zhou, Z.H.: SETRED: Self-Training with Editing. In: Proceedings of 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005) (2005)
Google Scholar
Nigam, K., Mccallum, A.K., Thrun, S., Mitchell, T.: Machine Learning (2000)
Google Scholar
Ratanamahatana, C.A., Keogh, E.: Everything you know about Dynamic Time Warping is wrong. In: Proceedings of 3rd Workshop on Mining Temporal and Sequential Data, In Conjunction with 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2004) (2004)
Google Scholar
Ratanamahatana, C.A., Keogh, E.: Making Time-Series Classification More Accurate Using Learned Constraints. In: Proceedings of SIAM International Conference on Data Mining (2004)
Google Scholar
Sakoe, H., Chiba, S.: Dynamic Programming Algorithm Optimization for Spoken Word Recognition. Morgan Kaufmann, San Francisco (1990)
Google Scholar
Shahshahani, B.M., Landgrebe, D.A.: The Effect of Unlabeled Samples in Reducing the Small Sample Size Problem and Mitigating the Hughes Phenomenon. IEEE Transactions on Geoscience and Remote Sensing (1994)
Google Scholar
Wei, L.: Self Training dataset (May 2007), http://www.cs.ucr.edu/~wli/selfTraining/
Wei, L., Keogh, E.: Semi-Supervised Time Series Classification. In: Proceedings 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Google Scholar
Zhang, R., Alexander, I.R.: A New Data Selection Principle for Semi-Supervised Incremental Learning. In: Proceedings of 18th International Conference on Pattern Recognition (ICPR 2006) (2006)
Google Scholar
Zhu, X.: Semi-Supervised Learning Literature Survey. Technical report, no.1530, Computer Sciences, University of Wisconsin-Madison (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Chulalongkorn University, Phayathai Rd., Pathumwan, Bangkok, 10330, Thailand
Chotirat Ann Ratanamahatana & Dechawut Wanichsan

Authors

Chotirat Ann Ratanamahatana
View author publications
You can also search for this author in PubMed Google Scholar
Dechawut Wanichsan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Roger Lee

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ratanamahatana, C.A., Wanichsan, D. (2008). Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification. In: Lee, R. (eds) Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing. Studies in Computational Intelligence, vol 149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70560-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-70560-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70559-8
Online ISBN: 978-3-540-70560-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics