Summary
High-quality classifiers generally require significant amount of labeled data. However, in many real-life applications and domains, labeled positive training data are difficult to obtain, while unlabeled data are largely available. To resolve the problem, many researchers have proposed semi-supervised learning methods that can build good classifiers by using only handful of labeled data. However, the main problem of the previous approaches for time series domains is the difficulty in selecting an optimal stopping criterion. This work therefore proposes a novel stopping criterion for semi-supervised time series classification, together with an integration of Dynamic Time Warping distance measure to improve the data selection during a self training. The experimental results show that this method can build a better classifier that achieves higher classification accuracy than the previous approach. In addition, the extended proposed work is shown to have satisfactory result for multi-cluster and multi-class semi-supervised time series classifier.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bennett, K.P., Demiriz, A.: Semi-Supervised Support Vector Machines. In: Proceedings of the 1998 Conference on Advances in Neural Information Processing Systems II (1999)
Blum, A., Lafferty, J.: Learning from Labeled and Unlabeled Data using Graph Mincuts. In: Proceedings of 18th International Conference on Machine Learning (2001)
Blum, A., Mitchell, T.: Combining Labeled and Unlabeled Data with Co-Training. In: Proceedings of 11th Annual Conference on Computational Learning Theory, Madison, Wisconsin, United States (1998)
Chapelle, O., Schölkopf, B., Zien, A.: Semi-Supervised Learning. MIT Press, Cambridge (2006)
Cohen, I., Cozman, F.G., Sebe, N., Cirelo, M.C., Huang, T.S.: Semi-Supervised Learning of Classifiers: Theory, Algorithms, and Their Application to Human-Computer Interaction. IEEE Transaction on Pattern Analysis and Machine Intelligence (2004)
Keogh, E.: The UCR Time Series Classification/Clustering Homepage (January 2008), http://www.cs.ucr.edu/~eamonn/timeseriesdata/
Li, M., Zhou, Z.H.: SETRED: Self-Training with Editing. In: Proceedings of 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2005) (2005)
Nigam, K., Mccallum, A.K., Thrun, S., Mitchell, T.: Machine Learning (2000)
Ratanamahatana, C.A., Keogh, E.: Everything you know about Dynamic Time Warping is wrong. In: Proceedings of 3rd Workshop on Mining Temporal and Sequential Data, In Conjunction with 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2004) (2004)
Ratanamahatana, C.A., Keogh, E.: Making Time-Series Classification More Accurate Using Learned Constraints. In: Proceedings of SIAM International Conference on Data Mining (2004)
Sakoe, H., Chiba, S.: Dynamic Programming Algorithm Optimization for Spoken Word Recognition. Morgan Kaufmann, San Francisco (1990)
Shahshahani, B.M., Landgrebe, D.A.: The Effect of Unlabeled Samples in Reducing the Small Sample Size Problem and Mitigating the Hughes Phenomenon. IEEE Transactions on Geoscience and Remote Sensing (1994)
Wei, L.: Self Training dataset (May 2007), http://www.cs.ucr.edu/~wli/selfTraining/
Wei, L., Keogh, E.: Semi-Supervised Time Series Classification. In: Proceedings 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2006)
Zhang, R., Alexander, I.R.: A New Data Selection Principle for Semi-Supervised Incremental Learning. In: Proceedings of 18th International Conference on Pattern Recognition (ICPR 2006) (2006)
Zhu, X.: Semi-Supervised Learning Literature Survey. Technical report, no.1530, Computer Sciences, University of Wisconsin-Madison (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ratanamahatana, C.A., Wanichsan, D. (2008). Stopping Criterion Selection for Efficient Semi-supervised Time Series Classification. In: Lee, R. (eds) Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing. Studies in Computational Intelligence, vol 149. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70560-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-70560-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70559-8
Online ISBN: 978-3-540-70560-4
eBook Packages: EngineeringEngineering (R0)