Advertisement

A Reverse Nearest Neighbor Based Active Semi-supervised Learning Method for Multivariate Time Series Classification

  • Yifei Li
  • Guoliang HeEmail author
  • Xuewen Xia
  • Yuanxiang Li
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9827)

Abstract

Time series widely exist in many areas. In reality, the number of labeled time series data is often small and there is a huge number of unlabeled data. Manually labeling these unlabeled examples is time-consuming and expensive, and sometimes it is even impossible. To reduce manual cost and obtain high confident labeled training data for multivariate time series classification, in this paper a reverse nearest neighbor based active semi-supervised learning method is proposed. First, based on information entropy and distribution density of the training data, a sampling strategy is introduced to select the most informative examples for manual annotation. Second, in terms of the newly labeled example by experts, a reverse nearest neighbor based semi-supervised learning method is presented to automatically and accurately label some confident examples. We evaluate our work with a comprehensive set of experiments on diverse multivariate time series data. Experimental results show that our approach can obtain a confident labeled training data with less manual cost.

Keywords

Multivariate time series Active learning Semi-supervised learning 

References

  1. 1.
    Wei, L., Keogh, E.: Semi-supervised time series classification. In: KDD (2006)Google Scholar
  2. 2.
    Begum, N., Hu, B., Rakthanmanon, T., Keogh, E.: Towards a minimum description length based stopping criterion for semi-supervised time series classification. In: IEEE IRI (2013)Google Scholar
  3. 3.
    Nguyen, M.N., Li, X.-L., Ng, S.-K.: Positive unlabeled learning for time series classification. In: IJCAI (2011)Google Scholar
  4. 4.
    Nguyen, M.N., Li, X.-L., Ng, S.-K.: Ensemble based positive unlabeled learning for time series classification. In: Lee, S.-g., Peng, Z., Zhou, X., Moon, Y.-S., Unland, R., Yoo, J. (eds.) DASFAA 2012, Part I. LNCS, vol. 7238, pp. 243–257. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  5. 5.
    Mabel, G., Chrisoph, B., Isaac, T., Taneet, R., Kosé, B.: On the stopping criteria for k-nearest neighbor in positive unlabeled time series classification problems. Inf. Sci. 328, 42–59 (2016)CrossRefGoogle Scholar
  6. 6.
    He, G., Duan, Y., Peng, R., Jing, X., Qian, T., Wang, L.: Early classification on multivariate time series. Neurocomputing 149, 777–787 (2015)CrossRefGoogle Scholar
  7. 7.
    Yifan, F., Zhu, X., Li, B.: A survey on instance selection for active learning. Knowl. Inf. Sys. 35, 249–283 (2013)CrossRefGoogle Scholar
  8. 8.
    Settles, B.: Active learning literature survey. Computer Sciences Technical report, University of Wisconsin–Madison, 26 January 2010Google Scholar
  9. 9.
    Guo, H., Wang, W.: An active learning-based SVM multi-class classification model. Pattern Recogn. 48(5), 1577–1597 (2015)CrossRefGoogle Scholar
  10. 10.
    Huang, S.-J., Jinm, R., Zhou, Z.: Active learning by querying informative and representative examples. IEEE Trans. Pattern Anal. Mach. Intell. 36(10), 1936–1949 (2014)CrossRefGoogle Scholar
  11. 11.
    Hady, M.F.A., Schwenker, F.: Combing committee-based semi-supervised learning and active learning. J. Comput. Sci. Technol. 25(4), 681–698 (2010)CrossRefGoogle Scholar
  12. 12.
    Seung, H., Opper, M., Sompolinsky, H.: Query by committee. In: ACM Workshop on Computational Learning Theory, pp. 287–294 (1992)Google Scholar
  13. 13.
    He, G., Duan, Y., Li, Y., Qian, T., He, J., Jia, X.: Active learning for multivariate time series classification with positive unlabeled data. In: ICTAI (2015)Google Scholar
  14. 14.
    Zhu, J., Wang, H., Tsou, B.K., Ma, M.: Active learning with sampling by uncertainty and density for data annotations. IEEE Trans. Audio Speech Lang. Process. 18(6), 1323–1331 (2010)CrossRefGoogle Scholar
  15. 15.
    Huang, H., He, Q., He, J., Ma, L.: RADRA: rare category detection iva computation of boundary degree. In: PAKDD (2011)Google Scholar
  16. 16.
    Xia, C., Hsu, W., Lee, M.L., Ooi, B.C.: BORDER: efficient computation of boundary points. IEEE Trans. Knowl. Data Eng. 18(3), 289–303 (2006)CrossRefGoogle Scholar
  17. 17.
  18. 18.
  19. 19.

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Yifei Li
    • 1
  • Guoliang He
    • 1
    • 2
    Email author
  • Xuewen Xia
    • 3
  • Yuanxiang Li
    • 1
    • 2
  1. 1.State Key Laboratory of Software EngineeringWuhan UniversityWuhanChina
  2. 2.College of Computer ScienceWuhan UniversityWuhanChina
  3. 3.School of SoftwareEast China Jiaotong UniversityNanchangChina

Personalised recommendations