International Conference on Speech and Computer

SPECOM 2015: Speech and Computer pp 389-396 | Cite as

Speaker Identification Using Semi-supervised Learning

  • Nikos Fazakis
  • Stamatis Karlos
  • Sotiris Kotsiantis
  • Kyriakos Sgarbas
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9319)

Abstract

Semi-supervised classification methods use available unlabeled data, along with a small set of labeled examples, to increase the classification accuracy in comparison with training a supervised method using only the labeled data. In this work, a new semi-supervised method for speaker identification is presented. We present a comparison with other well-known semi-supervised and supervised classification methods on benchmark datasets and verify that the presented technique exhibits better accuracy in most cases.

Keywords

Semi-supervised learning Speaker identification Classification using labeled Unlabeled data 

References

  1. 1.
    Khaled, D.: Wavelet entropy and neural network for text-independent speaker identification. Engg. Appl. Artif. Intell. 24, 796–802 (2011)CrossRefGoogle Scholar
  2. 2.
    Grimaldi, M., Cummins, F.: Speaker identification using instantaneous frequencies. IEEE TASLP 16(6), 1097–1111 (2008)Google Scholar
  3. 3.
    Manikandan, J., Venkataramani, B.: Design of a real time automatic speech recognition system using modified one against all SVM classifier. Microproc. Microsyst. 35(6), 568–578 (2011)CrossRefGoogle Scholar
  4. 4.
    Dileep, A., Chandra, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365–379 (2012)CrossRefGoogle Scholar
  5. 5.
    Friedhelm, S., Edmondo, T.: Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recogn. Lett. 37, 4–14 (2014)CrossRefGoogle Scholar
  6. 6.
    Lan, Y., Hu, Z., Soh, Y.C., Huang, G.-B.: An extreme learning machine approach for speaker recognition. Neural Comput. Appl. 22(3–4), 417–425 (2013)CrossRefGoogle Scholar
  7. 7.
    Zhi-Hua, Z., Li, M.: Tri-training: exploiting unlabeled data using three classifiers. IEEE TKDE 17(11), 1529–1541 (2005)Google Scholar
  8. 8.
    Chapelle, O., Schlkopf, B., Zien, A.: Semi-supervised learning. MIT Press, Cambridge (2006)CrossRefGoogle Scholar
  9. 9.
    Shuang, W., Linsheng, W., Licheng, J., Hongying, L.: Improve the performance of co-training by committee with refinement of class probability estimations. Neurocomputing 136, 30–40 (2014)CrossRefGoogle Scholar
  10. 10.
    Xu, J., He, H., Man, H.: DCPE co-training for classification. Neurocomputing 86, 75–85 (2012)CrossRefGoogle Scholar
  11. 11.
    Li, M., Zhou, Z.: Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE TSMC 37, 1088–1098 (2007)Google Scholar
  12. 12.
    Hady, M., Schwenker, F.: Co-training by committee: a new semi-supervised learning framework, In: Proceedings of the IEEE International Conference on Data Mining Workshops, pp. 563–572 (2008)Google Scholar
  13. 13.
    Zhou, Y., Goldman, S.: Democratic co-learning. In: 16th IEEE International Conference on Tools with Artificial Intelligence (ICTAI’04), pp. 594–202 (2004)Google Scholar
  14. 14.
    Hyon, S., Dang, J., Feng, H., Wang, H., Honda, K.: Detection of speaker individual information using a phoneme effect suppression method. Speech Commun. 57, 87–100 (2014)CrossRefGoogle Scholar
  15. 15.
    Sun, S.: A survey of multi-view machine learning. Neural Comput. Appl. 23(7–8), 2031–2038 (2013)CrossRefGoogle Scholar
  16. 16.
    Deng, C., Guo, M.Z.: A new co-training-style random forest for computer aided diagnosis. J. Intell. Inf. Syst. 36, 253–281 (2011)CrossRefGoogle Scholar
  17. 17.
    Wang, J., Luo, S., Zeng, X.: A random subspace method for co-training. In: IEEE International Joint Conference on Computational Intelligence, pp. 195–200 (2008)Google Scholar
  18. 18.
    Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, Cambridge (2000)CrossRefGoogle Scholar
  19. 19.
    Susan, S., Sharma, S.: A fuzzy nearest neighbor classifier for speaker identification. In: 4th International Conference on Computational Intelligence and Communication Networks, CICN 2012, pp. 842–845 (2012)Google Scholar
  20. 20.
    Jiang, Z., Zhang, S., Zeng, J.: A hybrid generative/discriminative method for semi-supervised classification. Knowl.-Based Syst. 37, 137–145 (2013)CrossRefGoogle Scholar
  21. 21.
    Didaci, L., Fumera, G., Roli, F.: Analysis of co-training algorithm with very small training sets. In: Gimel’farb, G., Hancock, E., Imiya, A., Kuijper, A., Kudo, M., Omachi, S., Windeatt, T., Yamada, K. (eds.) SSPR &SPR 2012, vol. 7626, pp. 719–726. Springer, Berlin (2012)CrossRefGoogle Scholar
  22. 22.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)CrossRefGoogle Scholar
  23. 23.
    Frank, E., Hall, M., Pfahringer, B.: Locally weighted naive Bayes. In: 19th Conference on Uncertainty in Artificial Intelligence. Mexico (2003)Google Scholar
  24. 24.
    Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29, 103–130 (1997)CrossRefMATHGoogle Scholar
  25. 25.
    Du, J., Ling, C.X., Zhou, Z.-H.: When does cotraining work in real data? IEEE TKDE 23(5), 788–799 (2011)Google Scholar
  26. 26.
    Goldberg, X.: Introduction to semi-supervised learning. In: Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan Claypool (2009)Google Scholar
  27. 27.
    Bourouba, H., Korba, C.A., Djemili, R.: Novel approach in speaker identification using SVM and GMM. Control Engg. Appl. Inf. 15(3), 87–95 (2013)Google Scholar
  28. 28.
    Pal, A., Bose, S., Basak, G.K., Mukhopadhyay, A.: Speaker identification by aggregating Gaussian mixture models (GMMs) based on uncorrelated MFCC-derived features. Int. J. Pattern Recogn. Artif. Intell. 28(4), 25 (2014)Google Scholar
  29. 29.
    Zhao, X., Wang, Y., Wang, D.: Robust speaker identification in noisy and reverberant conditions. IEEE TASLP 22(4), 836–845 (2014)Google Scholar
  30. 30.
    Alcal-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., Garca, S., Snchez, L.: KEEL data-mining software tool: data set repository, integration of algorithms and experimental analysis framework. J. Multi.-Valued Logic Soft Comput. 17(2–3), 255–287 (2011)Google Scholar
  31. 31.
    Triguero, I., Garca, S., Herrera, F.: Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study. Knowl. Inf. Syst. 42(2), 245–284 (2015)CrossRefGoogle Scholar
  32. 32.
    Namrata, D.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Engg Technol. 1(6), 1–4 (2013)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Nikos Fazakis
    • 1
  • Stamatis Karlos
    • 1
  • Sotiris Kotsiantis
    • 1
  • Kyriakos Sgarbas
    • 1
  1. 1.University of PatrasPatrasGreece

Personalised recommendations