Clustering Based Under-Sampling for Improving Speaker Verification Decisions Using AdaBoost

Altınçay, Hakan; Ergün, Cem

doi:10.1007/978-3-540-27868-9_76

Hakan Altınçay²¹ &
Cem Ergün²¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3138))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

1342 Accesses
13 Citations

Abstract

The class imbalance problem naturally occurs in some classification problems where the amount of training samples available for one class may be much less than that of another. In order to deal with this problem, random sampling based methods are generally used. This paper proposes a clustering based sampling technique to select a subset from the majority class involving much larger amount of training data. The proposed approach is verified in designing a post-classifier using AdaBoost to improve the speaker verification decisions. Experiments conducted on NIST99 speaker verification corpus have shown that in general, the proposed sampling technique provides better equal error rates (EER) than random sampling.

Download to read the full chapter text

Chapter PDF

A strong hybrid AdaBoost classification algorithm for speaker recognition

Article 09 July 2021

A Cluster-Based Under-Sampling Algorithm for Class-Imbalanced Data

Speaker Classification via Supervised Hierarchical Clustering Using ICA Mixture Model

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bengio, S., Mariethoz, J.: Learning the decision function for speaker verification. In: IEEE-ICASSP Proceedings (2001)
Google Scholar
Monard, M.C., Batista, G.E.A.P.A.: Learning with Skewed Class Distribution. In: Abe, J.M., da Silva Filho, J.I. (eds.) Advances in Logic, Artificial Intelligence and Robotics, pp. 173–180. IOS Press, Amsterdam (2002)
Google Scholar
Weiss, G.M., Provost, F.: The effect of class distribution on classifier learning: An empirical study. Technical Report ML-TR-44, Department of Computer Science, Rutgers University (August 2001)
Google Scholar
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. In: Second European Conference on Computational Learning Theory (March 1995)
Google Scholar
Duin, R.P.W.: PRTOOLS (version 3.0). A Matlab toolbox for pattern recognition. Pattern Recognition Group, Delft University, Netherlands (January 2000)
Google Scholar
Ting, K.M.: A comparative study of cost-sensitive boosting algorithms. In: Proc. 17th International Conf. on Machine Learning, pp. 983–990. Morgan Kaufmann, San Francisco (2000)
Google Scholar
Merler, S., Furlanello, C., Larcher, B., Sboner, A.: Automatic model selection in cost-sensitive boosting. Information Fusion 4(1), 3–10 (2003)
Article Google Scholar
Reynolds, D.A.: Speaker identification and verification using Gaussian mixture speaker models. Speech Communication 17, 91–108 (1995)
Article Google Scholar
Reynolds, D.A., Quateri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)
Article Google Scholar
Kuncheva, L.I., Whitaker, C.J.: Using diversity with three variants of boosting: Aggressive, conservative, and inverse. In: Roli, F., Kittler, J. (eds.) MCS 2002. LNCS, vol. 2364, p. 81. Springer, Heidelberg (2002)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Advanced Technology Research and Development Institute, Eastern Mediterranean University, Gazi Mağusa KKTC, Mersin 10, Turkey
Hakan Altınçay & Cem Ergün

Authors

Hakan Altınçay
View author publications
You can also search for this author in PubMed Google Scholar
Cem Ergün
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Superior Técnico, Instituto de Telecomunicações, Lisbon, Portugal
Ana Fred
RSISE, the Australian National University, ACT 0200, Canberra, Australia
Terry M. Caelli
Information and Communication Theory Group, Delft University of Technology, P.O. Box 5031, 2600GA, Delft, The Netherlands
Robert P. W. Duin
FEUP - Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
Aurélio C. Campilho
Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, Information and Communication Theory Group, Delft, The Netherlands
Dick de Ridder

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Altınçay, H., Ergün, C. (2004). Clustering Based Under-Sampling for Improving Speaker Verification Decisions Using AdaBoost. In: Fred, A., Caelli, T.M., Duin, R.P.W., Campilho, A.C., de Ridder, D. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2004. Lecture Notes in Computer Science, vol 3138. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27868-9_76

Download citation

DOI: https://doi.org/10.1007/978-3-540-27868-9_76
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22570-6
Online ISBN: 978-3-540-27868-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Clustering Based Under-Sampling for Improving Speaker Verification Decisions Using AdaBoost

Abstract

Chapter PDF

Similar content being viewed by others

A strong hybrid AdaBoost classification algorithm for speaker recognition

A Cluster-Based Under-Sampling Algorithm for Class-Imbalanced Data

Speaker Classification via Supervised Hierarchical Clustering Using ICA Mixture Model

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Clustering Based Under-Sampling for Improving Speaker Verification Decisions Using AdaBoost

Abstract

Chapter PDF

Similar content being viewed by others

A strong hybrid AdaBoost classification algorithm for speaker recognition

A Cluster-Based Under-Sampling Algorithm for Class-Imbalanced Data

Speaker Classification via Supervised Hierarchical Clustering Using ICA Mixture Model

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation