Skip to main content

A Fishervoice-SVM Language Identification System

  • Conference paper
Computational Processing of the Portuguese Language (PROPOR 2012)

Abstract

In this paper, a language identification system is described that implements the Fishervoice approach in order to reduce the dimensionality of the data. Fishervoice performs two-dimensional Principal Component Analysis (2D-PCA) and Linear Discriminant Analysis (LDA) to project the data into a discriminative subspace. After this transformation the speech utterances are transformed into supervectors and classified by means of a Support Vector Machine (SVM). Experiments performed on KALAKA-2 database, which includes speech in Spanish, Catalan, English, Basque, Galician and Portuguese, show that the Fishervoice-SVM system achieves good identification results while reducing dramatically the number of features needed to represent the speech utterances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abad, A., Koller, O., Trancoso, I.: The L2F Language Verification Systems for Albayzin-2010 Evaluation. In: Proceedings of FALA 2010 - VI Jornadas en Tecnología Del Habla and II Iberian SLTech Workshop, pp. 383–388 (2010)

    Google Scholar 

  2. Anthony, G., Gregg, H., Tshilidzi, M.: Image Classification Using SVMs: One-Against-One Vs One-Against-All. In: Proceedings of the 28th Asian Conference on Remote Sensing (2007)

    Google Scholar 

  3. Burges, C.J.C.: A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery 2, 121–167 (1998)

    Article  Google Scholar 

  4. Castaldo, F., Colibro, D., Cumani, S., Dalmasso, E., Laface, P., Vair, C.: Loquendo-Politecnico di Torino System for the 2009 NIST Language Recognition Evaluation. In: Proceedings of ICASSP, pp. 5002–5005 (2010)

    Google Scholar 

  5. Chang, C.-C., Lin, C.-J.: LIBSVM: a Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology 2(3), article 27 (2011), http://www.csie.ntu.edu.tw/~cjlin/libsvm

  6. Crystal, D.: The Cambridge Encyclopedia of the English Language, pp. 6–8. Cambridge University Press, Cambridge (2003)

    Google Scholar 

  7. Hazen, T.J., Hetherington, I.L., Park, A.: FST-Based Recognition Techniques for Multi-Lingual and Multi-Domain Spontaneous Speech. In: Proceedings of the European Conference on Speech Communication and Technology (2001)

    Google Scholar 

  8. Jing, X.Y., Wong, H.S., Zhang, D.: Face Recognition Based on 2D Fisherface Approach. Pattern Recognition 39(4), 707–710 (2006)

    Article  MATH  Google Scholar 

  9. KALAKA-2. Speech database created for the Albayzin, Language Recognition Evaluation, organized by the Spanish Network on Speech Technology. Produced by the Software Technologies Working Group (GTTS ), University of the Basque Country (2010), http://gtts.ehu.es

  10. Kovács, G., Tóth, L.: Phone Recognition Experiments with 2D-DCT Spectro-Temporal Features. In: 6th IEEE International Symposium on Applied Computational Intelligence and Informatics, pp. 143–146 (2011)

    Google Scholar 

  11. Lapesa, R.: Historia de la Lengua Española. Escelicer, Gredos (1981)

    Google Scholar 

  12. Lopez-Otero, P., Docio-Fernandez, L., Garcia-Mateo, C.: A Fishervoice-based Speaker Identification System. In: Proceedings of FALA 2010 - VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop, pp. 139–142 (2010)

    Google Scholar 

  13. Lopez-Otero, P., Docio-Fernandez, L., Garcia-Mateo, C.: The UVigo-GTM Language Verification Systems for the Albayzin 2010 Evaluation. In: Proceedings of FALA 2010 - VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop, pp. 389–392 (2010)

    Google Scholar 

  14. Martin, A., Greenberg, C.: The 2009 NIST Language Recognition Evaluation. In: Odyssey 2010 - The Speaker and Language Recognition Workshop, paper 030 (2010)

    Google Scholar 

  15. Martínez, D., Villalba, J., Miguel, A., Ortega, A., Lleida, E.: ViVoLab UZ Language Recognition System for Albayzin 2010 LRE. In: Proceedings of FALA 2010 - VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop, pp. 375–376 (2010)

    Google Scholar 

  16. Matějka, P., Schwarz, P., Černocký, J., Chytil, P.: Phonotactic Language Identification using High Quality Phoneme Recognition. In: Proceedings of Interspeech, pp. 2237–2240 (2005)

    Google Scholar 

  17. Rodriguez-Fuentes, L.J., Penagarikano, M., Varona, A., Diez, M., Bordel, G.: Overview of the Albayzin 2010 Language Recognition Evaluation: Database Design, Evaluation Plan and Preliminary Analysis of Results. In: Proceedings of VI Jornadas en Technologa del Habla and II Iberian SLTech Workshop, pp. 309–316 (2010)

    Google Scholar 

  18. Saeidi, R., Soufifar, M., Kinnunen, T., Svendsen, T., Fränti, P.: UEF-NTNU System Description for Albayzin 2010 Language Recognition Evaluation. In: Proceedings of FALA 2010 - VI Jornadas en Tecnología del Habla and II Iberian SLTech Workshop, pp. 377–382 (2010)

    Google Scholar 

  19. Torres-Carrasquillo, P.A., Singer, E., Kohler, M.A., Green, R.J., Reynolds, D.A., Deller Jr, J.R.: Approaches to Language Identification Using Gaussian Mixture Models and Shifted Delta Cepstral Features. In: Proceedings of ICSLP, pp. 89–92 (2002)

    Google Scholar 

  20. Woehrling, C., de Mareil, P.B., Adda-Decker, M.: Linguistically-Motivated Automatic Classification of Regional French Varieties. In: Proceedings of Interspeech, pp. 2183–2186 (2009)

    Google Scholar 

  21. Wong, E., Pelenacos, J., Myers, S., Sridharan, S.: Language Identification Using Efficient Gaussian Mixture Model Analysis. In: Proceedings of Australian International Conference on Speech Science and Technology, pp. 300–305 (2000)

    Google Scholar 

  22. Zissman, M.A., Berkling, K.M.: Automatic Language Identification. In: Proceedings of the ESCA-NATO Workshop on Multi-Lingual Interoperability in Speech Technology, MIST (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lopez-Otero, P., Docio-Fernandez, L., Garcia-Mateo, C. (2012). A Fishervoice-SVM Language Identification System. In: Caseli, H., Villavicencio, A., Teixeira, A., Perdigão, F. (eds) Computational Processing of the Portuguese Language. PROPOR 2012. Lecture Notes in Computer Science(), vol 7243. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28885-2_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-28885-2_43

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-28884-5

  • Online ISBN: 978-3-642-28885-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics