Combination of Features for Crosslingual Speaker Identification with the Constraint of Limited Data

Conference paper
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 221)


Mel frequency cepstral coefficients (MFCC) has proven to be effective in speaker identification, but does not provide satisfactory performance in limited data condition. This paper presents a combination of features from different languages for Crosslingual speaker identification with the constraint of limited data. However, combined features can increase the complexity of the speaker identification system by doubling the dimensionality of the features. Frame reduction and smoothing are achieved using an adaptive weighted-sum algorithm. Experiment results show that the proposed method gives an average 11 % improved in performance over conventional MFCC method.


Frame reduction Crosslingual MFCC 



This work is supported by Visvesvraya Technological University (VTU), Belgaum-590018, Karnataka, India.


  1. 1.
    Atal BS (1976) Automatic recognition of speakers from their voices. Proc IEEE 64(4):460–475CrossRefGoogle Scholar
  2. 2.
    Halsband U (2006) Bilingual and multilingual language processing. J. Physiol Paris 99:355–369Google Scholar
  3. 3.
    Arjun PH (2005) Speaker recognition in Indian languages: A feature based approach. Ph.D. dissertation, Indian Institute of Technology, KharagpurGoogle Scholar
  4. 4.
    Nagaraja BG, Jayanna HS (2012) Mono and cross lingual speaker identification with the constraint of limited data. In: Proceedings of IEEE, PRIME-2012, Periyar University, Salem, pp 439–443Google Scholar
  5. 5.
    Durou G (1999) Multilingual text-independent speaker identification. In: Proceedings of MIST 1999 workshop, Leusden, pp 115–118Google Scholar
  6. 6.
    Arjun PH, Sitaram S, Sharma E (2009) DA-IICT cross-lingual and multilingual corpora for speaker recognition. In: Proceedings of IEEE advances in pattern recognition, Kolkata, pp 187–190Google Scholar
  7. 7.
    Jayanna HS (2009) Limited data speaker recognition. Ph.D. dissertation, Indian Institute of Technology, GuwahatiGoogle Scholar
  8. 8.
    Picone JW (1993) Signal modeling techniques in speech recognition. Proc IEEE 81(9):1215–1247CrossRefGoogle Scholar
  9. 9.
    Nuratch S, Boonpramuk P, Wutiwiwatchai C (2010) Feature smoothing and frame reduction for speaker recognition. In: Proceedings of IEEE international conference on Asian language processing, pp 311–314Google Scholar

Copyright information

© Springer India 2013

Authors and Affiliations

  1. 1.Department of Information Science and EngineeringSiddaganga Institute of TechnologyTumkurIndia

Personalised recommendations