Skip to main content

Gaussian Filter-Based Speech Segmentation Algorithm for Gujarati Language

  • Conference paper
  • First Online:
Smart Computing Techniques and Applications

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 224))

  • 552 Accesses

Abstract

Automatic speech segmentation is a main step in speech signal production and analysis process. Great advancement in speech synthesis has already been made using concatenative algorithms. Syllable is most suitable speech unit for concatenative speech synthesis because it does not require extensive prosodic models and provide better co-articulations than other sound units. To get natural sounding, output speech segmentation plays very important role. Speech segmentation is the process of dividing speech signal in to smaller units of sound. So accurate selection of speech unit and detection of boundaries are very important. In this research work, Gujarati language is used for segmentation and database is created. This paper suggests a method of syllable segmentation to detect boundaries of syllable by means of start point of syllable and end point of syllable. Performance parameters such as accuracy and peak signal to noise ratio (PSNR) are evaluated. Producing natural sounding speech signal in different Indian languages is a very demanding and ongoing problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Geeta, S., Muralidhara, B.L.: Syllable as the Basic Unit for Kannada Speech Synthesis. IEEE WiSPNET 2017 Conference

    Google Scholar 

  2. Geethaa, K., Vadivel, R.: Syllable segmentation of tamil speech signals using vowel onset point and spectral transition measure. automatic control and computer sciences (2018)

    Google Scholar 

  3. Shah, N.J., Patil, H.A., Madhavi, M.C., Sailor, H.B., Patel, T.B.: Deterministic annealing EM algorithm for developing TTS system in Gujarati. 2014 IEEE

    Google Scholar 

  4. Patil, H.A., Patel, T., Talesara, S., Shah, N., Sailor, H., Vachhani, B., Akhani, J., Kanakiya, B., Gaur, Y., Prajapati, V.: Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. In: 2013 International Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)

    Google Scholar 

  5. Talesara, S., Patil, H.A., Patel, T., Sailor, H., Shah, N.: A novel gaussian filter-based automatic labeling of speech data for TTS system in Gujarati Language. In: 2013 International Conference on Asian Language Processing

    Google Scholar 

  6. Takamichi, S., Toda, T., Shiga, Y., Sakti, S., Neubig, G., Nakamura, S.: Parameter generation methods with rich context models for high-quality and flexible text-to-speech synthesis. IEEE J. Sel. Top. Sign. Process. 8(2) (2014)

    Google Scholar 

  7. Kiruthiga, S., Krishnamoorthy, K.: Design issues in developing speech corpus for Indian languages—a survey. In: 2012 International Conference on Computer Communication and Informatics (ICCCI-2012), Jan. 10–12, 2012, Coimbatore, India

    Google Scholar 

  8. Bellur, A., Badri Narayan, K., Raghava Krishnan, K., Murthy, H.A.: Prosody modeling for syllable-based concatenative speech synthesis of Hindi and Tamil IEEE (2011)

    Google Scholar 

  9. Shreekanth, T., Udayashankara, V., Arun Kumar, C.: An unit selection based hindi text to speech synthesis system using syllable as a basic unit. IOSR J. VLSI Sign. Process. (IOSR-JVSP) (2014)

    Google Scholar 

  10. Kurian, A.S., Narayan, B., Nagarajan Madasamy, Bellur, A., Krishan, R.: Indian language screen reader and syllable based festival text to speech synthesis system. Assoc. Comput. Linguist. 63–72 (2011)

    Google Scholar 

  11. Saraswathi, S., Geetha, T.V.: Design of language models at various phases of Tamil speech recognition system. Int. J. Eng. Sci. Technol. (2010)

    Google Scholar 

  12. Patil, H.A., Patel, T.B., Shah, N.J., Sailor, H.B., Krishnan, R., Kasthuri, G.R., Nagarajan, T., Christina, L., Kumar, N., Raghavendra, V., Kishore, S.P., Prasanna, S.R.M., Adiga, N., Ranbir Singh, S., Anand, K., Kumar, P., Chandra Singh, B., Binil Kumar, S.L., Bhadran, T.G., Sajini, T., Saha, A., Basu, T., Sreenivasa Rao, K., Narendra, N.P., Sao, A.K., Kumar, R., Talukdar, P., Acharyaa, P., Chandra, S., Lata, S., Murthy, H.A.: A syllable-based framework for unit selection synthesis in 13 Indian languages. In: 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)

    Google Scholar 

  13. Panda, S.P., Nayak, A.K.: A waveform concatenation technique for text-to-speech synthesis. Speech Technol. (2017)

    Google Scholar 

  14. Black, A.W., Zen, H., Tokuda, K.: Statistical parametric speech synthesis. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

    Google Scholar 

  15. Geethaa, K., Vadivel, R.: Syllable segmentation of tamil speech signals using vowel onset point and spectral transition measure. Autom. Control Comput. Sci. (2018)

    Google Scholar 

  16. Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Spotting multilingual consonant-vowel units of speech using neural network models. In: International Conference on Nonlinear Analyses and Algorithms for Speech Processing; Lect. Notes Comput. Sci. (2005)

    Google Scholar 

  17. Fu, M., Pan, W., Hu, W., Chen, S.: Syllable segmentation of pumi speech with GABPNN based on fractal dimension and DWPTMFCC. In: 2016 International Conference on Information System and Artificial Intelligence

    Google Scholar 

  18. Kishore, S., Black, A., Kumar, R., Sangal, R.: Experiments with unit selection speech databases for Indian languages. National seminar on Language Technology Tools: Implementation of Telugu, October 2003, Hyderabad, India

    Google Scholar 

  19. Kamakshi Prasad, V., Nagarajan, T., Murthy, H.A.: Automatic segmentation of continuous speech using minimum phase group delay functions. Speech Communication 2004

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gujarathi, P.V., Patil, S.R. (2021). Gaussian Filter-Based Speech Segmentation Algorithm for Gujarati Language. In: Satapathy, S.C., Bhateja, V., Favorskaya, M.N., Adilakshmi, T. (eds) Smart Computing Techniques and Applications. Smart Innovation, Systems and Technologies, vol 224. Springer, Singapore. https://doi.org/10.1007/978-981-16-1502-3_74

Download citation

Publish with us

Policies and ethics