Gaussian Filter-Based Speech Segmentation Algorithm for Gujarati Language

Gujarathi, Priyanka Vishwas; Patil, Sandip Raosaheb

doi:10.1007/978-981-16-1502-3_74

Priyanka Vishwas Gujarathi⁷ &
Sandip Raosaheb Patil⁸

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 224))

552 Accesses

Abstract

Automatic speech segmentation is a main step in speech signal production and analysis process. Great advancement in speech synthesis has already been made using concatenative algorithms. Syllable is most suitable speech unit for concatenative speech synthesis because it does not require extensive prosodic models and provide better co-articulations than other sound units. To get natural sounding, output speech segmentation plays very important role. Speech segmentation is the process of dividing speech signal in to smaller units of sound. So accurate selection of speech unit and detection of boundaries are very important. In this research work, Gujarati language is used for segmentation and database is created. This paper suggests a method of syllable segmentation to detect boundaries of syllable by means of start point of syllable and end point of syllable. Performance parameters such as accuracy and peak signal to noise ratio (PSNR) are evaluated. Producing natural sounding speech signal in different Indian languages is a very demanding and ongoing problem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Geeta, S., Muralidhara, B.L.: Syllable as the Basic Unit for Kannada Speech Synthesis. IEEE WiSPNET 2017 Conference
Google Scholar
Geethaa, K., Vadivel, R.: Syllable segmentation of tamil speech signals using vowel onset point and spectral transition measure. automatic control and computer sciences (2018)
Google Scholar
Shah, N.J., Patil, H.A., Madhavi, M.C., Sailor, H.B., Patel, T.B.: Deterministic annealing EM algorithm for developing TTS system in Gujarati. 2014 IEEE
Google Scholar
Patil, H.A., Patel, T., Talesara, S., Shah, N., Sailor, H., Vachhani, B., Akhani, J., Kanakiya, B., Gaur, Y., Prajapati, V.: Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. In: 2013 International Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)
Google Scholar
Talesara, S., Patil, H.A., Patel, T., Sailor, H., Shah, N.: A novel gaussian filter-based automatic labeling of speech data for TTS system in Gujarati Language. In: 2013 International Conference on Asian Language Processing
Google Scholar
Takamichi, S., Toda, T., Shiga, Y., Sakti, S., Neubig, G., Nakamura, S.: Parameter generation methods with rich context models for high-quality and flexible text-to-speech synthesis. IEEE J. Sel. Top. Sign. Process. 8(2) (2014)
Google Scholar
Kiruthiga, S., Krishnamoorthy, K.: Design issues in developing speech corpus for Indian languages—a survey. In: 2012 International Conference on Computer Communication and Informatics (ICCCI-2012), Jan. 10–12, 2012, Coimbatore, India
Google Scholar
Bellur, A., Badri Narayan, K., Raghava Krishnan, K., Murthy, H.A.: Prosody modeling for syllable-based concatenative speech synthesis of Hindi and Tamil IEEE (2011)
Google Scholar
Shreekanth, T., Udayashankara, V., Arun Kumar, C.: An unit selection based hindi text to speech synthesis system using syllable as a basic unit. IOSR J. VLSI Sign. Process. (IOSR-JVSP) (2014)
Google Scholar
Kurian, A.S., Narayan, B., Nagarajan Madasamy, Bellur, A., Krishan, R.: Indian language screen reader and syllable based festival text to speech synthesis system. Assoc. Comput. Linguist. 63–72 (2011)
Google Scholar
Saraswathi, S., Geetha, T.V.: Design of language models at various phases of Tamil speech recognition system. Int. J. Eng. Sci. Technol. (2010)
Google Scholar
Patil, H.A., Patel, T.B., Shah, N.J., Sailor, H.B., Krishnan, R., Kasthuri, G.R., Nagarajan, T., Christina, L., Kumar, N., Raghavendra, V., Kishore, S.P., Prasanna, S.R.M., Adiga, N., Ranbir Singh, S., Anand, K., Kumar, P., Chandra Singh, B., Binil Kumar, S.L., Bhadran, T.G., Sajini, T., Saha, A., Basu, T., Sreenivasa Rao, K., Narendra, N.P., Sao, A.K., Kumar, R., Talukdar, P., Acharyaa, P., Chandra, S., Lata, S., Murthy, H.A.: A syllable-based framework for unit selection synthesis in 13 Indian languages. In: 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)
Google Scholar
Panda, S.P., Nayak, A.K.: A waveform concatenation technique for text-to-speech synthesis. Speech Technol. (2017)
Google Scholar
Black, A.W., Zen, H., Tokuda, K.: Statistical parametric speech synthesis. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Google Scholar
Geethaa, K., Vadivel, R.: Syllable segmentation of tamil speech signals using vowel onset point and spectral transition measure. Autom. Control Comput. Sci. (2018)
Google Scholar
Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Spotting multilingual consonant-vowel units of speech using neural network models. In: International Conference on Nonlinear Analyses and Algorithms for Speech Processing; Lect. Notes Comput. Sci. (2005)
Google Scholar
Fu, M., Pan, W., Hu, W., Chen, S.: Syllable segmentation of pumi speech with GABPNN based on fractal dimension and DWPTMFCC. In: 2016 International Conference on Information System and Artificial Intelligence
Google Scholar
Kishore, S., Black, A., Kumar, R., Sangal, R.: Experiments with unit selection speech databases for Indian languages. National seminar on Language Technology Tools: Implementation of Telugu, October 2003, Hyderabad, India
Google Scholar
Kamakshi Prasad, V., Nagarajan, T., Murthy, H.A.: Automatic segmentation of continuous speech using minimum phase group delay functions. Speech Communication 2004
Google Scholar

Download references

Author information

Authors and Affiliations

JSPM’s Rajarshi Shahu College of Engineering, Tathawade, Pune, 411033, India
Priyanka Vishwas Gujarathi
Bharati Vidyapeeth’s College of Engineering for Women, Dhankawadi, Pune, 411043, India
Sandip Raosaheb Patil

Authors

Priyanka Vishwas Gujarathi
View author publications
You can also search for this author in PubMed Google Scholar
Sandip Raosaheb Patil
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering, KIIT University, Bhubaneswar, Odisha, India
Suresh Chandra Satapathy
Department of Electronics and Communication Engineering, Shri Ramswaroop Memorial Group of Professional Colleges (SRMGPC), Lucknow, Uttar Pradesh, India
Vikrant Bhateja
Informatics and Computer Techniques, Reshetnev Siberian State University of Science and Technologies, Krasnoyarsk, Russia
Margarita N. Favorskaya
Department of Computer Science and Engineering, Vasavi College of Engineering, Hyderabad, India
T. Adilakshmi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gujarathi, P.V., Patil, S.R. (2021). Gaussian Filter-Based Speech Segmentation Algorithm for Gujarati Language. In: Satapathy, S.C., Bhateja, V., Favorskaya, M.N., Adilakshmi, T. (eds) Smart Computing Techniques and Applications. Smart Innovation, Systems and Technologies, vol 224. Springer, Singapore. https://doi.org/10.1007/978-981-16-1502-3_74

Download citation

DOI: https://doi.org/10.1007/978-981-16-1502-3_74
Published: 14 July 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1501-6
Online ISBN: 978-981-16-1502-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics