Abstract
Automatic speech segmentation is a main step in speech signal production and analysis process. Great advancement in speech synthesis has already been made using concatenative algorithms. Syllable is most suitable speech unit for concatenative speech synthesis because it does not require extensive prosodic models and provide better co-articulations than other sound units. To get natural sounding, output speech segmentation plays very important role. Speech segmentation is the process of dividing speech signal in to smaller units of sound. So accurate selection of speech unit and detection of boundaries are very important. In this research work, Gujarati language is used for segmentation and database is created. This paper suggests a method of syllable segmentation to detect boundaries of syllable by means of start point of syllable and end point of syllable. Performance parameters such as accuracy and peak signal to noise ratio (PSNR) are evaluated. Producing natural sounding speech signal in different Indian languages is a very demanding and ongoing problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Geeta, S., Muralidhara, B.L.: Syllable as the Basic Unit for Kannada Speech Synthesis. IEEE WiSPNET 2017 Conference
Geethaa, K., Vadivel, R.: Syllable segmentation of tamil speech signals using vowel onset point and spectral transition measure. automatic control and computer sciences (2018)
Shah, N.J., Patil, H.A., Madhavi, M.C., Sailor, H.B., Patel, T.B.: Deterministic annealing EM algorithm for developing TTS system in Gujarati. 2014 IEEE
Patil, H.A., Patel, T., Talesara, S., Shah, N., Sailor, H., Vachhani, B., Akhani, J., Kanakiya, B., Gaur, Y., Prajapati, V.: Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. In: 2013 International Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)
Talesara, S., Patil, H.A., Patel, T., Sailor, H., Shah, N.: A novel gaussian filter-based automatic labeling of speech data for TTS system in Gujarati Language. In: 2013 International Conference on Asian Language Processing
Takamichi, S., Toda, T., Shiga, Y., Sakti, S., Neubig, G., Nakamura, S.: Parameter generation methods with rich context models for high-quality and flexible text-to-speech synthesis. IEEE J. Sel. Top. Sign. Process. 8(2) (2014)
Kiruthiga, S., Krishnamoorthy, K.: Design issues in developing speech corpus for Indian languages—a survey. In: 2012 International Conference on Computer Communication and Informatics (ICCCI-2012), Jan. 10–12, 2012, Coimbatore, India
Bellur, A., Badri Narayan, K., Raghava Krishnan, K., Murthy, H.A.: Prosody modeling for syllable-based concatenative speech synthesis of Hindi and Tamil IEEE (2011)
Shreekanth, T., Udayashankara, V., Arun Kumar, C.: An unit selection based hindi text to speech synthesis system using syllable as a basic unit. IOSR J. VLSI Sign. Process. (IOSR-JVSP) (2014)
Kurian, A.S., Narayan, B., Nagarajan Madasamy, Bellur, A., Krishan, R.: Indian language screen reader and syllable based festival text to speech synthesis system. Assoc. Comput. Linguist. 63–72 (2011)
Saraswathi, S., Geetha, T.V.: Design of language models at various phases of Tamil speech recognition system. Int. J. Eng. Sci. Technol. (2010)
Patil, H.A., Patel, T.B., Shah, N.J., Sailor, H.B., Krishnan, R., Kasthuri, G.R., Nagarajan, T., Christina, L., Kumar, N., Raghavendra, V., Kishore, S.P., Prasanna, S.R.M., Adiga, N., Ranbir Singh, S., Anand, K., Kumar, P., Chandra Singh, B., Binil Kumar, S.L., Bhadran, T.G., Sajini, T., Saha, A., Basu, T., Sreenivasa Rao, K., Narendra, N.P., Sao, A.K., Kumar, R., Talukdar, P., Acharyaa, P., Chandra, S., Lata, S., Murthy, H.A.: A syllable-based framework for unit selection synthesis in 13 Indian languages. In: 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)
Panda, S.P., Nayak, A.K.: A waveform concatenation technique for text-to-speech synthesis. Speech Technol. (2017)
Black, A.W., Zen, H., Tokuda, K.: Statistical parametric speech synthesis. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Geethaa, K., Vadivel, R.: Syllable segmentation of tamil speech signals using vowel onset point and spectral transition measure. Autom. Control Comput. Sci. (2018)
Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B.: Spotting multilingual consonant-vowel units of speech using neural network models. In: International Conference on Nonlinear Analyses and Algorithms for Speech Processing; Lect. Notes Comput. Sci. (2005)
Fu, M., Pan, W., Hu, W., Chen, S.: Syllable segmentation of pumi speech with GABPNN based on fractal dimension and DWPTMFCC. In: 2016 International Conference on Information System and Artificial Intelligence
Kishore, S., Black, A., Kumar, R., Sangal, R.: Experiments with unit selection speech databases for Indian languages. National seminar on Language Technology Tools: Implementation of Telugu, October 2003, Hyderabad, India
Kamakshi Prasad, V., Nagarajan, T., Murthy, H.A.: Automatic segmentation of continuous speech using minimum phase group delay functions. Speech Communication 2004
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Gujarathi, P.V., Patil, S.R. (2021). Gaussian Filter-Based Speech Segmentation Algorithm for Gujarati Language. In: Satapathy, S.C., Bhateja, V., Favorskaya, M.N., Adilakshmi, T. (eds) Smart Computing Techniques and Applications. Smart Innovation, Systems and Technologies, vol 224. Springer, Singapore. https://doi.org/10.1007/978-981-16-1502-3_74
Download citation
DOI: https://doi.org/10.1007/978-981-16-1502-3_74
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1501-6
Online ISBN: 978-981-16-1502-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)