Abstract
Public address system can provide useful information for the audience, which are especially important at the railway station or the subway station. Effective public address can offer help to evacuate people at these places.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
1 Introduction
Public address system can provide useful information for the audience, which are especially important at the railway station or the subway station. Effective public address can offer help to evacuate people at these places.
At present, most speech recognition researches are to predict and assess effects of hearing devices, to do relevant medical legal identification and evaluate the injury degree with low hearing function. In China, large public places basically broadcast through functional address system. It is very important for the public barrier free design to try to take the old people into account.
2 Experimental Method
2.1 Experiment Design
The experiment is conducted in the anechoic room of ergonomics laboratory. The audio files are the length of each sentence is about 4 s, the fade in and out time of each sentence is about 0.15 s and the interval of word front and back is 0.2 s and 0.3 s respectively.
44 subjects were divided into the old group and the young group. The young group includes 21 subjects with the average age of 25.4, and the old group includes 22 subjects with the average of 62.3.
2.2 Experimental Method
The speech recognition experiment adopts the method of subjective assessment of the public sound system [1]. The experiment is divided into two tests. The first one is the hearing level, and after that is the speech recognition test.
The hearing level test is to measure the hearing threshold of subjects to guarantee that their hearing belongs to the normal range of their age (Table 1).
3 Experiment Results and Discussion
3.1 Hearing Level
The hearing level test showed that the hearing attenuation of Chinese young group and the old group have no significant frequency differences with the data according to the international standards of ISO 7029 [2], except in the frequency of 1000 Hz and 4000 Hz for the old group.
3.2 Speech Recognition Threshold
Under the same 50 dB environmental noise, both for the young group and for the old group, a single factor analysis was conducted and there was a significant difference between the two kinds of voices for the two age groups. It is showed in Figs. 1 and 2 that under the same noisy environment, the threshold value of male voice is lower than that of female voice in the both young and old groups. It can be seen that threshold value for the old group for the male voice is lower than to the female voice.
Under the same 50 dB environmental noise, the average speech recognition threshold value for the young group to the male voice is 43 dB and the signal noise ratio is −7 dB while the average value to the female voice is 45 dB and the signal noise ratio is −5 dB. The average speech recognition threshold value for the old group to the male voice is 48 dB and the signal noise ratio is −2 dB while the average value to the female voice is 52 dB and the signal noise ratio is 2 dB. So under the same noisy environment, the signal noise ratio should be at least 2 dB to make sure that more than 50 % people can identify the language correctly.
3.3 The Relationship Between Speech Recognition Score and the Signal Noise Ratio
The speech recognition score refers to the percentage of the speech signals heard and understood by subjects. The hearing loss caused by the age influences the speech recognition score, which is a function of the signal noise ration.
From the Fig. 3, it could be found that when the signal noise ratio reaches 4 dB, the accuracy of the young group to words recognition exceeds 90 % and has no obvious change. When the signal noise ratio is under −4 dB, the difficulty of target words is over 80 % When the signal noise ratio reaches 8 dB and F(2, 35) = 2.111, there is no significant difference in statistics for the young group.
From the Fig. 4, it could be found that when the signal noise ratio of the old group reaches over 8 dB, their accuracy to words recognition is over 90 % and has no significant change. For the old group, when the signal noise ratio reaches 12 dB and F(2, 37) = 2.408, there is no significant difference in statistics.
When the environmental noise is 50 dB, in order to meet the requirements of speech recognition of the young people and to let the old people gain better hearing effects, in the sound pressure level design of audio equipment the signal noise ratio should reach 12 dB to make sure that most people can hear harmonious and beautiful sound level.
References
Kobayashi, M., Morimoto, M., Sato, H., Sato, H.: Optimum speech level to minimize listening difficulty in public spaces. J. Acoust. Soc. Am. 121(1), 251–256 (2007)
ISO 7029:2000: Acoustics statistical distribution of hearing thresholds as a function of age (2000)
Acknowledgment
This work is supported by the National Key Technology R&D Program (project number: 2012BAK28B03-2) and China National Institute of Standardization through the “special funds for the basic R&D undertakings by welfare research institutions” (project number: 522013Y-3055).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Ran, L., Luo, L., Zhang, X., Liu, T., Zhao, C. (2015). The Speech Recognition Ability for Different Age Groups on the Chinese Language System. In: Stephanidis, C. (eds) HCI International 2015 - Posters’ Extended Abstracts. HCI 2015. Communications in Computer and Information Science, vol 528. Springer, Cham. https://doi.org/10.1007/978-3-319-21380-4_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-21380-4_54
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21379-8
Online ISBN: 978-3-319-21380-4
eBook Packages: Computer ScienceComputer Science (R0)